BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3831 Length=160 Score E Sequences producing significant alignments: (Bits) Value gi|15610967|ref|NP_218348.1| hypothetical protein Rv3831 [Mycoba... 324 2e-87 gi|340628800|ref|YP_004747252.1| hypothetical protein MCAN_38501... 321 2e-86 gi|118620020|ref|YP_908352.1| hypothetical protein MUL_5005 [Myc... 270 4e-71 gi|183985353|ref|YP_001853644.1| hypothetical protein MMAR_5385 ... 255 1e-66 gi|296166959|ref|ZP_06849375.1| conserved hypothetical protein [... 242 1e-62 gi|41406298|ref|NP_959134.1| hypothetical protein MAP0200c [Myco... 238 2e-61 gi|342860084|ref|ZP_08716736.1| hypothetical protein MCOL_14435 ... 234 4e-60 gi|333992778|ref|YP_004525392.1| hypothetical protein JDM601_413... 225 2e-57 gi|254822670|ref|ZP_05227671.1| hypothetical protein MintA_22264... 222 1e-56 gi|240172631|ref|ZP_04751290.1| hypothetical protein MkanA1_2517... 222 2e-56 gi|33863910|ref|NP_895470.1| hypothetical protein PMT1643 [Proch... 43.5 0.011 gi|88854525|ref|ZP_01129192.1| hypothetical protein A20C1_09914 ... 43.1 0.015 gi|170780649|ref|YP_001708981.1| putative integral membrane prot... 41.2 0.054 gi|323356511|ref|YP_004222907.1| hypothetical protein MTES_0063 ... 40.8 0.066 gi|148272116|ref|YP_001221677.1| hypothetical protein CMM_0936 [... 39.7 0.15 gi|124021945|ref|YP_001016252.1| hypothetical protein P9303_0232... 37.4 0.74 gi|159903688|ref|YP_001551032.1| hypothetical protein P9211_1147... 37.0 1.0 gi|109897233|ref|YP_660488.1| hypothetical protein Patl_0908 [Ps... 37.0 1.0 gi|145224916|ref|YP_001135594.1| hypothetical protein Mflv_4337 ... 36.2 1.6 gi|113955407|ref|YP_730669.1| hypothetical protein sync_1464 [Sy... 36.2 1.7 gi|148272115|ref|YP_001221676.1| hypothetical protein CMM_0935 [... 35.8 2.3 gi|119489269|ref|ZP_01622076.1| hypothetical protein L8106_07436... 35.0 4.0 gi|67525477|ref|XP_660800.1| hypothetical protein AN3196.2 [Aspe... 33.9 8.2 gi|66876421|gb|AAY57986.1| hypothetical protein [Lyngbya majuscu... 33.5 9.6 >gi|15610967|ref|NP_218348.1| hypothetical protein Rv3831 [Mycobacterium tuberculosis H37Rv] gi|15843455|ref|NP_338492.1| hypothetical protein MT3939 [Mycobacterium tuberculosis CDC1551] gi|31795005|ref|NP_857498.1| hypothetical protein Mb3861 [Mycobacterium bovis AF2122/97] 76 more sequence titlesLength=160 Score = 324 bits (831), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 160/160 (100%), Positives = 160/160 (100%), Gaps = 0/160 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF Sbjct 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 Query 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR Sbjct 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 Query 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA Sbjct 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 >gi|340628800|ref|YP_004747252.1| hypothetical protein MCAN_38501 [Mycobacterium canettii CIPT 140010059] gi|340006990|emb|CCC46181.1| hypothetical protein MCAN_38501 [Mycobacterium canettii CIPT 140010059] Length=160 Score = 321 bits (823), Expect = 2e-86, Method: Compositional matrix adjust. Identities = 159/160 (99%), Positives = 159/160 (99%), Gaps = 0/160 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFS PECVYYVVGIASIALGWYFNIRF Sbjct 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSPPECVYYVVGIASIALGWYFNIRF 60 Query 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR Sbjct 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 Query 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA Sbjct 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 >gi|118620020|ref|YP_908352.1| hypothetical protein MUL_5005 [Mycobacterium ulcerans Agy99] gi|118572130|gb|ABL06881.1| conserved hypothetical membrane protein [Mycobacterium ulcerans Agy99] Length=160 Score = 270 bits (691), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 126/159 (80%), Positives = 140/159 (89%), Gaps = 0/159 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHA LG+ VI WI++SNP+ + RPA G+WFS ECVYY VGIASIA GWYFNIRF Sbjct 1 MVSLLVHAVLGIAVISWIIASNPQAYARPAAGAWFSPLECVYYAVGIASIAFGWYFNIRF 60 Query 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 V++Y+HGA NP+WGPGSWA+Y+RLMFTNPAA SA QDYTIANVILLPLFS DGYRRGLR Sbjct 61 VREYSHGATNPIWGPGSWADYIRLMFTNPAAGSASQDYTIANVILLPLFSIVDGYRRGLR 120 Query 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVG 159 RPWLYFVSSLFTSFAFA FYFATIERQHRHE++R VG Sbjct 121 RPWLYFVSSLFTSFAFALGFYFATIERQHRHEQARDKVG 159 >gi|183985353|ref|YP_001853644.1| hypothetical protein MMAR_5385 [Mycobacterium marinum M] gi|183178679|gb|ACC43789.1| conserved hypothetical membrane protein [Mycobacterium marinum M] Length=160 Score = 255 bits (652), Expect = 1e-66, Method: Compositional matrix adjust. Identities = 129/159 (82%), Positives = 143/159 (90%), Gaps = 0/159 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHA LG+ VI WI++SNP+V+ RPA G+WFS ECVYY VGIASIALGWYFNIRF Sbjct 1 MVSLLVHAVLGIAVISWIIASNPQVYARPAAGAWFSPLECVYYAVGIASIALGWYFNIRF 60 Query 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 V++Y+HGA NP+WGPGSWA+Y+RLMFTNPAA SA QDYTIANVILLPLFS DGYRRGLR Sbjct 61 VREYSHGATNPIWGPGSWADYIRLMFTNPAAGSASQDYTIANVILLPLFSIVDGYRRGLR 120 Query 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVG 159 RPWLYFVSSLFTSFAFA AFYFATIERQHRHE++R VG Sbjct 121 RPWLYFVSSLFTSFAFALAFYFATIERQHRHEQARDKVG 159 >gi|296166959|ref|ZP_06849375.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295897698|gb|EFG77288.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=161 Score = 242 bits (617), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 128/161 (80%), Positives = 142/161 (89%), Gaps = 1/161 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHA LG+ VIGWIV+SN KVF RPAGG FS ECVYYVVGIAS+ALGWYFNI + Sbjct 1 MVSLLVHAVLGLSVIGWIVASNSKVFARPAGGPLFSPLECVYYVVGIASVALGWYFNITY 60 Query 61 VQQYAHGAANPLWGP-GSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGL 119 V+QY+HG++NPLWG GSWAEY+RLMFTNPAA SA QDYTIANV+LLPLF+ DGYRRGL Sbjct 61 VEQYSHGSSNPLWGEHGSWAEYIRLMFTNPAADSASQDYTIANVVLLPLFTIVDGYRRGL 120 Query 120 RRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 R PWLYFVSSLFTSFAFAFAFYFAT+ERQ RHE++R TV A Sbjct 121 RHPWLYFVSSLFTSFAFAFAFYFATMERQRRHEQARETVDA 161 >gi|41406298|ref|NP_959134.1| hypothetical protein MAP0200c [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463108|ref|YP_879490.1| hypothetical protein MAV_0195 [Mycobacterium avium 104] gi|254773255|ref|ZP_05214771.1| hypothetical protein MaviaA2_01036 [Mycobacterium avium subsp. avium ATCC 25291] gi|41394646|gb|AAS02517.1| hypothetical protein MAP_0200c [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164395|gb|ABK65292.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336461886|gb|EGO40741.1| Protein of unknown function (DUF2834) [Mycobacterium avium subsp. paratuberculosis S397] Length=161 Score = 238 bits (608), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 129/161 (81%), Positives = 140/161 (87%), Gaps = 1/161 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLLVHA LG VI WIV+SN KVF RPAGG FS E VYY+VGIAS+ALGWYFNI F Sbjct 1 MVSLLVHAVLGFSVIAWIVASNAKVFARPAGGPLFSPMEVVYYLVGIASVALGWYFNITF 60 Query 61 VQQYAHGAANPLWGP-GSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGL 119 VQQY+HG+ NPLWG GSWAEY+RLMFTNPAASSA QDYTIANV+LLPLF+ DGYRRGL Sbjct 61 VQQYSHGSTNPLWGEHGSWAEYIRLMFTNPAASSASQDYTIANVVLLPLFTIVDGYRRGL 120 Query 120 RRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 RRPWLYFVSSLFTSFAFAFAFYFAT+ERQ RHE++R TV A Sbjct 121 RRPWLYFVSSLFTSFAFAFAFYFATMERQRRHEQARETVPA 161 >gi|342860084|ref|ZP_08716736.1| hypothetical protein MCOL_14435 [Mycobacterium colombiense CECT 3035] gi|342132462|gb|EGT85691.1| hypothetical protein MCOL_14435 [Mycobacterium colombiense CECT 3035] Length=161 Score = 234 bits (596), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 124/160 (78%), Positives = 137/160 (86%), Gaps = 1/160 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLL HA LG+ VIGWIV++N VF RPA G FS E VYYVVGIAS+ALGWYFNI + Sbjct 1 MVSLLTHAVLGLAVIGWIVTANKGVFARPADGPLFSPMEVVYYVVGIASVALGWYFNITY 60 Query 61 VQQYAHGAANPLWGP-GSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGL 119 VQQY+HG+ NPLWG GSW EY++LMFTNPAASSA QDYTIANV+LLP+F+ DGYRRGL Sbjct 61 VQQYSHGSTNPLWGEHGSWLEYIKLMFTNPAASSASQDYTIANVVLLPIFTIVDGYRRGL 120 Query 120 RRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVG 159 R PWLYFVSSLFTSFAFAFAFYFAT+ERQ RHER RATVG Sbjct 121 RHPWLYFVSSLFTSFAFAFAFYFATMERQRRHERDRATVG 160 >gi|333992778|ref|YP_004525392.1| hypothetical protein JDM601_4138 [Mycobacterium sp. JDM601] gi|333488746|gb|AEF38138.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=157 Score = 225 bits (574), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 109/157 (70%), Positives = 126/157 (81%), Gaps = 1/157 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 M+SLLVHA LG+ +GWIV++N VF +P G FS E VYYV+GIASI LGWYFNIRF Sbjct 1 MISLLVHAVLGLATVGWIVAANRAVFAKPPQGGQFSPMEVVYYVIGIASIGLGWYFNIRF 60 Query 61 VQQYAHGAA-NPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGL 119 V +YA G NP+WGPGSW +Y++LM+TNPAA SA QDYTI NVILLPLF+ DGYRRGL Sbjct 61 VNEYAGGPNHNPIWGPGSWTQYIQLMYTNPAAGSASQDYTIINVILLPLFTVVDGYRRGL 120 Query 120 RRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRA 156 R PWLYFVSSLFTS AFA+AFYFAT+ERQ RH + A Sbjct 121 RHPWLYFVSSLFTSCAFAYAFYFATMERQRRHATATA 157 >gi|254822670|ref|ZP_05227671.1| hypothetical protein MintA_22264 [Mycobacterium intracellulare ATCC 13950] Length=165 Score = 222 bits (566), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 119/154 (78%), Positives = 131/154 (86%), Gaps = 1/154 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSLL HA LG+ VIGWIV+SN KVF RPA G FS E VYYVVGIAS+ALGWYFNI F Sbjct 1 MVSLLTHAVLGLAVIGWIVTSNSKVFARPANGPLFSPMEIVYYVVGIASVALGWYFNITF 60 Query 61 VQQYAHGAANPLWGP-GSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGL 119 V +Y+ G+ NPLWG GSWAEY++LMFTNPAASSA QDYTIANVILLP+F+ DGYRRGL Sbjct 61 VHEYSQGSTNPLWGEHGSWAEYIKLMFTNPAASSASQDYTIANVILLPIFTIVDGYRRGL 120 Query 120 RRPWLYFVSSLFTSFAFAFAFYFATIERQHRHER 153 R PWLYFVSSLFTSFAFAFAFYFAT+ERQ RH + Sbjct 121 RHPWLYFVSSLFTSFAFAFAFYFATMERQRRHAQ 154 >gi|240172631|ref|ZP_04751290.1| hypothetical protein MkanA1_25175 [Mycobacterium kansasii ATCC 12478] Length=159 Score = 222 bits (565), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 106/160 (67%), Positives = 125/160 (79%), Gaps = 1/160 (0%) Query 1 MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASIALGWYFNIRF 60 MVSL+VHA LG+ V+ WIV+SNPKV+ +PAGG+W S ECVYY GIASI LGWYFNI F Sbjct 1 MVSLVVHALLGLAVVAWIVASNPKVYAKPAGGAWLSPLECVYYTAGIASIVLGWYFNIHF 60 Query 61 VQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLR 120 + AHG N + GPGS+ ++RL F NPAA S QDY IANV+LLP+F+ DGYRRGL+ Sbjct 61 MLD-AHGQGNLVSGPGSYPNFLRLQFANPAAGSGNQDYLIANVVLLPVFTIVDGYRRGLK 119 Query 121 RPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA 160 RPWL+FV+S FTSFAF A YFATIERQ RHERSR T+ A Sbjct 120 RPWLFFVASFFTSFAFPLACYFATIERQRRHERSRHTINA 159 >gi|33863910|ref|NP_895470.1| hypothetical protein PMT1643 [Prochlorococcus marinus str. MIT 9313] gi|33635494|emb|CAE21818.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9313] Length=119 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 37/123 (31%), Positives = 57/123 (47%), Gaps = 13/123 (10%) Query 36 SLPECVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPG-SWAEYVRLMFTNPAASSA 94 SL + VY ++ I L NI F+QQY GP + +V L NPAA S Sbjct 6 SLLQWVYLILAITGAILPTLANIDFMQQY---------GPDFDISLFVALSNANPAAQSL 56 Query 95 GQDYTI-ANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHER 153 +D I A+ I ++ + R +R W+ +SS+ +FAFA + ER+ + Sbjct 57 SRDLIIGASAI--TIWIVVESRRLQMRHLWIVLLSSITIAFAFAAPLFLFLRERRLQEMA 114 Query 154 SRA 156 + A Sbjct 115 NHA 117 >gi|88854525|ref|ZP_01129192.1| hypothetical protein A20C1_09914 [marine actinobacterium PHSC20C1] gi|88816333|gb|EAR26188.1| hypothetical protein A20C1_09914 [marine actinobacterium PHSC20C1] Length=120 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 32/122 (27%), Positives = 51/122 (42%), Gaps = 14/122 (11%) Query 27 TRPAGGSWFSLPECVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMF 86 TRP ++ + V+ + + W+FN+ + Q G W Sbjct 6 TRPTILRHWNAKAITFAVLSVVGLVGTWFFNVLAIVQLRDYL-------GDW------FG 52 Query 87 TNPAASSAGQDYTIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIE 146 + PA +S G D + V + + R G++R WLY V S T+FAF F + A E Sbjct 53 SGPAVNSLGVDLLVVAVAG-SILIIIEARRLGMKRAWLYIVLSGITAFAFTFPLFLAMRE 111 Query 147 RQ 148 R+ Sbjct 112 RK 113 >gi|170780649|ref|YP_001708981.1| putative integral membrane protein [Clavibacter michiganensis subsp. sepedonicus] gi|169155217|emb|CAQ00318.1| putative integral membrane protein [Clavibacter michiganensis subsp. sepedonicus] Length=122 Score = 41.2 bits (95), Expect = 0.054, Method: Compositional matrix adjust. Identities = 37/122 (31%), Positives = 51/122 (42%), Gaps = 15/122 (12%) Query 40 CVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYT 99 VY V+ A + L W NIR V + A+ G S SS D Sbjct 15 VVYLVLAAAGLVLTWSANIRVVTEGRDFLADLSAGGAS-------------VSSLSWDLL 61 Query 100 IANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQ-HRHERSRATV 158 IA V + +F +G R +RR W+Y + + +FAFA + A E ER+ T Sbjct 62 IAAVASV-VFIVVEGRRLRMRRVWVYVLLAPLVAFAFALPLFLAAREMHLSAPERTEPTP 120 Query 159 GA 160 GA Sbjct 121 GA 122 >gi|323356511|ref|YP_004222907.1| hypothetical protein MTES_0063 [Microbacterium testaceum StLB037] gi|323272882|dbj|BAJ73027.1| hypothetical protein MTES_0063 [Microbacterium testaceum StLB037] Length=106 Score = 40.8 bits (94), Expect = 0.066, Method: Compositional matrix adjust. Identities = 33/121 (28%), Positives = 52/121 (43%), Gaps = 15/121 (12%) Query 40 CVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYT 99 ++ V+ +A + W FN+ + Q + + L+ + PA SS D Sbjct 1 MLFLVLAVAGLVGTWTFNVLAIVQMSDFLGD-------------LVTSGPAVSSITVDLL 47 Query 100 IANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVG 159 + I F + R G+R W Y V S T+FAF F + A + ++H R AT G Sbjct 48 VVA-IAGSAFIIIEARRLGMRFGWAYVVLSGITAFAFTFPLFLA-MRQRHLTARREATAG 105 Query 160 A 160 A Sbjct 106 A 106 >gi|148272116|ref|YP_001221677.1| hypothetical protein CMM_0936 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] gi|147830046|emb|CAN00975.1| hypothetical protein CMM_0936 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] Length=122 Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust. Identities = 34/119 (29%), Positives = 49/119 (42%), Gaps = 15/119 (12%) Query 40 CVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYT 99 VY V+ A + L W NIR V + A+ L P+ SS D Sbjct 15 VVYLVLAAAGLVLTWSANIRVVTEGRDFLAD-------------LSAGGPSVSSLSWDLL 61 Query 100 IANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQ-HRHERSRAT 157 IA V + +F +G R +RR W+Y + + +FA A + A E ER+ T Sbjct 62 IAAVASV-VFIVVEGRRLRMRRVWIYVLLAPLVAFAVALPVFLAAREMHLSAPERTEPT 119 >gi|124021945|ref|YP_001016252.1| hypothetical protein P9303_02321 [Prochlorococcus marinus str. MIT 9303] gi|123962231|gb|ABM76987.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9303] Length=107 Score = 37.4 bits (85), Expect = 0.74, Method: Compositional matrix adjust. Identities = 32/107 (30%), Positives = 49/107 (46%), Gaps = 13/107 (12%) Query 44 VVGIASIALGWYFNIRFVQQYAHGAANPLWGPG-SWAEYVRLMFTNPAASSAGQDYTI-A 101 ++ I L NI F+QQY GP + +V L NPAA S +D I A Sbjct 2 ILAITGAILPTLANIDFMQQY---------GPDFDISLFVALSNANPAAQSLSRDLIIGA 52 Query 102 NVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQ 148 + I ++ + R +R W+ +SS+ +FAFA + ER+ Sbjct 53 SAI--TIWIVVESRRLQMRHLWIVLLSSITIAFAFAAPLFLFLRERR 97 >gi|159903688|ref|YP_001551032.1| hypothetical protein P9211_11471 [Prochlorococcus marinus str. MIT 9211] gi|159888864|gb|ABX09078.1| Hypothetical protein P9211_11471 [Prochlorococcus marinus str. MIT 9211] Length=123 Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust. Identities = 29/100 (29%), Positives = 44/100 (44%), Gaps = 13/100 (13%) Query 40 CVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPG-SWAEYVRLMFTNPAASSAGQDY 98 C+Y ++ I L NI F+ Y GP +++L NPAA S +D Sbjct 10 CIYVLLAILGAVLPMLANIDFINNY---------GPSFDLDNFIKLANINPAAQSLSRDL 60 Query 99 TI-ANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFA 137 I A + +FS + R ++ WL +S +FAFA Sbjct 61 FIGAGATTIWMFS--EARRLKIKHFWLVIISMFVIAFAFA 98 >gi|109897233|ref|YP_660488.1| hypothetical protein Patl_0908 [Pseudoalteromonas atlantica T6c] gi|109699514|gb|ABG39434.1| conserved hypothetical protein [Pseudoalteromonas atlantica T6c] Length=107 Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust. Identities = 19/65 (30%), Positives = 31/65 (48%), Gaps = 1/65 (1%) Query 88 NPAASSAGQDYTIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIER 147 NP + A D ++ ++LL F DG R ++ WL + +L +F F Y E+ Sbjct 41 NPISIFAWLDVLVSALVLLT-FIVVDGKRNKVKYHWLAVLGTLCVGVSFGFPLYLYLKEK 99 Query 148 QHRHE 152 QH + Sbjct 100 QHLNS 104 >gi|145224916|ref|YP_001135594.1| hypothetical protein Mflv_4337 [Mycobacterium gilvum PYR-GCK] gi|315445245|ref|YP_004078124.1| hypothetical protein Mspyr1_36820 [Mycobacterium sp. Spyr1] gi|145217402|gb|ABP46806.1| hypothetical protein Mflv_4337 [Mycobacterium gilvum PYR-GCK] gi|315263548|gb|ADU00290.1| hypothetical protein Mspyr1_36820 [Mycobacterium sp. Spyr1] Length=121 Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust. Identities = 26/91 (29%), Positives = 41/91 (46%), Gaps = 4/91 (4%) Query 66 HGAANPLWGPG-SWAEYVRLMFTNPAASSAGQDYTIANVILLPLFSTTDGYRRGLRRPWL 124 H A L G G S +++R + N A +S D + V + +F + R G+ R WL Sbjct 26 HNVAFILSGQGESLLDFIRAAYANHAGASLTNDLLLLGVAVF-VFMVVEARRLGIARIWL 84 Query 125 YFVSSLFTSFAFAFAFYFATIERQHRHERSR 155 Y V S+ + + + I RQ R+R Sbjct 85 YLVISIGVAISVGLPLFL--IVRQVALARTR 113 >gi|113955407|ref|YP_730669.1| hypothetical protein sync_1464 [Synechococcus sp. CC9311] gi|113882758|gb|ABI47716.1| conserved hypothetical protein [Synechococcus sp. CC9311] Length=121 Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust. Identities = 31/119 (27%), Positives = 50/119 (43%), Gaps = 15/119 (12%) Query 36 SLPECVYYVVGIASIALGWYFNIRFVQQ-YAHGAANPL-----WGPGSWAEYVRLMFTNP 89 L +Y V + + WY +F+Q+ A G +PL + G WA N Sbjct 3 KLRLLIYAVTAVGGVVWPWYCIYQFIQETEALGLTDPLEIVELFSQGVWA--------NA 54 Query 90 AASSAGQDYTIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQ 148 +A D T+ + F + R ++ +LYFV++ SFAF+F + ER Sbjct 55 SAGFIAADLTLVLIAAFA-FIVAEAMRLKMKYWYLYFVATFGISFAFSFGLFMFNRERN 112 >gi|148272115|ref|YP_001221676.1| hypothetical protein CMM_0935 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] gi|147830045|emb|CAN00974.1| hypothetical protein CMM_0935 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] Length=115 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 29/116 (25%), Positives = 50/116 (44%), Gaps = 16/116 (13%) Query 40 CVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMF-TNPAASSAGQDY 98 V+ V+ +A + + W +NI + S +Y+ F + P+ SS D Sbjct 13 VVHLVLALAGVVVTWTYNITAIT--------------SGRDYLGDWFGSGPSVSSLTADV 58 Query 99 TIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERS 154 IA + + +F +G R G+R W++ V + AFA + E + R E S Sbjct 59 LIAAIAAV-VFILVEGRRLGIRFAWIFVVLIPLVALAFALPLFLGVREMRVRTEGS 113 >gi|119489269|ref|ZP_01622076.1| hypothetical protein L8106_07436 [Lyngbya sp. PCC 8106] gi|119454743|gb|EAW35888.1| hypothetical protein L8106_07436 [Lyngbya sp. PCC 8106] Length=112 Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust. Identities = 27/123 (22%), Positives = 54/123 (44%), Gaps = 15/123 (12%) Query 36 SLPECVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAG 95 +L + VY+++ IA + WY+N++F + GS E+V N A S Sbjct 2 NLKKIVYFILAIAGLIFPWYYNVQF------------FLTGSLGEFVAASSGNLATQSIS 49 Query 96 QDYTIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSR 155 D IA V+ ++ + R ++ ++Y + +++FA + R+ + E Sbjct 50 LDLFIATVV-GSIWMYFESKRLSIKFGFIYILIGFLIAYSFALPLFLYV--RESKLEDEL 106 Query 156 ATV 158 T+ Sbjct 107 DTI 109 >gi|67525477|ref|XP_660800.1| hypothetical protein AN3196.2 [Aspergillus nidulans FGSC A4] gi|40743773|gb|EAA62960.1| hypothetical protein AN3196.2 [Aspergillus nidulans FGSC A4] gi|259485845|tpe|CBF83213.1| TPA: unsaturated rhamnogalacturonan hydrolase (Eurofung) [Aspergillus nidulans FGSC A4] Length=370 Score = 33.9 bits (76), Expect = 8.2, Method: Compositional matrix adjust. Identities = 32/103 (32%), Positives = 41/103 (40%), Gaps = 9/103 (8%) Query 53 GWYFNIRFVQQYAHGAANPLWGPG-SWA-----EYVRLMFTNPAASSAGQ--DYTIANVI 104 GW FN + H AN W G SW E++ L+ P+ D IA V Sbjct 199 GWEFNANQATGHGHNFANARWARGNSWVTIVIPEFIELLDLQPSDPIRVHLVDTLIAQVE 258 Query 105 LLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIER 147 L T DGY R L +V S T+ FA+ A +R Sbjct 259 ALKRLQTNDGYWRTLLDHEDSYVESSATA-GFAWGILKAVRKR 300 >gi|66876421|gb|AAY57986.1| hypothetical protein [Lyngbya majuscula CCAP 1446/4] Length=107 Score = 33.5 bits (75), Expect = 9.6, Method: Compositional matrix adjust. Identities = 25/119 (22%), Positives = 52/119 (44%), Gaps = 13/119 (10%) Query 36 SLPECVYYVVGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAG 95 ++ + +Y ++ IA + WY+NI+F + GS AE+V N A S Sbjct 2 NIKKIIYLILAIAGLIFPWYYNIQF------------FLTGSLAEFVAASSGNLATQSIS 49 Query 96 QDYTIANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERS 154 D IA V+ ++ + R ++ ++Y + +++FA + E + ++ Sbjct 50 FDLFIATVV-GSIWIYFESKRLNMKFGFIYILIGFLIAYSFALPLFLYVRETKLEEQKE 107 Lambda K H 0.327 0.138 0.456 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127750398496 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40