BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0398c Length=213 Score E Sequences producing significant alignments: (Bits) Value gi|15607539|ref|NP_214912.1| hypothetical protein Rv0398c [Mycob... 426 2e-117 gi|340625424|ref|YP_004743876.1| putative secreted protein [Myco... 424 4e-117 gi|289744094|ref|ZP_06503472.1| conserved hypothetical protein [... 423 7e-117 gi|183980718|ref|YP_001849009.1| hypothetical protein MMAR_0695 ... 274 5e-72 gi|240168860|ref|ZP_04747519.1| hypothetical protein MkanA1_0608... 241 4e-62 gi|342859124|ref|ZP_08715778.1| hypothetical protein MCOL_09608 ... 216 1e-54 gi|254818882|ref|ZP_05223883.1| hypothetical protein MintA_03099... 205 3e-51 gi|254777208|ref|ZP_05218724.1| hypothetical protein MaviaA2_214... 203 1e-50 gi|336460384|gb|EGO39284.1| hypothetical protein MAPs_41820 [Myc... 196 2e-48 gi|41409973|ref|NP_962809.1| hypothetical protein MAP3875c [Myco... 194 7e-48 gi|296167854|ref|ZP_06850037.1| conserved hypothetical protein [... 193 1e-47 gi|118462232|ref|YP_883897.1| hypothetical protein MAV_4770 [Myc... 189 3e-46 gi|183980258|ref|YP_001848549.1| hypothetical protein MMAR_0225 ... 179 2e-43 gi|118619922|ref|YP_908254.1| hypothetical protein MUL_4889 [Myc... 161 5e-38 gi|340345909|ref|ZP_08669039.1| hypothetical protein HMPREF9136_... 39.7 0.29 gi|71905867|ref|YP_283454.1| PAS [Dechloromonas aromatica RCB] >... 38.5 0.65 gi|333990313|ref|YP_004522927.1| hypothetical protein JDM601_167... 38.5 0.72 gi|17230345|ref|NP_486893.1| UDP-glucose 4-epimerase [Nostoc sp.... 37.4 1.6 gi|197116672|ref|YP_002137099.1| Cache/HAMP/PAS/PAC domain-conta... 35.0 6.8 >gi|15607539|ref|NP_214912.1| hypothetical protein Rv0398c [Mycobacterium tuberculosis H37Rv] gi|15839781|ref|NP_334818.1| hypothetical protein MT0408 [Mycobacterium tuberculosis CDC1551] gi|31791574|ref|NP_854067.1| hypothetical protein Mb0404c [Mycobacterium bovis AF2122/97] 68 more sequence titlesLength=213 Score = 426 bits (1094), Expect = 2e-117, Method: Compositional matrix adjust. Identities = 213/213 (100%), Positives = 213/213 (100%), Gaps = 0/213 (0%) Query 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ Sbjct 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 Query 61 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA Sbjct 61 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 Query 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR Sbjct 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 Query 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG Sbjct 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 >gi|340625424|ref|YP_004743876.1| putative secreted protein [Mycobacterium canettii CIPT 140010059] gi|340003614|emb|CCC42736.1| putative secreted protein [Mycobacterium canettii CIPT 140010059] Length=213 Score = 424 bits (1090), Expect = 4e-117, Method: Compositional matrix adjust. Identities = 212/213 (99%), Positives = 213/213 (100%), Gaps = 0/213 (0%) Query 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ Sbjct 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 Query 61 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 LFSFG+PTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA Sbjct 61 LFSFGNPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 Query 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR Sbjct 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 Query 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG Sbjct 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 >gi|289744094|ref|ZP_06503472.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289756466|ref|ZP_06515844.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|294995154|ref|ZP_06800845.1| secreted protein [Mycobacterium tuberculosis 210] 7 more sequence titles Length=213 Score = 423 bits (1088), Expect = 7e-117, Method: Compositional matrix adjust. Identities = 212/213 (99%), Positives = 213/213 (100%), Gaps = 0/213 (0%) Query 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 MGVIARVVGVAACGLSLAVLAAAPTAGA+PTGALPPMTSSGSGPVIGDGDAALRQRISQQ Sbjct 1 MGVIARVVGVAACGLSLAVLAAAPTAGADPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 Query 61 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA Sbjct 61 LFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARA 120 Query 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR Sbjct 121 YRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRR 180 Query 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG Sbjct 181 GEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 213 >gi|183980718|ref|YP_001849009.1| hypothetical protein MMAR_0695 [Mycobacterium marinum M] gi|183174044|gb|ACC39154.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=216 Score = 274 bits (701), Expect = 5e-72, Method: Compositional matrix adjust. Identities = 155/209 (75%), Positives = 171/209 (82%), Gaps = 1/209 (0%) Query 4 IARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFS 63 I + VA C LSLA LA TA A+PT LPPMTS+GSGP+IG G +L Q ISQQL S Sbjct 8 IRCIATVATCVLSLATLAVPATAAADPTVTLPPMTSTGSGPIIGGGSTSLGQLISQQLIS 67 Query 64 FGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRR 123 F P +QEVDGSDAAQF+ AAA V DR++ASVFL LQR LGCQ + AGSG GFGARAYRR Sbjct 68 FDKPDIQEVDGSDAAQFVAAAAGVDDRELASVFLLLQRALGCQPDPAGSGTGFGARAYRR 127 Query 124 TDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEE 183 +DGQWGGAMLV+AKSTV+DVDALKAC+KSGWR+ATAG P SMCN+GWTYPPF RRG E Sbjct 128 SDGQWGGAMLVIAKSTVTDVDALKACIKSGWRRATAGGPDSMCNSGWTYPPFTG-RRGGE 186 Query 184 GYFVLLAGTASDFCSAPNANYRTTASSWP 212 GYFVLLAGTASDFCS PNA YRTTASSWP Sbjct 187 GYFVLLAGTASDFCSVPNAKYRTTASSWP 215 >gi|240168860|ref|ZP_04747519.1| hypothetical protein MkanA1_06085 [Mycobacterium kansasii ATCC 12478] Length=208 Score = 241 bits (616), Expect = 4e-62, Method: Compositional matrix adjust. Identities = 127/206 (62%), Positives = 148/206 (72%), Gaps = 3/206 (1%) Query 7 VVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGD 66 VV VA CGLSL L AP A A+PT LPPMTS+GSGP+I AA +SQQL + Sbjct 6 VVAVAVCGLSLGALTVAPKAAADPTVTLPPMTSTGSGPMIAGTGAA--PGLSQQLTGLNN 63 Query 67 PTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRRTDG 126 P VQEVDGSDAAQFITAAA+V D +A+ F+ L R L CQ N AGSG FGARAYRR DG Sbjct 64 PNVQEVDGSDAAQFITAAASVTDPQLAAPFVLLHRALACQGNGAGSGPPFGARAYRRADG 123 Query 127 QWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYF 186 +WGGAMLV AKS ++VDAL +CVK+ WR++TAG +MCNNGWT P D RR E Y+ Sbjct 124 RWGGAMLVAAKSATANVDALVSCVKTNWRRSTAGGENAMCNNGWTTPTDGDLRR-TEVYY 182 Query 187 VLLAGTASDFCSAPNANYRTTASSWP 212 +LLAGTA+DFC+ NA Y T AS WP Sbjct 183 ILLAGTAADFCTGLNARYATNASGWP 208 >gi|342859124|ref|ZP_08715778.1| hypothetical protein MCOL_09608 [Mycobacterium colombiense CECT 3035] gi|342133365|gb|EGT86568.1| hypothetical protein MCOL_09608 [Mycobacterium colombiense CECT 3035] Length=206 Score = 216 bits (551), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 123/213 (58%), Positives = 148/213 (70%), Gaps = 9/213 (4%) Query 1 MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQ 60 M + AR + A L+L P AGA+ MTSSG+GP+IG G AA Q ISQQ Sbjct 1 MRITARAMRTMAALLALGATTGMPAAGADAALPA--MTSSGAGPIIGGGSAA-GQGISQQ 57 Query 61 LFSFGDP-TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGAR 119 LFSFG+P TVQEVDGSDAAQFITAAA A++ +A F L+R L CQ N AG FGAR Sbjct 58 LFSFGNPNTVQEVDGSDAAQFITAAAGSANQQLAGPFALLRRALTCQTNNAG----FGAR 113 Query 120 AYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTR 179 AYRR DGQWGGAMLV AKS +VD+L AC K+ WR+ TAG +MCNNGWT P ++R Sbjct 114 AYRRNDGQWGGAMLVAAKSATPNVDSLVACAKTNWRRTTAGGEAAMCNNGWTTPTSYESR 173 Query 180 RGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 RGE Y++LLAGTA+DFCS NA Y++ A++WP Sbjct 174 RGET-YYILLAGTAADFCSDLNAKYKSDANAWP 205 >gi|254818882|ref|ZP_05223883.1| hypothetical protein MintA_03099 [Mycobacterium intracellulare ATCC 13950] Length=211 Score = 205 bits (522), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 108/175 (62%), Positives = 129/175 (74%), Gaps = 7/175 (4%) Query 39 SSGSGPVIGDGDAALRQRISQQLFSFGDPTV-QEVDGSDAAQFITAAAAVADRDVASVFL 97 SSGSGP+IG G AA Q I+QQLFSFG+P V QEVDGSDAAQFIT+AA A+ +A F Sbjct 42 SSGSGPIIGAGSAAA-QGIAQQLFSFGNPNVVQEVDGSDAAQFITSAAGAANPQLAGPFS 100 Query 98 PLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKA 157 L+R L CQ + +G FGARAYRR+DGQWGGAMLV AKS +D L CVK+ WR+ Sbjct 101 LLRRALACQTDNSG----FGARAYRRSDGQWGGAMLVAAKSATPSLDGLVGCVKTNWRRT 156 Query 158 TAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 AG T+MCNNGWT P D+RRGE Y++LLAGTA+DFCS NA Y++ AS+WP Sbjct 157 AAGGETAMCNNGWTTPTSYDSRRGET-YYILLAGTAADFCSDLNAKYKSDASAWP 210 >gi|254777208|ref|ZP_05218724.1| hypothetical protein MaviaA2_21414 [Mycobacterium avium subsp. avium ATCC 25291] Length=206 Score = 203 bits (517), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 109/177 (62%), Positives = 129/177 (73%), Gaps = 7/177 (3%) Query 37 MTSSGSGPVIGDGDAALRQRISQQLFSFGDP-TVQEVDGSDAAQFITAAAAVADRDVASV 95 MTS GS P+IG G AA Q ISQQLFSFG+P VQE+DGSDAAQFITAAA A++ +A+ Sbjct 35 MTSGGSRPIIGGGSAAA-QGISQQLFSFGNPGVVQEIDGSDAAQFITAAAGAANQQLAAP 93 Query 96 FLPLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWR 155 F L+R L Q N AG FGARAYRR DGQWGGAMLV AKS D L AC K+ WR Sbjct 94 FSLLRRALTYQDNNAG----FGARAYRRNDGQWGGAMLVAAKSATPSPDNLAACAKTNWR 149 Query 156 KATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 +ATAG T+MCN+GWT P ++RRGE Y++LLAGTA+DFCS NA Y++ A+ WP Sbjct 150 RATAGGETAMCNSGWTTPTSYESRRGET-YYILLAGTAADFCSDLNAKYKSDATGWP 205 >gi|336460384|gb|EGO39284.1| hypothetical protein MAPs_41820 [Mycobacterium avium subsp. paratuberculosis S397] Length=206 Score = 196 bits (498), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 101/159 (64%), Positives = 119/159 (75%), Gaps = 6/159 (3%) Query 55 QRISQQLFSFGDP-TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSG 113 Q ISQQLFSFG+P VQEVDGSDAAQFITAAA A++ +A+ F L+R L CQ N AG Sbjct 52 QAISQQLFSFGNPGVVQEVDGSDAAQFITAAAGAANQQLAAPFSLLRRALTCQDNNAG-- 109 Query 114 AGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYP 173 FGARAYRR DGQWGGAMLVVAKS D L AC K+ WR+ATAG T+MCN+GWT P Sbjct 110 --FGARAYRRNDGQWGGAMLVVAKSATPSPDNLAACAKTNWRRATAGGETAMCNSGWTTP 167 Query 174 PFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 ++RRGE Y++LLAGTA+DFCS NA Y++ A+ WP Sbjct 168 TSYESRRGET-YYILLAGTAADFCSDLNAKYKSDATGWP 205 >gi|41409973|ref|NP_962809.1| hypothetical protein MAP3875c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41398806|gb|AAS06425.1| hypothetical protein MAP_3875c [Mycobacterium avium subsp. paratuberculosis K-10] Length=206 Score = 194 bits (493), Expect = 7e-48, Method: Compositional matrix adjust. Identities = 100/159 (63%), Positives = 118/159 (75%), Gaps = 6/159 (3%) Query 55 QRISQQLFSFGDP-TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSG 113 Q ISQQLFSFG+P VQEVDGSDAAQFITAAA A++ +A+ F L+R L CQ N AG Sbjct 52 QAISQQLFSFGNPGVVQEVDGSDAAQFITAAAGAANQQLAAPFSLLRRALTCQDNNAG-- 109 Query 114 AGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYP 173 FGARAYRR DGQWGGAMLV AKS D L AC K+ WR+ATAG T+MCN+GWT P Sbjct 110 --FGARAYRRNDGQWGGAMLVAAKSATPSPDNLAACAKTNWRRATAGGETAMCNSGWTTP 167 Query 174 PFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 ++RRGE Y++LLAGTA+DFCS NA Y++ A+ WP Sbjct 168 TSYESRRGET-YYILLAGTAADFCSDLNAKYKSDATGWP 205 >gi|296167854|ref|ZP_06850037.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896979|gb|EFG76602.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=206 Score = 193 bits (491), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 110/184 (60%), Positives = 134/184 (73%), Gaps = 8/184 (4%) Query 29 EPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVA 88 +PT LPPM SSG GP+IG G A I+QQLF FG P VQEVDGSDAAQFITAAA A Sbjct 30 DPTVTLPPMASSGGGPIIGGGSNA---GIAQQLFGFGAPNVQEVDGSDAAQFITAAAGSA 86 Query 89 DRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKA 148 + +A+ F ++R L CQ N AG FGARAYRRTDGQWGGAM+V AKS S++DAL A Sbjct 87 NPQLAAPFSLMRRALACQPNDAG----FGARAYRRTDGQWGGAMVVAAKSATSNLDALVA 142 Query 149 CVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTA 208 C K+ WR+ATAG +SMCN+GW+ P ++RRGE Y+VLLAGTA DFC+A N ++ + Sbjct 143 CAKTNWRRATAGNQSSMCNSGWSTPNTYESRRGET-YYVLLAGTADDFCTAVNGKFKDNS 201 Query 209 SSWP 212 + WP Sbjct 202 NGWP 205 >gi|118462232|ref|YP_883897.1| hypothetical protein MAV_4770 [Mycobacterium avium 104] gi|118163519|gb|ABK64416.1| conserved hypothetical protein [Mycobacterium avium 104] Length=206 Score = 189 bits (479), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 98/159 (62%), Positives = 117/159 (74%), Gaps = 6/159 (3%) Query 55 QRISQQLFSFGDP-TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSG 113 Q ISQQLFSFG+P VQE+DGSDAAQFITAAA A++ +A+ F L+R L CQ N AG Sbjct 52 QGISQQLFSFGNPGVVQEIDGSDAAQFITAAAGAANQQLAAPFSLLRRALTCQDNNAG-- 109 Query 114 AGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYP 173 FGARAYRR DGQWGGAMLV AKS D L AC K+ WR+ATAG T+MCN+G T P Sbjct 110 --FGARAYRRNDGQWGGAMLVAAKSATPSPDNLAACAKTNWRRATAGGETAMCNSGCTTP 167 Query 174 PFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 ++RRGE Y++LLAGTA+DFCS NA Y++ A+ WP Sbjct 168 TSYESRRGET-YYILLAGTAADFCSDLNAKYKSDATGWP 205 >gi|183980258|ref|YP_001848549.1| hypothetical protein MMAR_0225 [Mycobacterium marinum M] gi|183173584|gb|ACC38694.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=206 Score = 179 bits (454), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 105/206 (51%), Positives = 136/206 (67%), Gaps = 7/206 (3%) Query 8 VGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDP 67 + +AA L + AP A P LPP+ S+G GP+IG GD A + RI++QL S G+ Sbjct 7 MNIAAWVLCAGAVVGAPPVQAAPGAVLPPIPSTGGGPIIGGGDEAAQSRIARQLTSVGNA 66 Query 68 TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQ-NTAGSGAGFGARAYRRTDG 126 VQE DG+DAA FI +A VAD +AS F+PLQR LGCQ+ NT+ FGARAYRR+DG Sbjct 67 DVQEGDGADAASFIMDSARVADSRLASAFVPLQRALGCQKVNTS-----FGARAYRRSDG 121 Query 127 QWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYF 186 WGGAMLV+A+S SD+ AL C++S W ATAG +MC +GWTYP R E Y Sbjct 122 GWGGAMLVIAESATSDMQALTGCIRSVWPAATAGGSNAMCAHGWTYPTSGQDHR-PETYH 180 Query 187 VLLAGTASDFCSAPNANYRTTASSWP 212 +LLAGTA+DFC+ + +Y A++WP Sbjct 181 ILLAGTAADFCAKFDEDYANFATNWP 206 >gi|118619922|ref|YP_908254.1| hypothetical protein MUL_4889 [Mycobacterium ulcerans Agy99] gi|118572032|gb|ABL06783.1| conserved hypothetical secreted protein [Mycobacterium ulcerans Agy99] Length=207 Score = 161 bits (408), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 104/213 (49%), Positives = 133/213 (63%), Gaps = 14/213 (6%) Query 8 VGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDP 67 + +AA L + AP A P LPP+ S+G GP+IG GD A + RI++QL S G+ Sbjct 1 MNIAAWVLCAGAVVGAPPVQAAPGAVLPPIPSTGGGPIIGGGDEAAQSRIARQLMSVGNA 60 Query 68 TVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQ-NTAGSGAGFGARAYRRTD- 125 VQE DG+DAA FI +A VAD +AS F PLQR LGCQ+ NT+ FGARAYRR+D Sbjct 61 DVQEGDGADAASFIMDSAQVADSRLASAFAPLQRALGCQKVNTS-----FGARAYRRSDG 115 Query 126 ------GQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTR 179 G GG MLV+AKS SD+ AL C++S W ATAG +MC +GWTYP Sbjct 116 GGGGGGGGGGGGMLVIAKSATSDMQALTGCIRSVWPAATAGGSNAMCAHGWTYPTSGQDH 175 Query 180 RGEEGYFVLLAGTASDFCSAPNANYRTTASSWP 212 R E Y +LLAGTA+DFC+ + +Y A++WP Sbjct 176 R-PETYHILLAGTAADFCAKLDEDYANFATNWP 207 >gi|340345909|ref|ZP_08669039.1| hypothetical protein HMPREF9136_0036 [Prevotella dentalis DSM 3688] gi|339612896|gb|EGQ17692.1| hypothetical protein HMPREF9136_0036 [Prevotella dentalis DSM 3688] Length=996 Score = 39.7 bits (91), Expect = 0.29, Method: Compositional matrix adjust. Identities = 42/125 (34%), Positives = 53/125 (43%), Gaps = 23/125 (18%) Query 26 AGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAA 85 A A PT LP SG V +G RQR ++ L+ G PTV ++ SD F T Sbjct 29 AVARPTSYLP----SGLDIVCING----RQRYTRALY--GGPTVFRLETSDRPVFATYYN 78 Query 86 AVADRD------VASVFLPLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKST 139 R V + LPL C+ G G R YR D QWGG L V + Sbjct 79 KGGHRHIDFALRVGTAKLPLDSTAWCEARYQG-----GRRTYRLRDPQWGGGELHV--TA 131 Query 140 VSDVD 144 ++D D Sbjct 132 LADAD 136 >gi|71905867|ref|YP_283454.1| PAS [Dechloromonas aromatica RCB] gi|71845488|gb|AAZ44984.1| PAS/PAC sensor hybrid histidine kinase [Dechloromonas aromatica RCB] Length=673 Score = 38.5 bits (88), Expect = 0.65, Method: Compositional matrix adjust. Identities = 26/85 (31%), Positives = 39/85 (46%), Gaps = 5/85 (5%) Query 18 AVLAAAPTAGAEPTGALP----PMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQEVD 73 +VL A P A EP LP + G ++ + D +R+ + QQL G P ++ D Sbjct 527 SVLMALPLAAREPDIDLPGEEAALPHGGELVLLVEDDPNVRRVVRQQLIDLGHPVIEAED 586 Query 74 GSDAAQFITAAAAVADRDVASVFLP 98 GS A I +A V+ V +P Sbjct 587 GSQALDMIDQIHDIA-VVVSDVIMP 610 >gi|333990313|ref|YP_004522927.1| hypothetical protein JDM601_1673 [Mycobacterium sp. JDM601] gi|333486281|gb|AEF35673.1| hypothetical protein JDM601_1673 [Mycobacterium sp. JDM601] Length=100 Score = 38.5 bits (88), Expect = 0.72, Method: Compositional matrix adjust. Identities = 18/52 (35%), Positives = 27/52 (52%), Gaps = 0/52 (0%) Query 72 VDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRR 123 +DG Q A A ++D+ ++ PL +V+G + N G FG R YRR Sbjct 3 IDGHAIGQLDYIADAAEEQDLVALIRPLSQVIGVEVNAYGPADTFGERTYRR 54 >gi|17230345|ref|NP_486893.1| UDP-glucose 4-epimerase [Nostoc sp. PCC 7120] gi|17131947|dbj|BAB74552.1| UDP-glucose 4-epimerase [Nostoc sp. PCC 7120] Length=336 Score = 37.4 bits (85), Expect = 1.6, Method: Compositional matrix adjust. Identities = 33/111 (30%), Positives = 46/111 (42%), Gaps = 11/111 (9%) Query 12 ACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQE 71 A L +L AGAEP G L MT S + DAAL++R+ ++F PT Sbjct 159 ASDLRYVILRYFNVAGAEPGGRLGQMTKDASHLIRVTCDAALKRRLGVKIFGTDFPT--- 215 Query 72 VDGSDAAQFITAAAAVADRDVASVFLPL---QRVLGCQQNTAGSGAGFGAR 119 DG+ +I A V+L ++L C G G G+ R Sbjct 216 PDGTAIRDYIHVEDLATAHLDALVYLEQGNSSQILNC-----GYGQGYSVR 261 >gi|197116672|ref|YP_002137099.1| Cache/HAMP/PAS/PAC domain-containing sensor histidine kinase response regulator [Geobacter bemidjiensis Bem] gi|197086032|gb|ACH37303.1| sensor histidine kinase response regulator, Cache, HAMP, PAS, PAC, PAS and PAC domain-containing protein [Geobacter bemidjiensis Bem] Length=1113 Score = 35.0 bits (79), Expect = 6.8, Method: Composition-based stats. Identities = 17/51 (34%), Positives = 26/51 (51%), Gaps = 0/51 (0%) Query 48 DGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLP 98 + DA +R+ + + L FG ++ VDG DA Q AAA + V +P Sbjct 995 EDDAVIREMVGELLEEFGYRVIKAVDGEDAVQTFRGAAAEVQLVILDVIMP 1045 Lambda K H 0.317 0.130 0.398 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 251148961304 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40