BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3755c
Length=199
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610891|ref|NP_218272.1| hypothetical protein Rv3755c [Mycob... 404 5e-111
gi|340628726|ref|YP_004747178.1| hypothetical protein MCAN_37751... 402 2e-110
gi|289445354|ref|ZP_06435098.1| conserved hypothetical protein [... 400 6e-110
gi|308232556|ref|ZP_07416452.2| hypothetical protein TMAG_00245 ... 375 3e-102
gi|240168514|ref|ZP_04747173.1| hypothetical protein MkanA1_0432... 368 2e-100
gi|183985266|ref|YP_001853557.1| hypothetical protein MMAR_5298 ... 360 5e-98
gi|118619513|ref|YP_907845.1| hypothetical protein MUL_4373 [Myc... 358 2e-97
gi|339296563|gb|AEJ48674.1| hypothetical protein CCDC5079_3485 [... 352 2e-95
gi|296166873|ref|ZP_06849290.1| conserved hypothetical protein [... 327 5e-88
gi|254822448|ref|ZP_05227449.1| hypothetical protein MintA_21116... 322 2e-86
gi|41406374|ref|NP_959210.1| hypothetical protein MAP0276 [Mycob... 320 7e-86
gi|342860000|ref|ZP_08716652.1| hypothetical protein MCOL_14010 ... 316 1e-84
gi|336459998|gb|EGO38908.1| hypothetical protein MAPs_44750 [Myc... 303 8e-81
gi|108801926|ref|YP_642123.1| hypothetical protein Mmcs_4963 [My... 277 7e-73
gi|118463305|ref|YP_879609.1| hypothetical protein MAV_0322 [Myc... 269 2e-70
gi|333992676|ref|YP_004525290.1| hypothetical protein JDM601_403... 261 3e-68
gi|118472131|ref|YP_890547.1| hypothetical protein MSMEG_6329 [M... 255 3e-66
gi|169627370|ref|YP_001701019.1| hypothetical protein MAB_0265 [... 251 6e-65
gi|120406539|ref|YP_956368.1| hypothetical protein Mvan_5597 [My... 248 4e-64
gi|145221803|ref|YP_001132481.1| hypothetical protein Mflv_1211 ... 242 2e-62
gi|226303816|ref|YP_002763774.1| hypothetical protein RER_03270 ... 216 2e-54
gi|226363493|ref|YP_002781275.1| hypothetical protein ROP_40830 ... 204 5e-51
gi|111021131|ref|YP_704103.1| hypothetical protein RHA1_ro04151 ... 203 1e-50
gi|312137788|ref|YP_004005124.1| hypothetical protein REQ_02940 ... 198 4e-49
gi|54022199|ref|YP_116441.1| hypothetical protein nfa2350 [Nocar... 197 5e-49
gi|333917869|ref|YP_004491450.1| hypothetical protein AS9A_0190 ... 195 3e-48
gi|296141718|ref|YP_003648961.1| hypothetical protein Tpau_4051 ... 192 2e-47
gi|229492484|ref|ZP_04386287.1| conserved hypothetical protein [... 186 2e-45
gi|262200334|ref|YP_003271542.1| hypothetical protein Gbro_0307 ... 184 5e-45
gi|343926155|ref|ZP_08765664.1| hypothetical protein GOALK_056_0... 176 2e-42
gi|296394997|ref|YP_003659881.1| hypothetical protein Srot_2615 ... 175 4e-42
gi|317508898|ref|ZP_07966535.1| hypothetical protein HMPREF9336_... 172 3e-41
gi|326385122|ref|ZP_08206791.1| hypothetical protein SCNU_19350 ... 135 3e-30
gi|256374320|ref|YP_003097980.1| hypothetical protein Amir_0163 ... 133 1e-29
gi|331694217|ref|YP_004330456.1| hypothetical protein Psed_0330 ... 132 4e-29
gi|325001737|ref|ZP_08122849.1| hypothetical protein PseP1_23386... 132 4e-29
gi|319948571|ref|ZP_08022700.1| hypothetical protein ES5_04316 [... 101 6e-20
gi|258650775|ref|YP_003199931.1| hypothetical protein Namu_0524 ... 94.0 1e-17
gi|163758824|ref|ZP_02165911.1| hypothetical protein HPDFL43_154... 81.6 5e-14
gi|153008519|ref|YP_001369734.1| hypothetical protein Oant_1188 ... 79.0 4e-13
gi|330820254|ref|YP_004349116.1| hypothetical protein bgla_2g115... 73.2 2e-11
gi|295699345|ref|YP_003607238.1| hypothetical protein BC1002_370... 72.8 2e-11
gi|15966517|ref|NP_386870.1| hypothetical protein SMc03980 [Sino... 72.0 4e-11
gi|51892410|ref|YP_075101.1| hypothetical protein STH1272 [Symbi... 72.0 4e-11
gi|307727694|ref|YP_003910907.1| hypothetical protein BC1003_570... 72.0 4e-11
gi|323529891|ref|YP_004232043.1| hypothetical protein BC1001_560... 72.0 4e-11
gi|91778317|ref|YP_553525.1| hypothetical protein Bxe_B1793 [Bur... 71.2 8e-11
gi|187780008|ref|ZP_02996481.1| hypothetical protein CLOSPO_0360... 71.2 8e-11
gi|170695809|ref|ZP_02886950.1| protein of unknown function DUF1... 70.9 1e-10
gi|296160708|ref|ZP_06843522.1| protein of unknown function DUF1... 70.1 2e-10
>gi|15610891|ref|NP_218272.1| hypothetical protein Rv3755c [Mycobacterium tuberculosis H37Rv]
gi|15843375|ref|NP_338412.1| hypothetical protein MT3862 [Mycobacterium tuberculosis CDC1551]
gi|31794925|ref|NP_857418.1| hypothetical protein Mb3781c [Mycobacterium bovis AF2122/97]
61 more sequence titles
Length=199
Score = 404 bits (1037), Expect = 5e-111, Method: Compositional matrix adjust.
Identities = 198/199 (99%), Positives = 199/199 (100%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY
Sbjct 1 MNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS
Sbjct 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT
Sbjct 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVDSDGFIVDYPGLAERM
Sbjct 181 VTVDSDGFIVDYPGLAERM 199
>gi|340628726|ref|YP_004747178.1| hypothetical protein MCAN_37751 [Mycobacterium canettii CIPT
140010059]
gi|340006916|emb|CCC46106.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=199
Score = 402 bits (1033), Expect = 2e-110, Method: Compositional matrix adjust.
Identities = 197/199 (99%), Positives = 199/199 (100%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NAVPSDLTPRVWPA+LTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY
Sbjct 1 MNAVPSDLTPRVWPAILTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS
Sbjct 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT
Sbjct 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVDSDGFIVDYPGLAERM
Sbjct 181 VTVDSDGFIVDYPGLAERM 199
>gi|289445354|ref|ZP_06435098.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289418312|gb|EFD15513.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=199
Score = 400 bits (1028), Expect = 6e-110, Method: Compositional matrix adjust.
Identities = 197/199 (99%), Positives = 198/199 (99%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANN AFGAHY
Sbjct 1 MNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNLAFGAHY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS
Sbjct 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT
Sbjct 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVDSDGFIVDYPGLAERM
Sbjct 181 VTVDSDGFIVDYPGLAERM 199
>gi|308232556|ref|ZP_07416452.2| hypothetical protein TMAG_00245 [Mycobacterium tuberculosis SUMu001]
gi|308369221|ref|ZP_07666692.1| hypothetical protein TMBG_02293 [Mycobacterium tuberculosis SUMu002]
gi|308376218|ref|ZP_07668221.1| hypothetical protein TMHG_02822 [Mycobacterium tuberculosis SUMu008]
11 more sequence titles
Length=184
Score = 375 bits (962), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 184/184 (100%), Positives = 184/184 (100%), Gaps = 0/184 (0%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL
Sbjct 1 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 60
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE 135
TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE
Sbjct 61 TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE 120
Query 136 RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGL 195
RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGL
Sbjct 121 RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGL 180
Query 196 AERM 199
AERM
Sbjct 181 AERM 184
>gi|240168514|ref|ZP_04747173.1| hypothetical protein MkanA1_04327 [Mycobacterium kansasii ATCC
12478]
Length=199
Score = 368 bits (945), Expect = 2e-100, Method: Compositional matrix adjust.
Identities = 175/199 (88%), Positives = 192/199 (97%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+N+ PSD RVW AMLTWRAQD+SRMESVR+Q+SGKRIRANGRIVAAATA NPAFGA+Y
Sbjct 1 MNSAPSDPARRVWSAMLTWRAQDVSRMESVRIQVSGKRIRANGRIVAAATATNPAFGAYY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDE+GATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALD+DLVFS
Sbjct 61 DLQTDESGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDVDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLG+HE+AES+ALP+VYVNVPEM+VDAATVSYTSEGRLD IKLRSPVADT+
Sbjct 121 PFFNALPIRRLGIHEKAESLALPMVYVNVPEMTVDAATVSYTSEGRLDAIKLRSPVADTS 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVD++GFIVDYPGLAER+
Sbjct 181 VTVDAEGFIVDYPGLAERI 199
>gi|183985266|ref|YP_001853557.1| hypothetical protein MMAR_5298 [Mycobacterium marinum M]
gi|183178592|gb|ACC43702.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=199
Score = 360 bits (925), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 174/199 (88%), Positives = 187/199 (94%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+N+V SD R+W AMLTWRAQD+SRMESVR+Q+SGKRIRANGRIVAAAT +NPAFGA Y
Sbjct 1 MNSVSSDPARRLWTAMLTWRAQDVSRMESVRIQVSGKRIRANGRIVAAATTSNPAFGAFY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGE RA YNGALDIDLVFS
Sbjct 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGESRAGYNGALDIDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLG+HERAE I LP+VYVNVPEMSVDAATVSY+SEGRLDGIKLRSPVADTT
Sbjct 121 PFFNALPIRRLGIHERAEMITLPMVYVNVPEMSVDAATVSYSSEGRLDGIKLRSPVADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVD +GFI+DYPGLAER+
Sbjct 181 VTVDDEGFILDYPGLAERI 199
>gi|118619513|ref|YP_907845.1| hypothetical protein MUL_4373 [Mycobacterium ulcerans Agy99]
gi|118571623|gb|ABL06374.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=199
Score = 358 bits (919), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 173/199 (87%), Positives = 186/199 (94%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+N+V SD R+W MLTWRAQD+SRMESVR+Q+SGKRIRANGRIVAAATA+NPAFGA Y
Sbjct 1 MNSVSSDPARRLWTGMLTWRAQDVSRMESVRIQVSGKRIRANGRIVAAATASNPAFGAFY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGE RA YNGALDIDLVFS
Sbjct 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGESRAGYNGALDIDLVFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
FFNALPIRRLG+HERAE I LP+VYVNVPEMSVDAATVSY+SEGRLDGIKLRSPVADTT
Sbjct 121 SFFNALPIRRLGIHERAEMITLPMVYVNVPEMSVDAATVSYSSEGRLDGIKLRSPVADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
VTVD +GFI+DYPGLAER+
Sbjct 181 VTVDDEGFILDYPGLAERI 199
>gi|339296563|gb|AEJ48674.1| hypothetical protein CCDC5079_3485 [Mycobacterium tuberculosis
CCDC5079]
Length=173
Score = 352 bits (903), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 173/173 (100%), Positives = 173/173 (100%), Gaps = 0/173 (0%)
Query 27 MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL 86
MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL
Sbjct 1 MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL 60
Query 87 AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY 146
AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY
Sbjct 61 AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY 120
Query 147 VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM
Sbjct 121 VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 173
>gi|296166873|ref|ZP_06849290.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897750|gb|EFG77339.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=184
Score = 327 bits (839), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 155/184 (85%), Positives = 173/184 (95%), Gaps = 0/184 (0%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
MLTWRAQD+SRMESVR+Q+SGKRIRANGRIVAAATA NPAFGA+YDLQTDETGATKR G+
Sbjct 1 MLTWRAQDVSRMESVRIQVSGKRIRANGRIVAAATATNPAFGAYYDLQTDETGATKRLGM 60
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE 135
TVTLAERER L+IARDEENMWLVTDHQGE RAAYNGALD+D+VFSPFFNALPIRRLGLHE
Sbjct 61 TVTLAERERVLSIARDEENMWLVTDHQGEHRAAYNGALDVDVVFSPFFNALPIRRLGLHE 120
Query 136 RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGL 195
RA+S+ALPVVYV++P+MS+ A V+YT G LDGIKLRSPVADTTV+VD +GFIVDYPGL
Sbjct 121 RADSVALPVVYVHLPDMSITADLVTYTCAGGLDGIKLRSPVADTTVSVDEEGFIVDYPGL 180
Query 196 AERM 199
AER+
Sbjct 181 AERI 184
>gi|254822448|ref|ZP_05227449.1| hypothetical protein MintA_21116 [Mycobacterium intracellulare
ATCC 13950]
Length=199
Score = 322 bits (824), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 154/199 (78%), Positives = 175/199 (88%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NA PSD + RVW AMLTWRAQD+SRMESVR+Q+SGKRI+ANGRIVAAAT NPAFGA+Y
Sbjct 1 MNAAPSDPSRRVWQAMLTWRAQDVSRMESVRLQVSGKRIKANGRIVAAATEANPAFGAYY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DL TDETGATKR G+TVTLAERER + ARDEENMWLVTDHQGE RAAYNGALD+D+ FS
Sbjct 61 DLLTDETGATKRLGMTVTLAERERVFSFARDEENMWLVTDHQGEHRAAYNGALDVDVEFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGLHE+A S+ LPVVYVNVPEMS+ A TVSY+S G IK+ +P+ADTT
Sbjct 121 PFFNALPIRRLGLHEQAASVTLPVVYVNVPEMSIIADTVSYSSAGSRGEIKVHTPIADTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
V+VD +GFIVDYPGLAER+
Sbjct 181 VSVDDEGFIVDYPGLAERI 199
>gi|41406374|ref|NP_959210.1| hypothetical protein MAP0276 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254773331|ref|ZP_05214847.1| hypothetical protein MaviaA2_01416 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41394722|gb|AAS02593.1| hypothetical protein MAP_0276 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=199
Score = 320 bits (820), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 154/199 (78%), Positives = 173/199 (87%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NA PSD + RVW AMLTWRAQD+SRMESVR+Q+SG RI+ANGRIVAAAT NPAFGA+Y
Sbjct 1 MNAAPSDPSRRVWQAMLTWRAQDVSRMESVRLQVSGNRIKANGRIVAAATDANPAFGAYY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DLQTDETGATKR G+TVTLAERER + ARDEENMWLVTD QGE RAAYNGALD+D+ FS
Sbjct 61 DLQTDETGATKRLGMTVTLAERERVFSFARDEENMWLVTDPQGEHRAAYNGALDVDVEFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGL ERA S+ LPVVYVNVPEMS+ A TVSY+S G D IK+ SP++DTT
Sbjct 121 PFFNALPIRRLGLQERAASVTLPVVYVNVPEMSITADTVSYSSTGSRDEIKVHSPISDTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
V+VD GFIVDYPGLAER+
Sbjct 181 VSVDEQGFIVDYPGLAERI 199
>gi|342860000|ref|ZP_08716652.1| hypothetical protein MCOL_14010 [Mycobacterium colombiense CECT
3035]
gi|342132378|gb|EGT85607.1| hypothetical protein MCOL_14010 [Mycobacterium colombiense CECT
3035]
Length=199
Score = 316 bits (809), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 151/199 (76%), Positives = 173/199 (87%), Gaps = 0/199 (0%)
Query 1 VNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHY 60
+NA PS+ + RVW MLTWRAQD RMESVR+Q+SG RI+ANGRI+AAAT +PAFGA+Y
Sbjct 1 MNAAPSEPSRRVWQTMLTWRAQDALRMESVRLQVSGNRIKANGRIIAAATDAHPAFGAYY 60
Query 61 DLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFS 120
DL TDE GATKR G+TVTLAERER + ARDEENMWLVTDHQGE RAAYNGALD+D+ FS
Sbjct 61 DLLTDEAGATKRLGMTVTLAERERVFSFARDEENMWLVTDHQGEHRAAYNGALDVDVEFS 120
Query 121 PFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTT 180
PFFNALPIRRLGL+ERA S+ LPVVYVNVPEMS+ A TVSY+S G LD IKLRSP++DTT
Sbjct 121 PFFNALPIRRLGLYERAASVTLPVVYVNVPEMSITADTVSYSSTGSLDEIKLRSPISDTT 180
Query 181 VTVDSDGFIVDYPGLAERM 199
V+VD +GFIVDYPGLAER+
Sbjct 181 VSVDDEGFIVDYPGLAERI 199
>gi|336459998|gb|EGO38908.1| hypothetical protein MAPs_44750 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=184
Score = 303 bits (776), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 145/184 (79%), Positives = 162/184 (89%), Gaps = 0/184 (0%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
MLTWRAQD+SRMESVR+Q+SG RI+ANGRIVAAAT NPAFGA+YDLQTDETGATKR G+
Sbjct 1 MLTWRAQDVSRMESVRLQVSGNRIKANGRIVAAATDANPAFGAYYDLQTDETGATKRLGM 60
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE 135
TVTLAERER + ARDEENMWLVTD QGE RAAYNGALD+D+ FSPFFNALPIRRLGL E
Sbjct 61 TVTLAERERVFSFARDEENMWLVTDPQGEHRAAYNGALDVDVEFSPFFNALPIRRLGLQE 120
Query 136 RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGL 195
RA S+ LPVVYVNVPEMS+ A TVSY+S G D IK+ SP++DTTV+VD GFIVDYPGL
Sbjct 121 RAASVTLPVVYVNVPEMSITADTVSYSSTGSRDEIKVHSPISDTTVSVDEQGFIVDYPGL 180
Query 196 AERM 199
AER+
Sbjct 181 AERI 184
>gi|108801926|ref|YP_642123.1| hypothetical protein Mmcs_4963 [Mycobacterium sp. MCS]
gi|119871078|ref|YP_941030.1| hypothetical protein Mkms_5051 [Mycobacterium sp. KMS]
gi|126437907|ref|YP_001073598.1| hypothetical protein Mjls_5344 [Mycobacterium sp. JLS]
gi|108772345|gb|ABG11067.1| protein of unknown function DUF1089 [Mycobacterium sp. MCS]
gi|119697167|gb|ABL94240.1| protein of unknown function DUF1089 [Mycobacterium sp. KMS]
gi|126237707|gb|ABO01108.1| protein of unknown function DUF1089 [Mycobacterium sp. JLS]
Length=200
Score = 277 bits (708), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 132/188 (71%), Positives = 158/188 (85%), Gaps = 1/188 (0%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTWRA D++RMESVRVQLSGKRI+A GRIVAAA +PAF A YDL TDE GATKR
Sbjct 13 WPAVLTWRAHDVARMESVRVQLSGKRIKAYGRIVAAACDAHPAFSASYDLVTDEHGATKR 72
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGE-RRAAYNGALDIDLVFSPFFNALPIRRL 131
+TVTLAERERQ++ ARDEENMWLV D Q + +RAA++GALD+D+V SPFFN LPIRR
Sbjct 73 LAMTVTLAERERQVSFARDEENMWLVRDQQNQMKRAAFDGALDVDVVLSPFFNTLPIRRA 132
Query 132 GLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVD 191
GLHE +ESI +PVVYV +PE SV+ A++SY+ DGIK++SPVADTT+TVD+DGFIVD
Sbjct 133 GLHEHSESITVPVVYVRLPEFSVEQASISYSGGPDSDGIKVQSPVADTTITVDADGFIVD 192
Query 192 YPGLAERM 199
YPGLA R+
Sbjct 193 YPGLAARI 200
>gi|118463305|ref|YP_879609.1| hypothetical protein MAV_0322 [Mycobacterium avium 104]
gi|118164592|gb|ABK65489.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=166
Score = 269 bits (687), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 129/166 (78%), Positives = 144/166 (87%), Gaps = 0/166 (0%)
Query 34 LSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEE 93
+SG RI+ANGRIVAAAT NPAFGA+YDLQTDETGATKR G+TVTLAERER + ARDEE
Sbjct 1 MSGNRIKANGRIVAAATDANPAFGAYYDLQTDETGATKRLGMTVTLAERERVFSFARDEE 60
Query 94 NMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMS 153
NMWLVTD QGE RAAYNGALD+D+ FSPFFNALPIRRLGL ERA S+ LPVVYVNVPEMS
Sbjct 61 NMWLVTDPQGEHRAAYNGALDVDVEFSPFFNALPIRRLGLQERAASVTLPVVYVNVPEMS 120
Query 154 VDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
+ A TVSY+S G D IK+ SP++DTTV+VD GFIVDYPGLAER+
Sbjct 121 ITADTVSYSSTGSRDEIKVHSPISDTTVSVDEQGFIVDYPGLAERI 166
>gi|333992676|ref|YP_004525290.1| hypothetical protein JDM601_4035 [Mycobacterium sp. JDM601]
gi|333488643|gb|AEF38035.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=196
Score = 261 bits (668), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 132/190 (70%), Positives = 153/190 (81%), Gaps = 5/190 (2%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTWRA D RMESVRVQLSG RI+ANGRIVA AT +PAF A+YDL TDE+GATKR
Sbjct 9 WPAILTWRAPDAPRMESVRVQLSGNRIKANGRIVAGATDAHPAFSAYYDLATDESGATKR 68
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLG 132
GLTVT+AER+RQL IARDEENMWL+TD +G+ RAAY+GALDID+VFSPFFN LPIRR
Sbjct 69 LGLTVTVAERDRQLVIARDEENMWLITDSRGQSRAAYDGALDIDVVFSPFFNTLPIRRAR 128
Query 133 LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVAD---TTVTVDSDGFI 189
LHERA ++ALP VY+ +PEMSV AA SY S GI + +P D TTVTVD DGF+
Sbjct 129 LHERAAAVALPTVYLWLPEMSVVAAEASYRSTEA--GITVLTPGTDRDGTTVTVDDDGFV 186
Query 190 VDYPGLAERM 199
+DYPGLA R+
Sbjct 187 IDYPGLAARI 196
>gi|118472131|ref|YP_890547.1| hypothetical protein MSMEG_6329 [Mycobacterium smegmatis str.
MC2 155]
gi|118173418|gb|ABK74314.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=171
Score = 255 bits (651), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 120/173 (70%), Positives = 146/173 (85%), Gaps = 2/173 (1%)
Query 27 MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL 86
MESVRVQL GKRI+A GRIVAAAT ++PAF A YDL TDETGATKR LTVTLAERERQL
Sbjct 1 MESVRVQLQGKRIKAYGRIVAAATESHPAFSASYDLVTDETGATKRLSLTVTLAERERQL 60
Query 87 AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY 146
+IARDEE+ WLV DH +++ + GALD+D++FSPFFNALPIRR+GLH R +S++LPV Y
Sbjct 61 SIARDEESQWLVQDHSQTKKSDFGGALDVDVIFSPFFNALPIRRVGLHTRTDSVSLPVAY 120
Query 147 VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
V +PE+SV+ +SY+S DGIKL SPVA+TT+TVDS+GFI+DYPGLAER+
Sbjct 121 VRLPELSVETVNISYSSGA--DGIKLHSPVAETTITVDSEGFILDYPGLAERI 171
>gi|169627370|ref|YP_001701019.1| hypothetical protein MAB_0265 [Mycobacterium abscessus ATCC 19977]
gi|169239337|emb|CAM60365.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=192
Score = 251 bits (640), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 120/186 (65%), Positives = 152/186 (82%), Gaps = 2/186 (1%)
Query 14 PAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRF 73
PA+LTWRA D SRMES RVQLSG+RIRA+GR VA A+ +PAF A YDL TDETG+T R
Sbjct 9 PAVLTWRAHDASRMESTRVQLSGRRIRAHGRFVAGASDAHPAFSASYDLVTDETGSTNRL 68
Query 74 GLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGL 133
L+ T+AERERQL+IARDEE MW V +H+G R+A++GALD+D+VFSPFFNALPIRR GL
Sbjct 69 SLSTTVAERERQLSIARDEEGMWTVQNHEGATRSAFDGALDVDVVFSPFFNALPIRRTGL 128
Query 134 HERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYP 193
+++ S LPVVYV +P+++V AT+SY ++G GIK+ SPVADTTVTVD +GF+++YP
Sbjct 129 YQQEGSAVLPVVYVTLPDLAVSPATISYRNDG--TGIKVVSPVADTTVTVDDEGFLLEYP 186
Query 194 GLAERM 199
GLA R+
Sbjct 187 GLAVRI 192
>gi|120406539|ref|YP_956368.1| hypothetical protein Mvan_5597 [Mycobacterium vanbaalenii PYR-1]
gi|119959357|gb|ABM16362.1| protein of unknown function DUF1089 [Mycobacterium vanbaalenii
PYR-1]
Length=197
Score = 248 bits (633), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 129/189 (69%), Positives = 158/189 (84%), Gaps = 5/189 (2%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
W A+LTWRA D+SRMES RVQ+SG RI+A GRIVAAAT+ +PAF A YDL TDE GATKR
Sbjct 12 WRAVLTWRAHDVSRMESARVQVSGDRIKAYGRIVAAATSAHPAFSASYDLVTDEAGATKR 71
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGE-RRAAYNGALDIDLVFSPFFNALPIRRL 131
LTVTLAERERQL+IARDEENMWLV +H G+ R+AY+GALD+D++FSPFFNALPIRR
Sbjct 72 LSLTVTLAERERQLSIARDEENMWLVQEHSGQTSRSAYDGALDVDVIFSPFFNALPIRRT 131
Query 132 GLHERAESIALPVVYVNVPEMSVDAATVSYTS-EGRLDGIKLRSPVADTTVTVDSDGFIV 190
G+++ S+ +PVVYV VP+++VD T+SY + +G GI+L SPVA+T VTVDSDGFI+
Sbjct 132 GVYKDGGSVTVPVVYVRVPDLAVDVETISYAAVDG---GIRLHSPVAETVVTVDSDGFIL 188
Query 191 DYPGLAERM 199
DYPGLAER+
Sbjct 189 DYPGLAERI 197
>gi|145221803|ref|YP_001132481.1| hypothetical protein Mflv_1211 [Mycobacterium gilvum PYR-GCK]
gi|315446460|ref|YP_004079339.1| hypothetical protein Mspyr1_49710 [Mycobacterium sp. Spyr1]
gi|145214289|gb|ABP43693.1| protein of unknown function DUF1089 [Mycobacterium gilvum PYR-GCK]
gi|315264763|gb|ADU01505.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=198
Score = 242 bits (618), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/189 (69%), Positives = 155/189 (83%), Gaps = 4/189 (2%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTWRA D RMESVRVQLSGKR++A GR+VAAAT+ +PAF A YDL TDE GATKR
Sbjct 12 WPAVLTWRAHDEPRMESVRVQLSGKRVKAYGRVVAAATSAHPAFSASYDLVTDELGATKR 71
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGE-RRAAYNGALDIDLVFSPFFNALPIRRL 131
LTVTLAERERQL+IARDEENMWLV +H G+ R+A++GALD+D+VFSPFFNALPIRRL
Sbjct 72 LSLTVTLAERERQLSIARDEENMWLVQEHSGQTSRSAFDGALDVDMVFSPFFNALPIRRL 131
Query 132 G-LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIV 190
G L ES+ +PVVYV V ++SV ++SY + GI L SPVADT++TVD+DGFI+
Sbjct 132 GVLPGSGESVTVPVVYVRVHDLSVVVESISYAATD--SGISLTSPVADTSITVDADGFIL 189
Query 191 DYPGLAERM 199
DYPGLAER+
Sbjct 190 DYPGLAERI 198
>gi|226303816|ref|YP_002763774.1| hypothetical protein RER_03270 [Rhodococcus erythropolis PR4]
gi|226182931|dbj|BAH31035.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=195
Score = 216 bits (549), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 98/187 (53%), Positives = 139/187 (75%), Gaps = 2/187 (1%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTW+A + RMESVRVQL+G RI+A+GRI+ A + +PAF A YDL TDE G T+R
Sbjct 11 WPAVLTWQAHNAPRMESVRVQLNGNRIKASGRIIGGACSEHPAFSASYDLVTDEAGITRR 70
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLG 132
+ LA ERQ++I+RDEE W+V + +R+A++GALD+D++ SPFFN LPIRR+G
Sbjct 71 LSVRTALAAGERQMSISRDEEGTWMVENGASHQRSAFDGALDVDVILSPFFNTLPIRRVG 130
Query 133 LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDY 192
L E +PVVYVN+ ++ V+ AT++Y+S DGI + SPV+ +T+ VD++GF++DY
Sbjct 131 LQNDIEDAEVPVVYVNLLDLRVEGATITYSSG--TDGISVLSPVSSSTLAVDTEGFVLDY 188
Query 193 PGLAERM 199
PGLA R+
Sbjct 189 PGLATRI 195
>gi|226363493|ref|YP_002781275.1| hypothetical protein ROP_40830 [Rhodococcus opacus B4]
gi|226241982|dbj|BAH52330.1| hypothetical protein [Rhodococcus opacus B4]
Length=195
Score = 204 bits (520), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 104/187 (56%), Positives = 139/187 (75%), Gaps = 2/187 (1%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTWRA + RMESVRVQL+G RI+A GRI+ +PAF A YDL TDE+G T+R
Sbjct 11 WPAVLTWRADNAPRMESVRVQLNGDRIKAAGRIIGGECPEHPAFSASYDLVTDESGVTRR 70
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLG 132
L ++A ERQ++I+RDEE W+V +R+ ++GALD+D+V SPFFNALPIRR G
Sbjct 71 LSLRTSVAAGERQMSISRDEEGTWMVEHGANHQRSTFDGALDVDMVLSPFFNALPIRRYG 130
Query 133 LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDY 192
LH +E I +PVVYVN+ ++ V+ A ++Y+S DGI + SPV+ ++VTVD DGFI+DY
Sbjct 131 LHLGSEDIEVPVVYVNLLDLRVEGAILTYSSGP--DGIHVLSPVSSSSVTVDRDGFIIDY 188
Query 193 PGLAERM 199
PGLAER+
Sbjct 189 PGLAERI 195
>gi|111021131|ref|YP_704103.1| hypothetical protein RHA1_ro04151 [Rhodococcus jostii RHA1]
gi|110820661|gb|ABG95945.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=195
Score = 203 bits (517), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/187 (56%), Positives = 139/187 (75%), Gaps = 2/187 (1%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPA+LTWRA + RMESVRVQL+G RI+A GRI+ +PAF A YDL TDE+G T+R
Sbjct 11 WPAVLTWRADNAPRMESVRVQLNGDRIKAAGRIIGGECPEHPAFSASYDLVTDESGITRR 70
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLG 132
L ++A ERQ++I+RDEE W+V +R+ ++GALD+D+V SPFFNALPIRR G
Sbjct 71 LSLRTSVAAGERQMSISRDEEGTWMVEHGANHQRSTFDGALDVDMVLSPFFNALPIRRYG 130
Query 133 LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDY 192
LH +E + +PVVYVN+ ++ V+ A ++Y+S DGI + SPV+ ++VTVD DGFI+DY
Sbjct 131 LHLGSEDVEVPVVYVNLLDLRVEGAILTYSSGP--DGIHVLSPVSSSSVTVDRDGFIIDY 188
Query 193 PGLAERM 199
PGLAER+
Sbjct 189 PGLAERI 195
>gi|312137788|ref|YP_004005124.1| hypothetical protein REQ_02940 [Rhodococcus equi 103S]
gi|325676111|ref|ZP_08155793.1| hypothetical protein HMPREF0724_13576 [Rhodococcus equi ATCC
33707]
gi|311887127|emb|CBH46436.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325553151|gb|EGD22831.1| hypothetical protein HMPREF0724_13576 [Rhodococcus equi ATCC
33707]
Length=171
Score = 198 bits (503), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 92/173 (54%), Positives = 124/173 (72%), Gaps = 2/173 (1%)
Query 27 MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL 86
MESVRVQ+SG RI+A GRIV A +PAF A YDL TDE G T+R L LA ER +
Sbjct 1 MESVRVQMSGNRIKATGRIVGGACPEHPAFSASYDLVTDENGVTRRLSLHTALAAGERHM 60
Query 87 AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY 146
+I+RDEE +W+V R+ + GA D+D+V SPFFN LPIRR GL +E I +PVVY
Sbjct 61 SISRDEEGVWMVETGTTHLRSGFAGAKDVDVVLSPFFNTLPIRRFGLQHESEDIQVPVVY 120
Query 147 VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
VN+P+++V A+++Y+S DGI + SPV+ +++ VD+DGF++DYPGLAER+
Sbjct 121 VNLPDLAVQEASLTYSSGA--DGIHVLSPVSSSSIKVDADGFVLDYPGLAERI 171
>gi|54022199|ref|YP_116441.1| hypothetical protein nfa2350 [Nocardia farcinica IFM 10152]
gi|54013707|dbj|BAD55077.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=189
Score = 197 bits (502), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 105/192 (55%), Positives = 133/192 (70%), Gaps = 3/192 (1%)
Query 8 LTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDET 67
+ PR WPA+LTWRA + SRMESVRV L+G RIRA GR++ +PAF A YDL TDE
Sbjct 1 MAPR-WPAILTWRAHNASRMESVRVTLNGNRIRAAGRMIGGDCDEHPAFSASYDLVTDEN 59
Query 68 GATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALP 127
G TKR L T A ER +IARDEE+ WLV R+ + GALD+D+V SPFFN LP
Sbjct 60 GVTKRLSLRTTTAAGERHASIARDEEDYWLVDAGNSHVRSTFGGALDVDVVLSPFFNTLP 119
Query 128 IRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDG 187
IRR GL + + +PVVYV +P++ V A+++Y+S DGI + SPV+ TVTVD DG
Sbjct 120 IRRFGLQHAVDEVVVPVVYVRLPDLLVQEASLTYSSGA--DGISVLSPVSSATVTVDPDG 177
Query 188 FIVDYPGLAERM 199
F++DYPGLAER+
Sbjct 178 FLLDYPGLAERI 189
>gi|333917869|ref|YP_004491450.1| hypothetical protein AS9A_0190 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480090|gb|AEF38650.1| hypothetical protein AS9A_0190 [Amycolicicoccus subflavus DQS3-9A1]
Length=171
Score = 195 bits (496), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 93/173 (54%), Positives = 125/173 (73%), Gaps = 2/173 (1%)
Query 27 MESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQL 86
MESVR+QLSGK+I+A GRI+ A A +PAF A YDL TDE GAT+R L T+A ER L
Sbjct 1 MESVRIQLSGKKIKAAGRIIGADCAEHPAFSASYDLITDEFGATRRLSLRATVARGERVL 60
Query 87 AIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVY 146
+++RD E W V + R+ + GALD+D++ SPFFNALPIRRL LH A + +PVVY
Sbjct 61 SVSRDTEGYWTVHEGNTSTRSRFGGALDVDVILSPFFNALPIRRLDLHTEANDVQVPVVY 120
Query 147 VNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
V++P+++V T++Y+S +GI + SPVA TVTVDSDGF++DYP L +R+
Sbjct 121 VSLPDLTVREETLTYSSHA--EGIHVFSPVASATVTVDSDGFLIDYPALGQRI 171
>gi|296141718|ref|YP_003648961.1| hypothetical protein Tpau_4051 [Tsukamurella paurometabola DSM
20162]
gi|296029852|gb|ADG80622.1| protein of unknown function DUF1089 [Tsukamurella paurometabola
DSM 20162]
Length=192
Score = 192 bits (489), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/192 (50%), Positives = 127/192 (67%), Gaps = 2/192 (1%)
Query 8 LTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDET 67
+T WP MLTWR+ D + +ESVRVQ++G RI+A GRI+AA +A+ PAF A YDL TD+
Sbjct 2 ITGGSWPRMLTWRSDDGNLLESVRVQVTGDRIKAYGRIIAAPSADGPAFNASYDLVTDDE 61
Query 68 GATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALP 127
G TKR + A E Q+ IARD E+ WLV QG R ++GAL +D++ S FFNAL
Sbjct 62 GVTKRLSVHALTASGEAQVTIARDGESHWLVQGAQGAERGFFSGALSVDVLRSAFFNALT 121
Query 128 IRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDG 187
IRR L E + +PVVYV +P + V +SY + DGI + SPV+ + ++VD DG
Sbjct 122 IRRYNLQSHVEDVDVPVVYVELPTLQVKETVISYAAAA--DGITVISPVSSSKLSVDEDG 179
Query 188 FIVDYPGLAERM 199
F+VDYPGLA R+
Sbjct 180 FVVDYPGLARRV 191
>gi|229492484|ref|ZP_04386287.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229320470|gb|EEN86288.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=166
Score = 186 bits (472), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 84/168 (50%), Positives = 124/168 (74%), Gaps = 2/168 (1%)
Query 32 VQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARD 91
+QL+G RI+A+GRI+ A + +PAF A YDL TDE G T+R + LA ERQ++I+RD
Sbjct 1 MQLNGNRIKASGRIIGGACSEHPAFSASYDLVTDEAGITRRLSVRTALAAGERQMSISRD 60
Query 92 EENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPE 151
EE W+V + +R+A++GALD+D++ SPFFN LPIRR+GL E +PVVYVN+ +
Sbjct 61 EEGTWMVENGASHQRSAFDGALDVDVILSPFFNTLPIRRVGLQNDIEDAEVPVVYVNLLD 120
Query 152 MSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
+ V+ AT++Y+S DGI + SPV+ +T++VD++GF++DYPGLA R+
Sbjct 121 LRVEGATITYSSGA--DGISVLSPVSSSTLSVDTEGFVLDYPGLATRI 166
>gi|262200334|ref|YP_003271542.1| hypothetical protein Gbro_0307 [Gordonia bronchialis DSM 43247]
gi|262083681|gb|ACY19649.1| protein of unknown function DUF1089 [Gordonia bronchialis DSM
43247]
Length=184
Score = 184 bits (468), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 94/186 (51%), Positives = 130/186 (70%), Gaps = 4/186 (2%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
MLTWR + R+E VR+ ++G RI+A GRI+AAAT ++ AF A Y+L T+++G TKR +
Sbjct 1 MLTWRGEGTDRLEQVRLHVNGTRIKAYGRIIAAATDDHEAFSASYELVTNDSGVTKRLSI 60
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGER-RAAYNGALDIDLVFSPFFNALPIRRLGLH 134
+ A E Q AI RDEE WLV GE R+ +NGA+D+DL SP FNALPIRRLGL
Sbjct 61 HLVRASGETQFAINRDEEQHWLVHAPGGESIRSDFNGAMDVDLALSPMFNALPIRRLGLA 120
Query 135 E-RAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYP 193
E+ +PVVYV +P+ V+ T++YT + DG+ + SPVA++T+T+D +GF+VDYP
Sbjct 121 AGSGEATEVPVVYVYLPQGVVEPGTLTYTP--KPDGLGVVSPVANSTLTIDDNGFVVDYP 178
Query 194 GLAERM 199
GLA R+
Sbjct 179 GLATRV 184
>gi|343926155|ref|ZP_08765664.1| hypothetical protein GOALK_056_00230 [Gordonia alkanivorans NBRC
16433]
gi|343763784|dbj|GAA12590.1| hypothetical protein GOALK_056_00230 [Gordonia alkanivorans NBRC
16433]
Length=216
Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/188 (48%), Positives = 124/188 (66%), Gaps = 3/188 (1%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
+ +LTWR +D R+E VR+ +SG R++A GRI+AA T ++ AF A Y+LQT E+G TKR
Sbjct 31 FKTVLTWRGEDTDRLEQVRLVVSGTRMKAYGRIIAAKTDDHEAFSASYELQTTESGVTKR 90
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGE-RRAAYNGALDIDLVFSPFFNALPIRRL 131
+ + E Q I RD E WL+ GE R+ ++GA D+DL SP FNALP+RR
Sbjct 91 LTVHLICEAGETQFGITRDNEGTWLIRRPDGEIIRSDFDGAEDVDLALSPMFNALPLRRK 150
Query 132 GLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVD 191
L E ++ +PVVY+ +P V AAT+SYT+ + GI L SP+A TT+T+D +GF+ D
Sbjct 151 ALTEADGAVDVPVVYMYLPSGEVKAATMSYTATAK--GIDLVSPLATTTLTLDDNGFVTD 208
Query 192 YPGLAERM 199
YPGLA R+
Sbjct 209 YPGLARRV 216
>gi|296394997|ref|YP_003659881.1| hypothetical protein Srot_2615 [Segniliparus rotundus DSM 44985]
gi|296182144|gb|ADG99050.1| protein of unknown function DUF1089 [Segniliparus rotundus DSM
44985]
Length=197
Score = 175 bits (443), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 87/187 (47%), Positives = 133/187 (72%), Gaps = 2/187 (1%)
Query 13 WPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKR 72
WPAMLTW++ D +R+ESVRV LSG R+RA GRI+AAA ++ AF A YDL T++ G T+R
Sbjct 13 WPAMLTWQSSDATRLESVRVNLSGSRVRAYGRIIAAANEDHEAFSASYDLVTNDEGITER 72
Query 73 FGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLG 132
++V A ++Q++I RD + WLV ++ +NGALD+D+ FS FFN L +RR G
Sbjct 73 LSVSVLRASGDQQVSITRDAQGGWLVQTISSAVKSGFNGALDVDMQFSAFFNTLLLRRAG 132
Query 133 LHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDY 192
LH+ + + +PV+Y+ VPE+ + T+ Y ++ +G+++ SPV+++ V +DSDGFI+DY
Sbjct 133 LHQGPQDLDVPVMYLRVPELELSEVTLQYRADP--NGVRVVSPVSESVVVIDSDGFILDY 190
Query 193 PGLAERM 199
PGL+ R+
Sbjct 191 PGLSRRV 197
>gi|317508898|ref|ZP_07966535.1| hypothetical protein HMPREF9336_02907 [Segniliparus rugosus ATCC
BAA-974]
gi|316252782|gb|EFV12215.1| hypothetical protein HMPREF9336_02907 [Segniliparus rugosus ATCC
BAA-974]
Length=197
Score = 172 bits (436), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 91/190 (48%), Positives = 131/190 (69%), Gaps = 2/190 (1%)
Query 10 PRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGA 69
P+ WP MLTW +++ES RV L+G RIRA GRI++AATA++ AF A YDL TDE G
Sbjct 10 PKSWPTMLTWNGHTATQLESARVNLAGNRIRAYGRIISAATADHEAFSASYDLVTDEEGG 69
Query 70 TKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIR 129
+R ++V A +RQ+++ RD E WLV ++ +NGALD+D+ S FFN L +R
Sbjct 70 AQRLSVSVLRAGGDRQVSVTRDTEGNWLVHTLGSVVKSGFNGALDVDMERSSFFNTLLLR 129
Query 130 RLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFI 189
RLGLH +A I +PV+Y+ +PE+ V T+ Y ++ GI++ SPV+++ +T+DSDGFI
Sbjct 130 RLGLHLQARDIDVPVMYLRLPELEVSEVTLQYRADPV--GIRVVSPVSESVITIDSDGFI 187
Query 190 VDYPGLAERM 199
+DYPGLA R+
Sbjct 188 LDYPGLARRV 197
>gi|326385122|ref|ZP_08206791.1| hypothetical protein SCNU_19350 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196155|gb|EGD53360.1| hypothetical protein SCNU_19350 [Gordonia neofelifaecis NRRL
B-59395]
Length=198
Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/196 (41%), Positives = 112/196 (58%), Gaps = 6/196 (3%)
Query 2 NAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYD 61
+A P+D P V AMLTWR D R+E VR+ LSG R+RA GRIVAAAT A+ A Y+
Sbjct 5 DASPAD--PGV-KAMLTWRGVDGDRLEQVRLNLSGSRVRAYGRIVAAATETTEAYSASYE 61
Query 62 LQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSP 121
L T+++G T+R + + A E + I+RD + W+V R+ ++GA IDL SP
Sbjct 62 LVTNDSGVTRRLSVRLLRAGGESSIDISRDMDGRWMVQTSTSTVRSDFDGAEVIDLELSP 121
Query 122 FFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDG-IKLRSPVADTT 180
FF LP+RR G+ E +PVV + +P+ +D+ SY G DG + + P
Sbjct 122 FFKGLPVRRFGIAEGVRRDDIPVVTLRLPDCEIDSVPKSYV--GLADGRVTVIGPNGSRE 179
Query 181 VTVDSDGFIVDYPGLA 196
+ VD G + DY G+A
Sbjct 180 LEVDDAGIVRDYGGIA 195
>gi|256374320|ref|YP_003097980.1| hypothetical protein Amir_0163 [Actinosynnema mirum DSM 43827]
gi|255918623|gb|ACU34134.1| protein of unknown function DUF1089 [Actinosynnema mirum DSM
43827]
Length=207
Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/188 (39%), Positives = 114/188 (61%), Gaps = 7/188 (3%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
M+TW+ + R+E VRV +S ++RA+GRIVA+ A F ++L E GA R L
Sbjct 23 MVTWQGCSVPRLEQVRVLVSEHKLRASGRIVASGPAEQ--FNGSFELSVGEDGAVTRLLL 80
Query 76 -TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLH 134
T T+AE ER ++++R + +W+V G RA ++GA+D+D+ F+ F A+P+RRLGLH
Sbjct 81 RTATVAE-ERHVSLSRSSDGVWMVDRGHGGERADFDGAVDVDVEFAVLFAAIPVRRLGLH 139
Query 135 ERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDG---IKLRSPVADTTVTVDSDGFIVD 191
LPVV V++P++ V Y + DG + +R+ + VTVD++G +VD
Sbjct 140 REPGEAELPVVRVSLPDLDVTVVRRGYRTASVGDGGSVVAIRAEDGEQDVTVDAEGLVVD 199
Query 192 YPGLAERM 199
YP +A+R+
Sbjct 200 YPDVAQRI 207
>gi|331694217|ref|YP_004330456.1| hypothetical protein Psed_0330 [Pseudonocardia dioxanivorans
CB1190]
gi|326948906|gb|AEA22603.1| protein of unknown function DUF1089 [Pseudonocardia dioxanivorans
CB1190]
Length=197
Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/190 (39%), Positives = 105/190 (56%), Gaps = 7/190 (3%)
Query 15 AMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFG 74
M+TW+A+D +E RV + RA GR+V T + A Y L +E + +R
Sbjct 3 GMVTWQAEDEVGLEGARVLIGPTGFRALGRVVR--TGPHGELTASYRLTLNEDHSVERLS 60
Query 75 LTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLH 134
+T AERER L + R E+ WL+ D G R+ Y+GA+D+DL SP FN LPIRRLGLH
Sbjct 61 VTAATAERERHLTMNRTEDGFWLLDDGSGSTRSDYDGAIDVDLERSPLFNTLPIRRLGLH 120
Query 135 ERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDG-----IKLRSPVADTTVTVDSDGFI 189
+PV++V++P +SV+ Y + +DG + S +TVD+DG +
Sbjct 121 TEHGDHVIPVLFVSLPTLSVELVDQHYRTVSVVDGDAPAVVNFSSGEFSADLTVDADGIV 180
Query 190 VDYPGLAERM 199
YPGLA R+
Sbjct 181 DHYPGLARRV 190
>gi|325001737|ref|ZP_08122849.1| hypothetical protein PseP1_23386 [Pseudonocardia sp. P1]
Length=196
Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 111/195 (57%), Gaps = 15/195 (7%)
Query 16 MLTWR----AQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATK 71
ML+WR A + +ES R+ ++G RA GR++ A Y L GA
Sbjct 4 MLSWRSGPEAGAATGLESARITVAGGGFRAVGRMIRGTPEG--VLTASYRLVVAADGALS 61
Query 72 RFGLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRL 131
R + V AE E+QL I+R + +WLV D G R A++GA D+DL FSP FNALPIRRL
Sbjct 62 RLAVDVATAEGEQQLTISRSTDGVWLVDDGSGGTRGAFSGARDVDLAFSPVFNALPIRRL 121
Query 132 GLHERAESIALPVVYVNVPEMSVDAATVSY------TSEG-RLDGIKLRSPVADTTVTVD 184
GLH LP+V+V++P ++V+A +Y ++ G + G AD +TVD
Sbjct 122 GLHRDPAEHVLPMVFVDLPTLAVEATEQTYRTVRSASAAGPAVVGFAAGDVAAD--MTVD 179
Query 185 SDGFIVDYPGLAERM 199
+DGF++DYPG+A R+
Sbjct 180 ADGFVLDYPGIATRV 194
>gi|319948571|ref|ZP_08022700.1| hypothetical protein ES5_04316 [Dietzia cinnamea P4]
gi|319437770|gb|EFV92761.1| hypothetical protein ES5_04316 [Dietzia cinnamea P4]
Length=183
Score = 101 bits (252), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/184 (39%), Positives = 103/184 (56%), Gaps = 3/184 (1%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
M TW ++ +E VRV G RA GRIV+AA + AF YD + A +R GL
Sbjct 1 MYTWISETGRVIEQVRVVPRGDSARARGRIVSAAHPEHVAFTVEYDAEIGSDRALRRVGL 60
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGER-RAAYNGALDIDLVFSPFFNALPIRRLGLH 134
TV+ E ER + +A D+E WL+ D G R R +G +D+D+ +S FF ++ IRRLGLH
Sbjct 61 TVSTEEYERSIDLACDDEGAWLLDDPSGTRSRVGGDGVVDVDVTYSVFFASVMIRRLGLH 120
Query 135 ERAESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPG 194
+ S V+ V+ + V TV+ +S+ + + + A T+ TVD+ G I+D PG
Sbjct 121 AQPGSAEERVLSVDSMTLDVTEDTVTLSSDD--EQVHGFTATASTSATVDAGGMIIDVPG 178
Query 195 LAER 198
L+ R
Sbjct 179 LSRR 182
>gi|258650775|ref|YP_003199931.1| hypothetical protein Namu_0524 [Nakamurella multipartita DSM
44233]
gi|258554000|gb|ACV76942.1| protein of unknown function DUF1089 [Nakamurella multipartita
DSM 44233]
Length=194
Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/187 (34%), Positives = 97/187 (52%), Gaps = 6/187 (3%)
Query 16 MLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGL 75
W + D R+E+VRV ++ + +RA+G +V +FGA Y + D G T+R L
Sbjct 10 FFAWSSDDGRRLETVRVVITERGLRASGYLV---RVGRNSFGASYSVLCDAAGRTRRVTL 66
Query 76 TVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHE 135
A ER L++ R WL + + ALD+D V S F N++ IRRLGLH+
Sbjct 67 HSDSAITERGLSLTRTPGGPWLDGAGKSPPMPDLDLALDVDFVASVFSNSMAIRRLGLHQ 126
Query 136 RAESIALPVVYVNVPEMSVDAATVSY-TSEGRLDGIKLR--SPVADTTVTVDSDGFIVDY 192
R ++ V VN P+++V+ Y T E G +LR P ++VDS+GF++D
Sbjct 127 RLGQESVVVAEVNFPDLTVEPVVHHYRTIELTEHGARLRHQGPNGHHELSVDSEGFVLDV 186
Query 193 PGLAERM 199
L+ R+
Sbjct 187 AHLSYRL 193
>gi|163758824|ref|ZP_02165911.1| hypothetical protein HPDFL43_15412 [Hoeflea phototrophica DFL-43]
gi|162284114|gb|EDQ34398.1| hypothetical protein HPDFL43_15412 [Hoeflea phototrophica DFL-43]
Length=188
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/185 (32%), Positives = 85/185 (46%), Gaps = 13/185 (7%)
Query 19 WRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVT 78
WR + +E + ++ + IRA +V + FG HY + D + F + T
Sbjct 12 WRPVEGEGLEHLTLRQTANSIRAESVVVGSEAGET--FGIHYQIDCDAGWHVRAFAIQST 69
Query 79 LAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAE 138
+R L + D W + D G + ++G LDID +PF N LPIRR+
Sbjct 70 SGDR---LEMQSDGGGRWKLGD--GTPQPQFDGCLDIDFTGTPFSNTLPIRRIDPAPADG 124
Query 139 SIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVAD----TTVTVDSDGFIVDYPG 194
+I L V+YV+ + A T YT R G K R D + VDSDGF+ DYP
Sbjct 125 NIRLRVLYVSFASLRPLADTQIYTCIDR--GRKYRYQAEDRPFVAELPVDSDGFVTDYPD 182
Query 195 LAERM 199
L ER+
Sbjct 183 LFERI 187
>gi|153008519|ref|YP_001369734.1| hypothetical protein Oant_1188 [Ochrobactrum anthropi ATCC 49188]
gi|151560407|gb|ABS13905.1| protein of unknown function DUF1089 [Ochrobactrum anthropi ATCC
49188]
Length=189
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/192 (31%), Positives = 87/192 (46%), Gaps = 17/192 (8%)
Query 14 PAMLTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRF 73
P + WR + +E + + +G+ IRA +V +NP +G Y + + F
Sbjct 7 PTVARWRPLEGEGLEHLNIGPAGRTIRAES-VVIGDRGDNP-YGVRYSIDCNSVWHVLHF 64
Query 74 GLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGL 133
+ T R L + D + W + G+ ++G +DIDL +PF N LPIRRLGL
Sbjct 65 LIETTAGHR---LELVSDGDGRW--STMAGDALPEFDGCIDIDLAGTPFTNTLPIRRLGL 119
Query 134 HERAESIALPVVYVNVPEMSVDAATVSYTS--EGRLDGIKLRSPVADTTVT----VDSDG 187
+ ++ L ++YV YT EGR + R AD T T VD DG
Sbjct 120 TPESGTVQLDMLYVPFDSFRPLRDQQRYTCLEEGR----RYRYEAADRTFTAELPVDEDG 175
Query 188 FIVDYPGLAERM 199
+ DYP L R+
Sbjct 176 LVTDYPTLFRRL 187
>gi|330820254|ref|YP_004349116.1| hypothetical protein bgla_2g11560 [Burkholderia gladioli BSR3]
gi|327372249|gb|AEA63604.1| hypothetical protein bgla_2g11560 [Burkholderia gladioli BSR3]
Length=184
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/186 (32%), Positives = 87/186 (47%), Gaps = 15/186 (8%)
Query 19 WRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVT 78
W + D +E + + G A I+ + +G Y L D T R L+V
Sbjct 6 WASLDSDGIEHLTLSRDGDGYLAESVIIGRHD-DGRRYGLAYRLACDGHWRTTRATLSVM 64
Query 79 LAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAE 138
L++ RD E W D +G A G +D+D+ +P+ N LPIRRLGL R E
Sbjct 65 GGA---TLSLLRDREGRW--QDGEGRPLPALEGCVDLDIAATPYTNTLPIRRLGLR-RDE 118
Query 139 SIALPVVYVNVPEMSVDAATVSYTS-----EGRLDGIKLRSPVADTTVTVDSDGFIVDYP 193
A+ V YV+VP+++V AT +Y R +GI R ++VD DG +++Y
Sbjct 119 RRAIEVAYVSVPDLAVSRATQAYVCIEPGRRYRYEGIDGRFTAG---LSVDDDGLVLEYD 175
Query 194 GLAERM 199
L R+
Sbjct 176 TLFRRL 181
>gi|295699345|ref|YP_003607238.1| hypothetical protein BC1002_3705 [Burkholderia sp. CCGE1002]
gi|295438558|gb|ADG17727.1| protein of unknown function DUF1089 [Burkholderia sp. CCGE1002]
Length=184
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/159 (34%), Positives = 78/159 (50%), Gaps = 14/159 (8%)
Query 46 VAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGER 105
V A+G HY ++ D T+ L + +L + D E W D G
Sbjct 31 VVVGQRYGKAYGLHYAVRCDARWRTRYAHLKIVGGG---ELELHGDGEGHW--HDGHGLA 85
Query 106 RAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYT--- 162
+A G +DID+ +P+ N LPIRRL L E E + V Y++ P++ V A +YT
Sbjct 86 LSAIEGCIDIDIAATPYTNTLPIRRLQLAE-GERQPIEVAYISTPDLQVTRAEQAYTCIE 144
Query 163 --SEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
+E R +GI R A+ + VDSDG ++DYP L R+
Sbjct 145 LNAEYRYEGI-FREFTAN--LRVDSDGLVIDYPTLFARL 180
>gi|15966517|ref|NP_386870.1| hypothetical protein SMc03980 [Sinorhizobium meliloti 1021]
gi|334317522|ref|YP_004550141.1| hypothetical protein Sinme_2820 [Sinorhizobium meliloti AK83]
gi|15075788|emb|CAC47343.1| Conserved hypothetical protein [Sinorhizobium meliloti 1021]
gi|333812824|gb|AEG05493.1| protein of unknown function DUF1089 [Sinorhizobium meliloti BL225C]
gi|334096516|gb|AEG54527.1| protein of unknown function DUF1089 [Sinorhizobium meliloti AK83]
gi|336034241|gb|AEH80173.1| hypothetical protein SM11_chr2928 [Sinorhizobium meliloti SM11]
Length=196
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/170 (33%), Positives = 81/170 (48%), Gaps = 17/170 (10%)
Query 36 GKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENM 95
G IRA ++ A A+GA Y + D F + T +R L + D
Sbjct 33 GAAIRAESVLIGERGAT--AYGARYRIDCDAGWRVFSFLIETTQGQR---LHLMSDGHGH 87
Query 96 WLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYV--NVPEMS 153
W D G ++G +DIDL +PF N LPIRRLGL ++ + L ++YV + E +
Sbjct 88 WRKAD--GTALPQFDGCVDIDLAGTPFTNTLPIRRLGLTRQSGTARLNMLYVPFDSFEPT 145
Query 154 VDAATVSYTSEGRLDGIKLRSPVADTTVT----VDSDGFIVDYPGLAERM 199
VD + +G+L R AD + T VD DG ++DYP L +R+
Sbjct 146 VDGQHYTCLDDGKL----YRYEAADGSFTADLPVDEDGLVLDYPTLFQRL 191
>gi|51892410|ref|YP_075101.1| hypothetical protein STH1272 [Symbiobacterium thermophilum IAM
14863]
gi|51856099|dbj|BAD40257.1| conserved hypothetical protein [Symbiobacterium thermophilum
IAM 14863]
Length=180
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/187 (31%), Positives = 82/187 (44%), Gaps = 16/187 (8%)
Query 17 LTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLT 76
L W A D + E + +Q G I A+ V + Y L+ D R +T
Sbjct 5 LRWAAVDSTETEELVLQTEGGGIVADA--VVSGVQGGDGTDVTYHLELDPRWQVLRLSVT 62
Query 77 VTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHER 136
E +R++ + RD W D G G D+D+ +PF N LPIRRL L E
Sbjct 63 ----EGDREVDLTRDSRGTW--RDAGGAVLPELQGCADVDISVTPFTNTLPIRRLQLAE- 115
Query 137 AESIALPVVYVNVPEMSVDAATVSYTSEGRLDGIKLRSPVAD----TTVTVDSDGFIVDY 192
ES + V YV VP + + YT+ L G + R D ++VD GF+++Y
Sbjct 116 GESAEIRVAYVQVPGLVLRPVRQRYTN---LGGGRYRYEALDGGYTAVLSVDESGFVLEY 172
Query 193 PGLAERM 199
PG R+
Sbjct 173 PGRFRRL 179
>gi|307727694|ref|YP_003910907.1| hypothetical protein BC1003_5702 [Burkholderia sp. CCGE1003]
gi|307588219|gb|ADN61616.1| protein of unknown function DUF1089 [Burkholderia sp. CCGE1003]
Length=184
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (33%), Positives = 77/159 (49%), Gaps = 14/159 (8%)
Query 46 VAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGER 105
V ++G HY ++ D T+ L + A +L + D E W D G
Sbjct 31 VVVGQRYGKSYGLHYAVRCDTQWRTRYAWLKIVGAG---ELELHGDGEGHW--RDGHGLL 85
Query 106 RAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTS-- 163
+A G +DID+ +PF N LPIRRL L ++ E L V Y++ P++ V +Y+
Sbjct 86 LSAIEGCIDIDIAATPFTNTLPIRRLQL-QQGERRPLQVAYISTPDLQVTRVEQAYSCVV 144
Query 164 ---EGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
E R +GI R A+ + VD DG ++DYP L R+
Sbjct 145 PNREYRYEGI-FRDFTAN--MKVDEDGLVIDYPTLFTRL 180
>gi|323529891|ref|YP_004232043.1| hypothetical protein BC1001_5609 [Burkholderia sp. CCGE1001]
gi|323386893|gb|ADX58983.1| protein of unknown function DUF1089 [Burkholderia sp. CCGE1001]
Length=184
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/191 (30%), Positives = 93/191 (49%), Gaps = 22/191 (11%)
Query 17 LTWRAQDISRMESVRVQLSGKRIRANGRIVAAATANN---PAFGAHYDLQTDETGATKRF 73
L W +Q+ +E + + +R +G V + ++G HY+++ D T+
Sbjct 4 LRWASQEGDGIEHLVFE-----VREDGFQVESVVVGQRYGKSYGLHYEVRCDTQWRTRYA 58
Query 74 GLTVTLAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGL 133
L + A +L + D + W D G +A G +DID+ +PF N LPIRRL L
Sbjct 59 RLKIVGAG---ELELHGDGDGHW--RDGHGLMLSAIEGCIDIDIAATPFTNTLPIRRLQL 113
Query 134 HERAESIALPVVYVNVPEMSVDAATVSYTS-----EGRLDGIKLRSPVADTTVTVDSDGF 188
+ E +L V Y++ P++ V +Y+ E R +GI R+ A+ + VD DG
Sbjct 114 AQ-GERRSLQVAYISTPDLQVTRVEQAYSCIELGREYRYEGI-FRNFTAN--MKVDEDGL 169
Query 189 IVDYPGLAERM 199
++DYP L R+
Sbjct 170 VLDYPTLFTRL 180
>gi|91778317|ref|YP_553525.1| hypothetical protein Bxe_B1793 [Burkholderia xenovorans LB400]
gi|91690977|gb|ABE34175.1| Hypothetical protein Bxe_B1793 [Burkholderia xenovorans LB400]
Length=184
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/160 (33%), Positives = 77/160 (49%), Gaps = 15/160 (9%)
Query 45 IVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGE 104
+V P +G HY ++ D T+ + + +L + D E W D G
Sbjct 31 VVVGQRYGKP-YGLHYKVRCDAQWRTRYAWMKIVGGG---ELELHGDGEGHW--RDGHGL 84
Query 105 RRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTS- 163
+A G +DID+ +PF N LPIRRL L E E + V Y++ P++ V A +Y+
Sbjct 85 VLSAIEGCIDIDIAATPFTNTLPIRRLQLAE-GERRPISVAYISTPDLQVTRAGQAYSCI 143
Query 164 ----EGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
E R +GI R AD + VD DG ++DYP L R+
Sbjct 144 ALHREYRYEGI-FRDFTAD--LKVDEDGLVIDYPTLFTRL 180
>gi|187780008|ref|ZP_02996481.1| hypothetical protein CLOSPO_03604 [Clostridium sporogenes ATCC
15579]
gi|187773633|gb|EDU37435.1| hypothetical protein CLOSPO_03604 [Clostridium sporogenes ATCC
15579]
Length=200
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/186 (27%), Positives = 87/186 (47%), Gaps = 11/186 (5%)
Query 19 WRAQDISRMESVRVQLSGKRIRANGRIVAAATANNPAFGAHYDLQTDETGATKRFGLTVT 78
W+ + +E + + + + I+ N I+ +N Y++ D K+F + +
Sbjct 16 WKTFNGVGLEHLLLLKNHENIKVNSVILTMR--DNMPVRILYNMYCDLDWKVKKFDIEI- 72
Query 79 LAERERQLAIARDEENMWLVTDHQGERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAE 138
++ + + + D W T+ E G +DID+ +PF N +PIRRL L + E
Sbjct 73 FCDKHKNIILQSDGNGNW--TNDTNELVEDLKGCIDIDISITPFTNTIPIRRL-LLKVGE 129
Query 139 SIALPVVYVNVPEMSVDAATVSYTS-EGRLDGIKLRSPVADTTVT----VDSDGFIVDYP 193
S + VVYV++ S+ YT + L+G K R + T VD +G ++DYP
Sbjct 130 SKEIKVVYVDIYNYSLIPVKQRYTCLDSNLNGYKYRYENLNNGFTAEFFVDKEGVVIDYP 189
Query 194 GLAERM 199
L ER+
Sbjct 190 DLFERV 195
>gi|170695809|ref|ZP_02886950.1| protein of unknown function DUF1089 [Burkholderia graminis C4D1M]
gi|170139233|gb|EDT07420.1| protein of unknown function DUF1089 [Burkholderia graminis C4D1M]
Length=184
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/159 (33%), Positives = 76/159 (48%), Gaps = 14/159 (8%)
Query 46 VAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGER 105
V ++G HY ++ D T+ L + A +L + D E W D G
Sbjct 31 VVVGQRYGKSYGLHYTVRCDVQWRTRHAWLKIVGAG---ELELHGDGEGHW--RDGHGLV 85
Query 106 RAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTS-- 163
+A G +DID+ +PF N LPIRRL L + E L V Y++ P++ V +Y+
Sbjct 86 LSAIEGCIDIDIAATPFTNTLPIRRLQLAQ-GERRPLQVAYISTPDLQVTRVEQAYSCIE 144
Query 164 ---EGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
E R +GI R A+ + VD DG ++DYP L R+
Sbjct 145 LNREYRYEGI-FRDFTAN--MKVDDDGLVIDYPTLFSRL 180
>gi|296160708|ref|ZP_06843522.1| protein of unknown function DUF1089 [Burkholderia sp. Ch1-1]
gi|295889011|gb|EFG68815.1| protein of unknown function DUF1089 [Burkholderia sp. Ch1-1]
Length=184
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (32%), Positives = 77/160 (49%), Gaps = 15/160 (9%)
Query 45 IVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQGE 104
+V P +G HY ++ D T+ L + +L + D + W D G
Sbjct 31 VVVGQRYGKP-YGLHYKVRCDAQWRTRYAWLKIVGGG---ELELHGDGDGHW--RDGHGL 84
Query 105 RRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVSYTS- 163
+A G +DID+ +PF N LPIRRL L E E + V Y++ P++ + A +Y+
Sbjct 85 VLSAIEGCIDIDIAATPFTNTLPIRRLQLAE-GERRPISVAYISTPDLQITRAGQAYSCI 143
Query 164 ----EGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM 199
E R +GI R AD + VD DG ++DYP L R+
Sbjct 144 ALHREYRYEGI-FRDFTAD--LKVDEDGLVIDYPTLFTRL 180
Lambda K H
0.319 0.133 0.387
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 215040480604
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40