BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1480
Length=317
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608618|ref|NP_215996.1| hypothetical protein Rv1480 [Mycoba... 626 2e-177
gi|340626495|ref|YP_004744947.1| hypothetical protein MCAN_14971... 625 3e-177
gi|339631548|ref|YP_004723190.1| hypothetical protein MAF_15030 ... 625 4e-177
gi|289569505|ref|ZP_06449732.1| conserved hypothetical protein [... 624 9e-177
gi|15840941|ref|NP_335978.1| hypothetical protein MT1527 [Mycoba... 598 3e-169
gi|240172224|ref|ZP_04750883.1| hypothetical protein MkanA1_2311... 555 3e-156
gi|118463999|ref|YP_882480.1| hypothetical protein MAV_3298 [Myc... 527 1e-147
gi|15827967|ref|NP_302230.1| hypothetical protein ML1809 [Mycoba... 525 3e-147
gi|41407304|ref|NP_960140.1| hypothetical protein MAP1206 [Mycob... 523 1e-146
gi|342858771|ref|ZP_08715426.1| hypothetical protein MCOL_07836 ... 523 1e-146
gi|254819549|ref|ZP_05224550.1| hypothetical protein MintA_06479... 522 3e-146
gi|183982300|ref|YP_001850591.1| hypothetical protein MMAR_2287 ... 509 2e-142
gi|118617150|ref|YP_905482.1| hypothetical protein MUL_1489 [Myc... 507 1e-141
gi|296170659|ref|ZP_06852234.1| conserved hypothetical protein [... 504 1e-140
gi|108799421|ref|YP_639618.1| hypothetical protein Mmcs_2454 [My... 495 4e-138
gi|333990874|ref|YP_004523488.1| hypothetical protein JDM601_223... 492 4e-137
gi|169629809|ref|YP_001703458.1| hypothetical protein MAB_2725c ... 489 2e-136
gi|118470346|ref|YP_887463.1| hypothetical protein MSMEG_3148 [M... 488 5e-136
gi|145224244|ref|YP_001134922.1| hypothetical protein Mflv_3660 ... 474 6e-132
gi|315444580|ref|YP_004077459.1| hypothetical protein Mspyr1_300... 474 1e-131
gi|120403734|ref|YP_953563.1| hypothetical protein Mvan_2750 [My... 459 3e-127
gi|226306559|ref|YP_002766519.1| hypothetical protein RER_30720 ... 451 8e-125
gi|54025449|ref|YP_119691.1| hypothetical protein nfa34790 [Noca... 440 2e-121
gi|111024161|ref|YP_707133.1| hypothetical protein RHA1_ro07211 ... 439 3e-121
gi|333919587|ref|YP_004493168.1| hypothetical protein AS9A_1919 ... 434 6e-120
gi|226366408|ref|YP_002784191.1| hypothetical protein ROP_69990 ... 434 7e-120
gi|317508724|ref|ZP_07966377.1| hypothetical protein HMPREF9336_... 419 2e-115
gi|312139645|ref|YP_004006981.1| hypothetical protein REQ_22470 ... 416 2e-114
gi|296393888|ref|YP_003658772.1| hypothetical protein Srot_1478 ... 416 3e-114
gi|262202332|ref|YP_003273540.1| hypothetical protein Gbro_2405 ... 405 6e-111
gi|296139787|ref|YP_003647030.1| hypothetical protein Tpau_2079 ... 400 1e-109
gi|343927989|ref|ZP_08767455.1| hypothetical protein GOALK_099_0... 389 3e-106
gi|326382236|ref|ZP_08203928.1| hypothetical protein SCNU_04801 ... 372 3e-101
gi|257056240|ref|YP_003134072.1| hypothetical protein Svir_22370... 362 3e-98
gi|331697177|ref|YP_004333416.1| hypothetical protein Psed_3373 ... 361 7e-98
gi|302527161|ref|ZP_07279503.1| hypothetical protein SSMG_03543 ... 354 1e-95
gi|300786827|ref|YP_003767118.1| hypothetical protein AMED_4950 ... 352 4e-95
gi|134100329|ref|YP_001105990.1| putative von Willebrand factor,... 342 4e-92
gi|258652509|ref|YP_003201665.1| hypothetical protein Namu_2299 ... 338 5e-91
gi|256376277|ref|YP_003099937.1| hypothetical protein Amir_2146 ... 326 4e-87
gi|291299991|ref|YP_003511269.1| hypothetical protein Snas_2494 ... 301 1e-79
gi|284990592|ref|YP_003409146.1| hypothetical protein Gobs_2087 ... 293 2e-77
gi|145595545|ref|YP_001159842.1| hypothetical protein Strop_3027... 288 8e-76
gi|284030498|ref|YP_003380429.1| hypothetical protein Kfla_2561 ... 286 4e-75
gi|330466228|ref|YP_004403971.1| hypothetical protein VAB18032_1... 285 8e-75
gi|330469088|ref|YP_004406831.1| hypothetical protein VAB18032_2... 284 1e-74
gi|319949308|ref|ZP_08023385.1| hypothetical protein ES5_07786 [... 284 1e-74
gi|288919020|ref|ZP_06413361.1| conserved hypothetical protein [... 281 7e-74
gi|238060067|ref|ZP_04604776.1| hypothetical protein MCAG_01033 ... 278 9e-73
gi|315504834|ref|YP_004083721.1| hypothetical protein ML5_4060 [... 272 6e-71
>gi|15608618|ref|NP_215996.1| hypothetical protein Rv1480 [Mycobacterium tuberculosis H37Rv]
gi|31792675|ref|NP_855168.1| hypothetical protein Mb1516 [Mycobacterium bovis AF2122/97]
gi|121637411|ref|YP_977634.1| hypothetical protein BCG_1542 [Mycobacterium bovis BCG str. Pasteur
1173P2]
74 more sequence titles
Length=317
Score = 626 bits (1614), Expect = 2e-177, Method: Compositional matrix adjust.
Identities = 316/317 (99%), Positives = 317/317 (100%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL
Sbjct 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG
Sbjct 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV
Sbjct 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD
Sbjct 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFVASRRRGALAGHQ
Sbjct 301 IVRFVASRRRGALAGHQ 317
>gi|340626495|ref|YP_004744947.1| hypothetical protein MCAN_14971 [Mycobacterium canettii CIPT
140010059]
gi|340004685|emb|CCC43829.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=317
Score = 625 bits (1611), Expect = 3e-177, Method: Compositional matrix adjust.
Identities = 315/317 (99%), Positives = 317/317 (100%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL
Sbjct 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG
Sbjct 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV
Sbjct 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGVVREF+IDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD
Sbjct 241 GDVVLQDAESGVVREFTIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFVASRRRGALAGHQ
Sbjct 301 IVRFVASRRRGALAGHQ 317
>gi|339631548|ref|YP_004723190.1| hypothetical protein MAF_15030 [Mycobacterium africanum GM041182]
gi|339330904|emb|CCC26575.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=317
Score = 625 bits (1611), Expect = 4e-177, Method: Compositional matrix adjust.
Identities = 315/317 (99%), Positives = 317/317 (100%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL
Sbjct 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG
Sbjct 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV
Sbjct 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD
Sbjct 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFVASR+RGALAGHQ
Sbjct 301 IVRFVASRQRGALAGHQ 317
>gi|289569505|ref|ZP_06449732.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289543259|gb|EFD46907.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=317
Score = 624 bits (1608), Expect = 9e-177, Method: Compositional matrix adjust.
Identities = 315/317 (99%), Positives = 317/317 (100%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRR+DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL
Sbjct 61 ESRLYQPGDDVRRVDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG
Sbjct 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV
Sbjct 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD
Sbjct 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFVASRRRGALAGHQ
Sbjct 301 IVRFVASRRRGALAGHQ 317
>gi|15840941|ref|NP_335978.1| hypothetical protein MT1527 [Mycobacterium tuberculosis CDC1551]
gi|13881147|gb|AAK45792.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
Length=303
Score = 598 bits (1543), Expect = 3e-169, Method: Compositional matrix adjust.
Identities = 303/303 (100%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 15 MLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRM 74
MLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRM
Sbjct 1 MLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRM 60
Query 75 DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSG 134
DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSG
Sbjct 61 DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSG 120
Query 135 GGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPER 194
GGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPER
Sbjct 121 GGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPER 180
Query 195 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR 254
RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR
Sbjct 181 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR 240
Query 255 EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALA 314
EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALA
Sbjct 241 EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALA 300
Query 315 GHQ 317
GHQ
Sbjct 301 GHQ 303
>gi|240172224|ref|ZP_04750883.1| hypothetical protein MkanA1_23114 [Mycobacterium kansasii ATCC
12478]
Length=317
Score = 555 bits (1430), Expect = 3e-156, Method: Compositional matrix adjust.
Identities = 294/317 (93%), Positives = 310/317 (98%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TESKAPAVVHPPS+LRGDIDDPKL+AALRTLELTVK KLDGVLHGDHLGLIPGPG+EPG
Sbjct 1 MTESKAPAVVHPPSLLRGDIDDPKLSAALRTLELTVKHKLDGVLHGDHLGLIPGPGTEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTA CEKRDL
Sbjct 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTAVCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNR+GALIANGA MTRVPARTGRQHQHTMLRTIATMP+APAGVRG
Sbjct 121 AVAAAAAITFLNSGGGNRIGALIANGATMTRVPARTGRQHQHTMLRTIATMPKAPAGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLAVAIDALRRPERRRGMAV+ISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPD+
Sbjct 181 DLAVAIDALRRPERRRGMAVVISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDI 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDV+LQDAESGV REF+ID L++DFARAAAAHRADVART+RGCGAP+L+LRTDRDWLAD
Sbjct 241 GDVILQDAESGVTREFTIDTQLQNDFARAAAAHRADVARTLRGCGAPVLTLRTDRDWLAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFVASRRRGA+AG Q
Sbjct 301 IVRFVASRRRGAMAGVQ 317
>gi|118463999|ref|YP_882480.1| hypothetical protein MAV_3298 [Mycobacterium avium 104]
gi|254775743|ref|ZP_05217259.1| hypothetical protein MaviaA2_13895 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118165286|gb|ABK66183.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=316
Score = 527 bits (1357), Expect = 1e-147, Method: Compositional matrix adjust.
Identities = 281/317 (89%), Positives = 297/317 (94%), Gaps = 1/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+T+ K P V+HPPSM RG IDDPKL+AALRTLELTVK+KLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTDPKRP-VLHPPSMQRGQIDDPKLSAALRTLELTVKRKLDGVLHGDHLGLIPGPGSEPG 59
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR YQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGT CEKRDL
Sbjct 60 ESREYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTTVCEKRDL 119
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGA MTRVPAR+GRQH+ T+LRTIAT P+AP GVRG
Sbjct 120 AVAAAAAITFLNSGGGNRLGALIANGATMTRVPARSGRQHEQTLLRTIATTPRAPVGVRG 179
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLA AIDALRRPERRRGMAVIISDFLGPINW RPLRAIAARHEVLAIEVLDPRDVELPD+
Sbjct 180 DLATAIDALRRPERRRGMAVIISDFLGPINWQRPLRAIAARHEVLAIEVLDPRDVELPDI 239
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAE+GV REF+ID LRDDFA+AAAAHRADVARTIR CGAP+L+LRTDRDW+AD
Sbjct 240 GDVVLQDAETGVTREFTIDAQLRDDFAKAAAAHRADVARTIRSCGAPILTLRTDRDWIAD 299
Query 301 IVRFVASRRRGALAGHQ 317
IVRFV SRRRGALAG Q
Sbjct 300 IVRFVESRRRGALAGRQ 316
>gi|15827967|ref|NP_302230.1| hypothetical protein ML1809 [Mycobacterium leprae TN]
gi|221230444|ref|YP_002503860.1| hypothetical protein MLBr_01809 [Mycobacterium leprae Br4923]
gi|13093520|emb|CAC30762.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933551|emb|CAR71904.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=320
Score = 525 bits (1352), Expect = 3e-147, Method: Compositional matrix adjust.
Identities = 272/311 (88%), Positives = 290/311 (94%), Gaps = 0/311 (0%)
Query 7 PAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQ 66
P V HPPSM RG IDDPKL+AALRTLELTVK+KLDGVLHGDHLGLI GPGSEPGESR+YQ
Sbjct 10 PGVFHPPSMQRGQIDDPKLSAALRTLELTVKRKLDGVLHGDHLGLISGPGSEPGESRVYQ 69
Query 67 PGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAA 126
PGDDVRRMDWAVTARTTHPHVRQMIADRELETW+V+DMSASLDFGT CEKRDLAVAAAA
Sbjct 70 PGDDVRRMDWAVTARTTHPHVRQMIADRELETWMVIDMSASLDFGTTICEKRDLAVAAAA 129
Query 127 AITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAI 186
AITFLNSGGGNRLGALI NGA MTRVPAR+GRQH+ T+LRTIAT P+AP GVRGDL VAI
Sbjct 130 AITFLNSGGGNRLGALICNGARMTRVPARSGRQHEQTLLRTIATTPKAPVGVRGDLTVAI 189
Query 187 DALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQ 246
DALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVL IEVLDPRDV LPD+G+VVLQ
Sbjct 190 DALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLGIEVLDPRDVALPDIGEVVLQ 249
Query 247 DAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVA 306
DAE+GV REF+ID ALRDDFARAAAAH ADV+R++R CGAPL+SLRTDRDW+ADIVRFV
Sbjct 250 DAETGVTREFTIDAALRDDFARAAAAHCADVSRSLRNCGAPLMSLRTDRDWIADIVRFVE 309
Query 307 SRRRGALAGHQ 317
SRRRGALAG Q
Sbjct 310 SRRRGALAGRQ 320
>gi|41407304|ref|NP_960140.1| hypothetical protein MAP1206 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395656|gb|AAS03523.1| hypothetical protein MAP_1206 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336458033|gb|EGO37020.1| hypothetical protein MAPs_17310 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=316
Score = 523 bits (1348), Expect = 1e-146, Method: Compositional matrix adjust.
Identities = 280/317 (89%), Positives = 296/317 (94%), Gaps = 1/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+T+ K P V+HPPSM RG IDDPKL+AALRTLELTVK+KLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTDPKRP-VLHPPSMQRGQIDDPKLSAALRTLELTVKRKLDGVLHGDHLGLIPGPGSEPG 59
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR YQPG DVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGT CEKRDL
Sbjct 60 ESREYQPGADVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTTVCEKRDL 119
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGALIANGA MTRVPAR+GRQH+ T+LRTIAT P+AP GVRG
Sbjct 120 AVAAAAAITFLNSGGGNRLGALIANGATMTRVPARSGRQHEQTLLRTIATTPRAPVGVRG 179
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLA AIDALRRPERRRGMAVIISDFLGPINW RPLRAIAARHEVLAIEVLDPRDVELPD+
Sbjct 180 DLATAIDALRRPERRRGMAVIISDFLGPINWQRPLRAIAARHEVLAIEVLDPRDVELPDI 239
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAE+GV REF+ID LRDDFA+AAAAHRADVARTIR CGAP+L+LRTDRDW+AD
Sbjct 240 GDVVLQDAETGVTREFTIDAQLRDDFAKAAAAHRADVARTIRSCGAPILTLRTDRDWIAD 299
Query 301 IVRFVASRRRGALAGHQ 317
IVRFV SRRRGALAG Q
Sbjct 300 IVRFVESRRRGALAGRQ 316
>gi|342858771|ref|ZP_08715426.1| hypothetical protein MCOL_07836 [Mycobacterium colombiense CECT
3035]
gi|342134475|gb|EGT87655.1| hypothetical protein MCOL_07836 [Mycobacterium colombiense CECT
3035]
Length=316
Score = 523 bits (1348), Expect = 1e-146, Method: Compositional matrix adjust.
Identities = 276/317 (88%), Positives = 298/317 (95%), Gaps = 1/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+T++K P VVHPPSM RG IDDPKL+AALRTLELTVK+KLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTDAKRP-VVHPPSMQRGQIDDPKLSAALRTLELTVKRKLDGVLHGDHLGLIPGPGSEPG 59
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR YQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGT CEKRDL
Sbjct 60 ESREYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTTVCEKRDL 119
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGA+++ GA +TRVPAR+GRQH+ T+LRTIAT P+AP GVRG
Sbjct 120 AVAAAAAITFLNSGGGNRLGAIVSTGANITRVPARSGRQHEQTLLRTIATTPRAPVGVRG 179
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLA AIDALRRPERRRGMAVIISDFLGPINWMRPLRA+AARHEVLA+EVLDPRD+ELPD+
Sbjct 180 DLATAIDALRRPERRRGMAVIISDFLGPINWMRPLRAVAARHEVLAVEVLDPRDIELPDI 239
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGV REF+ID LRDDFA+AAAAHRADVARTIR CGAP+L+LRTDRDW+AD
Sbjct 240 GDVVLQDAESGVTREFTIDAQLRDDFAKAAAAHRADVARTIRSCGAPILTLRTDRDWIAD 299
Query 301 IVRFVASRRRGALAGHQ 317
IVRFV SRRRGALAG Q
Sbjct 300 IVRFVESRRRGALAGRQ 316
>gi|254819549|ref|ZP_05224550.1| hypothetical protein MintA_06479 [Mycobacterium intracellulare
ATCC 13950]
Length=317
Score = 522 bits (1345), Expect = 3e-146, Method: Compositional matrix adjust.
Identities = 276/317 (88%), Positives = 295/317 (94%), Gaps = 0/317 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+T K + VHPPSM RG IDDPKL+AALRTLELTVK+KLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTGPKGRSAVHPPSMQRGQIDDPKLSAALRTLELTVKRKLDGVLHGDHLGLIPGPGSEPG 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR YQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGT CEKRDL
Sbjct 61 ESREYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTTVCEKRDL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAITFLNSGGGNRLGA+++ GA +TRVPAR+GRQH+ T+LRTIAT P+AP GVRG
Sbjct 121 AVAAAAAITFLNSGGGNRLGAIVSTGANITRVPARSGRQHEQTLLRTIATTPRAPVGVRG 180
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DLA AIDALRRPERRRGMAVIISDFLGPINWMRPLRA+AARHEVLAIEVLDPRDVELPD+
Sbjct 181 DLATAIDALRRPERRRGMAVIISDFLGPINWMRPLRAVAARHEVLAIEVLDPRDVELPDI 240
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAESGV REF+ID LRDDFA+AAAAHRADVARTIR CGAP+L+LRTDRDW+AD
Sbjct 241 GDVVLQDAESGVTREFTIDAQLRDDFAKAAAAHRADVARTIRSCGAPVLTLRTDRDWIAD 300
Query 301 IVRFVASRRRGALAGHQ 317
IVRFV SRRRGALAG Q
Sbjct 301 IVRFVESRRRGALAGRQ 317
>gi|183982300|ref|YP_001850591.1| hypothetical protein MMAR_2287 [Mycobacterium marinum M]
gi|183175626|gb|ACC40736.1| conserved protein [Mycobacterium marinum M]
Length=318
Score = 509 bits (1312), Expect = 2e-142, Method: Compositional matrix adjust.
Identities = 264/305 (87%), Positives = 287/305 (95%), Gaps = 0/305 (0%)
Query 3 ESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGES 62
E+ P VV+PPSM RGDI+DPKLAAAL+TLEL V+ KLDGVLHGD+LGL+PGPGSEPGES
Sbjct 4 EAVGPRVVNPPSMQRGDINDPKLAAALKTLELAVRHKLDGVLHGDYLGLLPGPGSEPGES 63
Query 63 RLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAV 122
R+YQPGDDVRRMDW+VTARTT PHVRQMIADRELETWLVVDMSASLDFGTA CEKRDLAV
Sbjct 64 RIYQPGDDVRRMDWSVTARTTTPHVRQMIADRELETWLVVDMSASLDFGTAVCEKRDLAV 123
Query 123 AAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDL 182
AAAAAI+FLNSGGGNRLGALI+NGA +TRVPAR+GRQH TMLRTIAT P+AP GVRGDL
Sbjct 124 AAAAAISFLNSGGGNRLGALISNGATLTRVPARSGRQHLQTMLRTIATTPKAPVGVRGDL 183
Query 183 AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGD 242
AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAI+ARHEVLAIEVLDPRDVELPD+GD
Sbjct 184 AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAISARHEVLAIEVLDPRDVELPDIGD 243
Query 243 VVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIV 302
VVLQDAE+GV REF+ID L++DFARAAAAHRADV RTIRGCGAP++SLRTDRDW+ADIV
Sbjct 244 VVLQDAETGVTREFTIDAQLQNDFARAAAAHRADVVRTIRGCGAPVMSLRTDRDWIADIV 303
Query 303 RFVAS 307
RFV S
Sbjct 304 RFVTS 308
>gi|118617150|ref|YP_905482.1| hypothetical protein MUL_1489 [Mycobacterium ulcerans Agy99]
gi|118569260|gb|ABL04011.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=318
Score = 507 bits (1305), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 263/305 (87%), Positives = 286/305 (94%), Gaps = 0/305 (0%)
Query 3 ESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGES 62
E+ P VV+PPSM RGDI+DPKLAAAL+TLEL V+ KLDGVLHGD+LGL+PGPGSEPGES
Sbjct 4 EAVGPRVVNPPSMQRGDINDPKLAAALKTLELAVRHKLDGVLHGDYLGLLPGPGSEPGES 63
Query 63 RLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAV 122
+YQPGDDVRRMDW+VTARTT PHVRQMIADRELETWLVVDMSASLDFGTA CEKRDLAV
Sbjct 64 LIYQPGDDVRRMDWSVTARTTTPHVRQMIADRELETWLVVDMSASLDFGTAVCEKRDLAV 123
Query 123 AAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDL 182
AAAAAI+FLNSGGGNRLGALI+NGA +TRVPAR+GRQH TMLRTIAT P+AP GVRGDL
Sbjct 124 AAAAAISFLNSGGGNRLGALISNGATLTRVPARSGRQHLQTMLRTIATTPKAPVGVRGDL 183
Query 183 AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGD 242
AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAI+ARHEVLAIEVLDPRDVELPD+GD
Sbjct 184 AVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAISARHEVLAIEVLDPRDVELPDIGD 243
Query 243 VVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIV 302
VVLQDAE+GV REF+ID L++DFARAAAAHRADV RTIRGCGAP++SLRTDRDW+ADIV
Sbjct 244 VVLQDAETGVTREFTIDAQLQNDFARAAAAHRADVVRTIRGCGAPVMSLRTDRDWIADIV 303
Query 303 RFVAS 307
RFV S
Sbjct 304 RFVTS 308
>gi|296170659|ref|ZP_06852234.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894648|gb|EFG74382.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=303
Score = 504 bits (1297), Expect = 1e-140, Method: Compositional matrix adjust.
Identities = 269/303 (89%), Positives = 282/303 (94%), Gaps = 0/303 (0%)
Query 15 MLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRM 74
M RG I DPKLAAALR LELTVK+KLDGVLHGDHLGLIPGPGSEPGESR YQPGDDVRRM
Sbjct 1 MQRGQIVDPKLAAALRQLELTVKRKLDGVLHGDHLGLIPGPGSEPGESREYQPGDDVRRM 60
Query 75 DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSG 134
DWAVTART HPHVRQMIADRELETW+VVDMSASLDFGT CEKRDLAVAAAAAITFLNSG
Sbjct 61 DWAVTARTMHPHVRQMIADRELETWMVVDMSASLDFGTVGCEKRDLAVAAAAAITFLNSG 120
Query 135 GGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPER 194
GGNRLGALIANG M RVPAR+GRQH+ T+LRTIAT P+AP GVRGDLA AIDALRRPER
Sbjct 121 GGNRLGALIANGQTMVRVPARSGRQHEQTLLRTIATTPRAPVGVRGDLATAIDALRRPER 180
Query 195 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR 254
RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPD+GDVVLQDAESGV R
Sbjct 181 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDIGDVVLQDAESGVTR 240
Query 255 EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALA 314
EF+ID LRDDFA+AAAAHRA+VARTIR CGAP+L+LRTDRDW+ADIVRFV SRRRGALA
Sbjct 241 EFTIDAQLRDDFAKAAAAHRAEVARTIRSCGAPVLTLRTDRDWIADIVRFVESRRRGALA 300
Query 315 GHQ 317
G Q
Sbjct 301 GRQ 303
>gi|108799421|ref|YP_639618.1| hypothetical protein Mmcs_2454 [Mycobacterium sp. MCS]
gi|119868534|ref|YP_938486.1| hypothetical protein Mkms_2499 [Mycobacterium sp. KMS]
gi|126435075|ref|YP_001070766.1| hypothetical protein Mjls_2491 [Mycobacterium sp. JLS]
gi|108769840|gb|ABG08562.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119694623|gb|ABL91696.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126234875|gb|ABN98275.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=315
Score = 495 bits (1274), Expect = 4e-138, Method: Compositional matrix adjust.
Identities = 256/315 (82%), Positives = 283/315 (90%), Gaps = 1/315 (0%)
Query 1 VTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+TES +V PS+ RG I DP L+AALR LELTV++KLDGVLHGDHLGLIPGPGSEPG
Sbjct 1 MTESDGRSV-DLPSLQRGQIRDPALSAALRKLELTVRRKLDGVLHGDHLGLIPGPGSEPG 59
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR+YQPGDDVRRMDW+VTARTT PHVR+MIADRELETWLVVDMSASLDFGTA CEKRDL
Sbjct 60 ESRIYQPGDDVRRMDWSVTARTTVPHVREMIADRELETWLVVDMSASLDFGTAGCEKRDL 119
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAI FLNSGGGNRLGA+I+NG M RVPA +GR H+ +LRTIAT P+AP GVRG
Sbjct 120 AVAAAAAIAFLNSGGGNRLGAVISNGQTMRRVPALSGRMHEQEVLRTIATTPKAPPGVRG 179
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
+LA AIDALRRPERRRGMAV+ISDFLGPI+WMRPLRAIA RHEVL IEVLDPRDVELP+V
Sbjct 180 NLAEAIDALRRPERRRGMAVVISDFLGPIDWMRPLRAIAGRHEVLGIEVLDPRDVELPEV 239
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
GDVVLQDAE+GV REF+ID LR+DF RAAA HRA+VART+R CGAPLLSLRTDRDW+AD
Sbjct 240 GDVVLQDAETGVTREFTIDHQLREDFERAAAEHRAEVARTLRRCGAPLLSLRTDRDWIAD 299
Query 301 IVRFVASRRRGALAG 315
+VRFVASRRRGA+AG
Sbjct 300 VVRFVASRRRGAMAG 314
>gi|333990874|ref|YP_004523488.1| hypothetical protein JDM601_2234 [Mycobacterium sp. JDM601]
gi|333486842|gb|AEF36234.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=324
Score = 492 bits (1266), Expect = 4e-137, Method: Compositional matrix adjust.
Identities = 270/304 (89%), Positives = 284/304 (94%), Gaps = 0/304 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPSMLRG I DPKLAAALRTLELTVK+KLDGVLHGDHLGLIPGPGSEPGESR YQPGDDV
Sbjct 19 PPSMLRGGIRDPKLAAALRTLELTVKRKLDGVLHGDHLGLIPGPGSEPGESRPYQPGDDV 78
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
RRMDW+VTARTTHPHVRQMIADRELETWLVVDMSAS+DFGTA CEKRDLAVAAAAAI +L
Sbjct 79 RRMDWSVTARTTHPHVRQMIADRELETWLVVDMSASMDFGTATCEKRDLAVAAAAAIGYL 138
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
NSGGGNRLGAL+ANG + RVPAR+GR H+ T+LRTIAT+P+APAGVRGDLA AIDALRR
Sbjct 139 NSGGGNRLGALVANGDQVLRVPARSGRNHEQTLLRTIATIPRAPAGVRGDLAAAIDALRR 198
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
PERRRGMAVIISDFLGPINWMRPLRAIAARHEVL IEVLDPRDVELPDVGDVVLQD ESG
Sbjct 199 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLGIEVLDPRDVELPDVGDVVLQDTESG 258
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
V REF+ID LRDDFARAAAAHRADVA +R CGAPLLSLRTDRDW+ADIVRFV SRRRG
Sbjct 259 VTREFTIDAKLRDDFARAAAAHRADVAHALRSCGAPLLSLRTDRDWIADIVRFVESRRRG 318
Query 312 ALAG 315
ALAG
Sbjct 319 ALAG 322
>gi|169629809|ref|YP_001703458.1| hypothetical protein MAB_2725c [Mycobacterium abscessus ATCC
19977]
gi|169241776|emb|CAM62804.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=323
Score = 489 bits (1259), Expect = 2e-136, Method: Compositional matrix adjust.
Identities = 246/315 (79%), Positives = 280/315 (89%), Gaps = 3/315 (0%)
Query 4 SKAPAVVHP---PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
+ +PA P PS+ RG+I DP+L+AALRTLEL +++KLDGVLHG+HLGLIPGPGSEPG
Sbjct 3 TPSPASGRPVDIPSLRRGEIRDPQLSAALRTLELKIRRKLDGVLHGNHLGLIPGPGSEPG 62
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESRLYQPGDDVRRMDW+VTARTT PHVRQMIADRELETWLVVDMSASLDFGT CEKRDL
Sbjct 63 ESRLYQPGDDVRRMDWSVTARTTSPHVRQMIADRELETWLVVDMSASLDFGTTNCEKRDL 122
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRG 180
AVAAAAAI FLNSGGGNRLGA+IANG + R+PAR+GR H+ +LR+IATMP+AP GVRG
Sbjct 123 AVAAAAAIIFLNSGGGNRLGAIIANGDKIVRLPARSGRAHEQDILRSIATMPKAPQGVRG 182
Query 181 DLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV 240
DL+VAIDALRRPERRRGMAVIISDFLGPINWMRPLRAI RHEVL IEVLDPRDVELPDV
Sbjct 183 DLSVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIGGRHEVLGIEVLDPRDVELPDV 242
Query 241 GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLAD 300
G+V+LQDAE+G+ +E+ ID LR DF +AAA H +VART+R CGAPLLSLRTDRDW++D
Sbjct 243 GEVLLQDAETGITKEYRIDENLRRDFQQAAARHHEEVARTLRRCGAPLLSLRTDRDWISD 302
Query 301 IVRFVASRRRGALAG 315
IVRFV+ +RRGA+AG
Sbjct 303 IVRFVSQQRRGAVAG 317
>gi|118470346|ref|YP_887463.1| hypothetical protein MSMEG_3148 [Mycobacterium smegmatis str.
MC2 155]
gi|118171633|gb|ABK72529.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=315
Score = 488 bits (1256), Expect = 5e-136, Method: Compositional matrix adjust.
Identities = 248/307 (81%), Positives = 275/307 (90%), Gaps = 0/307 (0%)
Query 9 VVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPG 68
V PS+ RG+I DP L+AALR LELTV++KLDGVLHGDHLGL+PGPGSEPGESR+Y+PG
Sbjct 8 TVDLPSLQRGEIRDPALSAALRKLELTVRRKLDGVLHGDHLGLLPGPGSEPGESRMYEPG 67
Query 69 DDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAI 128
DDVRRMDW+VTARTT PHVRQMIADRELETWLVVDMSASLDFGTA CEKRDLAVAAAAAI
Sbjct 68 DDVRRMDWSVTARTTTPHVRQMIADRELETWLVVDMSASLDFGTAGCEKRDLAVAAAAAI 127
Query 129 TFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDA 188
FLNSGGGNRLGA+IANG M RVPA +GR H+ +LR IAT P+AP GVRGDL+ AIDA
Sbjct 128 AFLNSGGGNRLGAVIANGDTMRRVPALSGRMHERELLRAIATTPKAPTGVRGDLSAAIDA 187
Query 189 LRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDA 248
LRRPERRRGMAVIISDFLGPINWMRPLRAIA RHEVL IE+LDPRDVELP VGDV+LQD
Sbjct 188 LRRPERRRGMAVIISDFLGPINWMRPLRAIAGRHEVLGIEILDPRDVELPPVGDVILQDT 247
Query 249 ESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASR 308
E+GV REF++D LR DF +AAAAHR +VART+R CGAPLLSLRTDRDW+AD++RFVA+R
Sbjct 248 ETGVTREFTVDEQLRHDFEQAAAAHREEVARTLRRCGAPLLSLRTDRDWIADVMRFVANR 307
Query 309 RRGALAG 315
RRGALAG
Sbjct 308 RRGALAG 314
>gi|145224244|ref|YP_001134922.1| hypothetical protein Mflv_3660 [Mycobacterium gilvum PYR-GCK]
gi|145216730|gb|ABP46134.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=316
Score = 474 bits (1221), Expect = 6e-132, Method: Compositional matrix adjust.
Identities = 252/311 (82%), Positives = 278/311 (90%), Gaps = 0/311 (0%)
Query 4 SKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESR 63
S++ V PSM RG+I DP L+AALR LELTV++KLDGVLHGDHLGL+PGPGSEPGESR
Sbjct 5 SRSSRSVDIPSMKRGEIRDPALSAALRKLELTVRRKLDGVLHGDHLGLLPGPGSEPGESR 64
Query 64 LYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVA 123
+YQPGDDVRRMDW+VTARTTHPHVRQMIADRELETWLVVDMSASLDFGTA CEKRDLAVA
Sbjct 65 VYQPGDDVRRMDWSVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTAGCEKRDLAVA 124
Query 124 AAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLA 183
AAA+I FLNSGGGNR+GA+IANG + RVPA +GR H+ +LRTIATMP+A GVRGDLA
Sbjct 125 AAASIAFLNSGGGNRIGAIIANGETVRRVPALSGRMHEQELLRTIATMPKAAPGVRGDLA 184
Query 184 VAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDV 243
AIDALRRPERRRGMAVIISDFLGPINWMRPLRAIA RHEVL IE++DPRDVELP VGDV
Sbjct 185 AAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAGRHEVLGIEIIDPRDVELPVVGDV 244
Query 244 VLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVR 303
VLQD E+G REF+ID LR DF +AAAAHRADVART+R C APLL+LRTDRDW+AD+VR
Sbjct 245 VLQDTETGRTREFTIDEQLRSDFEKAAAAHRADVARTLRRCDAPLLTLRTDRDWIADVVR 304
Query 304 FVASRRRGALA 314
FVASRRRGALA
Sbjct 305 FVASRRRGALA 315
>gi|315444580|ref|YP_004077459.1| hypothetical protein Mspyr1_30060 [Mycobacterium sp. Spyr1]
gi|315262883|gb|ADT99624.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=316
Score = 474 bits (1219), Expect = 1e-131, Method: Compositional matrix adjust.
Identities = 252/311 (82%), Positives = 278/311 (90%), Gaps = 0/311 (0%)
Query 4 SKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESR 63
S++ V PSM RG+I DP L+AALR LELTV++KLDGVLHGDHLGL+PGPGSEPGESR
Sbjct 5 SRSSRSVDIPSMKRGEIRDPALSAALRKLELTVRRKLDGVLHGDHLGLLPGPGSEPGESR 64
Query 64 LYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVA 123
+YQPGDDVRRMDW+VTARTTHPHVRQMIADRELETWLVVDMSASLDFGTA CEKRDLAVA
Sbjct 65 VYQPGDDVRRMDWSVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTAGCEKRDLAVA 124
Query 124 AAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLA 183
AAA+I FLNSGGGNR+GA+IANG + RVPA +GR H+ +LRTIATMP+A GVRGDLA
Sbjct 125 AAASIAFLNSGGGNRIGAIIANGETVRRVPALSGRMHEQELLRTIATMPKAAPGVRGDLA 184
Query 184 VAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDV 243
AIDALRRPERRRGMAVIISDFLGPINWMRPLRAIA RHEVL IE++DPRDVELP VGDV
Sbjct 185 AAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAGRHEVLGIEIIDPRDVELPVVGDV 244
Query 244 VLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVR 303
VLQD E+G REF+ID LR DF +AAAAHRADVART+R C APLL+LRTDRDW+AD+VR
Sbjct 245 VLQDTETGHTREFTIDEQLRSDFEKAAAAHRADVARTLRRCDAPLLTLRTDRDWIADVVR 304
Query 304 FVASRRRGALA 314
FVASRRRGALA
Sbjct 305 FVASRRRGALA 315
>gi|120403734|ref|YP_953563.1| hypothetical protein Mvan_2750 [Mycobacterium vanbaalenii PYR-1]
gi|119956552|gb|ABM13557.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=302
Score = 459 bits (1181), Expect = 3e-127, Method: Compositional matrix adjust.
Identities = 245/300 (82%), Positives = 269/300 (90%), Gaps = 0/300 (0%)
Query 15 MLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRM 74
M RG+I DP LAAALR LELTV++KLDGVLHGDHLGL+PGPGSEPGESR YQPGDDVRRM
Sbjct 1 MQRGEIRDPALAAALRKLELTVRRKLDGVLHGDHLGLLPGPGSEPGESRAYQPGDDVRRM 60
Query 75 DWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSG 134
DW+VTARTTHPHVRQMIADRELETWLVVD+SASLDFGTA CEKRDLAVAAAA+I FLNSG
Sbjct 61 DWSVTARTTHPHVRQMIADRELETWLVVDVSASLDFGTANCEKRDLAVAAAASIAFLNSG 120
Query 135 GGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPER 194
GGNR+GA+I+NG M RVPA +GR H+ +LR IAT P+AP GVRGDLA AIDALRRPER
Sbjct 121 GGNRIGAVISNGETMRRVPALSGRMHEQELLRAIATTPRAPVGVRGDLAAAIDALRRPER 180
Query 195 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR 254
RRGM VIISDFLGPINWMRPLRAIA RHEVL IE++DPRDVELP VGDVVLQD E+G R
Sbjct 181 RRGMVVIISDFLGPINWMRPLRAIAGRHEVLGIEIIDPRDVELPAVGDVVLQDTETGRTR 240
Query 255 EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALA 314
EF+ID LR DFA+AAAAHRA+VART+R C A LL+LRTDRDW+AD+VRFVASRRRGALA
Sbjct 241 EFTIDEQLRTDFAKAAAAHRAEVARTLRRCDALLLTLRTDRDWIADVVRFVASRRRGALA 300
>gi|226306559|ref|YP_002766519.1| hypothetical protein RER_30720 [Rhodococcus erythropolis PR4]
gi|229493654|ref|ZP_04387439.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|226185676|dbj|BAH33780.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
gi|229319615|gb|EEN85451.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=330
Score = 451 bits (1159), Expect = 8e-125, Method: Compositional matrix adjust.
Identities = 219/304 (73%), Positives = 255/304 (84%), Gaps = 0/304 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS G++ DPKL AALRTLELTV+++LDGVLHGDHLGLIPGPGSEPG++R YQPGDDV
Sbjct 19 PPSFRSGELSDPKLTAALRTLELTVRRRLDGVLHGDHLGLIPGPGSEPGDAREYQPGDDV 78
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
R+MDW+VTARTTHPHVRQ +ADRELETWLV+D+SASLDFGTA CEKRDL VAAAAAIT L
Sbjct 79 RQMDWSVTARTTHPHVRQSVADRELETWLVIDLSASLDFGTAGCEKRDLVVAAAAAITHL 138
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
S GGNR+GA+I+ GA TR+PAR GR H MLR IAT P AP GVRGDL AI+ALRR
Sbjct 139 TSSGGNRVGAIISTGAQTTRIPARGGRIHAQAMLRKIATTPHAPDGVRGDLVGAIEALRR 198
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P+RRRG+AV+ISDFLGPI+W R LRAI+ RH+VL +EV+DPRD+ELPDVGDVVL D ESG
Sbjct 199 PQRRRGLAVVISDFLGPIDWERSLRAISGRHDVLGVEVVDPRDLELPDVGDVVLHDPESG 258
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
REF+ P LR DFAR AA HR +V + +R CGAPL+SL TDRDW+AD+VRF+++RR
Sbjct 259 RTREFTTTPQLRADFARVAAEHRVEVRQALRRCGAPLMSLHTDRDWIADVVRFISARRHS 318
Query 312 ALAG 315
AG
Sbjct 319 YGAG 322
>gi|54025449|ref|YP_119691.1| hypothetical protein nfa34790 [Nocardia farcinica IFM 10152]
gi|54016957|dbj|BAD58327.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=318
Score = 440 bits (1131), Expect = 2e-121, Method: Compositional matrix adjust.
Identities = 216/298 (73%), Positives = 261/298 (88%), Gaps = 0/298 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS G++ D +L+AAL+TLELTV+++LDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV
Sbjct 8 PPSFRAGELSDARLSAALKTLELTVRRRLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 67
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
R+MDW+VTARTTHPHVRQMIADRELETW+VVD+SASLDFGTA C+KRDLA+AAAAAIT+L
Sbjct 68 RQMDWSVTARTTHPHVRQMIADRELETWMVVDLSASLDFGTAACQKRDLAIAAAAAITYL 127
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
SGGGNR+GA++A G + R+PAR+GR H T+LR+IAT P A GVRGDL AI++LRR
Sbjct 128 TSGGGNRIGAVVATGEQLVRIPARSGRIHAQTLLRSIATTPHARDGVRGDLRGAIESLRR 187
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P+R+RG+AVIISDFLG I+W R LRAI+ARH++LA+EVLDPRD+ELPDVGDVVL D E+G
Sbjct 188 PQRKRGLAVIISDFLGEIDWQRSLRAISARHDLLAVEVLDPRDLELPDVGDVVLHDPETG 247
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
REFS+ P LR DFA AA HR VA+ +R CGAP+L+L+TDRDW+AD+VRFV++RR
Sbjct 248 RTREFSVTPTLRADFAAAAQRHRDQVAQALRSCGAPVLTLQTDRDWIADVVRFVSTRR 305
>gi|111024161|ref|YP_707133.1| hypothetical protein RHA1_ro07211 [Rhodococcus jostii RHA1]
gi|110823691|gb|ABG98975.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=324
Score = 439 bits (1129), Expect = 3e-121, Method: Compositional matrix adjust.
Identities = 215/298 (73%), Positives = 256/298 (86%), Gaps = 0/298 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS G++ DPKL+AALRTLELTV+++LDGVLHGDHLGLIPGPGSEPG++R YQPGDDV
Sbjct 14 PPSFRSGELRDPKLSAALRTLELTVRRRLDGVLHGDHLGLIPGPGSEPGDAREYQPGDDV 73
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
R+MDW+VTARTTHPHVRQ +ADRELETWLVVD+S+SLDFGTA CEKRDL VAAAAA+T L
Sbjct 74 RQMDWSVTARTTHPHVRQSVADRELETWLVVDLSSSLDFGTAGCEKRDLVVAAAAAVTHL 133
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
SGGGNR+GA+++ GA TR+PAR GR H MLR IAT P AP GVRGDL A+++LRR
Sbjct 134 TSGGGNRIGAIVSTGAQTTRIPARGGRIHAQAMLRQIATTPHAPDGVRGDLQGAVESLRR 193
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P+RRRG+AV+ISDFLGPI+W R LRAI+ RH+VL +EVLDPRD+ELPD+GDVVL D ESG
Sbjct 194 PQRRRGLAVVISDFLGPIDWERSLRAISGRHDVLGVEVLDPRDLELPDIGDVVLHDPESG 253
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
REF+ P LR DFARAA HR V +++R CGAPLLSLRTDRDW++D+VRFV++R+
Sbjct 254 RTREFTTTPQLRADFARAADEHRLQVEQSLRRCGAPLLSLRTDRDWISDVVRFVSARK 311
>gi|333919587|ref|YP_004493168.1| hypothetical protein AS9A_1919 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481808|gb|AEF40368.1| hypothetical protein AS9A_1919 [Amycolicicoccus subflavus DQS3-9A1]
Length=334
Score = 434 bits (1117), Expect = 6e-120, Method: Compositional matrix adjust.
Identities = 213/298 (72%), Positives = 250/298 (84%), Gaps = 0/298 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS G++ DP+LAAALRTLELTV++KLDGVLHGDHLGLIPGPGSEPG++R+YQPGDDV
Sbjct 22 PPSFASGELRDPQLAAALRTLELTVRRKLDGVLHGDHLGLIPGPGSEPGDARMYQPGDDV 81
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
R+MDW+VTARTTHPHVRQ IADRELETW+V+D+SASLDFGTA +KRDLAVAA AAIT+L
Sbjct 82 RQMDWSVTARTTHPHVRQTIADRELETWIVLDLSASLDFGTADSDKRDLAVAACAAITYL 141
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
GGGNR+GA++A GA RVPA +G H+ T+LR IAT P+A +G RG L AIDALRR
Sbjct 142 TGGGGNRIGAIVATGADTIRVPAGSGMHHRQTLLRKIATTPRAASGTRGSLDAAIDALRR 201
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P+RRRG+AVIISDFLGPI+W R LRA++ RH++LA+EVLDPRD+ELPDVG V LQD ESG
Sbjct 202 PQRRRGLAVIISDFLGPIDWERSLRAVSGRHDLLAVEVLDPRDLELPDVGLVTLQDPESG 261
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
+E LR+DFA AA HR DVA +R CG+PLLSLRTD DW+ADIVRFV SRR
Sbjct 262 ATKEVKTTRKLRNDFAAAAEQHRDDVAAVLRRCGSPLLSLRTDHDWIADIVRFVVSRR 319
>gi|226366408|ref|YP_002784191.1| hypothetical protein ROP_69990 [Rhodococcus opacus B4]
gi|226244898|dbj|BAH55246.1| hypothetical protein [Rhodococcus opacus B4]
Length=324
Score = 434 bits (1117), Expect = 7e-120, Method: Compositional matrix adjust.
Identities = 213/298 (72%), Positives = 255/298 (86%), Gaps = 0/298 (0%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS G++ DPKL+AALRTLELTV+++LDGVLHGDH+GLIPGPGSEPG++R YQPGDDV
Sbjct 14 PPSFRSGELRDPKLSAALRTLELTVRRRLDGVLHGDHMGLIPGPGSEPGDAREYQPGDDV 73
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
R+MDW+VTARTTHPHVRQ +ADRELETWLVVD+S+SLDFGTA CEKRDL VAAAAA+T L
Sbjct 74 RQMDWSVTARTTHPHVRQSVADRELETWLVVDLSSSLDFGTAGCEKRDLVVAAAAAVTHL 133
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
SGGGNR+GA+++ GA R+PAR GR H MLR IAT P AP GVRGDL A+++LRR
Sbjct 134 TSGGGNRIGAIVSTGAQTIRIPARGGRIHAQAMLRRIATTPHAPDGVRGDLQGAVESLRR 193
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P+RRRG+AV+ISDFLGPI+W R LRAI+ RH+VL +EVLDPRD+ELPD+GDVVL D ESG
Sbjct 194 PQRRRGLAVVISDFLGPIDWERSLRAISGRHDVLGVEVLDPRDLELPDLGDVVLHDPESG 253
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
REF+ P LR DFARAA HR V +++R CGAPLLSLRTDRDW++D+VRFV++R+
Sbjct 254 RTREFTTTPQLRADFARAADEHRLQVEQSLRRCGAPLLSLRTDRDWISDVVRFVSARK 311
>gi|317508724|ref|ZP_07966377.1| hypothetical protein HMPREF9336_02749 [Segniliparus rugosus ATCC
BAA-974]
gi|316252972|gb|EFV12389.1| hypothetical protein HMPREF9336_02749 [Segniliparus rugosus ATCC
BAA-974]
Length=310
Score = 419 bits (1078), Expect = 2e-115, Method: Compositional matrix adjust.
Identities = 204/303 (68%), Positives = 249/303 (83%), Gaps = 0/303 (0%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
PS +G + +PKLAAAL+TLELTV++KLDG LHGDHLGL+PGPGSEPG+SR+Y PGDDVR
Sbjct 6 PSFAQGTLRNPKLAAALKTLELTVRRKLDGQLHGDHLGLLPGPGSEPGDSRVYVPGDDVR 65
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
+MDW+VTARTTHPHVRQM ADRELETWLVVD+SAS+DFGT CEKRDLAVAAA+AI L
Sbjct 66 QMDWSVTARTTHPHVRQMTADRELETWLVVDLSASMDFGTTNCEKRDLAVAAASAIVHLT 125
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
+ GNR GA+IA G + RV AR+GRQH +L TIAT P++ G RG+LA AIDALRRP
Sbjct 126 TAPGNRHGAIIATGTDIIRVNARSGRQHVQNLLSTIATTPKSVEGRRGNLAQAIDALRRP 185
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
+RR+G+AV+ISDFLGP++W RPLR + RHEVL IEVLDPRD+ELP VG+VVL+DAESG
Sbjct 186 QRRKGLAVVISDFLGPVDWDRPLRGVGGRHEVLGIEVLDPRDLELPPVGEVVLRDAESGE 245
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGA 312
++ + + +LR FA AA H+ V R +RG G +LSLRTD+DW+A+IVRFVA+RRRG
Sbjct 246 IKSYRVTDSLRQRFAEAAREHQEAVHRALRGAGGGVLSLRTDKDWIAEIVRFVAARRRGL 305
Query 313 LAG 315
+ G
Sbjct 306 VNG 308
>gi|312139645|ref|YP_004006981.1| hypothetical protein REQ_22470 [Rhodococcus equi 103S]
gi|325676909|ref|ZP_08156582.1| hypothetical protein HMPREF0724_14365 [Rhodococcus equi ATCC
33707]
gi|311888984|emb|CBH48297.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325552457|gb|EGD22146.1| hypothetical protein HMPREF0724_14365 [Rhodococcus equi ATCC
33707]
Length=318
Score = 416 bits (1070), Expect = 2e-114, Method: Compositional matrix adjust.
Identities = 216/306 (71%), Positives = 253/306 (83%), Gaps = 2/306 (0%)
Query 4 SKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESR 63
++ P V PS G + DP+L AALRTLELTV+++LDGVLHGDHLGLIPGPGSEPG++R
Sbjct 2 TRKPGAV--PSFRSGSLRDPELTAALRTLELTVRRRLDGVLHGDHLGLIPGPGSEPGDAR 59
Query 64 LYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVA 123
YQPGDDVR+MDW+VTARTTHPHVRQ +ADRELETWLVVD+S+SLDFGTA CEKRDLAVA
Sbjct 60 EYQPGDDVRQMDWSVTARTTHPHVRQTVADRELETWLVVDLSSSLDFGTALCEKRDLAVA 119
Query 124 AAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLA 183
AA+A+T+L G GNR+GA++A G M R+PAR GR H MLR IAT P A GVRGDL
Sbjct 120 AASAVTYLAGGSGNRIGAVVATGDRMLRIPARGGRVHAQAMLRRIATTPHAAEGVRGDLR 179
Query 184 VAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDV 243
A+++LRRPERRRG+AV+ISDFLGPI+W R LRA++ RHE+L IEVLDPRD+ELPD GDV
Sbjct 180 GALESLRRPERRRGLAVVISDFLGPIDWERSLRALSGRHELLGIEVLDPRDLELPDAGDV 239
Query 244 VLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVR 303
VL D ESG REF+ P LR DFA+AAAAHR V T+R CGAP LSL TDRDW+AD+VR
Sbjct 240 VLFDPESGRTREFATTPRLRADFAQAAAAHRHAVESTLRRCGAPRLSLSTDRDWIADVVR 299
Query 304 FVASRR 309
FV+SRR
Sbjct 300 FVSSRR 305
>gi|296393888|ref|YP_003658772.1| hypothetical protein Srot_1478 [Segniliparus rotundus DSM 44985]
gi|296181035|gb|ADG97941.1| protein of unknown function DUF58 [Segniliparus rotundus DSM
44985]
Length=310
Score = 416 bits (1069), Expect = 3e-114, Method: Compositional matrix adjust.
Identities = 199/303 (66%), Positives = 249/303 (83%), Gaps = 0/303 (0%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
PS +G + +PKLAAAL+TLELTVK+KLDG LHGDHLGL+PGPGSEPG+SR Y PGDDVR
Sbjct 6 PSFAQGTLRNPKLAAALKTLELTVKRKLDGQLHGDHLGLLPGPGSEPGDSRTYVPGDDVR 65
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
+MDW+VTARTTHPHVRQM+ADRELETW+VVD+SAS+DFGT CEKRDLA+AAA+AI L
Sbjct 66 QMDWSVTARTTHPHVRQMVADRELETWIVVDLSASMDFGTTNCEKRDLAIAAASAIVHLT 125
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
+ GNR GA+IA G+ + RV AR+GRQH +L TIAT P++P G RG+LA AID+LRRP
Sbjct 126 TAPGNRHGAIIATGSDLVRVNARSGRQHVQNLLSTIATTPKSPDGQRGNLAAAIDSLRRP 185
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
RR+G+AV+ISDFLGP++W RPLR I ARHE+L +EVLDPRD++LP VG+VVL+DAESG
Sbjct 186 LRRKGLAVVISDFLGPVDWDRPLRGIGARHELLGVEVLDPRDLDLPAVGEVVLRDAESGQ 245
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGA 312
+ + I +LR F+ AA H++ V RT+R G +L LRTD+DW+ +IVRFV++RRRG
Sbjct 246 INSYRITESLRARFSEAAKEHQSLVHRTLRSAGGGVLPLRTDKDWIGEIVRFVSARRRGL 305
Query 313 LAG 315
+ G
Sbjct 306 VNG 308
>gi|262202332|ref|YP_003273540.1| hypothetical protein Gbro_2405 [Gordonia bronchialis DSM 43247]
gi|262085679|gb|ACY21647.1| protein of unknown function DUF58 [Gordonia bronchialis DSM 43247]
Length=312
Score = 405 bits (1040), Expect = 6e-111, Method: Compositional matrix adjust.
Identities = 204/303 (68%), Positives = 245/303 (81%), Gaps = 0/303 (0%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
P + G +++P+L AALRTLELTV++KLDGVL G+HLGLIPGPGSEPGE+R YQPGDD+R
Sbjct 8 PDLGAGLLEEPQLTAALRTLELTVRRKLDGVLQGEHLGLIPGPGSEPGEAREYQPGDDIR 67
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
RM+W+VTARTT PHVRQM+ADRELETWLVVD SASLDFGTA C KR+LAVAAAAAI L
Sbjct 68 RMEWSVTARTTQPHVRQMVADRELETWLVVDASASLDFGTANCTKRELAVAAAAAIVHLT 127
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
+ GGNR GALI G + R+PAR+GR H T+L+ IAT ++ GVRGDL I+ALRRP
Sbjct 128 TEGGNRHGALIVTGDDVVRIPARSGRAHAQTLLKAIATTRRSSPGVRGDLKGGIEALRRP 187
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
+RRRG+AV+ISDFLGPI+W R LRAI A HE+LA+EVLD RD+ELPD+G+V L DAESG
Sbjct 188 QRRRGLAVVISDFLGPIDWERSLRAIGAHHELLAVEVLDRRDLELPDIGEVTLADAESGE 247
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGA 312
+RE ++ LR DF AA AH+ V RTIR CG P+L+LRTDRDW+ D ++FVA RRRG
Sbjct 248 IREVTVTDKLRADFGAAARAHQQKVHRTIRSCGGPVLTLRTDRDWMTDTIKFVAQRRRGL 307
Query 313 LAG 315
AG
Sbjct 308 AAG 310
>gi|296139787|ref|YP_003647030.1| hypothetical protein Tpau_2079 [Tsukamurella paurometabola DSM
20162]
gi|296027921|gb|ADG78691.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=310
Score = 400 bits (1029), Expect = 1e-109, Method: Compositional matrix adjust.
Identities = 198/303 (66%), Positives = 246/303 (82%), Gaps = 1/303 (0%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
PS G++D+ ++ A+L+TLEL V+++LDGVL GDH GL+PGPGSEPGESR Y PGDDVR
Sbjct 7 PSFAGGEVDETRMKASLKTLELLVRRRLDGVLKGDHQGLLPGPGSEPGESRPYTPGDDVR 66
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
MDW+VTARTTHPHVRQMIADREL+TW+VVD+SAS+DFG+ KRDLAVAA+AA+T L
Sbjct 67 LMDWSVTARTTHPHVRQMIADRELQTWIVVDLSASMDFGSVSGTKRDLAVAASAAVTHLV 126
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
+G NR+G ++ NG+ RV R GR H+ +LRTIA P+A G RGDL A+D+LRRP
Sbjct 127 AGAANRVGCIVTNGSTTLRVQPRAGRAHRQLVLRTIAGAPRAIEGTRGDLRGALDSLRRP 186
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
E+ RG+ V+ISDFLG I+++R LR +AA+HEVLA+EVLDPRDVELPDVG++ L+DAE+G
Sbjct 187 EQPRGLIVVISDFLGDIDYVRELRGLAAKHEVLAVEVLDPRDVELPDVGEIALRDAETGA 246
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGA 312
VRE ++ P L+ FA AA HR DVART+RG GAP+L LRTDRDWLADI RFVA+RRRG
Sbjct 247 VRELTVTPELQARFADAAQKHRQDVARTLRGVGAPVLELRTDRDWLADIGRFVAARRRG- 305
Query 313 LAG 315
LAG
Sbjct 306 LAG 308
>gi|343927989|ref|ZP_08767455.1| hypothetical protein GOALK_099_01210 [Gordonia alkanivorans NBRC
16433]
gi|343762212|dbj|GAA14381.1| hypothetical protein GOALK_099_01210 [Gordonia alkanivorans NBRC
16433]
Length=312
Score = 389 bits (1000), Expect = 3e-106, Method: Compositional matrix adjust.
Identities = 200/303 (67%), Positives = 240/303 (80%), Gaps = 0/303 (0%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
PS+ G +++P+L AAL+ LELTV++KLDGVL G+HLGLIPGPGSEPGE+R YQPGDDVR
Sbjct 8 PSLGAGLLEEPQLTAALKMLELTVRRKLDGVLQGEHLGLIPGPGSEPGEARTYQPGDDVR 67
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
RM+W+VTARTT PHVRQMIADRELETWLVVD SASLDFGT C KRDLAVAAAAAI L
Sbjct 68 RMEWSVTARTTQPHVRQMIADRELETWLVVDASASLDFGTVGCTKRDLAVAAAAAIVHLT 127
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
+GGGNR GAL+ G + RVPAR GR H +L+ IAT ++ GVRGDL I+ALRRP
Sbjct 128 TGGGNRHGALVVTGDDVVRVPARAGRAHAQNLLKAIATTHRSAPGVRGDLKAGIEALRRP 187
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
+RRRG+AV+ISDFLGPI+W R LRAI A HE+L +EVLDPRD+ELP +G+V L DAESG
Sbjct 188 QRRRGLAVVISDFLGPIDWERSLRAIGAHHELLGVEVLDPRDLELPAIGEVTLADAESGE 247
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGA 312
+ + ++ L+ DFA AA AH+ V RT+R CG LSLRTDR+W+ D V+F+A RRRG
Sbjct 248 IHDVTVTEDLQRDFAAAARAHQQRVHRTLRSCGGATLSLRTDREWITDTVKFIAQRRRGL 307
Query 313 LAG 315
AG
Sbjct 308 AAG 310
>gi|326382236|ref|ZP_08203928.1| hypothetical protein SCNU_04801 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198966|gb|EGD56148.1| hypothetical protein SCNU_04801 [Gordonia neofelifaecis NRRL
B-59395]
Length=312
Score = 372 bits (956), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 198/309 (65%), Positives = 238/309 (78%), Gaps = 0/309 (0%)
Query 7 PAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQ 66
PA PS G ++DP+L AAL++LELTV++KLDGVL G+HLGLIPGPGSEPGE+R YQ
Sbjct 2 PADSGLPSFGAGTLNDPELTAALKSLELTVRRKLDGVLQGEHLGLIPGPGSEPGEAREYQ 61
Query 67 PGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAA 126
PGDD+RRM+W+VTART PHVRQM+ADRELETW VVD SASLDFG+ KRD+A+AAA+
Sbjct 62 PGDDIRRMEWSVTARTGTPHVRQMVADRELETWFVVDASASLDFGSVGRSKRDIAMAAAS 121
Query 127 AITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAI 186
AI L +GGGNR GA+I G + RVPAR GR H +L+ IAT ++ GVRGDL +
Sbjct 122 AIVHLTAGGGNRHGAIIVTGDDIVRVPARAGRAHDQALLKAIATTRRSAPGVRGDLDAGL 181
Query 187 DALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQ 246
+ALRRP RRRG+AV+ISDFLG I+W R LRAI A H++L +EVLDPRDVELP VG V L+
Sbjct 182 EALRRPLRRRGLAVVISDFLGEIDWARSLRAIGAHHDLLGVEVLDPRDVELPAVGPVTLR 241
Query 247 DAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVA 306
DAESG + + + +R D+ARAAA HRADV RT R G P+LSLR+DRDWLAD VRFV
Sbjct 242 DAESGEIVDIDVTAQVRADYARAAAEHRADVLRTFRSAGGPVLSLRSDRDWLADTVRFVG 301
Query 307 SRRRGALAG 315
RRR AG
Sbjct 302 MRRRTMAAG 310
>gi|257056240|ref|YP_003134072.1| hypothetical protein Svir_22370 [Saccharomonospora viridis DSM
43017]
gi|256586112|gb|ACU97245.1| uncharacterized conserved protein [Saccharomonospora viridis
DSM 43017]
Length=317
Score = 362 bits (930), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 191/299 (64%), Positives = 236/299 (79%), Gaps = 4/299 (1%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
P +LRG+ ++ A LRTLEL V+ +LDG+L G+HLGL+PGPGSEPGE+R YQPGDDVR
Sbjct 17 PPILRGE----RMEAGLRTLELDVRHRLDGLLQGNHLGLVPGPGSEPGEARQYQPGDDVR 72
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
R+DWAVTARTT PH+R+ +ADRELETW+V D+S SLDFGTA CEKRDL V A AAI L
Sbjct 73 RIDWAVTARTTTPHIRETVADRELETWVVADLSPSLDFGTAACEKRDLVVCAVAAIAHLT 132
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
GGGNR+GAL++NGA R+P R GR H ++R +ATMP+A G RGDLA +D LRRP
Sbjct 133 RGGGNRIGALLSNGAETVRIPPRGGRGHARELVRRVATMPRAKEGTRGDLAALVDKLRRP 192
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
RRRG+AV+ISDFLGP+ W RPLRA++ARH+++A+EVLDPRDVELP++G VVL D E+G
Sbjct 193 PRRRGLAVVISDFLGPLTWERPLRALSARHDLVAVEVLDPRDVELPEIGSVVLADPETGR 252
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
RE + LR +FA AA+AHRA+VAR IR GA L LRTD DW+AD+VRF +R+RG
Sbjct 253 QREVHVSALLRKEFAAAASAHRAEVARAIRQAGAGHLVLRTDSDWIADVVRFAVARKRG 311
>gi|331697177|ref|YP_004333416.1| hypothetical protein Psed_3373 [Pseudonocardia dioxanivorans
CB1190]
gi|326951866|gb|AEA25563.1| protein of unknown function DUF58 [Pseudonocardia dioxanivorans
CB1190]
Length=325
Score = 361 bits (927), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 183/291 (63%), Positives = 228/291 (79%), Gaps = 0/291 (0%)
Query 20 IDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVT 79
+ D +L AALRTLEL+V+ +LDG+L G+HLGL+PGPG+EPGE+R+YQPGDDVRRMDWAVT
Sbjct 28 LRDGRLEAALRTLELSVRGRLDGLLQGNHLGLVPGPGTEPGEARVYQPGDDVRRMDWAVT 87
Query 80 ARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRL 139
ARTT PH+R+ +ADRELETW+VVD+S SLD GTA CEKRDLAVAA AA+ L GGGNR+
Sbjct 88 ARTTEPHIRETVADRELETWVVVDLSPSLDMGTAACEKRDLAVAAVAAVAHLTRGGGNRI 147
Query 140 GALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMA 199
GAL+ G RVPAR G H ++R +A +P+AP G RGDLA A++ LRRP RRRG+A
Sbjct 148 GALVTTGEHTVRVPARGGVAHARGLVRRVAEVPRAPEGTRGDLAEALEQLRRPARRRGLA 207
Query 200 VIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREFSID 259
V++SDFLG W R LR ++ARH++LA+EVLDP +++LPD G VVL D E+G RE +
Sbjct 208 VVVSDFLGEPTWERALRGLSARHDLLAVEVLDPAELDLPDAGTVVLADPETGRQREVHVT 267
Query 260 PALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRR 310
P LR +FA AA AHR VA T+R CGA L+LRTD DW+ADIVRF +R+R
Sbjct 268 PLLRREFAAAAGAHRDRVATTLRRCGAARLTLRTDSDWIADIVRFALARKR 318
>gi|302527161|ref|ZP_07279503.1| hypothetical protein SSMG_03543 [Streptomyces sp. AA4]
gi|302436056|gb|EFL07872.1| hypothetical protein SSMG_03543 [Streptomyces sp. AA4]
Length=317
Score = 354 bits (909), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 189/298 (64%), Positives = 234/298 (79%), Gaps = 4/298 (1%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
P +LRG+ ++ A LRTLEL V+++LDG+L G+HLGL+PGPGSEPGE+R YQPGDDVR
Sbjct 17 PPILRGE----RMEAGLRTLELEVRRRLDGLLQGNHLGLVPGPGSEPGEARPYQPGDDVR 72
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
RMDWAVTARTT PH+R+ +ADRELETW+V DMSASLDFGTA CEKRDL V A AA+ L
Sbjct 73 RMDWAVTARTTTPHIRETVADRELETWVVADMSASLDFGTALCEKRDLVVCATAAVAHLT 132
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
GGGNR+GAL++NG +TR+PAR G H ++R +A P+AP GVRGDLA A++ LRRP
Sbjct 133 GGGGNRIGALVSNGEGITRLPARGGLPHARGLVRRLAETPRAPEGVRGDLAGALEKLRRP 192
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
RRRG+AV++SDFLGP++W RPLRA+A RHE++AIE++DPRDV+LPDVG VVL D E+G
Sbjct 193 PRRRGLAVVLSDFLGPMDWERPLRALAGRHELIAIEIIDPRDVDLPDVGTVVLADPETGR 252
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRR 310
RE LR +F AA AHR VA +R GA L LRTD DW+AD+VRFV +R+R
Sbjct 253 QREVHASALLRKEFGAAANAHRQAVAAALRRAGAAHLVLRTDSDWIADMVRFVVARKR 310
>gi|300786827|ref|YP_003767118.1| hypothetical protein AMED_4950 [Amycolatopsis mediterranei U32]
gi|299796341|gb|ADJ46716.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340528313|gb|AEK43518.1| hypothetical protein RAM_25200 [Amycolatopsis mediterranei S699]
Length=314
Score = 352 bits (904), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 188/298 (64%), Positives = 230/298 (78%), Gaps = 4/298 (1%)
Query 13 PSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVR 72
P +LRGD +L A LRTLEL V+++LDG+L G+HLGL+PGPGSEPGE+R YQPGDDVR
Sbjct 14 PPVLRGD----RLEAGLRTLELDVRRRLDGLLQGNHLGLVPGPGSEPGEARPYQPGDDVR 69
Query 73 RMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLN 132
RMDWAVTARTT PH+R+ +ADRELETW+V D+SASLDFGTA CEKRDL V A AA+ L
Sbjct 70 RMDWAVTARTTTPHIRETVADRELETWVVADLSASLDFGTALCEKRDLVVCAVAAVAHLT 129
Query 133 SGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRP 192
GGGNR+GALI+ GA TR+PAR G H ++R +A P+A G RGD A A++ALRRP
Sbjct 130 GGGGNRIGALISTGADTTRIPARGGLAHARGLVRKLAETPRAAEGTRGDFAQALEALRRP 189
Query 193 ERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
RRRG+AV+ISDFLG +W RPLRA+ RHE++AIEVLDPRD++LP+VG VVL D E+G
Sbjct 190 PRRRGLAVVISDFLGDESWERPLRALGGRHELIAIEVLDPRDIDLPEVGTVVLADPETGK 249
Query 253 VREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRR 310
RE LR +F AA AHR VA ++R GA L+LRTD DW+AD+VRFV +R+R
Sbjct 250 QREVHASALLRKEFGAAAHAHRQKVAASLRRAGAAHLTLRTDADWIADMVRFVVARKR 307
>gi|134100329|ref|YP_001105990.1| putative von Willebrand factor, type A [Saccharopolyspora erythraea
NRRL 2338]
gi|291008771|ref|ZP_06566744.1| putative von Willebrand factor, type A [Saccharopolyspora erythraea
NRRL 2338]
gi|133912952|emb|CAM03065.1| putative von Willebrand factor, type A [Saccharopolyspora erythraea
NRRL 2338]
Length=315
Score = 342 bits (877), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 184/304 (61%), Positives = 232/304 (77%), Gaps = 5/304 (1%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PPS+ D +L AAL++LELTV+ +LDG+L G+HLGL+PGPG+EPGE+R+YQPGDDV
Sbjct 15 PPSL-----DSGRLQAALKSLELTVRGRLDGLLQGNHLGLVPGPGTEPGEARIYQPGDDV 69
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
RRMDWAVTAR PH+RQ +ADRELETW+ +D+S SLDFG+A C+KR+LAVA AA+T L
Sbjct 70 RRMDWAVTARMNEPHIRQTVADRELETWVALDLSPSLDFGSAACDKRELAVAGLAAVTHL 129
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
SGGGNR+GA++ NG +TR+PAR G + +L+ + MP+A G RGDLA +++LRR
Sbjct 130 TSGGGNRIGAVVDNGERLTRMPARGGSAYARALLKKVVEMPRAEEGTRGDLARLVESLRR 189
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P RRRG+AV+ISDFLGP+ W R LR + RH +LAIEVLDPRDVELPDVG V+L D E+G
Sbjct 190 PPRRRGLAVVISDFLGPLEWQRALRGLGTRHSLLAIEVLDPRDVELPDVGTVLLSDPETG 249
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
RE P LR +FA AAAAHR +VA +R G L LRTD DW+AD+VRFV +R+RG
Sbjct 250 KQREVRTTPVLRKEFAAAAAAHREEVAAALRRAGCAHLVLRTDSDWIADVVRFVMARKRG 309
Query 312 ALAG 315
G
Sbjct 310 WSGG 313
>gi|258652509|ref|YP_003201665.1| hypothetical protein Namu_2299 [Nakamurella multipartita DSM
44233]
gi|258555734|gb|ACV78676.1| conserved hypothetical protein [Nakamurella multipartita DSM
44233]
Length=331
Score = 338 bits (868), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 182/291 (63%), Positives = 230/291 (80%), Gaps = 0/291 (0%)
Query 21 DDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTA 80
D +L AAL TLELTV+++LDG+L G+HLGL+PGPG+EPG++R Y PGDDVRRMDW+VTA
Sbjct 20 DPTRLDAALSTLELTVRRRLDGLLQGNHLGLVPGPGTEPGDARPYYPGDDVRRMDWSVTA 79
Query 81 RTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLG 140
RTT PH+RQ +ADRELETWLV D+SASLDFGT CEKRDL VAAAAA+ L GGGNR+G
Sbjct 80 RTTEPHIRQTVADRELETWLVADLSASLDFGTVGCEKRDLVVAAAAAVGHLTRGGGNRIG 139
Query 141 ALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAV 200
A++A+G+ + RVPAR GR H +LRT+A P+A G RGDLA A++ LRRP RRRG+ V
Sbjct 140 AIVASGSQLARVPARGGRPHLEYLLRTLANNPRATPGDRGDLATALEQLRRPPRRRGLVV 199
Query 201 IISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREFSIDP 260
+ISDF+GP++W RPLR ++ARH++LA+EV+DPRD+ELP VG V L D E+G +E S
Sbjct 200 VISDFIGPVDWERPLRGLSARHDLLAVEVIDPRDLELPAVGLVTLVDPETGRSKEVSTSA 259
Query 261 ALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
LR FA+A+A HRA VA +R GA L LRTD DW+AD++RF+ R+RG
Sbjct 260 GLRAAFAKASAEHRAQVAGALRRAGAAQLVLRTDGDWIADVLRFIVGRKRG 310
>gi|256376277|ref|YP_003099937.1| hypothetical protein Amir_2146 [Actinosynnema mirum DSM 43827]
gi|255920580|gb|ACU36091.1| protein of unknown function DUF58 [Actinosynnema mirum DSM 43827]
Length=334
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 187/299 (63%), Positives = 228/299 (77%), Gaps = 5/299 (1%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PP++ G ++ AALRTLEL V ++LDG+L G+HLGL+PGPGSEPGE+R YQPGDDV
Sbjct 34 PPALHGG-----RMEAALRTLELEVNRRLDGLLQGNHLGLVPGPGSEPGEARPYQPGDDV 88
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
RRMDWAVTARTT PH+R+ +ADRELETW+ VD+S SLDFGTA CEKRDL VA AA L
Sbjct 89 RRMDWAVTARTTVPHIRETVADRELETWVAVDLSPSLDFGTAACEKRDLVVAGVAAAAHL 148
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
GGGNR+GAL++ G + RVPAR G H ++R +A P+AP G RGDLA ++ LRR
Sbjct 149 TRGGGNRIGALVSTGEQVVRVPARGGLAHARGLVRKVAETPRAPEGTRGDLAQLVEQLRR 208
Query 192 PERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESG 251
P RRRG+ V+ISDFLG + W RPLRA++ARH++LAIEV+DPRDV+LPDVG VVL D E+G
Sbjct 209 PPRRRGLVVVISDFLGELEWQRPLRALSARHDLLAIEVVDPRDVDLPDVGTVVLSDPETG 268
Query 252 VVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRR 310
RE P LR +FA AAA HRA+VA +R GA L LRTD DW+AD VRFV +R+R
Sbjct 269 RQREVVASPLLRREFAAAAAEHRAEVAAGLRRAGAGHLVLRTDSDWIADTVRFVVARKR 327
>gi|291299991|ref|YP_003511269.1| hypothetical protein Snas_2494 [Stackebrandtia nassauensis DSM
44728]
gi|290569211|gb|ADD42176.1| protein of unknown function DUF58 [Stackebrandtia nassauensis
DSM 44728]
Length=325
Score = 301 bits (770), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 168/294 (58%), Positives = 202/294 (69%), Gaps = 13/294 (4%)
Query 29 LRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTARTTHPHVR 88
L L+L + KLDG+LHGD+LGL+PGPG+EPGESR Y+PGDDVRRMDW VTARTT PHVR
Sbjct 23 LNHLQLLINNKLDGLLHGDYLGLLPGPGTEPGESREYRPGDDVRRMDWPVTARTTTPHVR 82
Query 89 QMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGAA 148
IADRELETWL VD+SASLDFGTA C KRDLA+AA AA+ L GGNR+GA++ G
Sbjct 83 TTIADRELETWLAVDLSASLDFGTAKCLKRDLAIAATAAMAHLTVRGGNRIGAVMGAGGP 142
Query 149 MTRVPARTGRQHQHTMLRTIATM-PQAPAGV-----------RGDLAVAIDALRRPERRR 196
VPA G +LR +A + P+ PA R DL++ ++ L RP RRR
Sbjct 143 PRVVPAAPGHSGAQMLLRKVAGLRPERPAKPGRFRRVAAKPGRTDLSLLVERLHRPPRRR 202
Query 197 GMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREF 256
G AVIISDFL W RP+R +A RH+VLAIE++DPR++ELPDVG + LQD E+G V E
Sbjct 203 GFAVIISDFLAEDGWERPIRKLAVRHDVLAIEIVDPRELELPDVGVMELQDPETGAVMEI 262
Query 257 SI-DPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
D A R +A AA A R +A +R GA L L TD DWL DIVRFVAS+R
Sbjct 263 QTHDAAFRRQYAHAAQAQRTQIASGLRRAGAARLRLSTDSDWLRDIVRFVASQR 316
>gi|284990592|ref|YP_003409146.1| hypothetical protein Gobs_2087 [Geodermatophilus obscurus DSM
43160]
gi|284063837|gb|ADB74775.1| protein of unknown function DUF58 [Geodermatophilus obscurus
DSM 43160]
Length=350
Score = 293 bits (751), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 176/310 (57%), Positives = 212/310 (69%), Gaps = 11/310 (3%)
Query 12 PPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDV 71
PP G D L+ LELTV+++LDG+L GDHLGL+PG G+E G+SR Y PGDDV
Sbjct 43 PPRFAEGPAD-----VLLQRLELTVRRRLDGLLQGDHLGLVPGSGTEAGDSRSYHPGDDV 97
Query 72 RRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFL 131
RRMDW VTART PHVR+ IADRELETW VVD+SASLDFGTA C KRDLA+A AA++ L
Sbjct 98 RRMDWPVTARTQVPHVRETIADRELETWAVVDLSASLDFGTAACTKRDLAIAGLAAVSHL 157
Query 132 NSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRR 191
GGNRLGA++ G + R PA GR +LR + P+A G RGDLA A+++LRR
Sbjct 158 TVHGGNRLGAVVTTGERVDRYPATAGRLAADRLLRAVVATPRAEGGRRGDLAAALESLRR 217
Query 192 PERRRGMAVIISDFLGP-----INWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQ 246
P RRRG+ V++SDFLG +W RPLR + ARHE+LAIEV+DPR++ELPDVG + +
Sbjct 218 PPRRRGLVVVVSDFLGSDASGFPDWERPLRGLRARHELLAIEVVDPRELELPDVGLLTVV 277
Query 247 DAESGVVREFSI-DPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFV 305
D ESG E D A R FA AAA R +A +R GA L LRTDRDWL D+VRFV
Sbjct 278 DPESGQTLEVPTGDAAFRTRFAEGAAAQRRAIAAALRRAGAGHLQLRTDRDWLMDVVRFV 337
Query 306 ASRRRGALAG 315
A RRR G
Sbjct 338 ADRRRAGSGG 347
>gi|145595545|ref|YP_001159842.1| hypothetical protein Strop_3027 [Salinispora tropica CNB-440]
gi|145304882|gb|ABP55464.1| protein of unknown function DUF58 [Salinispora tropica CNB-440]
Length=339
Score = 288 bits (737), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 175/331 (53%), Positives = 219/331 (67%), Gaps = 26/331 (7%)
Query 2 TESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGE 61
T++ P V P + R + A L L L V +KLDG+L GD++GL+PGPGSE G+
Sbjct 15 TDATGPGSVDPTARRRTE-------ATLSRLHLLVTRKLDGLLQGDYVGLLPGPGSEAGD 67
Query 62 SRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLA 121
SR Y+PGDDVRRMDW VTARTT PHVR+ +ADRELETWL VD+SASLDFGT KRD+
Sbjct 68 SREYRPGDDVRRMDWPVTARTTMPHVRRTVADRELETWLAVDLSASLDFGTGRWLKRDVV 127
Query 122 VAAAAAITFLNSGGGNRLGALIANGA---------------AMTRVPARTGRQHQHTMLR 166
VAAAAA+ L S GGNR+GA+I G+ TR+PAR+GR+ ++R
Sbjct 128 VAAAAALAHLTSRGGNRIGAVIGTGSEPAAGGRGAPAAGPGRFTRLPARSGRREVQALVR 187
Query 167 TIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFLG-PINWMRPLRAIAARHEVL 225
+A P RGDL +D L RP RRRG+AVI+SDFL P W RPLR + RH+VL
Sbjct 188 AVAGTEIRPG--RGDLGALVDLLNRPPRRRGVAVIVSDFLAPPAQWARPLRKLRVRHDVL 245
Query 226 AIEVLDPRDVELPDVGDVVLQDAESGVVREFSI-DPALRDDFARAAAAHRADVARTIRGC 284
AIEVLDPR++ELPDVG + + D E+G + E DP LR +A AAA RA++A +R
Sbjct 246 AIEVLDPRELELPDVGVLPVVDPETGELHEVRTGDPQLRRRYAEAAATQRAEIAAALRAG 305
Query 285 GAPLLSLRTDRDWLADIVRFVASRRRGALAG 315
GA L LRTDRDWL D+VRFVA++R + G
Sbjct 306 GAAHLRLRTDRDWLLDMVRFVAAQRHTRIRG 336
>gi|284030498|ref|YP_003380429.1| hypothetical protein Kfla_2561 [Kribbella flavida DSM 17836]
gi|283809791|gb|ADB31630.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
Length=319
Score = 286 bits (731), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 153/302 (51%), Positives = 206/302 (69%), Gaps = 12/302 (3%)
Query 28 ALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTARTTHPHV 87
ALR LELTV ++L+G LHG+HLGL+PGPG+E E+R YQ GDDVRRMDWAVTARTT PHV
Sbjct 11 ALRRLELTVVRRLEGYLHGEHLGLLPGPGTELAEAREYQVGDDVRRMDWAVTARTTMPHV 70
Query 88 RQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGA 147
R +IADRELETW +VD+SAS+DFGT+ EKR+LAVAA A + FL G+R G L+ +
Sbjct 71 RDLIADRELETWALVDLSASMDFGTSQLEKRELAVAAVATVGFLTHRLGDRFGGLMLRDS 130
Query 148 AMTRVPARTGRQHQHTMLRT-IATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFL 206
+ R PAR+GR + +LR +A R DLA A++++ R +R+RG+ V++SDFL
Sbjct 131 TLRRWPARSGRLALYGLLRALLAEKDHGEHKARSDLAGALESMARTQRKRGLRVVVSDFL 190
Query 207 GPIN----------WMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREF 256
P + W R +R + A+H+VLA+E++DPR++ELP++G V++ D E+G VRE
Sbjct 191 TPEDGEIDARMEPSWERAMRKLTAQHQVLAVEIVDPRELELPNIGVVMIGDPETGAVREI 250
Query 257 SIDP-ALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALAG 315
+R ++A AA A R +R GA L LRTDRDW+AD VRFV + +R A
Sbjct 251 DTRKRRVRQEYAAAALAQRERTRTALRRVGAGHLVLRTDRDWVADTVRFVLAYKRVAPRL 310
Query 316 HQ 317
HQ
Sbjct 311 HQ 312
>gi|330466228|ref|YP_004403971.1| hypothetical protein VAB18032_11265 [Verrucosispora maris AB-18-032]
gi|328809199|gb|AEB43371.1| hypothetical protein VAB18032_11265 [Verrucosispora maris AB-18-032]
Length=330
Score = 285 bits (728), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 166/303 (55%), Positives = 213/303 (71%), Gaps = 17/303 (5%)
Query 29 LRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDD-VRRMDWAVTARTTHPHV 87
LR LELTV ++LDG+LHG+ GL+PGPGSEP SR Y+PG+D VRRMDW+VTARTT PHV
Sbjct 18 LRRLELTVTRRLDGLLHGERRGLLPGPGSEPAGSREYRPGEDEVRRMDWSVTARTTVPHV 77
Query 88 RQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGA 147
R++ ADREL TWL+VD S S+++GTA +KR+LAVAA AAI FL +G GNRLGA +
Sbjct 78 REVDADRELTTWLLVDASPSMEYGTAELDKRELAVAAVAAIGFLTAGAGNRLGAQVLTPY 137
Query 148 AMTRVPARTGRQHQHTMLRTIATMPQA---PAGVRG----DLAVAIDALRRPERRRGMAV 200
+ RVPAR GR H +LR + P+ P G R DL+ A+ A+ R RRG+ V
Sbjct 138 GLHRVPARGGRSHLIGLLRGLLAAPRQTGEPDGHRTADEIDLSAALAAVHRTAHRRGLVV 197
Query 201 IISDFL--------GPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGV 252
+ISDFL P +W RPLR ++ RH+VLA+EV+DPR++ELPDVG + L D E+G
Sbjct 198 VISDFLDGLPDDPRSPASWERPLRRLSVRHQVLAVEVVDPRELELPDVGLITLADPETGR 257
Query 253 VRE-FSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRG 311
RE ++ DP LR+ +A AAAA R + + +R GA L+LRTDRDW ADIVR V ++RR
Sbjct 258 RREVWTGDPGLRERYAAAAAAQRDQLRQALRRAGATHLALRTDRDWAADIVRHVHTQRRL 317
Query 312 ALA 314
ALA
Sbjct 318 ALA 320
>gi|330469088|ref|YP_004406831.1| hypothetical protein VAB18032_25665 [Verrucosispora maris AB-18-032]
gi|328812059|gb|AEB46231.1| hypothetical protein VAB18032_25665 [Verrucosispora maris AB-18-032]
Length=340
Score = 284 bits (727), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 171/312 (55%), Positives = 213/312 (69%), Gaps = 23/312 (7%)
Query 19 DIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAV 78
D D + A L L+L V +KLDG+L GD+ GL+PGPGSE GESR Y+PGDDVRRMDW V
Sbjct 22 DATDNRSGAVLSRLQLLVTRKLDGLLQGDYAGLLPGPGSEAGESREYRPGDDVRRMDWPV 81
Query 79 TARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNR 138
TARTT PHVR+ +ADRELETWL VD+SASLDFGT KRD+ +AAAAAIT L GGNR
Sbjct 82 TARTTTPHVRRTVADRELETWLAVDLSASLDFGTGRWLKRDVVIAAAAAITHLTVRGGNR 141
Query 139 LGALIANG-------------------AAMTRVPARTGRQHQHTMLRTIATMPQAPAGVR 179
+GA++ +G + R+PAR+GR+ MLR IA P R
Sbjct 142 IGAVVGSGDEVPAPRRGRRSAPAPVGPGRLVRMPARSGRKEAQGMLRAIAATESRPG--R 199
Query 180 GDLAVAIDALRRPERRRGMAVIISDFLG-PINWMRPLRAIAARHEVLAIEVLDPRDVELP 238
DL +D L RP RRRG+AV+ISDFL P W RPLR + RH+VLAIEV+DPR++ELP
Sbjct 200 SDLGALVDMLNRPPRRRGVAVVISDFLAPPTQWARPLRKLRVRHDVLAIEVVDPRELELP 259
Query 239 DVGDVVLQDAESGVVREF-SIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDW 297
DVG + + D E+G + E + DP+LR +A AAAA RA+++ +R GA L LRTDRDW
Sbjct 260 DVGVLPVVDPETGQLHEVQTADPSLRHRYAAAAAAQRAEISAAMRAAGAAHLRLRTDRDW 319
Query 298 LADIVRFVASRR 309
L D+VRFVA++R
Sbjct 320 LLDMVRFVAAQR 331
>gi|319949308|ref|ZP_08023385.1| hypothetical protein ES5_07786 [Dietzia cinnamea P4]
gi|319437028|gb|EFV92071.1| hypothetical protein ES5_07786 [Dietzia cinnamea P4]
Length=327
Score = 284 bits (726), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 154/299 (52%), Positives = 195/299 (66%), Gaps = 5/299 (1%)
Query 17 RGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDW 76
R D D AAL LEL V ++LDGVL+GDH GL+PGPG+EPGE+R Y+PGDDVR MDW
Sbjct 9 RDDPVDASSRAALTQLELLVTRRLDGVLNGDHRGLLPGPGTEPGEARAYEPGDDVRTMDW 68
Query 77 AVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGG 136
+VTARTT PH+RQ IADRELETWLVVD++ SLD KR LA AA A + FL++GGG
Sbjct 69 SVTARTTVPHIRQTIADRELETWLVVDLTPSLDVQGRHGVKRRLAEAAVATVGFLSAGGG 128
Query 137 NRLGALIANGAAMTRVPARTGRQHQHTMLRTI--ATMPQAPAGVRGDLAVAIDALRRPER 194
+R+G ++ +PA GR H +L + A +P G D + A+R R
Sbjct 129 SRVGMVLTGDGRPRVLPATGGRDHVRRLLEEVSRAATAVSPGGALDD---CLHAVRNAAR 185
Query 195 RRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVR 254
R G+ V+ISDFL ++W R LR + RHE LA+ V DP D+ LP VG +LQD +G V
Sbjct 186 RHGLVVVISDFLSEVDWERSLRVLGTRHEFLAVHVADPLDIALPVVGAALLQDPATGEVL 245
Query 255 EFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGAL 313
E +D AL D+ RAA HR V +R CGAP+L+LRTDRDW+ D++ FV RRRG L
Sbjct 246 ELDVDDALAADYRRAAGEHRERVHSALRRCGAPVLALRTDRDWIRDVIDFVGIRRRGGL 304
>gi|288919020|ref|ZP_06413361.1| conserved hypothetical protein [Frankia sp. EUN1f]
gi|288349560|gb|EFC83796.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=335
Score = 281 bits (720), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 167/325 (52%), Positives = 215/325 (67%), Gaps = 20/325 (6%)
Query 2 TESKAPAVV-HPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPG 60
T + P V+ PP+ P + LR LEL V ++LDG+L GDHLGL+PG G+E
Sbjct 7 TGATTPQVISQPPAT------SPSVERTLRGLELAVNRRLDGMLLGDHLGLLPGQGTEKA 60
Query 61 ESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDL 120
ESR Y GDDVRRMDWAVTARTT PHV ++ADRELETW +VD++AS +FGT KR+L
Sbjct 61 ESREYHVGDDVRRMDWAVTARTTVPHVHDLVADRELETWALVDLTASQEFGTTSIRKREL 120
Query 121 AVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQHQHTMLRTIATMPQAP----- 175
A+AA AAI FL + GNR+GAL +PAR GRQ T+LR + ++P+A
Sbjct 121 AIAATAAIGFLTARTGNRMGALALTPTGPQLIPARPGRQGLRTLLRALLSVPEAAHDRPV 180
Query 176 ----AGVRGDLAVAIDALRRPERRRGMAVIISDFLGP-INWMRPLRAIAARHEVLAIEVL 230
DLA A+ A+ RP RRRG+AV+ISDFL + W RP+RA+AARH++LA+EVL
Sbjct 181 RRPDHSAATDLAAAVAAMDRPRRRRGLAVVISDFLSTDLGWERPMRALAARHQLLAVEVL 240
Query 231 DPRDVELPDVGDVVLQDAESGVVREFSIDP-ALRDDFARAAAAHRADVARTIRGCGAPLL 289
DP ++ LP VG + + DAE+G V E +R+ ++RAAA HRA VA T+R GA L
Sbjct 241 DPAELALPSVGLLSVVDAETGAVLEVPTSSRRVRERYSRAAAEHRAQVALTLRRVGAGHL 300
Query 290 SLRTDRDWLADIVRFVASRR--RGA 312
LRTD DWL DIVR+V++ R RGA
Sbjct 301 VLRTDSDWLVDIVRYVSAARVTRGA 325
>gi|238060067|ref|ZP_04604776.1| hypothetical protein MCAG_01033 [Micromonospora sp. ATCC 39149]
gi|237881878|gb|EEP70706.1| hypothetical protein MCAG_01033 [Micromonospora sp. ATCC 39149]
Length=336
Score = 278 bits (711), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 174/327 (54%), Positives = 214/327 (66%), Gaps = 26/327 (7%)
Query 7 PAVVHPPSMLRGDIDD---PKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESR 63
P HP R I P+ A L L+L V +KLDG+L GD+ GL+PGPGSE GESR
Sbjct 3 PPTPHPFPGTRSPISPAAAPRTEAVLSRLQLLVTRKLDGLLQGDYAGLLPGPGSEAGESR 62
Query 64 LYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVA 123
Y+PGDDVRRMDW VTARTT PHVR+ +ADRELETWL VD+SASLDFGT KRD+ VA
Sbjct 63 EYRPGDDVRRMDWPVTARTTMPHVRRTVADRELETWLAVDLSASLDFGTGRWLKRDVVVA 122
Query 124 AAAAITFLNSGGGNRLGALIANGAA-------------------MTRVPARTGRQHQHTM 164
AA A+T L GGNR+GA++ GA + R+PAR GR+ +
Sbjct 123 AAVALTHLTVRGGNRIGAVVGTGAVSRAPAGRRRGAAPPPDPGRLVRLPARGGRREAQGL 182
Query 165 LRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFLG-PINWMRPLRAIAARHE 223
LR I P R DL +DAL RP RRRG+AV+ISDFL P W RPLR + RH+
Sbjct 183 LRAIVGTEIRPG--RSDLGALVDALNRPPRRRGVAVVISDFLAPPQQWARPLRKLRVRHD 240
Query 224 VLAIEVLDPRDVELPDVGDVVLQDAESGVVREF-SIDPALRDDFARAAAAHRADVARTIR 282
VLAIEV+DPR++ELPDVG + + D E+G + E + DP LR +A AAAA RA+++ +R
Sbjct 241 VLAIEVVDPRELELPDVGVLPVVDPETGELHEVQTADPRLRRRYAEAAAAQRAEISACLR 300
Query 283 GCGAPLLSLRTDRDWLADIVRFVASRR 309
G GA L LRTD DWL D+VRFVA++R
Sbjct 301 GAGAAHLRLRTDTDWLLDMVRFVAAQR 327
>gi|315504834|ref|YP_004083721.1| hypothetical protein ML5_4060 [Micromonospora sp. L5]
gi|315411453|gb|ADU09570.1| protein of unknown function DUF58 [Micromonospora sp. L5]
Length=309
Score = 272 bits (695), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 169/288 (59%), Positives = 206/288 (72%), Gaps = 6/288 (2%)
Query 26 AAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTARTTHP 85
AAL L+L V +KLDG+L GD+ GL+PGPGSE GESR Y+PGDDVRRMDW VTARTT P
Sbjct 15 GAALARLQLMVTRKLDGLLQGDYAGLLPGPGSEAGESREYRPGDDVRRMDWPVTARTTMP 74
Query 86 HVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIAN 145
HVR+ +ADRELETWL VDMSASLDFGT KRD+AVAA AA+ L GGNR+GA++
Sbjct 75 HVRRTVADRELETWLAVDMSASLDFGTGRWLKRDVAVAAVAALAHLTVRGGNRIGAVVGT 134
Query 146 G--AAMTRVPARTGRQHQHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIIS 203
G M R+PAR+GR+ +LR +A P R DL +D L RP RRRG+AV+IS
Sbjct 135 GGPGTMLRLPARSGRKEAQGLLRAVAGAEIRPG--RSDLGALVDMLNRPPRRRGVAVVIS 192
Query 204 DFLG-PINWMRPLRAIAARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREF-SIDPA 261
DFL P W RPLR + RH+VLA+EV+DPR++ELPDVG + + D ESG + E + DP
Sbjct 193 DFLAPPQQWGRPLRKLRVRHDVLAVEVVDPRELELPDVGVLPVVDPESGELHEVQTADPG 252
Query 262 LRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRR 309
LR +A AAAA R +A +R GA L LRTDRDWL D+VRFVA++R
Sbjct 253 LRRRYAEAAAAQRGAIAAELRAAGAAHLRLRTDRDWLLDMVRFVAAQR 300
Lambda K H
0.322 0.136 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 552325845108
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40