BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2714
Length=324
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609851|ref|NP_217230.1| hypothetical protein Rv2714 [Mycoba... 653 0.0
gi|289746516|ref|ZP_06505894.1| conserved hypothetical protein [... 651 0.0
gi|260656116|pdb|2WAM|A Chain A, Crystal Structure Of Mycobacter... 649 0.0
gi|31793886|ref|NP_856379.1| hypothetical protein Mb2733 [Mycoba... 649 0.0
gi|341602627|emb|CCC65303.1| conserved hypothetical alanine and ... 647 0.0
gi|308232221|ref|ZP_07415330.2| conserved alanine and leucine ri... 613 1e-173
gi|183982012|ref|YP_001850303.1| hypothetical protein MMAR_1998 ... 591 4e-167
gi|118618678|ref|YP_907010.1| hypothetical protein MUL_3357 [Myc... 588 3e-166
gi|41408929|ref|NP_961765.1| hypothetical protein MAP2831 [Mycob... 579 3e-163
gi|118466834|ref|YP_882785.1| hypothetical protein MAV_3608 [Myc... 578 5e-163
gi|240169684|ref|ZP_04748343.1| hypothetical protein MkanA1_1025... 574 6e-162
gi|342858417|ref|ZP_08715072.1| hypothetical protein MCOL_06066 ... 574 6e-162
gi|296171817|ref|ZP_06852931.1| conserved hypothetical protein [... 572 4e-161
gi|15827483|ref|NP_301746.1| hypothetical protein ML1009 [Mycoba... 566 1e-159
gi|254776048|ref|ZP_05217564.1| hypothetical protein MaviaA2_154... 565 2e-159
gi|254821019|ref|ZP_05226020.1| hypothetical protein MintA_13877... 561 4e-158
gi|333991095|ref|YP_004523709.1| hypothetical protein JDM601_245... 538 7e-151
gi|120403436|ref|YP_953265.1| hypothetical protein Mvan_2446 [My... 516 2e-144
gi|145224532|ref|YP_001135210.1| hypothetical protein Mflv_3951 ... 512 3e-143
gi|118471616|ref|YP_887078.1| hypothetical protein MSMEG_2746 [M... 508 5e-142
gi|108799140|ref|YP_639337.1| hypothetical protein Mmcs_2173 [My... 506 3e-141
gi|169630116|ref|YP_001703765.1| hypothetical protein MAB_3033 [... 479 2e-133
gi|226366199|ref|YP_002783982.1| hypothetical protein ROP_67900 ... 437 9e-121
gi|111023763|ref|YP_706735.1| hypothetical protein RHA1_ro06805 ... 437 1e-120
gi|325672696|ref|ZP_08152392.1| hypothetical protein HMPREF0724_... 435 4e-120
gi|312139417|ref|YP_004006753.1| hypothetical protein REQ_20090 ... 433 2e-119
gi|54025753|ref|YP_119995.1| hypothetical protein nfa37830 [Noca... 432 4e-119
gi|226306284|ref|YP_002766244.1| hypothetical protein RER_27970 ... 424 6e-117
gi|296139539|ref|YP_003646782.1| hypothetical protein Tpau_1825 ... 338 5e-91
gi|333919416|ref|YP_004492997.1| hypothetical protein AS9A_1748 ... 336 2e-90
gi|326384469|ref|ZP_08206149.1| hypothetical protein SCNU_16094 ... 335 9e-90
gi|262202165|ref|YP_003273373.1| hypothetical protein Gbro_2232 ... 330 2e-88
gi|343926982|ref|ZP_08766470.1| hypothetical protein GOALK_077_0... 324 1e-86
gi|336177595|ref|YP_004582970.1| hypothetical protein FsymDg_159... 309 4e-82
gi|319949154|ref|ZP_08023243.1| hypothetical protein ES5_07072 [... 301 1e-79
gi|111221414|ref|YP_712208.1| hypothetical protein FRAAL1976 [Fr... 299 5e-79
gi|312198471|ref|YP_004018532.1| Proteasome assembly chaperone 2... 298 7e-79
gi|86739954|ref|YP_480354.1| hypothetical protein Francci3_1247 ... 292 5e-77
gi|332670110|ref|YP_004453118.1| hypothetical protein Celf_1598 ... 290 2e-76
gi|330466792|ref|YP_004404535.1| hypothetical protein VAB18032_1... 289 3e-76
gi|159037347|ref|YP_001536600.1| hypothetical protein Sare_1722 ... 289 4e-76
gi|145594281|ref|YP_001158578.1| hypothetical protein Strop_1737... 289 5e-76
gi|302866657|ref|YP_003835294.1| hypothetical protein Micau_2173... 288 8e-76
gi|158316964|ref|YP_001509472.1| hypothetical protein Franean1_5... 286 4e-75
gi|291301221|ref|YP_003512499.1| hypothetical protein Snas_3749 ... 281 9e-74
gi|238063779|ref|ZP_04608488.1| hypothetical protein MCAG_04745 ... 279 5e-73
gi|119717076|ref|YP_924041.1| hypothetical protein Noca_2852 [No... 279 5e-73
gi|302529899|ref|ZP_07282241.1| conserved hypothetical protein [... 278 6e-73
gi|317506475|ref|ZP_07964276.1| hypothetical protein HMPREF9336_... 278 7e-73
gi|257055520|ref|YP_003133352.1| ATP-grasp superfamily enzyme [S... 277 1e-72
>gi|15609851|ref|NP_217230.1| hypothetical protein Rv2714 [Mycobacterium tuberculosis H37Rv]
gi|15842252|ref|NP_337289.1| hypothetical protein MT2787 [Mycobacterium tuberculosis CDC1551]
gi|148662555|ref|YP_001284078.1| hypothetical protein MRA_2742 [Mycobacterium tuberculosis H37Ra]
23 more sequence titles
Length=324
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/324 (100%), Positives = 324/324 (100%), Gaps = 0/324 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH
Sbjct 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA
Sbjct 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP
Sbjct 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ
Sbjct 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDDPT 324
DELGAEFERFLAQQAEKKSDDDPT
Sbjct 301 DELGAEFERFLAQQAEKKSDDDPT 324
>gi|289746516|ref|ZP_06505894.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289758843|ref|ZP_06518221.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294994194|ref|ZP_06799885.1| hypothetical protein Mtub2_06688 [Mycobacterium tuberculosis
210]
7 more sequence titles
Length=324
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/324 (99%), Positives = 323/324 (99%), Gaps = 0/324 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH
Sbjct 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA
Sbjct 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP
Sbjct 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ
Sbjct 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA LAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDDPT 324
DELGAEFERFLAQQAEKKSDDDPT
Sbjct 301 DELGAEFERFLAQQAEKKSDDDPT 324
>gi|260656116|pdb|2WAM|A Chain A, Crystal Structure Of Mycobacterium Tuberculosis Unknown
Function Protein Rv2714
gi|260656117|pdb|2WAM|B Chain B, Crystal Structure Of Mycobacterium Tuberculosis Unknown
Function Protein Rv2714
gi|260656118|pdb|2WAM|C Chain C, Crystal Structure Of Mycobacterium Tuberculosis Unknown
Function Protein Rv2714
Length=351
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/322 (100%), Positives = 322/322 (100%), Gaps = 0/322 (0%)
Query 2 ARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHL 61
ARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHL
Sbjct 30 ARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHL 89
Query 62 KAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAG 121
KAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAG
Sbjct 90 KAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAG 149
Query 122 LEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPS 181
LEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPS
Sbjct 150 LEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPS 209
Query 182 ISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQL 241
ISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQL
Sbjct 210 ISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQL 269
Query 242 PLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGD 301
PLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGD
Sbjct 270 PLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGD 329
Query 302 ELGAEFERFLAQQAEKKSDDDP 323
ELGAEFERFLAQQAEKKSDDDP
Sbjct 330 ELGAEFERFLAQQAEKKSDDDP 351
>gi|31793886|ref|NP_856379.1| hypothetical protein Mb2733 [Mycobacterium bovis AF2122/97]
gi|121638589|ref|YP_978813.1| hypothetical protein BCG_2727 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224991081|ref|YP_002645770.1| hypothetical alanine and leucine rich protein [Mycobacterium
bovis BCG str. Tokyo 172]
19 more sequence titles
Length=324
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/324 (99%), Positives = 322/324 (99%), Gaps = 0/324 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH
Sbjct 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA
Sbjct 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP
Sbjct 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
ISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ
Sbjct 181 WISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA LAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDDPT 324
DELGAEFERFLAQQAEKKSDDDPT
Sbjct 301 DELGAEFERFLAQQAEKKSDDDPT 324
>gi|341602627|emb|CCC65303.1| conserved hypothetical alanine and leucine rich protein [Mycobacterium
bovis BCG str. Moreau RDJ]
Length=324
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/324 (99%), Positives = 322/324 (99%), Gaps = 0/324 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH
Sbjct 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA
Sbjct 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPM+VPHTRPITMTAHSNNRELISDFQP
Sbjct 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMSVPHTRPITMTAHSNNRELISDFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
ISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ
Sbjct 181 WISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA LAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDDPT 324
DELGAEFERFLAQQAEKKSDDDPT
Sbjct 301 DELGAEFERFLAQQAEKKSDDDPT 324
>gi|308232221|ref|ZP_07415330.2| conserved alanine and leucine rich protein [Mycobacterium tuberculosis
SUMu001]
gi|308369836|ref|ZP_07419233.2| conserved alanine and leucine rich protein [Mycobacterium tuberculosis
SUMu002]
gi|308371108|ref|ZP_07423843.2| conserved alanine and leucine rich protein [Mycobacterium tuberculosis
SUMu003]
19 more sequence titles
Length=305
Score = 613 bits (1581), Expect = 1e-173, Method: Compositional matrix adjust.
Identities = 305/305 (100%), Positives = 305/305 (100%), Gaps = 0/305 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL
Sbjct 1 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 60
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA
Sbjct 61 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 120
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM
Sbjct 121 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 180
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE
Sbjct 181 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 240
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS 319
QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS
Sbjct 241 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS 300
Query 320 DDDPT 324
DDDPT
Sbjct 301 DDDPT 305
>gi|183982012|ref|YP_001850303.1| hypothetical protein MMAR_1998 [Mycobacterium marinum M]
gi|183175338|gb|ACC40448.1| conserved protein [Mycobacterium marinum M]
Length=325
Score = 591 bits (1524), Expect = 4e-167, Method: Compositional matrix adjust.
Identities = 292/322 (91%), Positives = 304/322 (95%), Gaps = 0/322 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MA DQ EA++Y+PGQ GMYELEFPAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA H
Sbjct 1 MAHDQDPGEAQDYQPGQSGMYELEFPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAATH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LK LDTELVASFAIDELLDYRSRRP+MTFKTDHFT DDPELSLYALRDS+GTPFLLLA
Sbjct 61 LKDGLDTELVASFAIDELLDYRSRRPMMTFKTDHFTKYDDPELSLYALRDSVGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
G+EPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRP+TMTAHSNN ELI+DFQP
Sbjct 121 GMEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPVTMTAHSNNPELIADFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
ISEIQVP SASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSL
Sbjct 181 WISEIQVPASASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLD 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA L +A+A+V AKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALTDASAQVGAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDD 322
DELGAEFERFLAQQAEKK DDD
Sbjct 301 DELGAEFERFLAQQAEKKFDDD 322
>gi|118618678|ref|YP_907010.1| hypothetical protein MUL_3357 [Mycobacterium ulcerans Agy99]
gi|118570788|gb|ABL05539.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=325
Score = 588 bits (1517), Expect = 3e-166, Method: Compositional matrix adjust.
Identities = 291/322 (91%), Positives = 303/322 (95%), Gaps = 0/322 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
MA DQ EA++Y+PGQ GMYELEFPAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA H
Sbjct 1 MAHDQDPGEAQDYQPGQSGMYELEFPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAATH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LK LDTELVASFAIDELLDYRSRR +MTFKTDHFT DDPELSLYALRDS+GTPFLLLA
Sbjct 61 LKDGLDTELVASFAIDELLDYRSRRSMMTFKTDHFTKYDDPELSLYALRDSVGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
G+EPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRP+TMTAHSNN ELI+DFQP
Sbjct 121 GMEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPVTMTAHSNNPELIADFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
ISEIQVP SASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSL
Sbjct 181 WISEIQVPASASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLD 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA L +A+A+V AKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALTDASAQVGAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDD 322
DELGAEFERFLAQQAEKK DDD
Sbjct 301 DELGAEFERFLAQQAEKKFDDD 322
>gi|41408929|ref|NP_961765.1| hypothetical protein MAP2831 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397288|gb|AAS05148.1| hypothetical protein MAP_2831 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336458923|gb|EGO37879.1| PAC2 family [Mycobacterium avium subsp. paratuberculosis S397]
Length=323
Score = 579 bits (1492), Expect = 3e-163, Method: Compositional matrix adjust.
Identities = 285/315 (91%), Positives = 298/315 (95%), Gaps = 0/315 (0%)
Query 8 DEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDT 67
D +Y+PGQ GMYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA HLKAALD+
Sbjct 6 DAGDQYQPGQAGMYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAAKHLKAALDS 65
Query 68 ELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLK 127
ELVASFAIDELLDYRSRRPLMTFKTDHFTH DDPELSLYALRDS+GTPFLLLAGLEPDLK
Sbjct 66 ELVASFAIDELLDYRSRRPLMTFKTDHFTHYDDPELSLYALRDSVGTPFLLLAGLEPDLK 125
Query 128 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQV 187
WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNN ELI +FQP I+EIQV
Sbjct 126 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNPELIKNFQPWIAEIQV 185
Query 188 PGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLA 247
PGSASNLLEYRMAQHGHEVVG+TVHVPHYLTQTDYPAAAQALLEQVAKT SL+LPL L+
Sbjct 186 PGSASNLLEYRMAQHGHEVVGYTVHVPHYLTQTDYPAAAQALLEQVAKTASLELPLTALS 245
Query 248 EAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEF 307
EAA ++AKIDEQV+ASAEVAQVVAALERQYDAFIDAQENRSLLTRD DLPSGDELGAEF
Sbjct 246 EAAEVIRAKIDEQVEASAEVAQVVAALERQYDAFIDAQENRSLLTRDGDLPSGDELGAEF 305
Query 308 ERFLAQQAEKKSDDD 322
ERFLAQQAEKK DDD
Sbjct 306 ERFLAQQAEKKFDDD 320
>gi|118466834|ref|YP_882785.1| hypothetical protein MAV_3608 [Mycobacterium avium 104]
gi|118168121|gb|ABK69018.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=324
Score = 578 bits (1489), Expect = 5e-163, Method: Compositional matrix adjust.
Identities = 284/315 (91%), Positives = 298/315 (95%), Gaps = 0/315 (0%)
Query 8 DEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDT 67
D +Y+PGQ GMYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA HLKAALD+
Sbjct 6 DAGDQYQPGQAGMYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAAKHLKAALDS 65
Query 68 ELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLK 127
ELVASFAIDELLDYRSRRPLMTFKTDHFTH DDPELSLYALRDS+GTPFLLLAGLEPDLK
Sbjct 66 ELVASFAIDELLDYRSRRPLMTFKTDHFTHYDDPELSLYALRDSVGTPFLLLAGLEPDLK 125
Query 128 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQV 187
WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNN ELI +FQP I+EIQV
Sbjct 126 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNPELIKNFQPWIAEIQV 185
Query 188 PGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLA 247
PGSASNLLEYRMAQHGHEVVG+TVHVPHYLTQTDYPAAAQALLEQVAKT SL+LPL L+
Sbjct 186 PGSASNLLEYRMAQHGHEVVGYTVHVPHYLTQTDYPAAAQALLEQVAKTASLELPLTALS 245
Query 248 EAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEF 307
EAA ++AKIDEQV+ASAEVAQVVA+LERQYDAFIDAQENRSLLTRD DLPSGDELGAEF
Sbjct 246 EAAEVIRAKIDEQVEASAEVAQVVASLERQYDAFIDAQENRSLLTRDGDLPSGDELGAEF 305
Query 308 ERFLAQQAEKKSDDD 322
ERFLAQQAEKK DDD
Sbjct 306 ERFLAQQAEKKFDDD 320
>gi|240169684|ref|ZP_04748343.1| hypothetical protein MkanA1_10252 [Mycobacterium kansasii ATCC
12478]
Length=325
Score = 574 bits (1480), Expect = 6e-162, Method: Compositional matrix adjust.
Identities = 295/322 (92%), Positives = 308/322 (96%), Gaps = 0/322 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
M DQ DEA++Y+PGQPGMY+LE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAAAH
Sbjct 1 MTHDQDRDEAQDYQPGQPGMYDLELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
LK +LDTELVASFAIDELLDYRSRRPLMTFKTDHFT DDPELSLYALRDS+GTPFLLLA
Sbjct 61 LKGSLDTELVASFAIDELLDYRSRRPLMTFKTDHFTSYDDPELSLYALRDSVGTPFLLLA 120
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNN ELI+DFQP
Sbjct 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNPELIADFQP 180
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
ISEIQVP SASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQ+LLEQVA+TGSL+
Sbjct 181 WISEIQVPASASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQSLLEQVARTGSLE 240
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPLA LAEAAAE++AKIDEQVQAS EVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
Sbjct 241 LPLAALAEAAAEIRAKIDEQVQASTEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
Query 301 DELGAEFERFLAQQAEKKSDDD 322
DELGAEFERFLAQQAEKK DDD
Sbjct 301 DELGAEFERFLAQQAEKKRDDD 322
>gi|342858417|ref|ZP_08715072.1| hypothetical protein MCOL_06066 [Mycobacterium colombiense CECT
3035]
gi|342134121|gb|EGT87301.1| hypothetical protein MCOL_06066 [Mycobacterium colombiense CECT
3035]
Length=325
Score = 574 bits (1480), Expect = 6e-162, Method: Compositional matrix adjust.
Identities = 280/315 (89%), Positives = 296/315 (94%), Gaps = 0/315 (0%)
Query 8 DEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDT 67
D +Y+PGQ GMYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAI+LAAAHLKA LDT
Sbjct 8 DPGDQYQPGQAGMYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIKLAAAHLKAVLDT 67
Query 68 ELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLK 127
ELVASFAIDELLDYRSRRPLMTFKTDHFTH +DPELSLYA+RD++GTPFLLLAG+EPDLK
Sbjct 68 ELVASFAIDELLDYRSRRPLMTFKTDHFTHYEDPELSLYAMRDTVGTPFLLLAGMEPDLK 127
Query 128 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQV 187
WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRP TMTAHSNN ELI++FQP ISEIQV
Sbjct 128 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPTTMTAHSNNPELIANFQPWISEIQV 187
Query 188 PGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLA 247
PGSASNLLEYRM QHGHEVVG+TVHVPHYLTQTDYPAAAQALLEQVAKT SL+LPL L
Sbjct 188 PGSASNLLEYRMGQHGHEVVGYTVHVPHYLTQTDYPAAAQALLEQVAKTASLELPLTALT 247
Query 248 EAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEF 307
EAA ++AKIDEQV+ASAEVAQVVAALERQYDAFIDAQENRSLL RDEDLPSGDELGAEF
Sbjct 248 EAAEVIRAKIDEQVEASAEVAQVVAALERQYDAFIDAQENRSLLARDEDLPSGDELGAEF 307
Query 308 ERFLAQQAEKKSDDD 322
ERFLAQQAEKK DDD
Sbjct 308 ERFLAQQAEKKYDDD 322
>gi|296171817|ref|ZP_06852931.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295893953|gb|EFG73721.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=321
Score = 572 bits (1473), Expect = 4e-161, Method: Compositional matrix adjust.
Identities = 279/310 (90%), Positives = 295/310 (96%), Gaps = 0/310 (0%)
Query 13 YEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVAS 72
Y+PGQ GMYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLA++HLKAALDTELVAS
Sbjct 9 YQPGQAGMYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLASSHLKAALDTELVAS 68
Query 73 FAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFI 132
FAIDELLDYRSRRPLMTFKTDHFT+ D+PELSLYALRD+IGTPFLLLAG+EPDLKWERFI
Sbjct 69 FAIDELLDYRSRRPLMTFKTDHFTNYDEPELSLYALRDTIGTPFLLLAGMEPDLKWERFI 128
Query 133 TAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSAS 192
TAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNN ELI++F P ISEIQVPGSAS
Sbjct 129 TAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNPELIAEFTPWISEIQVPGSAS 188
Query 193 NLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAE 252
NLLEYRMAQHGHEVVG+TVHVPHYLTQTDYPAAA+ALLEQVAK SL+LPL L EAAA
Sbjct 189 NLLEYRMAQHGHEVVGYTVHVPHYLTQTDYPAAAEALLEQVAKIASLELPLTALTEAAAV 248
Query 253 VQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLA 312
++ KIDEQV+ASAEVAQVV ALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLA
Sbjct 249 IRTKIDEQVEASAEVAQVVTALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLA 308
Query 313 QQAEKKSDDD 322
QQAEKK DDD
Sbjct 309 QQAEKKRDDD 318
>gi|15827483|ref|NP_301746.1| hypothetical protein ML1009 [Mycobacterium leprae TN]
gi|221229960|ref|YP_002503376.1| hypothetical protein MLBr_01009 [Mycobacterium leprae Br4923]
gi|467098|gb|AAA17281.1| B2235_F1_6 [Mycobacterium leprae]
gi|13093033|emb|CAC31390.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933067|emb|CAR71104.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=326
Score = 566 bits (1460), Expect = 1e-159, Method: Compositional matrix adjust.
Identities = 287/320 (90%), Positives = 302/320 (95%), Gaps = 0/320 (0%)
Query 5 QGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAA 64
Q D+ + Y+PGQPGMY LEFPAPQL +SDGRGPVL+HALEGFSDAGHAIRLAA HLKAA
Sbjct 7 QDPDDEQHYQPGQPGMYVLEFPAPQLLASDGRGPVLIHALEGFSDAGHAIRLAATHLKAA 66
Query 65 LDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEP 124
L+TELVASFAIDELLDYRSRRPLMTFKTDHFTH DDPELSLYALRDS+GTPFLLLAG+EP
Sbjct 67 LNTELVASFAIDELLDYRSRRPLMTFKTDHFTHYDDPELSLYALRDSVGTPFLLLAGMEP 126
Query 125 DLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISE 184
DLKWERFITAVRLLAERLGVRQTI LGTVPMAVPHTRPIT+TAHSNN ELI+DF P I+E
Sbjct 127 DLKWERFITAVRLLAERLGVRQTISLGTVPMAVPHTRPITLTAHSNNGELIADFTPWITE 186
Query 185 IQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLA 244
IQVPGSASNLLEYRM QHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTG+LQLPL+
Sbjct 187 IQVPGSASNLLEYRMGQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGALQLPLS 246
Query 245 VLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELG 304
LAEAAAE++AKIDEQVQAS EVAQVVAALERQYDAFIDAQENRSLL RDEDLPSGDELG
Sbjct 247 ALAEAAAEIRAKIDEQVQASTEVAQVVAALERQYDAFIDAQENRSLLRRDEDLPSGDELG 306
Query 305 AEFERFLAQQAEKKSDDDPT 324
AEFERFLAQQAEKK DDD T
Sbjct 307 AEFERFLAQQAEKKRDDDLT 326
>gi|254776048|ref|ZP_05217564.1| hypothetical protein MaviaA2_15450 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=306
Score = 565 bits (1457), Expect = 2e-159, Method: Compositional matrix adjust.
Identities = 278/303 (92%), Positives = 290/303 (96%), Gaps = 0/303 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
MYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA HLKAALD+ELVASFAIDELL
Sbjct 1 MYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAAKHLKAALDSELVASFAIDELL 60
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRPLMTFKTDHFTH DDPELSLYALRDS+GTPFLLLAGLEPDLKWERFITAVRLLA
Sbjct 61 DYRSRRPLMTFKTDHFTHYDDPELSLYALRDSVGTPFLLLAGLEPDLKWERFITAVRLLA 120
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNN ELI +FQP I+EIQVPGSASNLLEYRM
Sbjct 121 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNPELIKNFQPWIAEIQVPGSASNLLEYRM 180
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
AQHGHEVVG+TVHVPHYLTQTDYPAAAQALLEQVAKT SL+LPL L+EAA ++AKIDE
Sbjct 181 AQHGHEVVGYTVHVPHYLTQTDYPAAAQALLEQVAKTASLELPLTALSEAAEVIRAKIDE 240
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS 319
QV+ASAEVAQVVAALERQYDAF+DAQENRSLLTRD DLPSGDELGAEFERFLAQQAEKK
Sbjct 241 QVEASAEVAQVVAALERQYDAFVDAQENRSLLTRDGDLPSGDELGAEFERFLAQQAEKKF 300
Query 320 DDD 322
DDD
Sbjct 301 DDD 303
>gi|254821019|ref|ZP_05226020.1| hypothetical protein MintA_13877 [Mycobacterium intracellulare
ATCC 13950]
Length=306
Score = 561 bits (1447), Expect = 4e-158, Method: Compositional matrix adjust.
Identities = 275/303 (91%), Positives = 289/303 (96%), Gaps = 0/303 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
MYELE PAPQLS+SDGRGPVLVHALEGFSDAGHAIRLAA+HLKA LDTELVASFAIDELL
Sbjct 1 MYELELPAPQLSTSDGRGPVLVHALEGFSDAGHAIRLAASHLKAVLDTELVASFAIDELL 60
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRPLMTFKTDHFT+ DDPELSLYALRD++GTPFLLLAG+EPDLKWERFITAVRLLA
Sbjct 61 DYRSRRPLMTFKTDHFTNYDDPELSLYALRDTVGTPFLLLAGMEPDLKWERFITAVRLLA 120
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ERL VRQTIGLGTVPMAVPHTRP TMTAHSNN ELI++FQP I+EIQVPGSASNLLEYRM
Sbjct 121 ERLNVRQTIGLGTVPMAVPHTRPTTMTAHSNNPELIANFQPWIAEIQVPGSASNLLEYRM 180
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKT SL+LPL L EAA ++AKIDE
Sbjct 181 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTASLELPLTALTEAAEVIRAKIDE 240
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS 319
QV+ASAEVAQVVAALERQYDAFIDAQENRSLL+RDEDLPSGDELGAEFERFLAQQAEKK
Sbjct 241 QVEASAEVAQVVAALERQYDAFIDAQENRSLLSRDEDLPSGDELGAEFERFLAQQAEKKF 300
Query 320 DDD 322
DDD
Sbjct 301 DDD 303
>gi|333991095|ref|YP_004523709.1| hypothetical protein JDM601_2455 [Mycobacterium sp. JDM601]
gi|333487063|gb|AEF36455.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=327
Score = 538 bits (1385), Expect = 7e-151, Method: Compositional matrix adjust.
Identities = 261/321 (82%), Positives = 289/321 (91%), Gaps = 0/321 (0%)
Query 4 DQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKA 63
D G+D R Y+ Q GMYELE PAPQL+S DG GPVL+HALEGFSDAGHAIRLAA HLK
Sbjct 6 DTGSDRDRHYQAQQGGMYELEVPAPQLTSPDGEGPVLIHALEGFSDAGHAIRLAAGHLKT 65
Query 64 ALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLE 123
ALD+ELVASFAID+LLDYRSRRP+MTFKTDHFTH +PELSLYALRDS GTPFLLLAG+E
Sbjct 66 ALDSELVASFAIDDLLDYRSRRPVMTFKTDHFTHYAEPELSLYALRDSAGTPFLLLAGME 125
Query 124 PDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSIS 183
PDLKWERFITAVRLLAERLGVR+TIGLGT+PMAVPHTRP+T+TAHSNNRELI+DF P I+
Sbjct 126 PDLKWERFITAVRLLAERLGVRRTIGLGTIPMAVPHTRPVTLTAHSNNRELIADFTPWIA 185
Query 184 EIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPL 243
E+QVPGSASNLLEYRMAQHGHEVVGFTVHVPHY++QTDYP AA+ALL Q A+TGSLQLPL
Sbjct 186 EVQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYVSQTDYPEAAEALLRQAAQTGSLQLPL 245
Query 244 AVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDEL 303
L+ AAA+++AKI+EQV+ASAEVAQVV ALERQYDAFI AQENRSLL RDE+LPS DEL
Sbjct 246 TELSRAAADIRAKINEQVEASAEVAQVVTALERQYDAFIAAQENRSLLARDEELPSADEL 305
Query 304 GAEFERFLAQQAEKKSDDDPT 324
GAEFERFLAQ+A K DD T
Sbjct 306 GAEFERFLAQEARKDRGDDGT 326
>gi|120403436|ref|YP_953265.1| hypothetical protein Mvan_2446 [Mycobacterium vanbaalenii PYR-1]
gi|119956254|gb|ABM13259.1| protein of unknown function DUF75 [Mycobacterium vanbaalenii
PYR-1]
Length=329
Score = 516 bits (1329), Expect = 2e-144, Method: Compositional matrix adjust.
Identities = 250/319 (79%), Positives = 283/319 (89%), Gaps = 3/319 (0%)
Query 8 DEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDT 67
D EY+P Q GMYELEFP PQLSS DGRGPVL+HALEGFSDAGHAIRL+A HLK LDT
Sbjct 7 DPGHEYQPEQTGMYELEFPGPQLSSPDGRGPVLIHALEGFSDAGHAIRLSAQHLKDTLDT 66
Query 68 ELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLK 127
ELVASFAIDELLDYRSRRPLMTFKTDHFTH D PEL+LYALRD+ GTPFLLLAGLEPDL+
Sbjct 67 ELVASFAIDELLDYRSRRPLMTFKTDHFTHYDQPELNLYALRDTAGTPFLLLAGLEPDLR 126
Query 128 WERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQV 187
WERFITAVRLL+ERLGVR+ IGLG++PMAVPHTRP+T+TAHSN++ELI++ QP ++E+QV
Sbjct 127 WERFITAVRLLSERLGVRRVIGLGSIPMAVPHTRPMTLTAHSNDKELIAEHQPWVNEVQV 186
Query 188 PGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLA 247
PGSASNLLE+RMAQHG+EVVGFTVHVPHYL QTDYP+AA+ LL +VA+ GSL++P L
Sbjct 187 PGSASNLLEFRMAQHGYEVVGFTVHVPHYLAQTDYPSAAETLLSEVARNGSLEIPTTKLT 246
Query 248 EAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEF 307
+AAAEV KI+EQV SAEVAQVV ALERQYDAF+ AQENRSLL RDEDLPSG+ELGAEF
Sbjct 247 QAAAEVFDKINEQVAGSAEVAQVVEALERQYDAFVAAQENRSLLARDEDLPSGEELGAEF 306
Query 308 ERFLAQQA---EKKSDDDP 323
ERFLAQQA ++K DDP
Sbjct 307 ERFLAQQAGEKKRKDGDDP 325
>gi|145224532|ref|YP_001135210.1| hypothetical protein Mflv_3951 [Mycobacterium gilvum PYR-GCK]
gi|315444863|ref|YP_004077742.1| hypothetical protein Mspyr1_32960 [Mycobacterium sp. Spyr1]
gi|145217018|gb|ABP46422.1| protein of unknown function DUF75 [Mycobacterium gilvum PYR-GCK]
gi|315263166|gb|ADT99907.1| Protein of unknown function DUF75 [Mycobacterium sp. Spyr1]
Length=330
Score = 512 bits (1319), Expect = 3e-143, Method: Compositional matrix adjust.
Identities = 248/319 (78%), Positives = 282/319 (89%), Gaps = 1/319 (0%)
Query 4 DQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKA 63
D D Y+P Q GMYELEFP PQLS+ DGRGPVL+HALEGFSDAGHAI+LAAAHLK
Sbjct 3 DDAHDSGSRYQPEQTGMYELEFPGPQLSTPDGRGPVLIHALEGFSDAGHAIKLAAAHLKN 62
Query 64 ALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLE 123
+LDTELVASFAIDELLDYRSRRPLMTFKTDHFTH D+PEL+LYAL DS+GTPFLLL+G+E
Sbjct 63 SLDTELVASFAIDELLDYRSRRPLMTFKTDHFTHYDEPELNLYALHDSVGTPFLLLSGME 122
Query 124 PDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSIS 183
PDL+WERF+TA+RLLAERLGVR+ IGLG++PMAVPHTRP+T+TAHSN++ELI++ QP +
Sbjct 123 PDLRWERFVTAIRLLAERLGVRRVIGLGSIPMAVPHTRPMTLTAHSNDKELIAEHQPWVG 182
Query 184 EIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPL 243
E+QVPGSASNLLE+RMAQHG+EVVGFTVHVPHYL QTDYP+AA+ LL +VA+T SL +P
Sbjct 183 EVQVPGSASNLLEFRMAQHGYEVVGFTVHVPHYLAQTDYPSAAETLLAEVARTASLDIPT 242
Query 244 AVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDEL 303
A L AAA V KI+EQV ASAEVAQVV ALERQYDAF+ AQENRSLL RDEDLPSG+EL
Sbjct 243 AELTTAAAVVFDKINEQVTASAEVAQVVDALERQYDAFVAAQENRSLLARDEDLPSGEEL 302
Query 304 GAEFERFLAQQA-EKKSDD 321
GAEFERFLAQQA EKK D
Sbjct 303 GAEFERFLAQQAGEKKRKD 321
>gi|118471616|ref|YP_887078.1| hypothetical protein MSMEG_2746 [Mycobacterium smegmatis str.
MC2 155]
gi|118172903|gb|ABK73799.1| conserved hypothetical alanine and leucine rich protein [Mycobacterium
smegmatis str. MC2 155]
Length=348
Score = 508 bits (1308), Expect = 5e-142, Method: Compositional matrix adjust.
Identities = 253/312 (82%), Positives = 282/312 (91%), Gaps = 0/312 (0%)
Query 11 REYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELV 70
+ Y+P Q GMYELEFPAPQLSSSDGRGPVL+HALEGFSDAGHAIRLAA HLK +LDTELV
Sbjct 9 QRYQPDQSGMYELEFPAPQLSSSDGRGPVLIHALEGFSDAGHAIRLAAEHLKKSLDTELV 68
Query 71 ASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWER 130
ASFAIDELLDYRSRRPLMTFKTDHFT ++PEL+LYAL D++GTPFLLLAGLEPDL+WER
Sbjct 69 ASFAIDELLDYRSRRPLMTFKTDHFTAYEEPELNLYALHDTVGTPFLLLAGLEPDLRWER 128
Query 131 FITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGS 190
FITAVRLLAE+LGVRQ IGLGT+PMAVPHTRP+ +TAHSNN+ELI++ P + E+QVP S
Sbjct 129 FITAVRLLAEQLGVRQVIGLGTIPMAVPHTRPVNLTAHSNNKELIAEHTPWVGEVQVPAS 188
Query 191 ASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAA 250
SNLLE+RMAQHGHEVVGFTVHVPHYL QT YP AA+ALL +VA+TGSL+LPLA L+EA
Sbjct 189 VSNLLEFRMAQHGHEVVGFTVHVPHYLAQTAYPPAAEALLAEVARTGSLELPLAALSEAG 248
Query 251 AEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERF 310
AEV KI+EQV+AS EVAQVV+ALERQYDAFI AQENRSLL RDEDLPSGDELGAEFERF
Sbjct 249 AEVYTKINEQVEASPEVAQVVSALERQYDAFIAAQENRSLLARDEDLPSGDELGAEFERF 308
Query 311 LAQQAEKKSDDD 322
LAQQA +K DD
Sbjct 309 LAQQAGEKFKDD 320
>gi|108799140|ref|YP_639337.1| hypothetical protein Mmcs_2173 [Mycobacterium sp. MCS]
gi|119868255|ref|YP_938207.1| hypothetical protein Mkms_2219 [Mycobacterium sp. KMS]
gi|126434748|ref|YP_001070439.1| hypothetical protein Mjls_2162 [Mycobacterium sp. JLS]
gi|108769559|gb|ABG08281.1| protein of unknown function DUF75 [Mycobacterium sp. MCS]
gi|119694344|gb|ABL91417.1| protein of unknown function DUF75 [Mycobacterium sp. KMS]
gi|126234548|gb|ABN97948.1| protein of unknown function DUF75 [Mycobacterium sp. JLS]
Length=334
Score = 506 bits (1302), Expect = 3e-141, Method: Compositional matrix adjust.
Identities = 239/312 (77%), Positives = 280/312 (90%), Gaps = 0/312 (0%)
Query 11 REYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELV 70
++Y+P Q GMYELEFPAPQLS++DGRGPVL+HALEGFSDAGH +RLA AHLK +LDTELV
Sbjct 10 QQYQPEQTGMYELEFPAPQLSAADGRGPVLLHALEGFSDAGHVVRLATAHLKNSLDTELV 69
Query 71 ASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWER 130
ASFAIDELLDYRSRRPLMTFKTDHF+ ++PEL+LYA+ D++GTPFLLLAG+EPDL+WER
Sbjct 70 ASFAIDELLDYRSRRPLMTFKTDHFSAYEEPELNLYAMHDTVGTPFLLLAGMEPDLRWER 129
Query 131 FITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGS 190
FITAVRLLAE+LGVRQTIGLG++PMAVPHTRP+TMTAHSNN+ELI++ P + E+QVP S
Sbjct 130 FITAVRLLAEQLGVRQTIGLGSIPMAVPHTRPVTMTAHSNNKELIAEHTPWVGEVQVPAS 189
Query 191 ASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAA 250
S+LLE+RMAQHGHEVVG+TV+VPHYL+QT YP AA++LL +VAKT +LQ+PL L EA
Sbjct 190 VSSLLEFRMAQHGHEVVGYTVYVPHYLSQTAYPPAAESLLAEVAKTAALQIPLTALGEAG 249
Query 251 AEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERF 310
AEV KI+EQV+AS EVAQVV ALERQYDAF+ AQENRSLL DEDLPSGDELGAEFERF
Sbjct 250 AEVYTKINEQVEASVEVAQVVTALERQYDAFVAAQENRSLLAHDEDLPSGDELGAEFERF 309
Query 311 LAQQAEKKSDDD 322
LAQQA +K +D
Sbjct 310 LAQQAGEKDKED 321
>gi|169630116|ref|YP_001703765.1| hypothetical protein MAB_3033 [Mycobacterium abscessus ATCC 19977]
gi|169242083|emb|CAM63111.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=330
Score = 479 bits (1234), Expect = 2e-133, Method: Compositional matrix adjust.
Identities = 232/324 (72%), Positives = 278/324 (86%), Gaps = 3/324 (0%)
Query 1 MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAH 60
M ++G E + Y+P Q GMYELEFP PQL+S+DGRGPVLVHAL+GFSD+GHA++LAAAH
Sbjct 1 MTSNEGV-EPQPYKPDQSGMYELEFPGPQLASADGRGPVLVHALQGFSDSGHAVKLAAAH 59
Query 61 LKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLA 120
L+ L++ELVASFAID+LLDYRSRRP+MTFK+DHFT PEL+LYAL+D+ GTPFLLLA
Sbjct 60 LRQTLESELVASFAIDDLLDYRSRRPVMTFKSDHFTEYATPELNLYALKDTKGTPFLLLA 119
Query 121 GLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQP 180
GLEPDLKWERF+ A+RLLAE+LGVR+TIGLG +PMAVPHTRPIT+TAH N+R+ + +
Sbjct 120 GLEPDLKWERFVNAIRLLAEQLGVRKTIGLGAIPMAVPHTRPITLTAHGNDRKTLDEHPG 179
Query 181 SISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQ 240
I E+QVPGSASNLLE+R+AQHGH+VVGF VHVPHYL QTDYP A+Q LLE+VA+TG L
Sbjct 180 WIDEVQVPGSASNLLEFRLAQHGHDVVGFAVHVPHYLAQTDYPEASQRLLEEVARTGDLD 239
Query 241 LPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG 300
LPL L+EAAA+V+ +I+EQV+ S EVAQVV ALERQYDAF+ AQENRSLL RDE+LPSG
Sbjct 240 LPLQELSEAAAKVRNQINEQVEGSEEVAQVVQALERQYDAFVAAQENRSLLARDEELPSG 299
Query 301 DELGAEFERFLAQQAEKKSDDDPT 324
DEL EFERFLA+QA K DDP+
Sbjct 300 DELAGEFERFLAEQA--KFGDDPS 321
>gi|226366199|ref|YP_002783982.1| hypothetical protein ROP_67900 [Rhodococcus opacus B4]
gi|226244689|dbj|BAH55037.1| hypothetical protein [Rhodococcus opacus B4]
Length=322
Score = 437 bits (1125), Expect = 9e-121, Method: Compositional matrix adjust.
Identities = 207/300 (69%), Positives = 250/300 (84%), Gaps = 1/300 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFPAPQLSS+DG+GPVL+H LEGFSDAGHA++LA HL+ +L++ELVASFA+D
Sbjct 4 QSKMYELEFPAPQLSSADGQGPVLIHGLEGFSDAGHAVKLATTHLRESLESELVASFAVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP MTFK DHF+ D PEL+LYAL+D+ GTPFLLLAG+EPDL+WERF TAVR
Sbjct 64 ELVDYRSRRPTMTFKADHFSDYDQPELNLYALKDTAGTPFLLLAGMEPDLRWERFTTAVR 123
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LLAE+LGVR+T+G+ +PMA+PHTRP+ +TAHS N++LI D Q E+QVPGSAS+L+E
Sbjct 124 LLAEQLGVRRTVGINAIPMAIPHTRPLGVTAHSTNKDLIKDHQRWSGELQVPGSASSLIE 183
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RMAQHGHE VGF+VHVPHYL QTDYP AA+ LLE V+ L LPLA L EAAA V+ +
Sbjct 184 LRMAQHGHESVGFSVHVPHYLAQTDYPGAAETLLENVSDVTDLDLPLAALGEAAARVREQ 243
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRS-LLTRDEDLPSGDELGAEFERFLAQQA 315
+DE + + EV VV ALERQYDA++ AQE +S LL +EDLPSGDELGAEFERFLA+QA
Sbjct 244 VDEHIAGNEEVQTVVHALERQYDAYVTAQEQQSTLLASEEDLPSGDELGAEFERFLAEQA 303
>gi|111023763|ref|YP_706735.1| hypothetical protein RHA1_ro06805 [Rhodococcus jostii RHA1]
gi|110823293|gb|ABG98577.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=322
Score = 437 bits (1123), Expect = 1e-120, Method: Compositional matrix adjust.
Identities = 207/300 (69%), Positives = 250/300 (84%), Gaps = 1/300 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFPAPQLSS+DG+GPVL+H LEGFSDAGHA++LA HL+ +L++ELVASFA+D
Sbjct 4 QSKMYELEFPAPQLSSADGQGPVLIHGLEGFSDAGHAVKLATTHLRESLESELVASFAVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP MTFK DHF+ D PEL+LYAL+D+ GTPFLLLAG+EPDL+WERF TAVR
Sbjct 64 ELVDYRSRRPTMTFKADHFSDYDQPELNLYALKDTAGTPFLLLAGMEPDLRWERFTTAVR 123
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LLAE+LGVR+T+G+ +PMA+PHTRP+ +TAHS N++LI D Q E+QVPGSAS+L+E
Sbjct 124 LLAEQLGVRRTVGINAIPMAIPHTRPLGVTAHSTNKDLIKDHQRWSGELQVPGSASSLIE 183
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RMAQHGHE VGF+VHVPHYL QTDYP AA+ LLE V+ L LPLA L EAAA V+ +
Sbjct 184 LRMAQHGHESVGFSVHVPHYLAQTDYPGAAETLLENVSDVTDLDLPLAALGEAAARVREQ 243
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRS-LLTRDEDLPSGDELGAEFERFLAQQA 315
+DE + + EV VV ALERQYDA++ AQE +S LL +EDLPSGDELGAEFERFLA+QA
Sbjct 244 VDEHIAGNEEVQTVVHALERQYDAYVTAQEQQSTLLASEEDLPSGDELGAEFERFLAEQA 303
>gi|325672696|ref|ZP_08152392.1| hypothetical protein HMPREF0724_10173 [Rhodococcus equi ATCC
33707]
gi|325556573|gb|EGD26239.1| hypothetical protein HMPREF0724_10173 [Rhodococcus equi ATCC
33707]
Length=317
Score = 435 bits (1119), Expect = 4e-120, Method: Compositional matrix adjust.
Identities = 207/307 (68%), Positives = 252/307 (83%), Gaps = 1/307 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFPAP LSS+DG+GPVL+H LEGFSDAGHA++LA HL+ +L+TELVASFA+D
Sbjct 4 QSKMYELEFPAPHLSSADGQGPVLIHGLEGFSDAGHAVKLATKHLRESLETELVASFAVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP MTFK DHF+ D P+L+LYALRD+ GTPFLLLAG+EPDL+WERF TAVR
Sbjct 64 ELIDYRSRRPTMTFKADHFSDFDAPQLNLYALRDTAGTPFLLLAGMEPDLRWERFTTAVR 123
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LLAE+LGVR+T+G+ +PMA+PHTRP+++TAHS N+ELI D E+QVPGSAS+LLE
Sbjct 124 LLAEQLGVRRTVGINAIPMAIPHTRPLSVTAHSTNKELIEDHHRWSGELQVPGSASSLLE 183
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RM+QHGHE VGF+VHVPHYL QT+YPAAA+ LLE V + L+LPL L EAAA V+ +
Sbjct 184 LRMSQHGHESVGFSVHVPHYLAQTEYPAAAETLLENVMEIADLELPLVALGEAAARVREQ 243
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRS-LLTRDEDLPSGDELGAEFERFLAQQA 315
IDE + ++ EV VV ALE QYD+++ AQE +S LL DEDLPSGDELGAEFERFLA+QA
Sbjct 244 IDEHITSNEEVQSVVKALENQYDSYVAAQEQQSTLLAGDEDLPSGDELGAEFERFLAEQA 303
Query 316 EKKSDDD 322
+ D
Sbjct 304 RMDGEGD 310
>gi|312139417|ref|YP_004006753.1| hypothetical protein REQ_20090 [Rhodococcus equi 103S]
gi|311888756|emb|CBH48068.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=317
Score = 433 bits (1113), Expect = 2e-119, Method: Compositional matrix adjust.
Identities = 206/307 (68%), Positives = 251/307 (82%), Gaps = 1/307 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFPAP LSS+DG+GPVL+H LEGFSDAGHA++LA HL+ +L+TELVASFA+D
Sbjct 4 QSKMYELEFPAPHLSSADGQGPVLIHGLEGFSDAGHAVKLATKHLRESLETELVASFAVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP M FK DHF+ D P+L+LYALRD+ GTPFLLLAG+EPDL+WERF TAVR
Sbjct 64 ELIDYRSRRPTMMFKADHFSDFDAPQLNLYALRDTAGTPFLLLAGMEPDLRWERFTTAVR 123
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LLAE+LGVR+T+G+ +PMA+PHTRP+++TAHS N+ELI D E+QVPGSAS+LLE
Sbjct 124 LLAEQLGVRRTVGINAIPMAIPHTRPLSVTAHSTNKELIEDHHRWSGELQVPGSASSLLE 183
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RM+QHGHE VGF+VHVPHYL QT+YPAAA+ LLE V + L+LPL L EAAA V+ +
Sbjct 184 LRMSQHGHESVGFSVHVPHYLAQTEYPAAAETLLENVMEIADLELPLVALGEAAARVREQ 243
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRS-LLTRDEDLPSGDELGAEFERFLAQQA 315
IDE + ++ EV VV ALE QYD+++ AQE +S LL DEDLPSGDELGAEFERFLA+QA
Sbjct 244 IDEHITSNEEVQSVVKALENQYDSYVAAQEQQSTLLAGDEDLPSGDELGAEFERFLAEQA 303
Query 316 EKKSDDD 322
+ D
Sbjct 304 RMDGEGD 310
>gi|54025753|ref|YP_119995.1| hypothetical protein nfa37830 [Nocardia farcinica IFM 10152]
gi|54017261|dbj|BAD58631.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=310
Score = 432 bits (1111), Expect = 4e-119, Method: Compositional matrix adjust.
Identities = 210/304 (70%), Positives = 251/304 (83%), Gaps = 1/304 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
MYELEFPAPQLSS+DG GPVLVH LEGF+DAGHA+RLA HL+ +L++ELVASF +DELL
Sbjct 7 MYELEFPAPQLSSADGSGPVLVHGLEGFTDAGHAVRLATTHLRESLESELVASFDVDELL 66
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRPLMTFKTDHF+ +PEL+L+ALRD+ GTPFLLLAGLEPDL+WE+F TAVRLLA
Sbjct 67 DYRSRRPLMTFKTDHFSDYAEPELNLWALRDTAGTPFLLLAGLEPDLRWEKFTTAVRLLA 126
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
E+LGVR++IGL +PMA+PHTRP+ +TAHS++R LI+D Q E+QVPGSAS+LLEYRM
Sbjct 127 EQLGVRRSIGLSAIPMAIPHTRPLGITAHSSDRSLIADHQRWPGELQVPGSASSLLEYRM 186
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
AQHGHE +GF+VHVPHYL QT YP AAQ LLE VA L+LPLA L EAAA V+ +++E
Sbjct 187 AQHGHESLGFSVHVPHYLAQTAYPEAAQTLLEHVADNAGLELPLAALGEAAARVREQVNE 246
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENR-SLLTRDEDLPSGDELGAEFERFLAQQAEKK 318
+ + EV VV ALERQYD+F+ AQE + SLL D DLPSG+ELGAEFERFLA+Q
Sbjct 247 HIAGNPEVETVVHALERQYDSFVTAQERQSSLLAADGDLPSGEELGAEFERFLAEQGGYD 306
Query 319 SDDD 322
D D
Sbjct 307 GDKD 310
>gi|226306284|ref|YP_002766244.1| hypothetical protein RER_27970 [Rhodococcus erythropolis PR4]
gi|229490863|ref|ZP_04384698.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|226185401|dbj|BAH33505.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
gi|229322253|gb|EEN88039.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=319
Score = 424 bits (1091), Expect = 6e-117, Method: Compositional matrix adjust.
Identities = 200/300 (67%), Positives = 248/300 (83%), Gaps = 1/300 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFPAPQL+++DG+GP+L+H LEG+SDAGHA++LA HL+ +L+TELVASFA+D
Sbjct 4 QSRMYELEFPAPQLAAADGQGPILIHGLEGYSDAGHAVKLATTHLRESLETELVASFAVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP MTFK DHF+ D P L+LYALRD+ GTPFLLLAG+EPDLKWERF TAVR
Sbjct 64 ELIDYRSRRPTMTFKADHFSDYDAPALNLYALRDTAGTPFLLLAGMEPDLKWERFTTAVR 123
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LL+E+LGVR+TIGL +PMA+PHTRP+ +TAHS N++LI D Q E+QVPGSAS+LLE
Sbjct 124 LLSEQLGVRRTIGLNAIPMAIPHTRPLGVTAHSTNKDLIQDHQRWSGELQVPGSASSLLE 183
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RM+QHGHE +GF+VHVPHYL QTDYP AA+ LLE V++ L+LPL L EAAA V+ +
Sbjct 184 LRMSQHGHEAMGFSVHVPHYLAQTDYPGAAETLLENVSEVSDLELPLVALGEAAARVREQ 243
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENR-SLLTRDEDLPSGDELGAEFERFLAQQA 315
++E + + EV VV ALERQYD F+ AQE + SLL + DLPSGDE+GAEFE+FLA+QA
Sbjct 244 VNEHIAGNEEVQTVVHALERQYDTFVAAQEQQSSLLAGEADLPSGDEIGAEFEKFLAEQA 303
>gi|296139539|ref|YP_003646782.1| hypothetical protein Tpau_1825 [Tsukamurella paurometabola DSM
20162]
gi|296027673|gb|ADG78443.1| protein of unknown function DUF75 [Tsukamurella paurometabola
DSM 20162]
Length=311
Score = 338 bits (868), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 168/306 (55%), Positives = 221/306 (73%), Gaps = 2/306 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q MYELEFP P +S +DG GPVLVHAL+G++DAGHA++L HL + L TELVASF +D
Sbjct 4 QSHMYELEFPGPAVSDTDGNGPVLVHALDGYADAGHALKLLREHLTSNLTTELVASFDVD 63
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
EL+DYRSRRP+MTF+ D FT ++P+L+LYA+RDS G PFLLLAG EPDL+WE F++AV
Sbjct 64 ELIDYRSRRPMMTFE-DRFTGVEEPQLNLYAVRDSAGKPFLLLAGAEPDLRWEGFVSAVA 122
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LAER GV+ +GL +PMAVPHTRP+++T H +N +L + + +++PGSA+ +LE
Sbjct 123 GLAERFGVKTVVGLHAIPMAVPHTRPVSVTGHGSNPDLRKNLRSWDGAMRIPGSAAGMLE 182
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
RMA G++ VG +VHVPHYL Q DYP A +L + T LQLP L AAE++ +
Sbjct 183 LRMADKGYDTVGLSVHVPHYLAQNDYPEAVLGMLGALRSTVDLQLPDGELPAEAAELREQ 242
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRD-EDLPSGDELGAEFERFLAQQA 315
ID QV +SAE+ QVV ALE QYD A + R LL D E +P+GD+L +EFE FLA+QA
Sbjct 243 IDAQVSSSAEITQVVEALEHQYDEATHAAQPRELLIADGEAIPTGDDLASEFEAFLAEQA 302
Query 316 EKKSDD 321
+ D+
Sbjct 303 GDEGDE 308
>gi|333919416|ref|YP_004492997.1| hypothetical protein AS9A_1748 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481637|gb|AEF40197.1| hypothetical protein AS9A_1748 [Amycolicicoccus subflavus DQS3-9A1]
Length=355
Score = 336 bits (862), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 175/305 (58%), Positives = 224/305 (74%), Gaps = 1/305 (0%)
Query 17 QPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAID 76
Q +YELEFP+P + S VLVH LEGF+DAG A+RLA HL+ +L+TELVASF++D
Sbjct 39 QARIYELEFPSPHIEVSADSTLVLVHGLEGFADAGQAVRLATDHLRQSLETELVASFSVD 98
Query 77 ELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVR 136
+L+DYRSRRP MTF +DHF+ PELSLYA +D+ G PFLLL+G+EPD KWE+F +AVR
Sbjct 99 DLVDYRSRRPPMTFTSDHFSSYQAPELSLYAAKDTNGVPFLLLSGMEPDFKWEKFTSAVR 158
Query 137 LLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLE 196
LLAE+ GV +++GL +PMA PHTRP+ + HS+N + Q +E+QVPGSAS LLE
Sbjct 159 LLAEQFGVTRSVGLSAIPMATPHTRPLGVIGHSSNPGEVPAEQRLGTEVQVPGSASALLE 218
Query 197 YRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAK 256
YRM QHG + GF+VHVPHYL Q+ YPAAA LL+ ++ L +PLA L EAAA+V +
Sbjct 219 YRMGQHGFDARGFSVHVPHYLAQSPYPAAAVTLLKHLSDVSGLSVPLAALEEAAADVTRQ 278
Query 257 IDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAE 316
++EQV+AS E VV ALE+QYD E +LL D DLPSGDELGA+FE+FLA+Q
Sbjct 279 VEEQVEASPEAVAVVRALEQQYDIGARQSEETNLLALDGDLPSGDELGAQFEQFLAEQ-N 337
Query 317 KKSDD 321
+SDD
Sbjct 338 AESDD 342
>gi|326384469|ref|ZP_08206149.1| hypothetical protein SCNU_16094 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196814|gb|EGD54008.1| hypothetical protein SCNU_16094 [Gordonia neofelifaecis NRRL
B-59395]
Length=329
Score = 335 bits (858), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 159/297 (54%), Positives = 221/297 (75%), Gaps = 1/297 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+Y+LEFPAP + S+DG GPVL+HALEG++DAGHA+ LAA HL+ AL++ELVA+F DEL+
Sbjct 11 LYDLEFPAPAVYSADGDGPVLIHALEGYADAGHAVALAATHLREALESELVATFNADELI 70
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP ++F + F + +L+++A+RD+ G PFLLL G EPDL+WE+F TA+ LA
Sbjct 71 DYRSRRPTISFSGEKFDGIEMHQLTVHAVRDNSGVPFLLLDGPEPDLRWEQFTTAISALA 130
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ER V Q +GL ++PMAVPHTRP ++TAH N+ + I D + +++P S S LLE R+
Sbjct 131 ERFNVSQVVGLNSIPMAVPHTRPASITAHGNDSDSIGDLNRWGNPMKLPASVSMLLELRL 190
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G+ VG + HVPHYL Q++YP A+ ALLE + + L LP+ L AA E++A+ID
Sbjct 191 GEAGYRTVGLSAHVPHYLAQSNYPGASAALLEAIGQASGLDLPVTALENAAEEMRAQIDG 250
Query 260 QVQASAEVAQVVAALERQYDAFIDAQ-ENRSLLTRDEDLPSGDELGAEFERFLAQQA 315
+V ++AEVA VV +LE QYDA++ A+ E SLL D+++PSGDELGAEFE+FLA+ A
Sbjct 251 EVASNAEVASVVTSLENQYDAYMRAKNEQASLLAADQEMPSGDELGAEFEKFLAEHA 307
>gi|262202165|ref|YP_003273373.1| hypothetical protein Gbro_2232 [Gordonia bronchialis DSM 43247]
gi|262085512|gb|ACY21480.1| protein of unknown function DUF75 [Gordonia bronchialis DSM 43247]
Length=342
Score = 330 bits (846), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 165/307 (54%), Positives = 220/307 (72%), Gaps = 2/307 (0%)
Query 16 GQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAI 75
G +YEL FPAPQL S + GPVL+HALEGF+DAGHA+ LAA HL+ +LD++L+A+F
Sbjct 7 GDDHLYELAFPAPQLGSGES-GPVLIHALEGFADAGHAVALAATHLRDSLDSQLLATFNS 65
Query 76 DELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAV 135
DEL+DYRSRRP +TF + FT PEL+++A+RD+ G FLLL+G EPDL+WE+F+ AV
Sbjct 66 DELMDYRSRRPTITFSGETFTEVAMPELTMHAIRDNAGRGFLLLSGSEPDLRWEQFVDAV 125
Query 136 RLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLL 195
R L++ GV +GL +PMAVPHTRP ++TAH ++ + + D S +++P SAS LL
Sbjct 126 RRLSDHFGVTDVVGLNAIPMAVPHTRPPSITAHGSDPDALGDLPRWGSAMKLPASASMLL 185
Query 196 EYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQA 255
E RM QH + G +VHVPHYL QT+YPAA+ LL V++ L LP A L AA +V+
Sbjct 186 ELRMGQHHYRAAGLSVHVPHYLAQTNYPAASARLLAAVSELTGLDLPTAALESAAEKVRG 245
Query 256 KIDEQVQASAEVAQVVAALERQYDAFIDAQENR-SLLTRDEDLPSGDELGAEFERFLAQQ 314
+ID +V + E+ VVAALE QYD+F AQ+ R SLL +E+LPSGDELGAE ERFLA+Q
Sbjct 246 QIDNEVSGNEEIESVVAALETQYDSFTQAQQERASLLAAEEELPSGDELGAELERFLAEQ 305
Query 315 AEKKSDD 321
+ +D
Sbjct 306 IRQGGED 312
>gi|343926982|ref|ZP_08766470.1| hypothetical protein GOALK_077_00060 [Gordonia alkanivorans NBRC
16433]
gi|343763040|dbj|GAA13396.1| hypothetical protein GOALK_077_00060 [Gordonia alkanivorans NBRC
16433]
Length=326
Score = 324 bits (830), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 172/307 (57%), Positives = 231/307 (76%), Gaps = 3/307 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL FPAP+++ +DG GPVL+HALEGF+DAGHA+ LAAAHL+ +L++ELVA+F+ DEL+
Sbjct 11 LYELAFPAPKVTRADGTGPVLIHALEGFADAGHAVALAAAHLRDSLESELVATFSSDELM 70
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP ++F + FT + P L+L+A+RD+ G FLLLAG EPDL+WE+F+ AVR L+
Sbjct 71 DYRSRRPTISFSGETFTEVEMPALTLHAIRDNSGKGFLLLAGAEPDLRWEQFVDAVRRLS 130
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
+RLGV IGL +PMAVPHTRP ++TAH ++ + + D S +++P SAS LLE RM
Sbjct 131 DRLGVTDVIGLNAIPMAVPHTRPPSITAHGSDPDALGDLPRWGSAMKLPASASMLLELRM 190
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+H + G +VHVPHYL QT+YPAA+ LL VA+ L LPLA L AA +V+A++D
Sbjct 191 GEHDYRASGLSVHVPHYLAQTNYPAASARLLSAVAELAGLDLPLAALESAAEKVRAQVDT 250
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENR-SLLTRDEDLPSGDELGAEFERFLAQQAEKK 318
+V+ ++E+ VVAALE QYD F A E R SLL +E LPSGDELGAE ERFLA+QA ++
Sbjct 251 EVEGNSEIESVVAALETQYDTFTQAAEERASLLAAEESLPSGDELGAELERFLAEQAAEQ 310
Query 319 S--DDDP 323
+ DD+P
Sbjct 311 TPKDDEP 317
>gi|336177595|ref|YP_004582970.1| hypothetical protein FsymDg_1591 [Frankia symbiont of Datisca
glomerata]
gi|334858575|gb|AEH09049.1| hypothetical protein FsymDg_1591 [Frankia symbiont of Datisca
glomerata]
Length=309
Score = 309 bits (791), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 159/304 (53%), Positives = 203/304 (67%), Gaps = 6/304 (1%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YE+ P++ GR PVLV AL G DAG AIRLA HL LD L+A+F +D+LL
Sbjct 7 LYEVADDLPEI----GR-PVLVEALTGVVDAGGAIRLARDHLLTTLDNRLIATFDVDQLL 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP M F DH+ H D+P L L+ + DS GTPFLLL+G EPDL+W+RFI AV +LA
Sbjct 62 DYRSRRPFMIFSEDHWEHYDEPLLGLHLVDDSAGTPFLLLSGPEPDLQWKRFIAAVGILA 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ERLGVR T+GL +PMAVPHTRP +TAH+ R+LI ++P + +Q PGSA +L+EY
Sbjct 122 ERLGVRLTVGLNAIPMAVPHTRPCGVTAHATRRDLIIGYEPWVRRVQAPGSAGHLIEYLR 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHYL+QTDYPAA ++LL ++K L LPL L A+A V++ ID
Sbjct 182 GRDGLDAMGFAAHVPHYLSQTDYPAATESLLTSLSKATGLMLPLDGLRSASAAVRSNIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSL-LTRDEDLPSGDELGAEFERFLAQQAEKK 318
Q+ E A +V ALE QYD FI + L DE+LP+ DEL A ERFLA+Q E
Sbjct 242 QLANGGEAAALVTALEEQYDTFIQGRTGSDLPAAEDEELPTADELAAALERFLAEQTEPD 301
Query 319 SDDD 322
D
Sbjct 302 GPPD 305
>gi|319949154|ref|ZP_08023243.1| hypothetical protein ES5_07072 [Dietzia cinnamea P4]
gi|319437140|gb|EFV92171.1| hypothetical protein ES5_07072 [Dietzia cinnamea P4]
Length=316
Score = 301 bits (770), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/306 (50%), Positives = 206/306 (68%), Gaps = 2/306 (0%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+Y L P P L S DGRGPVLVH LEGFSDAG AI+ + HL+ +LD++L+ F +DEL+
Sbjct 7 LYRLIEPVPDLRSEDGRGPVLVHGLEGFSDAGLAIQGVSEHLRESLDSQLIVEFDVDELV 66
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP + + D F ++P + ++A++ S GT FLLL+GLEPDLKW+ F +V LA
Sbjct 67 DYRSRRPHLKYSFDRFADYNEPTIQMHAVKASDGTSFLLLSGLEPDLKWDGFTESVIDLA 126
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
GVR +IGLG +P+ VPHTRP +AH+++ +LI F E VPG+ ++LLE RM
Sbjct 127 GSFGVRMSIGLGAMPLGVPHTRPTNSSAHASDVDLIKGFSAWPGEFSVPGNVTSLLELRM 186
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
A+HG GFTVHVP YL+QT YPAA L+ +AK L+LP A L +AA E A+++
Sbjct 187 AEHGIPSAGFTVHVPQYLSQTAYPAAVLHLVGSIAKIADLELPTAELEKAAEEFTAQVNA 246
Query 260 QVQASAEVAQVVAALERQYDAFIDAQ-ENRSLLTRDEDLPSGDELGAEFERFLAQQ-AEK 317
Q+ S E+ V +E+QYD F++ + + SL + LPSGDE+GAEFERFLAQQ +
Sbjct 247 QIAQSPEILTAVELMEKQYDEFMETRLGSDSLNPGGKPLPSGDEIGAEFERFLAQQTGDG 306
Query 318 KSDDDP 323
DDP
Sbjct 307 GQGDDP 312
>gi|111221414|ref|YP_712208.1| hypothetical protein FRAAL1976 [Frankia alni ACN14a]
gi|111148946|emb|CAJ60625.1| conserved hypothetical protein [Frankia alni ACN14a]
Length=312
Score = 299 bits (765), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 151/285 (53%), Positives = 192/285 (68%), Gaps = 2/285 (0%)
Query 34 DGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTD 93
D P+++ AL G DAG+A+ LA HL ALD +VA+F +D+LLDYRSRRP M F D
Sbjct 20 DAHRPIMLEALTGVVDAGNAVSLAGEHLLTALDHRIVATFDVDQLLDYRSRRPTMIFSED 79
Query 94 HFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTV 153
H+ DP L+LY LRD TPFLLL G EPDL+W+RF AVR L RLGVR T+GL V
Sbjct 80 HWESYTDPVLALYQLRDESDTPFLLLTGPEPDLQWKRFTAAVRGLVARLGVRLTVGLNAV 139
Query 154 PMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHV 213
PMAVPHTRP T+TAH +++EL+ ++P + +QVPGSA +LLEY + + G + +GF VHV
Sbjct 140 PMAVPHTRPATITAHGSSKELVVGYEPWLRRLQVPGSAGHLLEYELGRDGRDAMGFAVHV 199
Query 214 PHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAA 273
PHYL QT YPAA + LL V+K L LPL L AA VQ +++ Q+ E A +V A
Sbjct 200 PHYLAQTTYPAATEVLLTSVSKATGLMLPLDGLRSAAVAVQDEVNSQIAQGGEAAALVHA 259
Query 274 LERQYDAFIDAQENRSLLTRD--EDLPSGDELGAEFERFLAQQAE 316
LE QYDA+ + SL T D + LP+ DELG ERFLA+Q+E
Sbjct 260 LEEQYDAYQRGRRGPSLPTIDAEQKLPTADELGEALERFLAEQSE 304
>gi|312198471|ref|YP_004018532.1| Proteasome assembly chaperone 2 [Frankia sp. EuI1c]
gi|311229807|gb|ADP82662.1| Proteasome assembly chaperone 2 [Frankia sp. EuI1c]
Length=303
Score = 298 bits (763), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 152/298 (52%), Positives = 204/298 (69%), Gaps = 6/298 (2%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YE+ P L GR PV++ A+ G D+G+A+RLA+ HL +L+ E+VA+F ID LL
Sbjct 7 LYEVHGDLPDL----GR-PVMLEAMTGVVDSGNAVRLASEHLLTSLEHEVVATFDIDLLL 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP MTF DH+ H +DP L+LYALRD TPFLLLAG EPDL W+RF TA+R L
Sbjct 62 DYRSRRPAMTFVEDHWEHYEDPVLALYALRDRADTPFLLLAGPEPDLMWKRFSTAIRELT 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL +R +GL +PMAVPHTRP + H +EL++ ++P + ++QVPGSA +LLE+
Sbjct 122 RRLNLRLAVGLNAIPMAVPHTRPTGLIVHGTRKELVAGYEPWVRQVQVPGSAGHLLEFEF 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +G VPHYL QTD+PAA + LL V+KT L LPL L AAA V+ ++D
Sbjct 182 GKEGRDAMGLAALVPHYLNQTDFPAATEVLLTSVSKTTGLMLPLDGLQSAAATVRGEVDL 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLT-RDEDLPSGDELGAEFERFLAQQAE 316
++ E+A +V ALE QYDA+ +E+ L T + EDLP+ DELG E ERFLA+Q+E
Sbjct 242 ELAKGGEMASLVHALEEQYDAYKRGKESGGLPTVQPEDLPTADELGEELERFLAEQSE 299
>gi|86739954|ref|YP_480354.1| hypothetical protein Francci3_1247 [Frankia sp. CcI3]
gi|86566816|gb|ABD10625.1| protein of unknown function DUF75 [Frankia sp. CcI3]
Length=312
Score = 292 bits (747), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 148/289 (52%), Positives = 192/289 (67%), Gaps = 5/289 (1%)
Query 38 PVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTH 97
PV++ A+ G DAG A+ LA HL ALD L+A+F ID+LLDYRSRRP M F D +
Sbjct 24 PVMLEAMTGVVDAGSAVSLAGEHLMTALDHRLLATFDIDQLLDYRSRRPTMVFSEDRWES 83
Query 98 SDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAV 157
+DP L+LY LRD GTPFLLLAG EPDL+W+RF A+R L RLGVR T+GL +PMAV
Sbjct 84 YEDPVLALYLLRDEAGTPFLLLAGPEPDLQWKRFTVALRGLVARLGVRLTVGLNAIPMAV 143
Query 158 PHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYL 217
PHTRP+ ++AH+ ++LI ++P + +QVPGSA +LLE+ + + G + +GF HVPHYL
Sbjct 144 PHTRPLVVSAHATRKDLIVGYEPWLRRLQVPGSAGHLLEFELGREGRDAMGFAAHVPHYL 203
Query 218 TQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQ 277
QT YPAA + LL V+K L LPL L AA +Q ++D Q+ E A +V+ALE Q
Sbjct 204 AQTTYPAATEVLLTSVSKATGLLLPLDGLRSAAVAIQDEVDSQIARGGEAAALVSALEEQ 263
Query 278 YDAFIDAQENRSLLTRD--EDLPSGDELGAEFERFLAQQAEKKSDDDPT 324
YDA+ + SL D + LP+ DELG ERFLA+Q E D PT
Sbjct 264 YDAYQRGRRGPSLPAADDVQPLPTADELGDALERFLAEQTEP---DGPT 309
>gi|332670110|ref|YP_004453118.1| hypothetical protein Celf_1598 [Cellulomonas fimi ATCC 484]
gi|332339148|gb|AEE45731.1| protein of unknown function DUF75 [Cellulomonas fimi ATCC 484]
Length=312
Score = 290 bits (742), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/281 (52%), Positives = 189/281 (68%), Gaps = 0/281 (0%)
Query 34 DGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTD 93
DG GPVLVHA+ GF DAG A +L A HL L + +F +D+LLDYRSRRP+MTF +
Sbjct 24 DGAGPVLVHAVRGFVDAGSAGQLVAEHLTEELGATRLVTFDVDQLLDYRSRRPVMTFDST 83
Query 94 HFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTV 153
++ DPEL++ + D+ G PFLLL G+EPD++WER++ AVR + ER V+ T+G+ V
Sbjct 84 TWSDYADPELAVDVVEDAAGVPFLLLHGVEPDVQWERYVAAVRQIVERFDVQLTVGVHGV 143
Query 154 PMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHV 213
PM +PHTRP+++TAH+ EL++D +QVP SAS LLE R+ Q GH+ +GF VHV
Sbjct 144 PMGIPHTRPVSVTAHATRPELVADQASWFGRVQVPASASALLELRLGQSGHDAMGFAVHV 203
Query 214 PHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAA 273
PHYL Q+ YP A+ A L + + L L L EAA E + +I+ QV S EVA VV A
Sbjct 204 PHYLAQSAYPRASVAALHGIERATGLDLRAGALTEAAQEAEREIERQVAGSEEVATVVRA 263
Query 274 LERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQ 314
LE QYDAF + SLL DLP+ DELGAEFERFLA+Q
Sbjct 264 LEEQYDAFARSIGRTSLLASSTDLPTADELGAEFERFLAEQ 304
>gi|330466792|ref|YP_004404535.1| hypothetical protein VAB18032_14115 [Verrucosispora maris AB-18-032]
gi|328809763|gb|AEB43935.1| hypothetical protein VAB18032_14115 [Verrucosispora maris AB-18-032]
Length=303
Score = 289 bits (740), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 151/302 (50%), Positives = 198/302 (66%), Gaps = 6/302 (1%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL P L PVL+ AL GF DAG+A RLA L +L++ +A F +D+L
Sbjct 7 LYELTDDLPDLGQ-----PVLIQALTGFVDAGNASRLAREQLLTSLESRPIARFDLDQLF 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP+MTF DH+ D PEL L+ L D TPFLLL G EPDL+WERF+ AV LA
Sbjct 62 DYRSRRPVMTFVEDHWESYDTPELELHLLHDDDETPFLLLTGPEPDLQWERFVAAVAGLA 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL VR T+GL +PMAVPHTRP +TAH+ RELI ++P + +QVPGS +LLE+R+
Sbjct 122 TRLDVRLTVGLNAIPMAVPHTRPAGVTAHATRRELIVGYEPWLQRVQVPGSVGHLLEFRL 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHY+ Q +YPAAA+ LL V+++ L LP L AA V+ +ID
Sbjct 182 GEAGRDALGFAAHVPHYVAQAEYPAAAEVLLASVSRSTGLLLPRDGLRSAAEAVRVEIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDED-LPSGDELGAEFERFLAQQAEKK 318
QV S E A +V ALE QYDA+ +E ++LL + LP+ +ELGAE ERFLA+Q
Sbjct 242 QVAQSEEAATLVQALEEQYDAYARGREGKNLLAAENGPLPTAEELGAELERFLAEQTRPN 301
Query 319 SD 320
++
Sbjct 302 NE 303
>gi|159037347|ref|YP_001536600.1| hypothetical protein Sare_1722 [Salinispora arenicola CNS-205]
gi|157916182|gb|ABV97609.1| protein of unknown function DUF75 [Salinispora arenicola CNS-205]
Length=306
Score = 289 bits (740), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 154/305 (51%), Positives = 196/305 (65%), Gaps = 8/305 (2%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL P L PVL+ AL GF DAG+A RLA L +LD VA F +D+L
Sbjct 7 LYELADDLPDLGQ-----PVLIQALSGFVDAGNATRLAREQLLTSLDARPVARFDLDQLF 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP+MTF DH+ D P L L+ LRD TPFLLL G EPDL+WERF+ AV LA
Sbjct 62 DYRSRRPVMTFVEDHWESYDAPALELHLLRDDADTPFLLLTGPEPDLQWERFVAAVAGLA 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL VR T+GL +PMAVPHTR +TAH+ REL + ++P + +QVPGS LLEYR+
Sbjct 122 TRLDVRLTVGLNAIPMAVPHTRRTGVTAHATRRELTAGYEPWLQRVQVPGSVGYLLEYRL 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHY+ QT+YPAAA+ LL V+++ L LP L A V+ +ID
Sbjct 182 GEQGRDALGFAAHVPHYVAQTEYPAAAEVLLSSVSRSTGLLLPCDELRAATEAVRTEIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLL-TRDEDLPSGDELGAEFERFLAQQAEKK 318
QV + + A +V ALE QYDAF + +LL T LP+ DELGAE ERFLA+Q +
Sbjct 242 QVAQTEDAAALVQALEEQYDAFTRGRGQPNLLNTGAGSLPTADELGAELERFLAEQ--TR 299
Query 319 SDDDP 323
+D+P
Sbjct 300 PNDNP 304
>gi|145594281|ref|YP_001158578.1| hypothetical protein Strop_1737 [Salinispora tropica CNB-440]
gi|145303618|gb|ABP54200.1| protein of unknown function DUF75 [Salinispora tropica CNB-440]
Length=306
Score = 289 bits (739), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 154/305 (51%), Positives = 198/305 (65%), Gaps = 8/305 (2%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL P L PVL+ AL GF DAG+A RLA L +LD VA F +D+L
Sbjct 7 LYELTDDLPDLGQ-----PVLIQALSGFVDAGNATRLAREQLLTSLDARPVARFDLDQLF 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP+MTF DH+ D P L L+ LRD TPFLLL G EPDL+WERF+ AV L+
Sbjct 62 DYRSRRPVMTFVEDHWESYDAPALELHLLRDDADTPFLLLTGPEPDLQWERFVAAVAGLS 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL VR T+GL +PMAVPHTR +TAH+ REL + ++P + +QVPGS +LLEYR+
Sbjct 122 ARLDVRLTVGLNAIPMAVPHTRRTGVTAHATRRELTAGYEPWLQRVQVPGSIGHLLEYRL 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHY+ QT+YPAAA+ LL V+++ L LP L A V+ +ID
Sbjct 182 GEQGRDALGFAAHVPHYVAQTEYPAAAEVLLASVSRSTGLLLPSDGLRAATEAVRTEIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLL-TRDEDLPSGDELGAEFERFLAQQAEKK 318
QV + + A +V ALE QYDAF + +LL T E LP+ DELGAE ERFLA+Q +
Sbjct 242 QVAQTEDAAALVQALEEQYDAFTRGRGQPNLLSTGTEALPTADELGAELERFLAEQ--TR 299
Query 319 SDDDP 323
+D+P
Sbjct 300 PNDNP 304
>gi|302866657|ref|YP_003835294.1| hypothetical protein Micau_2173 [Micromonospora aurantiaca ATCC
27029]
gi|315503071|ref|YP_004081958.1| hypothetical protein ML5_2285 [Micromonospora sp. L5]
gi|302569516|gb|ADL45718.1| hypothetical protein Micau_2173 [Micromonospora aurantiaca ATCC
27029]
gi|315409690|gb|ADU07807.1| hypothetical protein ML5_2285 [Micromonospora sp. L5]
Length=306
Score = 288 bits (737), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 148/296 (50%), Positives = 196/296 (67%), Gaps = 6/296 (2%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL P+L PVL+ AL GF DAG+A RLA L +LD ++A F +D++
Sbjct 7 LYELTDELPELGQ-----PVLIQALTGFVDAGNATRLAREQLLTSLDARVIARFDVDQIF 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP+MTF DH+ D P L L+ L D TPFLLL G EPDL+WERF+ AV L+
Sbjct 62 DYRSRRPVMTFVEDHWESYDAPALELHLLHDDDETPFLLLTGPEPDLQWERFVAAVAGLS 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL VR T+GL ++PMAVPHTRP +TAH+ +ELI+ +P + ++QVP +LLEYR+
Sbjct 122 ARLDVRLTVGLNSIPMAVPHTRPSGVTAHATRKELIAGHEPWLQKVQVPAGVGHLLEYRL 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHY+ Q +YPAAA+ALL V+++ L LP+ L AA V+ +ID
Sbjct 182 GEQGRDALGFAAHVPHYVAQAEYPAAAEALLSAVSRSTGLLLPVEALRTAAEAVRVEIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDED-LPSGDELGAEFERFLAQQ 314
QV + E A +V ALE QYD F + +SLL + LP+ DELGAE ERFLA+Q
Sbjct 242 QVTQTEEAATLVQALEEQYDTFARGRGEKSLLAGETGPLPTADELGAELERFLAEQ 297
>gi|158316964|ref|YP_001509472.1| hypothetical protein Franean1_5208 [Frankia sp. EAN1pec]
gi|158112369|gb|ABW14566.1| protein of unknown function DUF75 [Frankia sp. EAN1pec]
Length=307
Score = 286 bits (731), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 142/281 (51%), Positives = 186/281 (67%), Gaps = 2/281 (0%)
Query 38 PVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTH 97
PVL+ A+ G DAG A+ LA+ HL AL E + +F +D+L+DYRSRRP M F DH+
Sbjct 20 PVLIEAMTGVVDAGGAVGLASEHLTTALQHERIVTFDVDQLMDYRSRRPPMVFYEDHWES 79
Query 98 SDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAV 157
DDP L++ L D GTPFLLL G EPDL W+RF AV+ + LGVR ++GL +PMAV
Sbjct 80 YDDPVLAIELLHDEAGTPFLLLCGPEPDLHWKRFTKAVQAVMAELGVRMSVGLNAIPMAV 139
Query 158 PHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYL 217
PHTRP +TAH+ +EL+ ++P + + VPGSA +LLEY + + G + +GF HVPHYL
Sbjct 140 PHTRPCGVTAHATRKELLVGYEPWVRRLSVPGSAGHLLEYEIGRSGADAMGFAAHVPHYL 199
Query 218 TQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQ 277
Q YPAA +ALL V+K+ L LPL L AA EV+ ++D Q+ E A VV A+E Q
Sbjct 200 AQATYPAATEALLSSVSKSTGLLLPLDGLRSAALEVRGEVDSQIARGGEAADVVKAIEEQ 259
Query 278 YDAFIDAQENRSLLTRD--EDLPSGDELGAEFERFLAQQAE 316
YDAF +E L D E LP+G+ELGA ERFLA+Q+E
Sbjct 260 YDAFHRGREGEHLPVVDDSEPLPTGEELGAALERFLAEQSE 300
>gi|291301221|ref|YP_003512499.1| hypothetical protein Snas_3749 [Stackebrandtia nassauensis DSM
44728]
gi|290570441|gb|ADD43406.1| protein of unknown function DUF75 [Stackebrandtia nassauensis
DSM 44728]
Length=301
Score = 281 bits (719), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 143/304 (48%), Positives = 199/304 (66%), Gaps = 5/304 (1%)
Query 15 PGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFA 74
P +Y +E P +S G V++ L GF DAG A + +L LD ++VASF
Sbjct 2 PNGEDLYTVEADTPDIS-----GAVMLVELRGFMDAGQAGQGVTEYLLKELDHQVVASFD 56
Query 75 IDELLDYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITA 134
+DEL+DYR RRP+MTF TDH+ D P L +Y +RD +G PFLLL+G EPDL+WERF A
Sbjct 57 VDELIDYRGRRPVMTFDTDHWVDYDAPRLRVYLMRDDVGVPFLLLSGDEPDLRWERFAEA 116
Query 135 VRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNL 194
V+ L E+ G+R T+ L +PM PHTRP+ +TAH + L+ + +++ +QVPG+A+ L
Sbjct 117 VQSLIEKFGIRLTVALHGIPMGAPHTRPLGVTAHGTDASLLPSGERTLNRLQVPGNAAAL 176
Query 195 LEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQ 254
LE R+ Q GH+ +GF VHVPHYL Q YP A+ LLE + + L + + L E V
Sbjct 177 LELRLGQAGHDAIGFAVHVPHYLAQASYPNASVRLLESLHQATGLSVSVESLREEGRVVD 236
Query 255 AKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQ 314
A++D QV+AS EV+ VVAALERQYD F D++ + + +E+LP+GDELG +FERFLA+Q
Sbjct 237 AEVDSQVRASQEVSDVVAALERQYDMFDDSRPSLLVEESEEELPTGDELGEQFERFLAEQ 296
Query 315 AEKK 318
+
Sbjct 297 QRRS 300
>gi|238063779|ref|ZP_04608488.1| hypothetical protein MCAG_04745 [Micromonospora sp. ATCC 39149]
gi|237885590|gb|EEP74418.1| hypothetical protein MCAG_04745 [Micromonospora sp. ATCC 39149]
Length=305
Score = 279 bits (713), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 152/296 (52%), Positives = 193/296 (66%), Gaps = 5/296 (1%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YEL P+L PVL+ AL GF DAG+A RLA L +LD VASF +D+L
Sbjct 7 LYELSDDLPELGQ-----PVLIQALTGFVDAGNATRLAREQLLTSLDARPVASFDVDQLY 61
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP MTF DH+ D P L ++ L D TPFLLL G EPDL+WERF+ AV LA
Sbjct 62 DYRSRRPSMTFVEDHWEEYDAPTLRVHLLNDDDETPFLLLTGPEPDLQWERFVAAVAGLA 121
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
RL VR T+GL ++PMAVPHTRP +TAH+ RELIS ++P + +QVPG+ +LLEYR+
Sbjct 122 ARLDVRLTVGLNSIPMAVPHTRPTGVTAHATRRELISGYEPWLQRVQVPGTVGHLLEYRL 181
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ G + +GF HVPHY+ Q +YPAAA+ LL V+++ L LP L AA V+ +ID
Sbjct 182 GEQGRDALGFAAHVPHYVAQAEYPAAAEVLLASVSRSTGLLLPRDGLRSAAEVVRVEIDR 241
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQA 315
QV + + A +VAALE QYDAF + L LP+ DELGAE ERFLA+Q
Sbjct 242 QVAQTEDAAALVAALEEQYDAFARGRGENLLAAEAGPLPTADELGAELERFLAEQG 297
>gi|119717076|ref|YP_924041.1| hypothetical protein Noca_2852 [Nocardioides sp. JS614]
gi|119537737|gb|ABL82354.1| protein of unknown function DUF75 [Nocardioides sp. JS614]
Length=311
Score = 279 bits (713), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 149/298 (50%), Positives = 201/298 (68%), Gaps = 3/298 (1%)
Query 28 PQLSSSDGRGPV-LVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRP 86
P+L + RG + +V L+GF DAG+A AA HL + +VA+F +DE DYR+RRP
Sbjct 13 PELDDARSRGALTMVLVLDGFLDAGNAAGRAAQHLVDLSEGPVVATFDVDEFHDYRARRP 72
Query 87 LMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQ 146
M+F DH+ D P L + L D+ GTP+LLL G EPD +WE F AVR + ER GV +
Sbjct 73 PMSFVRDHYDAYDAPRLVVRLLADTGGTPYLLLHGPEPDNRWEAFCRAVREVVERFGVSR 132
Query 147 TIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEV 206
+G+G+VPMAVPHTRPI +T H+N+ ELI+ P E+++P SA LLE R+ + GH+
Sbjct 133 VVGMGSVPMAVPHTRPIAITHHANSPELITGESPWRGELRIPSSAQALLEVRLGEWGHDA 192
Query 207 VGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAE 266
+GF H+PHYL Q DYP A+ ALLEQV G L + L+ L A + +A+I + A+ E
Sbjct 193 MGFVAHIPHYLAQMDYPRASAALLEQVEIAGRLTVDLSGLRAEAEDREAEIARYLAANEE 252
Query 267 VAQVVAALERQYDAFIDAQEN-RSLLTRDEDLPSGDELGAEFERFLAQQAEKKSDDDP 323
VA+VVAALERQYDAF A+E+ SLL RD+ LP+G+E+G EFERFLA ++ DD+P
Sbjct 253 VAEVVAALERQYDAFERAEESGTSLLARDQRLPTGEEIGKEFERFLA-GLDRPGDDEP 309
>gi|302529899|ref|ZP_07282241.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302438794|gb|EFL10610.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=304
Score = 278 bits (712), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 142/301 (48%), Positives = 186/301 (62%), Gaps = 5/301 (1%)
Query 20 MYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELL 79
+YE++ P L G VL+H EGF DAG A RL HL ++ +VA F +D L+
Sbjct 8 LYEVDSDVPDLD-----GAVLLHFFEGFMDAGSAGRLVTDHLTGEVENRIVARFDVDRLI 62
Query 80 DYRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLA 139
DYRSRRP M + DH+ + PEL + L D G PFLLL+G EPD +WE F AVR L
Sbjct 63 DYRSRRPAMIYAVDHWEEYEAPELVVRLLHDEDGIPFLLLSGPEPDREWELFAAAVRQLV 122
Query 140 ERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRM 199
ER GVR T+G +PM PHTRP+ +TAH+ L+ + QP + +QVPGS + +LEYR
Sbjct 123 ERWGVRLTVGYHGIPMGAPHTRPLGVTAHATREHLVGEHQPLPNRMQVPGSIAAMLEYRF 182
Query 200 AQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDE 259
+ GH+ +GF HVPHYL Q+ YPAAA +L+ + K L+LP L AA +A+ID
Sbjct 183 GEWGHDAMGFAAHVPHYLAQSTYPAAALTILDSIGKATGLRLPDGELRTAAEVAKAEIDR 242
Query 260 QVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKS 319
QV S E VV ALERQYD F +A + L E +P+ DELG++FERFLA+Q S
Sbjct 243 QVAESEESVDVVRALERQYDTFTEASGHSLLAESQEHMPTADELGSQFERFLAEQGGDGS 302
Query 320 D 320
+
Sbjct 303 E 303
>gi|317506475|ref|ZP_07964276.1| hypothetical protein HMPREF9336_00646 [Segniliparus rugosus ATCC
BAA-974]
gi|316255236|gb|EFV14505.1| hypothetical protein HMPREF9336_00646 [Segniliparus rugosus ATCC
BAA-974]
Length=313
Score = 278 bits (712), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 152/287 (53%), Positives = 189/287 (66%), Gaps = 1/287 (0%)
Query 37 GPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFT 96
G VL+H+LEGF DAG A +LA AHL +L +A+F ID LLDYRSRRP + F + F
Sbjct 25 GLVLIHSLEGFLDAGQAPKLATAHLLESLPATALATFDIDALLDYRSRRPPLKFAKNSFA 84
Query 97 HSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMA 156
+ ++P L LY LRD GTPFLLLAGLEPDL WERF+ AV +A GV ++IGL + MA
Sbjct 85 NYEEPLLRLYGLRDLNGTPFLLLAGLEPDLMWERFVAAVEKVARHFGVTRSIGLSALAMA 144
Query 157 VPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHY 216
VPHTRP + AHS + LI+D + E + GSAS LLE R+AQH +GFTV+VPHY
Sbjct 145 VPHTRPPVVMAHSADPVLIADHRKYDGEALISGSASALLELRLAQHDIPSLGFTVYVPHY 204
Query 217 LTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALER 276
LT YPA+A LLEQVA+ L LPL L E A +I+EQV AS EV + +AALE
Sbjct 205 LTNASYPASALGLLEQVAQNSGLALPLEALRETIAATHEQIEEQVSASDEVQRAIAALED 264
Query 277 QYDAFIDAQENR-SLLTRDEDLPSGDELGAEFERFLAQQAEKKSDDD 322
QYD E+ L+ E+LPS +ELGA+FERFLA + E +D
Sbjct 265 QYDGHAQTAEDELPPLSELEELPSAEELGAQFERFLATRPEPSPGED 311
>gi|257055520|ref|YP_003133352.1| ATP-grasp superfamily enzyme [Saccharomonospora viridis DSM 43017]
gi|256585392|gb|ACU96525.1| ATP-grasp superfamily enzyme [Saccharomonospora viridis DSM 43017]
Length=307
Score = 277 bits (709), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/280 (51%), Positives = 186/280 (67%), Gaps = 0/280 (0%)
Query 39 VLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHS 98
VL++ +GF DAG A + HL A D +VA F +D LLDYRSRRP MTF DH+
Sbjct 22 VLLYHFDGFVDAGSAGGVVVDHLLAECDGPVVARFDVDRLLDYRSRRPTMTFAADHWADY 81
Query 99 DDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVP 158
++PEL++ LRD+ PFLL G EPD +WE F+ AVR L +R VR + + +PM VP
Sbjct 82 EEPELAVRLLRDADEVPFLLFTGPEPDREWEAFVAAVRGLVQRWRVRLLVNVHGIPMGVP 141
Query 159 HTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLT 218
HTRP+ +TAH+ EL+ ++ ++IQVPGSA+ LLEYR+ Q GH+V+GFT HVPHYL
Sbjct 142 HTRPLGITAHATRPELVRSYRTVFNQIQVPGSAAALLEYRLGQAGHDVIGFTAHVPHYLA 201
Query 219 QTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQY 278
Q+ YPAAA L + V + L++PLA L EAA +ID QV+ +AE A VV ALE+QY
Sbjct 202 QSRYPAAALRLFDAVTEATGLRVPLADLREAAHAANLEIDRQVRDNAEAADVVRALEQQY 261
Query 279 DAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKK 318
DAF A +LL + LPSGDEL F+RFLA+Q + +
Sbjct 262 DAFTAAAPGSNLLADADSLPSGDELAEHFQRFLAEQQQDR 301
Lambda K H
0.316 0.132 0.371
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 574046524410
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40