BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2411c
Length=551
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609548|ref|NP_216927.1| hypothetical protein Rv2411c [Mycob... 1125 0.0
gi|289754518|ref|ZP_06513896.1| conserved hypothetical protein [... 1117 0.0
gi|294994481|ref|ZP_06800172.1| hypothetical protein Mtub2_08193... 1108 0.0
gi|340627422|ref|YP_004745874.1| hypothetical protein MCAN_24451... 1087 0.0
gi|15827247|ref|NP_301510.1| hypothetical protein ML0605 [Mycoba... 960 0.0
gi|466974|gb|AAA17160.1| u1937b [Mycobacterium leprae] 956 0.0
gi|183983712|ref|YP_001852003.1| hypothetical protein MMAR_3732 ... 954 0.0
gi|41408321|ref|NP_961157.1| hypothetical protein MAP2223c [Myco... 951 0.0
gi|240170817|ref|ZP_04749476.1| hypothetical protein MkanA1_1599... 951 0.0
gi|254774589|ref|ZP_05216105.1| hypothetical protein MaviaA2_079... 950 0.0
gi|296170533|ref|ZP_06852117.1| UDP-N-acetylmuramate dehydrogena... 941 0.0
gi|118618943|ref|YP_907275.1| hypothetical protein MUL_3675 [Myc... 941 0.0
gi|336458242|gb|EGO37223.1| hypothetical protein MAPs_15370 [Myc... 931 0.0
gi|342857811|ref|ZP_08714467.1| hypothetical protein MCOL_03005 ... 930 0.0
gi|254819868|ref|ZP_05224869.1| hypothetical protein MintA_08084... 925 0.0
gi|118467439|ref|YP_888839.1| hypothetical protein MSMEG_4570 [M... 883 0.0
gi|108800475|ref|YP_640672.1| hypothetical protein Mmcs_3509 [My... 882 0.0
gi|145223253|ref|YP_001133931.1| hypothetical protein Mflv_2666 ... 877 0.0
gi|120404865|ref|YP_954694.1| hypothetical protein Mvan_3911 [My... 872 0.0
gi|169628725|ref|YP_001702374.1| hypothetical protein MAB_1635 [... 833 0.0
gi|226360418|ref|YP_002778196.1| hypothetical protein ROP_10040 ... 805 0.0
gi|111018294|ref|YP_701266.1| hypothetical protein RHA1_ro01284 ... 804 0.0
gi|333918819|ref|YP_004492400.1| hypothetical protein AS9A_1148 ... 795 0.0
gi|229493138|ref|ZP_04386930.1| conserved hypothetical protein [... 786 0.0
gi|226307253|ref|YP_002767213.1| hypothetical protein RER_37660 ... 784 0.0
gi|262203073|ref|YP_003274281.1| hypothetical protein Gbro_3183 ... 781 0.0
gi|296140493|ref|YP_003647736.1| hypothetical protein Tpau_2799 ... 779 0.0
gi|317507854|ref|ZP_07965555.1| hypothetical protein HMPREF9336_... 778 0.0
gi|343926798|ref|ZP_08766291.1| hypothetical protein GOALK_072_0... 755 0.0
gi|296392908|ref|YP_003657792.1| hypothetical protein Srot_0474 ... 754 0.0
gi|256375351|ref|YP_003099011.1| hypothetical protein Amir_1213 ... 753 0.0
gi|302531218|ref|ZP_07283560.1| DUF404 domain-containing protein... 723 0.0
gi|300791010|ref|YP_003771301.1| hypothetical protein AMED_9210 ... 715 0.0
gi|158313932|ref|YP_001506440.1| hypothetical protein Franean1_2... 630 1e-178
gi|312198642|ref|YP_004018703.1| hypothetical protein FraEuI1c_4... 630 2e-178
gi|111223940|ref|YP_714734.1| hypothetical protein FRAAL4547 [Fr... 625 5e-177
gi|288920571|ref|ZP_06414877.1| protein of unknown function DUF4... 625 7e-177
gi|336177739|ref|YP_004583114.1| hypothetical protein FsymDg_174... 619 3e-175
gi|119960999|ref|YP_947876.1| hypothetical protein AAur_2132 [Ar... 609 5e-172
gi|116670678|ref|YP_831611.1| hypothetical protein Arth_2131 [Ar... 607 2e-171
gi|148271675|ref|YP_001221236.1| hypothetical protein CMM_0496 [... 602 7e-170
gi|170780706|ref|YP_001709038.1| hypothetical protein CMS_0252 [... 602 8e-170
gi|88856624|ref|ZP_01131280.1| hypothetical protein A20C1_10595 ... 601 1e-169
gi|336115926|ref|YP_004570692.1| hypothetical protein MLP_02750 ... 599 5e-169
gi|220912637|ref|YP_002487946.1| hypothetical protein Achl_1882 ... 599 5e-169
gi|325963241|ref|YP_004241147.1| hypothetical protein Asphe3_185... 598 9e-169
gi|258654683|ref|YP_003203839.1| hypothetical protein Namu_4571 ... 592 4e-167
gi|336320339|ref|YP_004600307.1| hypothetical protein Celgi_1220... 585 1e-164
gi|326330358|ref|ZP_08196668.1| hypothetical protein NBCG_01793 ... 583 4e-164
gi|323358427|ref|YP_004224823.1| hypothetical protein MTES_1979 ... 582 6e-164
>gi|15609548|ref|NP_216927.1| hypothetical protein Rv2411c [Mycobacterium tuberculosis H37Rv]
gi|15841929|ref|NP_336966.1| hypothetical protein MT2484 [Mycobacterium tuberculosis CDC1551]
gi|31793590|ref|NP_856083.1| hypothetical protein Mb2434c [Mycobacterium bovis AF2122/97]
75 more sequence titles
Length=551
Score = 1125 bits (2909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/551 (100%), Positives = 551/551 (100%), Gaps = 0/551 (0%)
Query 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA
Sbjct 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK
Sbjct 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG
Sbjct 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE
Sbjct 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR
Sbjct 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK
Sbjct 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG
Sbjct 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQ 540
SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQ
Sbjct 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQ 540
Query 541 QQQQQQQQAFH 551
QQQQQQQQAFH
Sbjct 541 QQQQQQQQAFH 551
>gi|289754518|ref|ZP_06513896.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289695105|gb|EFD62534.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|339295308|gb|AEJ47419.1| hypothetical protein CCDC5079_2229 [Mycobacterium tuberculosis
CCDC5079]
gi|339298927|gb|AEJ51037.1| hypothetical protein CCDC5180_2200 [Mycobacterium tuberculosis
CCDC5180]
Length=548
Score = 1117 bits (2888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 547/548 (99%), Positives = 548/548 (100%), Gaps = 0/548 (0%)
Query 4 VSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD 63
+SLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD
Sbjct 1 MSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD 60
Query 64 ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE 123
ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE
Sbjct 61 ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE 120
Query 124 CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR 183
CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR
Sbjct 121 CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR 180
Query 184 VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP 243
VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP
Sbjct 181 VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP 240
Query 244 TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID 303
TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID
Sbjct 241 TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID 300
Query 304 DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK 363
DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK
Sbjct 301 DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK 360
Query 364 PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD 423
PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD
Sbjct 361 PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD 420
Query 424 DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV 483
DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV
Sbjct 421 DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV 480
Query 484 VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ 543
VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ
Sbjct 481 VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ 540
Query 544 QQQQQAFH 551
QQQQQAFH
Sbjct 541 QQQQQAFH 548
>gi|294994481|ref|ZP_06800172.1| hypothetical protein Mtub2_08193 [Mycobacterium tuberculosis
210]
Length=624
Score = 1108 bits (2865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 544/546 (99%), Positives = 545/546 (99%), Gaps = 0/546 (0%)
Query 4 VSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD 63
+SLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD
Sbjct 1 MSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSD 60
Query 64 ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE 123
ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE
Sbjct 61 ASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALE 120
Query 124 CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR 183
CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR
Sbjct 121 CYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFR 180
Query 184 VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP 243
VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP
Sbjct 181 VLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADP 240
Query 244 TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID 303
TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID
Sbjct 241 TVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRID 300
Query 304 DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK 363
DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK
Sbjct 301 DAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEK 360
Query 364 PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD 423
PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD
Sbjct 361 PLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRD 420
Query 424 DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV 483
DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV
Sbjct 421 DPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRV 480
Query 484 VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ 543
VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ
Sbjct 481 VNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQ 540
Query 544 QQQQQA 549
QQQQQ
Sbjct 541 QQQQQG 546
>gi|340627422|ref|YP_004745874.1| hypothetical protein MCAN_24451 [Mycobacterium canettii CIPT
140010059]
gi|340005612|emb|CCC44776.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=550
Score = 1087 bits (2812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/551 (99%), Positives = 548/551 (99%), Gaps = 1/551 (0%)
Query 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA
Sbjct 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK
Sbjct 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRD RG
Sbjct 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDDRG 180
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE
Sbjct 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR
Sbjct 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK
Sbjct 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG
Sbjct 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQ 540
SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQ
Sbjct 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQ-P 539
Query 541 QQQQQQQQAFH 551
QQQQQQQQAFH
Sbjct 540 QQQQQQQQAFH 550
>gi|15827247|ref|NP_301510.1| hypothetical protein ML0605 [Mycobacterium leprae TN]
gi|221229725|ref|YP_002503141.1| hypothetical protein MLBr_00605 [Mycobacterium leprae Br4923]
gi|8039819|sp|Q49755.2|Y605_MYCLE RecName: Full=Uncharacterized protein ML0605
gi|2398688|emb|CAB16148.1| hypothetical protein MLCL536.05c [Mycobacterium leprae]
gi|13092796|emb|CAC30113.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932832|emb|CAR70698.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=561
Score = 960 bits (2481), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/536 (89%), Positives = 499/536 (94%), Gaps = 6/536 (1%)
Query 1 MRRVSLPNQLNET------RRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKG 54
M +VSLP+QL ET R RS R ERIFGGYNTSD+Y+MAFDEMFD QG VRGPYKG
Sbjct 1 MSQVSLPSQLKETGPRLQSRCRSSARSERIFGGYNTSDIYSMAFDEMFDVQGNVRGPYKG 60
Query 55 IYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERG 114
IYAELAPSDASELKARA+AL RAFIDQGITFSLSGQERPFPLDLVPRVISA EW+RLERG
Sbjct 61 IYAELAPSDASELKARAEALARAFIDQGITFSLSGQERPFPLDLVPRVISASEWSRLERG 120
Query 115 ITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDL 174
ITQRVKALE YLDDIYGDQEILRDGVIPRRL+TSCEHFHRQAVGI+PPNGVRIHVAGIDL
Sbjct 121 ITQRVKALEMYLDDIYGDQEILRDGVIPRRLITSCEHFHRQAVGIIPPNGVRIHVAGIDL 180
Query 175 IRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRN 234
IRD G+FRVLEDNLRSPSGVSYVMENRRT+ARVFPNLFATHRVRAVDDYASHLLRALRN
Sbjct 181 IRDDSGNFRVLEDNLRSPSGVSYVMENRRTIARVFPNLFATHRVRAVDDYASHLLRALRN 240
Query 235 SAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQ 294
SAATNEADPTVVVLTPGV N+AYFEHSLLARQMGVELVEGRDLFCRDNQVYM TTEGERQ
Sbjct 241 SAATNEADPTVVVLTPGVANAAYFEHSLLARQMGVELVEGRDLFCRDNQVYMCTTEGERQ 300
Query 295 VDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPT 354
VDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SSAIGNGVGDDKLVYTYVPT
Sbjct 301 VDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTYVPT 360
Query 355 MIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAEL 414
M+EYYL EKPLLANV+TLRCWLDDER+EVLDRI +LVLKPVEGSGGYGIVFGP+AS+ EL
Sbjct 361 MMEYYLREKPLLANVDTLRCWLDDERQEVLDRIHDLVLKPVEGSGGYGIVFGPDASEKEL 420
Query 415 AAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTR 474
AA S+KIRDDPRSWIAQP+MELSTVPT++ TLAPRYVDLRPFAVNDGN+VWVLPGGLTR
Sbjct 421 AAASKKIRDDPRSWIAQPVMELSTVPTQVGSTLAPRYVDLRPFAVNDGNDVWVLPGGLTR 480
Query 475 VALVEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDAS 530
VALVEGSRVVNSSQGGGSKDTWVLAP AS ARELGAA+IV SLPQ DP D S
Sbjct 481 VALVEGSRVVNSSQGGGSKDTWVLAPHASYGARELGAAEIVCSLPQSSPDPVPDGS 536
>gi|466974|gb|AAA17160.1| u1937b [Mycobacterium leprae]
Length=558
Score = 956 bits (2472), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/533 (89%), Positives = 497/533 (94%), Gaps = 6/533 (1%)
Query 4 VSLPNQLNET------RRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYA 57
+SLP+QL ET R RS R ERIFGGYNTSD+Y+MAFDEMFD QG VRGPYKGIYA
Sbjct 1 MSLPSQLKETGPRLQSRCRSSARSERIFGGYNTSDIYSMAFDEMFDVQGNVRGPYKGIYA 60
Query 58 ELAPSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQ 117
ELAPSDASELKARA+AL RAFIDQGITFSLSGQERPFPLDLVPRVISA EW+RLERGITQ
Sbjct 61 ELAPSDASELKARAEALARAFIDQGITFSLSGQERPFPLDLVPRVISASEWSRLERGITQ 120
Query 118 RVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRD 177
RVKALE YLDDIYGDQEILRDGVIPRRL+TSCEHFHRQAVGI+PPNGVRIHVAGIDLIRD
Sbjct 121 RVKALEMYLDDIYGDQEILRDGVIPRRLITSCEHFHRQAVGIIPPNGVRIHVAGIDLIRD 180
Query 178 HRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAA 237
G+FRVLEDNLRSPSGVSYVMENRRT+ARVFPNLFATHRVRAVDDYASHLLRALRNSAA
Sbjct 181 DSGNFRVLEDNLRSPSGVSYVMENRRTIARVFPNLFATHRVRAVDDYASHLLRALRNSAA 240
Query 238 TNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDV 297
TNEADPTVVVLTPGV N+AYFEHSLLARQMGVELVEGRDLFCRDNQVYM TTEGERQVDV
Sbjct 241 TNEADPTVVVLTPGVANAAYFEHSLLARQMGVELVEGRDLFCRDNQVYMCTTEGERQVDV 300
Query 298 IYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIE 357
IYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SSAIGNGVGDDKLVYTYVPTM+E
Sbjct 301 IYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTYVPTMME 360
Query 358 YYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAV 417
YYL EKPLLANV+TLRCWLDDER+EVLDRI +LVLKPVEGSGGYGIVFGP+AS+ ELAA
Sbjct 361 YYLREKPLLANVDTLRCWLDDERQEVLDRIHDLVLKPVEGSGGYGIVFGPDASEKELAAA 420
Query 418 SQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVAL 477
S+KIRDDPRSWIAQP+MELSTVPT++ TLAPRYVDLRPFAVNDGN+VWVLPGGLTRVAL
Sbjct 421 SKKIRDDPRSWIAQPVMELSTVPTQVGSTLAPRYVDLRPFAVNDGNDVWVLPGGLTRVAL 480
Query 478 VEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDAS 530
VEGSRVVNSSQGGGSKDTWVLAP AS ARELGAA+IV SLPQ DP D S
Sbjct 481 VEGSRVVNSSQGGGSKDTWVLAPHASYGARELGAAEIVCSLPQSSPDPVPDGS 533
>gi|183983712|ref|YP_001852003.1| hypothetical protein MMAR_3732 [Mycobacterium marinum M]
gi|183177038|gb|ACC42148.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=557
Score = 954 bits (2466), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/528 (89%), Positives = 495/528 (94%), Gaps = 0/528 (0%)
Query 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
M +VSL + +RRRS R ERIFGGYN+SDVY+ AFDEMFDAQG VRGPYKGIYAELA
Sbjct 1 MNQVSLTEPMQASRRRSQARPERIFGGYNSSDVYSQAFDEMFDAQGNVRGPYKGIYAELA 60
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
PSDASELKARADAL RAF+DQGITFSLSGQERPFPLDLVPRVISA EW+RLERGI QRVK
Sbjct 61 PSDASELKARADALDRAFLDQGITFSLSGQERPFPLDLVPRVISAAEWSRLERGIIQRVK 120
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALE YLDDIYGDQEILRDGVIPRRL+TSCEHFHR+AVGI+PPNGVRIHVAGIDLIRD RG
Sbjct 121 ALEMYLDDIYGDQEILRDGVIPRRLITSCEHFHREAVGIIPPNGVRIHVAGIDLIRDERG 180
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
DFRVLEDNLRSPSGVSYV+ENRRTMARVFPNLFATHRVRAVDDY SHLLRALRNSAATNE
Sbjct 181 DFRVLEDNLRSPSGVSYVIENRRTMARVFPNLFATHRVRAVDDYPSHLLRALRNSAATNE 240
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGE QVDVIYR
Sbjct 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGECQVDVIYR 300
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SS+IGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSSIGNGVGDDKLVYTYVPTMIEYYL 360
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
EKPLLANV+T RCWLD+EREEVLDR++ELVLKPVEGSGGYGIVFGP+AS ELAAV +K
Sbjct 361 REKPLLANVDTYRCWLDEEREEVLDRLKELVLKPVEGSGGYGIVFGPDASDKELAAVGKK 420
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
IRDDPRSWIAQPMMELSTVPTRIE +LAPRYVDLRPFAVNDGN+VWVLPGGLTRVA VEG
Sbjct 421 IRDDPRSWIAQPMMELSTVPTRIEDSLAPRYVDLRPFAVNDGNDVWVLPGGLTRVAQVEG 480
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVD 528
SRVVNSSQGGGSKDTWVLAPR+ A RELG AQ++RSLP+ + + + D
Sbjct 481 SRVVNSSQGGGSKDTWVLAPRSLATGRELGGAQVLRSLPRTVPEQSPD 528
>gi|41408321|ref|NP_961157.1| hypothetical protein MAP2223c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396677|gb|AAS04540.1| hypothetical protein MAP_2223c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=558
Score = 951 bits (2459), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/527 (90%), Positives = 490/527 (93%), Gaps = 2/527 (0%)
Query 4 VSLPNQLNETRRR-SPTRGERIFGGYNT-SDVYAMAFDEMFDAQGIVRGPYKGIYAELAP 61
+SL NQL +T R R ERIFGGYN SD Y MAFDEMFDA G VRGPYKGIYAELAP
Sbjct 1 MSLSNQLEDTGRGFRAARSERIFGGYNVASDAYDMAFDEMFDAAGAVRGPYKGIYAELAP 60
Query 62 SDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKA 121
SDASELKARA+AL RAF+DQGITFSLSGQERPFPLDLVPRVISA EW RLERGITQRVKA
Sbjct 61 SDASELKARAEALSRAFLDQGITFSLSGQERPFPLDLVPRVISAAEWARLERGITQRVKA 120
Query 122 LECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGD 181
LE YLDDIYGDQEIL DGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRD G+
Sbjct 121 LEMYLDDIYGDQEILNDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDEEGN 180
Query 182 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 241
FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYA+HLLRALRNSAATNEA
Sbjct 181 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYAAHLLRALRNSAATNEA 240
Query 242 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 301
DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR
Sbjct 241 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 300
Query 302 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLH 361
IDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTYVPTMIEYYLG 360
Query 362 EKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKI 421
EKPLLANVETLRCWLDDEREEVLDRI ELVLKPVEGSGGYGIVFGPEAS ELAAV++KI
Sbjct 361 EKPLLANVETLRCWLDDEREEVLDRIDELVLKPVEGSGGYGIVFGPEASDKELAAVAKKI 420
Query 422 RDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGS 481
RDDPRSWIAQPMMELSTVPT++ +LAPRYVDLRPFAVNDG +VWVLPGGLTRVALVEGS
Sbjct 421 RDDPRSWIAQPMMELSTVPTQVGSSLAPRYVDLRPFAVNDGEDVWVLPGGLTRVALVEGS 480
Query 482 RVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVD 528
RVVNSSQGGGSKDTWVLA RAS+ ELGAA++VRSLP + DP VD
Sbjct 481 RVVNSSQGGGSKDTWVLASRASSGDHELGAAEVVRSLPTAMPDPLVD 527
>gi|240170817|ref|ZP_04749476.1| hypothetical protein MkanA1_15997 [Mycobacterium kansasii ATCC
12478]
Length=544
Score = 951 bits (2457), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/545 (89%), Positives = 508/545 (94%), Gaps = 5/545 (0%)
Query 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
M RVSL + + TRRRS R ERIF GY+ SD YA+AFDEMFDAQG VRGPYKGIYAELA
Sbjct 1 MTRVSLSDPIEATRRRSSARSERIFDGYHKSDGYALAFDEMFDAQGNVRGPYKGIYAELA 60
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
P+DASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISA EWTRLERGI QRV+
Sbjct 61 PTDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAAEWTRLERGIIQRVQ 120
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALE YLDDIYGDQEILRDGVIPRRLVTSCEHFHR+AVGIVPPNGVRIHVAGIDLIRD RG
Sbjct 121 ALERYLDDIYGDQEILRDGVIPRRLVTSCEHFHREAVGIVPPNGVRIHVAGIDLIRDDRG 180
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDY +HLLRALRNSAATNE
Sbjct 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYPAHLLRALRNSAATNE 240
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGVYN AYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR
Sbjct 241 ADPTVVVLTPGVYNPAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDDA+LDPLQFRADSVLGVAGLVNAARAGNVV+SS+IGNGVGDDKLVYTYVP MIEYYL
Sbjct 301 RIDDAYLDPLQFRADSVLGVAGLVNAARAGNVVISSSIGNGVGDDKLVYTYVPAMIEYYL 360
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
EKPLLANVET RCWL+DEREEVLDRI ELVLKPVEGSGGYGIVFGP+AS+ ELAAV +K
Sbjct 361 REKPLLANVETYRCWLEDEREEVLDRIGELVLKPVEGSGGYGIVFGPQASEKELAAVGKK 420
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
IRD+PRSWIAQPMMELSTVPTRIEG+LAPRYVDLRPFAVNDGN++WVLPGGLTRVALVEG
Sbjct 421 IRDNPRSWIAQPMMELSTVPTRIEGSLAPRYVDLRPFAVNDGNDIWVLPGGLTRVALVEG 480
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCD-PTVDASGYEPHDQQP 539
SRVVNSSQGGGSKDTWVLAPRASAA RELG AQ+VRSLP+ + + P D+ P ++Q
Sbjct 481 SRVVNSSQGGGSKDTWVLAPRASAADRELGRAQVVRSLPRVVPEQPPTDS----PRNEQS 536
Query 540 QQQQQ 544
QQQQ+
Sbjct 537 QQQQK 541
>gi|254774589|ref|ZP_05216105.1| hypothetical protein MaviaA2_07958 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=558
Score = 950 bits (2455), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/527 (90%), Positives = 490/527 (93%), Gaps = 2/527 (0%)
Query 4 VSLPNQLNETRRR-SPTRGERIFGGYNT-SDVYAMAFDEMFDAQGIVRGPYKGIYAELAP 61
+SL NQL +T R R ERIFGGYN SD Y MAFDEMFDA G VRGPYKGIYAELAP
Sbjct 1 MSLSNQLEDTGRGFRAARSERIFGGYNVASDAYDMAFDEMFDAAGAVRGPYKGIYAELAP 60
Query 62 SDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKA 121
SDASELKARA+AL RAF+DQGITFSLSGQERPFPLDLVPRVISA EW RLERGITQRVKA
Sbjct 61 SDASELKARAEALSRAFLDQGITFSLSGQERPFPLDLVPRVISAAEWARLERGITQRVKA 120
Query 122 LECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGD 181
LE YLDDI GDQEIL DGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRD +G+
Sbjct 121 LEMYLDDIDGDQEILNDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDEKGN 180
Query 182 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 241
FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYA+HLLRALRNSAATNEA
Sbjct 181 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYAAHLLRALRNSAATNEA 240
Query 242 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 301
DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR
Sbjct 241 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 300
Query 302 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLH 361
IDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTYVPTMIEYYLG 360
Query 362 EKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKI 421
EKPLLANVETLRCWLDDEREEVLDRI ELVLKPVEGSGGYGIVFGPEAS ELAAV++KI
Sbjct 361 EKPLLANVETLRCWLDDEREEVLDRIDELVLKPVEGSGGYGIVFGPEASDKELAAVAKKI 420
Query 422 RDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGS 481
RDDPRSWIAQPMMELSTVPT++ +LAPRYVDLRPFAVNDG +VWVLPGGLTRVALVEGS
Sbjct 421 RDDPRSWIAQPMMELSTVPTQVGSSLAPRYVDLRPFAVNDGEDVWVLPGGLTRVALVEGS 480
Query 482 RVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVD 528
RVVNSSQGGGSKDTWVLA RAS+ ELGAA++VRSLP + DP VD
Sbjct 481 RVVNSSQGGGSKDTWVLASRASSGDHELGAAEVVRSLPTAMPDPLVD 527
>gi|296170533|ref|ZP_06852117.1| UDP-N-acetylmuramate dehydrogenase [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894765|gb|EFG74490.1| UDP-N-acetylmuramate dehydrogenase [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=558
Score = 941 bits (2433), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/545 (87%), Positives = 497/545 (92%), Gaps = 6/545 (1%)
Query 4 VSLPNQLNETRR-RSPTRGERIFGGYNTS-DVYAMAFDEMFDAQGIVRGPYKGIYAELAP 61
+SLP+QL + R R ERIFGGYN S D+Y+ AFDEMFDAQG VRGPYKGIYAELAP
Sbjct 1 MSLPSQLEDRGRGLRAARAERIFGGYNASPDLYSAAFDEMFDAQGAVRGPYKGIYAELAP 60
Query 62 SDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKA 121
SDASELKARA+ALGRAFIDQGITFSLSGQERPFPLDLVPRVISA EW+RLERGI QRVKA
Sbjct 61 SDASELKARAEALGRAFIDQGITFSLSGQERPFPLDLVPRVISAAEWSRLERGIIQRVKA 120
Query 122 LECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGD 181
LE YLDDIYGDQEIL D +IPRRLVTSCEHFHRQA+GIVPPNGVRIHVAGIDLIRD +G+
Sbjct 121 LEMYLDDIYGDQEILSDDIIPRRLVTSCEHFHRQAMGIVPPNGVRIHVAGIDLIRDEKGN 180
Query 182 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 241
FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA
Sbjct 181 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 240
Query 242 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 301
DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTT+GE QVDVIYRR
Sbjct 241 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTDGEVQVDVIYRR 300
Query 302 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLH 361
IDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTYVPTMIEYYLG 360
Query 362 EKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKI 421
EKPLLANVET RCWLDDEREEVLDRI ELVLKPVEGSGGYGIVFGPEAS ELA V++K+
Sbjct 361 EKPLLANVETYRCWLDDEREEVLDRIDELVLKPVEGSGGYGIVFGPEASDKELATVAKKV 420
Query 422 RDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGS 481
RDDPRSWIAQPMMELSTVPT+I TLAPRYVDLRPFAVNDG++VWVLPGGLTRVALVEGS
Sbjct 421 RDDPRSWIAQPMMELSTVPTQIGNTLAPRYVDLRPFAVNDGDDVWVLPGGLTRVALVEGS 480
Query 482 RVVNSSQGGGSKDTWVLA-PRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQ 540
RVVNSSQGGGSKDTWVLA R SA ELGAA++VRSLP+ + DP D + P Q Q
Sbjct 481 RVVNSSQGGGSKDTWVLASSRTSADEHELGAAEVVRSLPESMPDPASDGA---PRRTQTQ 537
Query 541 QQQQQ 545
Q ++
Sbjct 538 SQTRE 542
>gi|118618943|ref|YP_907275.1| hypothetical protein MUL_3675 [Mycobacterium ulcerans Agy99]
gi|118571053|gb|ABL05804.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=557
Score = 941 bits (2432), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/528 (88%), Positives = 492/528 (94%), Gaps = 0/528 (0%)
Query 1 MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
M +VSL + +RRRS R ERIFGGYN+SDVY+ AFDEMFDAQG VRGPYKGIYAELA
Sbjct 1 MNQVSLTEPMQASRRRSQARPERIFGGYNSSDVYSQAFDEMFDAQGNVRGPYKGIYAELA 60
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
PSDASELKARADAL RAF+DQGITFSLSGQERPFPLDLVPRVISA EW+RLERGI QRVK
Sbjct 61 PSDASELKARADALDRAFLDQGITFSLSGQERPFPLDLVPRVISAAEWSRLERGIIQRVK 120
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALE YLDDIYGDQEILRDGVIPRRL+TSCEHFHR+A+GI+ PNGVRIHVAGIDLIR+ G
Sbjct 121 ALEMYLDDIYGDQEILRDGVIPRRLITSCEHFHREAMGIITPNGVRIHVAGIDLIRNECG 180
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
DFRVLEDNLRSPSGVSYV+ENRRTMARVFPNLFATHRVRAVDDY SHLLRALRNSAATNE
Sbjct 181 DFRVLEDNLRSPSGVSYVIENRRTMARVFPNLFATHRVRAVDDYPSHLLRALRNSAATNE 240
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGVYNSA+FEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGE QVDVIYR
Sbjct 241 ADPTVVVLTPGVYNSAHFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGECQVDVIYR 300
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+SS+IGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSSIGNGVGDDKLVYTYVPTMIEYYL 360
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
EKPLLANV+T RCWLD+EREEVLDR++ELVLKPVEGSGGYGIVFGP+AS ELAAV +K
Sbjct 361 REKPLLANVDTYRCWLDEEREEVLDRLKELVLKPVEGSGGYGIVFGPDASDKELAAVGKK 420
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
IRDDPRSWIAQPMMELSTVPTRIE +LAPRYVDLRPFAVNDGN+VWVLPGGLTRVA VEG
Sbjct 421 IRDDPRSWIAQPMMELSTVPTRIEDSLAPRYVDLRPFAVNDGNDVWVLPGGLTRVAQVEG 480
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVD 528
SRVVNSSQGGGSK TWVLAPR+ A RELG AQ++RSLP+ + + + D
Sbjct 481 SRVVNSSQGGGSKATWVLAPRSLATGRELGGAQVLRSLPRTVPEQSPD 528
>gi|336458242|gb|EGO37223.1| hypothetical protein MAPs_15370 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=531
Score = 931 bits (2407), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/499 (92%), Positives = 473/499 (95%), Gaps = 0/499 (0%)
Query 30 TSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSG 89
SD Y MAFDEMFDA G VRGPYKGIYAELAPSDASELKARA+AL RAF+DQGITFSLSG
Sbjct 2 ASDAYDMAFDEMFDAAGAVRGPYKGIYAELAPSDASELKARAEALSRAFLDQGITFSLSG 61
Query 90 QERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSC 149
QERPFPLDLVPRVISA EW RLERGITQRVKALE YLDDIYGDQEIL DGVIPRRLVTSC
Sbjct 62 QERPFPLDLVPRVISAAEWARLERGITQRVKALEMYLDDIYGDQEILNDGVIPRRLVTSC 121
Query 150 EHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVF 209
EHFHRQAVGIVPPNGVRIHVAGIDLIRD +G+FRVLEDNLRSPSGVSYVMENRRTMARVF
Sbjct 122 EHFHRQAVGIVPPNGVRIHVAGIDLIRDEKGNFRVLEDNLRSPSGVSYVMENRRTMARVF 181
Query 210 PNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGV 269
PNLFATHRVRAVDDYA+HLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGV
Sbjct 182 PNLFATHRVRAVDDYAAHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGV 241
Query 270 ELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARA 329
ELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARA
Sbjct 242 ELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARA 301
Query 330 GNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRE 389
GNVV+SSAIGNGVGDDKLVYTYVPTMIEYYL EKPLLANVETLRCWLDDEREEVLDRI E
Sbjct 302 GNVVISSAIGNGVGDDKLVYTYVPTMIEYYLGEKPLLANVETLRCWLDDEREEVLDRIDE 361
Query 390 LVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAP 449
LVLKPVEGSGGYGIVFGPEAS ELAAV++KIRDDPRSWIAQPMMELSTVPT++ +LAP
Sbjct 362 LVLKPVEGSGGYGIVFGPEASDKELAAVAKKIRDDPRSWIAQPMMELSTVPTQVGSSLAP 421
Query 450 RYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAAREL 509
RYVDLRPFAVNDG +VWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA RAS+ EL
Sbjct 422 RYVDLRPFAVNDGEDVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLASRASSGDHEL 481
Query 510 GAAQIVRSLPQPLCDPTVD 528
GAA++VRSLP + DP VD
Sbjct 482 GAAEVVRSLPTAMPDPLVD 500
>gi|342857811|ref|ZP_08714467.1| hypothetical protein MCOL_03005 [Mycobacterium colombiense CECT
3035]
gi|342135144|gb|EGT88310.1| hypothetical protein MCOL_03005 [Mycobacterium colombiense CECT
3035]
Length=560
Score = 930 bits (2404), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/548 (87%), Positives = 500/548 (92%), Gaps = 3/548 (0%)
Query 4 VSLPNQLNETRRR-SPTRGERIFGGYN-TSDVYAMAFDEMFDAQGIVRGPYKGIYAELAP 61
+SL NQL++++R R ERIFGGYN +SD Y MAFDEMFDAQG VRGPYKGIYAELAP
Sbjct 1 MSLTNQLDDSKRGFRAARAERIFGGYNGSSDAYDMAFDEMFDAQGAVRGPYKGIYAELAP 60
Query 62 SDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKA 121
SDASELKARA+AL RAF+DQGITFSLSGQERPFPLDLVPRVISA EWTRLERGITQRVKA
Sbjct 61 SDASELKARAEALSRAFLDQGITFSLSGQERPFPLDLVPRVISAAEWTRLERGITQRVKA 120
Query 122 LECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGD 181
LE YLDD+YGDQEIL DGVIPRRLVTSCEHFHRQA+GIVPPNGVRIHVAGIDLIRD +G
Sbjct 121 LEMYLDDVYGDQEILNDGVIPRRLVTSCEHFHRQAMGIVPPNGVRIHVAGIDLIRDEKGV 180
Query 182 FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 241
+RVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA
Sbjct 181 WRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEA 240
Query 242 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 301
DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR
Sbjct 241 DPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR 300
Query 302 IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLH 361
IDDAFLDPLQFRADS+LGVAGLVNAARAGNV +SSAIGNGVGDDKLVYTYVPTMIEYYL
Sbjct 301 IDDAFLDPLQFRADSMLGVAGLVNAARAGNVSISSAIGNGVGDDKLVYTYVPTMIEYYLG 360
Query 362 EKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKI 421
EKPLLANVETLRCWLDDEREE LDRI ELV+KPVEGSGGYGIVFGPEAS ELAA ++KI
Sbjct 361 EKPLLANVETLRCWLDDEREEALDRIDELVIKPVEGSGGYGIVFGPEASAKELAAAAKKI 420
Query 422 RDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGS 481
RDDPRSWIAQPMMELSTVPT+I TLAPRYVDLRPFAVNDGN+V+VLPGGLTRVALVEGS
Sbjct 421 RDDPRSWIAQPMMELSTVPTQIGNTLAPRYVDLRPFAVNDGNDVFVLPGGLTRVALVEGS 480
Query 482 RVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEP-HDQQPQ 540
RVVNSSQGGGSKDTWVLA RAS ELGAA++VRSLP+ + DP D+ QQPQ
Sbjct 481 RVVNSSQGGGSKDTWVLASRASGGEHELGAAEVVRSLPESMPDPLEDSPRLTSVTSQQPQ 540
Query 541 QQQQQQQQ 548
Q++
Sbjct 541 PTDHPQRE 548
>gi|254819868|ref|ZP_05224869.1| hypothetical protein MintA_08084 [Mycobacterium intracellulare
ATCC 13950]
Length=528
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/509 (90%), Positives = 477/509 (94%), Gaps = 1/509 (0%)
Query 36 MAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPFP 95
MAFDEMFDAQG VRGPYKGIYAELAPSDASELKARA+AL RAF+DQGITFSLSGQERPFP
Sbjct 1 MAFDEMFDAQGAVRGPYKGIYAELAPSDASELKARAEALSRAFLDQGITFSLSGQERPFP 60
Query 96 LDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQ 155
LDLVPRVISA EW RLERGITQRVKALE YLDDIYGDQEIL DGVIPRRLVTSCEHFHRQ
Sbjct 61 LDLVPRVISAAEWARLERGITQRVKALEMYLDDIYGDQEILNDGVIPRRLVTSCEHFHRQ 120
Query 156 AVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFAT 215
A+GIVPPNGVRIHVAGIDLIRD +G+FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFAT
Sbjct 121 AMGIVPPNGVRIHVAGIDLIRDEKGNFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFAT 180
Query 216 HRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGR 275
HRVR+VDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGR
Sbjct 181 HRVRSVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGR 240
Query 276 DLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLS 335
D+FCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV+S
Sbjct 241 DMFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVIS 300
Query 336 SAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPV 395
SAIGNGVGDDKLVYTYVPTMIEYYL EKPLLANVETLRCWLDDEREEVLDRI ELVLKPV
Sbjct 301 SAIGNGVGDDKLVYTYVPTMIEYYLGEKPLLANVETLRCWLDDEREEVLDRIDELVLKPV 360
Query 396 EGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLR 455
EGSGGYGIVFGPEAS+ ELAAV++KIRDDPRSWIAQPMMELSTVPT++ LAPRYVDLR
Sbjct 361 EGSGGYGIVFGPEASEKELAAVAKKIRDDPRSWIAQPMMELSTVPTQVGSALAPRYVDLR 420
Query 456 PFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIV 515
PFAVNDGN+VWVLPGGLTR ALVEGSRVVNSSQGGGSKDTWVLA RASA EL AA++V
Sbjct 421 PFAVNDGNDVWVLPGGLTRTALVEGSRVVNSSQGGGSKDTWVLASRASAGDHELEAAEVV 480
Query 516 RSLPQPLCDPTVDASGYEPHDQQPQQQQQ 544
R+LP + DP +D S QQPQ ++
Sbjct 481 RALPTSMPDPMLDDSP-RLASQQPQPTER 508
>gi|118467439|ref|YP_888839.1| hypothetical protein MSMEG_4570 [Mycobacterium smegmatis str.
MC2 155]
gi|118168726|gb|ABK69622.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=542
Score = 883 bits (2282), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/520 (83%), Positives = 469/520 (91%), Gaps = 0/520 (0%)
Query 11 NETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKAR 70
N T + + +F GYN YA AFDEMFDA G VRGPYKGI+AELAP+DASEL+AR
Sbjct 11 NATGSSARVKQRGVFDGYNKLGHYAKAFDEMFDASGNVRGPYKGIFAELAPTDASELQAR 70
Query 71 ADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIY 130
ADALGRAF DQGITFSLSGQERPFPLDLVPRVISA EW+RLERGI QRVKALE YLDDIY
Sbjct 71 ADALGRAFTDQGITFSLSGQERPFPLDLVPRVISAAEWSRLERGIRQRVKALEMYLDDIY 130
Query 131 GDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLR 190
G+QEILRDGVIPRRLVTSCEHFHR+A GIVPPNGVRIHVAGIDLIRD +GDFRVLEDNLR
Sbjct 131 GEQEILRDGVIPRRLVTSCEHFHREAAGIVPPNGVRIHVAGIDLIRDDKGDFRVLEDNLR 190
Query 191 SPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTP 250
SPSGVSYVMENRRTMARVFPNLFATHRVRAV DYASHLLRALRN+A TN ADPTVVVLTP
Sbjct 191 SPSGVSYVMENRRTMARVFPNLFATHRVRAVGDYASHLLRALRNAAPTNVADPTVVVLTP 250
Query 251 GVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPL 310
GVYNSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDDAFLDP+
Sbjct 251 GVYNSAYFEHSLLARQMGVELVEGRDLFCRDNVVYMRTTEGERQVDVIYRRIDDAFLDPM 310
Query 311 QFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVE 370
QFR DSVLGVAGL+NAARAGNVV+SSA+GNGVGDDKLVYTYVPT+IEYYL EKP+LANV+
Sbjct 311 QFRPDSVLGVAGLLNAARAGNVVISSAVGNGVGDDKLVYTYVPTIIEYYLGEKPVLANVD 370
Query 371 TLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIA 430
T RCWLDDEREEVLDRI ELV+KPVEGSGGYGIVFGP+A+ EL +++KIR+DPR+WIA
Sbjct 371 TYRCWLDDEREEVLDRIEELVIKPVEGSGGYGIVFGPDATPKELTTIAKKIRNDPRAWIA 430
Query 431 QPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGG 490
QP+M+LSTVPT+I+ L PR+VDLRPFAVNDGN+VWVLPGGLTRVAL E S VVNSSQGG
Sbjct 431 QPVMQLSTVPTQIDNKLVPRHVDLRPFAVNDGNDVWVLPGGLTRVALPENSLVVNSSQGG 490
Query 491 GSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDAS 530
GSKDTWVLA RAS A REL AA++VR+LP+ P D++
Sbjct 491 GSKDTWVLASRASVADRELAAAEVVRALPKSGRGPKADSA 530
>gi|108800475|ref|YP_640672.1| hypothetical protein Mmcs_3509 [Mycobacterium sp. MCS]
gi|119869613|ref|YP_939565.1| hypothetical protein Mkms_3581 [Mycobacterium sp. KMS]
gi|126436098|ref|YP_001071789.1| hypothetical protein Mjls_3521 [Mycobacterium sp. JLS]
gi|108770894|gb|ABG09616.1| protein of unknown function DUF404 [Mycobacterium sp. MCS]
gi|119695702|gb|ABL92775.1| protein of unknown function DUF404 [Mycobacterium sp. KMS]
gi|126235898|gb|ABN99298.1| protein of unknown function DUF404 [Mycobacterium sp. JLS]
Length=567
Score = 882 bits (2279), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/533 (83%), Positives = 473/533 (89%), Gaps = 6/533 (1%)
Query 4 VSLPNQLNETRRRSPTRGER------IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYA 57
VSL E+ R S T G R IF GYN+ Y AFDEMFD QG VRGPYKGI+A
Sbjct 18 VSLRTLPTESSRSSRTNGARTKRHEGIFDGYNSVGGYDKAFDEMFDPQGNVRGPYKGIFA 77
Query 58 ELAPSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQ 117
EL P+DAS+L+ARADAL RAFI+QGITFSLSGQERP PLDLVPRVISA EWTRLERGITQ
Sbjct 78 ELEPADASDLQARADALDRAFINQGITFSLSGQERPLPLDLVPRVISAAEWTRLERGITQ 137
Query 118 RVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRD 177
RV+ALE YLDDIYG+Q ILRDGVIPRRLVTSCEHFHR+AVGI PPNGVRIHVAGIDLIRD
Sbjct 138 RVRALEAYLDDIYGEQHILRDGVIPRRLVTSCEHFHREAVGISPPNGVRIHVAGIDLIRD 197
Query 178 HRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAA 237
G FRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRN+AA
Sbjct 198 EHGSFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNAAA 257
Query 238 TNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDV 297
+NEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTT GERQVDV
Sbjct 258 SNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNTVYMRTTAGERQVDV 317
Query 298 IYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIE 357
IYRRIDDAFLDP+QFR DSVLGVAGL+NAARAGNVV+SSA+GNGVGDDKLVYTYVPT+IE
Sbjct 318 IYRRIDDAFLDPMQFRPDSVLGVAGLLNAARAGNVVISSAVGNGVGDDKLVYTYVPTIIE 377
Query 358 YYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAV 417
YYL EKPLLANV+T RCWLD+EREEVLDR+ ELV+KPVEGSGGYGIVFGP+AS EL +
Sbjct 378 YYLGEKPLLANVDTYRCWLDEEREEVLDRVTELVIKPVEGSGGYGIVFGPDASDKELNTI 437
Query 418 SQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVAL 477
+KIR+DPR WIAQP+M+LSTVPT+I G LAPR+VDLRPFAVNDG++VWVLPGGLTRVAL
Sbjct 438 CKKIRNDPRGWIAQPVMQLSTVPTQIGGKLAPRHVDLRPFAVNDGDDVWVLPGGLTRVAL 497
Query 478 VEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDAS 530
EGS VVNSSQGGGSKDTWVLA RASAA REL AA++VRSLP+ VD +
Sbjct 498 PEGSLVVNSSQGGGSKDTWVLASRASAADRELAAAEVVRSLPKSAKANKVDKN 550
>gi|145223253|ref|YP_001133931.1| hypothetical protein Mflv_2666 [Mycobacterium gilvum PYR-GCK]
gi|315443713|ref|YP_004076592.1| hypothetical protein Mspyr1_21030 [Mycobacterium sp. Spyr1]
gi|145215739|gb|ABP45143.1| protein of unknown function DUF404 [Mycobacterium gilvum PYR-GCK]
gi|315262016|gb|ADT98757.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=544
Score = 877 bits (2266), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/515 (83%), Positives = 466/515 (91%), Gaps = 0/515 (0%)
Query 16 RSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALG 75
R+ R + +FGGYN Y+ AFDEMFDAQG VRGPYKGI+ EL P+D S+L+ARA+ALG
Sbjct 15 RNAKRHDGVFGGYNKLGSYSQAFDEMFDAQGNVRGPYKGIHKELGPADVSDLEARAEALG 74
Query 76 RAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEI 135
RAF DQGITFSLSGQERPFPLDLVPRVISA EWTRLERGI QRV+ALE YLDDIYG+QEI
Sbjct 75 RAFTDQGITFSLSGQERPFPLDLVPRVISAAEWTRLERGIRQRVQALEMYLDDIYGEQEI 134
Query 136 LRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGV 195
LRDGVIPRRL+TSCEHFHR+AVGIVPPNGVRIHVAGIDLIRD +G+FRVLEDNLRSPSGV
Sbjct 135 LRDGVIPRRLITSCEHFHREAVGIVPPNGVRIHVAGIDLIRDEQGNFRVLEDNLRSPSGV 194
Query 196 SYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNS 255
SYVMENRRTMARVFPNLFATHRVRAV DYASHLLRALRN+AA N ADPTVVVLTPGVYNS
Sbjct 195 SYVMENRRTMARVFPNLFATHRVRAVGDYASHLLRALRNAAANNVADPTVVVLTPGVYNS 254
Query 256 AYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRAD 315
AYFEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD FLDP+ F+ D
Sbjct 255 AYFEHSLLARQMGVELVEGRDLFCRDNAVYMRTTEGERQVDVIYRRIDDEFLDPMVFKPD 314
Query 316 SVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCW 375
SVLGVAG++NAARAGNVV+SSA+GNGVGDDKLVYTYVPT+IEYYL EKPLLANV+T RCW
Sbjct 315 SVLGVAGILNAARAGNVVISSAVGNGVGDDKLVYTYVPTIIEYYLGEKPLLANVDTFRCW 374
Query 376 LDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMME 435
LDDEREEVLDR+ ELV+KPVEGSGGYGIVFGP+AS+ ELA +++KI DPR WIAQP+M+
Sbjct 375 LDDEREEVLDRVDELVIKPVEGSGGYGIVFGPDASEKELATITKKIIADPRGWIAQPVMQ 434
Query 436 LSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDT 495
LSTVPT+I +LAPR+VDLRPFAVNDGN+VWVLPGGLTRVAL EGS VVNSSQGGGSKDT
Sbjct 435 LSTVPTQIGDSLAPRHVDLRPFAVNDGNDVWVLPGGLTRVALPEGSLVVNSSQGGGSKDT 494
Query 496 WVLAPRASAAARELGAAQIVRSLPQPLCDPTVDAS 530
WVLA R S A REL AA++VRSLP+ TV S
Sbjct 495 WVLASRTSVADRELAAAEVVRSLPKAPSSKTVGKS 529
>gi|120404865|ref|YP_954694.1| hypothetical protein Mvan_3911 [Mycobacterium vanbaalenii PYR-1]
gi|119957683|gb|ABM14688.1| protein of unknown function DUF404 [Mycobacterium vanbaalenii
PYR-1]
Length=519
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/496 (86%), Positives = 461/496 (93%), Gaps = 0/496 (0%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+FGGYN Y+ AFDEMFDAQG VRGPYKGI+ ELAPSDASEL+AR+DALGRAF DQGI
Sbjct 1 MFGGYNKLGSYSQAFDEMFDAQGNVRGPYKGIHKELAPSDASELEARSDALGRAFTDQGI 60
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TFSLSGQERPFPLDLVPRVISA EWTRLERGI QRV+ALE YLDDIYG+QEILRDGVIPR
Sbjct 61 TFSLSGQERPFPLDLVPRVISAAEWTRLERGIRQRVQALEMYLDDIYGEQEILRDGVIPR 120
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RL+TSCEHFHR+AVGIVPPNGVRIHVAGIDLIRD +G+FRVLEDNLRSPSGVSYVMENRR
Sbjct 121 RLITSCEHFHREAVGIVPPNGVRIHVAGIDLIRDAQGNFRVLEDNLRSPSGVSYVMENRR 180
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
TMARVFPNLFATHRVRAV DY+SHLLRALRN+AA+N ADPTVVVLTPGVYNSAYFEHSLL
Sbjct 181 TMARVFPNLFATHRVRAVGDYSSHLLRALRNAAASNVADPTVVVLTPGVYNSAYFEHSLL 240
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD FLDP+QF+ DSVLGVAG+
Sbjct 241 ARQMGVELVEGRDLFCRDNFVYMRTTEGERQVDVIYRRIDDDFLDPMQFKPDSVLGVAGI 300
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+NAARAGNVV+SSA+GNGVGDDKLVYTYVPT+IEYYL EKPLLANV+T RCWLDDEREEV
Sbjct 301 LNAARAGNVVISSAVGNGVGDDKLVYTYVPTIIEYYLGEKPLLANVDTFRCWLDDEREEV 360
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LDR+ ELV+KPVEGSGGYGIVFGP+AS ELAA+++KI DPR WIAQP+++LSTVPT+I
Sbjct 361 LDRVGELVIKPVEGSGGYGIVFGPDASDRELAAITKKIIADPRGWIAQPVVQLSTVPTQI 420
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRAS 503
LAPR+VDLRPFAVNDG+EVWVLPGGLTRVAL EGS VVNSSQGGGSKDTWVLA R S
Sbjct 421 GDELAPRHVDLRPFAVNDGDEVWVLPGGLTRVALPEGSLVVNSSQGGGSKDTWVLASRTS 480
Query 504 AAARELGAAQIVRSLP 519
A REL AA++VRSLP
Sbjct 481 IADRELAAAEVVRSLP 496
>gi|169628725|ref|YP_001702374.1| hypothetical protein MAB_1635 [Mycobacterium abscessus ATCC 19977]
gi|169240692|emb|CAM61720.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=556
Score = 833 bits (2151), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/505 (83%), Positives = 459/505 (91%), Gaps = 4/505 (0%)
Query 20 RGERIFGGY----NTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALG 75
R ++IFGGY + Y+ AFDEMFDA G VRGPYKGIYAELAP+DA++L ARADALG
Sbjct 20 RDDQIFGGYRELVSEKGSYSKAFDEMFDADGNVRGPYKGIYAELAPTDAADLAARADALG 79
Query 76 RAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEI 135
RAFIDQGITFSLSGQERPFPLDLVPRVI+A EW+RLERGI QRV+ALE YL DIYGDQEI
Sbjct 80 RAFIDQGITFSLSGQERPFPLDLVPRVIAAAEWSRLERGIAQRVRALEMYLADIYGDQEI 139
Query 136 LRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGV 195
LRD VIPRRLVTSCEHFHR+A GI PPNGVRIHVAGIDL+RD +G FRVLEDNLRSPSGV
Sbjct 140 LRDEVIPRRLVTSCEHFHREAAGINPPNGVRIHVAGIDLVRDAQGTFRVLEDNLRSPSGV 199
Query 196 SYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNS 255
SYVMENRRTMARVFP+LFATHRVRAVDDY+SHLLRALR SAATNEADPTVVVLTPGV NS
Sbjct 200 SYVMENRRTMARVFPDLFATHRVRAVDDYSSHLLRALRKSAATNEADPTVVVLTPGVANS 259
Query 256 AYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRAD 315
AYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDD +LDP+QFR D
Sbjct 260 AYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDTYLDPMQFRPD 319
Query 316 SVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCW 375
SVLGVAGL+NAARAGNVV+SSA+GNGVGDDKLVYTYVPT+IEYYL EKP++ANV+T RCW
Sbjct 320 SVLGVAGLLNAARAGNVVISSAVGNGVGDDKLVYTYVPTIIEYYLGEKPIVANVDTFRCW 379
Query 376 LDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMME 435
LD+EREEVLDR+ LV+KPVEGSGGYGIVFGP+AS+ E AA+++KI+ DPR W+AQP+++
Sbjct 380 LDEEREEVLDRLEHLVIKPVEGSGGYGIVFGPDASEKERAAIAKKIKADPRGWVAQPVVQ 439
Query 436 LSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDT 495
LSTVPT+I+ L PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS VVNSSQGGGSKDT
Sbjct 440 LSTVPTKIDDQLVPRHVDLRPFAVNDGDDVWVLPGGLTRVALPEGSLVVNSSQGGGSKDT 499
Query 496 WVLAPRASAAARELGAAQIVRSLPQ 520
WVLA RAS A REL A++V LPQ
Sbjct 500 WVLASRASVAERELAGAELVSELPQ 524
>gi|226360418|ref|YP_002778196.1| hypothetical protein ROP_10040 [Rhodococcus opacus B4]
gi|226238903|dbj|BAH49251.1| hypothetical protein [Rhodococcus opacus B4]
Length=541
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/507 (77%), Positives = 440/507 (87%), Gaps = 0/507 (0%)
Query 14 RRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADA 73
R R P +F GY Y +AFDEMFD G VR PYKG++ L P+D ++L AR+DA
Sbjct 4 RPRKPAEPAHVFDGYTDIGRYGLAFDEMFDRDGTVRPPYKGVFKALEPADRADLAARSDA 63
Query 74 LGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQ 133
LGRAFIDQG+TFSLSGQERPFPLDLVPRVI+A EWTRLE+GI QRV+ALE +LDD+YG+Q
Sbjct 64 LGRAFIDQGVTFSLSGQERPFPLDLVPRVIAAAEWTRLEKGIKQRVQALEMFLDDVYGEQ 123
Query 134 EILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPS 193
ILRD V+P+RLVTSCEHFHR+A GIVPPNGVRIHVAGIDL+RD G FRVLEDNLRSPS
Sbjct 124 RILRDHVLPKRLVTSCEHFHREASGIVPPNGVRIHVAGIDLVRDENGVFRVLEDNLRSPS 183
Query 194 GVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVY 253
GVSYVMENRRTMARVFP+LF +HRVR+V DYASHLLRALR SAA NEADPTVVVLTPGV
Sbjct 184 GVSYVMENRRTMARVFPDLFMSHRVRSVGDYASHLLRALRASAALNEADPTVVVLTPGVA 243
Query 254 NSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFR 313
NSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD +LDP+ FR
Sbjct 244 NSAYFEHSLLARQMGVELVEGRDLFCRDNMVYMRTTEGERQVDVIYRRIDDDYLDPMHFR 303
Query 314 ADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLR 373
DSVLGVAG++NAARAGNVV+SSA+GNGVGDDKLVYTYVP +I+YYL EKPLLANV+T R
Sbjct 304 PDSVLGVAGVLNAARAGNVVISSAVGNGVGDDKLVYTYVPQIIDYYLGEKPLLANVDTFR 363
Query 374 CWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPM 433
CWLD+EREEVLDR+ ELV+KPVEGSGGYGIVFGP+AS EL +++KI+ DPR WIAQP+
Sbjct 364 CWLDEEREEVLDRVGELVIKPVEGSGGYGIVFGPDASPKELNTITRKIKADPRGWIAQPV 423
Query 434 MELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSK 493
++LSTVPT++ L PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS VVNSSQGGGSK
Sbjct 424 VQLSTVPTKVGDELVPRHVDLRPFAVNDGDDVWVLPGGLTRVALPEGSLVVNSSQGGGSK 483
Query 494 DTWVLAPRASAAARELGAAQIVRSLPQ 520
DTWVLA R+S REL ++V + Q
Sbjct 484 DTWVLASRSSDEERELAGEELVAAPAQ 510
>gi|111018294|ref|YP_701266.1| hypothetical protein RHA1_ro01284 [Rhodococcus jostii RHA1]
gi|110817824|gb|ABG93108.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=599
Score = 804 bits (2076), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/502 (78%), Positives = 437/502 (88%), Gaps = 0/502 (0%)
Query 14 RRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADA 73
R R P +F GY Y +AFDEMFD G VR PYKG++ L P+D ++L AR+DA
Sbjct 60 RPRKPAEPAHVFDGYTDVGRYGLAFDEMFDRDGTVRPPYKGVFKALEPADRADLAARSDA 119
Query 74 LGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQ 133
LGRAFIDQG+TFSLSGQERPFPLDLVPRVI+A EWTRLE+GI QRV+ALE +LDD+YG+Q
Sbjct 120 LGRAFIDQGVTFSLSGQERPFPLDLVPRVIAAAEWTRLEKGIKQRVQALEMFLDDVYGEQ 179
Query 134 EILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPS 193
ILRD V+P+RLVTSCEHFHR+A GIVPPNGVRIHVAGIDL+RD G FRVLEDNLRSPS
Sbjct 180 RILRDHVLPKRLVTSCEHFHREASGIVPPNGVRIHVAGIDLVRDENGVFRVLEDNLRSPS 239
Query 194 GVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVY 253
GVSYVMENRRTMARVFP+LF +HRVR+V DYASHLLRALR SAA NEADPTVVVLTPGV
Sbjct 240 GVSYVMENRRTMARVFPDLFMSHRVRSVGDYASHLLRALRASAALNEADPTVVVLTPGVA 299
Query 254 NSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFR 313
NSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD +LDP+ FR
Sbjct 300 NSAYFEHSLLARQMGVELVEGRDLFCRDNMVYMRTTEGERQVDVIYRRIDDDYLDPMHFR 359
Query 314 ADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLR 373
DSVLGVAG++NAARAGNVV+SSA+GNGVGDDKLVYTYVP +I+YYL EKPLLANV+T R
Sbjct 360 PDSVLGVAGVLNAARAGNVVISSAVGNGVGDDKLVYTYVPQIIDYYLGEKPLLANVDTFR 419
Query 374 CWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPM 433
CWLD+E EEVLDR+ ELV+KPVEGSGGYGIVFGP+AS EL +++KI+ DPR WIAQP+
Sbjct 420 CWLDEECEEVLDRVDELVIKPVEGSGGYGIVFGPDASPKELNTIARKIKADPRGWIAQPV 479
Query 434 MELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSK 493
++LSTVPT++ L PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS VVNSSQGGGSK
Sbjct 480 VQLSTVPTKVGDELVPRHVDLRPFAVNDGDDVWVLPGGLTRVALPEGSLVVNSSQGGGSK 539
Query 494 DTWVLAPRASAAARELGAAQIV 515
DTWVLA R+S REL ++V
Sbjct 540 DTWVLASRSSDEERELAGEELV 561
>gi|333918819|ref|YP_004492400.1| hypothetical protein AS9A_1148 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481040|gb|AEF39600.1| hypothetical protein AS9A_1148 [Amycolicicoccus subflavus DQS3-9A1]
Length=552
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/546 (73%), Positives = 458/546 (84%), Gaps = 11/546 (2%)
Query 5 SLPNQLNETRRRSPTRGERIFGGYNTS----DVYAMAFDEMFDAQGIVRGPYKGIYAELA 60
+L L+++ + GE +FGGY + + YA+A DEMFD +G VR YKGI+ LA
Sbjct 13 ALYEALHKSDDPASGNGEYVFGGYADTGPHYEHYALAHDEMFDGEGNVRSAYKGIFKALA 72
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
P+ A++L ARADALGRAF+DQGITFSLSGQERPFPLDL+PRVI+A EWT+LERGI QRV+
Sbjct 73 PATANDLAARADALGRAFLDQGITFSLSGQERPFPLDLIPRVIAAGEWTKLERGIKQRVQ 132
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALE +LDD+YG+Q ILRDGV+PRRL+TSC+HFHR+A GIVPPN VRIHVAGIDLIRD G
Sbjct 133 ALELFLDDVYGEQNILRDGVLPRRLITSCQHFHREAAGIVPPNEVRIHVAGIDLIRDDYG 192
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
FRVLEDNLRSPSGVSYV+ENRRTM RVFP+LFA+HRVRAV DY ++LLRALRNSAA NE
Sbjct 193 TFRVLEDNLRSPSGVSYVLENRRTMTRVFPDLFASHRVRAVADYPAYLLRALRNSAALNE 252
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPGV NSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGE+QVDVIYR
Sbjct 253 ADPTVVVLTPGVANSAYFEHSLLARQMGVELVEGRDLFCRDNIVYMRTTEGEQQVDVIYR 312
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDD FLDPLQFR +SVLGV G++NAARAGNVV+SSA+GNGVGDDKL+YTYVPT+IEYYL
Sbjct 313 RIDDEFLDPLQFRPNSVLGVPGILNAARAGNVVISSAVGNGVGDDKLIYTYVPTIIEYYL 372
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
+EKP L NV+T RCW+ +E EEVLDRI ELV+KPVEGSGGYGIVFGP+AS A+L +S++
Sbjct 373 NEKPSLPNVDTFRCWIPEELEEVLDRIDELVVKPVEGSGGYGIVFGPDASPAQLKKLSRQ 432
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
+RD PR WIAQP+++LSTVPT+ LAPR+VDLRPFAVNDG++VWVLPGGLTRVAL EG
Sbjct 433 LRDSPRDWIAQPVVQLSTVPTKSGDELAPRHVDLRPFAVNDGDDVWVLPGGLTRVALTEG 492
Query 481 SRVVNSSQGGGSKDTWVLA-PRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQP 539
S VVNSSQGGGSKDTWVLA R++A REL ++V + T + P
Sbjct 493 SLVVNSSQGGGSKDTWVLATTRSAAQDRELAGEELVSEV------KTAHKAETGPELAID 546
Query 540 QQQQQQ 545
Q+QQQQ
Sbjct 547 QEQQQQ 552
>gi|229493138|ref|ZP_04386930.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229319869|gb|EEN85698.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=541
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/510 (76%), Positives = 442/510 (87%), Gaps = 3/510 (0%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY+ Y +AFDEMF+ G VRGPYKG+Y LAP+ +++L ARADALGRAFIDQG+
Sbjct 24 VFDGYSDIGRYELAFDEMFEPDGSVRGPYKGVYKALAPTSSADLAARADALGRAFIDQGV 83
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TFSLSGQERPFPLDLVPRVI+A EW+RLE+GI QRVKALE +L DIYG+Q ILRD V+PR
Sbjct 84 TFSLSGQERPFPLDLVPRVIAAQEWSRLEKGIKQRVKALELFLADIYGEQRILRDHVLPR 143
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLVTSCEHFHR+A GIVPPNGVRIHVAGIDL+RD G+FRVLEDNLRSPSGVSYVMENRR
Sbjct 144 RLVTSCEHFHREAAGIVPPNGVRIHVAGIDLVRDEAGEFRVLEDNLRSPSGVSYVMENRR 203
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
TM RVFP+LF +H+VRAV DYA+HLLRALR AA NEADPTVVVLTPG+ NSAYFEHSLL
Sbjct 204 TMTRVFPDLFMSHKVRAVGDYATHLLRALRAGAALNEADPTVVVLTPGIANSAYFEHSLL 263
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD +LDP+ FR DS+LGVAGL
Sbjct 264 ARQMGVELVEGRDLFCRDNMVYMRTTEGERQVDVIYRRIDDEYLDPMHFRPDSILGVAGL 323
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+NAARAGNVV+SSA+GNGVGDDKLVYTYVP +I+YYL EKPLL NV+T RCWLD+E E+V
Sbjct 324 LNAARAGNVVISSAVGNGVGDDKLVYTYVPQIIDYYLGEKPLLQNVDTFRCWLDEECEQV 383
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LDR+ ELV+KPVEGSGGYGIVFGP+AS ELAA+++KI+ DPR WIAQP+++LSTVPT+I
Sbjct 384 LDRVAELVIKPVEGSGGYGIVFGPDASPKELAAITRKIKADPRGWIAQPLVQLSTVPTKI 443
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRAS 503
+ L+PR+VDLRPFAVNDG +VWVLPGGLTRVAL EGS VVNSSQGGGSKDTWVLA R S
Sbjct 444 DDVLSPRHVDLRPFAVNDGEDVWVLPGGLTRVALPEGSLVVNSSQGGGSKDTWVLASRTS 503
Query 504 AAARELGAAQIVRSLP---QPLCDPTVDAS 530
EL ++V P +P+ P + S
Sbjct 504 DEDPELSGEELVSEPPESAEPVQGPELSTS 533
>gi|226307253|ref|YP_002767213.1| hypothetical protein RER_37660 [Rhodococcus erythropolis PR4]
gi|226186370|dbj|BAH34474.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=540
Score = 784 bits (2024), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/510 (76%), Positives = 442/510 (87%), Gaps = 3/510 (0%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY+ Y +AFDEMF+ G VR PYKG+Y LAP+ +++L ARADALGRAFIDQG+
Sbjct 24 VFDGYSDIGRYELAFDEMFEPDGSVRAPYKGVYKALAPTSSADLAARADALGRAFIDQGV 83
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TFSLSGQERPFPLDLVPRVI+A EW+RLE+GI QRVKALE +L DIYG+Q ILRD V+PR
Sbjct 84 TFSLSGQERPFPLDLVPRVIAAQEWSRLEKGIKQRVKALELFLADIYGEQRILRDHVLPR 143
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLVTSCEHFHR+A GIVPPNGVRIHVAGIDL+RD G+FRVLEDNLRSPSGVSYVMENRR
Sbjct 144 RLVTSCEHFHREAAGIVPPNGVRIHVAGIDLVRDEAGEFRVLEDNLRSPSGVSYVMENRR 203
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
TM RVFP+LF +H+VRAV DYA+HLLRALR AA NEADPTVVVLTPG+ NSAYFEHSLL
Sbjct 204 TMTRVFPDLFMSHKVRAVGDYATHLLRALRAGAALNEADPTVVVLTPGIANSAYFEHSLL 263
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDLFCRDN VYMRTTEGERQVDVIYRRIDD +LDP+ FR DS+LGVAGL
Sbjct 264 ARQMGVELVEGRDLFCRDNMVYMRTTEGERQVDVIYRRIDDEYLDPMHFRPDSILGVAGL 323
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+NAARAGNVV+SSA+GNGVGDDKLVYTYVP +I+YYL EKPLL NV+T RCWLD+E E+V
Sbjct 324 LNAARAGNVVISSAVGNGVGDDKLVYTYVPQIIDYYLGEKPLLQNVDTFRCWLDEECEQV 383
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LDR+ ELV+KPVEGSGGYGIVFGP+AS ELAA+++KI+ DPR WIAQP+++LSTVPT+I
Sbjct 384 LDRVAELVIKPVEGSGGYGIVFGPDASPKELAAITRKIKADPRGWIAQPLVQLSTVPTKI 443
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRAS 503
+ L+PR+VDLRPFAVNDG +VWVLPGGLTRVAL EGS VVNSSQGGGSKDTWVLA R+S
Sbjct 444 DDVLSPRHVDLRPFAVNDGEDVWVLPGGLTRVALPEGSLVVNSSQGGGSKDTWVLASRSS 503
Query 504 AAARELGAAQIVRSLP---QPLCDPTVDAS 530
EL ++V P +P+ P + S
Sbjct 504 DEDPELSGEELVSEPPESAEPVQGPELSTS 533
>gi|262203073|ref|YP_003274281.1| hypothetical protein Gbro_3183 [Gordonia bronchialis DSM 43247]
gi|262086420|gb|ACY22388.1| protein of unknown function DUF404 [Gordonia bronchialis DSM
43247]
Length=606
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/509 (75%), Positives = 436/509 (86%), Gaps = 6/509 (1%)
Query 12 ETRRRSPTRGE-----RIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASE 66
++R ++ + GE +F GY +S Y A+DEMFD+ G VR PY+GI+ + + ++
Sbjct 49 QSRSKAASDGEVPSATGLFAGY-SSGPYGRAYDEMFDSSGDVRTPYRGIHKSMGRQERAD 107
Query 67 LKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYL 126
L+ R +ALG A++DQG+TFSLSG+ERPFPLD+VPRVISA EW +LE G+TQRV+ALE +L
Sbjct 108 LETRVEALGNAYLDQGVTFSLSGKERPFPLDVVPRVISAAEWNKLEAGVTQRVQALELFL 167
Query 127 DDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLE 186
DDIYG+QEILRDGV+P+RLV SCEHFHRQA I PPNGVRIHVAGIDLIRD GDFRVLE
Sbjct 168 DDIYGEQEILRDGVLPKRLVHSCEHFHRQAANIKPPNGVRIHVAGIDLIRDENGDFRVLE 227
Query 187 DNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVV 246
DNLRSPSGVSYV+ENRR MARVFP+LFATHRVRAV DY SHLLRALR SAA NEADP +V
Sbjct 228 DNLRSPSGVSYVLENRRAMARVFPDLFATHRVRAVADYPSHLLRALRASAAFNEADPNIV 287
Query 247 VLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAF 306
VLTPGV NSAYFEHSLLAR MGVELVEGRDLFCRDN VYMRTTEGE++VDVIYRRIDD F
Sbjct 288 VLTPGVANSAYFEHSLLARLMGVELVEGRDLFCRDNVVYMRTTEGEQRVDVIYRRIDDDF 347
Query 307 LDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLL 366
LDP+QFR DS+LGVAGL+NAARAGNVV+SSA+GNGVGDDKL+YTYVP +IEYYL EKP L
Sbjct 348 LDPMQFRPDSMLGVAGLLNAARAGNVVISSAVGNGVGDDKLIYTYVPEIIEYYLGEKPSL 407
Query 367 ANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPR 426
NV+TLRCWLD E EEVLDRI ELV+KPVEGSGGYGIVFGP+AS+AEL A+++K+R DPR
Sbjct 408 QNVDTLRCWLDHECEEVLDRIDELVVKPVEGSGGYGIVFGPDASKAELDALARKVRSDPR 467
Query 427 SWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNS 486
WIAQP+++LSTVPT+I L PR+VDLRPFAVNDG VWVLPGGLTRVAL EGS VVNS
Sbjct 468 GWIAQPVVQLSTVPTKIGDDLRPRHVDLRPFAVNDGESVWVLPGGLTRVALPEGSLVVNS 527
Query 487 SQGGGSKDTWVLAPRASAAARELGAAQIV 515
SQGGGSKDTWVLA R S RE+ A++V
Sbjct 528 SQGGGSKDTWVLASRGSEDEREMSGAKVV 556
>gi|296140493|ref|YP_003647736.1| hypothetical protein Tpau_2799 [Tsukamurella paurometabola DSM
20162]
gi|296028627|gb|ADG79397.1| protein of unknown function DUF404 [Tsukamurella paurometabola
DSM 20162]
Length=553
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/510 (76%), Positives = 433/510 (85%), Gaps = 3/510 (0%)
Query 11 NETRRRSPTR---GERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASEL 67
+ TRR + R R+F GY+ + + AFDEMF G VR PYK ++ L+ +D S+L
Sbjct 9 SHTRRPAAPRLSDDARLFAGYDDAPSFGAAFDEMFADDGTVRSPYKRVFEALSSADESDL 68
Query 68 KARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLD 127
AR DALG AFIDQG+TFSL G+ERPFPLDLVPRVI+A EW RLE+GI QRV+ALE +LD
Sbjct 69 AARVDALGAAFIDQGVTFSLEGRERPFPLDLVPRVIAAGEWNRLEKGIKQRVRALEMFLD 128
Query 128 DIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLED 187
DIY +QEILRD V+P+RLVTSC HFHRQA GI PPNGVRIHVAGIDLIRD G FRVLED
Sbjct 129 DIYSEQEILRDQVVPKRLVTSCAHFHRQAAGIRPPNGVRIHVAGIDLIRDAEGTFRVLED 188
Query 188 NLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVV 247
NLRSPSGVSYVMENRRTMA+VFP+LF HRVRAV DY+SHLLRALR SAA+NEADPTVVV
Sbjct 189 NLRSPSGVSYVMENRRTMAQVFPDLFLRHRVRAVGDYSSHLLRALRRSAASNEADPTVVV 248
Query 248 LTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFL 307
LTPG+ NSAYFEHSLLARQMGVELVEGRDLFCRDN VYMRTT GE+QVDVIYRRIDD FL
Sbjct 249 LTPGMANSAYFEHSLLARQMGVELVEGRDLFCRDNVVYMRTTGGEQQVDVIYRRIDDDFL 308
Query 308 DPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLA 367
DP+QFR DSVLGVAGL+NAARAGNVV+SSA+GNGVGDDKL YTYVP +I+YYL EKPLL
Sbjct 309 DPMQFRPDSVLGVAGLLNAARAGNVVISSAVGNGVGDDKLTYTYVPEIIDYYLGEKPLLQ 368
Query 368 NVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRS 427
NV+TLRCWLD+EREEVLDRI ELV+KPVEGSGGYGIVFGP+AS ELA + +K+ DPR
Sbjct 369 NVDTLRCWLDEEREEVLDRIDELVIKPVEGSGGYGIVFGPDASDKELATMRRKVAADPRG 428
Query 428 WIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSS 487
WIAQP+++LSTVPT+I + PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS VVNSS
Sbjct 429 WIAQPVVQLSTVPTKIGESARPRHVDLRPFAVNDGDDVWVLPGGLTRVALPEGSLVVNSS 488
Query 488 QGGGSKDTWVLAPRASAAARELGAAQIVRS 517
QGGGSKDTWVLA R+S A EL +V S
Sbjct 489 QGGGSKDTWVLAARSSVAEAELEGEALVPS 518
>gi|317507854|ref|ZP_07965555.1| hypothetical protein HMPREF9336_01927 [Segniliparus rugosus ATCC
BAA-974]
gi|316253896|gb|EFV13265.1| hypothetical protein HMPREF9336_01927 [Segniliparus rugosus ATCC
BAA-974]
Length=556
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/480 (78%), Positives = 421/480 (88%), Gaps = 0/480 (0%)
Query 25 FGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGIT 84
F GY S + FDEMF+ G R PY+G++ L P D+ +L ARADALGRAFI+QGIT
Sbjct 71 FDGYAESPGFEKNFDEMFEQDGSSRAPYRGVFQALEPLDSDDLTARADALGRAFINQGIT 130
Query 85 FSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRR 144
FSLSGQERPFPLDL+PRVI+A EWT+LERGITQRV+ALE +LDD+YGDQEILRDGVIPR
Sbjct 131 FSLSGQERPFPLDLIPRVIAAAEWTKLERGITQRVRALEAFLDDVYGDQEILRDGVIPRA 190
Query 145 LVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRT 204
L+ SC+HFHRQA GI PPNGVRIHVAGID+IRD +G FRVLEDNLR+PSGVSYVMENRRT
Sbjct 191 LIFSCQHFHRQAAGIRPPNGVRIHVAGIDIIRDGQGTFRVLEDNLRNPSGVSYVMENRRT 250
Query 205 MARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLA 264
M RVFP+LF THRVR VDDY +HLLRALR +A TNE DPTVVVLTPGV NSA+FEHSLLA
Sbjct 251 MTRVFPDLFGTHRVRPVDDYPAHLLRALRAAAPTNEDDPTVVVLTPGVANSAHFEHSLLA 310
Query 265 RQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLV 324
RQMGVELVEGRDLFCRDN VYMRTTEGE QVDVIYRRIDD +LDPLQF+ +S+LGVAG+V
Sbjct 311 RQMGVELVEGRDLFCRDNVVYMRTTEGEVQVDVIYRRIDDEYLDPLQFKPESLLGVAGIV 370
Query 325 NAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVL 384
NAARAGNVV+SSA+GNGVGDDKLVYTYVPTM+EYYL EKPLLANV+T RCW+ +E EE L
Sbjct 371 NAARAGNVVISSAVGNGVGDDKLVYTYVPTMVEYYLGEKPLLANVDTFRCWIPEELEETL 430
Query 385 DRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIE 444
DRI ELVLKPVEGSGGYGIVFGP+AS+ ELA +++K+R +PR WIAQP+M+LSTVPT++
Sbjct 431 DRINELVLKPVEGSGGYGIVFGPDASEKELATLARKVRANPRDWIAQPVMQLSTVPTKVG 490
Query 445 GTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASA 504
++PR+VDLRPFAVNDG VWVLPGGLTRVAL EGS VVNSSQGGGSKDTWVL R+ A
Sbjct 491 DKVSPRHVDLRPFAVNDGENVWVLPGGLTRVALKEGSLVVNSSQGGGSKDTWVLGSRSQA 550
>gi|343926798|ref|ZP_08766291.1| hypothetical protein GOALK_072_00190 [Gordonia alkanivorans NBRC
16433]
gi|343763158|dbj|GAA13217.1| hypothetical protein GOALK_072_00190 [Gordonia alkanivorans NBRC
16433]
Length=502
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/471 (78%), Positives = 418/471 (89%), Gaps = 1/471 (0%)
Query 48 VRGPYKGIYAELAPSDASEL-KARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAP 106
+R PY+GIY ++ D+S+L +AR +ALGRA++DQG+TFSLSGQERPFPLD+VPRVISA
Sbjct 1 MRTPYRGIYKAMSDEDSSDLVEARVEALGRAYLDQGVTFSLSGQERPFPLDIVPRVISAG 60
Query 107 EWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVR 166
EW++LE GITQRV+ALE +LDDIYG+QEILRDGV+P+RLV SCEHFHRQA I PPNGVR
Sbjct 61 EWSKLEAGITQRVQALELFLDDIYGEQEILRDGVLPKRLVHSCEHFHRQAANIRPPNGVR 120
Query 167 IHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYAS 226
IHVAGIDLIRD GDFRVLEDNLRSPSGVSYV+ENRR MARVFP+LF+ HRVRAV DY S
Sbjct 121 IHVAGIDLIRDENGDFRVLEDNLRSPSGVSYVLENRRAMARVFPDLFSKHRVRAVADYPS 180
Query 227 HLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYM 286
HLLRALR SAA NEADP +VVLTPGV NSAYFEHSLLAR MGVELVEGRDLFCRDN VYM
Sbjct 181 HLLRALRASAAFNEADPNIVVLTPGVANSAYFEHSLLARLMGVELVEGRDLFCRDNVVYM 240
Query 287 RTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDK 346
RTTEGE++VDVIYRRIDD FLDP+QFR DS+LGVAGL+NAARAGNVV+SSA+GNGVGDDK
Sbjct 241 RTTEGEQRVDVIYRRIDDDFLDPMQFRPDSMLGVAGLLNAARAGNVVISSAVGNGVGDDK 300
Query 347 LVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFG 406
L+YTYVP +IEYYL EKP L NV+TLRCWL +E EEVLDRI ELV+KPVEGSGGYGIVFG
Sbjct 301 LIYTYVPEIIEYYLGEKPSLQNVDTLRCWLPEECEEVLDRIDELVVKPVEGSGGYGIVFG 360
Query 407 PEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVW 466
PEA++AEL +++K+R+DPR WIAQP+++LSTVPT+I + PR+VDLRPFAVNDG VW
Sbjct 361 PEATKAELDTLARKVRNDPRGWIAQPVVQLSTVPTKIGNEIRPRHVDLRPFAVNDGESVW 420
Query 467 VLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRS 517
VLPGGLTRVAL EGS VVNSSQGGGSKDTWVLA R S A REL A++V +
Sbjct 421 VLPGGLTRVALPEGSLVVNSSQGGGSKDTWVLASRTSEAERELSGAKVVTT 471
>gi|296392908|ref|YP_003657792.1| hypothetical protein Srot_0474 [Segniliparus rotundus DSM 44985]
gi|296180055|gb|ADG96961.1| protein of unknown function DUF404 [Segniliparus rotundus DSM
44985]
Length=528
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/496 (77%), Positives = 428/496 (87%), Gaps = 5/496 (1%)
Query 14 RRRSPTRG-----ERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELK 68
+R+S T+G + F Y S + FDEMF+ G R PY+G++ L P D+ +L
Sbjct 27 QRQSQTQGGGAQLDGAFEDYTESPGFEKNFDEMFEQDGSARAPYRGVFQALEPLDSDDLN 86
Query 69 ARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDD 128
ARA+ALGRAFI+QGITFSLSGQERPFPLDLVPRVI+A EW +LE+GITQRV+ALE +LDD
Sbjct 87 ARAEALGRAFINQGITFSLSGQERPFPLDLVPRVIAAAEWAKLEKGITQRVRALEAFLDD 146
Query 129 IYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDN 188
+YGDQEILRDGVIPR L+ SC+HFHRQA GI PPNGVRIHVAGID+IRD +G FRVLEDN
Sbjct 147 VYGDQEILRDGVIPRSLIFSCQHFHRQASGIRPPNGVRIHVAGIDIIRDGQGTFRVLEDN 206
Query 189 LRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVL 248
LR+PSGVSYVMENRRTM RVFP+LF THRVR VDDY +HLLRALR +AATNE DPTVVVL
Sbjct 207 LRNPSGVSYVMENRRTMTRVFPDLFGTHRVRPVDDYPAHLLRALRAAAATNEDDPTVVVL 266
Query 249 TPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLD 308
TPGV NSA+FEHSLLARQMGVELVEGRDLFCRDN VYMRTTEGE QVDVIYRRIDD +LD
Sbjct 267 TPGVANSAHFEHSLLARQMGVELVEGRDLFCRDNIVYMRTTEGEVQVDVIYRRIDDEYLD 326
Query 309 PLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLAN 368
PLQF+ +S+LGVAG+VNAARAGNVV+SSA+GNGVGDDKLVYTYVP+MIEYYL EKPLLAN
Sbjct 327 PLQFKPESLLGVAGIVNAARAGNVVISSAVGNGVGDDKLVYTYVPSMIEYYLGEKPLLAN 386
Query 369 VETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSW 428
V+T RCW+ E E+ LDRI ELVLKPVEGSGGYGIVFGP+AS+ ELAA+S+KIR +PR W
Sbjct 387 VDTYRCWIPHELEQTLDRINELVLKPVEGSGGYGIVFGPDASEKELAAMSRKIRANPRDW 446
Query 429 IAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQ 488
+AQP+M+LSTVPT+I +APR+VDLRPFAVNDG VWVLPGGLTRVAL EGS VVNSSQ
Sbjct 447 VAQPVMQLSTVPTKIGDKVAPRHVDLRPFAVNDGENVWVLPGGLTRVALKEGSLVVNSSQ 506
Query 489 GGGSKDTWVLAPRASA 504
GGGSKDTWVL R+ A
Sbjct 507 GGGSKDTWVLGNRSQA 522
>gi|256375351|ref|YP_003099011.1| hypothetical protein Amir_1213 [Actinosynnema mirum DSM 43827]
gi|255919654|gb|ACU35165.1| protein of unknown function DUF404 [Actinosynnema mirum DSM 43827]
Length=553
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/510 (74%), Positives = 428/510 (84%), Gaps = 1/510 (0%)
Query 8 NQLNETRRRSPTRGERIFGGY-NTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASE 66
+QL T R+ R +F GY + +A A+DEMF A VR Y+ ++ +APS A E
Sbjct 13 SQLRRTGPRAEARLGELFEGYLDPRRPHAGAYDEMFGADASVRPAYRALHDSIAPSRAPE 72
Query 67 LKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYL 126
L ARA+AL RAF+DQGITFSLSGQERPFPLDL+PRVI+A EW++LERGI QRV+ALE +L
Sbjct 73 LNARAEALDRAFVDQGITFSLSGQERPFPLDLIPRVITAGEWSKLERGIVQRVRALEMFL 132
Query 127 DDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLE 186
DIYGD +I+RDGVIPRRL+TSCEHFHR+A I PPNGVR+HV+G+DL+RD G FRVLE
Sbjct 133 ADIYGDAQIVRDGVIPRRLITSCEHFHREAARISPPNGVRVHVSGVDLVRDEAGVFRVLE 192
Query 187 DNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVV 246
DNLRSPSGVSYVMENRRTMARVFP+LFA HRVR+V DYA HLLRALRNSAA N ADPTVV
Sbjct 193 DNLRSPSGVSYVMENRRTMARVFPDLFAQHRVRSVGDYAVHLLRALRNSAAPNAADPTVV 252
Query 247 VLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAF 306
VLTPGV NSAYFEHSLLARQMGVELVEGRDLFCRDN VY+RTTEGERQVDVIYRRIDD F
Sbjct 253 VLTPGVANSAYFEHSLLARQMGVELVEGRDLFCRDNLVYLRTTEGERQVDVIYRRIDDTF 312
Query 307 LDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLL 366
LDP+ R DSVLGVAGL+NAARAGNVV+++A+GNGV DDKLVYTY+P ++EYYL EKPLL
Sbjct 313 LDPVHLRPDSVLGVAGLLNAARAGNVVIANAVGNGVADDKLVYTYLPEILEYYLGEKPLL 372
Query 367 ANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPR 426
NV+T RCWL DER VLD + ELV+KPVEGSGGYGIVFGP+A+ EL A+ + IR +PR
Sbjct 373 PNVDTYRCWLPDERGHVLDSLAELVVKPVEGSGGYGIVFGPQATTRELNALRRTIRANPR 432
Query 427 SWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNS 486
WIAQP+++LSTVPT+I LAPR+VDLRPFAVNDGN V+VLPGGLTRVAL EGS +VNS
Sbjct 433 GWIAQPVVQLSTVPTKIGDRLAPRHVDLRPFAVNDGNFVFVLPGGLTRVALPEGSLIVNS 492
Query 487 SQGGGSKDTWVLAPRASAAARELGAAQIVR 516
SQGGGSKDTWVLA R+S REL +VR
Sbjct 493 SQGGGSKDTWVLAARSSTVERELAEPGLVR 522
>gi|302531218|ref|ZP_07283560.1| DUF404 domain-containing protein [Streptomyces sp. AA4]
gi|302440113|gb|EFL11929.1| DUF404 domain-containing protein [Streptomyces sp. AA4]
Length=553
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/495 (73%), Positives = 419/495 (85%), Gaps = 0/495 (0%)
Query 15 RRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADAL 74
RR+ G + G +A A+DEMF G VR PY+ +Y +A DAS+L R+ AL
Sbjct 25 RRAARPGAQFDGYLAPERPHAGAYDEMFAPDGTVRAPYRALYGSIAALDASDLTNRSQAL 84
Query 75 GRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQE 134
RA +DQGITFSLSGQERPFPLDLVPRV+ A EW+RLERG+TQRV+ALE +L D+YGD++
Sbjct 85 DRAMVDQGITFSLSGQERPFPLDLVPRVLQATEWSRLERGVTQRVRALEAFLADVYGDRQ 144
Query 135 ILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSG 194
ILRDGV+PRRL+TSCEHFHR+A GI PPNGVRIHV+G+DL+RD G FRVLEDNLR+PSG
Sbjct 145 ILRDGVLPRRLITSCEHFHREAYGIKPPNGVRIHVSGVDLVRDEEGTFRVLEDNLRNPSG 204
Query 195 VSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYN 254
VSYVMENRRTMARVFP+LFA HRVR V DYASHLLRALR +AA N ADP VVVLTPGVYN
Sbjct 205 VSYVMENRRTMARVFPDLFARHRVRPVGDYASHLLRALRAAAAPNVADPMVVVLTPGVYN 264
Query 255 SAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRA 314
SAYFEHSLLAR MGVELVEGRD+FCRDN VY+RTTEGERQVDVIYRRIDD FLDP+ R
Sbjct 265 SAYFEHSLLARLMGVELVEGRDMFCRDNVVYLRTTEGERQVDVIYRRIDDDFLDPVHHRP 324
Query 315 DSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRC 374
DSVLGVAG++NAARAGNVV+++A+GNGVGDDKLVYTYVP M+ YYL+EKP+L NV+T RC
Sbjct 325 DSVLGVAGILNAARAGNVVVANAVGNGVGDDKLVYTYVPEMVRYYLNEKPILPNVDTFRC 384
Query 375 WLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMM 434
WL DE + V+ ELV+KPVEGSGGYGIVFGPEAS+ EL A+ +K+R + R WIAQP++
Sbjct 385 WLPDEFDHVMQHADELVIKPVEGSGGYGIVFGPEASKKELDALRRKVRANRRGWIAQPVV 444
Query 435 ELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKD 494
+LSTVP++++ LAPR+VDLRPFAVNDG E++VLPGGLTRVAL EGS VVNSSQGGGSKD
Sbjct 445 QLSTVPSKVDDRLAPRHVDLRPFAVNDGKEIFVLPGGLTRVALPEGSLVVNSSQGGGSKD 504
Query 495 TWVLAPRASAAAREL 509
TWVLA R+S + +EL
Sbjct 505 TWVLASRSSTSEQEL 519
>gi|300791010|ref|YP_003771301.1| hypothetical protein AMED_9210 [Amycolatopsis mediterranei U32]
gi|299800524|gb|ADJ50899.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532707|gb|AEK47912.1| hypothetical protein RAM_47235 [Amycolatopsis mediterranei S699]
Length=552
Score = 715 bits (1846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/509 (71%), Positives = 425/509 (84%), Gaps = 6/509 (1%)
Query 6 LPNQLNETRRRSPTR----GERIFGGYNTSD-VYAMAFDEMFDAQGIVRGPYKGIYAELA 60
LP R+ + +R G+R F GY + D +A A+DEMF A G VRGPY+ +Y +A
Sbjct 11 LPPSGRRARKAATSRITRPGDR-FEGYLSPDRPHAGAYDEMFAADGSVRGPYRALYESIA 69
Query 61 PSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVK 120
DA +L +R AL RA +DQGITFSLSGQERPFPLDLVPRVI A EWT++ERG+ QRV+
Sbjct 70 ALDAHDLNSRTLALDRAMVDQGITFSLSGQERPFPLDLVPRVIQAAEWTKIERGVAQRVR 129
Query 121 ALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRG 180
ALE +L DIYGD+ ILR+GV+PRRL+TSC HF R+A GI PPNGVRIHV+G+DL+RD G
Sbjct 130 ALEAFLADIYGDRLILREGVLPRRLITSCVHFQREAFGINPPNGVRIHVSGVDLVRDEEG 189
Query 181 DFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE 240
FRVLEDNLR+PSGVSYVMENRRTMARVFP+LFA HRVR V DYASHLLRALR +AA N
Sbjct 190 TFRVLEDNLRNPSGVSYVMENRRTMARVFPDLFAQHRVRPVGDYASHLLRALRAAAAANV 249
Query 241 ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYR 300
ADPTVVVLTPG++NSAYFEHSLLAR MGVELVEGRD+FCRDN VY+RTTEGERQVDVIYR
Sbjct 250 ADPTVVVLTPGIHNSAYFEHSLLARLMGVELVEGRDMFCRDNVVYLRTTEGERQVDVIYR 309
Query 301 RIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYL 360
RIDD FLDP+ +R DSVLGVAG+ NAARAGNVV+++AIGNGVGDDKLVYTYVP M++YYL
Sbjct 310 RIDDEFLDPVHYRPDSVLGVAGVQNAARAGNVVIANAIGNGVGDDKLVYTYVPEMVKYYL 369
Query 361 HEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQK 420
+EKPLL NV+T RCWL DE + V+ + ELV+KPV+GSGGYGIVFGPEA++ +L + +K
Sbjct 370 NEKPLLPNVDTFRCWLPDEFDHVMAHLDELVVKPVDGSGGYGIVFGPEATKKDLDTLRRK 429
Query 421 IRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG 480
+R R WIAQP+++LSTVP +++ LAPR+VDLRPFAVNDG +++VLPGGLTRVAL EG
Sbjct 430 VRAHRRGWIAQPVVQLSTVPAKVDDRLAPRHVDLRPFAVNDGKDIFVLPGGLTRVALPEG 489
Query 481 SRVVNSSQGGGSKDTWVLAPRASAAAREL 509
S VVNSSQGGGSKDTWVLA RAS A REL
Sbjct 490 SLVVNSSQGGGSKDTWVLASRASTAEREL 518
>gi|158313932|ref|YP_001506440.1| hypothetical protein Franean1_2098 [Frankia sp. EAN1pec]
gi|158109337|gb|ABW11534.1| protein of unknown function DUF404 [Frankia sp. EAN1pec]
Length=526
Score = 630 bits (1626), Expect = 1e-178, Method: Compositional matrix adjust.
Identities = 310/466 (67%), Positives = 372/466 (80%), Gaps = 0/466 (0%)
Query 35 AMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPF 94
A A+DE+FDA R Y ++ L P +S+L AR AL RAF D GITF+L G+ERPF
Sbjct 6 AAAWDEVFDAAHRPREVYTALHDALQPLSSSDLAARKIALDRAFRDAGITFNLFGEERPF 65
Query 95 PLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHR 154
PLDLVPR++S EW +ERG+TQRV+ALE +LDD+YG ++L DG++PRRLV S HFHR
Sbjct 66 PLDLVPRLLSCDEWDVIERGVTQRVRALEAFLDDVYGRADVLADGIVPRRLVLSSSHFHR 125
Query 155 QAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFA 214
A GI PPNGVR HV+GIDL+RD RGDFRVLEDN+R PSGVSYV+ENRR M RVFP LF+
Sbjct 126 AAHGIDPPNGVRAHVSGIDLVRDERGDFRVLEDNVRVPSGVSYVIENRRAMTRVFPELFS 185
Query 215 THRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEG 274
THRVR V DYA+HLL ALR +A ADPTVVVLTPGVYNSAYFEH+LLARQMGVELVEG
Sbjct 186 THRVRPVADYATHLLHALRAAAPPEVADPTVVVLTPGVYNSAYFEHALLARQMGVELVEG 245
Query 275 RDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVL 334
RDL R+N+V MRTTEG++ V V+YRR+DD +LDPL FR +S++G AGL+NAARAGNV +
Sbjct 246 RDLSVRNNRVTMRTTEGDQPVHVVYRRVDDDWLDPLHFRPESMVGCAGLLNAARAGNVTI 305
Query 335 SSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKP 394
++A+GNGV DDKL+YTYVP +I YYL E+P L NV+T R D+R VLD + LV+KP
Sbjct 306 ANAVGNGVADDKLMYTYVPDLIRYYLGEEPALGNVDTFRLEDPDQRAHVLDNLESLVVKP 365
Query 395 VEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDL 454
V+GSGG GIV GP+A++AEL A+ ++ DPR WIAQ +++LST PT + L PR+VDL
Sbjct 366 VDGSGGKGIVIGPQATEAELVALRARVLADPRGWIAQRVVKLSTSPTLADDRLGPRHVDL 425
Query 455 RPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAP 500
RPFAVNDGN +WVLPGGLTRVAL GS VVNSSQGGGSKDTWVLAP
Sbjct 426 RPFAVNDGNRIWVLPGGLTRVALPRGSLVVNSSQGGGSKDTWVLAP 471
>gi|312198642|ref|YP_004018703.1| hypothetical protein FraEuI1c_4843 [Frankia sp. EuI1c]
gi|311229978|gb|ADP82833.1| protein of unknown function DUF404 [Frankia sp. EuI1c]
Length=609
Score = 630 bits (1625), Expect = 2e-178, Method: Compositional matrix adjust.
Identities = 316/484 (66%), Positives = 370/484 (77%), Gaps = 0/484 (0%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY A+DE+FD G+ R Y +Y L P + +L AR AL RAF D GI
Sbjct 22 LFEGYPAEVQATAAWDEVFDPAGVPREVYAALYDALQPLSSGDLAARKAALDRAFRDAGI 81
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF L G+ERPFPLDLVPR++S EW +ERG+ QRV+ALE +L DIYG EIL DG++PR
Sbjct 82 TFILFGEERPFPLDLVPRLLSGSEWDTIERGVVQRVRALEAFLADIYGRAEILDDGIVPR 141
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLV S HFHR A GI PPNGVR HV+GIDLIRD +G FRVLEDN+R PSGVSYV+ENRR
Sbjct 142 RLVMSSSHFHRAAHGIDPPNGVRCHVSGIDLIRDEQGRFRVLEDNVRVPSGVSYVIENRR 201
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
M RVFP LFATHRVR V DYASHLL ALR +A ADPTVVVLTPG+YNSAYFEH+LL
Sbjct 202 AMTRVFPELFATHRVRPVADYASHLLHALRAAAPPEVADPTVVVLTPGIYNSAYFEHALL 261
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDL RDNQV MRTTEGE+ V VIYRRIDD +LDPL FR +SV+G AGL
Sbjct 262 ARQMGVELVEGRDLQVRDNQVTMRTTEGEQPVHVIYRRIDDDWLDPLHFRPESVVGCAGL 321
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+NAARAG V +++ +GNGV DDKL+YTYVP +I YYL E+P+L NV+T R D+R V
Sbjct 322 INAARAGEVTIANGVGNGVADDKLMYTYVPDLIRYYLGEEPVLPNVDTYRVEDPDQRAYV 381
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LD + ELV+KPV+GSGG GIV GP+A+ ELA + ++ DPR WIAQ ++ LST PT
Sbjct 382 LDHLDELVVKPVDGSGGKGIVIGPQATDEELATLRGQVTADPRGWIAQRLVRLSTSPTLS 441
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRAS 503
L PR++DLRPFAVNDG+ +WVLPGGLTRVAL GS VVNSSQGGGSKDTWVLAP+ +
Sbjct 442 GDRLGPRHIDLRPFAVNDGSRIWVLPGGLTRVALPRGSFVVNSSQGGGSKDTWVLAPQLA 501
Query 504 AAAR 507
R
Sbjct 502 DGER 505
>gi|111223940|ref|YP_714734.1| hypothetical protein FRAAL4547 [Frankia alni ACN14a]
gi|111151472|emb|CAJ63189.1| Conserved hypothetical protein [Frankia alni ACN14a]
Length=559
Score = 625 bits (1612), Expect = 5e-177, Method: Compositional matrix adjust.
Identities = 318/495 (65%), Positives = 379/495 (77%), Gaps = 6/495 (1%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY A+DE+F+A R Y +Y L P +++L AR AL RAF D GI
Sbjct 4 LFEGYPAEAAATAAWDEVFEASNTPRDVYAALYDALQPLSSADLAARKVALDRAFRDAGI 63
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF+L G+ERPFPLDLVPR++ EW +ERG+TQRV+ALE +L D+YG E+L DG++PR
Sbjct 64 TFNLFGEERPFPLDLVPRLLDGDEWDVIERGVTQRVQALEAFLADVYGPAEVLADGIVPR 123
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLV + HFHR A GI PPNGVR HV+GIDLIRD +G FRVLEDN+R PSGVSYV+ENRR
Sbjct 124 RLVLTSAHFHRAAHGIDPPNGVRAHVSGIDLIRDEQGGFRVLEDNVRVPSGVSYVIENRR 183
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
M RVFP LFATHRVR V DYA+HLL ALR +A ADPTVVVLTPGVYNSAYFEH+LL
Sbjct 184 AMTRVFPELFATHRVRPVADYATHLLHALRAAAPPEVADPTVVVLTPGVYNSAYFEHALL 243
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDL RDN+V MRTTEGE+ V V+YRR+DD +LDPL FR +S++G AGL
Sbjct 244 ARQMGVELVEGRDLTVRDNKVTMRTTEGEQPVHVVYRRVDDDWLDPLHFRPESMVGCAGL 303
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
VNAAR G+V +++A+GNGV DDKL+YTYVP +I YYL E+P+L NV+T R D+R V
Sbjct 304 VNAARGGHVTIANAVGNGVADDKLMYTYVPELIRYYLGEEPILPNVDTYRLEDPDQRAHV 363
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LD + LV+KPV+GSGG GIV GP+AS AELA + ++ +DPR WIAQ +++LST PT
Sbjct 364 LDHLDTLVVKPVDGSGGKGIVIGPQASDAELAELRVRVSEDPRGWIAQRVVKLSTSPTLT 423
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
+ L PR+VDLRPFAVNDG +VWVLPGGLTRVAL GS VVNSSQGGGSKDTWVLA
Sbjct 424 DDRLGPRHVDLRPFAVNDGTKVWVLPGGLTRVALPRGSLVVNSSQGGGSKDTWVLAAERN 483
Query 500 PRASA--AARELGAA 512
PR A AR GAA
Sbjct 484 PREPALPMARPPGAA 498
>gi|288920571|ref|ZP_06414877.1| protein of unknown function DUF404 [Frankia sp. EUN1f]
gi|288348064|gb|EFC82335.1| protein of unknown function DUF404 [Frankia sp. EUN1f]
Length=483
Score = 625 bits (1612), Expect = 7e-177, Method: Compositional matrix adjust.
Identities = 310/477 (65%), Positives = 373/477 (79%), Gaps = 0/477 (0%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY V A A+DE+FD R Y ++ L P +++L AR AL RAF D GI
Sbjct 4 LFEGYAAEVVAAAAWDEVFDPTHRPRDVYSALHDALQPLSSADLAARKVALDRAFRDAGI 63
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF+L G+ERPFPLDLVPR++S EW +ERG+ QRV+ALE +L D+YG E+L DG++PR
Sbjct 64 TFNLFGEERPFPLDLVPRLLSGDEWEVIERGVVQRVRALEAFLADVYGPAEVLADGIVPR 123
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLV S HFHR A G+ PPNGVR HV+GIDL+RD GDFRVLEDN+R PSGVSYV+ENRR
Sbjct 124 RLVLSSSHFHRAAHGVDPPNGVRAHVSGIDLVRDENGDFRVLEDNVRVPSGVSYVIENRR 183
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
M RVFP LFATHRVR V DYA+HLL ALR +A ADPTVVVLTPGVYN+AYFEH+LL
Sbjct 184 AMTRVFPELFATHRVRPVADYATHLLHALRAAAPPEVADPTVVVLTPGVYNAAYFEHALL 243
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDL R+N+V MRTTEGE+ V VIYRR+DD +LDPL FR +S++G AGL
Sbjct 244 ARQMGVELVEGRDLSVRNNRVTMRTTEGEQPVHVIYRRVDDDWLDPLHFRPESMVGCAGL 303
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+N ARAGNV +++A+GNGV DDKL+YTYVP +I YYL E+P+L N++T R D+R V
Sbjct 304 LNVARAGNVTIANAVGNGVADDKLMYTYVPDLIRYYLGEEPVLRNIDTFRLEEPDQRAHV 363
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LD + LV+KPV+GSGG GIV GP+A++AELA + ++ DPR WIAQP+++LST PT
Sbjct 364 LDNLDALVVKPVDGSGGKGIVIGPQATEAELAELRARVLGDPRGWIAQPVVKLSTSPTLA 423
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAP 500
L PR+VDLRPFAVNDGN +WVLPGGLTRVAL GS VVNSSQGGGSKDTWVLAP
Sbjct 424 GDRLGPRHVDLRPFAVNDGNRIWVLPGGLTRVALPRGSLVVNSSQGGGSKDTWVLAP 480
>gi|336177739|ref|YP_004583114.1| hypothetical protein FsymDg_1744 [Frankia symbiont of Datisca
glomerata]
gi|334858719|gb|AEH09193.1| protein of unknown function DUF404 [Frankia symbiont of Datisca
glomerata]
Length=570
Score = 619 bits (1597), Expect = 3e-175, Method: Compositional matrix adjust.
Identities = 330/551 (60%), Positives = 388/551 (71%), Gaps = 27/551 (4%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F GY A+DE+F+A R Y +Y L P +S+L AR AL RAF D GI
Sbjct 4 LFEGYAAQ--ATQAWDEVFEAPDTPRPLYASLYDALRPLSSSDLAARKAALDRAFRDAGI 61
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF+L G+ERPFPLDLVPR++ EW +ERG+TQRV+ALE +L DIYG E+L DG++PR
Sbjct 62 TFNLFGEERPFPLDLVPRLLDNSEWDVIERGVTQRVRALEAFLTDIYGRAEVLADGIVPR 121
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
RLV S HFHR A GI PPNGVR HV+GIDLIRD +G FRVLEDN+R PSGVSYV+ENRR
Sbjct 122 RLVLSSAHFHRAAHGIDPPNGVRAHVSGIDLIRDEQGGFRVLEDNVRVPSGVSYVVENRR 181
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
M RVFP LFATHRVR V DYA+HLL ALR +A + ADPTVVVLTPGVYN AYFEH+LL
Sbjct 182 AMTRVFPELFATHRVRPVADYATHLLHALRAAAPPDVADPTVVVLTPGVYNPAYFEHALL 241
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
ARQMGVELVEGRDL +N V MRTTEG+R V V+YRRIDD +LDPL FR +SV+G AGL
Sbjct 242 ARQMGVELVEGRDLTVHNNNVTMRTTEGDRPVHVVYRRIDDDWLDPLHFRPESVVGCAGL 301
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
+NAARAG V +++A+GNGV DDKL+YTYVP +I YYL E+P+LANV+T R D+R+ V
Sbjct 302 LNAARAGRVTIANAVGNGVADDKLMYTYVPDLIRYYLGEEPVLANVDTYRLEDPDQRDHV 361
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
L + ELVLKPV+GSGG GIV G +AS AEL A+ KI DPR WIAQ ++ LST PT
Sbjct 362 LGHLDELVLKPVDGSGGKGIVIGEQASAAELDALRLKIEADPRGWIAQRVVRLSTSPTLA 421
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
L PR+VDLRPFAVNDG VWVLPGGLTRVAL GS VVNSSQGGGSKDTWVLA
Sbjct 422 GDRLGPRHVDLRPFAVNDGRRVWVLPGGLTRVALPRGSLVVNSSQGGGSKDTWVLASPES 481
Query 500 ------PRASAAARELGAAQIVRSLPQPLCD---------------PTVDASGYEPHDQQ 538
PR+ A P P CD P+ DA + Q+
Sbjct 482 SRESISPRSRPPGNVPQVADGPDVGPFPSCDQQQQQQQQREGPQPRPSQDAGPGQGQRQE 541
Query 539 PQQQQQQQQQA 549
P+Q Q + Q+
Sbjct 542 PRQGQSEHGQS 552
>gi|119960999|ref|YP_947876.1| hypothetical protein AAur_2132 [Arthrobacter aurescens TC1]
gi|119947858|gb|ABM06769.1| putative Domain of unknown function (DUF404/DUF407) [Arthrobacter
aurescens TC1]
Length=530
Score = 609 bits (1570), Expect = 5e-172, Method: Compositional matrix adjust.
Identities = 297/495 (60%), Positives = 371/495 (75%), Gaps = 10/495 (2%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F Y+ + + A+DEMF + R Y + L +++ ARAD++ R F+D+G+
Sbjct 16 LFQDYSEAAARSGAYDEMFAQGHVARRSYGQVSGALRELSLADVTARADSMARTFLDRGV 75
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF +G+ERPFPLD+VPRVI A EW LERG+ QRVKALE +L+D+YG ++ DGVIPR
Sbjct 76 TFDYAGEERPFPLDIVPRVIPADEWNVLERGVAQRVKALEAFLNDVYGRMAVVTDGVIPR 135
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
+LVT+ HFHR G P GVR+HV+GID++RD G FRVLEDN+R PSGVSYV+ENRR
Sbjct 136 QLVTTSAHFHRAVHGFEPSGGVRVHVSGIDVVRDAAGTFRVLEDNVRVPSGVSYVLENRR 195
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
MA+ P F +R V++Y LL ALR +A DPTVVVLTPGV+NSAYFEH+LL
Sbjct 196 AMAKGLPEAFGQQHIRPVEEYPRRLLSALRKTAPAGVDDPTVVVLTPGVFNSAYFEHTLL 255
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
A MGVELVEGRDL CR N+VYMRTT GE++VDVIY+RIDD FLDPLQFR+DS+LG GL
Sbjct 256 AGLMGVELVEGRDLICRGNRVYMRTTAGEQRVDVIYKRIDDDFLDPLQFRSDSMLGCPGL 315
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
VNAARAG V +++A+GNGV DDKLVY+YVP +I YYLHE+P++ANV+T R + RE V
Sbjct 316 VNAARAGGVTIANAVGNGVADDKLVYSYVPDLIRYYLHEEPIIANVDTFRLEEKEAREHV 375
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LDR+ ELV+KPV+GSGG G+V GP+A++ EL A+ +++ DPR WIAQP+++LSTVPT
Sbjct 376 LDRLDELVVKPVDGSGGKGLVIGPDATKDELDALRKRVIADPRGWIAQPVLQLSTVPTLS 435
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS +VNSSQGGGSKDTWVLA
Sbjct 436 GDKFGPRHVDLRPFAVNDGDDVWVLPGGLTRVALKEGSLIVNSSQGGGSKDTWVLADSPQ 495
Query 500 ------PRASAAARE 508
PR S RE
Sbjct 496 LPAEIIPRQSVTVRE 510
>gi|116670678|ref|YP_831611.1| hypothetical protein Arth_2131 [Arthrobacter sp. FB24]
gi|116610787|gb|ABK03511.1| protein of unknown function DUF404 [Arthrobacter sp. FB24]
Length=520
Score = 607 bits (1565), Expect = 2e-171, Method: Compositional matrix adjust.
Identities = 296/495 (60%), Positives = 374/495 (76%), Gaps = 10/495 (2%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F Y+ + + A+DEMF R Y + L +++ ARAD++ R F+D+G+
Sbjct 6 LFQDYSEAAGRSGAYDEMFTPGQEARKSYGQVAGALRELSLTDVTARADSMARTFLDRGV 65
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF +G+ERPFPLD+VPRVI A EWT LE+G+ QRV+ALE +L+D+Y ++ DGVIPR
Sbjct 66 TFDFAGEERPFPLDIVPRVIPADEWTVLEKGVAQRVRALEAFLNDVYDKMSVVADGVIPR 125
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
+LVT+ HFHRQ G P GVR+H++GID++RD G FRVLEDN+R PSGVSYV+ENRR
Sbjct 126 QLVTTSAHFHRQVHGFEPAGGVRVHISGIDVVRDAAGTFRVLEDNVRVPSGVSYVLENRR 185
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
MA+ P F +R V++Y LL ALR +A + DPTVVVLTPGV+NSAYFEH+LL
Sbjct 186 AMAKGLPEAFGQQLIRPVEEYPRRLLSALRKTAPSGVDDPTVVVLTPGVFNSAYFEHTLL 245
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
A MGVELVEGRDL CR N+VYMRTT+GE++VDVIY+RIDD FLDPLQFRADS+LG GL
Sbjct 246 AGLMGVELVEGRDLICRGNRVYMRTTDGEQRVDVIYKRIDDDFLDPLQFRADSMLGCPGL 305
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
VNAARAG V +++A+GNGV DDKLVY+YVP +I YYL+E+P++ANV+T R + RE V
Sbjct 306 VNAARAGGVTIANAVGNGVADDKLVYSYVPDLIRYYLNEEPVIANVDTFRLEEKEAREHV 365
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LDR+ ELV+KPV+GSGG G+V GP+AS+ EL A+ +++ DPR WIAQP+++LSTVPT
Sbjct 366 LDRLDELVVKPVDGSGGKGLVIGPDASKEELDALRKRVIADPRGWIAQPVLQLSTVPTLS 425
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
PR+VDLRPFAVNDG++VWVLPGGLTRVAL EGS +VNSSQGGGSKDTWVL+
Sbjct 426 GDKFGPRHVDLRPFAVNDGDDVWVLPGGLTRVALKEGSLIVNSSQGGGSKDTWVLSDSPE 485
Query 500 ------PRASAAARE 508
PR S A RE
Sbjct 486 VPVEALPRPSIAVRE 500
>gi|148271675|ref|YP_001221236.1| hypothetical protein CMM_0496 [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147829605|emb|CAN00520.1| conserved hypothetical protein [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length=571
Score = 602 bits (1551), Expect = 7e-170, Method: Compositional matrix adjust.
Identities = 312/541 (58%), Positives = 381/541 (71%), Gaps = 19/541 (3%)
Query 24 IFGGYNT------SDVYAMAFDEMF------DAQGIVRGPYKGIYAELAPSDASELKARA 71
+F GY T + AM FDEMF + R Y+ I+A L+ ELK R
Sbjct 4 LFEGYGTLAAARRASGGAMPFDEMFRDPPVAGEPAVARAAYREIHAALSRMTKEELKDRT 63
Query 72 DALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYG 131
DAL +++ QG+TF +G+ERPFPLD VPRVI EW+RLE+G+ QRV+ALE +L D+YG
Sbjct 64 DALATSYLAQGVTFDFAGEERPFPLDAVPRVIEQAEWSRLEKGVAQRVRALEAFLADVYG 123
Query 132 DQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRS 191
Q +RDGVIP RL++S HFHRQA GI P NGVRI V+GIDL+RD G+ RVLEDN+R
Sbjct 124 PQRAIRDGVIPARLISSSSHFHRQAAGIDPANGVRIQVSGIDLVRDEAGEMRVLEDNVRV 183
Query 192 PSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPG 251
PSGVSYV+ NRR MA+ P LF + RVR V DY + LL+ALR SA DP VVVLTPG
Sbjct 184 PSGVSYVISNRRVMAQTLPELFVSMRVRPVGDYPNKLLQALRASAPDGVEDPNVVVLTPG 243
Query 252 VYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQ 311
VYNSAYFEH+LLAR MGVELVEGRDLFC +V+MRTT G +VDVIYRR+DD FLDPLQ
Sbjct 244 VYNSAYFEHTLLARLMGVELVEGRDLFCSGGRVWMRTTGGPMRVDVIYRRVDDEFLDPLQ 303
Query 312 FRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVET 371
FRADS+LG GL+ AAR GNV +++A+GNGV DDKLVYTY+P +I YYL E ++ NV+T
Sbjct 304 FRADSMLGSPGLMLAARLGNVTIANAVGNGVADDKLVYTYLPDLIRYYLAEDAIIPNVDT 363
Query 372 LRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQ 431
R D EEVLDR+ ELV+KPV+GSGG G+V GP AS ELA + ++ DPR WIAQ
Sbjct 364 WRLEEPDSLEEVLDRLPELVVKPVDGSGGKGLVVGPAASAGELAELRARLLKDPRGWIAQ 423
Query 432 PMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGG 491
P+++LST+PT +E + PR+ DLRPFAVNDG ++WVLPGGLTRVAL EG VVNSSQGGG
Sbjct 424 PVVQLSTIPTLVEDGMRPRHADLRPFAVNDGRDIWVLPGGLTRVALPEGQLVVNSSQGGG 483
Query 492 SKDTWVLAPRA-SAAARE-----LGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQ 545
SKDTWV+ AA RE L A Q + P+ A PHD +P+ + Q
Sbjct 484 SKDTWVVGDSGFPAATRERSVQTLVADQAAVTTSIPIIQNGEKAPDQSPHD-RPRNRDQH 542
Query 546 Q 546
+
Sbjct 543 E 543
>gi|170780706|ref|YP_001709038.1| hypothetical protein CMS_0252 [Clavibacter michiganensis subsp.
sepedonicus]
gi|169155274|emb|CAQ00375.1| conserved hypothetical protein [Clavibacter michiganensis subsp.
sepedonicus]
Length=566
Score = 602 bits (1551), Expect = 8e-170, Method: Compositional matrix adjust.
Identities = 307/537 (58%), Positives = 378/537 (71%), Gaps = 12/537 (2%)
Query 24 IFGGYNT------SDVYAMAFDEMF------DAQGIVRGPYKGIYAELAPSDASELKARA 71
+F GY T + AM FDEMF + R Y+ I+A L+ ELK R
Sbjct 4 LFEGYGTLAAARRASGGAMPFDEMFRDPPVAGEPAVARAAYREIHAALSRMTKEELKDRT 63
Query 72 DALGRAFIDQGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYG 131
DAL +++ QG+TF +G+ERPFPLD VPRVI EW+RLE+G+ QRV+ALE +L D+YG
Sbjct 64 DALATSYLAQGVTFDFAGEERPFPLDAVPRVIEQAEWSRLEKGVAQRVRALEAFLADVYG 123
Query 132 DQEILRDGVIPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRS 191
Q +RDGVIP RL++S HFHRQA GI P NGVRI V+GIDL+RD G+ RVLEDN+R
Sbjct 124 PQRAIRDGVIPARLISSSSHFHRQAAGIDPANGVRIQVSGIDLVRDEAGEMRVLEDNVRV 183
Query 192 PSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPG 251
PSGVSYV+ NRR MA+ P LF + RVR V DY + LL+ALR SA DP VVVLTPG
Sbjct 184 PSGVSYVISNRRVMAQTLPELFVSMRVRPVGDYPNKLLQALRASAPDGVEDPNVVVLTPG 243
Query 252 VYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQ 311
VYNSAYFEH+LLAR MGVELVEGRDLFC +V+MRTT G +VDVIYRR+DD FLDPLQ
Sbjct 244 VYNSAYFEHTLLARLMGVELVEGRDLFCSGGRVWMRTTGGPMRVDVIYRRVDDEFLDPLQ 303
Query 312 FRADSVLGVAGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVET 371
FRADS+LG GL+ AAR GNV +++A+GNGV DDKLVYTY+P +I YYL E ++ NV+T
Sbjct 304 FRADSMLGSPGLMLAARLGNVTIANAVGNGVADDKLVYTYLPDLIRYYLAEDAIIPNVDT 363
Query 372 LRCWLDDEREEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQ 431
R D EEVLDR+ ELV+KPV+GSGG G+V GP AS ELA + ++ DPR WIAQ
Sbjct 364 WRLEEPDSLEEVLDRLPELVVKPVDGSGGKGLVVGPAASAGELAELRARLLKDPRGWIAQ 423
Query 432 PMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGG 491
P+++LST+PT +E + PR+ DLRPFAVNDG ++WVLPGGLTRVAL EG VVNSSQGGG
Sbjct 424 PVVQLSTIPTLVEDGMRPRHADLRPFAVNDGRDIWVLPGGLTRVALPEGQLVVNSSQGGG 483
Query 492 SKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQQQQ 548
SKDTWV+ AA + Q + + + SG + DQ P + + + Q
Sbjct 484 SKDTWVVGGSGFPAATRERSVQTLVADQAAVTTSIPIVSGEKAPDQSPHDRPRNRDQ 540
>gi|88856624|ref|ZP_01131280.1| hypothetical protein A20C1_10595 [marine actinobacterium PHSC20C1]
gi|88814085|gb|EAR23951.1| hypothetical protein A20C1_10595 [marine actinobacterium PHSC20C1]
Length=555
Score = 601 bits (1549), Expect = 1e-169, Method: Compositional matrix adjust.
Identities = 295/479 (62%), Positives = 363/479 (76%), Gaps = 3/479 (0%)
Query 24 IFGGYNTSD---VYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFID 80
+F GY +S A+DEMF A +R PY+ I+ LA EL+ R +AL +++
Sbjct 4 LFDGYTSSASKRTGPAAWDEMFSADSEIRRPYREIHDALAQMTQEELRGRTEALADSYLA 63
Query 81 QGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGV 140
QG+TF +G+ERPFPLD VPRVI EW ++E G+ QRV+ALE +L DIYG Q ++DGV
Sbjct 64 QGVTFDFAGEERPFPLDPVPRVIDLSEWRQVESGVKQRVRALEAFLADIYGPQNAIKDGV 123
Query 141 IPRRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVME 200
IP R++TS HFHRQA GI P NGVRI V+GIDLIRD G +RVLEDN+R PSGVSYV+
Sbjct 124 IPARMITSSSHFHRQAAGIEPANGVRIQVSGIDLIRDEVGAWRVLEDNVRVPSGVSYVIS 183
Query 201 NRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEH 260
NRR MA+ P LF + RVR V DY LL+ALR SA + +PTVVVLTPGVYNSAYFEH
Sbjct 184 NRRVMAQTLPELFVSMRVRPVGDYPHKLLQALRASAPSGIEEPTVVVLTPGVYNSAYFEH 243
Query 261 SLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGV 320
+LLAR MGVELVEGRDLFC +V+MRTT G +VDVIYRR+DD FLDPLQFRADS+LG
Sbjct 244 TLLARLMGVELVEGRDLFCSGGRVWMRTTAGPTRVDVIYRRVDDEFLDPLQFRADSMLGS 303
Query 321 AGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDER 380
G++ AAR GNV +++A+GNGV DDKLVYTY+P + EYYL EK ++ NV+T R
Sbjct 304 PGMMLAARLGNVTIANAVGNGVADDKLVYTYLPDLTEYYLGEKAIIPNVQTWRLEDPGAL 363
Query 381 EEVLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVP 440
EEVLDR+ ELV+KPV+GSGG G+V GP AS+ ELA + ++R DPR WIAQP+++LST+P
Sbjct 364 EEVLDRLDELVVKPVDGSGGKGLVIGPAASKDELATLKTQLRKDPRGWIAQPVVQLSTIP 423
Query 441 TRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA 499
T ++ + PR+ DLRPFAVNDG++VWVLPGGLTRVAL EG VVNSSQGGGSKDTWV+
Sbjct 424 TVVDDGMRPRHADLRPFAVNDGSDVWVLPGGLTRVALPEGQLVVNSSQGGGSKDTWVVG 482
>gi|336115926|ref|YP_004570692.1| hypothetical protein MLP_02750 [Microlunatus phosphovorus NM-1]
gi|334683704|dbj|BAK33289.1| hypothetical protein MLP_02750 [Microlunatus phosphovorus NM-1]
Length=551
Score = 599 bits (1544), Expect = 5e-169, Method: Compositional matrix adjust.
Identities = 291/465 (63%), Positives = 354/465 (77%), Gaps = 0/465 (0%)
Query 35 AMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPF 94
+AFDEM DA G VR Y +Y L S A EL++ A++L + G+TF + G ERPF
Sbjct 11 GIAFDEMIDADGAVRAAYSTVYETLRRSSADELRSIAESLANNYTQAGVTFDVGGVERPF 70
Query 95 PLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHR 154
PLD+VPRVI A +W ++ G+ QR++ALE +L D+Y D ++ DGVIPR+L+TS H+HR
Sbjct 71 PLDVVPRVIPADDWEIIDSGVAQRIRALEAFLADVYADGRVMTDGVIPRQLITSSSHYHR 130
Query 155 QAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFA 214
GI PPNGVR+HV GIDLIR GD RVLEDN+R PSGVSYVM NR M P F
Sbjct 131 AVWGIQPPNGVRVHVGGIDLIRTPDGDVRVLEDNVRVPSGVSYVMTNRSAMVTAMPEAFG 190
Query 215 THRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEG 274
T R+R V Y LL ALR +A DPTVVVLTPGVYNSAYFEH+LLAR MGVELVEG
Sbjct 191 TQRIRPVAGYPQRLLAALRKAAPYGIDDPTVVVLTPGVYNSAYFEHTLLARTMGVELVEG 250
Query 275 RDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVL 334
RDL C+ +VYMRTT G R+VDVIYRRIDD F+DP+ FR+DS+LGV GL+NA R+G V L
Sbjct 251 RDLECQRGRVYMRTTAGLRRVDVIYRRIDDDFIDPVHFRSDSMLGVTGLLNAVRSGGVTL 310
Query 335 SSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKP 394
++AIGNGV DDKL+YTYVP +I YYL+E+P++ NV+T R DD REEV+DR+ ELV+KP
Sbjct 311 ANAIGNGVADDKLIYTYVPDLIRYYLNEEPIIRNVDTWRLEEDDAREEVMDRLDELVVKP 370
Query 395 VEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDL 454
V+GSGG GIV GP AS ELAA+ +++ DDPR WIAQP+++LSTVPT I L PR+VDL
Sbjct 371 VDGSGGKGIVIGPHASAEELAALRRRVTDDPRGWIAQPLVQLSTVPTLIGSGLEPRHVDL 430
Query 455 RPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA 499
RPFAVN G+++WVLPGGLTRVAL +G VVNSSQGGGSKDTWVL+
Sbjct 431 RPFAVNSGDDIWVLPGGLTRVALPKGELVVNSSQGGGSKDTWVLS 475
>gi|220912637|ref|YP_002487946.1| hypothetical protein Achl_1882 [Arthrobacter chlorophenolicus
A6]
gi|219859515|gb|ACL39857.1| protein of unknown function DUF404 [Arthrobacter chlorophenolicus
A6]
Length=518
Score = 599 bits (1544), Expect = 5e-169, Method: Compositional matrix adjust.
Identities = 295/495 (60%), Positives = 367/495 (75%), Gaps = 10/495 (2%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F Y+ + A+DEMF R Y+ + L +++ ARAD++ R F+D+G+
Sbjct 4 LFQDYSEAAGRTGAYDEMFAPGQQARPSYEQVADALRKLSLADVSARADSMARTFLDRGV 63
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF +G+ERPFPLD+VPRVI A EW +ERG+ QRV+ALE +L+D+Y ++ DGVIPR
Sbjct 64 TFDFAGEERPFPLDIVPRVIPAAEWDVMERGVAQRVRALEAFLNDVYDKMTVVSDGVIPR 123
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
+LVT+ HFHRQ G P GVR+H++GID++RD G FRVLEDN+R PSGVSYV+ENRR
Sbjct 124 QLVTTSAHFHRQVHGFEPAGGVRVHISGIDVVRDAAGTFRVLEDNVRVPSGVSYVLENRR 183
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
MA+ P F +R V++Y LL ALR +A + DPTVVVLTPGV+NSAYFEH+LL
Sbjct 184 AMAKGLPEAFGQQLIRPVEEYPRRLLSALRKTAPSGVDDPTVVVLTPGVFNSAYFEHTLL 243
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
A MGVELVEGRDL CR N+VYMRTT GE++VDVIY+RIDD FLDPLQFRADS+LG GL
Sbjct 244 AGLMGVELVEGRDLICRGNRVYMRTTAGEQRVDVIYKRIDDDFLDPLQFRADSMLGCPGL 303
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
VNAARAG V +++A+GNGV DDKLVY+YVP +I YYL E+P++ANV+T R + RE
Sbjct 304 VNAARAGGVTIANAVGNGVADDKLVYSYVPDLIRYYLSEEPIIANVDTFRLEEKEAREYT 363
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LD + ELV+KPV+GSGG G+V GP+AS EL A+ Q+I DPR WIAQP+++LSTVPT
Sbjct 364 LDNLAELVVKPVDGSGGKGLVIGPDASNDELDALRQRIIADPRGWIAQPVLQLSTVPTLS 423
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
PR+VDLRPFAVNDG+ VWVLPGGLTRVAL EGS +VNSSQGGGSKDTWVLA
Sbjct 424 GDKFGPRHVDLRPFAVNDGDNVWVLPGGLTRVALKEGSLIVNSSQGGGSKDTWVLADSPQ 483
Query 500 ------PRASAAARE 508
PR S + RE
Sbjct 484 MPVESVPRQSISVRE 498
>gi|325963241|ref|YP_004241147.1| hypothetical protein Asphe3_18570 [Arthrobacter phenanthrenivorans
Sphe3]
gi|323469328|gb|ADX73013.1| uncharacterized conserved protein [Arthrobacter phenanthrenivorans
Sphe3]
Length=520
Score = 598 bits (1542), Expect = 9e-169, Method: Compositional matrix adjust.
Identities = 295/495 (60%), Positives = 366/495 (74%), Gaps = 10/495 (2%)
Query 24 IFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGI 83
+F Y+ + A+DEMF R Y + L +++ ARAD++ R F+D+G+
Sbjct 6 LFQDYSVAAGRTGAYDEMFAPGQQARDSYGQVADALRKLSLADVSARADSMARTFLDRGV 65
Query 84 TFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPR 143
TF +G+ERPFPLD+VPRVI A EW LERG+ QRV+ALE +L+D+Y ++ DGVIPR
Sbjct 66 TFDFAGEERPFPLDIVPRVIPAAEWDVLERGVAQRVRALEAFLNDVYDKMTVVSDGVIPR 125
Query 144 RLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRR 203
+LVT+ HFHRQ G P GVR+H++GID++RD G FRVLEDN+R PSGVSYV+ENRR
Sbjct 126 QLVTTSAHFHRQVHGFEPAGGVRVHISGIDVVRDAAGTFRVLEDNVRVPSGVSYVLENRR 185
Query 204 TMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLL 263
MA+ P F +R V++Y LL ALR +A DPTVVVLTPGV+NSAYFEH+LL
Sbjct 186 AMAKGLPEAFGQQLIRPVEEYPRRLLSALRKTAPAGVDDPTVVVLTPGVFNSAYFEHTLL 245
Query 264 ARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGL 323
A MGVELVEGRDL CR N+VYMRTT GE++VDVIY+RIDD FLDPLQFRADS+LG GL
Sbjct 246 AGLMGVELVEGRDLICRGNRVYMRTTAGEQRVDVIYKRIDDDFLDPLQFRADSMLGCPGL 305
Query 324 VNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEV 383
VNAARAG V +++A+GNGV DDKLVY+YVP +I YYL E+P++ANV+T R + RE
Sbjct 306 VNAARAGGVTIANAVGNGVADDKLVYSYVPDLIRYYLSEEPIIANVDTYRLEEKEAREYT 365
Query 384 LDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRI 443
LD + ELV+KPV+GSGG G+V GP+AS+ EL A+ Q++ DPR WIAQP+++LSTVPT
Sbjct 366 LDNLSELVVKPVDGSGGKGLVIGPDASKDELDALRQRVIADPRGWIAQPVLQLSTVPTLS 425
Query 444 EGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---- 499
PR+VDLRPFAVNDG+ VWVLPGGLTRVAL EGS +VNSSQGGGSKDTWVLA
Sbjct 426 GDKFGPRHVDLRPFAVNDGDNVWVLPGGLTRVALKEGSLIVNSSQGGGSKDTWVLADSPQ 485
Query 500 ------PRASAAARE 508
PR S + RE
Sbjct 486 MPVETVPRPSISLRE 500
>gi|258654683|ref|YP_003203839.1| hypothetical protein Namu_4571 [Nakamurella multipartita DSM
44233]
gi|258557908|gb|ACV80850.1| protein of unknown function DUF404 [Nakamurella multipartita
DSM 44233]
Length=545
Score = 592 bits (1527), Expect = 4e-167, Method: Compositional matrix adjust.
Identities = 293/493 (60%), Positives = 363/493 (74%), Gaps = 12/493 (2%)
Query 34 YAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERP 93
+A A+DEMF A G +R Y+ ++A L DA++LKARAD +GR F+DQGITF+L G ERP
Sbjct 10 FARAWDEMFAAPGEIRPAYESVFAALQTMDAADLKARADIMGRTFLDQGITFALGGVERP 69
Query 94 FPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFH 153
FPLDL+PR+++A EW +E+G+ QRV+ALE +L D+YG I DGV+P+RLVT+ HFH
Sbjct 70 FPLDLIPRIVTAAEWQTVEKGVPQRVRALEAFLADVYGQGRIFTDGVVPKRLVTTSPHFH 129
Query 154 RQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLF 213
RQ +G+ +G R+ ++G+DLIRD +G+FRVLEDN+R PSGVSYV+ENR+ +A+V
Sbjct 130 RQVMGMSAQDGARVVISGVDLIRDEKGEFRVLEDNVRVPSGVSYVLENRQAVAQVLSEAG 189
Query 214 ATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVE 273
A VR V +Y LL ALR A N DP VVVLTPGVYNSAYFEH+LLAR+MGVELVE
Sbjct 190 ADQLVRPVSEYPGQLLAALRAVAPWNVTDPNVVVLTPGVYNSAYFEHTLLAREMGVELVE 249
Query 274 GRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVV 333
GRDL CR+N+V++RTT E V VIYRRIDD FLDP+QFRADS+LG GL+NAARAGN+
Sbjct 250 GRDLICRNNRVFLRTTSSEMPVHVIYRRIDDEFLDPMQFRADSLLGSPGLINAARAGNLT 309
Query 334 LSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLK 393
+++A+GNG+ DDKLVYTYVP +I YYL E+P+L NV+T R + D RE L+ + ELVLK
Sbjct 310 IANAVGNGIADDKLVYTYVPDIIRYYLSEEPILQNVDTYRMEVPDHREYALEHLAELVLK 369
Query 394 PVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVD 453
PV+GSGG GIV G A +A L + I ++PR WIAQ + LSTVPT I + PR+VD
Sbjct 370 PVDGSGGKGIVIGSRADRAVLRKARETILENPRGWIAQREIALSTVPTLIGEKMRPRHVD 429
Query 454 LRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAARELGAAQ 513
LRPFAVN+G VWVLPGGLTRVAL EG VVNSSQGGGSKDTWVL
Sbjct 430 LRPFAVNNGRSVWVLPGGLTRVALPEGELVVNSSQGGGSKDTWVL------------GGP 477
Query 514 IVRSLPQPLCDPT 526
I PQP D T
Sbjct 478 IPEPEPQPAADAT 490
>gi|336320339|ref|YP_004600307.1| hypothetical protein Celgi_1220 [Cellvibrio gilvus ATCC 13127]
gi|336103920|gb|AEI11739.1| protein of unknown function DUF404 [Cellvibrio gilvus ATCC 13127]
Length=533
Score = 585 bits (1507), Expect = 1e-164, Method: Compositional matrix adjust.
Identities = 299/515 (59%), Positives = 374/515 (73%), Gaps = 13/515 (2%)
Query 35 AMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPF 94
+A+DEM + R Y+ ++A LA A EL+ RADAL R+++ QG+TF +G+ERPF
Sbjct 11 GVAWDEMLEPSAGPRAAYRQVHAALAQLSAGELRGRADALARSYLTQGVTFDFAGEERPF 70
Query 95 PLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHR 154
PLD+VPRVI+ EW + G+ QRV+ALE +L D+YG Q + DGV+PR ++ S H+HR
Sbjct 71 PLDVVPRVIAGDEWEHVAPGVAQRVRALEAFLADVYGPQNAVADGVLPRSVIVSSTHYHR 130
Query 155 QAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFA 214
GI PPNGVR+HV+GIDL+RD +RVLEDN+R PSGVSYV+ NRR MA+ FP LFA
Sbjct 131 AVRGIAPPNGVRVHVSGIDLVRDSLDGWRVLEDNVRVPSGVSYVLSNRRAMAQSFPELFA 190
Query 215 THRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEG 274
R+R V DY LL AL +A DPTVVVLTPGV+NSAYFEHSLLAR MGVELVEG
Sbjct 191 ALRIRPVADYPRRLLAALMAAAPAGVDDPTVVVLTPGVFNSAYFEHSLLARTMGVELVEG 250
Query 275 RDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVL 334
RDLFC +V+MRTT+G R+VDVIYRR+DD FLDP+ FRADS+LG GL+ AR G V +
Sbjct 251 RDLFCSGGRVWMRTTQGRRRVDVIYRRVDDEFLDPVTFRADSLLGSPGLMTCARNGTVTI 310
Query 335 SSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKP 394
++A+GNGV DDKLVYTYVP +I YYL E+P+LANV+T R EEVLDR+ ELV+KP
Sbjct 311 ANAVGNGVADDKLVYTYVPDLIRYYLGEEPVLANVDTWRLEEPGALEEVLDRLDELVVKP 370
Query 395 VEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDL 454
V+GSGG G+V GP AS+ ELAA+ ++ DPR WIAQP+++LST+PT +E L PR+ DL
Sbjct 371 VDGSGGKGLVVGPAASRDELAALRARLIADPRGWIAQPVVQLSTIPTLVEDGLRPRHTDL 430
Query 455 RPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA---PRASAAARELGA 511
RPFA+NDG +VWVLPGGLTRVAL EG VVNSSQGGGSKDTWVL PR + A +
Sbjct 431 RPFAINDGTDVWVLPGGLTRVALPEGRLVVNSSQGGGSKDTWVLGDAPPRRATAVPQAHP 490
Query 512 AQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQQ 546
+ ++P +DA+ P D + QQQQ
Sbjct 491 VPVSNAVP-------IDAN---PQDIRAHVMQQQQ 515
>gi|326330358|ref|ZP_08196668.1| hypothetical protein NBCG_01793 [Nocardioidaceae bacterium Broad-1]
gi|325951895|gb|EGD43925.1| hypothetical protein NBCG_01793 [Nocardioidaceae bacterium Broad-1]
Length=498
Score = 583 bits (1502), Expect = 4e-164, Method: Compositional matrix adjust.
Identities = 295/477 (62%), Positives = 362/477 (76%), Gaps = 4/477 (0%)
Query 23 RIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQG 82
++F Y T D AFDEMF A G +R PY+ + L ASEL +R +A+ +++DQG
Sbjct 3 QMFDAYETRD---PAFDEMF-AGGELRPPYQRLGDSLRRLSASELISRVEAMQASYLDQG 58
Query 83 ITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIP 142
+TF + G+ER FPLD+VPRVI WT +++G+ QRVKALE +L D+Y + ++ DGVIP
Sbjct 59 VTFDIGGEERAFPLDIVPRVIERDAWTTIDKGVQQRVKALELFLADVYDEGKVFEDGVIP 118
Query 143 RRLVTSCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENR 202
R ++T+ H+HR A G+ PPNGVR+ V+GIDL+RD+ G+FRVLEDN+R PSGVSYVM NR
Sbjct 119 REVITTSSHYHRAAAGVHPPNGVRVQVSGIDLVRDNAGEFRVLEDNVRVPSGVSYVMTNR 178
Query 203 RTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSL 262
R ++ P A HR+R V +Y LL ALR +A ADPTVVVLTPGVYN AYFEH+L
Sbjct 179 RAISAALPETIAEHRIRPVANYPQKLLAALRAAAPAGVADPTVVVLTPGVYNGAYFEHAL 238
Query 263 LARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAG 322
LAR MGVELVEGRDL CR+ QV MRTT+G V VIYRRIDD FLDP+ FR DS+LG G
Sbjct 239 LARTMGVELVEGRDLVCRNGQVLMRTTKGLAPVHVIYRRIDDEFLDPVHFRPDSMLGCVG 298
Query 323 LVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREE 382
L++AAR GNV L++A+GNGV DDKLVYTY+P +I YYL E P++ NV+T R REE
Sbjct 299 LIDAARMGNVTLANAVGNGVADDKLVYTYMPDIIRYYLAEDPIIKNVDTWRMGDATSREE 358
Query 383 VLDRIRELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTR 442
VLDR+ ELVLKPV+GSGG GIV GP AS EL + KI DDPRSWIAQP+++LSTVPT
Sbjct 359 VLDRLDELVLKPVDGSGGKGIVIGPAASARELEVLRGKILDDPRSWIAQPVVQLSTVPTF 418
Query 443 IEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLA 499
I+G L R+VDLRPFAVNDG++VWVLPGGLTRVAL EG +VNSS+GGGSKDTWVLA
Sbjct 419 IDGDLGARHVDLRPFAVNDGDKVWVLPGGLTRVALAEGELIVNSSRGGGSKDTWVLA 475
>gi|323358427|ref|YP_004224823.1| hypothetical protein MTES_1979 [Microbacterium testaceum StLB037]
gi|323274798|dbj|BAJ74943.1| uncharacterized conserved protein [Microbacterium testaceum StLB037]
Length=594
Score = 582 bits (1500), Expect = 6e-164, Method: Compositional matrix adjust.
Identities = 292/491 (60%), Positives = 365/491 (75%), Gaps = 13/491 (2%)
Query 37 AFDEMFDAQGIVRGP---------YKGIYAELAPSDASELKARADALGRAFIDQGITFSL 87
AFDEMF G+ P Y+ +Y LA EL+ R ++L +++ QG+TF
Sbjct 23 AFDEMF---GVPASPGEAAPSREAYRELYQTLAQMTQEELRGRTESLASSYLAQGVTFDF 79
Query 88 SGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVT 147
+G+ERPFPLD VPRVI+ EW+R+E G+ QRV+ALE +LDD YG+Q +RDG++P L++
Sbjct 80 AGEERPFPLDAVPRVIAYDEWSRIEAGVKQRVRALEAFLDDAYGNQHCVRDGILPAGLIS 139
Query 148 SCEHFHRQAVGIVPPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMAR 207
S ++F+RQA GI NGVRI V+GIDLIRD G+ RVLEDN+R PSGVSYV+ NRR MA+
Sbjct 140 SSQYFYRQAAGIRSANGVRIQVSGIDLIRDEHGEMRVLEDNVRVPSGVSYVISNRRVMAQ 199
Query 208 VFPNLFATHRVRAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQM 267
P LF + RVR V DY + LL ALR SA DP +VVLTPGVYNSAYFEH+LLAR M
Sbjct 200 TLPELFVSMRVRPVGDYPNKLLAALRASAPPGIDDPNIVVLTPGVYNSAYFEHTLLARLM 259
Query 268 GVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAA 327
GVELVEGRDL C +V+MRTT G ++VDVIYRR+DD FLDPLQFRADS+LG GL+ AA
Sbjct 260 GVELVEGRDLLCIGGKVFMRTTRGPQRVDVIYRRVDDDFLDPLQFRADSMLGAPGLMLAA 319
Query 328 RAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRI 387
R GNV +++A+GNGV DDKL+YTYVP +I YYL E+P+L NV+T R EEVLDR+
Sbjct 320 RLGNVTIANAVGNGVADDKLLYTYVPDLIRYYLAEEPILKNVDTWRLEDPGALEEVLDRL 379
Query 388 RELVLKPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTL 447
ELV+KPV+GSGG G+V GP+AS AEL A+ +++ DPR WIAQP++ LST+PT +E +
Sbjct 380 PELVVKPVDGSGGKGLVVGPDASPAELDALRKRLLADPRGWIAQPVVMLSTIPTLVEDGM 439
Query 448 APRYVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAAR 507
PR+ DLRPFAVNDG+++WVLPGGLTRVAL EG VVNSSQGGGSKDTWV+ A+ +
Sbjct 440 RPRHADLRPFAVNDGDDIWVLPGGLTRVALPEGQLVVNSSQGGGSKDTWVVG-GAAPSHV 498
Query 508 ELGAAQIVRSL 518
E G Q V L
Sbjct 499 EYGQGQGVSGL 509
Lambda K H
0.320 0.137 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1222700780868
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40