BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3212
Length=407
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610348|ref|NP_217728.1| hypothetical protein Rv3212 [Mycoba... 798 0.0
gi|31794390|ref|NP_856883.1| hypothetical protein Mb3238 [Mycoba... 796 0.0
gi|121639099|ref|YP_979323.1| hypothetical protein BCG_3239 [Myc... 793 0.0
gi|240172121|ref|ZP_04750780.1| hypothetical protein MkanA1_2258... 645 0.0
gi|15827354|ref|NP_301617.1| hypothetical protein ML0810 [Mycoba... 617 8e-175
gi|183981368|ref|YP_001849659.1| hypothetical protein MMAR_1345 ... 593 1e-167
gi|118618016|ref|YP_906348.1| hypothetical protein MUL_2534 [Myc... 588 5e-166
gi|296169011|ref|ZP_06850677.1| conserved hypothetical protein [... 572 3e-161
gi|342861170|ref|ZP_08717819.1| hypothetical protein MCOL_19902 ... 552 3e-155
gi|336459519|gb|EGO38456.1| hypothetical protein MAPs_02470 [Myc... 546 3e-153
gi|254776599|ref|ZP_05218115.1| hypothetical protein MaviaA2_182... 545 4e-153
gi|41409411|ref|NP_962247.1| hypothetical protein MAP3313 [Mycob... 544 1e-152
gi|254822020|ref|ZP_05227021.1| hypothetical protein MintA_18952... 540 1e-151
gi|118462952|ref|YP_883307.1| hypothetical protein MAV_4160 [Myc... 532 4e-149
gi|333991567|ref|YP_004524181.1| hypothetical protein JDM601_292... 459 5e-127
gi|118472510|ref|YP_886295.1| hypothetical protein MSMEG_1929 [M... 436 5e-120
gi|315445552|ref|YP_004078431.1| hypothetical protein Mspyr1_400... 429 5e-118
gi|108798363|ref|YP_638560.1| hypothetical protein Mmcs_1392 [My... 426 3e-117
gi|145225255|ref|YP_001135933.1| hypothetical protein Mflv_4677 ... 425 8e-117
gi|120402789|ref|YP_952618.1| hypothetical protein Mvan_1790 [My... 424 2e-116
gi|126434047|ref|YP_001069738.1| hypothetical protein Mjls_1446 ... 421 2e-115
gi|167968330|ref|ZP_02550607.1| conserved alanine and valine ric... 405 9e-111
gi|169630613|ref|YP_001704262.1| hypothetical protein MAB_3532 [... 327 2e-87
gi|226365825|ref|YP_002783608.1| hypothetical protein ROP_64160 ... 298 8e-79
gi|325675726|ref|ZP_08155410.1| hypothetical protein HMPREF0724_... 283 5e-74
gi|312140645|ref|YP_004007981.1| hypothetical protein REQ_33050 ... 281 1e-73
gi|226305683|ref|YP_002765643.1| hypothetical protein RER_21960 ... 280 4e-73
gi|111023318|ref|YP_706290.1| hypothetical protein RHA1_ro06355 ... 269 5e-70
gi|229489582|ref|ZP_04383445.1| conserved hypothetical protein [... 264 2e-68
gi|343927605|ref|ZP_08767073.1| hypothetical protein GOALK_097_0... 229 5e-58
gi|54026547|ref|YP_120789.1| hypothetical protein nfa45740 [Noca... 222 9e-56
gi|134097655|ref|YP_001103316.1| hypothetical protein SACE_1059 ... 219 5e-55
gi|257054730|ref|YP_003132562.1| hypothetical protein Svir_06640... 212 8e-53
gi|291008800|ref|ZP_06566773.1| hypothetical protein SeryN2_3011... 207 2e-51
gi|333921426|ref|YP_004495007.1| hypothetical protein AS9A_3769 ... 206 6e-51
gi|326383297|ref|ZP_08204985.1| hypothetical protein SCNU_10184 ... 200 5e-49
gi|262203394|ref|YP_003274602.1| hypothetical protein Gbro_3516 ... 197 3e-48
gi|256374970|ref|YP_003098630.1| hypothetical protein Amir_0823 ... 196 4e-48
gi|300783034|ref|YP_003763325.1| hypothetical protein AMED_1107 ... 186 5e-45
gi|296138843|ref|YP_003646086.1| hypothetical protein Tpau_1115 ... 186 8e-45
gi|331694933|ref|YP_004331172.1| hypothetical protein Psed_1068 ... 181 3e-43
gi|324998439|ref|ZP_08119551.1| hypothetical protein PseP1_06707... 179 1e-42
gi|317507499|ref|ZP_07965224.1| hypothetical protein HMPREF9336_... 175 1e-41
gi|172040178|ref|YP_001799892.1| putative secreted protein [Cory... 175 2e-41
gi|237786052|ref|YP_002906757.1| hypothetical protein ckrop_1485... 173 4e-41
gi|337290207|ref|YP_004629228.1| hypothetical protein CULC22_005... 172 1e-40
gi|302524384|ref|ZP_07276726.1| predicted protein [Streptomyces ... 171 2e-40
gi|334696328|gb|AEG81125.1| putative secreted protein [Corynebac... 171 3e-40
gi|19551996|ref|NP_599998.1| hypothetical protein NCgl0736 [Cory... 167 4e-39
gi|344043729|gb|EGV39417.1| hypothetical protein CgS9114_13161 [... 166 7e-39
>gi|15610348|ref|NP_217728.1| hypothetical protein Rv3212 [Mycobacterium tuberculosis H37Rv]
gi|15842799|ref|NP_337836.1| hypothetical protein MT3308 [Mycobacterium tuberculosis CDC1551]
gi|148663073|ref|YP_001284596.1| hypothetical protein MRA_3251 [Mycobacterium tuberculosis H37Ra]
62 more sequence titles
Length=407
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/407 (100%), Positives = 407/407 (100%), Gaps = 0/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL
Sbjct 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV
Sbjct 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR
Sbjct 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP
Sbjct 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS
Sbjct 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG
Sbjct 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG
Sbjct 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
>gi|31794390|ref|NP_856883.1| hypothetical protein Mb3238 [Mycobacterium bovis AF2122/97]
gi|31619986|emb|CAD95330.1| CONSERVED HYPOTHETICAL ALANINE VALINE RICH PROTEIN [Mycobacterium
bovis AF2122/97]
Length=407
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/407 (99%), Positives = 406/407 (99%), Gaps = 0/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL
Sbjct 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV
Sbjct 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR
Sbjct 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP
Sbjct 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
IQRIVPEPG RPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS
Sbjct 241 IQRIVPEPGARPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG
Sbjct 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG
Sbjct 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
>gi|121639099|ref|YP_979323.1| hypothetical protein BCG_3239 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224991591|ref|YP_002646280.1| hypothetical alanine valine rich protein [Mycobacterium bovis
BCG str. Tokyo 172]
gi|121494747|emb|CAL73228.1| Conserved hypothetical alanine valine rich protein [Mycobacterium
bovis BCG str. Pasteur 1173P2]
gi|224774706|dbj|BAH27512.1| hypothetical alanine valine rich protein [Mycobacterium bovis
BCG str. Tokyo 172]
gi|341603138|emb|CCC65816.1| conserved hypothetical alanine valine rich protein [Mycobacterium
bovis BCG str. Moreau RDJ]
Length=407
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/407 (99%), Positives = 405/407 (99%), Gaps = 0/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL
Sbjct 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV
Sbjct 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR
Sbjct 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQA LRLVLLRPGKEDDEP
Sbjct 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQAGLRLVLLRPGKEDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
IQRIVPEPG RPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS
Sbjct 241 IQRIVPEPGARPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG
Sbjct 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG
Sbjct 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
>gi|240172121|ref|ZP_04750780.1| hypothetical protein MkanA1_22589 [Mycobacterium kansasii ATCC
12478]
Length=407
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/407 (85%), Positives = 367/407 (91%), Gaps = 0/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ DI AAA I VVVAV A+LIWWTSDARATISRPAA P P PAREVP++L
Sbjct 1 MVKPERRTRGDILAAAAIVVVVAVVAALIWWTSDARATISRPAATPAPNPTPAREVPSTL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLW+A SPA+RVPV VGGTV TGDGR VDGRDPATGE+LWSYARDTDLCGVTWVY YAV
Sbjct 61 KQLWSAPSPASRVPVAVGGTVVTGDGRHVDGRDPATGETLWSYARDTDLCGVTWVYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGCGQVSTIDGSTGRRG ARS YADPRVRL SDGTTVLSAG TRLELWRSDMVR
Sbjct 121 AVYRDDRGCGQVSTIDGSTGRRGPARSSYADPRVRLSSDGTTVLSAGRTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YGE DARVKPSNRGL SGCTLESAAASS++V+VLEAC NQAD+RLVLLRPGKE+DEP
Sbjct 181 MLSYGETDARVKPSNRGLHSGCTLESAAASSSSVAVLEACANQADVRLVLLRPGKEEDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
QRIVPEPG+RPGSGARVL VSQNNTAVYLP SGAQPRVDVIDETG TV+STLL KPPS
Sbjct 241 EQRIVPEPGIRPGSGARVLTVSQNNTAVYLPGESGAQPRVDVIDETGTTVASTLLPKPPS 300
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
A S++G+LVTWWTGDALLVFD+G LTQRYTIAAGET APVGPGVMMAGQL+VPVTG
Sbjct 301 PEATVSQSGSLVTWWTGDALLVFDSGKLTQRYTIAAGETAAPVGPGVMMAGQLIVPVTGA 360
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSGANNRYIPV RPPS SAVIP V GSRVIEQRGD++VALG
Sbjct 361 IGVYDPVSGANNRYIPVDRPPSGSAVIPVVVGSRVIEQRGDSVVALG 407
>gi|15827354|ref|NP_301617.1| hypothetical protein ML0810 [Mycobacterium leprae TN]
gi|221229832|ref|YP_002503248.1| hypothetical protein MLBr_00810 [Mycobacterium leprae Br4923]
gi|13092903|emb|CAC30320.1| putative membrane protein [Mycobacterium leprae]
gi|219932939|emb|CAR70905.1| putative membrane protein [Mycobacterium leprae Br4923]
Length=407
Score = 617 bits (1592), Expect = 8e-175, Method: Compositional matrix adjust.
Identities = 325/407 (80%), Positives = 346/407 (86%), Gaps = 0/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRTK D AA TI VV+A SLIWWTSDA+AT SRPA + P P PAREVPT+
Sbjct 1 MVRPERRTKADTIAAMTITVVMAAMVSLIWWTSDAQATHSRPATIPAPNPTPAREVPTAF 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
QLW AASPAT PVVVGG V TGDG Q+DGR+P TGES WSYARD+DLCGV+WVYHYAV
Sbjct 61 NQLWAAASPATTAPVVVGGAVITGDGHQIDGRNPVTGESRWSYARDSDLCGVSWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGCGQVSTIDGSTGRR AARS YADP VRL SDGT VLSAGDTRLELWRSDMVR
Sbjct 121 AVYRDDRGCGQVSTIDGSTGRREAARSSYADPHVRLSSDGTAVLSAGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
MLAYGEIDARVKP RGL SGCTLES AASS+AV+VLEAC NQ DL+LVLLRPGKEDDEP
Sbjct 181 MLAYGEIDARVKPPARGLHSGCTLESTAASSSAVAVLEACANQDDLQLVLLRPGKEDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q +V EP VR GSGARVL VS +TAVYLP +G QPRVDVIDETG TV+STLL KPPS
Sbjct 241 QQHLVAEPRVRSGSGARVLTVSDTHTAVYLPGEAGTQPRVDVIDETGTTVASTLLTKPPS 300
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
+SAV S+ GNLVTWWTGD L+VF+ NLT RYTIAAGETTAPVGPGVMMAGQLLVPVTG
Sbjct 301 SSAVVSQAGNLVTWWTGDTLMVFNQSNLTLRYTIAAGETTAPVGPGVMMAGQLLVPVTGK 360
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYD SGANNRYIPV RPPS+SAVIPAVSGS V EQRGDTLVALG
Sbjct 361 IGVYDLFSGANNRYIPVRRPPSSSAVIPAVSGSTVFEQRGDTLVALG 407
>gi|183981368|ref|YP_001849659.1| hypothetical protein MMAR_1345 [Mycobacterium marinum M]
gi|183174694|gb|ACC39804.1| conserved alanine and valine rich protein [Mycobacterium marinum
M]
Length=404
Score = 593 bits (1530), Expect = 1e-167, Method: Compositional matrix adjust.
Identities = 327/407 (81%), Positives = 358/407 (88%), Gaps = 3/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ DI AAA I VV+A+ LIWWTSDARAT+SRPAA P P+PAREVP +L
Sbjct 1 MVKPERRTRGDILAAAAIVVVIALVTLLIWWTSDARATVSRPAAAPAPNPSPAREVPGTL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTA SPATRVPVV GGTVATG GR V+GR+P TGE+LWSY+RDTDLCGV+WVY YAV
Sbjct 61 KQLWTANSPATRVPVVAGGTVATGTGRLVEGRNPTTGETLWSYSRDTDLCGVSWVYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGCGQVSTIDGSTGRRG ARS YADP+VRL SDGTTVLSAG TRLELWRSDMVR
Sbjct 121 AVYRDDRGCGQVSTIDGSTGRRGPARSSYADPKVRLSSDGTTVLSAGSTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YGE DARVKP+NRGL SGCTLESAAASS+AVSVLEAC +QADLRLVLLRPGKE+DEP
Sbjct 181 MLSYGETDARVKPANRGLHSGCTLESAAASSSAVSVLEACQDQADLRLVLLRPGKEEDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
QRIVPEPG+R GSGARVL+VSQNNTAVYLP+ QP V+VIDETG T +STLL KPPS
Sbjct 241 EQRIVPEPGIRAGSGARVLIVSQNNTAVYLPS---PQPHVEVIDETGTTTASTLLPKPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
+AV S+TGNLVTWWTGDALLVF+ G LTQRYTIAAG+T AP+GPGVMMAGQLLVPVTG
Sbjct 298 PAAVVSQTGNLVTWWTGDALLVFNTGKLTQRYTIAAGDTAAPLGPGVMMAGQLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
+GVYDPVSGA+ RYIPV R P+TSAVIP V GS VIEQRGD +VALG
Sbjct 358 LGVYDPVSGASIRYIPVDRQPNTSAVIPVVVGSTVIEQRGDAVVALG 404
>gi|118618016|ref|YP_906348.1| hypothetical protein MUL_2534 [Mycobacterium ulcerans Agy99]
gi|118570126|gb|ABL04877.1| conserved alanine and valine rich protein [Mycobacterium ulcerans
Agy99]
Length=404
Score = 588 bits (1516), Expect = 5e-166, Method: Compositional matrix adjust.
Identities = 324/407 (80%), Positives = 357/407 (88%), Gaps = 3/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ DI AAA I VV+A+ LIWWTSDARAT+SRPAA P P+PAREVP +L
Sbjct 1 MVKPERRTRGDILAAAAIVVVIALVTLLIWWTSDARATVSRPAAAPAPNPSPAREVPGTL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTA SPATRVP+V GGTVATG GR V+GR+P TGE+LWSY+RDTDLCGV+WVY YAV
Sbjct 61 KQLWTANSPATRVPMVAGGTVATGAGRLVEGRNPTTGETLWSYSRDTDLCGVSWVYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGCGQVSTIDGSTGRRG ARS YADP+VRL SDGTTVLSAG TRLELWRSDMVR
Sbjct 121 AVYRDDRGCGQVSTIDGSTGRRGPARSSYADPKVRLSSDGTTVLSAGSTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YGE DARVK +NRGL SGCTLESAAASS+AVSVLEAC +QADLRLVLLRPGKE+DEP
Sbjct 181 MLSYGETDARVKAANRGLHSGCTLESAAASSSAVSVLEACQDQADLRLVLLRPGKEEDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
QRIVPEPG+R GSGARVL+VSQNNTAVYLP+ QP V+VIDETG T +STLL KPPS
Sbjct 241 EQRIVPEPGIRAGSGARVLIVSQNNTAVYLPS---PQPHVEVIDETGTTTASTLLPKPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
+AV S+TGNLVTWWTGDALLVF+ G LTQRYTIAAG+T AP+GPGVMMAGQ+LVPVTG
Sbjct 298 PAAVVSQTGNLVTWWTGDALLVFNTGKLTQRYTIAAGDTAAPLGPGVMMAGQVLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
+GVYDPVSGA+ RYIPV R P+TSAVIP V GS VIEQRGD +VALG
Sbjct 358 LGVYDPVSGASIRYIPVDRQPNTSAVIPVVVGSTVIEQRGDAVVALG 404
>gi|296169011|ref|ZP_06850677.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896353|gb|EFG76009.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=404
Score = 572 bits (1475), Expect = 3e-161, Method: Compositional matrix adjust.
Identities = 309/407 (76%), Positives = 346/407 (86%), Gaps = 3/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRT+ D+ AAA IAVVVA A+LIWWTSDARATISRPAA P P PAR+VP +L
Sbjct 1 MVRPERRTRGDVLAAAAIAVVVAAVAALIWWTSDARATISRPAAEPAPNPTPARQVPATL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
QLWTAASPAT PV VGGTV TGDGR+VDGRDP +G+S WSY RD+ LCGV+WVYHYAV
Sbjct 61 GQLWTAASPATTAPVTVGGTVITGDGRRVDGRDPGSGQSRWSYERDSALCGVSWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGCGQVST+DGSTGRRG ARSGYAD VRL SDGTTVLSAGDT +ELWRSDMVR
Sbjct 121 AVYRDDRGCGQVSTLDGSTGRRGPARSGYADQHVRLSSDGTTVLSAGDTHVELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
M+AYGE DARVKPS RGL SGC L SAAASS+AVSVLE+C NQAD+RLVLLRPGK+DDEP
Sbjct 181 MVAYGETDARVKPSARGLHSGCKLVSAAASSSAVSVLESCVNQADVRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
QRIV EPG+ SGARVL V +NNTAVYLP+ +PRVDVIDETG TV+STLL KPPS
Sbjct 241 QQRIVDEPGITADSGARVLAVWENNTAVYLPS---PRPRVDVIDETGTTVASTLLPKPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
+ S G+L+TWWTGD++LVF+ GNL+ RYTIAAG+ TAP+GPG MMAG+LL+PVTG
Sbjct 298 SRGAVSHAGSLITWWTGDSVLVFETGNLSLRYTIAAGDKTAPLGPGAMMAGRLLIPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSGAN RYIPV R PS SAV+PAV+GSRV EQRGDT+VALG
Sbjct 358 IGVYDPVSGANERYIPVHRAPSESAVVPAVAGSRVFEQRGDTVVALG 404
>gi|342861170|ref|ZP_08717819.1| hypothetical protein MCOL_19902 [Mycobacterium colombiense CECT
3035]
gi|342131614|gb|EGT84884.1| hypothetical protein MCOL_19902 [Mycobacterium colombiense CECT
3035]
Length=408
Score = 552 bits (1423), Expect = 3e-155, Method: Compositional matrix adjust.
Identities = 313/411 (77%), Positives = 341/411 (83%), Gaps = 7/411 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRTK D+ AAA IAVVVA A+LIWW+SDARAT SRPAAV P PAPA++VP L
Sbjct 1 MVRPERRTKGDMLAAAAIAVVVAAVAALIWWSSDARATSSRPAAVPAPNPAPAKQVPAGL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLW+AASPAT PVV GG V T GRQ+DGRDP TG+S WSYARD DLCGVTW+Y YAV
Sbjct 61 KQLWSAASPATTAPVVAGGEVVTAAGRQIDGRDPGTGQSRWSYARDIDLCGVTWLYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
VYR DRGCGQVST+DGSTGRRG ARSGYADP+VRL SDG TVLS GDTRLELWRSDMVR
Sbjct 121 GVYRDDRGCGQVSTLDGSTGRRGPARSGYADPKVRLSSDGMTVLSVGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
M+AYGE DARVKPS RGL SGC L SAAASS AVSVLE+C NQADLRLVLLRPGK+DDEP
Sbjct 181 MVAYGETDARVKPSTRGLHSGCKLISAAASSQAVSVLESCANQADLRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q IV EPG+ P SGARVL V++N TAVYLPA QPRVDVID+TGATVSST L KPPS
Sbjct 241 QQHIVAEPGITPDSGARVLTVAENTTAVYLPA---PQPRVDVIDQTGATVSSTPLPKPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
SAV S G+ VTWWTGD+L+VFDA L RYTIAAG+ TAP+GPGVMMAG+LLVPVTG
Sbjct 298 RSAVVSHPGSQVTWWTGDSLMVFDANTLALRYTIAAGDRTAPLGPGVMMAGRLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTR----PPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDP+SGAN RYIPV R PP+ V PAV GSRV+EQRGDTLVALG
Sbjct 358 IGVYDPLSGANERYIPVDRSTSAPPAGKPVFPAVIGSRVLEQRGDTLVALG 408
>gi|336459519|gb|EGO38456.1| hypothetical protein MAPs_02470 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=408
Score = 546 bits (1406), Expect = 3e-153, Method: Compositional matrix adjust.
Identities = 312/411 (76%), Positives = 345/411 (84%), Gaps = 7/411 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRT+ DI AAATIAVVVA A+LIWWTSDARAT+SRPAAV P P+PAR+VP SL
Sbjct 1 MVRPERRTRGDIVAAATIAVVVAATAALIWWTSDARATVSRPAAVPAPNPSPARQVPASL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPAT PVVVGG VATG GR++DGRDPATG+S WSYARDTDLCG++W+Y YAV
Sbjct 61 KQLWTAASPATTEPVVVGGVVATGAGRRIDGRDPATGQSRWSYARDTDLCGLSWLYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
VYR DRGCGQVST+DGSTGRRG ARSGYADPRVRL SDG TVLS GDTRLELWRSDMVR
Sbjct 121 GVYRDDRGCGQVSTLDGSTGRRGPARSGYADPRVRLSSDGMTVLSVGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML YGE DARVKPS RGL SGC L SAAASS+AVSVLE+C NQADLRLVLLRPGK+DDEP
Sbjct 181 MLTYGETDARVKPSARGLHSGCRLISAAASSSAVSVLESCANQADLRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q IV EPG+ P SGARVL V + TAVYLPA +PRVDVIDETG TVSST L +PPS
Sbjct 241 QQHIVAEPGIAPDSGARVLAVREATTAVYLPA---PRPRVDVIDETGTTVSSTPLPRPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
SA S G+++TWWTGD+L+VFD L RYTIAAG+ TAP+GPG MMAG+LLVPVTG
Sbjct 298 PSAAVSHAGSVMTWWTGDSLMVFDVNTLALRYTIAAGDKTAPLGPGAMMAGRLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTR----PPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDP+SGA+ RYIPV R P +AV+PAVSGSRV EQRGDT+VALG
Sbjct 358 IGVYDPISGASERYIPVDRATSGAPPGAAVVPAVSGSRVFEQRGDTVVALG 408
>gi|254776599|ref|ZP_05218115.1| hypothetical protein MaviaA2_18291 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=408
Score = 545 bits (1405), Expect = 4e-153, Method: Compositional matrix adjust.
Identities = 312/411 (76%), Positives = 345/411 (84%), Gaps = 7/411 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRT+ DI AAATIAVVVA A+LIWWTSDARAT+SRPAAV P P+PAR+VP SL
Sbjct 1 MVRPERRTRGDIVAAATIAVVVAATAALIWWTSDARATVSRPAAVPAPNPSPARQVPASL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPAT PVVVGG VATG GR++DGRDPATG+S WSYARDTDLCG++W+Y YAV
Sbjct 61 KQLWTAASPATTEPVVVGGVVATGAGRRIDGRDPATGQSRWSYARDTDLCGLSWLYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
VYR DRGCGQVST+DGSTGRRG ARSGYADPRVRL SDG TVLS GDTRLELWRSDMVR
Sbjct 121 GVYRDDRGCGQVSTLDGSTGRRGPARSGYADPRVRLSSDGMTVLSVGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML YGE DARVKPS RGL SGC L SAAASS+AVSVLE+C NQADLRLVLLRPGK+DDEP
Sbjct 181 MLTYGETDARVKPSARGLHSGCRLISAAASSSAVSVLESCANQADLRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q IV EPG+ P SGARVL V + TAVYLPA +PRVDVIDETG TVSST L +PPS
Sbjct 241 QQHIVAEPGIAPDSGARVLAVREATTAVYLPA---PRPRVDVIDETGTTVSSTPLPRPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
SA S G+++TWWTGD+L+VFD L RYTIAAG+ TAP+GPG MMAG+LLVPVTG
Sbjct 298 PSAAVSHAGSVMTWWTGDSLMVFDVNTLALRYTIAAGDKTAPLGPGAMMAGRLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTR----PPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDP+SGA+ RYIPV R P +AV+PAVSGSRV EQRGDT+VALG
Sbjct 358 IGVYDPISGASERYIPVDRATGGAPPGAAVVPAVSGSRVFEQRGDTVVALG 408
>gi|41409411|ref|NP_962247.1| hypothetical protein MAP3313 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398242|gb|AAS05863.1| hypothetical protein MAP_3313 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=408
Score = 544 bits (1401), Expect = 1e-152, Method: Compositional matrix adjust.
Identities = 311/411 (76%), Positives = 344/411 (84%), Gaps = 7/411 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRT+ DI AAATIAVVVA A+LIWWTSDARAT+SRPAAV P P+PAR+VP SL
Sbjct 1 MVRPERRTRGDIVAAATIAVVVAATAALIWWTSDARATVSRPAAVPAPNPSPARQVPASL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPAT PVVVGG VATG GR++DGRDPATG+S WSYARDTDLCG++W+Y YAV
Sbjct 61 KQLWTAASPATTEPVVVGGVVATGAGRRIDGRDPATGQSRWSYARDTDLCGLSWLYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
VYR DRGCGQVST+DGSTGRRG ARSGYADPRVRL SDG TVLS GDTRLELWRSDMVR
Sbjct 121 GVYRDDRGCGQVSTLDGSTGRRGPARSGYADPRVRLSSDGMTVLSVGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML Y E DARVKPS RGL SGC L SAAASS+AVSVLE+C NQADLRLVLLRPGK+DDEP
Sbjct 181 MLTYSETDARVKPSARGLHSGCRLISAAASSSAVSVLESCANQADLRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q IV EPG+ P SGARVL V + TAVYLPA +PRVDVIDETG TVSST L +PPS
Sbjct 241 QQHIVAEPGIAPDSGARVLAVREATTAVYLPA---PRPRVDVIDETGTTVSSTPLPRPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
SA S G+++TWWTGD+L+VFD L RYTIAAG+ TAP+GPG MMAG+LLVPVTG
Sbjct 298 PSAAVSHAGSVMTWWTGDSLMVFDVNTLALRYTIAAGDKTAPLGPGAMMAGRLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTR----PPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDP+SGA+ RYIPV R P +AV+PAVSGSRV EQRGDT+VALG
Sbjct 358 IGVYDPISGASERYIPVDRATSGAPPGAAVVPAVSGSRVFEQRGDTVVALG 408
>gi|254822020|ref|ZP_05227021.1| hypothetical protein MintA_18952 [Mycobacterium intracellulare
ATCC 13950]
Length=408
Score = 540 bits (1392), Expect = 1e-151, Method: Compositional matrix adjust.
Identities = 312/411 (76%), Positives = 340/411 (83%), Gaps = 7/411 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRTK DI AAATIAVVVAV A+LIWWTSDARAT SRPAAV P P+PAR+VP L
Sbjct 1 MVRPERRTKRDILAAATIAVVVAVTAALIWWTSDARATSSRPAAVPAPNPSPARQVPAGL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTA+SPAT PVVVGG V TG G ++DGRD TG+S WSYARDT LCGV+W+Y YAV
Sbjct 61 KQLWTASSPATTEPVVVGGVVVTGAGVRIDGRDAGTGQSRWSYARDTSLCGVSWLYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
VYR DRGCGQVST+DGSTG RG ARSGYADPRVRL SDG TVLS GDTRLELWRSDMVR
Sbjct 121 GVYRDDRGCGQVSTLDGSTGHRGPARSGYADPRVRLSSDGMTVLSVGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
M+AYGE DARVKPS+RGL SGC L SAAASS+AVSVLE+C NQADLRLVLLRPGK+DDEP
Sbjct 181 MVAYGETDARVKPSSRGLHSGCRLISAAASSSAVSVLESCANQADLRLVLLRPGKDDDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
Q +V EPG+ P SGARVL V TAVYLPA QPRVDVIDETGATV+STLL KPPS
Sbjct 241 QQHVVAEPGIAPDSGARVLAVRDTTTAVYLPA---PQPRVDVIDETGATVASTLLPKPPS 297
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
SA S G+L+TWWTGD+L+VFD L RYTIAAG+ TAP+GPGVMMAG+LLVPVTG
Sbjct 298 RSAAVSHAGSLMTWWTGDSLMVFDVNTLALRYTIAAGDKTAPLGPGVMMAGRLLVPVTGA 357
Query 361 IGVYDPVSGANNRYIPVTR----PPSTSAVIPAVSGSRVIEQRGDTLVALG 407
IGVYDPVSG RYIPV R PP AV+PAVSGSRV EQRGDT+VALG
Sbjct 358 IGVYDPVSGTGERYIPVDRAANPPPPGRAVVPAVSGSRVFEQRGDTVVALG 408
>gi|118462952|ref|YP_883307.1| hypothetical protein MAV_4160 [Mycobacterium avium 104]
gi|118164239|gb|ABK65136.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=396
Score = 532 bits (1371), Expect = 4e-149, Method: Compositional matrix adjust.
Identities = 292/384 (77%), Positives = 322/384 (84%), Gaps = 7/384 (1%)
Query 28 LIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGR 87
LIWWTSDARAT+SRPAAV P P+PAR+VP SLKQLWTAASPAT PVVVGG VATG GR
Sbjct 16 LIWWTSDARATVSRPAAVPAPNPSPARQVPASLKQLWTAASPATTEPVVVGGVVATGAGR 75
Query 88 QVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARS 147
++DGRDPATG+S WSYARDTDLCG++W+Y YAV VYR DRGCGQVST+DGSTGRRG ARS
Sbjct 76 RIDGRDPATGQSRWSYARDTDLCGLSWLYRYAVGVYRDDRGCGQVSTLDGSTGRRGPARS 135
Query 148 GYADPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESA 207
GYADPRVRL SDG TVLS GDTRLELWRSDMVRML YGE DARVKPS RGL SGC L SA
Sbjct 136 GYADPRVRLSSDGMTVLSVGDTRLELWRSDMVRMLTYGETDARVKPSARGLHSGCRLISA 195
Query 208 AASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTA 267
AASS+AVSVLE+C NQADLRLVLLRPGK+DDEP Q IV EPG+ P SGARVL V + TA
Sbjct 196 AASSSAVSVLESCANQADLRLVLLRPGKDDDEPQQHIVAEPGIAPDSGARVLAVREATTA 255
Query 268 VYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGN 327
VYLPA +PRVDVIDETG TVSST L +PPS SA S G+++TWWTGD+L+VFD
Sbjct 256 VYLPA---PRPRVDVIDETGTTVSSTPLPRPPSPSAAVSHAGSVMTWWTGDSLMVFDVNT 312
Query 328 LTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTR----PPST 383
L RYTIAAG+ TAP+GPG MMAG+LLVPVTG IGVYDP+SGA+ RYIPV R P
Sbjct 313 LALRYTIAAGDKTAPLGPGAMMAGRLLVPVTGAIGVYDPISGASERYIPVDRATGGAPPG 372
Query 384 SAVIPAVSGSRVIEQRGDTLVALG 407
+AV+PAVSGSRV EQRGDT+VALG
Sbjct 373 AAVVPAVSGSRVFEQRGDTVVALG 396
>gi|333991567|ref|YP_004524181.1| hypothetical protein JDM601_2927 [Mycobacterium sp. JDM601]
gi|333487535|gb|AEF36927.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=411
Score = 459 bits (1180), Expect = 5e-127, Method: Compositional matrix adjust.
Identities = 263/414 (64%), Positives = 311/414 (76%), Gaps = 10/414 (2%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MV+PERRT D+ AAA IAVVV VA +IWWTSDARAT+SRPA TP A VP +L
Sbjct 1 MVRPERRTAGDVLAAAVIAVVVVVAGVIIWWTSDARATVSRPALDEATTPQSAAMVPAAL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASP TR PV+V GTV TG+G Q+ GRDPATGE WSYARD DLCGV+W+Y YAV
Sbjct 61 KQLWTAASPVTRAPVMVSGTVITGEGPQLTGRDPATGEERWSYARDVDLCGVSWIYRYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVY RGCGQVST+ +TGRRG AR+ YAD V + S+G++VLSAG TRLELWRSDMVR
Sbjct 121 AVYPDSRGCGQVSTVTAATGRRGPARTSYADRSVNVTSEGSSVLSAGSTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
+L+YGEIDARVKP+ RG GCTL SAAA+S+AVSVLEAC Q DL+L LLR GKE+DEP
Sbjct 181 VLSYGEIDARVKPTARGRGQGCTLVSAAAASSAVSVLEACPGQDDLQLTLLRAGKEEDEP 240
Query 241 IQRIVPEPGVRPGSGARVLVV----SQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLA 296
+ VP+PGV SGARVL V S NTAVYLP QPRV+V+D+TG T+++T+L
Sbjct 241 ETQHVPQPGVAADSGARVLTVTDADSGTNTAVYLPT---PQPRVEVVDQTGTTIATTMLP 297
Query 297 KPPSTSAVA---SRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQL 353
PS ++ A SR LV WWTGDA++VFD+G LT RYTI A P+GP MMA +L
Sbjct 298 AKPSPASAALQVSRPTGLVCWWTGDAVMVFDSGTLTYRYTIPAAGAAVPLGPAAMMADRL 357
Query 354 LVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
L+PVTGG+GV++ +GA R IPV+RP AV PAVSG V+EQRGDT+VALG
Sbjct 358 LIPVTGGVGVFNQRTGAAERVIPVSRPAGVRAVFPAVSGPMVLEQRGDTVVALG 411
>gi|118472510|ref|YP_886295.1| hypothetical protein MSMEG_1929 [Mycobacterium smegmatis str.
MC2 155]
gi|118173797|gb|ABK74693.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=405
Score = 436 bits (1120), Expect = 5e-120, Method: Compositional matrix adjust.
Identities = 250/407 (62%), Positives = 298/407 (74%), Gaps = 4/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ DI AAA I VVVAVAA LIWWTSDARAT+SRPAA VP PA EVP L
Sbjct 1 MVKPERRTRGDIVAAAVIVVVVAVAAGLIWWTSDARATLSRPAAAPVPYLTPAAEVPAGL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
++WTA SP T PVV GG V TGDGR VDGRDP +G LW+YARD DLCGVT VY YAV
Sbjct 61 HEMWTAPSPKTTAPVVAGGAVVTGDGRTVDGRDPISGAVLWTYARDADLCGVTSVYSYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVY RGCGQVSTIDG TG RG AR+ +ADP V+L +DG TVLS GD+RLELWRSDMVR
Sbjct 121 AVYPDVRGCGQVSTIDGRTGMRGPARTAFADPEVKLSTDGVTVLSGGDSRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YG +DAR+KP + C SAAASS+AVSV+E+C ++RL LLRP E+D P
Sbjct 181 MLSYGALDARIKP-DVPASPVCRQLSAAASSSAVSVIESCPKTDEVRLTLLRPADEEDTP 239
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
R V PGV SGA+V+ VS TA+Y+P +P+VDVIDETGATV+ST+L KP +
Sbjct 240 DLRYVELPGVTDESGAQVIAVSDTTTAIYVPT---PEPKVDVIDETGATVASTILPKPAA 296
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
+ +R G+LVTWWTGD+++VFDA L +YT++ APVGP MAG+LLVPV G
Sbjct 297 PQSTTTRAGDLVTWWTGDSVMVFDAAGLRYKYTVSPAGPHAPVGPATAMAGKLLVPVDDG 356
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
V+DP +G +R+I + R PS S V+PAV+GS V+EQRG LVALG
Sbjct 357 YDVFDPETGTGDRHISLPRTPSVSPVVPAVAGSIVLEQRGTELVALG 403
>gi|315445552|ref|YP_004078431.1| hypothetical protein Mspyr1_40090 [Mycobacterium sp. Spyr1]
gi|315263855|gb|ADU00597.1| hypothetical protein Mspyr1_40090 [Mycobacterium sp. Spyr1]
Length=390
Score = 429 bits (1102), Expect = 5e-118, Method: Compositional matrix adjust.
Identities = 226/387 (59%), Positives = 280/387 (73%), Gaps = 5/387 (1%)
Query 21 VVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGT 80
V+ + A+++WWTSDARAT S PAA +P+ PA VP SL QLWTA S T P+VVGG
Sbjct 9 VIVLVAAVVWWTSDARATRSTPAAEPLPSLKPAVAVPDSLTQLWTARSGETTRPLVVGGA 68
Query 81 VATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTG 140
V TGDG+ + GR+PATG+ +WSYARD DLCGVTWVY+YAVAVY RGCGQVST+D TG
Sbjct 69 VVTGDGQAMQGREPATGDVVWSYARDLDLCGVTWVYNYAVAVYPDARGCGQVSTVDAKTG 128
Query 141 RRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQS 200
RRG AR+ YAD V L DG+TVLS G+TRLE+WRSDMVRML+YG +DA +KP
Sbjct 129 RRGPARTSYADRHVTLSGDGSTVLSQGETRLEMWRSDMVRMLSYGALDAPIKPGVPATPL 188
Query 201 GCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLV 260
C SA ASS +VSVLEAC +Q DLRL LLRP E+D P + V + GV G+GARV+
Sbjct 189 -CRFVSAGASSDSVSVLEACESQ-DLRLTLLRPSDEEDSPEAKYVQQTGVADGTGARVVA 246
Query 261 VSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDAL 320
VS TA+Y+P +PR+D+ID+TGATV +T++ PPS A S+ G+LVTWWTGDAL
Sbjct 247 VSDTYTALYVPT---PKPRLDLIDDTGATVDTTIVDGPPSPEATMSKAGDLVTWWTGDAL 303
Query 321 LVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRP 380
+VF +L +YT+AA APVGP +MAG+LLVPVT G V+DP +G ++IPV RP
Sbjct 304 MVFSGNDLRYKYTVAATGPDAPVGPATIMAGKLLVPVTTGYDVFDPETGTGEKHIPVQRP 363
Query 381 PSTSAVIPAVSGSRVIEQRGDTLVALG 407
P V+PAV+G+ V+E RGD LVALG
Sbjct 364 PVDGPVVPAVAGTTVLELRGDDLVALG 390
>gi|108798363|ref|YP_638560.1| hypothetical protein Mmcs_1392 [Mycobacterium sp. MCS]
gi|119867460|ref|YP_937412.1| hypothetical protein Mkms_1410 [Mycobacterium sp. KMS]
gi|108768782|gb|ABG07504.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119693549|gb|ABL90622.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=403
Score = 426 bits (1096), Expect = 3e-117, Method: Compositional matrix adjust.
Identities = 253/407 (63%), Positives = 305/407 (75%), Gaps = 4/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ D+ AA IA V+AV A+L+WWTSDARATIS+PA AVP P PA +VP++L
Sbjct 1 MVKPERRTRGDVMAAVAIAAVIAVIAALVWWTSDARATISQPAEQAVPDPEPAADVPSAL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
++LW+AASP TR+PVVVGG+V TGDG V GRDPATG+++WSY+RD DLCGVT VYHYAV
Sbjct 61 RELWSAASPKTRLPVVVGGSVVTGDGSTVAGRDPATGDTVWSYSRDVDLCGVTSVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVY RGCGQVSTI+G TG+RG AR+G+ADP V L +DGTTVLSAG++RLELWRSDMVR
Sbjct 121 AVYPDSRGCGQVSTINGRTGKRGNARTGFADPAVTLSTDGTTVLSAGESRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YG +DARVKP + C L SAAASS+AVSVLEAC + DLRL LLRP E+D P
Sbjct 181 MLSYGALDARVKP-DVPAAPLCRLTSAAASSSAVSVLEACPKEPDLRLTLLRPSDEEDVP 239
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
+ V V S ARV+ VS+ TAVYLP QP V+VIDETG TV+STL+ P S
Sbjct 240 DIKYVELADVPADSDARVIAVSETTTAVYLPT---PQPTVNVIDETGTTVASTLMTSPAS 296
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
AVASR G+L+TWWTGD+++VFD L +YT+ + P+GP MA +LLVPVT G
Sbjct 297 PDAVASRAGDLITWWTGDSVMVFDGSGLRYKYTVTPSGPSLPLGPAAEMADRLLVPVTDG 356
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
V++P SG R+IP+ RPP AVIPAV G RV+E RG LV LG
Sbjct 357 YDVFEPGSGTGERHIPLARPPVDGAVIPAVVGDRVVELRGGELVGLG 403
>gi|145225255|ref|YP_001135933.1| hypothetical protein Mflv_4677 [Mycobacterium gilvum PYR-GCK]
gi|145217741|gb|ABP47145.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=378
Score = 425 bits (1092), Expect = 8e-117, Method: Compositional matrix adjust.
Identities = 224/382 (59%), Positives = 276/382 (73%), Gaps = 5/382 (1%)
Query 26 ASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGD 85
A+++WWTSDARAT S PAA +P+ PA VP SL QLW A S T P+VVGG V TGD
Sbjct 2 AAVVWWTSDARATRSTPAAEPLPSLKPAVAVPDSLTQLWAARSGETTRPLVVGGAVVTGD 61
Query 86 GRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAA 145
G+ + GR+PATG+ +WSYARD DLCGVTWVY+YAVAVY RGCGQVST+D TGRRG A
Sbjct 62 GQAMQGREPATGDVVWSYARDLDLCGVTWVYNYAVAVYPDARGCGQVSTVDAKTGRRGPA 121
Query 146 RSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLE 205
R+ YAD V L DG+TVLS G+TRLE+WRSDMVRML+YG +DA +KP C
Sbjct 122 RTSYADRHVTLSGDGSTVLSQGETRLEMWRSDMVRMLSYGALDAPIKPGVPATPL-CRFV 180
Query 206 SAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLVVSQNN 265
SA ASS +VSVLEAC +Q DLRL LLRP E+D P + V + GV G+GARV+ VS
Sbjct 181 SAGASSDSVSVLEACESQ-DLRLTLLRPSDEEDSPEAKYVQQTGVADGTGARVVAVSDTY 239
Query 266 TAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDA 325
TA+Y+P +PR+D+ID+TGATV +T++ PPS A S+ G+LVTWWTGDAL+VF
Sbjct 240 TALYVPT---PKPRLDLIDDTGATVDTTIVDGPPSPEATMSKAGDLVTWWTGDALMVFSG 296
Query 326 GNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSA 385
+L +YT+AA APVGP +MAG+LLVPVT G V+DP +G ++IPV RPP
Sbjct 297 NDLRYKYTVAATGPDAPVGPATIMAGKLLVPVTTGYDVFDPETGTGEKHIPVQRPPVDGP 356
Query 386 VIPAVSGSRVIEQRGDTLVALG 407
V+PAV+G+ V+E RGD LVALG
Sbjct 357 VVPAVAGTTVLELRGDDLVALG 378
>gi|120402789|ref|YP_952618.1| hypothetical protein Mvan_1790 [Mycobacterium vanbaalenii PYR-1]
gi|119955607|gb|ABM12612.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=403
Score = 424 bits (1089), Expect = 2e-116, Method: Compositional matrix adjust.
Identities = 248/409 (61%), Positives = 307/409 (76%), Gaps = 8/409 (1%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ D+ AAA IA VVA+ A+++WWTSDARAT SRPAA VP+ PA VP SL
Sbjct 1 MVKPERRTRADLVAAAAIAGVVALVAAVVWWTSDARATESRPAAEPVPSLKPAAAVPDSL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+Q WT S T +PVVVGG V TGDG+ ++GRDPATG +LW YARD +LCGVTWVY YAV
Sbjct 61 EQRWTTGSAKTTLPVVVGGAVVTGDGQAMEGRDPATGATLWRYARDLELCGVTWVYSYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVY RGCGQVST+D TG RG +R+ YAD V L +DGTTVLSAG TRLE+WRSDMVR
Sbjct 121 AVYPDVRGCGQVSTVDAKTGLRGPSRTSYADREVTLSADGTTVLSAGATRLEMWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSG--CTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDD 238
ML+YG +DA +KP G+ + C SA AS+++VSVLE+C +QADLRL LLRP E+D
Sbjct 181 MLSYGALDAPIKP---GVPASPLCRFVSAGASASSVSVLESCESQADLRLTLLRPSDEED 237
Query 239 EPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKP 298
P + V + GV GSGARV+ V+ + TA+Y+P +PR+D+ID+TGA + ST++ P
Sbjct 238 TPELKYVQQKGVADGSGARVVAVTDSATALYVPT---PKPRIDIIDDTGAVIDSTVVPAP 294
Query 299 PSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVT 358
PS A ASR G+LVTWWTGDAL+VF A NL +YT+AA + APVGP +MAGQLLVPVT
Sbjct 295 PSPDATASRVGDLVTWWTGDALMVFSANNLQYKYTVAASGSDAPVGPATIMAGQLLVPVT 354
Query 359 GGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
G V+DP++G +++IPV RP V+PAV+GS V+EQRGD LVALG
Sbjct 355 TGYDVFDPMTGTGDKHIPVQRPQVAGPVVPAVAGSTVLEQRGDELVALG 403
>gi|126434047|ref|YP_001069738.1| hypothetical protein Mjls_1446 [Mycobacterium sp. JLS]
gi|126233847|gb|ABN97247.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=403
Score = 421 bits (1081), Expect = 2e-115, Method: Compositional matrix adjust.
Identities = 252/407 (62%), Positives = 305/407 (75%), Gaps = 4/407 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRT+ D+ AA IA V+AV A+L+WWTSDARATIS+PA AVP P PA +VP++L
Sbjct 1 MVKPERRTRGDVMAAVAIAAVIAVTAALVWWTSDARATISQPAEQAVPDPEPAADVPSAL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
++LW+AASP TR+PVVVGG+V TGDG V GRDPATG+++WSY+RD DLCGVT VYHYAV
Sbjct 61 RELWSAASPKTRLPVVVGGSVVTGDGSTVAGRDPATGDTVWSYSRDVDLCGVTSVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVY RGCGQV+TI+G TG+RG AR+G+ADP V L +DGTTVLSAG++RLELWRSDMVR
Sbjct 121 AVYPDSRGCGQVTTINGRTGKRGNARTGFADPAVTLSTDGTTVLSAGESRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
ML+YG +DARVKP + C L SAAASS+AVSVLEAC + DLRL LLRP E+D P
Sbjct 181 MLSYGALDARVKP-DVPAAPLCRLTSAAASSSAVSVLEACPKEPDLRLTLLRPSDEEDVP 239
Query 241 IQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS 300
+ V V S ARV+ VS+ TAVYLP QP V+VIDETG TV+STL+ P S
Sbjct 240 DIKYVELADVLADSDARVIAVSETTTAVYLPT---PQPTVNVIDETGTTVASTLMTSPAS 296
Query 301 TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGG 360
AVASR G+L+TWWTGD+++VFD L +YT+ + P+GP MA +LLVPVT G
Sbjct 297 PDAVASRAGDLITWWTGDSVMVFDGSGLRYKYTVTPSGPSLPLGPAAEMADRLLVPVTDG 356
Query 361 IGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
V++P SG R+IP+ RPP AVIPAV G RV+E RG LV LG
Sbjct 357 YDVFEPGSGTGERHIPLARPPVDGAVIPAVVGDRVVELRGGELVGLG 403
>gi|167968330|ref|ZP_02550607.1| conserved alanine and valine rich protein [Mycobacterium tuberculosis
H37Ra]
Length=364
Score = 405 bits (1040), Expect = 9e-111, Method: Compositional matrix adjust.
Identities = 229/235 (98%), Positives = 231/235 (99%), Gaps = 0/235 (0%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL
Sbjct 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV
Sbjct 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR
Sbjct 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGK 235
MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLR + PG+
Sbjct 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRACAVTPGQ 235
>gi|169630613|ref|YP_001704262.1| hypothetical protein MAB_3532 [Mycobacterium abscessus ATCC 19977]
gi|169242580|emb|CAM63608.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=406
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 199/414 (49%), Positives = 260/414 (63%), Gaps = 15/414 (3%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERRT+ DI AAA IAVVVAV + IWWTSDARAT+SRPAA + P A VP S+
Sbjct 1 MIAPERRTRADIIAAAVIAVVVAVTGTTIWWTSDARATVSRPAAGDIKRPMSATRVPDSV 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYA-RDTDLCGVTWVYHYA 119
++LW+ AS AT+ PV+ G V + DG V DPATG LWSYA R+ DLCG A
Sbjct 61 RELWSTASAATKGPVIASGAVVSADGHDVVAHDPATGAQLWSYARRNLDLCGAIGFIDDA 120
Query 120 VAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMV 179
VAVYR RGCGQV+ IDG TGRRGA RS D +V L +DGT VL+ G TRLELWRSDMV
Sbjct 121 VAVYRDARGCGQVTMIDGQTGRRGALRSSANDSKVSLSTDGTYVLALGSTRLELWRSDMV 180
Query 180 RMLAYGEIDARVKPSNRGLQS--GCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
R L YG + V P N Q CTL+S A ++ ++VLE C LRL L +P +D
Sbjct 181 RTLEYGRV---VAPLNANSQPRVDCTLKSGAVGASVLAVLETCPQDPTLRLTLQKPTPKD 237
Query 238 DEPIQRIVPE--PGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLL 295
++ + + PGV GS A+VL V+ +AVYLP G + + V D+ G V +T+L
Sbjct 238 NDKPEELYSAVLPGVERGSAAKVLAVADTRSAVYLP---GTRNELVVFDDRGMRVGATVL 294
Query 296 AKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLV 355
+ + A++ G+++TWWTG +LV A +L+ R+ + TT P+GPG MAG+LL+
Sbjct 295 PGEIAQTNAAAQAGDVITWWTGSQVLVLSASDLSYRFVLPT--TTKPLGPGTNMAGELLI 352
Query 356 PVTGGIGVYDPVSGANNRYIPVTRPPS--TSAVIPAVSGSRVIEQRGDTLVALG 407
PV GGI V++ +G + I V R + S VI AV G+ ++EQRG + ALG
Sbjct 353 PVEGGIDVFNMTTGEFRKSIVVQRNTADEKSPVISAVVGNTLVEQRGSRIFALG 406
>gi|226365825|ref|YP_002783608.1| hypothetical protein ROP_64160 [Rhodococcus opacus B4]
gi|226244315|dbj|BAH54663.1| hypothetical protein [Rhodococcus opacus B4]
Length=404
Score = 298 bits (764), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 192/409 (47%), Positives = 244/409 (60%), Gaps = 9/409 (2%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERR++ D+ AA IAV V VA +++W+ SDAR T S AA P A VP +L
Sbjct 1 MLAPERRSRADLVVAAGIAVAVVVALTVVWFRSDARGTTSITAAEPPPALVTALTVPETL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+W +AS AT P+VVGG V T +G V GRD +G LW YARD DLCGVT + V
Sbjct 61 SPIWDSASSATTAPLVVGGAVVTAEGGDVVGRDRMSGTELWRYARDLDLCGVTASWEKVV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGC QV+ +DG TG R A R+ ADP V L +DGT V S GD RLELWRSD+VR
Sbjct 121 AVYRDDRGCSQVTELDGGTGERLAQRNSDADPGVTLTADGTYVASLGDRRLELWRSDLVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
+ YG +DA V P + +SGCTL +SS+ +SVLE C + RL +L P +D++
Sbjct 181 TVEYGRVDAPVNPKKQP-RSGCTLIDVGSSSSKLSVLERCPGEVADRLTVLNPSPKDNQE 239
Query 241 IQRIVPE--PGVRPG-SGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAK 297
+ GV G GAR+L VS + TAVYLPA PR+ + D TG VS LA
Sbjct 240 PEEYGSSVLAGVDAGVEGARILGVSGDTTAVYLPAGRTTGPRIGLFDGTGNAVSEYALAL 299
Query 298 PPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPV 357
V S + ++VTWWTG ++ A +L R+T +GPG +MAG LLVPV
Sbjct 300 RVGPDPVTSASSSVVTWWTGSDVVSLGASDLVPRWTFPGA-----LGPGAVMAGNLLVPV 354
Query 358 TGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
GI V D +GA R IPV R T + V+G V+EQRGD++VAL
Sbjct 355 ASGIAVLDLPTGALLRTIPVARDSVTGPITTTVAGDVVLEQRGDSVVAL 403
>gi|325675726|ref|ZP_08155410.1| hypothetical protein HMPREF0724_13192 [Rhodococcus equi ATCC
33707]
gi|325553697|gb|EGD23375.1| hypothetical protein HMPREF0724_13192 [Rhodococcus equi ATCC
33707]
Length=404
Score = 283 bits (723), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 188/412 (46%), Positives = 244/412 (60%), Gaps = 15/412 (3%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPA---AVAVPTPAPAREVP 57
M+ PERRT+ D+A AATIA+VVAVAA++IW SDAR T S A A AV TP P
Sbjct 1 MLAPERRTRLDVAVAATIAIVVAVAAAVIWIRSDARGTTSVTADAPASAVDTPL---SPP 57
Query 58 TSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYH 117
SL ++W AASP T P+V GTVAT DG V GRDP TG W Y RD LCG ++
Sbjct 58 ASLTEIWRAASPETAAPIVADGTVATADGGTVLGRDPVTGTERWRYQRDMPLCGAIGAWN 117
Query 118 YAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
VAVYR RGC QV+ +DG+TG+R A RS AD V L DGT V+S G R+ELWRSD
Sbjct 118 TVVAVYRDQRGCSQVTQLDGATGQREAQRSSDADDTVTLSYDGTYVVSRGSERMELWRSD 177
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
+VR L +G +DA + P + + C+L+SAAA V+VL C +A R+ +L +D
Sbjct 178 LVRTLEFGRVDAPINPGKQ-PRPDCSLKSAAAGGTRVTVLMNCPGEAGDRISVLDAAPKD 236
Query 238 DEPIQRIVPEPGVRPG---SGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTL 294
++ Q PG SGAR++ S + TAVYLPA ++ R+ V+D G ++S
Sbjct 237 NQEPQEAGSTLLTGPGADTSGARLIAASGDRTAVYLPAGPISEARIAVVDGEGTEIASHP 296
Query 295 LAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLL 354
P + A A+R G++ TWWTG L+ L ++T+ T +GPG +MAG LL
Sbjct 297 ATTPVTDGATAARNGSVFTWWTGTELIALSTSELAPKWTM-----TGALGPGAIMAGSLL 351
Query 355 VPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
VPV G I V DP +G IPVTR + +V G V+EQRGD +VAL
Sbjct 352 VPVPGAIAVLDPATGTERARIPVTRDSDAGPIATSVLGDIVLEQRGDEVVAL 403
>gi|312140645|ref|YP_004007981.1| hypothetical protein REQ_33050 [Rhodococcus equi 103S]
gi|311889984|emb|CBH49302.1| putative secreted protein [Rhodococcus equi 103S]
Length=404
Score = 281 bits (719), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 187/412 (46%), Positives = 244/412 (60%), Gaps = 15/412 (3%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPA---AVAVPTPAPAREVP 57
M+ PERRT+ D+A AATIA+VVAVAA++IW SDAR T S A A AV TP P
Sbjct 1 MLAPERRTRLDVAVAATIAIVVAVAAAVIWIRSDARGTTSVTADAPASAVDTPL---SPP 57
Query 58 TSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYH 117
SL ++W AASP + P+V GTVAT DG V GRDP TG W Y RD LCG ++
Sbjct 58 ASLTEIWRAASPESAAPIVADGTVATADGGTVLGRDPVTGTERWRYQRDMPLCGAIGAWN 117
Query 118 YAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
VAVYR RGC QV+ +DG+TG+R A RS AD V L DGT V+S G R+ELWRSD
Sbjct 118 TVVAVYRDQRGCSQVTQLDGATGQREAQRSSDADDTVTLSYDGTYVVSRGSERMELWRSD 177
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
+VR L +G +DA + P + + C+L+SAAA V+VL C +A R+ +L +D
Sbjct 178 LVRTLEFGRVDAPINPGKQ-PRPDCSLKSAAAGGTRVTVLMNCPGEAGDRISVLDAAPKD 236
Query 238 DEPIQRIVPEPGVRPG---SGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTL 294
++ Q PG SGAR++ S + TAVYLPA ++ R+ V+D G ++S
Sbjct 237 NQEPQEAGSTLLTGPGADTSGARLIAASGDRTAVYLPAGPISEARIAVVDGEGTEIASHP 296
Query 295 LAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLL 354
P + A A+R G++ TWWTG L+ L ++T+ T +GPG +MAG LL
Sbjct 297 ATTPVTDGATAARNGSVFTWWTGTELIALSTSELAPKWTM-----TGALGPGAIMAGSLL 351
Query 355 VPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
VPV G I V DP +G IPVTR + +V G V+EQRGD +VAL
Sbjct 352 VPVPGAIAVLDPATGTERARIPVTRDSDAGPIATSVLGDIVLEQRGDEVVAL 403
>gi|226305683|ref|YP_002765643.1| hypothetical protein RER_21960 [Rhodococcus erythropolis PR4]
gi|226184800|dbj|BAH32904.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=400
Score = 280 bits (715), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 165/407 (41%), Positives = 231/407 (57%), Gaps = 9/407 (2%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERRT++D+ A IA+ V VA +++W SDAR T S A +P + + +P L
Sbjct 1 MLAPERRTRSDVIGALAIALAVIVAVTVVWLRSDARGTTSVTAESPLPPLSTSVLLPDVL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
KQ W + S + P+ VG V T DG V GRDPATGE +W YAR+ LCG + V
Sbjct 61 KQTWESPSADSTAPITVGNAVVTADGSDVIGRDPATGEEVWRYARNIPLCGAIGSWDRVV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
+VY+ +RGC QV+++ TG R R+ AD V L +DGT ++S G R+ELWRSD+VR
Sbjct 121 SVYQDNRGCSQVTSLVADTGARKDQRNSDADSAVTLSADGTYLISRGSERMELWRSDLVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED-DE 239
L YG +DA+V P N+ ++GCTL AA+S+ VSVLE C +A RL ++ P +D E
Sbjct 181 TLEYGRVDAKVNP-NKQPRTGCTLLDAASSANRVSVLERCPGEASNRLTVMNPAPKDAQE 239
Query 240 PIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPP 299
P + + GAR+L V+ TAVYLPA +G R+ + D TG + +
Sbjct 240 PEE--YGSSILLDSEGARLLAVAGEKTAVYLPAANGTASRITIFDGTGNPIVDYPIEGAV 297
Query 300 STSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTG 359
+ A +TWWTG + +L + + GPG M+AGQ+L+PV G
Sbjct 298 TEGARTHADKASITWWTGSQTVNLRLSDLAPTWIV-----NGTSGPGSMVAGQMLIPVPG 352
Query 360 GIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
GI P GA R I V R +T+ ++ A +G ++EQRGDTL AL
Sbjct 353 GIAAVSPSDGAVQRMIAVDRGETTTPIVLASAGETILEQRGDTLFAL 399
>gi|111023318|ref|YP_706290.1| hypothetical protein RHA1_ro06355 [Rhodococcus jostii RHA1]
gi|110822848|gb|ABG98132.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=404
Score = 269 bits (688), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 188/409 (46%), Positives = 240/409 (59%), Gaps = 9/409 (2%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERR++ D AA IAV V VA +++W+ SDAR T S AA P A VP +L
Sbjct 1 MLAPERRSRADHVVAAGIAVAVVVALTVVWFRSDARGTTSVTAAEPPPALVTALMVPETL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+ +W +AS AT P+VVGG V T DG V GRD +G LW Y RD DLCGVT + V
Sbjct 61 RPIWDSASSATSAPLVVGGAVVTADGGDVVGRDRMSGTELWRYERDLDLCGVTASWGKVV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR DRGC QV+ +DG TG R A RS AD V L +DGT V S GD RLELWRSD+VR
Sbjct 121 AVYRDDRGCSQVTELDGGTGERLAQRSSDADSDVTLKADGTYVASLGDRRLELWRSDLVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
+ YG +DA V P + +SGCTL +SS+ +SVLE C + RL ++ P +D++
Sbjct 181 TVEYGRVDAPVNPRKQ-PRSGCTLIDVGSSSSRLSVLERCPGEVADRLTVMNPSPKDNQE 239
Query 241 IQRIVPE--PGVRPG-SGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAK 297
+ GV G GAR+L VS TAVYLPA PR+ + D TG VS L
Sbjct 240 PEEYGSSVLAGVDAGVEGARILGVSGETTAVYLPAGPSHGPRIGLFDGTGNAVSEYALTL 299
Query 298 PPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPV 357
+ V S + ++VTWWTG ++ A +L + +GPG +MAG LLVPV
Sbjct 300 GVGPNPVTSTSSSVVTWWTGSDVVSLGASDLAPHWIFPGA-----LGPGAVMAGNLLVPV 354
Query 358 TGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
GI V D +GA R IPVTR + ++ V+G V+EQRGD +VAL
Sbjct 355 ESGIAVLDLSTGALLRTIPVTRDAAAGPILTTVAGDVVLEQRGDAVVAL 403
>gi|229489582|ref|ZP_04383445.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229323679|gb|EEN89437.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=389
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/396 (41%), Positives = 222/396 (57%), Gaps = 15/396 (3%)
Query 15 AATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVP 74
A IA+ V VA +++W SDAR T S A +P + + +P LKQ W + S + P
Sbjct 4 ALAIALAVIVAVTVVWLRSDARGTTSVTAESPLPPLSTSVLLPDILKQTWESPSADSTAP 63
Query 75 VVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVST 134
+ VG V T DG +V GRDPATGE +W YAR+ LCG + V+VY+ DRGC QV++
Sbjct 64 ITVGNAVVTADGSEVIGRDPATGEEVWRYARNIPLCGAIGSWDRVVSVYQDDRGCSQVTS 123
Query 135 IDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPS 194
+ TG R R+ AD V L +DGT ++S G R+ELWRSD+VR L YG +DA+V P
Sbjct 124 LVADTGARKDQRNSDADSAVSLSADGTYLISRGSERMELWRSDLVRTLEYGRVDAKVNP- 182
Query 195 NRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED-DEPIQRIVPEPG---V 250
N+ ++GCTL AA+S+ VSVLE C +A RL ++ P +D EP E G +
Sbjct 183 NKQPRTGCTLLDAASSANRVSVLERCPGEASNRLTVMNPAPKDAQEP-----EEYGSAIL 237
Query 251 RPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGN 310
GAR+L V+ TAVYLPA +G R+ + D +G + + + A
Sbjct 238 LDSEGARLLAVAGEKTAVYLPAANGTVSRITIFDGSGNPIVDYPIEGTVTEGARTHADKA 297
Query 311 LVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGA 370
+TWWTG + +L + + GPG M+AGQ+L+PV GGI P GA
Sbjct 298 SITWWTGSQTVNLRLSDLAPTWIV-----NGTSGPGSMVAGQMLIPVPGGIAAVSPSDGA 352
Query 371 NNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
R I V R +T+ ++ A +G ++EQRGDTL AL
Sbjct 353 VQRMIAVDRGETTTPIVLASAGETILEQRGDTLFAL 388
>gi|343927605|ref|ZP_08767073.1| hypothetical protein GOALK_097_00250 [Gordonia alkanivorans NBRC
16433]
gi|343762246|dbj|GAA13999.1| hypothetical protein GOALK_097_00250 [Gordonia alkanivorans NBRC
16433]
Length=424
Score = 229 bits (585), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 163/425 (39%), Positives = 217/425 (52%), Gaps = 25/425 (5%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLK 61
VKPERR D+A ATI V+VAVA + W S R T S A P VP +
Sbjct 4 VKPERRRPIDLAITATIIVLVAVAGLIAWLVSPVRGTTSVQAQSTPPEVEQPAAVPDAFA 63
Query 62 QLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVY----H 117
W AAS AT VP + V TGDG V GRDPA+G+ LWSY RD DLC V +
Sbjct 64 PRWQAASDATVVPAIADSVVVTGDGGTVVGRDPASGDELWSYRRDLDLCVVDTAWTASTD 123
Query 118 YAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
A+AVYR RGC +V+ +D TG R +R+ AD +RL SD V++ G RLE W S+
Sbjct 124 LALAVYRNSRGCSEVTALDAKTGARKGSRTSDADDELRLVSDYGYVVAQGSGRLETWGSN 183
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
+VR + YG ++A V+P + + C + S+A + +SV+E C + RL +L +D
Sbjct 184 LVRGIEYGRVEAPVRPEDEAKRKDCVIYSSAITGDRLSVVERCADDPGYRLTVLGALLDD 243
Query 238 DEPIQRIVPE--PGVRPGSGARVLVVSQNNTAVYLPARSGAQ---------PRVDVIDET 286
DE +++ G G V+ +S AVY S A+ P + D
Sbjct 244 DERVEQYGSTLITGDVSGPPPVVIAMSSTGIAVYDGGASPAEPPALGGDSGPSIRRFDSE 303
Query 287 GATVSSTLLAK---PPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPV 343
G S +A PP S G +VT+WTG A +V DA +L Y + A +
Sbjct 304 GVATGSNTVAGDAIPPKDSVPLGGDG-VVTYWTGKATVVLDAQSLKPIYQVP-----ATL 357
Query 344 GPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPA-VSGSRVIEQRGDT 402
GPG +MAGQLL+P GI V D G R IP+TR + V+ V G V+EQRG T
Sbjct 358 GPGEVMAGQLLLPSASGISVRDVAGGREIRSIPLTRSTAPDGVVSLRVIGEMVVEQRGTT 417
Query 403 LVALG 407
+ A G
Sbjct 418 VEAFG 422
>gi|54026547|ref|YP_120789.1| hypothetical protein nfa45740 [Nocardia farcinica IFM 10152]
gi|54018055|dbj|BAD59425.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=409
Score = 222 bits (566), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 177/420 (43%), Positives = 231/420 (55%), Gaps = 24/420 (5%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERRT+ DI AA IAV VA+A ++W DA T S A+ TP A ++P +L
Sbjct 1 MLAPERRTRADIIAAVAIAVAVALAGVVVWARGDATGTESVTASQPATTPPAAEQLPAAL 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
++LW A AT +V GGTV T V GRDPATG +W Y RD LCGV + V
Sbjct 61 RELWHAPDDATGRALVAGGTVVTAADGTVTGRDPATGAEVWRYRRDMPLCGVESQFGMVV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR RGC Q + + G +G R ARS Y D VRL DGT VL+ GD RLE+WRSD+VR
Sbjct 121 AVYRDQRGCSQATMLAGDSGARRTARSSYMDDSVRLSVDGTYVLAQGDRRLEVWRSDLVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
L YG +DA V + + C+L SAA+SS+ ++VLE C A RL +L P +D+
Sbjct 181 TLEYGYVDAPVNVKTQP-RKDCSLLSAASSSSRLAVLERCPEDAGARLTVLNPAPKDN-- 237
Query 241 IQRIVPEP-GVRPGSGA-------RVLVVSQNNTAVYLP-ARSGAQ---PRVDVIDETGA 288
VPE G R +GA RV+ VS +Y P A GA PR+ + D +G
Sbjct 238 ---TVPEEYGSRVLTGADADAPGTRVIAVSDTKIVLYQPGATLGADTVPPRLSIFDGSGN 294
Query 289 TVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVM 348
+ L+ P S A A R G+ +TG +L AG+L +T+ +G
Sbjct 295 PLLVHQLSAPLSEHAQAVRIGSAYLVFTGTEVLALQAGSLQPMWTVPGA-----LGTPAS 349
Query 349 MAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPP-STSAVIPAVSGSRVIEQRGDTLVALG 407
MAG+LL+PV I DP +GA +PV RP S + AV G+ V+EQRG T+ ALG
Sbjct 350 MAGRLLIPVADAIVAVDPGTGAELARLPVERPGYDNSPISLAVLGTTVLEQRGTTMHALG 409
>gi|134097655|ref|YP_001103316.1| hypothetical protein SACE_1059 [Saccharopolyspora erythraea NRRL
2338]
gi|133910278|emb|CAM00391.1| conserved hypothetical alanine valine rich protein [Saccharopolyspora
erythraea NRRL 2338]
Length=403
Score = 219 bits (559), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 171/413 (42%), Positives = 224/413 (55%), Gaps = 18/413 (4%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPT-PAPAR-EVPT 58
MV+PERRTK D+AA IAV V A+ ++W SDARATIS PAA + P P P VP
Sbjct 1 MVRPERRTKADLAAVVLIAVAVLAASVVVWAGSDARATISEPAAQSAPDLPDPESVRVPE 60
Query 59 SLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHY 118
SL++ W SPAT VPVV G V T G +V GRDPATG+ W YARD LC V +
Sbjct 61 SLREAWREPSPATPVPVVAGPAVVTAGGNEVVGRDPATGQVRWRYARDIPLCTVGESWKR 120
Query 119 AVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDM 178
A+AVYR D C +V+++ GS G RG R+G A+ RL SDGT V ++G +E WRSD+
Sbjct 121 AIAVYRKDHNCSEVTSLRGSNGVRGPQRNGDAEFGTRLLSDGTYVTASGRRAVESWRSDL 180
Query 179 VRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDD 238
VR YG A P N + C S A ++E C + R+ LL+ EDD
Sbjct 181 VRTQQYGIPPAMKNPDNNLKRPDCQYGSVAVGDQRFGLVEECPGEPGGRITLLKTKPEDD 240
Query 239 EPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKP 298
E + + P GA V+ VS+++ AV P RS ++ V D A V S +
Sbjct 241 EKPEELFSTGLGLP--GASVVAVSKDHAAVLAPDRS----QLLVYDGNAAVVGSFPVRIG 294
Query 299 PSTSAVASRT-----GNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQL 353
P + SR V W+TG + +LT +T A +T +GPG ++ G+L
Sbjct 295 PRDTTANSRIEATTRDGQVYWFTGTDTVALHPDDLTPLWT--APDT---LGPGTVVGGKL 349
Query 354 LVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
LVPV G+ V DP +GA R IPV R V +G ++EQRG+TLVAL
Sbjct 350 LVPVRDGLAVLDPATGARERVIPVDRQGYAGPVGLDSAGDVLVEQRGETLVAL 402
>gi|257054730|ref|YP_003132562.1| hypothetical protein Svir_06640 [Saccharomonospora viridis DSM
43017]
gi|256584602|gb|ACU95735.1| hypothetical protein Svir_06640 [Saccharomonospora viridis DSM
43017]
Length=451
Score = 212 bits (540), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 149/389 (39%), Positives = 201/389 (52%), Gaps = 28/389 (7%)
Query 30 WWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQV 89
W SD+RAT R A+V P P VP SL + W A S AT PV GG V T V
Sbjct 78 WLFSDSRATEHRTASVPPPRLDPPAAVPGSLAERWRAPSEATPEPVAEGGFVITASAGTV 137
Query 90 DGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGY 149
GRDP TGE WSY+RD +LC V + AVAVYR D GC +V+ +D TGRR A R+G
Sbjct 138 AGRDPLTGEIRWSYSRDLELCTVAGAWSKAVAVYRKDLGCSEVTQLDPGTGRRTAQRNGD 197
Query 150 ADPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAA 209
A+ RL D V + G+ L WRSD+V+ + YG + A P R +SGC + +A
Sbjct 198 AEAPTRLVVDDAHVTTTGERLLNTWRSDLVQTMEYGRVPAPENP-QRQPRSGCRYSNVSA 256
Query 210 SSAAVSVLEACTNQADLRLVLLR-PGKEDDEPIQ---RIVPEPGVRPGSGARVLVVSQNN 265
+ V V+E C + RL +L+ G+E DEP Q ++ EP A+++ +S+
Sbjct 257 GADKVGVIEHCPKEVGARLTVLKAAGEESDEPEQVFSTLLAEP------TAQLVAMSEER 310
Query 266 TAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTG--------NLVTWWTG 317
AV LP Q R+ + D G SS +A P + A +G V W+TG
Sbjct 311 VAVALP----EQRRLLLFDYEGEQRSSHDIAVPTAELATVPDSGVVPVSSGRENVYWFTG 366
Query 318 DALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPV 377
+ AG L R+ E T +GPG + AG+ +VP+ GG+ V D +GA R + V
Sbjct 367 SHTVALSAGELRPRWI---AENT--LGPGTVFAGEYVVPIPGGLAVLDERTGATTRTVAV 421
Query 378 TRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
R ++ G +IEQRG TLVAL
Sbjct 422 DRRGHDGSIELEAIGPVLIEQRGRTLVAL 450
>gi|291008800|ref|ZP_06566773.1| hypothetical protein SeryN2_30118 [Saccharopolyspora erythraea
NRRL 2338]
Length=388
Score = 207 bits (528), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 154/381 (41%), Positives = 202/381 (54%), Gaps = 18/381 (4%)
Query 33 SDARATISRPAAVAVPT-PAPAR-EVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVD 90
SDARATIS PAA + P P P VP SL++ W SPAT VPVV G V T G +V
Sbjct 18 SDARATISEPAAQSAPDLPDPESVRVPESLREAWREPSPATPVPVVAGPAVVTAGGNEVV 77
Query 91 GRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYA 150
GRDPATG+ W YARD LC V + A+AVYR D C +V+++ GS G RG R+G A
Sbjct 78 GRDPATGQVRWRYARDIPLCTVGESWKRAIAVYRKDHNCSEVTSLRGSNGVRGPQRNGDA 137
Query 151 DPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAAS 210
+ RL SDGT V ++G +E WRSD+VR YG A P N + C S A
Sbjct 138 EFGTRLLSDGTYVTASGRRAVESWRSDLVRTQQYGIPPAMKNPDNNLKRPDCQYGSVAVG 197
Query 211 SAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYL 270
++E C + R+ LL+ EDDE + + P GA V+ VS+++ AV
Sbjct 198 DQRFGLVEECPGEPGGRITLLKTKPEDDEKPEELFSTGLGLP--GASVVAVSKDHAAVLA 255
Query 271 PARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRT-----GNLVTWWTGDALLVFDA 325
P RS ++ V D A V S + P + SR V W+TG +
Sbjct 256 PDRS----QLLVYDGNAAVVGSFPVRIGPRDTTANSRIEATTRDGQVYWFTGTDTVALHP 311
Query 326 GNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSA 385
+LT +T A +T +GPG ++ G+LLVPV G+ V DP +GA R IPV R
Sbjct 312 DDLTPLWT--APDT---LGPGTVVGGKLLVPVRDGLAVLDPATGARERVIPVDRQGYAGP 366
Query 386 VIPAVSGSRVIEQRGDTLVAL 406
V +G ++EQRG+TLVAL
Sbjct 367 VGLDSAGDVLVEQRGETLVAL 387
>gi|333921426|ref|YP_004495007.1| hypothetical protein AS9A_3769 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483647|gb|AEF42207.1| hypothetical protein AS9A_3769 [Amycolicicoccus subflavus DQS3-9A1]
Length=428
Score = 206 bits (524), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 154/443 (35%), Positives = 217/443 (49%), Gaps = 53/443 (11%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M+ PERR++ D+ A I ++V VA IWW SD R T S PA + A PTS+
Sbjct 1 MIAPERRSRADVVATVAIIMLVVVATFTIWWRSDYRQTQSAPAERPIEAVTGALATPTSV 60
Query 61 KQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+++W A S A+ VP+ GGTV + +V G D ATG W Y R +LC VT H V
Sbjct 61 REVWRAPSEASTVPITSGGTVVSTRAGEVAGHDVATGTLRWRYERQRELCAVTGQEHQVV 120
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
AVYR RGC QV+ ++ +TG RG R+ AD V L +DG ++S G RLE+WRSD+VR
Sbjct 121 AVYRTPRGCTQVTALESATGVRGEQRTSDADTSVTLTADGPHIMSQGPQRLEVWRSDLVR 180
Query 181 MLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEP 240
L +G + A V P +R + GC + A + + V+ + C +A RLVLL D P
Sbjct 181 TLEFGRVAAPVNP-DRQQRGGCDILDAHITRSRVAAVLQCPEEAGERLVLL-----DSTP 234
Query 241 IQRIVPEP-------------GVRPGS-GARVLVVSQNNTAVYLPARSGAQPRVDVIDET 286
+ PE G PG+ G + +S TA+++P G V V +
Sbjct 235 ERNNEPEEHGSALITTNDAGLGEIPGAEGTSLAAISAQLTAIFVP---GDTDEVAVYGQE 291
Query 287 GATVSSTLLAKPP-------------------STSAVASRTGNLVTWWTGDALLVFDAGN 327
G + LA P ST V++ T +L W+TG ++ +
Sbjct 292 GTAIHRWSLADVPAAPGISDDDPPIGFIPERRSTGTVSAPT-DLAYWFTGTDVVSLSGRD 350
Query 328 LTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVI 387
+ + A GP +MAGQLLVP GI V++ +GA R I + R +
Sbjct 351 FRPLWRVPAA------GPPAVMAGQLLVPTAAGISVHNARTGALERTIALDRNGEDVTGL 404
Query 388 PAVSGSRVIEQRG---DTLVALG 407
AV G ++EQR T+V LG
Sbjct 405 -AVLGDFILEQRNGSTGTVVVLG 426
>gi|326383297|ref|ZP_08204985.1| hypothetical protein SCNU_10184 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198047|gb|EGD55233.1| hypothetical protein SCNU_10184 [Gordonia neofelifaecis NRRL
B-59395]
Length=419
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 154/426 (37%), Positives = 212/426 (50%), Gaps = 30/426 (7%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLK 61
+KPERRT+TD+ A IAV+V A ++WWTS AR T S AA P PA VP +
Sbjct 4 IKPERRTRTDLIVTALIAVIVIAAGVVVWWTSSARRTESVTAAEPAPIQLPAENVPRGVT 63
Query 62 QLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVY----H 117
W+A S T VP + G+V TG V R PA G +LWSY RD LC +
Sbjct 64 VKWSARSDVTTVPQITRGSVVTGVDGTVTARRPADGSALWSYRRDLPLCATAAAWSGGDD 123
Query 118 YAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
+AVYR RGC +V+++ + G R AAR+ ADP V L +D L G TRLE W S+
Sbjct 124 DVLAVYRNARGCSEVTSLRAADGVRHAARTSDADPTVHLSADTGYALLYGPTRLETWGSN 183
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
+VR + YG + A V P ++GC L SA + + V+++E C A RL +L
Sbjct 184 LVRGIEYGRVSAPVNPDVAPDRTGCRLYSAVSGADRVAIIERCDGDAGYRLTVLSSNLTS 243
Query 238 DEPIQR----IVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATV--- 290
+E I++ ++ G RV+ + + VY G P V TG +
Sbjct 244 EEKIRQWGSTLITTSAT--GPAPRVVAATDSTITVY---DGGGDPSGTVGAPTGPRIRLF 298
Query 291 ---SSTLLAKPPSTSAVASRTGN------LVTWWTGDALLVFDAGNLTQRYTIAAGETTA 341
+ + P + A A + +V++WTG A +V DA N T + +A
Sbjct 299 TPQAESRSEHPVAGDAQAPERSDPIVDQGVVSFWTGRATVVLDASNGTPLFQVADA---- 354
Query 342 PVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGD 401
+GPG +M+GQLLVP+ G I V G + IPV R V V G +IEQRG
Sbjct 355 -IGPGAVMSGQLLVPMPGAISVRRAYDGHDEFRIPVDRGQVVGPVSLRVLGESIIEQRGT 413
Query 402 TLVALG 407
+VALG
Sbjct 414 EVVALG 419
>gi|262203394|ref|YP_003274602.1| hypothetical protein Gbro_3516 [Gordonia bronchialis DSM 43247]
gi|262086741|gb|ACY22709.1| hypothetical protein Gbro_3516 [Gordonia bronchialis DSM 43247]
Length=419
Score = 197 bits (500), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 149/426 (35%), Positives = 204/426 (48%), Gaps = 34/426 (7%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLK 61
V PERR D+ A I ++V + W S R T S A A A VP +
Sbjct 4 VTPERRRPVDLIITAVIVLIVVAVGIVAWVVSPVRNTDSAQAGTPAVAVAEATTVPERMV 63
Query 62 QLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYA-- 119
W A S AT P + VATGDG V GRDP +G +W Y RD LC V + ++
Sbjct 64 SRWHARSSATDTPAIGDAVVATGDGGSVIGRDPQSGREIWGYRRDLPLCTVAAAWQHSTD 123
Query 120 --VAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
+AVYR RGC +V+ +DG +GRR AARS ADP + L SD V++ G TRLE W S+
Sbjct 124 SVLAVYRNSRGCSEVTALDGKSGRRTAARSSDADPTLHLVSDSGYVVAQGTTRLETWGSN 183
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKED 237
+VR + YG +DA V P + + C L S+A S V+V+E C ++ RL +L
Sbjct 184 LVRGIEYGRVDAPVNPDVQPGRVDCRLFSSAISGDRVAVVEHCGEESGYRLTVLGALLSS 243
Query 238 DEPIQRI------VPEPGVRP------GSGARVLVVSQNNTAVYLPARSGAQPRVDVIDE 285
+E + + G P SG + +N A PA R+ + +
Sbjct 244 EEKVTEYGSSLITIDTAGPPPVLVAMSSSGIAIYDGGTDNPAAPTPA------RIRLFNP 297
Query 286 TGATVS----STLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTA 341
G + L PP +A G+L+T+WTG + +V DA RY +
Sbjct 298 DGVPGALRDVDGLRTPPPDNPPIAD--GDLITYWTGKSTVVLDAATGEPRYQVPGA---- 351
Query 342 PVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPA-VSGSRVIEQRG 400
+GPG MAG+LL+P GI V DP +G IP++R I V G V+EQRG
Sbjct 352 -LGPGAQMAGKLLLPSPSGISVRDPATGRELHSIPLSRTDYHGGPISLRVIGGVVVEQRG 410
Query 401 DTLVAL 406
D + A
Sbjct 411 DLVEAF 416
>gi|256374970|ref|YP_003098630.1| hypothetical protein Amir_0823 [Actinosynnema mirum DSM 43827]
gi|255919273|gb|ACU34784.1| hypothetical protein Amir_0823 [Actinosynnema mirum DSM 43827]
Length=492
Score = 196 bits (499), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 150/415 (37%), Positives = 214/415 (52%), Gaps = 33/415 (7%)
Query 7 RTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTA 66
RT+ D AA A + V + L+W TSD RAT S+ +P A +P SL ++W A
Sbjct 95 RTRRDYAAVALVVVGALLGGLLVWLTSDVRATTSQTGPSELPALPEATALPPSLSEVWRA 154
Query 67 ASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYD 126
ASPAT PVV G V TG+G +V GRDP TG+ W Y RD LC V ++ AV+
Sbjct 155 ASPATPGPVVANGVVVTGEGGEVVGRDPLTGDVRWRYGRDLPLCTVGTAWNRPFAVHTKG 214
Query 127 RGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELW-RSDMVRMLAYG 185
C +V+++D ++G RG R G A+ RL ++G+ V+ G+ E++ R D+VR + YG
Sbjct 215 TNCSEVTSLDAASGARGPQRDGDAELGTRLLNEGSHVVVTGERYFEVYRRDDLVRSMEYG 274
Query 186 EIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTN-QADLRLVLLRPGKEDDEPIQRI 244
++ A V P N+ ++ CT S A + V+VLE C + ++ R+ +L+P E +
Sbjct 275 QLRAIVNP-NKQPRTNCTYGSFAVTGGKVAVLERCPDLESGDRVTVLKPNPEKSD----- 328
Query 245 VPEPGVRP-----GSGARVLVVSQNNTA-------VYLPARSGAQPRVDVIDETGATVSS 292
EPGV +GARV+ V+ N A V A++G +I E VS
Sbjct 329 --EPGVITTVSVGATGARVIAVTANRVAVAAAGKLVLFDAQTGG-----LISEIPVEVSE 381
Query 293 TLLA-KPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAG 351
LA PP A + V W+TG + +LT R+T VG G +AG
Sbjct 382 AELAGDPPGRVAATYHSTANVYWYTGSRTIALSLDDLTPRWT-----REGTVGAGTSLAG 436
Query 352 QLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
+ L+PV G+ V D V+G IPV R V+ A +G V+EQRG LVAL
Sbjct 437 KALLPVAEGLVVLDQVTGEEIGSIPVNRGGYGGDVVMATNGPVVLEQRGGELVAL 491
>gi|300783034|ref|YP_003763325.1| hypothetical protein AMED_1107 [Amycolatopsis mediterranei U32]
gi|299792548|gb|ADJ42923.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524412|gb|AEK39617.1| hypothetical protein RAM_05625 [Amycolatopsis mediterranei S699]
Length=391
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 141/386 (37%), Positives = 201/386 (53%), Gaps = 26/386 (6%)
Query 32 TSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDG 91
TSD+ AT AA PA +VP SL ++W+A S AT VPVV G +VAT DG +V
Sbjct 20 TSDSAATDRTEAAPPPALPAAPDKVPGSLSEIWSAPSGATPVPVVAGESVATADGGEVAV 79
Query 92 RDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYAD 151
RDP TG+ W Y RD LC + V+ AVY R C +V+ +D +TGR A R+G A+
Sbjct 80 RDPLTGQIRWHYTRDLPLCTLDQVWGRLNAVYHKSRNCSEVTQLDPATGRITAQRNGNAE 139
Query 152 PRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQ--SGCTLESAAA 209
+L DG+ V + G L WR D+V+ YGE+ A V N G Q +GCT + AA
Sbjct 140 LGTQLIDDGSHVTATGKKLLTTWRDDLVQSAEYGEVPALV---NAGKQPRTGCTYGTLAA 196
Query 210 SSAAVSVLEACTNQADLRLVLLRPGKE-DDEPIQRIVPEPGVRPGSGARVLVVSQNNTAV 268
+S + V+E C RL + + E DDEP V V G ARV+ +S + TAV
Sbjct 197 ASGKIGVIERCPGDPADRLTVYKAAPEKDDEP---QVSFSSVLAGKRARVIAMSGDLTAV 253
Query 269 YLPARS--------GAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDAL 320
LP + G Q +D + L A PP+ + ++T + W++G +
Sbjct 254 VLPDQKLLVVYNGDGTQRTAYPLD----VPPADLAADPPTGAEATTQTAANMYWFSGSKV 309
Query 321 LVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRP 380
+ +L+ R+T+++ +GPGV A QL+VP+ GG+ V + G+ R + V R
Sbjct 310 VALSRDDLSPRWTLSSA-----LGPGVTYAQQLVVPIKGGLAVLNENDGSTIRTVGVERR 364
Query 381 PSTSAVIPAVSGSRVIEQRGDTLVAL 406
V A +G ++EQRG TL AL
Sbjct 365 GYPGVVRLAAAGPVLLEQRGPTLTAL 390
>gi|296138843|ref|YP_003646086.1| hypothetical protein Tpau_1115 [Tsukamurella paurometabola DSM
20162]
gi|296026977|gb|ADG77747.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=412
Score = 186 bits (471), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 146/425 (35%), Positives = 219/425 (52%), Gaps = 37/425 (8%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPA----PAREV 56
MVKPERRT+ D+A A I V V +A +W S AR T+S VP P+ P E+
Sbjct 1 MVKPERRTRVDLAVTAAIVVAVLIAGLAVWNFSSARKTVSE----TVPAPSAEATPLAEL 56
Query 57 PTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVY 116
P +L+ WT A R + + G V G +V G DPATG W Y RDTD CG++
Sbjct 57 PAALRPTWTRT--ADRPALSMAGGVVLASGGEVSGHDPATGARTWRYQRDTDTCGLSTNA 114
Query 117 HYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRS 176
+A Y RGCG+++ +D TG+R RSG DP + SDG+ ++ G R++ +RS
Sbjct 115 GLVLAYYPDARGCGEITALDPGTGQRKYTRSGQQDPDITTASDGSYAVAQGPRRVDAFRS 174
Query 177 DMVRMLAYGEIDARVKPSNRGLQ--SGCTLESAAASSAAVSVLEACTNQADLRLVLLRPG 234
D+V + YG D P N G+Q SGCTL SA +S ++ VLE+C + + RL ++
Sbjct 175 DLVSTVQYGRPDV---PVNPGVQPRSGCTLGSALPASPSLVVLESCPTEPNPRLTVVGIA 231
Query 235 KED-DEPIQR---IVPEPGVRPGSG---ARVLVVSQNNTAVYLPARSGAQPRVDVIDETG 287
+D D P + + P G S R+L +++ A Y+PA G RV + G
Sbjct 232 PKDADRPQETSSVVAPALGTASTSEDDRPRLLAATRDGAAAYVPAHDGQPARVVTVGYRG 291
Query 288 AT-----VSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAP 342
+++T P S VA +V+++TG + +V DA L + I
Sbjct 292 EVRQSVDITTTPGGAPRSVPVVAE--AGVVSFFTGTSTVVIDASTLIAQMVIPGT----- 344
Query 343 VGPGVMMAGQLLVPVTGGIGVYDPVSGANN-RYIPVTRPPSTSA-VIPAVSGSRVIEQRG 400
+GPG ++ GQ++VP + Y+ ++G N R P+ RP T V+ A++G ++ Q G
Sbjct 345 LGPGTLIGGQIVVPGPTSLAAYN-LAGRNQVRLAPIPRPGYTGGPVLLALAGETIVAQWG 403
Query 401 DTLVA 405
T+ A
Sbjct 404 KTVQA 408
>gi|331694933|ref|YP_004331172.1| hypothetical protein Psed_1068 [Pseudonocardia dioxanivorans
CB1190]
gi|326949622|gb|AEA23319.1| hypothetical protein Psed_1068 [Pseudonocardia dioxanivorans
CB1190]
Length=417
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 149/418 (36%), Positives = 208/418 (50%), Gaps = 25/418 (5%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAV--PTPAPAREVPTS 59
V+PERRT+ D+ A +AVVV L W +S A T S AA P+ A VP
Sbjct 8 VRPERRTRADVIVAVVLAVVVLGGGVLYWRSSAAATTESVTAAPGTFPQPPSSAATVPAG 67
Query 60 LKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYA 119
Q W S AT P+VVG V T DG V GRD TG + WSY R LC +
Sbjct 68 FTQAWREPSAATSAPLVVGPAVVTADGGAVTGRDATTGTAHWSYTRTAQLCTAGSGFGEV 127
Query 120 VAVYRYD--RGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSD 177
+A+YR C +++ + S+G R A + A P RL GT V G +++ RSD
Sbjct 128 MALYRNHDATACSELTVLAPSSGARRAQSNPDALPGTRLLDTGTLVAITGANYVQVVRSD 187
Query 178 MVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRP-GKE 236
+V+ YG + +P R + C S A + ++VLE C ++D RL ++ P G
Sbjct 188 LVKTTEYGTVATPDQP-GRQPRPDCDFGSFAVTQGRLAVLERCPGESDDRLTVVAPDGGS 246
Query 237 DDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSS---- 292
D + +P +PG +V+ S + AV PA + R++V+D G VS+
Sbjct 247 DATTPSVVFSQP--QPGPHGQVVAASGDRVAVARPAPA----RLEVVDGQGNLVSTFAVP 300
Query 293 ----TLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVM 348
L A PP A + + WWTG A + D G+LT +T+ +T+ GP V
Sbjct 301 VPDADLAADPPGGVARTTSDNQHIYWWTGSATVAIDRGDLTPAWTLP--DTS---GPAVR 355
Query 349 MAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
+LVPV GG+ V +P +GA R IPV R T V+P V+GS ++EQRG LVAL
Sbjct 356 YGDSVLVPVRGGLQVVNPATGAVGRTIPVDRGSWTGPVVPGVAGSVLLEQRGPELVAL 413
>gi|324998439|ref|ZP_08119551.1| hypothetical protein PseP1_06707 [Pseudonocardia sp. P1]
Length=351
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 127/359 (36%), Positives = 176/359 (50%), Gaps = 21/359 (5%)
Query 56 VPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWV 115
+P S +LW AASP T VP+V G V +G ++ GRD TG+ WSY RD LC
Sbjct 1 MPASFTELWRAASPGTPVPLVAGDGVVVAEGSRISGRDATTGQERWSYTRDLPLCTTGLA 60
Query 116 YHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWR 175
+A+YR D C ++ST+ TG RG R+ P RL VL+ G E +R
Sbjct 61 DGRVLALYRNDEYCSELSTLGPDTGLRGPTRTLDTRPGTRLIGQ-DPVLATGQDYAETFR 119
Query 176 SDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRP-G 234
SD+VR YG + A+ +P ++ ++GCT S AA+ VLE C QA RL ++RP G
Sbjct 120 SDLVRTAEYGTVRAQEEPGDQ-PRAGCTYLSFAAARDRAGVLERCPGQATERLSVIRPSG 178
Query 235 KEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTL 294
+ D P+ E G GA+++ VS +AV LP PR+ + D G
Sbjct 179 TDGDRPVFDSSAEIGT---DGAQLVAVSPERSAVLLP----GVPRLALYDRGGRKTGEFP 231
Query 295 LAKPPSTSAVASRTGNLVT------WWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVM 348
+A P +A A + WWTG A + D G L +++ +GPG
Sbjct 232 VATGPRITAPADGVARTSSDDARRYWWTGSATVALDTGTLQPLWSVP-----GTLGPGTR 286
Query 349 MAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
+LLVPV GG+ DP +G+ R +PV R T V G + E RGD +V LG
Sbjct 287 YGDRLLVPVPGGLADVDPGTGSAGRTVPVDRAGWTGPVQVTAQGGLLAELRGDQVVLLG 345
>gi|317507499|ref|ZP_07965224.1| hypothetical protein HMPREF9336_01596 [Segniliparus rugosus ATCC
BAA-974]
gi|316254212|gb|EFV13557.1| hypothetical protein HMPREF9336_01596 [Segniliparus rugosus ATCC
BAA-974]
Length=428
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 137/424 (33%), Positives = 200/424 (48%), Gaps = 42/424 (9%)
Query 15 AATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVP 74
A ++A+ V V A+++WW + R +SR A+ P P A +VP + W+ A P
Sbjct 3 ATSLALTVLVVAAVLWWRAPDRQAVSRTASRPAPAPRSATDVPEGFQLAWSKPDGAGTYP 62
Query 75 VVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVST 134
+VVGGT+ T DG + G + ATG+ LW YA+ +C T + AVYR RGC V +
Sbjct 63 MVVGGTLVTADGGVLTGWEVATGKELWRYAQPAPICAATVAWGKVYAVYREQRGCSLVVS 122
Query 135 IDGSTGRRGA-----ARSGYADPRVRLF----SDGT----------------TVLSAGDT 169
+D +TG R ARS AD RVRL S+G +L+ G
Sbjct 123 LDATTGARAKQLHADARSSAADQRVRLITTVPSEGAAPQDDSESSDSQWYSHMILAVGPR 182
Query 170 RLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLV 229
R+ELW D++R + YG + +P + GC L SA +VLE C LRL
Sbjct 183 RVELWHEDLLRSVEYGYVATPFEPKQQP-HPGCALRSAGIGEETFAVLERCPRDTALRLS 241
Query 230 LLR-PGKEDDEPIQR---IVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDE 285
L+ K+D +P Q ++P+ G R+L V + +Y+PA ++ +D
Sbjct 242 FLKITPKKDTKPEQTYSAVIPQLGS--SKDVRLLAVRGSRADLYVPASDSDGAKILEVDG 299
Query 286 TGATVSSTLLAKP---PSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAP 342
G S L+ P P T + T +L+TWWTG ++ +L+ + I
Sbjct 300 RGVVEKSFDLSLPVTNPDTRPWRT-TPDLITWWTGAGVVGISPLHLSPLFQIPDL----- 353
Query 343 VGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRP-PSTSAVIPAVSGSRVIEQRGD 401
VGP MA QL+ P GI V D +GA R I PS + AV GS V+ Q+
Sbjct 354 VGPVTEMAHQLIAPTATGIAVLDASTGAKLREIATEGAGPSQAGARLAVCGSTVLRQQDG 413
Query 402 TLVA 405
++A
Sbjct 414 RVLA 417
>gi|172040178|ref|YP_001799892.1| putative secreted protein [Corynebacterium urealyticum DSM 7109]
gi|171851482|emb|CAQ04458.1| putative secreted protein [Corynebacterium urealyticum DSM 7109]
Length=425
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 139/429 (33%), Positives = 203/429 (48%), Gaps = 46/429 (10%)
Query 3 KPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQ 62
+PER ++ D A I + + V WW SDAR + R A + PA A PT+L++
Sbjct 18 RPERASRKDYLAVGAIILFLVVVLVTSWWGSDAR-RVDRTQAATLSAPAAAEAAPTTLRE 76
Query 63 LWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAV 122
LW SP T P+++ G V T + + D +GE W+Y R LC VT+ VA+
Sbjct 77 LWRGTSPQTVAPILLSGGVLTAEDDVLSMHDLRSGEVAWTYDRGLPLCDVTFSNERIVAI 136
Query 123 YRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDT-------RLELWR 175
YR +GCG V ++ STG YA R L D T L + D+ R+ELWR
Sbjct 137 YRGSKGCGDVVSLAASTG-------DYAHTRSALAQDAATALRSNDSTGIVSPHRVELWR 189
Query 176 SDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQAD--LRLVLLRP 233
SD+VR G + VK + + C SA + ++ + C + LRL+ +P
Sbjct 190 SDLVRTTEVGRQETPVKKEEQRY-ADCPFTSALTRTELLATTQHCEGEDKILLRLLKTKP 248
Query 234 GKED--DEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLP--ARSGAQPRVDVIDETGAT 289
+ D +E VP G +++ ++Q++ AVY P A +P + V ++G
Sbjct 249 ERSDVPEELHSFYVPRDG-------QLVAIAQHHAAVYAPTGASGDGKPELIVASDSGEV 301
Query 290 VSSTLLAKP---------PSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETT 340
+ A P P A N+ TWWTGDA++ F +L +T+
Sbjct 302 NHYPMPATPALAGTAQDTPVLPVTADLPHNM-TWWTGDAVVGFHPTSLAPEFTVRGA--- 357
Query 341 APVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPP--STSAVIPAVSGSRVIEQ 398
+G G MA +LLVPV GG+ V+D R IPV R + S V VSG V+EQ
Sbjct 358 --LGAGANMADRLLVPVEGGVQVFDTKHKKKERTIPVRRDGLVAGSPVHLRVSGGFVVEQ 415
Query 399 RGDTLVALG 407
RG+ +V LG
Sbjct 416 RGNEVVVLG 424
>gi|237786052|ref|YP_002906757.1| hypothetical protein ckrop_1485 [Corynebacterium kroppenstedtii
DSM 44385]
gi|237758964|gb|ACR18214.1| putative secreted protein [Corynebacterium kroppenstedtii DSM
44385]
Length=448
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 133/446 (30%), Positives = 213/446 (48%), Gaps = 62/446 (13%)
Query 5 ERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLW 64
ER T+TD A A I ++V V +W TS+AR + A A VP L + W
Sbjct 11 ERSTRTDRIAVAVITLIVVVWLLGVWVTSEARHSHLSTADSAPQAEDQLTSVPAQLHRTW 70
Query 65 TA--ASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAV 122
T + PA PVV G TV + ++V+G DPATG WSY R+ +LCG+T + + V
Sbjct 71 THEISGPAPEGPVVSGPTVVATEEKKVEGLDPATGSVRWSYTRNQNLCGITASFSQIIPV 130
Query 123 YRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVRML 182
++ GCG+VS++D +TG +R + + V TR+ELWR+D+VR +
Sbjct 131 FKGPGGCGEVSSLDDATGEYSHSRESANSGPISMVRSNDNVGVVTPTRMELWRNDLVRTV 190
Query 183 AYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLR-LVLLRPGKEDDEPI 241
+G++D + +P + CT+ SA S ++V+E C + D +LR +
Sbjct 191 EFGDVDDQAEPDMQPFPR-CTIRSALTRSDLLAVVENCPDDKDTEGHAMLR--------L 241
Query 242 QRIVPEPGVRP---------GSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSS 292
+ VP+ +P A+++ +S++ AVY+ A + PR+DV+D+ G SS
Sbjct 242 MKAVPDDSRKPEMIKSYNLGADTAQIVAISESKAAVYVDAPT---PRIDVVDKKGHVTSS 298
Query 293 TLL-------------------AKPPSTSAVASRTG-NLVTWWTGDALLVFDAGNLTQRY 332
++ +P +T A G + + WW G+ L FD +L+ ++
Sbjct 299 QVVEPSPLIKAHKRANADAAGKGEPQNTFEPAINDGPHNMFWWDGERLYAFDPEDLSVQF 358
Query 333 TIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPP----------S 382
+A T V+ LLVPV GG+ V D G + IPV R
Sbjct 359 VVADALGTG----DVVGDNHLLVPVKGGVSVVDTKKGEAGKTIPVNRASRGDAVSTGGDD 414
Query 383 TSAVIPAVS----GSRVIEQRGDTLV 404
++A+ AVS G ++E++G+ LV
Sbjct 415 STAIADAVSLRVAGKTIVERQGNALV 440
>gi|337290207|ref|YP_004629228.1| hypothetical protein CULC22_00593 [Corynebacterium ulcerans BR-AD22]
gi|334698513|gb|AEG83309.1| putative secreted protein [Corynebacterium ulcerans BR-AD22]
Length=401
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 126/414 (31%), Positives = 195/414 (48%), Gaps = 24/414 (5%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSL 60
M K RRT+ D+ AA IA + S++W T+ PAA ++ T A SL
Sbjct 1 MTKSLRRTRKDLIAATIIAGISITGVSIVWATAPINKVTHSPAAQSMRTSAVPPVPVHSL 60
Query 61 KQLW-TAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYA 119
Q W T A T P++ G + DGR ++ +PATG +WSYAR LC + +
Sbjct 61 TQQWETPADQLTTKPIIAAGLAISYDGRSINAINPATGTPVWSYARPEPLCSLGQAWSSV 120
Query 120 VAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMV 179
V + RGCG V ++D ++G A RS A V S V + R+ELWRSD+V
Sbjct 121 VVTFHTGRGCGDVVSLDAASGTYKATRSASASDAVVPLSSNDNVGTVSSERVELWRSDLV 180
Query 180 RMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDE 239
R + YG++ + + +N+ C + SA + + + V E C LR P +D
Sbjct 181 RTVEYGDVPIK-QEANQQPHEDCEISSALSRKSLLGVTEKCEKSWWLRFQKTVP---EDS 236
Query 240 PIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPP 299
+ I + + G AR++ + Q + AV++ S +PR+D + +G +S + P
Sbjct 237 RVPEITHDIHIS-GDNARIIAIGQESAAVFV---STPKPRIDSFNNSGERTASI---EVP 289
Query 300 STSAVASRTG-------NLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQ 352
+ + A G + +TW+ G+ L +F+ +L I +G GV + +
Sbjct 290 AVNDAAWEHGPAVADLPHHITWFDGERLYLFNPSSLHVERVI-----DNVLGTGVAVNNK 344
Query 353 LLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
L VPV GI V + +G V R T V ++G VIE+RGD LV L
Sbjct 345 LYVPVLDGIAVINWDTGETESVFAVDRNRYTGLVSLGLAGDHVIEKRGDHLVGL 398
>gi|302524384|ref|ZP_07276726.1| predicted protein [Streptomyces sp. AA4]
gi|302433279|gb|EFL05095.1| predicted protein [Streptomyces sp. AA4]
Length=392
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/367 (33%), Positives = 184/367 (51%), Gaps = 23/367 (6%)
Query 50 PAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDT-D 108
PAP VP S+ QLW A S AT +P+ TV T DG +V GRDP TG+ W Y R+
Sbjct 38 PAPPTSVPGSMTQLWQAPSAATPIPIGQSDTVTTADGSEVAGRDPLTGQIRWHYTRENLQ 97
Query 109 LCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGD 168
LC V + AVY GC +V+ +D +TGR A R+G A+ RL SDG+ V + G
Sbjct 98 LCTVDAAWGRVNAVYHKSMGCSEVTQLDPATGRITAQRNGDAELGTRLVSDGSHVTTTGK 157
Query 169 TRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRL 228
L+ WR D+V+ + YG++ ++ +N+ + CT + AA++ + V+E C RL
Sbjct 158 HLLDTWRDDLVKSMEYGKV-PFLQNANKQPRPNCTYGTVAAAADKIGVIERCPGDRTDRL 216
Query 229 VLLRPGKE-DDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARS--------GAQPR 279
+ + E +DEP V + G ARV+ +S + T V LP + G+Q
Sbjct 217 TVYKATAEHEDEP---KVTYTALLAGKRARVVAMSGDLTGVLLPDQKLYVVYGADGSQKA 273
Query 280 VDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGET 339
+D A ++ P + +RT + W+TG + +L+ R+T+
Sbjct 274 AYPMDLPAADTAN----DPVGGTEATTRTAAGMYWFTGSKTVALSRDDLSPRWTL----- 324
Query 340 TAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQR 399
+GPG+ A QL+VP+ GG+ V + +GA R + + R V G ++EQR
Sbjct 325 DGTLGPGITFATQLVVPIRGGLAVLNETNGATLRTVGIDRGAYAGPVRLTALGPVLLEQR 384
Query 400 GDTLVAL 406
G + AL
Sbjct 385 GQNVAAL 391
>gi|334696328|gb|AEG81125.1| putative secreted protein [Corynebacterium ulcerans 809]
Length=401
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 129/419 (31%), Positives = 195/419 (47%), Gaps = 34/419 (8%)
Query 1 MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAA-----VAVPTPAPARE 55
M K RRT+ D+ AA IA + S++W T+ PAA AVP P P
Sbjct 1 MTKSLRRTRKDLIAATIIAGISITGVSIVWATAPINKVTHSPAAQSMRSSAVP-PVPVH- 58
Query 56 VPTSLKQLW-TAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTW 114
SL Q W T A T P++ G + DGR ++ +PATG +WSYAR LC +
Sbjct 59 ---SLTQQWETPADQLTTKPIIAAGLAVSYDGRSINAINPATGTPVWSYARPEPLCSLGQ 115
Query 115 VYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELW 174
+ V + RGCG V ++D ++G A RS A V S V + R+ELW
Sbjct 116 AWSSVVVTFHTGRGCGDVVSLDAASGTYKATRSASASDAVVPLSSNDNVGTVSSERVELW 175
Query 175 RSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPG 234
RSD+VR + YG++ + + +N+ C + SA + + + V E C LR P
Sbjct 176 RSDLVRTVEYGDVPIK-QEANQQPHEDCEISSALSRKSLLGVTEKCEKSWWLRFQKTVP- 233
Query 235 KEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTL 294
+D + I + + G AR++ + Q + AV++ S +PR+D +G +S
Sbjct 234 --EDSRVPEITHDIHIS-GDNARIIAIGQESAAVFV---STPKPRIDSFSNSGERTASI- 286
Query 295 LAKPPSTSAVASRTG-------NLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGV 347
+ P+ + A G + +TW+ G+ L +F+ +L I +G GV
Sbjct 287 --EVPAVNDAAWEHGPAVADLPHHITWFDGERLYLFNPSSLHVERVI-----DNVLGTGV 339
Query 348 MMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVAL 406
+ +L VPV GI V + +G V R T V ++G VIE+RGD LV L
Sbjct 340 AINNKLYVPVLDGIAVINWDTGETESVFAVDRNRYTGLVSLGLAGDHVIEKRGDHLVGL 398
>gi|19551996|ref|NP_599998.1| hypothetical protein NCgl0736 [Corynebacterium glutamicum ATCC
13032]
gi|62389659|ref|YP_225061.1| hypothetical protein cg0880 [Corynebacterium glutamicum ATCC
13032]
gi|21323536|dbj|BAB98163.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032]
gi|41324994|emb|CAF19475.1| secreted protein [Corynebacterium glutamicum ATCC 13032]
Length=400
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 123/415 (30%), Positives = 192/415 (47%), Gaps = 28/415 (6%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLK 61
++P +RTK D+ A I + + +W T+ R + PA +P +L
Sbjct 5 LQPLKRTKKDLIATGVITALAVIGVGTVWATAPIRGSELTPADEPFIASTTLDAIPETLS 64
Query 62 QLWTAASPAT-RVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+ W A +T P++ GG ++T DG + P G LWSY RD +LC ++ + AV
Sbjct 65 EHWRATDTSTNHKPLITGGVISTADGNTIKTYTP-DGALLWSYERDKELCSLSVGFDAAV 123
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
A Y+ GCG V+ I+ + G+ A RS + V S + G RLELWRSD+VR
Sbjct 124 ATYKTGIGCGDVTAINANDGQYQATRSAISSDHVAPISSNDRIGVLGTERLELWRSDLVR 183
Query 181 MLAYGEIDARVKPSNRGLQSG--CTLESAAASSAAVSVLEACTNQAD-LRLVLLRPGKED 237
+ YG+++A P G Q C++ SA +++ E C + + LR + P D
Sbjct 184 TIEYGDVEA---PQESGQQPHPECSITSAMTRKDLLAITEDCPDGSSYLRFMGTTP---D 237
Query 238 DEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGA-----TVSS 292
D I + + G R++ + Q+ AVY + PR+ ++ G V
Sbjct 238 DSRTPEITQDIEITDG---RIVAIGQSVAAVY---TNDPSPRIVSYNDDGELVGEQAVDE 291
Query 293 TLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQ 352
PP SA A ++ +W+ GD+L++F L R + +G G+ + G
Sbjct 292 VEFPDPPFQSATADLPHHM-SWFNGDSLVLFSPTQLNVRQSF-----NDALGTGIALNGS 345
Query 353 LLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
LL P GI V + +G R IPV R V V G ++E+RG +VALG
Sbjct 346 LLYPTAEGITVANWDTGEVQRTIPVDRAGYDGEVALGVVGQVIVEKRGSEIVALG 400
>gi|344043729|gb|EGV39417.1| hypothetical protein CgS9114_13161 [Corynebacterium glutamicum
S9114]
Length=400
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 123/415 (30%), Positives = 192/415 (47%), Gaps = 28/415 (6%)
Query 2 VKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLK 61
++P +RTK D+ A I + + +W T+ R + PA +P L
Sbjct 5 LQPLKRTKKDLIATGVITALAVIGVGTVWATAPIRGSELTPADEPFIASTTLDAIPEKLS 64
Query 62 QLWTAASPAT-RVPVVVGGTVATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAV 120
+ W A +T P++ GG ++T DG + P G LWSY RD +LC ++ + AV
Sbjct 65 EHWRATDTSTNHKPLITGGVISTADGNTIKTYTP-DGVLLWSYERDKELCSLSVGFDAAV 123
Query 121 AVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDGTTVLSAGDTRLELWRSDMVR 180
A Y+ GCG V+ I+ + G+ A RS + V S + G RLELWRSD+VR
Sbjct 124 ATYKTGIGCGDVTAINANDGQYKATRSAISSDHVAPISSNDRIGVLGTERLELWRSDLVR 183
Query 181 MLAYGEIDARVKPSNRGLQSG--CTLESAAASSAAVSVLEACTNQAD-LRLVLLRPGKED 237
+ YG+++A P G Q C++ SA +++ E C + + LR + P D
Sbjct 184 TIEYGDVEA---PQESGQQPHPECSITSAMTRKDLLAITEDCPDGSSYLRFMGTTP---D 237
Query 238 DEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGA-----TVSS 292
D I + + G R++ + Q+ AVY + PR+ ++ G V+
Sbjct 238 DSRTPEITQDIEITDG---RIVAIGQSAAAVY---TNDPSPRIVSYNDDGELVGEQAVNE 291
Query 293 TLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQ 352
PP SA A ++ +W+ GD+L++F L R + +G G+ + G
Sbjct 292 VEFPDPPFQSATADLPHHM-SWFNGDSLVLFSPTQLNVRQSFDDA-----LGTGIALNGS 345
Query 353 LLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG 407
LL P GI V + +G R IPV R V V G ++E+RG +VALG
Sbjct 346 LLYPTAEGITVANWDTGEVQRTIPVDRAGYGGEVALGVVGQVIVEKRGSEIVALG 400
Lambda K H
0.315 0.130 0.386
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 810175551480
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40