BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3529c
Length=384
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610665|ref|NP_218046.1| hypothetical protein Rv3529c [Mycob... 796 0.0
gi|306791122|ref|ZP_07429424.1| hypothetical protein TMEG_00017 ... 793 0.0
gi|260099866|pdb|2ZQ5|A Chain A, Crystal Structure Of Sulfotrans... 793 0.0
gi|308232492|ref|ZP_07416218.2| hypothetical protein TMAG_00019 ... 776 0.0
gi|339299949|gb|AEJ52059.1| hypothetical protein CCDC5180_3222 [... 773 0.0
gi|308369155|ref|ZP_07416748.2| hypothetical protein TMBG_02064 ... 773 0.0
gi|41406635|ref|NP_959471.1| hypothetical protein MAP0537 [Mycob... 708 0.0
gi|240172363|ref|ZP_04751022.1| hypothetical protein MkanA1_2381... 707 0.0
gi|254773588|ref|ZP_05215104.1| hypothetical protein MaviaA2_027... 706 0.0
gi|336458424|gb|EGO37398.1| sulfotransferase family protein [Myc... 705 0.0
gi|183984985|ref|YP_001853276.1| hypothetical protein MMAR_5017 ... 699 0.0
gi|296166559|ref|ZP_06848989.1| conserved hypothetical protein [... 698 0.0
gi|118619276|ref|YP_907608.1| hypothetical protein MUL_4091 [Myc... 685 0.0
gi|254822612|ref|ZP_05227613.1| hypothetical protein MintA_21964... 676 0.0
gi|342862262|ref|ZP_08718904.1| hypothetical protein MCOL_25351 ... 667 0.0
gi|333992310|ref|YP_004524924.1| hypothetical protein JDM601_367... 611 7e-173
gi|108801608|ref|YP_641805.1| hypothetical protein Mmcs_4645 [My... 609 2e-172
gi|120406175|ref|YP_956004.1| hypothetical protein Mvan_5227 [My... 608 3e-172
gi|126437592|ref|YP_001073283.1| hypothetical protein Mjls_5028 ... 608 5e-172
gi|315442562|ref|YP_004075441.1| hypothetical protein Mspyr1_091... 602 4e-170
gi|289763707|ref|ZP_06523085.1| conserved hypothetical protein [... 602 5e-170
gi|118467577|ref|YP_890156.1| hypothetical protein MSMEG_5930 [M... 599 2e-169
gi|145222123|ref|YP_001132801.1| hypothetical protein Mflv_1531 ... 597 1e-168
gi|169631255|ref|YP_001704904.1| hypothetical protein MAB_4177c ... 577 2e-162
gi|312139145|ref|YP_004006481.1| hypothetical protein REQ_17280 ... 531 1e-148
gi|343925876|ref|ZP_08765391.1| hypothetical protein GOALK_050_0... 530 2e-148
gi|111018523|ref|YP_701495.1| hypothetical protein RHA1_ro01523 ... 526 3e-147
gi|262200922|ref|YP_003272130.1| hypothetical protein Gbro_0925 ... 520 1e-145
gi|226360642|ref|YP_002778420.1| hypothetical protein ROP_12280 ... 516 4e-144
gi|296141275|ref|YP_003648518.1| hypothetical protein Tpau_3601 ... 513 2e-143
gi|54024428|ref|YP_118670.1| hypothetical protein nfa24590 [Noca... 513 3e-143
gi|226307428|ref|YP_002767388.1| hypothetical protein RER_39410 ... 506 3e-141
gi|229490062|ref|ZP_04383915.1| conserved hypothetical protein [... 504 7e-141
gi|326384571|ref|ZP_08206250.1| hypothetical protein SCNU_16603 ... 499 4e-139
gi|300784755|ref|YP_003765046.1| hypothetical protein AMED_2850 ... 498 8e-139
gi|302527707|ref|ZP_07280049.1| conserved hypothetical protein [... 478 1e-132
gi|159038405|ref|YP_001537658.1| hypothetical protein Sare_2832 ... 461 1e-127
gi|269126972|ref|YP_003300342.1| hypothetical protein Tcur_2758 ... 457 1e-126
gi|319948611|ref|ZP_08022735.1| hypothetical protein ES5_04493 [... 456 3e-126
gi|326382882|ref|ZP_08204572.1| hypothetical protein SCNU_08083 ... 448 7e-124
gi|145595160|ref|YP_001159457.1| hypothetical protein Strop_2635... 421 7e-116
gi|326331627|ref|ZP_08197915.1| hypothetical protein NBCG_03066 ... 421 1e-115
gi|119718592|ref|YP_925557.1| hypothetical protein Noca_4373 [No... 405 7e-111
gi|325675119|ref|ZP_08154805.1| sulfotransferase [Rhodococcus eq... 389 5e-106
gi|312137729|ref|YP_004005065.1| hypothetical protein REQ_02280 ... 389 6e-106
gi|229490662|ref|ZP_04384500.1| conserved hypothetical protein [... 382 6e-104
gi|226305146|ref|YP_002765104.1| hypothetical protein RER_16570 ... 381 1e-103
gi|312196476|ref|YP_004016537.1| hypothetical protein FraEuI1c_2... 373 3e-101
gi|86740720|ref|YP_481120.1| hypothetical protein Francci3_2017 ... 365 8e-99
gi|148553234|ref|YP_001260816.1| hypothetical protein Swit_0307 ... 268 1e-69
>gi|15610665|ref|NP_218046.1| hypothetical protein Rv3529c [Mycobacterium tuberculosis H37Rv]
gi|15843142|ref|NP_338179.1| hypothetical protein MT3632 [Mycobacterium tuberculosis CDC1551]
gi|31794705|ref|NP_857198.1| hypothetical protein Mb3559c [Mycobacterium bovis AF2122/97]
50 more sequence titles
Length=384
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/384 (100%), Positives = 384/384 (100%), Gaps = 0/384 (0%)
Query 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM
Sbjct 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
Query 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL
Sbjct 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
Query 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ
Sbjct 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
Query 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD
Sbjct 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
Query 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF
Sbjct 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
Query 301 NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA
Sbjct 301 NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
Query 361 PKHSYSLADYGLTVEMVKERFAGL 384
PKHSYSLADYGLTVEMVKERFAGL
Sbjct 361 PKHSYSLADYGLTVEMVKERFAGL 384
>gi|306791122|ref|ZP_07429424.1| hypothetical protein TMEG_00017 [Mycobacterium tuberculosis SUMu005]
gi|308340313|gb|EFP29164.1| hypothetical protein TMEG_00017 [Mycobacterium tuberculosis SUMu005]
Length=384
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/384 (99%), Positives = 383/384 (99%), Gaps = 0/384 (0%)
Query 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM
Sbjct 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
Query 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL
Sbjct 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
Query 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ
Sbjct 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
Query 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD
Sbjct 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
Query 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF
Sbjct 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
Query 301 NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
NAARAKYDSAQFYDVDYHDLIADPLG VADIYRHFGLTLSDEARQAMTTVHAESQSGARA
Sbjct 301 NAARAKYDSAQFYDVDYHDLIADPLGRVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
Query 361 PKHSYSLADYGLTVEMVKERFAGL 384
PKHSYSLADYGLTVEMVKERFAGL
Sbjct 361 PKHSYSLADYGLTVEMVKERFAGL 384
>gi|260099866|pdb|2ZQ5|A Chain A, Crystal Structure Of Sulfotransferase Stf1 From Mycobacterium
Tuberculosis H37rv (Type1 Form)
Length=384
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/384 (99%), Positives = 383/384 (99%), Gaps = 0/384 (0%)
Query 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM
Sbjct 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
Query 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL
Sbjct 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
Query 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
HMWLAEYPQPRPPRETWESNPLYRQLDA FTQHHAENPGYTGLHFMAAYELEECWQLLRQ
Sbjct 121 HMWLAEYPQPRPPRETWESNPLYRQLDADFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
Query 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD
Sbjct 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
Query 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF
Sbjct 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF 300
Query 301 NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA
Sbjct 301 NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
Query 361 PKHSYSLADYGLTVEMVKERFAGL 384
PKHSYSLADYGLTVEMVKERFAGL
Sbjct 361 PKHSYSLADYGLTVEMVKERFAGL 384
>gi|308232492|ref|ZP_07416218.2| hypothetical protein TMAG_00019 [Mycobacterium tuberculosis SUMu001]
gi|308379539|ref|ZP_07486659.2| hypothetical protein TMJG_00774 [Mycobacterium tuberculosis SUMu010]
gi|308380726|ref|ZP_07490878.2| hypothetical protein TMKG_00766 [Mycobacterium tuberculosis SUMu011]
6 more sequence titles
Length=375
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/375 (99%), Positives = 375/375 (100%), Gaps = 0/375 (0%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL
Sbjct 1 MATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 60
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ
Sbjct 61 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 120
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA
Sbjct 121 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 180
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA
Sbjct 181 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 240
Query 250 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 309
LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS
Sbjct 241 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 300
Query 310 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 369
AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD
Sbjct 301 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 360
Query 370 YGLTVEMVKERFAGL 384
YGLTVEMVKERFAGL
Sbjct 361 YGLTVEMVKERFAGL 375
>gi|339299949|gb|AEJ52059.1| hypothetical protein CCDC5180_3222 [Mycobacterium tuberculosis
CCDC5180]
Length=375
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/375 (99%), Positives = 374/375 (99%), Gaps = 0/375 (0%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL
Sbjct 1 MATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 60
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ
Sbjct 61 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 120
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PR PRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA
Sbjct 121 PRSPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 180
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA
Sbjct 181 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 240
Query 250 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 309
LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS
Sbjct 241 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 300
Query 310 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 369
AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD
Sbjct 301 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 360
Query 370 YGLTVEMVKERFAGL 384
YGLTVEMVKERFAGL
Sbjct 361 YGLTVEMVKERFAGL 375
>gi|308369155|ref|ZP_07416748.2| hypothetical protein TMBG_02064 [Mycobacterium tuberculosis SUMu002]
gi|308371380|ref|ZP_07424756.2| hypothetical protein TMCG_03652 [Mycobacterium tuberculosis SUMu003]
gi|308372574|ref|ZP_07429120.2| hypothetical protein TMDG_01259 [Mycobacterium tuberculosis SUMu004]
11 more sequence titles
Length=375
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/375 (99%), Positives = 374/375 (99%), Gaps = 0/375 (0%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL
Sbjct 1 MATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 60
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ
Sbjct 61 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 120
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA
Sbjct 121 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 180
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA
Sbjct 181 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 240
Query 250 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 309
LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS
Sbjct 241 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 300
Query 310 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 369
AQFYDVDYHDLIADPLG VADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD
Sbjct 301 AQFYDVDYHDLIADPLGRVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 360
Query 370 YGLTVEMVKERFAGL 384
YGLTVEMVKERFAGL
Sbjct 361 YGLTVEMVKERFAGL 375
>gi|41406635|ref|NP_959471.1| hypothetical protein MAP0537 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118465104|ref|YP_879911.1| hypothetical protein MAV_0631 [Mycobacterium avium 104]
gi|41394984|gb|AAS02854.1| hypothetical protein MAP_0537 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118166391|gb|ABK67288.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=381
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/379 (89%), Positives = 356/379 (94%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR D+ TV+ELHASATKL GLDDFGTDDDNY +AL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDIGTVEELHASATKLTGLDDFGTDDDNYLQALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+SAWKQYP++ DVAI+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSESAWKQYPQYADVAIQRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWESNPLYRQLDAQFTQHH +NPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWESNPLYRQLDAQFTQHHRDNPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAHVPSYA WLS QDWTPSY RHRRNLQLIGLNDA+KRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHVPSYAQWLSEQDWTPSYQRHRRNLQLIGLNDADKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT EGWST FVGAQIGADAMDTWSRGLERFN ARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTAEGWSTTFVGAQIGADAMDTWSRGLERFNTARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KY+ AQFYDVDY +LIADPLGTVADIYRHFGLTL++EA+ AM HA+SQSG RAPKHSY
Sbjct 303 KYNPAQFYDVDYKELIADPLGTVADIYRHFGLTLTEEAKAAMAKTHADSQSGERAPKHSY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGL+VE VKERFAGL
Sbjct 363 SLADYGLSVETVKERFAGL 381
>gi|240172363|ref|ZP_04751022.1| hypothetical protein MkanA1_23813 [Mycobacterium kansasii ATCC
12478]
Length=381
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/379 (89%), Positives = 360/379 (95%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR DV TV+ELHASATKLVGLDDFG+DDDNYREALGVLLD+Y+ +AGLTVLGSKMNRFFL
Sbjct 3 DRTDVGTVEELHASATKLVGLDDFGSDDDNYREALGVLLDSYRRDAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQYP++ DVAI+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSEAAWKQYPQYADVAIERPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPR+TWESNPLYRQLD QFT+HH ENPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRDTWESNPLYRQLDDQFTRHHKENPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+P YA WLS+QDWTP+Y RHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMA+
Sbjct 183 SYETLAHLPGYASWLSQQDWTPAYRRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAS 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT+EGWST FVGAQIGADAMDTWSRGLERFNAARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTSEGWSTVFVGAQIGADAMDTWSRGLERFNAARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+YD AQFYDVDY DLIADPLGTVA IYRHFGLTL++EARQAM +HAESQ+G RAPKH+Y
Sbjct 303 QYDPAQFYDVDYRDLIADPLGTVAAIYRHFGLTLTEEARQAMAKIHAESQTGERAPKHTY 362
Query 366 SLADYGLTVEMVKERFAGL 384
+LADYGLT E VKERFAGL
Sbjct 363 ALADYGLTAEAVKERFAGL 381
>gi|254773588|ref|ZP_05215104.1| hypothetical protein MaviaA2_02775 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=381
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/379 (89%), Positives = 355/379 (94%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR D+ TV+ELHASATKL GLDDFGTDDDNY +AL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDIGTVEELHASATKLTGLDDFGTDDDNYLQALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+SAWKQYP++ DVAI+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSESAWKQYPQYADVAIQRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWESNPLYRQLD QFTQHH +NPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWESNPLYRQLDTQFTQHHRDNPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAHVPSYA WLS QDWTPSY RHRRNLQLIGLNDA+KRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHVPSYAQWLSEQDWTPSYQRHRRNLQLIGLNDADKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT EGWST FVGAQIGADAMDTWSRGLERFN ARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTAEGWSTTFVGAQIGADAMDTWSRGLERFNTARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KY+ AQFYDVDY +LIADPLGTVADIYRHFGLTL++EA+ AM HA+SQSG RAPKHSY
Sbjct 303 KYNPAQFYDVDYKELIADPLGTVADIYRHFGLTLTEEAKAAMAKTHADSQSGERAPKHSY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGL+VE VKERFAGL
Sbjct 363 SLADYGLSVETVKERFAGL 381
>gi|336458424|gb|EGO37398.1| sulfotransferase family protein [Mycobacterium avium subsp. paratuberculosis
S397]
Length=381
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/379 (89%), Positives = 355/379 (94%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR D+ TV+ELHASATKL GLDDFGTDDDNY +AL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDIGTVEELHASATKLTGLDDFGTDDDNYLQALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+SAWKQYP++ DVAI+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSESAWKQYPQYADVAIQRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWESNPLYRQLDAQFTQHH +NPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWESNPLYRQLDAQFTQHHRDNPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAHVPSYA WLS QDWTPSY RHRRNLQLIGLNDA+KRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHVPSYAQWLSEQDWTPSYQRHRRNLQLIGLNDADKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQ T EGWST FVGAQIGADAMDTWSRGLERFN ARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQDTAEGWSTTFVGAQIGADAMDTWSRGLERFNTARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KY+ AQFYDVDY +LIADPLGTVADIYRHFGLTL++EA+ AM HA+SQSG RAPKHSY
Sbjct 303 KYNPAQFYDVDYKELIADPLGTVADIYRHFGLTLTEEAKAAMAKTHADSQSGERAPKHSY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGL+VE VKERFAGL
Sbjct 363 SLADYGLSVETVKERFAGL 381
>gi|183984985|ref|YP_001853276.1| hypothetical protein MMAR_5017 [Mycobacterium marinum M]
gi|183178311|gb|ACC43421.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=381
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/379 (87%), Positives = 354/379 (94%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR DV TVDELHASATKLVGLDDFG+D DNYREAL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDVGTVDELHASATKLVGLDDFGSDQDNYREALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQYP++ +V I+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSEAAWKQYPQYAEVPIQRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWE+NP YRQLDAQFTQHH +NPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWETNPFYRQLDAQFTQHHKDNPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+PSYA WL++QDWTPSY RHR+NLQLIGLNDAEKRWVLKNPSHLFALDALMA+
Sbjct 183 SYETLAHLPSYAQWLAKQDWTPSYQRHRKNLQLIGLNDAEKRWVLKNPSHLFALDALMAS 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT+EGWST FVGAQIGADAM+TWSRGL+RF++AR
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTSEGWSTNFVGAQIGADAMETWSRGLQRFDSART 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
YD AQFYDVDY DLIADP+GTVADIYRHFGLTL+DEAR AM +HAESQ+G RAPKH Y
Sbjct 303 NYDPAQFYDVDYRDLIADPMGTVADIYRHFGLTLTDEARAAMAKIHAESQTGERAPKHRY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGLT E VKERFAG
Sbjct 363 SLADYGLTAEAVKERFAGF 381
>gi|296166559|ref|ZP_06848989.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898045|gb|EFG77621.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=381
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/379 (88%), Positives = 348/379 (92%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR D+ TVDELHASATKL GLDDFG DDDNYREAL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDIGTVDELHASATKLTGLDDFGADDDNYREALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+++WKQYP+H DV I+RPIFVTGLVRTGTTALHRLLGADP HQGLHMWLA
Sbjct 63 RGALVARLLSEASWKQYPQHADVVIERPIFVTGLVRTGTTALHRLLGADPTHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWES+PLY+QLDAQFT+HH ENPGYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWESHPLYQQLDAQFTRHHQENPGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+PSYA WLS QDWTPSY RHR+NLQLIGLND EKRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHLPSYAHWLSEQDWTPSYQRHRKNLQLIGLNDTEKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT EGWST F GAQIGADAMDTWSRGLERFN ARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTAEGWSTTFDGAQIGADAMDTWSRGLERFNTARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KY AQFYDVDY DLIADPLGTV DIYRHFGLTL+DEAR AM HA SQSG RAPKH Y
Sbjct 303 KYSPAQFYDVDYKDLIADPLGTVTDIYRHFGLTLTDEARTAMEKTHAASQSGERAPKHRY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGLTVE VKERFAGL
Sbjct 363 SLADYGLTVETVKERFAGL 381
>gi|118619276|ref|YP_907608.1| hypothetical protein MUL_4091 [Mycobacterium ulcerans Agy99]
gi|118571386|gb|ABL06137.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=381
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/379 (85%), Positives = 349/379 (93%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR DV TVDELHASATKLVGLDDFG+D D YREAL VLLD+Y+ EAGLTVLGSKMNRFFL
Sbjct 3 DRTDVGTVDELHASATKLVGLDDFGSDQDTYREALEVLLDSYRREAGLTVLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQYP++ +V I+RPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA
Sbjct 63 RGALVARLLSEAAWKQYPQYAEVPIQRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETWE+NP YRQLDAQ TQH +N GYTGLHFMAAYELEECWQLLRQSLHSV
Sbjct 123 EFPQPRPPRETWETNPFYRQLDAQLTQHRKDNTGYTGLHFMAAYELEECWQLLRQSLHSV 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+PSYA WL++QDWTPSY RHR+NLQLIGLNDAEKRWVLKNPSHLFALDALMA+
Sbjct 183 SYETLAHLPSYAQWLAKQDWTPSYQRHRKNLQLIGLNDAEKRWVLKNPSHLFALDALMAS 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT+EGWST FVGAQIGADAM+TWSRGL+ F++AR
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTSEGWSTNFVGAQIGADAMETWSRGLQPFDSART 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
YD AQFYDVDY DLIADP+GTVADIYRHFGLTL+DEAR AM +HAESQ+G RAPKH Y
Sbjct 303 NYDPAQFYDVDYRDLIADPMGTVADIYRHFGLTLTDEARAAMAKIHAESQTGERAPKHRY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGLT E VKERFAG
Sbjct 363 SLADYGLTAEAVKERFAGF 381
>gi|254822612|ref|ZP_05227613.1| hypothetical protein MintA_21964 [Mycobacterium intracellulare
ATCC 13950]
Length=381
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/379 (84%), Positives = 349/379 (93%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R DV TVD+L ASA+K++GLDDFG++DDNY EAL VLLD+Y+ +A LT LGSKMNRFFL
Sbjct 3 ERTDVGTVDDLKASASKMIGLDDFGSNDDNYLEALEVLLDSYRRDADLTPLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQYP+H DV I+RPIFVTGLVRTGTTALHRLLGADPAHQGLH+WLA
Sbjct 63 RGALVARLLSEAAWKQYPQHADVVIERPIFVTGLVRTGTTALHRLLGADPAHQGLHLWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETW+SNP Y QL+AQF +HHAENP YTGLHFMAAYELEECWQLLRQSLHS
Sbjct 123 EFPQPRPPRETWDSNPYYSQLNAQFEKHHAENPDYTGLHFMAAYELEECWQLLRQSLHSA 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+P+Y+ WLSRQDWTPSY RHRRNLQLIGLNDA+KRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHLPTYSQWLSRQDWTPSYQRHRRNLQLIGLNDADKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT EGWST F GAQIGADAM+TWSRGLERFN ARA
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTAEGWSTSFTGAQIGADAMETWSRGLERFNTARA 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KY +QFYDVDY +LIADP+GTVADIYRHFG+TL++EA+ AM HA+SQSGARAPKHSY
Sbjct 303 KYSPSQFYDVDYKELIADPMGTVADIYRHFGMTLTEEAKAAMEKTHADSQSGARAPKHSY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGL+VE VKERFAGL
Sbjct 363 SLADYGLSVETVKERFAGL 381
>gi|342862262|ref|ZP_08718904.1| hypothetical protein MCOL_25351 [Mycobacterium colombiense CECT
3035]
gi|342130340|gb|EGT83660.1| hypothetical protein MCOL_25351 [Mycobacterium colombiense CECT
3035]
Length=381
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/379 (82%), Positives = 344/379 (91%), Gaps = 0/379 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R DV TVD+L ASA+K++GLDDFG++ DNY EAL VLLD+Y+ +A LT LGSKMNRFFL
Sbjct 3 ERTDVGTVDDLKASASKMIGLDDFGSNGDNYLEALEVLLDSYRRDADLTPLGSKMNRFFL 62
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQYP+H DV I+RPIFVTGLVRTGTTALHRLLGADPAHQGLH+WLA
Sbjct 63 RGALVARLLSEAAWKQYPQHADVVIERPIFVTGLVRTGTTALHRLLGADPAHQGLHLWLA 122
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPRETW+SNP Y QL+AQF +HHAENP YTGLHFMAAYELEECWQLLRQSLHS
Sbjct 123 EFPQPRPPRETWDSNPFYSQLNAQFNKHHAENPDYTGLHFMAAYELEECWQLLRQSLHSA 182
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LAH+P+Y+ WLSRQDWTPSY RHRRNLQLIGLNDA+KRWVLKNPSHLFALDALMAT
Sbjct 183 SYETLAHLPTYSQWLSRQDWTPSYQRHRRNLQLIGLNDADKRWVLKNPSHLFALDALMAT 242
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRPVETIMASMCSLAQHT EGWS F GAQIGADAM+TWSRGLERFN AR
Sbjct 243 YPDALVIQTHRPVETIMASMCSLAQHTAEGWSNTFTGAQIGADAMETWSRGLERFNTARV 302
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+Y +QFYDVDY +LIADP+GTVADIYRHFGLTL++EA+ AM HAESQSG RAPKH+Y
Sbjct 303 QYSPSQFYDVDYKELIADPMGTVADIYRHFGLTLTEEAKAAMEKTHAESQSGPRAPKHTY 362
Query 366 SLADYGLTVEMVKERFAGL 384
SLADYGL+ E VKERFAGL
Sbjct 363 SLADYGLSTETVKERFAGL 381
>gi|333992310|ref|YP_004524924.1| hypothetical protein JDM601_3670 [Mycobacterium sp. JDM601]
gi|333488278|gb|AEF37670.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=381
Score = 611 bits (1575), Expect = 7e-173, Method: Compositional matrix adjust.
Identities = 289/378 (77%), Positives = 323/378 (86%), Gaps = 0/378 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASATK+VGLDDFG DDDNYREALGVLL++Y+ EA LT LGSKMNRFFLR
Sbjct 4 RTDVGTVEDLHASATKMVGLDDFGPDDDNYREALGVLLESYRTEADLTELGSKMNRFFLR 63
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
GALVARLL+Q+ WKQ+PE+ +VA++RPIFVTGL RTGTTALHRLLGADPAHQGL MWLAE
Sbjct 64 GALVARLLAQAGWKQHPEYAEVAVERPIFVTGLPRTGTTALHRLLGADPAHQGLEMWLAE 123
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPRETW+SNP++ Q+ AQF +HH ENP YTGLHFM A LEECWQLLRQSLHSVS
Sbjct 124 FPQPRPPRETWDSNPVFAQMQAQFARHHDENPDYTGLHFMTADGLEECWQLLRQSLHSVS 183
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LAH+PSY+ WLS QDW PSY RHRRNLQLIGLND KRWVLKNPSHLFALDA+MA Y
Sbjct 184 YETLAHLPSYSRWLSEQDWIPSYRRHRRNLQLIGLNDPGKRWVLKNPSHLFALDAIMAVY 243
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDAL+VQ HRPVETI+ASMCSLAQHTTEG S FVGAQIG D M+TW+RGLE FN+ R +
Sbjct 244 PDALIVQCHRPVETILASMCSLAQHTTEGQSNTFVGAQIGIDEMETWARGLELFNSQRPR 303
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
YD AQF DVDY + +ADPL T A IY FGL LSD ARQAM +A S++G RAPKH YS
Sbjct 304 YDQAQFCDVDYREFVADPLATAAGIYERFGLPLSDAARQAMADDYAASKTGPRAPKHQYS 363
Query 367 LADYGLTVEMVKERFAGL 384
L DYGLT E V+ERFAGL
Sbjct 364 LEDYGLTTEQVRERFAGL 381
>gi|108801608|ref|YP_641805.1| hypothetical protein Mmcs_4645 [Mycobacterium sp. MCS]
gi|119870762|ref|YP_940714.1| hypothetical protein Mkms_4733 [Mycobacterium sp. KMS]
gi|108772027|gb|ABG10749.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696851|gb|ABL93924.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=383
Score = 609 bits (1571), Expect = 2e-172, Method: Compositional matrix adjust.
Identities = 288/378 (77%), Positives = 320/378 (85%), Gaps = 0/378 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASA K GLDDFG+DDDNYREALGVLL++Y+ +A LT GSKM RFF+R
Sbjct 6 RTDVGTVEDLHASAVKACGLDDFGSDDDNYREALGVLLESYRRDADLTEFGSKMQRFFVR 65
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
ALVARL+S++A+KQYPEH VAI+RPIFVTGL RTGTTA+HRLL ADP HQGL +WLAE
Sbjct 66 NALVARLVSEAAFKQYPEHAAVAIERPIFVTGLPRTGTTAVHRLLAADPRHQGLELWLAE 125
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPRETW NP++RQLDAQFT+ H ENP YTGLHFM A E+EECWQLLRQSLHSVS
Sbjct 126 FPQPRPPRETWSDNPVFRQLDAQFTKAHEENPDYTGLHFMTADEVEECWQLLRQSLHSVS 185
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LAHVP+Y+ WL+RQDWT Y RHRRNLQLIGLND EKRWVLKNPSHLFALDAL ATY
Sbjct 186 YETLAHVPTYSQWLARQDWTKPYQRHRRNLQLIGLNDREKRWVLKNPSHLFALDALFATY 245
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALVVQ HRP ETIMASMCSLAQHTTEGWS FVG IGAD+M+TWSRGLE FNA RAK
Sbjct 246 PDALVVQCHRPAETIMASMCSLAQHTTEGWSNTFVGDVIGADSMETWSRGLELFNAERAK 305
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
+D AQFYD+DY LI DP+ V DIYR FG+ +D AR+AM H ESQ G RAPKH+YS
Sbjct 306 HDPAQFYDLDYFALIKDPISVVEDIYRTFGIEFTDGAREAMARTHEESQRGPRAPKHTYS 365
Query 367 LADYGLTVEMVKERFAGL 384
LADYGLT E VKERFAGL
Sbjct 366 LADYGLTAEQVKERFAGL 383
>gi|120406175|ref|YP_956004.1| hypothetical protein Mvan_5227 [Mycobacterium vanbaalenii PYR-1]
gi|119958993|gb|ABM15998.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=381
Score = 608 bits (1569), Expect = 3e-172, Method: Compositional matrix adjust.
Identities = 288/378 (77%), Positives = 321/378 (85%), Gaps = 0/378 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASA K GLDDFG+DDDNYREALGVLL++Y+ +A LT LGSKM RFF+R
Sbjct 4 RTDVGTVEDLHASAVKACGLDDFGSDDDNYREALGVLLESYRRDADLTELGSKMQRFFVR 63
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
ALVARL+S++A+KQYPEH DVAI+RPIFVTGL RTGTTA+HRLL ADP HQGL +WLAE
Sbjct 64 NALVARLVSEAAFKQYPEHADVAIERPIFVTGLPRTGTTAVHRLLAADPRHQGLELWLAE 123
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPRETW NP+++ LDAQFT+ H ENP YTGLHFM A E+EECWQLLRQSLHSVS
Sbjct 124 FPQPRPPRETWSQNPVFQALDAQFTKAHEENPDYTGLHFMTADEVEECWQLLRQSLHSVS 183
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LAHVP+Y+ WL+RQDWT SY RHRRNLQLIGLND EKRWVLKNPSHLFALDALMATY
Sbjct 184 YETLAHVPTYSQWLARQDWTKSYQRHRRNLQLIGLNDREKRWVLKNPSHLFALDALMATY 243
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALVVQ HRP ETIMASMCSLAQHTTEGWS FVG IGAD+M+TWSRGLE FNA RAK
Sbjct 244 PDALVVQCHRPAETIMASMCSLAQHTTEGWSNTFVGDVIGADSMETWSRGLELFNAERAK 303
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
+D AQFYD+DY LI DP+G V DIYR FG+ AR AMT H ES+ G RAPKH+YS
Sbjct 304 HDPAQFYDLDYFALIKDPVGAVGDIYRSFGIDFPHAARAAMTATHEESKKGPRAPKHTYS 363
Query 367 LADYGLTVEMVKERFAGL 384
L+DYGLT E VKERF GL
Sbjct 364 LSDYGLTDEQVKERFKGL 381
>gi|126437592|ref|YP_001073283.1| hypothetical protein Mjls_5028 [Mycobacterium sp. JLS]
gi|126237392|gb|ABO00793.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=383
Score = 608 bits (1567), Expect = 5e-172, Method: Compositional matrix adjust.
Identities = 288/381 (76%), Positives = 320/381 (84%), Gaps = 0/381 (0%)
Query 4 RPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRF 63
R R DV TV++LHASA K GLDDFG+DDDNYREAL VLL++Y+ +A LT GSKM RF
Sbjct 3 RSGRTDVGTVEDLHASAVKACGLDDFGSDDDNYREALDVLLESYRRDADLTEFGSKMQRF 62
Query 64 FLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMW 123
F+R ALVARL+S++A+KQYPEH VAI+RPIFVTGL RTGTTA+HRLL ADP HQGL +W
Sbjct 63 FVRNALVARLVSEAAFKQYPEHAAVAIERPIFVTGLPRTGTTAVHRLLAADPRHQGLELW 122
Query 124 LAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLH 183
LAE+PQPRPPRETW NP++RQLDAQFT+ H ENP YTGLHFM A E+EECWQLLRQSLH
Sbjct 123 LAEFPQPRPPRETWSDNPVFRQLDAQFTKAHEENPDYTGLHFMTADEVEECWQLLRQSLH 182
Query 184 SVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALM 243
SVSYE LAHVP+Y+ WL+RQDWT Y RHRRNLQLIGLND EKRWVLKNPSHLFALDAL
Sbjct 183 SVSYETLAHVPTYSQWLARQDWTKPYQRHRRNLQLIGLNDREKRWVLKNPSHLFALDALF 242
Query 244 ATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAA 303
ATYPDALVVQ HRP ETIMASMCSLAQHTTEGWS FVG IGAD+M+TWSRGLE FNA
Sbjct 243 ATYPDALVVQCHRPAETIMASMCSLAQHTTEGWSNTFVGDVIGADSMETWSRGLELFNAE 302
Query 304 RAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKH 363
RAK+D AQFYD+DY LI DP+ V DIYR FG+ +D AR+AM H ESQ G RAPKH
Sbjct 303 RAKHDPAQFYDLDYFALIKDPISVVEDIYRTFGIEFTDGAREAMARTHEESQRGPRAPKH 362
Query 364 SYSLADYGLTVEMVKERFAGL 384
+YSLADYGLT E VKERFAGL
Sbjct 363 TYSLADYGLTAEQVKERFAGL 383
>gi|315442562|ref|YP_004075441.1| hypothetical protein Mspyr1_09150 [Mycobacterium sp. Spyr1]
gi|315260865|gb|ADT97606.1| hypothetical protein Mspyr1_09150 [Mycobacterium sp. Spyr1]
Length=381
Score = 602 bits (1551), Expect = 4e-170, Method: Compositional matrix adjust.
Identities = 284/378 (76%), Positives = 319/378 (85%), Gaps = 0/378 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASA K GLDDFG+DDDNYREAL VLL++YQ +A LT LGSKM RFF R
Sbjct 4 RTDVGTVEDLHASAVKACGLDDFGSDDDNYREALAVLLESYQRDADLTELGSKMQRFFAR 63
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
ALV+RL+S++A+KQYPEH DVAI+RPIFVTGL RTGTTA+HRLL ADP +QGL +WLAE
Sbjct 64 NALVSRLVSEAAFKQYPEHADVAIERPIFVTGLPRTGTTAVHRLLAADPRNQGLELWLAE 123
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPRETW NP+++QLDAQFT+ H ENP YTGLHFM A E+EECWQLLRQSLHSVS
Sbjct 124 FPQPRPPRETWSENPVFQQLDAQFTKAHEENPDYTGLHFMTADEVEECWQLLRQSLHSVS 183
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LAHVP+Y+ WL+RQDWT SY RHRRNLQLIGLND EKRWVLKNPSHLFALDAL ATY
Sbjct 184 YETLAHVPTYSQWLARQDWTKSYQRHRRNLQLIGLNDREKRWVLKNPSHLFALDALFATY 243
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALVVQ HRP ETIMASMCSLAQHTTEGWS F G IGAD+M+TWSRGLE FNA RAK
Sbjct 244 PDALVVQCHRPAETIMASMCSLAQHTTEGWSNTFTGEVIGADSMETWSRGLELFNAERAK 303
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
+D AQFYD+DY LI DP+G V DIYR FG+ +D AR A+ H ES+ G RAPKH+YS
Sbjct 304 HDPAQFYDLDYFALINDPVGAVDDIYRAFGIEFTDAARAAVADTHEESKKGPRAPKHTYS 363
Query 367 LADYGLTVEMVKERFAGL 384
LADYGLT E V+ERF GL
Sbjct 364 LADYGLTDEQVRERFRGL 381
>gi|289763707|ref|ZP_06523085.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289711213|gb|EFD75229.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=319
Score = 602 bits (1551), Expect = 5e-170, Method: Compositional matrix adjust.
Identities = 297/318 (94%), Positives = 300/318 (95%), Gaps = 6/318 (1%)
Query 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM
Sbjct 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKM 60
Query 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL
Sbjct 61 NRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGL 120
Query 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ
Sbjct 121 HMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQ 180
Query 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD
Sbjct 181 SLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALD 240
Query 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGA--QIGADA-MDTWSRGL 297
ALMATYPDALVVQTHRPVETIMASMCSLAQHTTE + G + G D + W L
Sbjct 241 ALMATYPDALVVQTHRPVETIMASMCSLAQHTTERVVDEVCGRPDRCGRDGHLVAW---L 297
Query 298 ERFNAARAKYDSAQFYDV 315
ERFNAARAKYDSAQFYDV
Sbjct 298 ERFNAARAKYDSAQFYDV 315
>gi|118467577|ref|YP_890156.1| hypothetical protein MSMEG_5930 [Mycobacterium smegmatis str.
MC2 155]
gi|118168864|gb|ABK69760.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=375
Score = 599 bits (1545), Expect = 2e-169, Method: Compositional matrix adjust.
Identities = 284/375 (76%), Positives = 319/375 (86%), Gaps = 0/375 (0%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ TV++LHASATK GLDDFGTDDDNYREALGVLL++YQ +A LT LGSKM+RFFLR AL
Sbjct 1 MGTVEDLHASATKATGLDDFGTDDDNYREALGVLLESYQRDAHLTELGSKMSRFFLRNAL 60
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
VARLLS+++WK P++ DV I+RPIFVTGL RTGTT LHRLL ADPAHQGL MWLAE+PQ
Sbjct 61 VARLLSEASWKANPQYADVEIERPIFVTGLPRTGTTVLHRLLTADPAHQGLEMWLAEFPQ 120
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PRPPRETW NP+Y+QL A F+QHH ENP YTGLHFM A E+EECWQLLRQSLHSVSYE
Sbjct 121 PRPPRETWPDNPVYQQLAASFSQHHQENPDYTGLHFMTADEVEECWQLLRQSLHSVSYET 180
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
LAH+P+YA+WL++QDWT Y RHRRNLQLIGLNDAEKRWVLKNPSHLFALDAL ATYPDA
Sbjct 181 LAHLPTYANWLAQQDWTRPYQRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALFATYPDA 240
Query 250 LVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDS 309
LV+Q HRP ETIMASMCSL+ HTT GWS FVGAQIGADAMDTW+RGLE F A RAK+D
Sbjct 241 LVIQCHRPAETIMASMCSLSAHTTAGWSNTFVGAQIGADAMDTWARGLEAFTAERAKHDP 300
Query 310 AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLAD 369
AQF DVDY D +ADPL TV +YRHFG+ +D AR A+T V+ S+ G RAPKH+YSLAD
Sbjct 301 AQFLDVDYDDFVADPLATVESVYRHFGMPYTDAARAAVTEVYEASRRGPRAPKHTYSLAD 360
Query 370 YGLTVEMVKERFAGL 384
YGLT E VKERF GL
Sbjct 361 YGLTSEAVKERFTGL 375
>gi|145222123|ref|YP_001132801.1| hypothetical protein Mflv_1531 [Mycobacterium gilvum PYR-GCK]
gi|145214609|gb|ABP44013.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=381
Score = 597 bits (1539), Expect = 1e-168, Method: Compositional matrix adjust.
Identities = 281/378 (75%), Positives = 317/378 (84%), Gaps = 0/378 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASA K GLDDFG+DDDNYREAL VLL++YQ +A LT LGSKM RFF R
Sbjct 4 RTDVGTVEDLHASAVKACGLDDFGSDDDNYREALAVLLESYQRDADLTELGSKMQRFFAR 63
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
ALVARL+S++A+KQYPEH DVAI+RPIFVTGL RTGTTA+HRLL ADP +QGL +WLAE
Sbjct 64 NALVARLVSEAAFKQYPEHADVAIERPIFVTGLPRTGTTAVHRLLAADPRNQGLELWLAE 123
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPRETW NP+++QLDAQFT+ H ENP YTGLHFM A E+EECWQLLRQSLHSVS
Sbjct 124 FPQPRPPRETWSQNPVFQQLDAQFTKAHEENPDYTGLHFMTADEVEECWQLLRQSLHSVS 183
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LAHVP+Y+ WL++QDW Y RHRRNLQLIGLND EKRWVLKNPSHLFALDAL ATY
Sbjct 184 YETLAHVPTYSRWLAQQDWAKPYQRHRRNLQLIGLNDREKRWVLKNPSHLFALDALFATY 243
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALVVQ HRP ETIMASMCSLAQHTTEGWS F G IGAD+M+TWSRGLE FNA RAK
Sbjct 244 PDALVVQCHRPAETIMASMCSLAQHTTEGWSNTFTGEVIGADSMETWSRGLELFNAERAK 303
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
+D AQFYD+DY LI DP+G V DIYR FG+ +D AR A+ H +S+ G RAPKH+YS
Sbjct 304 HDPAQFYDLDYFALINDPVGAVDDIYRAFGIEFTDAARDAVVNTHEQSKKGPRAPKHTYS 363
Query 367 LADYGLTVEMVKERFAGL 384
LADYGLT E V+ERF GL
Sbjct 364 LADYGLTAEQVQERFRGL 381
>gi|169631255|ref|YP_001704904.1| hypothetical protein MAB_4177c [Mycobacterium abscessus ATCC
19977]
gi|169243222|emb|CAM64250.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=387
Score = 577 bits (1486), Expect = 2e-162, Method: Compositional matrix adjust.
Identities = 272/380 (72%), Positives = 316/380 (84%), Gaps = 1/380 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDD-DNYREALGVLLDAYQGEAGLTVLGSKMNRFF 64
+R V TVD+LH SAT+L+GLDDFG DNYREALGVLLD+YQGEAGLT LGSKM+R F
Sbjct 8 ERTSVGTVDDLHESATRLIGLDDFGDGSVDNYREALGVLLDSYQGEAGLTPLGSKMSRVF 67
Query 65 LRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWL 124
LRGAL ARLLS++A+K +P++ VAI RPIFVTGL RTGTTALHRLL ADP HQGL MWL
Sbjct 68 LRGALGARLLSEAAFKAHPDYAQVAIDRPIFVTGLPRTGTTALHRLLNADPMHQGLEMWL 127
Query 125 AEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHS 184
A++PQPRPPR+TW++NP+Y+QL+ QF++HH ENP + GLH+M+A E+EECWQLLRQS+HS
Sbjct 128 ADFPQPRPPRDTWDANPVYQQLEGQFSKHHVENPEFMGLHYMSASEVEECWQLLRQSVHS 187
Query 185 VSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMA 244
VSYE LAHVPSYA WLS QDWTP+Y R++ NLQLIGLND EKRWVLKNPSHLFALDALM
Sbjct 188 VSYECLAHVPSYARWLSGQDWTPAYRRYKANLQLIGLNDIEKRWVLKNPSHLFALDALME 247
Query 245 TYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAAR 304
YPDALV+QTHRP ETI+AS+CSL +H T GWS F GA +GAD +DTW+RGLE F +AR
Sbjct 248 VYPDALVIQTHRPAETIIASVCSLNEHATAGWSETFTGATLGADQLDTWARGLESFKSAR 307
Query 305 AKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHS 364
AKYD +QF DVDY D I+DP+GTV IYRHFGL LS A M ++ ESQ G RAPKH
Sbjct 308 AKYDESQFCDVDYFDFISDPIGTVESIYRHFGLELSASALVEMQKMNDESQKGPRAPKHV 367
Query 365 YSLADYGLTVEMVKERFAGL 384
YSLADYGL+ E V ERFAGL
Sbjct 368 YSLADYGLSKEAVMERFAGL 387
>gi|312139145|ref|YP_004006481.1| hypothetical protein REQ_17280 [Rhodococcus equi 103S]
gi|325673550|ref|ZP_08153241.1| hypothetical protein HMPREF0724_11023 [Rhodococcus equi ATCC
33707]
gi|311888484|emb|CBH47796.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325555571|gb|EGD25242.1| hypothetical protein HMPREF0724_11023 [Rhodococcus equi ATCC
33707]
Length=382
Score = 531 bits (1367), Expect = 1e-148, Method: Compositional matrix adjust.
Identities = 244/377 (65%), Positives = 300/377 (80%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R V TV++LHASAT++ GLDDFGTDD Y EALGVLLD+Y + LT GSK++R FL
Sbjct 3 ERTSVGTVEDLHASATRMTGLDDFGTDD--YTEALGVLLDSYARDEDLTPFGSKISRVFL 60
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQ+PEH DV I+RPIFVTGL RTGTTALHRLL DP HQGL MWL
Sbjct 61 RGALVARLLSEAAWKQFPEHADVPIERPIFVTGLPRTGTTALHRLLTVDPGHQGLEMWLT 120
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRPPR+TWESNP+++ ++ QF QHH ++P + G+H+M+A E+EECWQLLRQ+ SV
Sbjct 121 EMPQPRPPRDTWESNPVFQAIEQQFGQHHIDHPEFMGVHYMSAGEVEECWQLLRQTFKSV 180
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA++P+Y+ WL QDWT +Y RH++NLQLIGL D ++RWVLKNPSHLFALD LMA
Sbjct 181 SYECLANLPTYSTWLEGQDWTNAYARHKKNLQLIGLPDQDRRWVLKNPSHLFALDELMAV 240
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHRP TI+ S+CSLA+ TEGWS KF G IG +D W+RGLE F AAR
Sbjct 241 YPDALVIQTHRPPRTIVPSVCSLAEQATEGWSNKFRGEVIGRSQLDLWARGLEDFTAARG 300
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
KYD AQF DVDY+D + DPLGTV +Y HF + ++++AR+AM +HAES+SG+R P H Y
Sbjct 301 KYDPAQFVDVDYNDFVGDPLGTVEKVYSHFSIPMTEQARRAMEDMHAESRSGSRKPAHKY 360
Query 366 SLADYGLTVEMVKERFA 382
+L +YGLT E V ERF
Sbjct 361 TLEEYGLTAEEVDERFG 377
>gi|343925876|ref|ZP_08765391.1| hypothetical protein GOALK_050_01710 [Gordonia alkanivorans NBRC
16433]
gi|343764227|dbj|GAA12317.1| hypothetical protein GOALK_050_01710 [Gordonia alkanivorans NBRC
16433]
Length=380
Score = 530 bits (1364), Expect = 2e-148, Method: Compositional matrix adjust.
Identities = 249/376 (67%), Positives = 298/376 (80%), Gaps = 2/376 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV T+ +LHASAT+ GL+DFG D +Y E LG+LLD+Y+ EAGLT LGSKM RFFL+
Sbjct 6 RTDVGTIADLHASATRATGLEDFG--DADYLEPLGILLDSYRSEAGLTELGSKMFRFFLK 63
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
GALVARLLS+++WK PEH +V I RPIFVTGL RTGTTALHRLL ADPAHQGL MWLAE
Sbjct 64 GALVARLLSEASWKANPEHAEVEITRPIFVTGLPRTGTTALHRLLAADPAHQGLEMWLAE 123
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPR+TW NP+Y+Q+ A F QHH ENP + GLH+M A E+EECWQLLRQS+ S+S
Sbjct 124 FPQPRPPRDTWADNPVYQQIQAGFEQHHVENPEFMGLHYMDAGEVEECWQLLRQSVTSIS 183
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE+LAH+P+Y+ WL+ QDWTP+Y RHR+NLQLIGLND KRWVLKNPSHLFALDALMA Y
Sbjct 184 YESLAHIPTYSRWLAEQDWTPAYLRHRKNLQLIGLNDPGKRWVLKNPSHLFALDALMAAY 243
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALV+QTHR TI+ASMCSLA+H T GWST F G QIG D ++ WSRGL F+ AR K
Sbjct 244 PDALVIQTHRAPSTIIASMCSLAEHATPGWSTTFTGDQIGQDQLELWSRGLREFSRAREK 303
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
YD AQF D+D+ DL +DP+GTV +Y +S+ AR A+TT+ ES+SGAR P+H Y
Sbjct 304 YDPAQFLDIDFADLRSDPMGTVERVYAALDTPMSEAARAAVTTLDEESRSGARKPQHRYQ 363
Query 367 LADYGLTVEMVKERFA 382
LADYGL +V+ F
Sbjct 364 LADYGLDEAVVEAAFG 379
>gi|111018523|ref|YP_701495.1| hypothetical protein RHA1_ro01523 [Rhodococcus jostii RHA1]
gi|110818053|gb|ABG93337.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=385
Score = 526 bits (1355), Expect = 3e-147, Method: Compositional matrix adjust.
Identities = 246/378 (66%), Positives = 300/378 (80%), Gaps = 2/378 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R V TV++LHASAT+L GL DFG DD Y EALGVLLD+Y + LT LGSK++R FL
Sbjct 3 ERTTVGTVEDLHASATRLTGLTDFGVDD--YTEALGVLLDSYHVDEKLTPLGSKVSRVFL 60
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AWKQ PEH DV I+RPIFVTGL RTGTTALHRLL ADP+HQGL MWL
Sbjct 61 RGALVARLLSEAAWKQNPEHADVRIERPIFVTGLPRTGTTALHRLLTADPSHQGLEMWLT 120
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRPPRETWESNP++++++ F++HH E P + G+H+M+A E+EECWQLLRQ+ S+
Sbjct 121 EMPQPRPPRETWESNPVFQKIEEGFSRHHIERPEFMGVHYMSASEVEECWQLLRQTFKSI 180
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA +P+Y+ WL QDWT +Y RH +NLQLIGL D ++RWVLKNPSHLFALD L+A
Sbjct 181 SYECLASLPTYSRWLEGQDWTNAYQRHMKNLQLIGLPDRDRRWVLKNPSHLFALDELLAV 240
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDAL+VQTHRP TI+ S+CSLA+ TEGWS KF G+ IG ++ W+RGLE+F AARA
Sbjct 241 YPDALIVQTHRPPCTIVPSVCSLAEQATEGWSEKFRGSVIGESQLELWARGLEQFTAARA 300
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
++D AQF DVDYHD +ADPLGTV +Y HFGL LS A+ AM +HAES+SG R P H Y
Sbjct 301 RHDPAQFIDVDYHDFVADPLGTVEGVYTHFGLDLSSSAQSAMEAMHAESRSGDRRPSHKY 360
Query 366 SLADYGLTVEMVKERFAG 383
+L ++GLT E V ERFA
Sbjct 361 TLEEFGLTAEQVDERFAN 378
>gi|262200922|ref|YP_003272130.1| hypothetical protein Gbro_0925 [Gordonia bronchialis DSM 43247]
gi|262084269|gb|ACY20237.1| conserved hypothetical protein [Gordonia bronchialis DSM 43247]
Length=386
Score = 520 bits (1340), Expect = 1e-145, Method: Compositional matrix adjust.
Identities = 244/375 (66%), Positives = 294/375 (79%), Gaps = 2/375 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R V TV++LHASAT+ GLDDFG DD Y E L +LLD+Y+ EAGLT LGSKM RFFL+
Sbjct 13 RTSVGTVEDLHASATRATGLDDFG--DDAYLEPLAILLDSYKNEAGLTKLGSKMFRFFLK 70
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
GAL+ARLLS++AWK P DV I+RPIFVTGL RTGTTALHRLL ADPAHQGL MWLAE
Sbjct 71 GALIARLLSEAAWKANPGQTDVEIRRPIFVTGLPRTGTTALHRLLTADPAHQGLEMWLAE 130
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPR+ W NP+Y+Q+DA QHH ENP + G+H+M A E+EECWQLLRQS+ S+S
Sbjct 131 FPQPRPPRDAWADNPVYQQIDAGLAQHHVENPEFMGVHYMDAAEVEECWQLLRQSVMSIS 190
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE+LA++P+Y+ WLS QDWTP+Y RH+RNLQ+IGLND +KRWVLKNPSHLFALDALMA Y
Sbjct 191 YESLAYLPTYSRWLSEQDWTPAYLRHKRNLQMIGLNDPDKRWVLKNPSHLFALDALMAAY 250
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALV+QTHR TI+ASMCSLA+ T GWST FVG IG ++ WSRGL F++ARA+
Sbjct 251 PDALVIQTHRAPSTIIASMCSLAEQATPGWSTTFVGDTIGDTQLELWSRGLREFSSARAR 310
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
YD +QF DVD+ DL DP+GTV +Y G +SD+AR A+T + ES++GAR P+H Y
Sbjct 311 YDQSQFVDVDFADLRNDPMGTVERVYSALGEPMSDDARAAVTALDEESKTGARKPQHRYQ 370
Query 367 LADYGLTVEMVKERF 381
LADYGL V F
Sbjct 371 LADYGLDEARVVAAF 385
>gi|226360642|ref|YP_002778420.1| hypothetical protein ROP_12280 [Rhodococcus opacus B4]
gi|226239127|dbj|BAH49475.1| hypothetical protein [Rhodococcus opacus B4]
Length=385
Score = 516 bits (1328), Expect = 4e-144, Method: Compositional matrix adjust.
Identities = 240/377 (64%), Positives = 298/377 (80%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R V TV++LHASAT+L GL DFG DD Y EALGVLLD+Y + LT LGSK++R FL
Sbjct 3 ERTTVGTVEDLHASATRLTGLTDFGVDD--YTEALGVLLDSYHTDEQLTPLGSKVSRVFL 60
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS++AW+Q PEH DV I+RPIFVTGL RTGTTALHRLL ADP+HQGL MWL
Sbjct 61 RGALVARLLSEAAWQQNPEHADVRIERPIFVTGLPRTGTTALHRLLTADPSHQGLEMWLT 120
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRPPR+TWESNP++++++ F++HH E P + G+H+M+A E+EECWQLLRQ+ S+
Sbjct 121 EMPQPRPPRDTWESNPVFQKIEEGFSRHHIERPEFMGVHYMSASEVEECWQLLRQTFKSI 180
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA +P+Y+ WL QDWT +Y RH++NLQLIGL D ++RWVLKNPSHLFALD L+A
Sbjct 181 SYECLASLPTYSHWLEGQDWTNAYQRHKKNLQLIGLPDRDRRWVLKNPSHLFALDELLAV 240
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDAL+VQTHRP TI+ S+CSLA+ T+GWS KF G+ IG ++ W+RGLE+F AAR
Sbjct 241 YPDALIVQTHRPPRTIVPSVCSLAEQATDGWSEKFRGSVIGESQLELWARGLEQFTAART 300
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
++D AQF DVDY D +ADPLGTV +Y +FGL L AR AM +HAES+SG R P H Y
Sbjct 301 RHDPAQFIDVDYRDFVADPLGTVEGVYTYFGLDLGGPARAAMEAMHAESRSGDRRPSHKY 360
Query 366 SLADYGLTVEMVKERFA 382
+L ++GLT E V ERFA
Sbjct 361 TLEEFGLTAEQVDERFA 377
>gi|296141275|ref|YP_003648518.1| hypothetical protein Tpau_3601 [Tsukamurella paurometabola DSM
20162]
gi|296029409|gb|ADG80179.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=385
Score = 513 bits (1320), Expect = 2e-143, Method: Compositional matrix adjust.
Identities = 248/377 (66%), Positives = 293/377 (78%), Gaps = 4/377 (1%)
Query 9 DVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGA 68
DV TVD+LH SA + GL DFG D YR+AL VLLD+Y+ EA LT GSK++R FLRGA
Sbjct 9 DVGTVDDLHESAMRRTGLSDFGDSSDGYRDALQVLLDSYRDEARLTPEGSKISRVFLRGA 68
Query 69 LVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYP 128
L ARL+S++A+ +PEH +V I RPIFVTGL R+GTTALHRLL ADP +QGL MWLAE P
Sbjct 69 LSARLISEAAFTAHPEHAEVTIDRPIFVTGLPRSGTTALHRLLDADPGNQGLQMWLAEVP 128
Query 129 QPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYE 188
Q RPPRETW +P+YR LD Q+ QHH E+P + GLH+M+A E+EECWQLLRQSLHSVSYE
Sbjct 129 QARPPRETWAEDPVYRLLDEQYAQHHTEHPEFMGLHYMSASEVEECWQLLRQSLHSVSYE 188
Query 189 ALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPD 248
LAHVPSYA WL+RQDW P+Y RH+RNLQLIG ++ +KRWVLKNPSHLFALDAL YPD
Sbjct 189 CLAHVPSYARWLARQDWAPAYRRHKRNLQLIGSSEPDKRWVLKNPSHLFALDALFEVYPD 248
Query 249 ALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYD 308
ALVVQTHR ETI+AS+CSLAQH T GWST F IGAD +DTW+RGL F +RA+
Sbjct 249 ALVVQTHRAPETIIASVCSLAQHATAGWSTAFTAETIGADQLDTWARGLTAFEDSRARQT 308
Query 309 SA----QFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHS 364
+A QF DVDY DL+ DPLGTVA IY FGL L+D AR +M+ +H S++GAR P H+
Sbjct 309 AAGRGDQFVDVDYRDLVGDPLGTVAGIYDAFGLDLTDAARDSMSAMHDASRTGARRPNHT 368
Query 365 YSLADYGLTVEMVKERF 381
YSLADYGLT V+ RF
Sbjct 369 YSLADYGLTDAGVRARF 385
>gi|54024428|ref|YP_118670.1| hypothetical protein nfa24590 [Nocardia farcinica IFM 10152]
gi|54015936|dbj|BAD57306.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=393
Score = 513 bits (1320), Expect = 3e-143, Method: Compositional matrix adjust.
Identities = 240/376 (64%), Positives = 296/376 (79%), Gaps = 2/376 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV TV++LHASA+K+VGLDDFGTDD YRE LGVLLD+Y +A LT G+K+NR FLR
Sbjct 10 RDDVGTVEDLHASASKVVGLDDFGTDD--YREGLGVLLDSYHRDAELTPFGNKVNRAFLR 67
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
GAL+ARLLS++AW+++PEH +VA++RP+FVTGL R+GTTA+HRLL ADPAHQGL MWL E
Sbjct 68 GALIARLLSENAWQRHPEHAEVAVERPVFVTGLPRSGTTAVHRLLEADPAHQGLEMWLTE 127
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
PQPRPPRETW NP+Y++++A F +HH E+P + G+H ++A ++EECWQLLRQS SVS
Sbjct 128 MPQPRPPRETWAENPVYQRIEAAFAKHHVEHPEFMGVHHISADQVEECWQLLRQSAMSVS 187
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE LA++P+Y+ WL QDWTP+Y RH+RNLQLIGL DAEKRWVLKNPSHLFALDALMA Y
Sbjct 188 YECLAYLPTYSAWLREQDWTPAYRRHKRNLQLIGLPDAEKRWVLKNPSHLFALDALMAVY 247
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDALVVQ HR TI+AS+CSL + TEGWS KF G +G +D W+RG RF R +
Sbjct 248 PDALVVQMHRDPRTIIASVCSLNEKATEGWSEKFRGPVVGETQLDLWARGAHRFQEDRKR 307
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
YD AQF DV Y D +ADP+GT+ IY FG+T + EA AM +H ES SGA P H Y+
Sbjct 308 YDQAQFADVYYDDFVADPIGTIGGIYDRFGMTFTAEAEAAMRALHGESTSGAARPAHRYT 367
Query 367 LADYGLTVEMVKERFA 382
LA++GLT + V ERFA
Sbjct 368 LAEFGLTADQVDERFA 383
>gi|226307428|ref|YP_002767388.1| hypothetical protein RER_39410 [Rhodococcus erythropolis PR4]
gi|226186545|dbj|BAH34649.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=379
Score = 506 bits (1302), Expect = 3e-141, Method: Compositional matrix adjust.
Identities = 231/377 (62%), Positives = 293/377 (78%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R V TV++LHASAT++ GL DFG DD Y EAL VLL++Y + LT GSK++R FL
Sbjct 3 ERTHVGTVEDLHASATRMTGLTDFGVDD--YTEALSVLLESYDRDEDLTPFGSKISRVFL 60
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+SAWK++PEH DV I+RPIFVTGL RTGTTALHRLL DPAHQGL MWL
Sbjct 61 RGALVARLLSESAWKEHPEHADVKIERPIFVTGLPRTGTTALHRLLTVDPAHQGLEMWLT 120
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPR+TWESNP++ +++ F QHH E+P + G+H+M+A E+EECWQLLRQ+ SV
Sbjct 121 EFPQPRPPRDTWESNPVFAKIEETFGQHHVEHPEFMGVHYMSASEVEECWQLLRQTFKSV 180
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA++P+Y+ WL Q+W+ +Y RH++NLQLIGL D ++RWVLKNPSHLFALD LM
Sbjct 181 SYECLANLPTYSAWLKDQEWSNAYARHKKNLQLIGLPDQDRRWVLKNPSHLFALDELMEA 240
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHR TI+ S+CSLA TEGWS F IG ++ W+RGLE+F++ARA
Sbjct 241 YPDALVIQTHRSPTTIIPSVCSLAAQATEGWSNTFTDKVIGESQLELWARGLEQFDSARA 300
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
++ AQF DVDY D + DPLGTV +IY HF + L+ A+Q+M +H ES+SGAR P H Y
Sbjct 301 HHNPAQFIDVDYQDFVTDPLGTVENIYTHFDIPLTSTAQQSMEAMHEESRSGARKPSHKY 360
Query 366 SLADYGLTVEMVKERFA 382
+L ++GLT E V+ERF
Sbjct 361 TLEEFGLTKEQVEERFG 377
>gi|229490062|ref|ZP_04383915.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229323163|gb|EEN88931.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=379
Score = 504 bits (1299), Expect = 7e-141, Method: Compositional matrix adjust.
Identities = 231/377 (62%), Positives = 293/377 (78%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
+R V TV++LHASAT++ GL DFG DD Y EAL VLL++Y + LT GSK++R FL
Sbjct 3 ERTHVGTVEDLHASATRMTGLTDFGVDD--YTEALSVLLESYDRDEDLTPFGSKISRVFL 60
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
RGALVARLLS+SAWK++PEH DV I+RPIFVTGL RTGTTALHRLL DPAHQGL MWL
Sbjct 61 RGALVARLLSESAWKEHPEHADVKIERPIFVTGLPRTGTTALHRLLTVDPAHQGLEMWLT 120
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E+PQPRPPR+TWESNP++ +++ F QHH E+P + G+H+M+A E+EECWQLLRQ+ SV
Sbjct 121 EFPQPRPPRDTWESNPVFAKIEETFGQHHVEHPEFMGVHYMSASEVEECWQLLRQTFKSV 180
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA++P+Y+ WL Q+W+ +Y RH++NLQLIGL D ++RWVLKNPSHLFALD LM
Sbjct 181 SYECLANLPTYSAWLKDQEWSNAYARHKKNLQLIGLPDQDRRWVLKNPSHLFALDELMEA 240
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDALV+QTHR TI+ S+CSLA TEGWS F IG ++ W+RGLE+F++ARA
Sbjct 241 YPDALVIQTHRSPTTIIPSVCSLAAQATEGWSNTFTDKVIGESQLELWARGLEQFDSARA 300
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
++ AQF DVDY D + DPLGTV +IY HF + L+ A+Q+M +H ES+SGAR P H Y
Sbjct 301 HHNPAQFIDVDYQDFVTDPLGTVENIYTHFDIPLTAAAQQSMEAMHEESRSGARKPSHKY 360
Query 366 SLADYGLTVEMVKERFA 382
+L ++GLT E V+ERF
Sbjct 361 TLEEFGLTKEQVEERFG 377
>gi|326384571|ref|ZP_08206250.1| hypothetical protein SCNU_16603 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196705|gb|EGD53900.1| hypothetical protein SCNU_16603 [Gordonia neofelifaecis NRRL
B-59395]
Length=378
Score = 499 bits (1284), Expect = 4e-139, Method: Compositional matrix adjust.
Identities = 232/376 (62%), Positives = 285/376 (76%), Gaps = 2/376 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R + T DELH +A + VGLDDFG DD YRE L VLL +Y A L LGSKM R+FL+
Sbjct 5 RTSIGTADELHEAAIRTVGLDDFGGDD--YREGLEVLLSSYASSAELEPLGSKMFRYFLK 62
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
GALVARLLS++ WK P + DV ++RP+FVTGL RTGTTALHRLL ADPA+QGL MWL E
Sbjct 63 GALVARLLSEAGWKANPGYTDVPVERPVFVTGLPRTGTTALHRLLAADPANQGLEMWLTE 122
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
+PQPRPPR+ W SNP+++Q+DA +QHH ENP + GLH+M A E+EECWQLLRQSL S+S
Sbjct 123 FPQPRPPRDQWSSNPVFQQIDAGLSQHHIENPEFMGLHYMGAAEVEECWQLLRQSLMSIS 182
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
YE+LAH+P Y++WLS+QDWTP+Y RH+RNLQLIG ND +RWVLKNPSHLFALDA+M Y
Sbjct 183 YESLAHIPEYSEWLSQQDWTPAYARHKRNLQLIGSNDVGRRWVLKNPSHLFALDAIMEVY 242
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDA++VQTHR ETI+ SMCSLA+ T G+S F +IGA +D WSRGL F+ AR K
Sbjct 243 PDAIIVQTHRAPETIIGSMCSLAEQATAGYSRAFTNERIGATQLDLWSRGLRSFSQARRK 302
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYS 366
YD AQF DVD+ DL +DP GTVA +Y G + +AR AM + +S+SG R P+H Y+
Sbjct 303 YDPAQFVDVDFADLRSDPFGTVARVYDAIGTEYTGQARAAMVALDEDSKSGDRRPQHKYA 362
Query 367 LADYGLTVEMVKERFA 382
L DYGL+ + VK FA
Sbjct 363 LEDYGLSPDQVKAAFA 378
>gi|300784755|ref|YP_003765046.1| hypothetical protein AMED_2850 [Amycolatopsis mediterranei U32]
gi|299794269|gb|ADJ44644.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526179|gb|AEK41384.1| hypothetical protein RAM_14480 [Amycolatopsis mediterranei S699]
Length=389
Score = 498 bits (1281), Expect = 8e-139, Method: Compositional matrix adjust.
Identities = 234/378 (62%), Positives = 294/378 (78%), Gaps = 2/378 (0%)
Query 5 PDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFF 64
P R+DV TV++LHASA+KL GL DFG D+ Y E L VLL++Y+ + LT G+K++R
Sbjct 3 PGREDVGTVEDLHASASKLTGLGDFGADE--YVEGLRVLLESYEADEELTPYGNKVHRAM 60
Query 65 LRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWL 124
LRGALVARLLS+++WKQ P + DV ++RPIFVTGL RTGTTALHRLL DPAHQGL +WL
Sbjct 61 LRGALVARLLSEASWKQNPGYADVRLERPIFVTGLPRTGTTALHRLLAEDPAHQGLEVWL 120
Query 125 AEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHS 184
AE PQPRPPR +W NP+++ + A + +HH E+P + G+H M+A ++EECWQLLRQS+ S
Sbjct 121 AEVPQPRPPRSSWADNPIFQGIQASYDRHHVEHPEFMGVHHMSADQVEECWQLLRQSMRS 180
Query 185 VSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMA 244
VS+E LAH+P Y+ WL++QDWT +Y RH+RNLQLIGL DA +RWVLKNPSHLFALDALMA
Sbjct 181 VSFECLAHLPRYSRWLAKQDWTDAYARHKRNLQLIGLPDAGRRWVLKNPSHLFALDALMA 240
Query 245 TYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAAR 304
YPDALV+QTHR TIMASMCSLA+ +GWS+KF G IG +D W+RG + F AR
Sbjct 241 NYPDALVIQTHRAPSTIMASMCSLAEKAADGWSSKFRGEVIGRGQLDLWARGADEFGWAR 300
Query 305 AKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHS 364
A+++ AQF+DV Y D +ADP+GTV+ +Y HFGL + EAR AMT VH S++G R P H
Sbjct 301 ARHNPAQFFDVRYEDFVADPIGTVSTVYDHFGLEFTPEARAAMTAVHEASRTGERKPVHR 360
Query 365 YSLADYGLTVEMVKERFA 382
YSLAD+GLT E V ERFA
Sbjct 361 YSLADFGLTSEEVDERFA 378
>gi|302527707|ref|ZP_07280049.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302436602|gb|EFL08418.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=383
Score = 478 bits (1229), Expect = 1e-132, Method: Compositional matrix adjust.
Identities = 240/379 (64%), Positives = 290/379 (77%), Gaps = 2/379 (0%)
Query 4 RPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRF 63
RP R V TV++LHASA KL GLDDFG D+ + E L VLLD+Y EA LT G+K++R
Sbjct 2 RPGRDSVGTVEDLHASAAKLTGLDDFGGDE--HLEGLRVLLDSYTHEADLTPYGNKVHRA 59
Query 64 FLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMW 123
FLRGALVARLLS+++WKQ+P++ DV I+RPIFVTGL RTGTTALHRLL DPAHQGL +W
Sbjct 60 FLRGALVARLLSEASWKQHPQYADVPIERPIFVTGLPRTGTTALHRLLTEDPAHQGLEVW 119
Query 124 LAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLH 183
L E PQPRPPRETW NP+++ + A + QHH E+P + GLH M+A ++EECWQLLRQS+
Sbjct 120 LTEMPQPRPPRETWPENPVFQAIQAGYEQHHVEHPEFMGLHHMSADQVEECWQLLRQSMK 179
Query 184 SVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALM 243
SVSYE LAHVP Y+ WL QDWT +Y RHRRNLQLIGL DA +RWVLKNPSHLFALDAL+
Sbjct 180 SVSYECLAHVPGYSRWLDGQDWTDAYRRHRRNLQLIGLPDAGRRWVLKNPSHLFALDALL 239
Query 244 ATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAA 303
YPDALVVQTHR TI+AS+CSL + +EGWS F G +G +D W+RG ERF AA
Sbjct 240 EVYPDALVVQTHRAPSTIIASVCSLTEQASEGWSDTFRGEVVGRSQLDLWARGAERFAAA 299
Query 304 RAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKH 363
RA+++ AQF DV Y D +ADP+GTV +YRHFGL L+ AR AMT +H S++G P+H
Sbjct 300 RARHNPAQFCDVRYEDFVADPIGTVEGVYRHFGLGLTPRARDAMTVLHERSRTGDAKPRH 359
Query 364 SYSLADYGLTVEMVKERFA 382
Y LAD+GLT E V ERF
Sbjct 360 RYDLADFGLTAEEVDERFG 378
>gi|159038405|ref|YP_001537658.1| hypothetical protein Sare_2832 [Salinispora arenicola CNS-205]
gi|157917240|gb|ABV98667.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=374
Score = 461 bits (1185), Expect = 1e-127, Method: Compositional matrix adjust.
Identities = 227/379 (60%), Positives = 275/379 (73%), Gaps = 6/379 (1%)
Query 4 RPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRF 63
R R DV TVD+LHASAT+L GLDDFG DD+YRE +G LL AY+ EA LT GSK++R
Sbjct 2 RSTRTDVGTVDDLHASATRLTGLDDFG--DDDYREGMGELLSAYRNEAALTPTGSKVSRA 59
Query 64 FLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMW 123
LR ALV+RLLS++AW+++PE+V+V + RP+FVTGL RTGTTALHRLL ADPAHQGL +W
Sbjct 60 LLRAALVSRLLSEAAWRRFPEYVEVPVPRPVFVTGLPRTGTTALHRLLTADPAHQGLELW 119
Query 124 LAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLH 183
L E PQPRPPR TWESNP+Y L A + QHHA NP + H AA ++EECW+LLRQS+
Sbjct 120 LTEAPQPRPPRSTWESNPVYATLQAGYAQHHATNPSFVEAHHTAADQVEECWRLLRQSMM 179
Query 184 SVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALM 243
SVS+E LAHVPSY+ WLS QDWT +Y RHRRNLQLIGL+D ++RWVLKNPSHLFALDAL+
Sbjct 180 SVSFECLAHVPSYSRWLSAQDWTGAYRRHRRNLQLIGLHDQDRRWVLKNPSHLFALDALL 239
Query 244 ATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAA 303
A YPDA+V+QTHR + ++AS+CSL GWS F G +GA WSRGL F A
Sbjct 240 AVYPDAVVIQTHRAPQDVIASVCSLNAQACAGWSELFHGEVLGAAQSRLWSRGLRTFMAD 299
Query 304 RAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKH 363
R ++D A+F DVDY D +ADP+ V IY G L+ AR AMT + + Q P H
Sbjct 300 RERHDPARFVDVDYDDFVADPIRVVEMIYERLGTRLTTVARSAMTAWYRQRQR----PAH 355
Query 364 SYSLADYGLTVEMVKERFA 382
Y LAD+GLT V FA
Sbjct 356 HYRLADFGLTAAEVDAAFA 374
>gi|269126972|ref|YP_003300342.1| hypothetical protein Tcur_2758 [Thermomonospora curvata DSM 43183]
gi|268311930|gb|ACY98304.1| conserved hypothetical protein [Thermomonospora curvata DSM 43183]
Length=382
Score = 457 bits (1176), Expect = 1e-126, Method: Compositional matrix adjust.
Identities = 230/376 (62%), Positives = 277/376 (74%), Gaps = 2/376 (0%)
Query 8 KDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRG 67
+ + T +ELH +A K+ GL DFG +D + + L VLLD+Y EA LT G K R LR
Sbjct 4 EGIGTAEELHEAACKITGLSDFGGED--HLDGLRVLLDSYAEEAALTPRGVKAARAMLRA 61
Query 68 ALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEY 127
AL ARL +Q AWK++PEH V I+RPIFVTGL RTGTTALHRLL ADPAHQGL +WLAE
Sbjct 62 ALAARLFAQDAWKRHPEHAKVRIERPIFVTGLPRTGTTALHRLLTADPAHQGLEVWLAEV 121
Query 128 PQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSY 187
PQPRPPRETW NP+++ + A + +HH +P + G+H+M+A +EECWQLLRQS+ SVS+
Sbjct 122 PQPRPPRETWADNPVFQAIQAGYQRHHVAHPEFMGVHYMSADMVEECWQLLRQSMRSVSF 181
Query 188 EALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYP 247
E LAH+PSY+ WL+ QDW P+Y RHRRNLQLIGLND +RWVLKNPSHLFALDAL+ YP
Sbjct 182 ECLAHLPSYSAWLAEQDWRPAYRRHRRNLQLIGLNDPGRRWVLKNPSHLFALDALLEVYP 241
Query 248 DALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKY 307
DAL+VQTHR T MASMCSLA H T+GWS F G IG D ++ WSRGL F A RAK+
Sbjct 242 DALIVQTHRDPRTAMASMCSLAAHATDGWSRVFTGKVIGRDQLELWSRGLALFRAERAKH 301
Query 308 DSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSL 367
D A+F+DV Y D DPLGTV IY HFGL + +AR AM + ES++GA P H Y L
Sbjct 302 DPARFFDVRYEDFTGDPLGTVEAIYAHFGLPFTGQARAAMARLLEESRTGAARPAHRYDL 361
Query 368 ADYGLTVEMVKERFAG 383
AD+GLT E V ERFAG
Sbjct 362 ADFGLTGEEVTERFAG 377
>gi|319948611|ref|ZP_08022735.1| hypothetical protein ES5_04493 [Dietzia cinnamea P4]
gi|319437692|gb|EFV92688.1| hypothetical protein ES5_04493 [Dietzia cinnamea P4]
Length=393
Score = 456 bits (1173), Expect = 3e-126, Method: Compositional matrix adjust.
Identities = 228/383 (60%), Positives = 278/383 (73%), Gaps = 5/383 (1%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
DR V TVD+LHASA++ VGL+DFG +D +REALGVLLD+ +AGLT GSK R L
Sbjct 5 DRVHVGTVDDLHASASRTVGLEDFGDGEDRHREALGVLLDSLHIDAGLTPAGSKYWRSVL 64
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
+GAL ARLLS SA P +VAI+RP+ VTGL RTGTTALHRLLGADPA+QGL +WL
Sbjct 65 KGALTARLLSTSALASDPARAEVAIERPVVVTGLPRTGTTALHRLLGADPANQGLELWLT 124
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRPPRETWE +P Y L ++ AENP Y G+H+++A +LEECWQLLRQSL SV
Sbjct 125 EVPQPRPPRETWEDDPAYVGLRDLYSGFMAENPDYGGVHYISADDLEECWQLLRQSLTSV 184
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
SYE LA + Y+ WL+ DW P+Y RH+RNLQLIG ND ++RWVLKNPSHLFALDAL+
Sbjct 185 SYECLARLDGYSQWLAGVDWVPAYRRHKRNLQLIGANDPDRRWVLKNPSHLFALDALLEV 244
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDA+VVQTHR MASMCSLA T WST+F IG+ +D W+RG+E F+AARA
Sbjct 245 YPDAVVVQTHRDPRKSMASMCSLAHRTAADWSTRFTPEYIGSSQLDLWARGVETFDAARA 304
Query 306 KYDS-----AQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARA 360
++++ A F DVD+H+L+ DP G VA +Y G LSDE R A+ + S SG RA
Sbjct 305 RHEADPASGATFVDVDHHELLDDPAGVVARVYAAAGTELSDEVRAAVVAENERSLSGDRA 364
Query 361 PKHSYSLADYGLTVEMVKERFAG 383
P H Y+LADYGL+ E + ERFAG
Sbjct 365 PAHRYTLADYGLSEERIAERFAG 387
>gi|326382882|ref|ZP_08204572.1| hypothetical protein SCNU_08083 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198472|gb|EGD55656.1| hypothetical protein SCNU_08083 [Gordonia neofelifaecis NRRL
B-59395]
Length=380
Score = 448 bits (1153), Expect = 7e-124, Method: Compositional matrix adjust.
Identities = 208/373 (56%), Positives = 268/373 (72%), Gaps = 2/373 (0%)
Query 12 TVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGALVA 71
++DE+H +A+ GL DFG D Y E L VL+D+Y EAGLT LG R + G L+A
Sbjct 6 SIDEVHEAASARTGLSDFGETD--YLEGLRVLIDSYAREAGLTGLGVASTREIVIGGLIA 63
Query 72 RLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQPR 131
RL S++A + +P+H+DV I RPIF+TGL RTGTTALHRLL DP HQG+ MWLAE PQPR
Sbjct 64 RLKSEAALRDHPQHLDVPIDRPIFLTGLPRTGTTALHRLLSVDPGHQGMEMWLAERPQPR 123
Query 132 PPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEALA 191
PPR+ W +NP YR++D F NP G+H+M A +EECW++L+QS+ S++YE
Sbjct 124 PPRDQWAANPDYREIDDAFAAQREANPDLMGMHYMDADVVEECWRVLQQSMRSIAYECQC 183
Query 192 HVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDALV 251
HVPSY++WL +DWTP+Y RHRRNLQLIG ND ++RWVLKNPSH+FALD +M+ YPDALV
Sbjct 184 HVPSYSEWLRTEDWTPAYRRHRRNLQLIGANDQDRRWVLKNPSHMFALDEIMSVYPDALV 243
Query 252 VQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDSAQ 311
+ THR +T++ S+ SL + + GWS F Q+GA +D W+RGLE+FN ARA Y S Q
Sbjct 244 IVTHRDPKTVIGSISSLNRQSAIGWSESFSAEQLGAAQLDLWARGLEQFNEARASYSSDQ 303
Query 312 FYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLADYG 371
F DVDY D + DP+GT A +Y HFGL LSDEAR AM S+SG RAP H+Y LA++G
Sbjct 304 FLDVDYRDFVGDPIGTAAGVYAHFGLDLSDEARSAMEAEVVASRSGDRAPSHTYDLAEFG 363
Query 372 LTVEMVKERFAGL 384
LT + V +RFA +
Sbjct 364 LTEQQVDDRFAEI 376
>gi|145595160|ref|YP_001159457.1| hypothetical protein Strop_2635 [Salinispora tropica CNB-440]
gi|145304497|gb|ABP55079.1| hypothetical protein Strop_2635 [Salinispora tropica CNB-440]
Length=334
Score = 421 bits (1083), Expect = 7e-116, Method: Compositional matrix adjust.
Identities = 202/330 (62%), Positives = 247/330 (75%), Gaps = 2/330 (0%)
Query 7 RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLR 66
R DV T++ELHASAT+L GLDDFG DD+YRE + LL AY+ EA LT GSK++R LR
Sbjct 5 RTDVGTIEELHASATRLTGLDDFG--DDDYREGMSELLAAYRNEAALTPTGSKVSRALLR 62
Query 67 GALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAE 126
ALV+RLLS++AW+Q+PE+ +V + RPIFVTGL RTGTTALHRLL ADP HQGL +WL E
Sbjct 63 AALVSRLLSEAAWRQFPEYAEVPVARPIFVTGLPRTGTTALHRLLTADPVHQGLELWLTE 122
Query 127 YPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVS 186
PQPRPPR TWESN +Y L A + Q+H NP G H+ AA ++EECW+LLRQS+ SVS
Sbjct 123 APQPRPPRATWESNLVYAGLRAGYEQYHETNPSLRGAHYTAADQVEECWRLLRQSMMSVS 182
Query 187 YEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY 246
+E LA++PSY+ WLS QDWT +Y RHRRNLQLIGL+D ++RWVLKNPSHLFALDAL+A Y
Sbjct 183 FECLAYLPSYSRWLSEQDWTAAYRRHRRNLQLIGLHDRDRRWVLKNPSHLFALDALLAVY 242
Query 247 PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAK 306
PDA+V+QTHR ++AS+CSL EGWS F GA +G + WSRGL RF A R +
Sbjct 243 PDAVVIQTHRAPREVVASVCSLNAQACEGWSELFRGAVLGGEQAKLWSRGLRRFVADRER 302
Query 307 YDSAQFYDVDYHDLIADPLGTVADIYRHFG 336
+D A F DV Y D +ADP+ V IY G
Sbjct 303 HDPAHFIDVYYDDFVADPIRVVEVIYDRLG 332
>gi|326331627|ref|ZP_08197915.1| hypothetical protein NBCG_03066 [Nocardioidaceae bacterium Broad-1]
gi|325950426|gb|EGD42478.1| hypothetical protein NBCG_03066 [Nocardioidaceae bacterium Broad-1]
Length=389
Score = 421 bits (1082), Expect = 1e-115, Method: Compositional matrix adjust.
Identities = 207/377 (55%), Positives = 266/377 (71%), Gaps = 3/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLL-DAYQGEAGLTVLGSKMNRFF 64
+R DV T +++ A+AT+ GL DFG D + E L +L+ D EAGLT +G+ +R
Sbjct 13 ERADVGTYEDICAAATRTTGLSDFGGTD--HEEGLRLLVEDLASPEAGLTPVGNYFHRAQ 70
Query 65 LRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWL 124
++ ALV RL++Q+ ++P+H DV I+RPIFVTGLVRTGTTALHRLL ADPAHQGL WL
Sbjct 71 VKSALVGRLMTQARLAEFPQHQDVRIERPIFVTGLVRTGTTALHRLLAADPAHQGLETWL 130
Query 125 AEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHS 184
E+PQPRPPRETWE +P++ L + QHH NP + G+H+M A +EECW++LRQS S
Sbjct 131 TEFPQPRPPRETWEDDPVFDALQNAYRQHHVTNPEFMGIHYMDATSVEECWRVLRQSGKS 190
Query 185 VSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMA 244
+S+E+LA+VP Y+ WL++Q W +Y HRR+LQLIGLND +KRWVLKNPSHL ALDALM
Sbjct 191 ISFESLANVPRYSAWLAKQHWRDAYELHRRSLQLIGLNDTDKRWVLKNPSHLVALDALME 250
Query 245 TYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAAR 304
YPDALVV THR +AS CSL+ T G ST FVG IGA ++ SR F AR
Sbjct 251 VYPDALVVVTHRDPVVSVASGCSLSAEATAGMSTTFVGETIGATQLEMLSRSWRSFGEAR 310
Query 305 AKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHS 364
+YD AQF DVDY + DP+GTV IY HF + SD AR ++ + AES+SG+ P+H
Sbjct 311 RRYDQAQFLDVDYRGFVQDPVGTVEGIYSHFDIPWSDAARAEVSRIDAESRSGSARPRHD 370
Query 365 YSLADYGLTVEMVKERF 381
YSLADYGLT + V++ F
Sbjct 371 YSLADYGLTEDEVRQAF 387
>gi|119718592|ref|YP_925557.1| hypothetical protein Noca_4373 [Nocardioides sp. JS614]
gi|119539253|gb|ABL83870.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=386
Score = 405 bits (1040), Expect = 7e-111, Method: Compositional matrix adjust.
Identities = 202/382 (53%), Positives = 257/382 (68%), Gaps = 5/382 (1%)
Query 1 MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQG-EAGLTVLGSK 59
MTR +R DV + +++ A+A + GL DFG + E L VL+D EAGLT G+
Sbjct 6 MTR--ERVDVGSYEDIAAAAMRTTGLSDFGAG--LHEEGLRVLVDDLASPEAGLTPRGNY 61
Query 60 MNRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQG 119
R ++ ALV LL+Q+ + +PEH DV I+RP+FV GL RTGTTALHRLL ADP QG
Sbjct 62 FQRSEVKSALVGVLLTQAQFATHPEHRDVPIERPVFVLGLPRTGTTALHRLLHADPMAQG 121
Query 120 LHMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLR 179
L MWL +YPQPRPPRETWE++P++ + F+ HH E+P + G+H+M A +EECW+LLR
Sbjct 122 LEMWLTQYPQPRPPRETWEADPIFTAMQQAFSAHHVESPEFMGIHYMDATTVEECWRLLR 181
Query 180 QSLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFAL 239
Q+ S SYE+LA+VP Y+ WL RQDWT +Y RH+ NLQL+GLND EKRWVLKNPSHL AL
Sbjct 182 QTGKSSSYESLANVPRYSAWLRRQDWTDAYARHKENLQLVGLNDPEKRWVLKNPSHLTAL 241
Query 240 DALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLER 299
DALM YPDAL+V THR +AS CSL+ TT G ST +VG IG +D WSR
Sbjct 242 DALMTVYPDALIVYTHRDPVVCIASSCSLSAETTAGHSTTYVGRTIGETQLDLWSRAFHA 301
Query 300 FNAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGAR 359
F+ AR +YD AQF DV + DL+ADPLG IY FGL + A+ A+ + ES+ G
Sbjct 302 FHDARGRYDQAQFADVAFRDLVADPLGVTRGIYEQFGLDWTPAAQAAIEEIDQESKQGKA 361
Query 360 APKHSYSLADYGLTVEMVKERF 381
P H+Y+L DYGL V+ F
Sbjct 362 KPSHTYTLEDYGLAEAEVRTAF 383
>gi|325675119|ref|ZP_08154805.1| sulfotransferase [Rhodococcus equi ATCC 33707]
gi|325554080|gb|EGD23756.1| sulfotransferase [Rhodococcus equi ATCC 33707]
Length=380
Score = 389 bits (998), Expect = 5e-106, Method: Compositional matrix adjust.
Identities = 185/377 (50%), Positives = 253/377 (68%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
D + ++++LHA A + GLDDFG D+ + E L VLLD++ EA LT G + R +
Sbjct 4 DYDGIGSIEDLHAQACEETGLDDFGGDE--HLEGLRVLLDSFANEADLTPQGRVVARKMI 61
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
AL RL+S++A+ + P HVDVAI+RPIF+ GL RTGTTALHRLL ADPA+QG+ MWLA
Sbjct 62 VSALRGRLISEAAFARNPGHVDVAIERPIFMCGLTRTGTTALHRLLSADPANQGVEMWLA 121
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRP RETW NP + + DA + A +HFM A E+EECW+LL+Q++ S
Sbjct 122 EAPQPRPARETWSENPDFLRCDAFYRARQANEADLMKVHFMGAEEVEECWRLLQQTMLST 181
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
S++ +A+VPSY +WL++QDWT +Y R+++NLQLIG+ND ++RWVLK+PSH+FA+D ++
Sbjct 182 SFDTIAYVPSYTEWLAKQDWTDTYARYKKNLQLIGMNDRDRRWVLKSPSHVFAIDDILKV 241
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
+PDAL V+T R T MAS SLA+ G S F IG +D W+RG F ARA
Sbjct 242 FPDALFVRTFRDPHTSMASTFSLAEQGGHGMSKAFDRKTIGRTQLDLWARGNANFQEARA 301
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+++ QF D+DY D +ADP+GT +Y F + SDEAR+A+ H S + R P H Y
Sbjct 302 RHNPEQFIDIDYRDFVADPIGTAEKVYTQFAMPFSDEARRAIADAHEASLADHRRPSHKY 361
Query 366 SLADYGLTVEMVKERFA 382
SL D+G+T V +FA
Sbjct 362 SLEDFGITAAEVDAKFA 378
>gi|312137729|ref|YP_004005065.1| hypothetical protein REQ_02280 [Rhodococcus equi 103S]
gi|311887068|emb|CBH46377.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=380
Score = 389 bits (998), Expect = 6e-106, Method: Compositional matrix adjust.
Identities = 185/377 (50%), Positives = 253/377 (68%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
D + ++++LHA A + GLDDFG D+ + E L VLLD++ EA LT G + R +
Sbjct 4 DYDGIGSIEDLHAQACEETGLDDFGGDE--HLEGLRVLLDSFANEADLTPQGRVVARKMI 61
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
AL RL+S++A+ + P HVDVAI+RPIF+ GL RTGTTALHRLL ADPA+QG+ MWLA
Sbjct 62 VSALRGRLISEAAFARNPGHVDVAIERPIFMCGLTRTGTTALHRLLSADPANQGVEMWLA 121
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRP RETW NP + + DA + A +HFM A E+EECW+LL+Q++ S
Sbjct 122 EAPQPRPARETWSENPDFLRCDAFYRARQANEADLMKVHFMGAEEVEECWRLLQQTMLST 181
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
S++ +A+VPSY +WL++QDWT +Y R+++NLQLIG+ND ++RWVLK+PSH+FA+D ++
Sbjct 182 SFDTIAYVPSYTEWLAKQDWTDTYARYKKNLQLIGMNDRDRRWVLKSPSHVFAIDDILKV 241
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
+PDAL V+T R T MAS SLA+ G S F IG +D W+RG F ARA
Sbjct 242 FPDALFVRTFRDPHTSMASTFSLAEQGGHGMSKAFDRKTIGRTQLDLWARGNANFQEARA 301
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+++ QF D+DY D +ADP+GT +Y F + SDEAR+A+ H S + R P H Y
Sbjct 302 RHNPEQFIDIDYRDFVADPIGTAEKVYTQFAMPFSDEARRAIADAHQASLADHRRPSHKY 361
Query 366 SLADYGLTVEMVKERFA 382
SL D+G+T V +FA
Sbjct 362 SLEDFGITAAEVDAKFA 378
>gi|229490662|ref|ZP_04384500.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229322482|gb|EEN88265.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=381
Score = 382 bits (981), Expect = 6e-104, Method: Compositional matrix adjust.
Identities = 187/377 (50%), Positives = 250/377 (67%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
D + ++D+LH +A + G ++FG++ +Y E L VLL+++Q EA LT G + R +
Sbjct 4 DYDGIGSIDDLHQAAREAAGYENFGSE--SYLEGLRVLLESFQNEADLTPHGKVIARKMI 61
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
GAL RL S++ + +YPEHVDV I+RPIFV GL RTG+TALHRLLGADPAHQG MWLA
Sbjct 62 VGALAGRLTSEAGFAKYPEHVDVPIERPIFVVGLTRTGSTALHRLLGADPAHQGAEMWLA 121
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRP R+ W N Y + DA + +HFM A E+EECW+LL+Q+L S
Sbjct 122 ETPQPRPERDKWSENEDYVRSDAFYRARQRNEADLMKVHFMGAEEVEECWRLLQQTLLST 181
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
++E +A+VPSY WL+ QDWT +Y RH++NLQLIGL+D ++RWVLK+PSH+FA+D +MA
Sbjct 182 AFETVAYVPSYTQWLAEQDWTETYARHKKNLQLIGLHDQDRRWVLKSPSHVFAIDEIMAV 241
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDAL V+T R T MAS SLA+ S F IG ++ W+RG FN AR+
Sbjct 242 YPDALFVRTFRDPLTSMASTFSLAEQGGHDMSKAFDRPAIGRTQLELWARGNANFNNARS 301
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+Y+ QF DVDY D IAD +GTV IY F L +D+AR A+ H S + R P H Y
Sbjct 302 RYNPDQFIDVDYKDFIADAVGTVEKIYAQFALPFTDDARAAVEASHQASLAEHRRPSHRY 361
Query 366 SLADYGLTVEMVKERFA 382
SL D+G++ V+ +FA
Sbjct 362 SLEDFGVSAADVEAKFA 378
>gi|226305146|ref|YP_002765104.1| hypothetical protein RER_16570 [Rhodococcus erythropolis PR4]
gi|226184261|dbj|BAH32365.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=381
Score = 381 bits (978), Expect = 1e-103, Method: Compositional matrix adjust.
Identities = 187/377 (50%), Positives = 248/377 (66%), Gaps = 2/377 (0%)
Query 6 DRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFL 65
D + ++D+LH +A + G +FG+D +Y E L VLL+++Q EA LT G + R +
Sbjct 4 DYDGIGSIDDLHQAAREAAGYKNFGSD--SYLEGLRVLLESFQNEADLTPHGKVIARKMI 61
Query 66 RGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLA 125
GAL RL S++ + +YPEHVDV I+RPIFV GL RTG+TALHRLLGADPAHQG MWLA
Sbjct 62 VGALAGRLTSEAGFAKYPEHVDVPIERPIFVVGLTRTGSTALHRLLGADPAHQGAEMWLA 121
Query 126 EYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSV 185
E PQPRP R+ W N Y + DA + +HFM A E+EECW+LL+Q++ S
Sbjct 122 ETPQPRPERDKWSENEDYVRSDAFYRARQRNEADLMKVHFMGAEEVEECWRLLQQTMLST 181
Query 186 SYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMAT 245
++E +A+VPSY WL+ QDWT +Y RH++NLQLIGL+D ++RWVLK+PSH+FA+D +MA
Sbjct 182 AFETVAYVPSYTRWLAEQDWTETYARHKKNLQLIGLHDQDRRWVLKSPSHVFAIDEIMAV 241
Query 246 YPDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARA 305
YPDAL V+T R T MAS SL + S F IG +D W+RG FN AR+
Sbjct 242 YPDALFVRTFRDPLTSMASTFSLVEQGGHDMSKAFDRPAIGRTQLDLWARGNANFNNARS 301
Query 306 KYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSY 365
+Y+ QF DVDY D IAD +GTV IY F L +DEAR A+ H S + R P H Y
Sbjct 302 RYNPDQFIDVDYKDFIADAVGTVEKIYAQFDLPFTDEARAAVEASHQASLAEHRRPSHRY 361
Query 366 SLADYGLTVEMVKERFA 382
SL ++G++ V+ +FA
Sbjct 362 SLEEFGVSAADVEAKFA 378
>gi|312196476|ref|YP_004016537.1| hypothetical protein FraEuI1c_2634 [Frankia sp. EuI1c]
gi|311227812|gb|ADP80667.1| hypothetical protein FraEuI1c_2634 [Frankia sp. EuI1c]
Length=375
Score = 373 bits (957), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 188/376 (50%), Positives = 245/376 (66%), Gaps = 6/376 (1%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ TV+ELHA+A ++ GL DFG D Y E L V+L AY+ EAGLT G+++ R L G L
Sbjct 4 IGTVEELHATAREITGLSDFGPSD--YLEGLKVVLAAYEREAGLTPDGARLIRDELCGIL 61
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
VARL S++ W+QYP++ ++RP+F+ GL RTGTT LHRLL ADPA+QGL +WL PQ
Sbjct 62 VARLFSEAGWRQYPDYAQNPVERPVFIIGLPRTGTTTLHRLLTADPANQGLELWLTYAPQ 121
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PRPPR TW NP++R ++ A P Y G+H A +EECW L RQS+ S +E
Sbjct 122 PRPPRSTWPDNPVFRAVEQGVDGFFARQPDYRGIHDRTADGVEECWLLTRQSMLSAYFEF 181
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
+VPSY+DWL+ QDWT +Y RHRRNLQLIGL D +RWVLK+ SHL LDAL+A YPDA
Sbjct 182 TGYVPSYSDWLAGQDWTEAYLRHRRNLQLIGLRDPGRRWVLKSSSHLPCLDALVAAYPDA 241
Query 250 LVVQTH-RPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYD 308
+++QTH RP ++ S CS+A G S+ F GA IG +D R L RF A RA++D
Sbjct 242 MIIQTHRRPAGAVLGSACSMASRLAGGTSSTFQGAAIGPVLLDLAERTLARFAADRARHD 301
Query 309 SAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLA 368
A+F+DV++ + ADPL VA IYRH G L D+ R AM V A+ S H Y LA
Sbjct 302 PARFHDVEFAEFTADPLAVVAGIYRHLGWELPDDVRPAMAAVLAQDAS---LRSHRYDLA 358
Query 369 DYGLTVEMVKERFAGL 384
D+G++ + R L
Sbjct 359 DFGVSAQEADARLGAL 374
>gi|86740720|ref|YP_481120.1| hypothetical protein Francci3_2017 [Frankia sp. CcI3]
gi|86567582|gb|ABD11391.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=375
Score = 365 bits (937), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 185/376 (50%), Positives = 248/376 (66%), Gaps = 6/376 (1%)
Query 10 VATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGAL 69
+ T++ELH +A+ L GL DFG D Y E L VLL +YQ EA LT G ++ + L G L
Sbjct 4 IQTIEELHTTASDLTGLTDFGPAD--YLEGLEVLLASYQEEASLTPHGVQLVQDELCGIL 61
Query 70 VARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQ 129
+ARL S++ W+++PEH V I+RP+F+ G+ RTGTT LHRLL AD A+QGL +WL PQ
Sbjct 62 MARLFSEAGWQRHPEHAQVPIERPVFIVGMPRTGTTTLHRLLTADSANQGLELWLGYAPQ 121
Query 130 PRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEA 189
PRP R TW +NP+++ + + ++PGY G+H A E+EECW L RQS+ S +E
Sbjct 122 PRPARSTWPTNPIFQMVQGGVDKFVEQHPGYLGIHNRKAGEVEECWLLTRQSMVSPYFEF 181
Query 190 LAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDA 249
+VP+Y+ WL+ +D T +Y RHRRNLQLIGL+D +RWVLK+ SH+ LDAL+ATYPDA
Sbjct 182 TGYVPTYSAWLAGRDSTEAYRRHRRNLQLIGLHDPGRRWVLKSSSHMPCLDALLATYPDA 241
Query 250 LVVQTH-RPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYD 308
+V+QTH RP T++ S CS+A G S+ F G IG + +R L RF RAK+D
Sbjct 242 MVIQTHRRPASTVLGSACSMASKLAAGMSSVFQGEVIGPTLLALATRTLARFATERAKHD 301
Query 309 SAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSLA 368
A+FYDV++ + ADPL VADIYRH G L++E R AM+ V AE AR H Y LA
Sbjct 302 QARFYDVEFDEFTADPLAVVADIYRHLGWDLANEVRPAMSAVLAED---ARLRSHRYDLA 358
Query 369 DYGLTVEMVKERFAGL 384
+G++ E V R L
Sbjct 359 QFGISAEEVDSRLGTL 374
>gi|148553234|ref|YP_001260816.1| hypothetical protein Swit_0307 [Sphingomonas wittichii RW1]
gi|148498424|gb|ABQ66678.1| hypothetical protein Swit_0307 [Sphingomonas wittichii RW1]
Length=379
Score = 268 bits (684), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/375 (41%), Positives = 207/375 (56%), Gaps = 10/375 (2%)
Query 9 DVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGA 68
D D LH A G DFG DD YRE LGVL+DA + + + +
Sbjct 5 DPTDADALHEEAIARTGRSDFG--DDGYREGLGVLIDAIRASPRHDRIAPRFGAMAV-NL 61
Query 69 LVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYP 128
LV RL SQ+ W +PE +D + P+ +TGL R+GTT LH L+ DP Q W+ E P
Sbjct 62 LVGRLASQAGWNAHPELLDDPVPAPLIITGLPRSGTTILHFLMSVDPQFQWTPRWVGEAP 121
Query 129 QPRPPRETWESNPLYRQLDAQFTQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYE 188
RPPRE WES+P YRQ+ + A NPG H M A +EC ++ QS + ++
Sbjct 122 LIRPPREEWESHPQYRQVHDRLEATFAANPGLRAAHDMGAALADECITVMSQSFMTNTFN 181
Query 189 ALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPD 248
+ +P Y W D PSY R++ NL+L+G ++ W+LKNPSH + +DA++ +PD
Sbjct 182 STLPLPDYRRWWYEADEEPSYRRYKDNLRLMGARARDRTWLLKNPSHSYGMDAMLRVFPD 241
Query 249 ALVVQTHR-PVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKY 307
A VV HR PVETI AS SL + F A+ G +D ++R +ER AR ++
Sbjct 242 ARVVVLHRNPVETI-ASGASLIWRNGQ----LFEKAETGPIRLDIFARAVERMREARERH 296
Query 308 DSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTVHAESQSGARAPKHSYSL 367
A DV Y DLIAD LGTV IYRHFGLTLS EA AM ++ G +H YS
Sbjct 297 PGAAVLDVHYRDLIADKLGTVRRIYRHFGLTLSAEAEAAMQAFIGDNPQGKHG-RHDYSS 355
Query 368 ADYGLTVEMVKERFA 382
++G+T + V++RFA
Sbjct 356 GEFGITDDQVRDRFA 370
Lambda K H
0.320 0.133 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 746616418650
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40