BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1343c
Length=126
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608483|ref|NP_215859.1| lipoprotein LprD [Mycobacterium tub... 256 7e-67
gi|340626359|ref|YP_004744811.1| putative lipoprotein LPRD [Myco... 254 3e-66
gi|289749898|ref|ZP_06509276.1| LOW QUALITY PROTEIN: lipoprotein... 254 3e-66
gi|31792539|ref|NP_855032.1| lipoprotein LprD [Mycobacterium bov... 254 3e-66
gi|289446938|ref|ZP_06436682.1| lipoprotein lprD [Mycobacterium ... 252 1e-65
gi|296170285|ref|ZP_06851876.1| lipoprotein LprD [Mycobacterium ... 217 4e-55
gi|342858000|ref|ZP_08714656.1| lipoprotein lprD [Mycobacterium ... 208 2e-52
gi|15827596|ref|NP_301859.1| lipoprotein [Mycobacterium leprae T... 205 2e-51
gi|41408516|ref|NP_961352.1| hypothetical protein MAP2418 [Mycob... 198 2e-49
gi|240170900|ref|ZP_04749559.1| lipoprotein lprD [Mycobacterium ... 197 6e-49
gi|183984009|ref|YP_001852300.1| lipoprotein LprD [Mycobacterium... 196 7e-49
gi|118619114|ref|YP_907446.1| lipoprotein LprD [Mycobacterium ul... 196 1e-48
gi|336460953|gb|EGO39834.1| glucitol operon activator [Mycobacte... 189 2e-46
gi|254774381|ref|ZP_05215897.1| hypothetical protein MaviaA2_068... 186 1e-45
gi|333989987|ref|YP_004522601.1| hypothetical protein JDM601_134... 171 3e-41
gi|254819173|ref|ZP_05224174.1| hypothetical protein MintA_04564... 169 1e-40
gi|118470214|ref|YP_889153.1| hypothetical protein MSMEG_4898 [M... 164 4e-39
gi|118465051|ref|YP_880795.1| hypothetical protein MAV_1560 [Myc... 159 1e-37
gi|120405237|ref|YP_955066.1| putative lipoprotein [Mycobacteriu... 157 3e-37
gi|145222950|ref|YP_001133628.1| hypothetical protein Mflv_2363 ... 139 1e-31
gi|108800804|ref|YP_641001.1| hypothetical protein Mmcs_3840 [My... 138 3e-31
gi|126436402|ref|YP_001072093.1| hypothetical protein Mjls_3826 ... 138 3e-31
gi|119869944|ref|YP_939896.1| hypothetical protein Mkms_3914 [My... 136 8e-31
gi|325673483|ref|ZP_08153174.1| lipoprotein LprD [Rhodococcus eq... 134 6e-30
gi|312139211|ref|YP_004006547.1| integral membrane protein [Rhod... 133 7e-30
gi|54023065|ref|YP_117307.1| hypothetical protein nfa10980 [Noca... 130 9e-29
gi|226307363|ref|YP_002767323.1| hypothetical protein RER_38760 ... 126 9e-28
gi|226360552|ref|YP_002778330.1| hypothetical protein ROP_11380 ... 125 2e-27
gi|229494028|ref|ZP_04387797.1| conserved lipoprotein LprD [Rhod... 124 5e-27
gi|333921278|ref|YP_004494859.1| hypothetical protein AS9A_3621 ... 120 6e-26
gi|169628579|ref|YP_001702228.1| hypothetical protein MAB_1488c ... 117 7e-25
gi|326385024|ref|ZP_08206696.1| glucitol operon activator-like p... 117 7e-25
gi|111018434|ref|YP_701406.1| hypothetical protein RHA1_ro01430 ... 110 9e-23
gi|343924695|ref|ZP_08764237.1| hypothetical protein GOALK_023_0... 100 7e-20
gi|296139025|ref|YP_003646268.1| hypothetical protein Tpau_1300 ... 96.7 1e-18
gi|296393947|ref|YP_003658831.1| hypothetical protein Srot_1537 ... 94.7 4e-18
gi|317508774|ref|ZP_07966424.1| acetolactate synthase [Segnilipa... 92.4 2e-17
gi|262201914|ref|YP_003273122.1| glucitol operon activator-like ... 91.7 3e-17
gi|334563356|ref|ZP_08516347.1| hypothetical protein CbovD2_0218... 89.0 2e-16
gi|25028954|ref|NP_739008.1| hypothetical protein CE2398 [Coryne... 87.0 9e-16
gi|259508019|ref|ZP_05750919.1| membrane protein [Corynebacteriu... 86.7 1e-15
gi|19553698|ref|NP_601700.1| hypothetical protein NCgl2412 [Cory... 85.9 2e-15
gi|306836766|ref|ZP_07469727.1| conserved hypothetical protein [... 84.3 5e-15
gi|227503001|ref|ZP_03933050.1| lipoprotein LprD [Corynebacteriu... 84.3 5e-15
gi|260579676|ref|ZP_05847539.1| conserved hypothetical protein [... 83.2 1e-14
gi|319949718|ref|ZP_08023747.1| putative lipoprotein [Dietzia ci... 83.2 1e-14
gi|337291434|ref|YP_004630455.1| hypothetical protein CULC22_018... 82.8 2e-14
gi|334697459|gb|AEG82256.1| hypothetical protein CULC809_01725 [... 82.8 2e-14
gi|300859071|ref|YP_003784054.1| hypothetical protein cpfrc_0165... 82.4 2e-14
gi|38234418|ref|NP_940185.1| hypothetical protein DIP1848 [Coryn... 82.4 2e-14
>gi|15608483|ref|NP_215859.1| lipoprotein LprD [Mycobacterium tuberculosis H37Rv]
gi|15840798|ref|NP_335835.1| putative lipoprotein [Mycobacterium tuberculosis CDC1551]
gi|148661132|ref|YP_001282655.1| putative lipoprotein LprD [Mycobacterium tuberculosis H37Ra]
64 more sequence titles
Length=126
Score = 256 bits (654), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 125/126 (99%), Positives = 126/126 (100%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY
Sbjct 1 MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK
Sbjct 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
Query 121 QNRTTA 126
QNRTTA
Sbjct 121 QNRTTA 126
>gi|340626359|ref|YP_004744811.1| putative lipoprotein LPRD [Mycobacterium canettii CIPT 140010059]
gi|340004549|emb|CCC43693.1| putative conserved lipoprotein LPRD [Mycobacterium canettii CIPT
140010059]
Length=126
Score = 254 bits (649), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 124/126 (99%), Positives = 125/126 (99%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY
Sbjct 1 MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
RNFVRYEETPPQPPTG AAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK
Sbjct 61 RNFVRYEETPPQPPTGSAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
Query 121 QNRTTA 126
QNRTTA
Sbjct 121 QNRTTA 126
>gi|289749898|ref|ZP_06509276.1| LOW QUALITY PROTEIN: lipoprotein lprD [Mycobacterium tuberculosis
T92]
gi|289690485|gb|EFD57914.1| LOW QUALITY PROTEIN: lipoprotein lprD [Mycobacterium tuberculosis
T92]
Length=127
Score = 254 bits (649), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 124/126 (99%), Positives = 125/126 (99%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY
Sbjct 1 MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREY AYLAELAKDDARK
Sbjct 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYKAYLAELAKDDARK 120
Query 121 QNRTTA 126
QNRTTA
Sbjct 121 QNRTTA 126
>gi|31792539|ref|NP_855032.1| lipoprotein LprD [Mycobacterium bovis AF2122/97]
gi|121637274|ref|YP_977497.1| putative lipoprotein lprD [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224989748|ref|YP_002644435.1| putative lipoprotein [Mycobacterium bovis BCG str. Tokyo 172]
8 more sequence titles
Length=126
Score = 254 bits (649), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 124/126 (99%), Positives = 125/126 (99%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY
Sbjct 1 MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
RNFVRYEETPPQPPTGGAAA IPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK
Sbjct 61 RNFVRYEETPPQPPTGGAAAAIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
Query 121 QNRTTA 126
QNRTTA
Sbjct 121 QNRTTA 126
>gi|289446938|ref|ZP_06436682.1| lipoprotein lprD [Mycobacterium tuberculosis CPHL_A]
gi|289419896|gb|EFD17097.1| lipoprotein lprD [Mycobacterium tuberculosis CPHL_A]
Length=127
Score = 252 bits (643), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 125/127 (99%), Positives = 126/127 (99%), Gaps = 1/127 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY
Sbjct 1 MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDAR- 119
RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDAR
Sbjct 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
Query 120 KQNRTTA 126
KQNRTTA
Sbjct 121 KQNRTTA 127
>gi|296170285|ref|ZP_06851876.1| lipoprotein LprD [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295895008|gb|EFG74727.1| lipoprotein LprD [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=153
Score = 217 bits (553), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 103/126 (82%), Positives = 108/126 (86%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
VSTTRRRRPALIALVI+A CGCLALGWWQWTRFQS SGTFQNLGYALQWPLFAWFCVYAY
Sbjct 28 VSTTRRRRPALIALVIVAACGCLALGWWQWTRFQSVSGTFQNLGYALQWPLFAWFCVYAY 87
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
R +VRYEE PP+P A EIPAGLLPERP PA QP DDP L +YNAYLAELA+ D K
Sbjct 88 RKYVRYEEMPPEPEQDAAQTEIPAGLLPERPAPAHQPSDDPALAQYNAYLAELARQDTEK 147
Query 121 QNRTTA 126
QNRTTA
Sbjct 148 QNRTTA 153
>gi|342858000|ref|ZP_08714656.1| lipoprotein lprD [Mycobacterium colombiense CECT 3035]
gi|342135333|gb|EGT88499.1| lipoprotein lprD [Mycobacterium colombiense CECT 3035]
Length=126
Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 101/126 (81%), Positives = 104/126 (83%), Gaps = 0/126 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STT RRRPALIALVI+A CGCLALGWWQWTRF S SG+FQNLGYALQWPLFA FCVYAY
Sbjct 1 MSTTHRRRPALIALVIVAACGCLALGWWQWTRFNSVSGSFQNLGYALQWPLFAGFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
R FVRYEE PPQP GA EIP GLLPERPK A Q PDDP LREYN YLAELA DA K
Sbjct 61 RKFVRYEEEPPQPRDTGAMTEIPDGLLPERPKRASQAPDDPALREYNTYLAELAAQDAEK 120
Query 121 QNRTTA 126
Q RTTA
Sbjct 121 QKRTTA 126
>gi|15827596|ref|NP_301859.1| lipoprotein [Mycobacterium leprae TN]
gi|221230073|ref|YP_002503489.1| putative lipoprotein [Mycobacterium leprae Br4923]
gi|1723047|sp|P54134.1|Y1177_MYCLE RecName: Full=Uncharacterized protein ML1177
gi|466928|gb|AAA50903.1| B1549_F3_106 [Mycobacterium leprae]
gi|4493784|emb|CAB39143.1| putative lipoprotein [Mycobacterium leprae]
gi|13093147|emb|CAC31558.1| possible lipoprotein [Mycobacterium leprae]
gi|219933180|emb|CAR71272.1| possible lipoprotein [Mycobacterium leprae Br4923]
Length=126
Score = 205 bits (522), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/125 (79%), Positives = 103/125 (83%), Gaps = 0/125 (0%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPAL+ALV IA CGCLALGWWQWTRFQS SGTFQNLGYALQWPLFA FC+Y Y
Sbjct 1 MSTTRRRRPALVALVTIAACGCLALGWWQWTRFQSASGTFQNLGYALQWPLFAGFCLYTY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
NFVRYEE+PPQP AEIP LLP RPKP QQPPDDP LR+YN YLAELAK DA
Sbjct 61 HNFVRYEESPPQPRHMNCIAEIPPELLPARPKPEQQPPDDPALRKYNTYLAELAKQDAEN 120
Query 121 QNRTT 125
NRTT
Sbjct 121 HNRTT 125
>gi|41408516|ref|NP_961352.1| hypothetical protein MAP2418 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396872|gb|AAS04735.1| hypothetical protein MAP_2418 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=130
Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 95/116 (82%), Positives = 99/116 (86%), Gaps = 0/116 (0%)
Query 10 ALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEET 69
ALIALVIIA CGCLALGWWQWTRFQS SG+FQNLGYALQWPLFAWFCVYAYR FVRYEE
Sbjct 14 ALIALVIIAACGCLALGWWQWTRFQSVSGSFQNLGYALQWPLFAWFCVYAYRKFVRYEEE 73
Query 70 PPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTT 125
PP+P A EIPAGLLPERP+P QPPDDP LREYN YLAELA DA +QNRTT
Sbjct 74 PPRPRDTAAVTEIPAGLLPERPRPTPQPPDDPALREYNRYLAELAAQDAAQQNRTT 129
>gi|240170900|ref|ZP_04749559.1| lipoprotein lprD [Mycobacterium kansasii ATCC 12478]
Length=124
Score = 197 bits (500), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 101/126 (81%), Positives = 108/126 (86%), Gaps = 2/126 (1%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STT RRRPALIAL I A CGCLALGWWQWTRFQS SGTFQNLGYALQWP+FAWFC+YAY
Sbjct 1 MSTTHRRRPALIALAISAACGCLALGWWQWTRFQSVSGTFQNLGYALQWPMFAWFCIYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
R FVRYEE PP+PP A EIPAGLLPERPKPA QP DDP L+EYNAYLAELAK DA +
Sbjct 61 RKFVRYEEVPPEPPN--AMTEIPAGLLPERPKPAPQPSDDPALQEYNAYLAELAKHDAER 118
Query 121 QNRTTA 126
QNR +A
Sbjct 119 QNRNSA 124
>gi|183984009|ref|YP_001852300.1| lipoprotein LprD [Mycobacterium marinum M]
gi|183177335|gb|ACC42445.1| conserved lipoprotein LprD [Mycobacterium marinum M]
Length=130
Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 94/116 (82%), Positives = 99/116 (86%), Gaps = 0/116 (0%)
Query 11 LIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETP 70
LIALVIIA GCLALGWWQWTRF+STSG+FQNLGYALQWP+FAWFCVYAYR FVRYEETP
Sbjct 15 LIALVIIAASGCLALGWWQWTRFESTSGSFQNLGYALQWPMFAWFCVYAYRKFVRYEETP 74
Query 71 PQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTTA 126
PQP A EIPAGLLP+R P Q PDDP LREYNAYLA LA+DDA KQNRTTA
Sbjct 75 PQPRDPAAVTEIPAGLLPQRTTPDQAAPDDPALREYNAYLAALAEDDAEKQNRTTA 130
>gi|118619114|ref|YP_907446.1| lipoprotein LprD [Mycobacterium ulcerans Agy99]
gi|118571224|gb|ABL05975.1| conserved lipoprotein LprD [Mycobacterium ulcerans Agy99]
Length=130
Score = 196 bits (498), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 94/116 (82%), Positives = 99/116 (86%), Gaps = 0/116 (0%)
Query 11 LIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETP 70
LIALVIIA GCLALGWWQWTRF+STSG+FQNLGYALQWP+FAWFCVYAYR FVRYEETP
Sbjct 15 LIALVIIAASGCLALGWWQWTRFESTSGSFQNLGYALQWPMFAWFCVYAYRKFVRYEETP 74
Query 71 PQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTTA 126
PQP AA EIPAGLLP+R P Q PDDP LREYNAYL LA+DDA KQNRTTA
Sbjct 75 PQPRDPAAATEIPAGLLPQRTTPDQAAPDDPALREYNAYLDALAEDDAEKQNRTTA 130
>gi|336460953|gb|EGO39834.1| glucitol operon activator [Mycobacterium avium subsp. paratuberculosis
S397]
Length=112
Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/111 (81%), Positives = 94/111 (85%), Gaps = 0/111 (0%)
Query 15 VIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQPP 74
+IIA CGCLALGWWQWTRFQS SG+FQNLGYALQWPLFAWFCVYAYR FVRYEE PP+P
Sbjct 1 MIIAACGCLALGWWQWTRFQSVSGSFQNLGYALQWPLFAWFCVYAYRKFVRYEEEPPRPR 60
Query 75 TGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTT 125
A EIPAGLLPERP+P QPPDDP LREYN YLAELA DA +QNRTT
Sbjct 61 DTAAVTEIPAGLLPERPRPTPQPPDDPALREYNRYLAELAAQDAAQQNRTT 111
>gi|254774381|ref|ZP_05215897.1| hypothetical protein MaviaA2_06890 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=117
Score = 186 bits (471), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/110 (80%), Positives = 92/110 (84%), Gaps = 0/110 (0%)
Query 15 VIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQPP 74
+IIA CGCLALGWWQWTRFQS SG+FQNLGYALQWPLFAWFCVYAYR FVRYEE PP+P
Sbjct 1 MIIAACGCLALGWWQWTRFQSVSGSFQNLGYALQWPLFAWFCVYAYRKFVRYEEEPPRPR 60
Query 75 TGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRT 124
A EIPAGLLPERP+P QPPDDP LREYN YLAELA DA +QNR
Sbjct 61 DTAAVTEIPAGLLPERPRPTPQPPDDPALREYNRYLAELAAQDAAQQNRN 110
>gi|333989987|ref|YP_004522601.1| hypothetical protein JDM601_1347 [Mycobacterium sp. JDM601]
gi|333485955|gb|AEF35347.1| lipoprotein LprD [Mycobacterium sp. JDM601]
Length=112
Score = 171 bits (433), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 81/113 (72%), Positives = 89/113 (79%), Gaps = 2/113 (1%)
Query 14 LVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQP 73
LVI+A C CLALGWWQWTRFQ +G FQNLGYALQWPLFAWFCVYAYR FVRYEE PP+P
Sbjct 2 LVIVAACACLALGWWQWTRFQEINGNFQNLGYALQWPLFAWFCVYAYRKFVRYEEAPPEP 61
Query 74 PTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTTA 126
EIPAGLLPERP PA P DD LR+YNAYLAEL ++D ++NR TA
Sbjct 62 HRPDTVTEIPAGLLPERPGPAATPTDDAALRDYNAYLAELTEND--RKNRNTA 112
>gi|254819173|ref|ZP_05224174.1| hypothetical protein MintA_04564 [Mycobacterium intracellulare
ATCC 13950]
Length=113
Score = 169 bits (428), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/113 (79%), Positives = 93/113 (83%), Gaps = 0/113 (0%)
Query 14 LVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQP 73
+VIIA CGCLALGWWQWTRFQS SG+FQNLGYALQWPLFAWFCVYAYR FVRYEE PP P
Sbjct 1 MVIIAACGCLALGWWQWTRFQSVSGSFQNLGYALQWPLFAWFCVYAYRKFVRYEEEPPPP 60
Query 74 PTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTTA 126
+ A EIPAGLLP+RP P Q PDDP LREYN YLAELA DA KQ RTTA
Sbjct 61 RSTEAVTEIPAGLLPDRPTPPPQAPDDPALREYNKYLAELAAQDAEKQKRTTA 113
>gi|118470214|ref|YP_889153.1| hypothetical protein MSMEG_4898 [Mycobacterium smegmatis str.
MC2 155]
gi|118171501|gb|ABK72397.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=124
Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 83/127 (66%), Positives = 94/127 (75%), Gaps = 7/127 (5%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALI LV +A GC+AL WWQWTRF+S SGTFQNLGYALQWP+FA FC YAY
Sbjct 1 MSTTRRRRPALILLVFVAAAGCMALAWWQWTRFESNSGTFQNLGYALQWPMFAGFCFYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQP----PDDPVLREYNAYLAELAKD 116
FVRYEE PP+ P EIP GLLPERP+ + DDP L EYNAYLAELAK
Sbjct 61 YKFVRYEEAPPERPK-DTVTEIPEGLLPERPRHVARERTADDDDPALAEYNAYLAELAKK 119
Query 117 DARKQNR 123
++Q+R
Sbjct 120 --KEQDR 124
>gi|118465051|ref|YP_880795.1| hypothetical protein MAV_1560 [Mycobacterium avium 104]
gi|118166338|gb|ABK67235.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=112
Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/111 (79%), Positives = 94/111 (85%), Gaps = 0/111 (0%)
Query 15 VIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQPP 74
+IIA CGCLALGWWQWTRFQS SG+FQNLGYALQWPLFAWFCVYAYR FVRYEE PP+P
Sbjct 1 MIIAACGCLALGWWQWTRFQSVSGSFQNLGYALQWPLFAWFCVYAYRKFVRYEEEPPRPR 60
Query 75 TGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTT 125
A EIPAGLLPERP+P +PPDDP LREYN YLAELA DA +Q+RTT
Sbjct 61 DTAAVTEIPAGLLPERPRPTPRPPDDPALREYNRYLAELAAQDAAQQDRTT 111
>gi|120405237|ref|YP_955066.1| putative lipoprotein [Mycobacterium vanbaalenii PYR-1]
gi|119958055|gb|ABM15060.1| lipoprotein, putative [Mycobacterium vanbaalenii PYR-1]
Length=141
Score = 157 bits (398), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/137 (65%), Positives = 99/137 (73%), Gaps = 13/137 (9%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALI LVI A GCLAL WWQWTR++STSG+FQNLGYA+QWPLFA FCVYAY
Sbjct 1 MSTTRRRRPALILLVIAAAAGCLALAWWQWTRYESTSGSFQNLGYAMQWPLFAGFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDD-------------PVLREYN 107
FVRYEE PP+P A EIPAGLLPERP+PA++ + LREYN
Sbjct 61 FKFVRYEEAPPEPHPHDAVTEIPAGLLPERPQPARRGNANRDADPDNDDPDDDTTLREYN 120
Query 108 AYLAELAKDDARKQNRT 124
AYLAELAK DA Q T
Sbjct 121 AYLAELAKADAEDQKMT 137
>gi|145222950|ref|YP_001133628.1| hypothetical protein Mflv_2363 [Mycobacterium gilvum PYR-GCK]
gi|315443412|ref|YP_004076291.1| glucitol operon activator [Mycobacterium sp. Spyr1]
gi|145215436|gb|ABP44840.1| lipoprotein, putative [Mycobacterium gilvum PYR-GCK]
gi|315261715|gb|ADT98456.1| glucitol operon activator [Mycobacterium sp. Spyr1]
Length=140
Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/130 (65%), Positives = 93/130 (72%), Gaps = 13/130 (10%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALI L A GCLAL WWQWTR++STSG+FQNLGYA+QWPLFA FC YAY
Sbjct 1 MSTTRRRRPALILLATAAALGCLALAWWQWTRYESTSGSFQNLGYAMQWPLFAGFCFYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQ----QPPD---------DPVLREYN 107
FVRYEE PP+ A EIPAGLLPERP+P +P D DP LREYN
Sbjct 61 FKFVRYEEAPPELHPQDAVTEIPAGLLPERPRPTTPRGVEPDDAVGTDPDDADPTLREYN 120
Query 108 AYLAELAKDD 117
AYLAELA+ D
Sbjct 121 AYLAELARTD 130
>gi|108800804|ref|YP_641001.1| hypothetical protein Mmcs_3840 [Mycobacterium sp. MCS]
gi|108771223|gb|ABG09945.1| lipoprotein, putative [Mycobacterium sp. MCS]
Length=140
Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/127 (68%), Positives = 96/127 (76%), Gaps = 4/127 (3%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
VSTTRRRRP LI LV+ A GCLAL WWQWTR++ST+G+FQNLGYALQWP+FA FCVYAY
Sbjct 15 VSTTRRRRPELILLVVAAAGGCLALAWWQWTRWESTTGSFQNLGYALQWPMFAGFCVYAY 74
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPER--PKPAQQPPDDPVLREYNAYLAELAKDDA 118
FVRYEE PP+ GA EIPA LLPER A P DDP LREYNAYLA+LA+ D
Sbjct 75 YKFVRYEEAPPEQNPRGALTEIPADLLPERGTTAAAPAPDDDPALREYNAYLADLAEAD- 133
Query 119 RKQNRTT 125
+ RTT
Sbjct 134 -NEGRTT 139
>gi|126436402|ref|YP_001072093.1| hypothetical protein Mjls_3826 [Mycobacterium sp. JLS]
gi|126236202|gb|ABN99602.1| lipoprotein, putative [Mycobacterium sp. JLS]
Length=126
Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/127 (68%), Positives = 97/127 (77%), Gaps = 4/127 (3%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRPALI LV+ A GCLAL WWQWTR++ST+G+FQNLGYALQWP+FA FCVYAY
Sbjct 1 MSTTRRRRPALILLVVAAAAGCLALAWWQWTRWESTTGSFQNLGYALQWPMFAGFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPER--PKPAQQPPDDPVLREYNAYLAELAKDDA 118
FVRYEE PP+ GA EIPA LLPER A P DDP LREYNAYLA+LA+ D
Sbjct 61 YKFVRYEEAPPEQNPRGALTEIPADLLPERGTTAAAPAPDDDPALREYNAYLADLAEAD- 119
Query 119 RKQNRTT 125
+ RTT
Sbjct 120 -NEGRTT 125
>gi|119869944|ref|YP_939896.1| hypothetical protein Mkms_3914 [Mycobacterium sp. KMS]
gi|119696033|gb|ABL93106.1| lipoprotein, putative [Mycobacterium sp. KMS]
Length=126
Score = 136 bits (343), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 85/127 (67%), Positives = 96/127 (76%), Gaps = 4/127 (3%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+STTRRRRP LI LV+ A GCLAL WWQWTR++ST+G+FQNLGYALQWP+FA FCVYAY
Sbjct 1 MSTTRRRRPELILLVVAAAGGCLALAWWQWTRWESTTGSFQNLGYALQWPMFAGFCVYAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPER--PKPAQQPPDDPVLREYNAYLAELAKDDA 118
FVRYEE PP+ GA EIPA LLPER A P DDP LREYNAYLA+LA+ D
Sbjct 61 YKFVRYEEAPPEQNPRGALTEIPADLLPERGTTAAAPAPDDDPALREYNAYLADLAEAD- 119
Query 119 RKQNRTT 125
+ RTT
Sbjct 120 -NEGRTT 125
>gi|325673483|ref|ZP_08153174.1| lipoprotein LprD [Rhodococcus equi ATCC 33707]
gi|325555504|gb|EGD25175.1| lipoprotein LprD [Rhodococcus equi ATCC 33707]
Length=135
Score = 134 bits (336), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/133 (57%), Positives = 87/133 (66%), Gaps = 14/133 (10%)
Query 3 TTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
T R+ RPALI LV++A+ GCLALGWWQW RF+S GT QNLGYA QWPLFA F VYAYR
Sbjct 4 TRRKSRPALIVLVVVASLGCLALGWWQWMRFESVGGTGQNLGYAFQWPLFAAFVVYAYRR 63
Query 63 FVRYEETPPQPPTGGAA----AEIPAGLLPERPK-PAQQPP---DDPVLR---EYNAYLA 111
FV+ E+ Q G A E+P GLLPERPK P P DD R EYN YLA
Sbjct 64 FVQLEDGDAQVGGAGTADAEPTEVPEGLLPERPKAPTTTDPSDLDDSDARQMLEYNDYLA 123
Query 112 ELAKDDARKQNRT 124
EL+ AR+ +R+
Sbjct 124 ELS---AREPDRS 133
>gi|312139211|ref|YP_004006547.1| integral membrane protein [Rhodococcus equi 103S]
gi|311888550|emb|CBH47862.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=135
Score = 133 bits (335), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/133 (57%), Positives = 87/133 (66%), Gaps = 14/133 (10%)
Query 3 TTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
T R+ RPALI LV++A+ GCLALGWWQW RF+S GT QNLGYA QWPLFA F VYAYR
Sbjct 4 TRRKSRPALIVLVVVASLGCLALGWWQWMRFESVGGTGQNLGYAFQWPLFAAFVVYAYRR 63
Query 63 FVRYEETPPQPPTGGAA----AEIPAGLLPERPK-PAQQPP---DDPVLR---EYNAYLA 111
FV+ E+ Q G A E+P GLLPERPK P P DD R EYN YLA
Sbjct 64 FVQLEDGDAQVGGAGTADGEPTEVPEGLLPERPKAPTTTDPSDFDDSDARQMLEYNDYLA 123
Query 112 ELAKDDARKQNRT 124
EL+ AR+ +R+
Sbjct 124 ELS---AREPDRS 133
>gi|54023065|ref|YP_117307.1| hypothetical protein nfa10980 [Nocardia farcinica IFM 10152]
gi|54014573|dbj|BAD55943.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=166
Score = 130 bits (326), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 73/127 (58%), Positives = 81/127 (64%), Gaps = 13/127 (10%)
Query 8 RPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYE 67
RPALIALV++A CLALGWWQW RF+S SGT QNLGYALQWPLFA F V+AY FVR E
Sbjct 28 RPALIALVLVAAVICLALGWWQWERFESASGTGQNLGYALQWPLFAAFVVFAYFRFVRLE 87
Query 68 ETPPQPPTGGAAA-----------EIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKD 116
+ P A EIPAG+LPERP A+ DPVL EYN YLAEL
Sbjct 88 SESTEGPAPAEPAAARRPRKDRPREIPAGILPERPTAARAA--DPVLAEYNRYLAELHAA 145
Query 117 DARKQNR 123
D +Q R
Sbjct 146 DVEEQVR 152
>gi|226307363|ref|YP_002767323.1| hypothetical protein RER_38760 [Rhodococcus erythropolis PR4]
gi|226186480|dbj|BAH34584.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=152
Score = 126 bits (317), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/131 (55%), Positives = 86/131 (66%), Gaps = 14/131 (10%)
Query 6 RRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVR 65
+RRPALI LVI++ GCLALGWWQW RF++ G+ QNLGYA QWPLFA F VYAYR FV+
Sbjct 24 KRRPALILLVIVSAAGCLALGWWQWERFEAVGGSGQNLGYAFQWPLFAAFVVYAYRKFVQ 83
Query 66 YEETPPQPPTGGAAA----EIPAGLLPERPK----PAQQPPDDPVLR---EYNAYLAELA 114
E+ P+ AA EIPA LLP+RP+ DDP R EYN YLA+LA
Sbjct 84 LEDVDPEEAEAEEAAAEPREIPADLLPQRPQAEFDDVNLDVDDPEARQMLEYNDYLAQLA 143
Query 115 KDDARKQNRTT 125
AR+ +R+T
Sbjct 144 ---ARESDRST 151
>gi|226360552|ref|YP_002778330.1| hypothetical protein ROP_11380 [Rhodococcus opacus B4]
gi|226239037|dbj|BAH49385.1| hypothetical protein [Rhodococcus opacus B4]
Length=133
Score = 125 bits (314), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/130 (54%), Positives = 84/130 (65%), Gaps = 12/130 (9%)
Query 5 RRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFV 64
++RRPALI V + GCLALGWWQWTRF++ GT QNLGYA QWPLFA F VYAYR FV
Sbjct 6 KQRRPALILFVAVGAVGCLALGWWQWTRFEAVGGTGQNLGYAFQWPLFAAFLVYAYRRFV 65
Query 65 RYEETPPQ-----PPTGGAAAEIPAGLLPERPKPAQ-QPPDDPV---LREYNAYLAELAK 115
+ E+ P P T EIPA +LP RP A+ PDD + +YN YLAEL+
Sbjct 66 QLEDAEPADTADVPDTPAELREIPADVLPPRPTAAEDDDPDDLESLQISQYNQYLAELS- 124
Query 116 DDARKQNRTT 125
AR+ +R+T
Sbjct 125 --ARETDRST 132
>gi|229494028|ref|ZP_04387797.1| conserved lipoprotein LprD [Rhodococcus erythropolis SK121]
gi|229319097|gb|EEN84949.1| conserved lipoprotein LprD [Rhodococcus erythropolis SK121]
Length=214
Score = 124 bits (311), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 71/131 (55%), Positives = 86/131 (66%), Gaps = 14/131 (10%)
Query 6 RRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVR 65
+RRPALI LVI++ GCLALGWWQW RF++ G+ QNLGYA QWPLFA F VYAYR FV+
Sbjct 86 KRRPALILLVIVSAAGCLALGWWQWERFEAVGGSGQNLGYAFQWPLFAAFVVYAYRKFVQ 145
Query 66 YEETPPQPPTGGAAA----EIPAGLLPERPK----PAQQPPDDPVLR---EYNAYLAELA 114
E+ P+ AA EIPA LLP+RP+ DDP R EYN YLA+LA
Sbjct 146 LEDVDPEEAEAEEAAAEPREIPADLLPQRPRAEFDDVNLDVDDPEARQMLEYNDYLAQLA 205
Query 115 KDDARKQNRTT 125
AR+ +R+T
Sbjct 206 ---ARESDRST 213
>gi|333921278|ref|YP_004494859.1| hypothetical protein AS9A_3621 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483499|gb|AEF42059.1| hypothetical protein AS9A_3621 [Amycolicicoccus subflavus DQS3-9A1]
Length=125
Score = 120 bits (301), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/124 (51%), Positives = 79/124 (64%), Gaps = 4/124 (3%)
Query 4 TRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNF 63
+R R ALI VI+ CL LGWWQ R++S +G+ QNLGYALQWPLFA F VYAY F
Sbjct 5 SRSPRLALIIFVIVGATACLVLGWWQLQRYESAAGSAQNLGYALQWPLFAGFLVYAYYRF 64
Query 64 VRYE-ETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQN 122
VR E E P G E+P +LP RP PA +DP + YNAYLAEL++ D ++
Sbjct 65 VRLEKEEEKGEPVQGPPREVPEDVLPPRPAPAGHADEDPEITAYNAYLAELSQHD---RD 121
Query 123 RTTA 126
R+T+
Sbjct 122 RSTS 125
>gi|169628579|ref|YP_001702228.1| hypothetical protein MAB_1488c [Mycobacterium abscessus ATCC
19977]
gi|169240546|emb|CAM61574.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=110
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 58/106 (55%), Positives = 72/106 (68%), Gaps = 0/106 (0%)
Query 14 LVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQP 73
LV ++ L L +WQ+ RF+S +G QNLGYALQWPLFA F ++AYR FV+YE+ P P
Sbjct 2 LVGVSVVAFLGLAYWQFQRFESVTGDGQNLGYALQWPLFAAFVIWAYRRFVQYEDEGPPP 61
Query 74 PTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDAR 119
P EIP GLLPERP A+ P D L EYNAYLA LA++D +
Sbjct 62 PPTDRVTEIPEGLLPERPAAAKPDPADRTLTEYNAYLAALAEEDRK 107
>gi|326385024|ref|ZP_08206696.1| glucitol operon activator-like protein [Gordonia neofelifaecis
NRRL B-59395]
gi|326196238|gb|EGD53440.1| glucitol operon activator-like protein [Gordonia neofelifaecis
NRRL B-59395]
Length=134
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/123 (53%), Positives = 73/123 (60%), Gaps = 10/123 (8%)
Query 2 STTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYR 61
+ RR RPALI LV +A CLAL WWQW R++S+SGT QNLGYALQWP FA VYAYR
Sbjct 8 TAKRRHRPALIVLVAVAAIACLALAWWQWGRYESSSGTGQNLGYALQWPAFAVAVVYAYR 67
Query 62 NFVRYEETPP-QPPTGGAAAEIPAGLLPERPKPA--------QQPPDDPVLREYNAYLAE 112
FV E P + EIP G+LP+RP PDD L EYN YLAE
Sbjct 68 RFVVMEADPDAERRDRDEPTEIPEGILPDRPTKNDPSVSAILDAAPDDD-LAEYNKYLAE 126
Query 113 LAK 115
L K
Sbjct 127 LDK 129
>gi|111018434|ref|YP_701406.1| hypothetical protein RHA1_ro01430 [Rhodococcus jostii RHA1]
gi|110817964|gb|ABG93248.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=133
Score = 110 bits (274), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/130 (54%), Positives = 86/130 (67%), Gaps = 12/130 (9%)
Query 5 RRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFV 64
++RRPAL+ LV++ GCLALGWWQWTRF++ GT QNLGYA QWPLFA F VYAYR FV
Sbjct 6 KQRRPALVLLVVVGAVGCLALGWWQWTRFEAVGGTGQNLGYAFQWPLFAAFLVYAYRRFV 65
Query 65 RYEETPPQPPTGGAAA-----EIPAGLLPERPKPAQ-QPPDDPVLR---EYNAYLAELAK 115
+ E+ P T A EIPA +LP RP A+ PDD R +YN YLAEL+
Sbjct 66 QLEDAEPADTTDVPDAPTELREIPADVLPPRPSVAEDDDPDDLESRQISQYNQYLAELS- 124
Query 116 DDARKQNRTT 125
AR+ +R+T
Sbjct 125 --ARETDRST 132
>gi|343924695|ref|ZP_08764237.1| hypothetical protein GOALK_023_00050 [Gordonia alkanivorans NBRC
16433]
gi|343765375|dbj|GAA11163.1| hypothetical protein GOALK_023_00050 [Gordonia alkanivorans NBRC
16433]
Length=129
Score = 100 bits (249), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/118 (50%), Positives = 66/118 (56%), Gaps = 12/118 (10%)
Query 12 IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETP- 70
+ L ++A CL L WWQW+RF+S SG+ QNLGYALQWP FA +YAYR FV E P
Sbjct 1 MVLTVVAAVACLVLAWWQWSRFESASGSGQNLGYALQWPAFAVAVIYAYRRFVVLESDPE 60
Query 71 ---PQPPTGGAAAEIPAGLLPERPKP--------AQQPPDDPVLREYNAYLAELAKDD 117
AAAEIP G+LPERP P D L EYN YLAEL D
Sbjct 61 EARKAAADARAAAEIPEGILPERPTTPSAASLINDHDPVKDAALIEYNRYLAELRSAD 118
>gi|296139025|ref|YP_003646268.1| hypothetical protein Tpau_1300 [Tsukamurella paurometabola DSM
20162]
gi|296027159|gb|ADG77929.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=146
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/127 (44%), Positives = 66/127 (52%), Gaps = 30/127 (23%)
Query 22 CLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETP----------- 70
CLALGWWQW RF+ST+G+ +NLGYALQWPLFA ++ YR FV+ E+
Sbjct 10 CLALGWWQWGRFESTAGSARNLGYALQWPLFALAFLWGYRRFVQLEDEAELQERLAADEA 69
Query 71 ----------------PQPPTGGAAAEIPAGLLPER---PKPAQQPPDDPVLREYNAYLA 111
P G+ IP G LP R P P + PDDP EYN LA
Sbjct 70 AAEPAAQADAAAARPAPVDRGAGSMTAIPEGFLPSRRAAPSPDESAPDDPAHAEYNRLLA 129
Query 112 ELAKDDA 118
ELA +DA
Sbjct 130 ELAANDA 136
>gi|296393947|ref|YP_003658831.1| hypothetical protein Srot_1537 [Segniliparus rotundus DSM 44985]
gi|296181094|gb|ADG98000.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=150
Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/124 (48%), Positives = 72/124 (59%), Gaps = 13/124 (10%)
Query 5 RRRR----PALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
RR+R PAL+ L +A C CL LG WQW RF + GTFQNLGYALQWPLFA F VY+Y
Sbjct 23 RRQRGGHHPALLMLTFLACCVCLLLGLWQWRRFHESGGTFQNLGYALQWPLFAVFTVYSY 82
Query 61 RNFVRYEETPPQPPTGGAAAEIP--AGLLPERPKPA--QQPPD--DPVLREYNAYLAELA 114
+R+ Q G + +P + +P P+PA Q D P L YNA LAELA
Sbjct 83 ---LRFFGLVRQSREGRDVSAVPRESSAMPPLPEPAPSNQAADRTHPGLAAYNALLAELA 139
Query 115 KDDA 118
+ A
Sbjct 140 EHTA 143
>gi|317508774|ref|ZP_07966424.1| acetolactate synthase [Segniliparus rugosus ATCC BAA-974]
gi|316252937|gb|EFV12357.1| acetolactate synthase [Segniliparus rugosus ATCC BAA-974]
Length=135
Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/114 (45%), Positives = 61/114 (54%), Gaps = 3/114 (2%)
Query 7 RRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRY 66
PA++ L A C CL LGWWQW RF GTFQNLGYALQWPLFA F VY Y F
Sbjct 14 HHPAMLVLTFAAACACLLLGWWQWNRFHQAGGTFQNLGYALQWPLFAVFTVYGYVRFFGL 73
Query 67 EETPPQPPTGGAAAEIPAGL--LPERPKPAQ-QPPDDPVLREYNAYLAELAKDD 117
+ + AA+ + + LPER + P L YNA LA+LA+
Sbjct 74 LKQSREGRDVTAASRESSAMPPLPEREGSDELAGPRHRGLAAYNAMLADLAEQQ 127
>gi|262201914|ref|YP_003273122.1| glucitol operon activator-like protein [Gordonia bronchialis
DSM 43247]
gi|262085261|gb|ACY21229.1| Glucitol operon activator-like protein [Gordonia bronchialis
DSM 43247]
Length=147
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/131 (52%), Positives = 78/131 (60%), Gaps = 14/131 (10%)
Query 1 VSTTRRR--RPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVY 58
+S+T RR RPALI LV +A CLAL WWQW R++S+SGT QNLGYALQWP FA +Y
Sbjct 1 MSSTPRRTHRPALIVLVAVAALACLALAWWQWDRYESSSGTAQNLGYALQWPAFAVAVIY 60
Query 59 AYRNFVRYEETPP----QPPTGGAAAEIPAGLLPERPKP------AQQPP--DDPVLREY 106
AYR FV E P Q EIP GLLPERP A +P +D L +Y
Sbjct 61 AYRRFVVLERAPEEQRAQAVRAKGVTEIPDGLLPERPTTPSASSLAARPDAAEDSALADY 120
Query 107 NAYLAELAKDD 117
N YLAEL D
Sbjct 121 NRYLAELDAQD 131
>gi|334563356|ref|ZP_08516347.1| hypothetical protein CbovD2_02188 [Corynebacterium bovis DSM
20582]
Length=176
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/94 (49%), Positives = 58/94 (62%), Gaps = 11/94 (11%)
Query 11 LIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYE--- 67
+AL ++AT L L WWQWTRF+S SG+FQNLGYALQWP+F F + AYR ++ YE
Sbjct 24 FLALAVVAT---LLLAWWQWTRFRSGSGSFQNLGYALQWPVFGVFFIIAYRKYIEYERDR 80
Query 68 ----ETPPQPPTGGAAAEIPAGLLPER-PKPAQQ 96
ETP P T +P LLP R P+P +
Sbjct 81 MLGDETPAAPSTDDEMRALPEDLLPTRVPEPTVE 114
>gi|25028954|ref|NP_739008.1| hypothetical protein CE2398 [Corynebacterium efficiens YS-314]
gi|23494241|dbj|BAC19208.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
Length=162
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 43/98 (44%), Positives = 56/98 (58%), Gaps = 4/98 (4%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+ T ++ R + + +II LAL WWQWTRFQS SGTFQNLGYA QWP+ F ++AY
Sbjct 12 LKTKKKVRLSHVVFLIICFAVALALAWWQWTRFQSGSGTFQNLGYAFQWPVIGGFLIFAY 71
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPP 98
R ++ YE AAE+ P+ KP PP
Sbjct 72 RKYLEYENQSID--RENLAAELARH--PDADKPRDTPP 105
>gi|259508019|ref|ZP_05750919.1| membrane protein [Corynebacterium efficiens YS-314]
gi|259164360|gb|EEW48914.1| membrane protein [Corynebacterium efficiens YS-314]
Length=151
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/98 (44%), Positives = 56/98 (58%), Gaps = 4/98 (4%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
+ T ++ R + + +II LAL WWQWTRFQS SGTFQNLGYA QWP+ F ++AY
Sbjct 1 MKTKKKVRLSHVVFLIICFAVALALAWWQWTRFQSGSGTFQNLGYAFQWPVIGGFLIFAY 60
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPP 98
R ++ YE AAE+ P+ KP PP
Sbjct 61 RKYLEYENQSID--RENLAAELARH--PDADKPRDTPP 94
>gi|19553698|ref|NP_601700.1| hypothetical protein NCgl2412 [Corynebacterium glutamicum ATCC
13032]
gi|62391338|ref|YP_226740.1| hypothetical protein cg2748 [Corynebacterium glutamicum ATCC
13032]
gi|145296500|ref|YP_001139321.1| hypothetical protein cgR_2411 [Corynebacterium glutamicum R]
gi|21325269|dbj|BAB99891.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC
13032]
gi|41326679|emb|CAF21161.1| putative membrane protein [Corynebacterium glutamicum ATCC 13032]
gi|140846420|dbj|BAF55419.1| hypothetical protein [Corynebacterium glutamicum R]
gi|344043860|gb|EGV39547.1| hypothetical protein CgS9114_12797 [Corynebacterium glutamicum
S9114]
Length=154
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/121 (38%), Positives = 64/121 (53%), Gaps = 6/121 (4%)
Query 1 VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAY 60
V ++ + + I ++I AL WWQW+RFQS SGTFQNLGYA QWPL F VYAY
Sbjct 9 VKEKKKVKASHIVFLLICFIAACALAWWQWSRFQSGSGTFQNLGYAFQWPLIGAFFVYAY 68
Query 61 RNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARK 120
R +++YE + A G P AQ +D ++ ++ L +DD+ K
Sbjct 69 RKYLQYENESIELENMEAKMMAEQGKTP----VAQSEQEDSFVQ--LSHRPSLVEDDSVK 122
Query 121 Q 121
+
Sbjct 123 E 123
>gi|306836766|ref|ZP_07469727.1| conserved hypothetical protein [Corynebacterium accolens ATCC
49726]
gi|304567353|gb|EFM42957.1| conserved hypothetical protein [Corynebacterium accolens ATCC
49726]
Length=158
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/56 (70%), Positives = 42/56 (75%), Gaps = 0/56 (0%)
Query 12 IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYE 67
IALVI L L +WQWTRF+S SGTFQNLGYA QWPLFA F VYAYR +RYE
Sbjct 17 IALVIFFVVLFLFLAYWQWTRFRSGSGTFQNLGYAFQWPLFAVFVVYAYRTTLRYE 72
>gi|227503001|ref|ZP_03933050.1| lipoprotein LprD [Corynebacterium accolens ATCC 49725]
gi|227076062|gb|EEI14025.1| lipoprotein LprD [Corynebacterium accolens ATCC 49725]
Length=158
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/56 (70%), Positives = 42/56 (75%), Gaps = 0/56 (0%)
Query 12 IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYE 67
IALVI L L +WQWTRF+S SGTFQNLGYA QWPLFA F VYAYR +RYE
Sbjct 17 IALVIFFVVLFLFLAYWQWTRFRSGSGTFQNLGYAFQWPLFAVFVVYAYRTTLRYE 72
>gi|260579676|ref|ZP_05847539.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC
43734]
gi|258602195|gb|EEW15509.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC
43734]
Length=147
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/121 (37%), Positives = 63/121 (53%), Gaps = 6/121 (4%)
Query 6 RRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVR 65
R R I +++A L L WWQW R+QS G+FQNLGYA+QWP+F F V AYR +++
Sbjct 19 RVRIIQIIFLVLAVIATLLLAWWQWDRWQSNGGSFQNLGYAIQWPIFGIFFVVAYRKYIQ 78
Query 66 YEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTT 125
YE+ AA +P + E P+ Q E+ ++ DD R++ R
Sbjct 79 YEKERLLGDESPAAPTLPTDEMTEVPEDFLQA------EEHAEETEDIWVDDRRERARMR 132
Query 126 A 126
A
Sbjct 133 A 133
>gi|319949718|ref|ZP_08023747.1| putative lipoprotein [Dietzia cinnamea P4]
gi|319436608|gb|EFV91699.1| putative lipoprotein [Dietzia cinnamea P4]
Length=187
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/153 (35%), Positives = 64/153 (42%), Gaps = 38/153 (24%)
Query 3 TTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
++ + RPA IA V++ C+ +G WQ R+Q SGT QNLGY WP F +YAY
Sbjct 22 SSVKHRPAWIAFVVLGAIVCIGMGLWQLARYQEASGTVQNLGYTFMWPFLGGFLIYAYLK 81
Query 63 FVRYEETPPQ-----------------------------PPTG---------GAAAEIPA 84
+VR E P G G EIPA
Sbjct 82 YVRMEAAEDDWASASDGTAGARGTDDDDDDEADPRAADLPAAGKNGSGRDRFGVLTEIPA 141
Query 85 GLLPERPKPAQQPPDDPVLREYNAYLAELAKDD 117
LLP R A D L YNAYLAELA+ D
Sbjct 142 DLLPTRRPAADARARDEGLDAYNAYLAELARKD 174
>gi|337291434|ref|YP_004630455.1| hypothetical protein CULC22_01827 [Corynebacterium ulcerans BR-AD22]
gi|334699740|gb|AEG84536.1| hypothetical protein CULC22_01827 [Corynebacterium ulcerans BR-AD22]
Length=148
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/65 (56%), Positives = 44/65 (68%), Gaps = 2/65 (3%)
Query 5 RRRRPAL--IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
RR R L I +++ L+L WWQWTRF+S SG+FQNLGYALQWP F F +YAYR
Sbjct 18 RRFRVRLWHIVFLVVVVSATLSLAWWQWTRFRSGSGSFQNLGYALQWPFFGGFFIYAYRK 77
Query 63 FVRYE 67
+ YE
Sbjct 78 LMEYE 82
>gi|334697459|gb|AEG82256.1| hypothetical protein CULC809_01725 [Corynebacterium ulcerans
809]
Length=148
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/65 (56%), Positives = 44/65 (68%), Gaps = 2/65 (3%)
Query 5 RRRRPAL--IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
RR R L I +++ L+L WWQWTRF+S SG+FQNLGYALQWP F F +YAYR
Sbjct 18 RRFRVRLWHIVFLVVVVAATLSLAWWQWTRFRSGSGSFQNLGYALQWPFFGGFFIYAYRK 77
Query 63 FVRYE 67
+ YE
Sbjct 78 LMEYE 82
>gi|300859071|ref|YP_003784054.1| hypothetical protein cpfrc_01654 [Corynebacterium pseudotuberculosis
FRC41]
gi|300686525|gb|ADK29447.1| hypothetical protein cpfrc_01654 [Corynebacterium pseudotuberculosis
FRC41]
gi|302206774|gb|ADL11116.1| Hypothetical protein CpC231_1654 [Corynebacterium pseudotuberculosis
C231]
gi|302331327|gb|ADL21521.1| Hypothetical protein Cp1002_1653 [Corynebacterium pseudotuberculosis
1002]
gi|308277019|gb|ADO26918.1| Glucitol operon activator [Corynebacterium pseudotuberculosis
I19]
gi|341825457|gb|AEK92978.1| Hypothetical protein CpPAT10_1653 [Corynebacterium pseudotuberculosis
PAT10]
Length=148
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/65 (57%), Positives = 44/65 (68%), Gaps = 2/65 (3%)
Query 5 RRRRPAL--IALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRN 62
RR R L I +++ L+L WWQWTRF+S SG+FQNLGYALQWP F F VYAYR
Sbjct 18 RRFRVRLWHIVFLVLVIVATLSLAWWQWTRFRSGSGSFQNLGYALQWPFFGGFFVYAYRK 77
Query 63 FVRYE 67
+ YE
Sbjct 78 LMEYE 82
>gi|38234418|ref|NP_940185.1| hypothetical protein DIP1848 [Corynebacterium diphtheriae NCTC
13129]
gi|38200681|emb|CAE50377.1| Putative membrane protein [Corynebacterium diphtheriae]
Length=125
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/87 (46%), Positives = 50/87 (58%), Gaps = 8/87 (9%)
Query 11 LIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYE--- 67
++ L ++ C L WWQWTRFQS +GTFQNLGY++QWP F F VYAYR FV YE
Sbjct 21 VLFLAVLVAC-TFGLAWWQWTRFQSGTGTFQNLGYSIQWPFFGGFFVYAYRKFVEYENQR 79
Query 68 ----ETPPQPPTGGAAAEIPAGLLPER 90
+ P + EI LP+R
Sbjct 80 LLGDDELFGPKDSESPTEIDESFLPQR 106
Lambda K H
0.321 0.136 0.454
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129836892488
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40