BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0332
Length=261
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607473|ref|NP_214846.1| hypothetical protein Rv0332 [Mycoba... 518 5e-145
gi|340625363|ref|YP_004743815.1| hypothetical protein MCAN_03341... 516 2e-144
gi|31791510|ref|NP_854003.1| hypothetical protein Mb0339 [Mycoba... 514 4e-144
gi|308231523|ref|ZP_07412762.2| hypothetical protein TMAG_03700 ... 498 4e-139
gi|308370375|ref|ZP_07421280.2| hypothetical protein TMCG_03015 ... 495 2e-138
gi|339293388|gb|AEJ45499.1| hypothetical protein CCDC5079_0309 [... 444 7e-123
gi|240172292|ref|ZP_04750951.1| hypothetical protein MkanA1_2345... 395 3e-108
gi|183980630|ref|YP_001848921.1| hypothetical protein MMAR_0604 ... 385 2e-105
gi|254820174|ref|ZP_05225175.1| hypothetical protein MintA_09616... 383 1e-104
gi|118463349|ref|YP_883947.1| hypothetical protein MAV_4822 [Myc... 379 2e-103
gi|41409924|ref|NP_962760.1| hypothetical protein MAP3826 [Mycob... 379 3e-103
gi|336460287|gb|EGO39189.1| hypothetical protein MAPs_42340 [Myc... 376 2e-102
gi|342859172|ref|ZP_08715826.1| hypothetical protein MCOL_09848 ... 375 3e-102
gi|118616387|ref|YP_904719.1| hypothetical protein MUL_0566 [Myc... 375 3e-102
gi|296167785|ref|ZP_06849973.1| conserved hypothetical protein [... 360 1e-97
gi|333988965|ref|YP_004521579.1| hypothetical protein JDM601_032... 300 1e-79
gi|118471045|ref|YP_885090.1| hypothetical protein MSMEG_0682 [M... 283 2e-74
gi|126433033|ref|YP_001068724.1| hypothetical protein Mjls_0421 ... 282 3e-74
gi|108797414|ref|YP_637611.1| hypothetical protein Mmcs_0434 [My... 281 9e-74
gi|145220891|ref|YP_001131569.1| hypothetical protein Mflv_0287 ... 275 3e-72
gi|120401617|ref|YP_951446.1| hypothetical protein Mvan_0601 [My... 258 5e-67
gi|324998628|ref|ZP_08119740.1| hypothetical protein PseP1_07672... 229 4e-58
gi|158316801|ref|YP_001509309.1| hypothetical protein Franean1_5... 219 3e-55
gi|312197499|ref|YP_004017560.1| hypothetical protein FraEuI1c_3... 217 1e-54
gi|312197979|ref|YP_004018040.1| hypothetical protein FraEuI1c_4... 211 1e-52
gi|331698178|ref|YP_004334417.1| hypothetical protein Psed_4407 ... 210 1e-52
gi|288918071|ref|ZP_06412429.1| protein of unknown function DUF1... 181 1e-43
gi|337765417|emb|CCB74126.1| conserved protein of unknown functi... 156 3e-36
gi|182438985|ref|YP_001826704.1| hypothetical protein SGR_5192 [... 151 1e-34
gi|256390592|ref|YP_003112156.1| hypothetical protein Caci_1392 ... 145 5e-33
gi|239987303|ref|ZP_04707967.1| hypothetical protein SrosN1_0836... 144 2e-32
gi|134098435|ref|YP_001104096.1| hypothetical protein SACE_1859 ... 140 1e-31
gi|271970204|ref|YP_003344400.1| hypothetical protein Sros_9026 ... 139 4e-31
gi|297156087|gb|ADI05799.1| hypothetical protein SBI_02678 [Stre... 137 1e-30
gi|302543297|ref|ZP_07295639.1| conserved hypothetical protein [... 134 1e-29
gi|134099833|ref|YP_001105494.1| hypothetical protein SACE_3294 ... 132 4e-29
gi|302558119|ref|ZP_07310461.1| conserved hypothetical protein [... 127 1e-27
gi|297204434|ref|ZP_06921831.1| conserved hypothetical protein [... 122 4e-26
gi|300783549|ref|YP_003763840.1| hypothetical protein AMED_1626 ... 119 3e-25
gi|289768817|ref|ZP_06528195.1| conserved hypothetical protein [... 111 9e-23
gi|21224000|ref|NP_629779.1| hypothetical protein SCO5649 [Strep... 111 9e-23
gi|312138927|ref|YP_004006263.1| hypothetical protein REQ_15000 ... 111 1e-22
gi|325676650|ref|ZP_08156326.1| hypothetical protein HMPREF0724_... 111 1e-22
gi|302530682|ref|ZP_07283024.1| predicted protein [Streptomyces ... 104 1e-20
gi|333921310|ref|YP_004494891.1| hypothetical protein AS9A_3653 ... 103 2e-20
gi|239985866|ref|ZP_04706530.1| hypothetical protein SrosN1_0103... 101 1e-19
gi|254380896|ref|ZP_04996262.1| conserved hypothetical protein [... 99.8 3e-19
gi|29827917|ref|NP_822551.1| hypothetical protein SAV_1376 [Stre... 99.4 4e-19
gi|297195727|ref|ZP_06913125.1| conserved hypothetical protein [... 98.2 1e-18
gi|291454435|ref|ZP_06593825.1| conserved hypothetical protein [... 97.4 2e-18
>gi|15607473|ref|NP_214846.1| hypothetical protein Rv0332 [Mycobacterium tuberculosis H37Rv]
gi|148660098|ref|YP_001281621.1| hypothetical protein MRA_0341 [Mycobacterium tuberculosis H37Ra]
gi|148821528|ref|YP_001286282.1| hypothetical protein TBFG_10337 [Mycobacterium tuberculosis F11]
42 more sequence titles
Length=261
Score = 518 bits (1333), Expect = 5e-145, Method: Compositional matrix adjust.
Identities = 260/261 (99%), Positives = 261/261 (100%), Gaps = 0/261 (0%)
Query 1 LRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
+RKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA
Sbjct 1 MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
Query 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP
Sbjct 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
Query 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED
Sbjct 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
Query 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD
Sbjct 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
Query 241 TGIELLGDAGVWQKWLDRTPL 261
TGIELLGDAGVWQKWLDRTPL
Sbjct 241 TGIELLGDAGVWQKWLDRTPL 261
>gi|340625363|ref|YP_004743815.1| hypothetical protein MCAN_03341 [Mycobacterium canettii CIPT
140010059]
gi|340003553|emb|CCC42674.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=261
Score = 516 bits (1328), Expect = 2e-144, Method: Compositional matrix adjust.
Identities = 259/261 (99%), Positives = 260/261 (99%), Gaps = 0/261 (0%)
Query 1 LRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
+RKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA
Sbjct 1 MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
Query 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP
Sbjct 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
Query 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGG PLPLED
Sbjct 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGAPLPLED 180
Query 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD
Sbjct 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
Query 241 TGIELLGDAGVWQKWLDRTPL 261
TGIELLGDAGVWQKWLDRTPL
Sbjct 241 TGIELLGDAGVWQKWLDRTPL 261
>gi|31791510|ref|NP_854003.1| hypothetical protein Mb0339 [Mycobacterium bovis AF2122/97]
gi|121636246|ref|YP_976469.1| hypothetical protein BCG_0371 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224988719|ref|YP_002643406.1| hypothetical protein JTY_0341 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31617096|emb|CAD93203.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121491893|emb|CAL70356.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224771832|dbj|BAH24638.1| hypothetical protein JTY_0341 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341600262|emb|CCC62932.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=261
Score = 514 bits (1325), Expect = 4e-144, Method: Compositional matrix adjust.
Identities = 259/261 (99%), Positives = 260/261 (99%), Gaps = 0/261 (0%)
Query 1 LRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
+RKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA
Sbjct 1 MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
Query 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP
Sbjct 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
Query 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED
Sbjct 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
Query 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
DDTLHLHATDPGLLEAG WTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD
Sbjct 181 DDTLHLHATDPGLLEAGGWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
Query 241 TGIELLGDAGVWQKWLDRTPL 261
TGIELLGDAGVWQKWLDRTPL
Sbjct 241 TGIELLGDAGVWQKWLDRTPL 261
>gi|308231523|ref|ZP_07412762.2| hypothetical protein TMAG_03700 [Mycobacterium tuberculosis SUMu001]
gi|308369365|ref|ZP_07417508.2| hypothetical protein TMBG_03561 [Mycobacterium tuberculosis SUMu002]
gi|308371643|ref|ZP_07425648.2| hypothetical protein TMDG_02526 [Mycobacterium tuberculosis SUMu004]
21 more sequence titles
Length=251
Score = 498 bits (1282), Expect = 4e-139, Method: Compositional matrix adjust.
Identities = 250/251 (99%), Positives = 251/251 (100%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF
Sbjct 1 MDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH
Sbjct 61 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD
Sbjct 121 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG
Sbjct 181 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 240
Query 251 VWQKWLDRTPL 261
VWQKWLDRTPL
Sbjct 241 VWQKWLDRTPL 251
>gi|308370375|ref|ZP_07421280.2| hypothetical protein TMCG_03015 [Mycobacterium tuberculosis SUMu003]
gi|308332238|gb|EFP21089.1| hypothetical protein TMCG_03015 [Mycobacterium tuberculosis SUMu003]
Length=251
Score = 495 bits (1275), Expect = 2e-138, Method: Compositional matrix adjust.
Identities = 249/251 (99%), Positives = 250/251 (99%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF
Sbjct 1 MDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH
Sbjct 61 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD
Sbjct 121 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGLLEAGEWTVRRDERGVTWSHRHGKGAVAL GGATELLLAMVRRLSVADTGIELLGDAG
Sbjct 181 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALCGGATELLLAMVRRLSVADTGIELLGDAG 240
Query 251 VWQKWLDRTPL 261
VWQKWLDRTPL
Sbjct 241 VWQKWLDRTPL 251
>gi|339293388|gb|AEJ45499.1| hypothetical protein CCDC5079_0309 [Mycobacterium tuberculosis
CCDC5079]
Length=225
Score = 444 bits (1141), Expect = 7e-123, Method: Compositional matrix adjust.
Identities = 224/225 (99%), Positives = 225/225 (100%), Gaps = 0/225 (0%)
Query 37 VPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGA 96
+PTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGA
Sbjct 1 MPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGA 60
Query 97 RLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAAD 156
RLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAAD
Sbjct 61 RLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAAD 120
Query 157 GISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGK 216
GISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGK
Sbjct 121 GISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGK 180
Query 217 GAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL 261
GAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL
Sbjct 181 GAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL 225
>gi|240172292|ref|ZP_04750951.1| hypothetical protein MkanA1_23456 [Mycobacterium kansasii ATCC
12478]
Length=251
Score = 395 bits (1015), Expect = 3e-108, Method: Compositional matrix adjust.
Identities = 196/251 (79%), Positives = 214/251 (86%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY+SAYL+QT FG+LIRN DQSTPVP+CPGW+LGQLFRHVGRGDRWAAQIVRDRLD F
Sbjct 1 MDYTSAYLDQTREFGDLIRNADQSTPVPSCPGWNLGQLFRHVGRGDRWAAQIVRDRLDSF 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPRSVE GKPPPD DDAI+WL GGA+ LVDAVE+TG ETPVWTFLG RPAGWW+RRRLH
Sbjct 61 LDPRSVEEGKPPPDMDDAITWLRGGAQRLVDAVERTGTETPVWTFLGARPAGWWIRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHR D AI +G EFTLEP++AADGISEFLERIA QAG LPL+ DTLHLHATD
Sbjct 121 EVAVHRVDAAIAIGSEFTLEPDIAADGISEFLERIATQAGRDDADLPLQAGDTLHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGL AGEWTV DE +TWSH HGKG VALRG A ELLLAMVRR+ VADTGIE+ GD
Sbjct 181 PGLGAAGEWTVGVDEGRITWSHEHGKGTVALRGSAAELLLAMVRRVPVADTGIEVFGDPA 240
Query 251 VWQKWLDRTPL 261
VW+KWLD TPL
Sbjct 241 VWRKWLDGTPL 251
>gi|183980630|ref|YP_001848921.1| hypothetical protein MMAR_0604 [Mycobacterium marinum M]
gi|183173956|gb|ACC39066.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=251
Score = 385 bits (990), Expect = 2e-105, Method: Compositional matrix adjust.
Identities = 188/251 (75%), Positives = 211/251 (85%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D +AYL+QT AFGEL+ N DQSTPVP+CPGW+LGQLFRHVGRGDRWAAQIVRDRLD F
Sbjct 1 MDQVAAYLDQTRAFGELVGNNDQSTPVPSCPGWNLGQLFRHVGRGDRWAAQIVRDRLDSF 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR+V GGKPP DDAI+WL GA+L+VDAVEQ G ETPVWTFLGPRPA WWVRRRLH
Sbjct 61 LDPRNVAGGKPPAAVDDAIAWLQDGAQLMVDAVEQAGAETPVWTFLGPRPAHWWVRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EV VHRAD AI +G +F LEP +AADGISEFLERIAVQAG G PLP+ED DT+HLHATD
Sbjct 121 EVVVHRADAAIALGQQFVLEPEIAADGISEFLERIAVQAGRDGAPLPIEDGDTVHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGL + GEWT ++ +TWSH HGKG VA+RGGA ELLLAM RR+SV DTGIE+ GD
Sbjct 181 PGLGDVGEWTAAVEDGHITWSHEHGKGTVAVRGGAAELLLAMTRRVSVPDTGIEVFGDQA 240
Query 251 VWQKWLDRTPL 261
VWQKWL+RTPL
Sbjct 241 VWQKWLERTPL 251
>gi|254820174|ref|ZP_05225175.1| hypothetical protein MintA_09616 [Mycobacterium intracellulare
ATCC 13950]
Length=251
Score = 383 bits (984), Expect = 1e-104, Method: Compositional matrix adjust.
Identities = 186/251 (75%), Positives = 215/251 (86%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY+ A+L+Q AF EL D+STPVPTCP W+L QLFRHVGRGDRWAAQIVRDRL+ +
Sbjct 1 MDYAGAFLDQNRAFAELFDGADESTPVPTCPEWTLRQLFRHVGRGDRWAAQIVRDRLESY 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR+VEGGKPPPDP DAISWL+GGA+ LVDAVE TGVETPVWTFLGPRPA WW+RRRLH
Sbjct 61 LDPRTVEGGKPPPDPADAISWLHGGAQRLVDAVELTGVETPVWTFLGPRPANWWIRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRAD AI +G +F LEP++AADGI+E+LER+A+QAG G PLPLED TLHLHATD
Sbjct 121 EVAVHRADAAIALGTDFALEPDIAADGITEWLERVAIQAGGQGAPLPLEDGTTLHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGL EAGEWT D+ VTWSH HGKG+VALRGGATELLLA++RR +ADTG++L GD
Sbjct 181 PGLGEAGEWTAAVDQGRVTWSHEHGKGSVALRGGATELLLAILRRRPLADTGVQLFGDEA 240
Query 251 VWQKWLDRTPL 261
VW++WLDRTPL
Sbjct 241 VWERWLDRTPL 251
>gi|118463349|ref|YP_883947.1| hypothetical protein MAV_4822 [Mycobacterium avium 104]
gi|254777257|ref|ZP_05218773.1| hypothetical protein MaviaA2_21669 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118164636|gb|ABK65533.1| conserved hypothetical protein, putative [Mycobacterium avium
104]
Length=264
Score = 379 bits (974), Expect = 2e-103, Method: Compositional matrix adjust.
Identities = 185/255 (73%), Positives = 214/255 (84%), Gaps = 0/255 (0%)
Query 7 SLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDR 66
SL VDY+ A+L++ AF EL R+ D+S PVPTCP W+L QLFRHVGRGDRWAAQIVRDR
Sbjct 10 SLTGVDYAGAFLDENRAFAELFRDADESMPVPTCPDWTLRQLFRHVGRGDRWAAQIVRDR 69
Query 67 LDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVR 126
LD +LDPR VEGGKPPPDP DAISWL+GGA+ LVDAVE TGV+TPVWTFLGPRPA WW+R
Sbjct 70 LDSYLDPRMVEGGKPPPDPADAISWLHGGAQRLVDAVELTGVQTPVWTFLGPRPANWWIR 129
Query 127 RRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHL 186
RRLHE AVHRAD AI +G EFTL P +AAD I+E+LER+AVQAG G PLPL++ DTLHL
Sbjct 130 RRLHETAVHRADAAIALGREFTLRPELAADAITEWLERVAVQAGGQGAPLPLDNADTLHL 189
Query 187 HATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELL 246
HATDPGL +AGEWTV ++ + WSH HGKG+VALRGGAT+LLLA++RR +ADTG EL
Sbjct 190 HATDPGLGDAGEWTVAVEQGRIAWSHEHGKGSVALRGGATDLLLAILRRRPLADTGAELF 249
Query 247 GDAGVWQKWLDRTPL 261
GD VWQ+WLDRTPL
Sbjct 250 GDDAVWQRWLDRTPL 264
>gi|41409924|ref|NP_962760.1| hypothetical protein MAP3826 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398757|gb|AAS06376.1| hypothetical protein MAP_3826 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=274
Score = 379 bits (972), Expect = 3e-103, Method: Compositional matrix adjust.
Identities = 185/255 (73%), Positives = 214/255 (84%), Gaps = 0/255 (0%)
Query 7 SLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDR 66
SL VDY+ A+L++ AF EL R+ D+STPVPTCP W+L QLFRHVGRGDRWAAQIVRDR
Sbjct 20 SLTGVDYAGAFLDENRAFAELFRDADESTPVPTCPDWTLRQLFRHVGRGDRWAAQIVRDR 79
Query 67 LDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVR 126
LD +LDPR VEGGKPPPDP DAISWL+GGA+ LVDAVE TGV+TPVWTFLGPRPA WW+R
Sbjct 80 LDSYLDPRMVEGGKPPPDPADAISWLHGGAQRLVDAVELTGVQTPVWTFLGPRPANWWIR 139
Query 127 RRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHL 186
RRLHE AVH AD AI +G EFTL P +AAD I+E+LER+AVQAG G PLPL++ DTLHL
Sbjct 140 RRLHETAVHLADAAIALGREFTLRPELAADAITEWLERVAVQAGGQGAPLPLDNADTLHL 199
Query 187 HATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELL 246
HATDPGL +AGEWTV ++ + WSH HGKG+VALRGGAT+LLLA++RR +ADTG EL
Sbjct 200 HATDPGLGDAGEWTVAVEQGRIAWSHEHGKGSVALRGGATDLLLAILRRRPLADTGAELF 259
Query 247 GDAGVWQKWLDRTPL 261
GD VWQ+WLDRTPL
Sbjct 260 GDDAVWQRWLDRTPL 274
>gi|336460287|gb|EGO39189.1| hypothetical protein MAPs_42340 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=274
Score = 376 bits (965), Expect = 2e-102, Method: Compositional matrix adjust.
Identities = 184/255 (73%), Positives = 213/255 (84%), Gaps = 0/255 (0%)
Query 7 SLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDR 66
SL VDY+ A+L++ AF EL R+ D+STPVPTCP W+L QLFRHVG GDRWAAQIVRDR
Sbjct 20 SLTGVDYAGAFLDENRAFAELFRDADESTPVPTCPDWTLRQLFRHVGPGDRWAAQIVRDR 79
Query 67 LDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVR 126
LD +LDPR VEGGKPPPDP DAISWL+GGA+ LVDAVE TGV+TPVWTFLGPRPA WW+R
Sbjct 80 LDSYLDPRMVEGGKPPPDPADAISWLHGGAQRLVDAVELTGVQTPVWTFLGPRPANWWIR 139
Query 127 RRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHL 186
RRLHE AVH AD AI +G EFTL P +AAD I+E+LER+AVQAG G PLPL++ DTLHL
Sbjct 140 RRLHETAVHLADAAIALGREFTLRPELAADAITEWLERVAVQAGGQGAPLPLDNADTLHL 199
Query 187 HATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELL 246
HATDPGL +AGEWTV ++ + WSH HGKG+VALRGGAT+LLLA++RR +ADTG EL
Sbjct 200 HATDPGLGDAGEWTVTVEQGRIAWSHEHGKGSVALRGGATDLLLAILRRRPLADTGAELF 259
Query 247 GDAGVWQKWLDRTPL 261
GD VWQ+WLDRTPL
Sbjct 260 GDDAVWQRWLDRTPL 274
>gi|342859172|ref|ZP_08715826.1| hypothetical protein MCOL_09848 [Mycobacterium colombiense CECT
3035]
gi|342133413|gb|EGT86616.1| hypothetical protein MCOL_09848 [Mycobacterium colombiense CECT
3035]
Length=286
Score = 375 bits (964), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 181/261 (70%), Positives = 217/261 (84%), Gaps = 0/261 (0%)
Query 1 LRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAA 60
L PA +L +VDY+ A+L++ AF EL + D+STPVPTCP W+L QLFRHVGRGDRWAA
Sbjct 26 LTIPAGNLTRVDYAGAFLDENRAFAELFEDADESTPVPTCPDWTLRQLFRHVGRGDRWAA 85
Query 61 QIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRP 120
QIVRD+LD +LDPR+VE GKPPPDP AI+WL GGA+ L+DAVE TGVETPVWTFLG RP
Sbjct 86 QIVRDKLDSYLDPRTVEAGKPPPDPTGAIAWLRGGAQRLIDAVELTGVETPVWTFLGSRP 145
Query 121 AGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
A WWVRRRLHEVAVHRAD AI +G EFTL +VAADGI+E+LER+A+QAG G PLPLE+
Sbjct 146 ANWWVRRRLHEVAVHRADAAIALGSEFTLAADVAADGITEWLERVAIQAGGQGAPLPLEE 205
Query 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
D+LHLHATDPGL EAGEWT+ + + WSH+HGKG+ ALRGG+TELLLA++RR +AD
Sbjct 206 GDSLHLHATDPGLGEAGEWTIAVEGGRIVWSHQHGKGSAALRGGSTELLLAILRRRPLAD 265
Query 241 TGIELLGDAGVWQKWLDRTPL 261
TG++L GD VW++WLDRTPL
Sbjct 266 TGVQLFGDDVVWERWLDRTPL 286
>gi|118616387|ref|YP_904719.1| hypothetical protein MUL_0566 [Mycobacterium ulcerans Agy99]
gi|118568497|gb|ABL03248.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=251
Score = 375 bits (964), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 184/251 (74%), Positives = 208/251 (83%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D +AYL+QT AFG+LI DQS PVP+CPGW+LGQLFRHVGRGDRWAAQIVRDRL+ F
Sbjct 1 MDQVAAYLDQTRAFGKLIGGNDQSAPVPSCPGWNLGQLFRHVGRGDRWAAQIVRDRLNSF 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR+V GGKPP DDAI+WL GA+L+VDAVEQ G ETPVWTFLGPRPA WWVRRRLH
Sbjct 61 LDPRNVAGGKPPAAVDDAIAWLQDGAQLMVDAVEQAGAETPVWTFLGPRPAHWWVRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EV VHRAD AI +G +F LEP +AADGISEFLERIAVQAG G PLP+ED DT+HLHATD
Sbjct 121 EVVVHRADAAIALGQQFVLEPEIAADGISEFLERIAVQAGHDGAPLPIEDGDTVHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGL + GEWT ++ +TWS HGKG VA+RGGA ELLLAM RR+SV DTGIE+ GD
Sbjct 181 PGLGDVGEWTAAVEDGHITWSPEHGKGTVAVRGGAAELLLAMTRRVSVPDTGIEVFGDQA 240
Query 251 VWQKWLDRTPL 261
VWQKWL+RTPL
Sbjct 241 VWQKWLERTPL 251
>gi|296167785|ref|ZP_06849973.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897058|gb|EFG76676.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=251
Score = 360 bits (924), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 176/251 (71%), Positives = 205/251 (82%), Gaps = 0/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY++A+L + AF EL+ + D+STPVPTCPGW+L QL RHVGRG+RWAAQIVRD+LD
Sbjct 1 MDYAAAFLAENRAFAELVGDADESTPVPTCPGWTLKQLLRHVGRGERWAAQIVRDKLDQP 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPRSVEGGKPP DP D ISWL+GGA+ LVDAVE TG ETPVWTFLGPRPA WW+RR +H
Sbjct 61 LDPRSVEGGKPPSDPADVISWLHGGAQRLVDAVELTGAETPVWTFLGPRPASWWLRRWVH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRAD AI + EF+L AADGI+E+LER+A+QAG G LPLED +TLHLHATD
Sbjct 121 EVAVHRADAAIALKAEFSLPAEQAADGITEWLERVAIQAGREGAALPLEDGNTLHLHATD 180
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
PGL EAGEWT+ + VTWSH HGKG ALRGGATELLLA++RR+ +ADTG+ L GD
Sbjct 181 PGLGEAGEWTIGVEAGHVTWSHEHGKGTAALRGGATELLLAILRRVPLADTGVALFGDEA 240
Query 251 VWQKWLDRTPL 261
VWQ WLDRTPL
Sbjct 241 VWQNWLDRTPL 251
>gi|333988965|ref|YP_004521579.1| hypothetical protein JDM601_0325 [Mycobacterium sp. JDM601]
gi|333484933|gb|AEF34325.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=252
Score = 300 bits (769), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/253 (61%), Positives = 183/253 (73%), Gaps = 3/253 (1%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY++A L++T AFGELIR+ D VP+CP W+L QLFRHVGRG RWAAQIV DRLDH
Sbjct 1 MDYAAALLDETRAFGELIRSGDPGLAVPSCPEWNLTQLFRHVGRGHRWAAQIVADRLDHA 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V GKPP DPD AI WL GA+ ++D V Q G + P WTFLGPRPA WWVRRRLH
Sbjct 61 LDPRDVVDGKPPADPDAAIGWLNDGAQRVLDGVAQVGPDNPAWTFLGPRPASWWVRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
E VHRAD A+ +GG+F L +AADGISEFL+ I + G P PL D T+HLHATD
Sbjct 121 EATVHRADAALALGGDFALPAELAADGISEFLDLITARVARDGQP-PLADGQTVHLHATD 179
Query 191 PGLLEAGEWTVRR--DERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGD 248
GL +AGEWT+ R D + W+H HGKG+VALRG A +L LA++ R+ VADT I L GD
Sbjct 180 DGLGQAGEWTISRSADNAALVWAHEHGKGSVALRGPARDLFLAIMGRVPVADTDIVLFGD 239
Query 249 AGVWQKWLDRTPL 261
A VWQ+W++ T
Sbjct 240 AAVWQEWVEHTAF 252
>gi|118471045|ref|YP_885090.1| hypothetical protein MSMEG_0682 [Mycobacterium smegmatis str.
MC2 155]
gi|118172332|gb|ABK73228.1| conserved hypothetical protein, putative [Mycobacterium smegmatis
str. MC2 155]
Length=250
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 148/251 (59%), Positives = 176/251 (71%), Gaps = 1/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+ +A L+QT AFGELI + D TPVPTCP W+L QL RHVGRG+RWAAQI+ DRL
Sbjct 1 MDFRAALLDQTRAFGELIASGDPDTPVPTCPDWTLRQLLRHVGRGNRWAAQIISDRLSQE 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V GKPP DP AI WL GA L+V AV+Q G E VWTFLGPRPAGWW+RRR +
Sbjct 61 LDPRQVRDGKPPDDPQGAIEWLNAGAALIVKAVDQVGSEARVWTFLGPRPAGWWIRRRAN 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRAD AI +G ++ L +AAD ISE+LER V+A +PL ++HLHATD
Sbjct 121 EVAVHRADAAIALGADYDLPLELAADAISEWLERTCVEAKRHHR-VPLAFGQSVHLHATD 179
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
GL GEWT+ DE GV WSH HGKG+VALRG A +LLLA+ R + AD G+E+ GD
Sbjct 180 DGLGPTGEWTLVNDEDGVGWSHDHGKGSVALRGPAKDLLLAITGRRTPADLGLEVFGDTE 239
Query 251 VWQKWLDRTPL 261
VW K L P
Sbjct 240 VWDKMLAAAPF 250
>gi|126433033|ref|YP_001068724.1| hypothetical protein Mjls_0421 [Mycobacterium sp. JLS]
gi|126232833|gb|ABN96233.1| protein of unknown function DUF1503 [Mycobacterium sp. JLS]
Length=249
Score = 282 bits (722), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 142/251 (57%), Positives = 179/251 (72%), Gaps = 2/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+ +A LEQT++FG+LI D TPV TC W+L QLFRHVGRG+RWAAQIV +R
Sbjct 1 MDFRAALLEQTNSFGDLIATGDPETPVTTCGDWTLRQLFRHVGRGNRWAAQIVAERRHEP 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V G+PP DPD AI WL GARLL+ AV+ G T VWTFLGPRPAGWW+RRR+H
Sbjct 61 LDPREVRDGRPPEDPDGAIQWLRDGARLLIHAVDSVGSGTKVWTFLGPRPAGWWIRRRVH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRAD A+ +G + L P++AAD +SE++E AVQAG G LP+E TLHLHATD
Sbjct 121 EVAVHRADAALALGQPYDLPPDLAADALSEWIELAAVQAGRRG--LPIERGHTLHLHATD 178
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
L GEW + E G+ W+H HGKG VA+RG +LLLA+ RR ++ D+G+E G
Sbjct 179 ESLGGVGEWMITSTEDGIDWTHEHGKGDVAVRGPVADLLLAVTRRRTLTDSGLEAFGKTE 238
Query 251 VWQKWLDRTPL 261
+W +WL++TP
Sbjct 239 IWDRWLEQTPF 249
>gi|108797414|ref|YP_637611.1| hypothetical protein Mmcs_0434 [Mycobacterium sp. MCS]
gi|119866498|ref|YP_936450.1| hypothetical protein Mkms_0444 [Mycobacterium sp. KMS]
gi|108767833|gb|ABG06555.1| protein of unknown function DUF1503 [Mycobacterium sp. MCS]
gi|119692587|gb|ABL89660.1| protein of unknown function DUF1503 [Mycobacterium sp. KMS]
Length=249
Score = 281 bits (718), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 142/251 (57%), Positives = 179/251 (72%), Gaps = 2/251 (0%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+ +A LEQT++FG+LI D TPV TC W+L QLFRHVGRG+RWAAQIV +R
Sbjct 1 MDFRAALLEQTNSFGDLIATGDPETPVTTCGDWTLRQLFRHVGRGNRWAAQIVAERRHEP 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V G+PP DPD AI WL GARLL+ AV+ G T VWTFLG RPAGWW+RRR+H
Sbjct 61 LDPREVRDGRPPEDPDGAIQWLREGARLLIHAVDSVGSGTKVWTFLGTRPAGWWIRRRVH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EVAVHRAD A+ +G + L P++AAD +SE++E AVQAG G LP+E TLHLHATD
Sbjct 121 EVAVHRADAALALGQPYDLPPDLAADALSEWIELAAVQAGRRG--LPIERGHTLHLHATD 178
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
L GEW + E G+ W+H HGKG VA+RG +LLLA+ RR ++AD+G+E G
Sbjct 179 ESLGGVGEWMITSTEDGIDWTHEHGKGDVAVRGPVADLLLAVTRRRTLADSGLEAFGKTE 238
Query 251 VWQKWLDRTPL 261
+W +WL++TP
Sbjct 239 IWDRWLEQTPF 249
>gi|145220891|ref|YP_001131569.1| hypothetical protein Mflv_0287 [Mycobacterium gilvum PYR-GCK]
gi|315442153|ref|YP_004075032.1| hypothetical protein Mspyr1_04880 [Mycobacterium sp. Spyr1]
gi|145213377|gb|ABP42781.1| protein of unknown function DUF1503 [Mycobacterium gilvum PYR-GCK]
gi|315260456|gb|ADT97197.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=248
Score = 275 bits (704), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 147/251 (59%), Positives = 181/251 (73%), Gaps = 3/251 (1%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+ +A LEQT AFGELIR+ D +TPVPTC W+L QLFRHVGRG+RWAAQIV +R
Sbjct 1 MDFRAALLEQTRAFGELIRSADPATPVPTCGDWTLKQLFRHVGRGNRWAAQIVSERRTEP 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V G+PP DPD AI WL GA++L+DAV++ +T VWTF GPRP GWW+RRRLH
Sbjct 61 LDPRDVRDGRPPEDPDGAIEWLNAGAQVLIDAVDRARPDTKVWTFTGPRPGGWWLRRRLH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
EV VHRAD A+ +G + LEP +AADGISE++E + A + P PL+ ++LHLHATD
Sbjct 121 EVVVHRADAALALGADLRLEPEMAADGISEWIE---LAANNRRGPAPLDRGESLHLHATD 177
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
L GEWTV DE GV WSH HGK VAL+G AT LLLA+ RR++ G+E+ GD
Sbjct 178 DKLGPTGEWTVVHDEDGVWWSHNHGKAGVALKGPATGLLLAITRRVTAEQAGLEMFGDTA 237
Query 251 VWQKWLDRTPL 261
VW WL+RTP
Sbjct 238 VWDAWLERTPF 248
>gi|120401617|ref|YP_951446.1| hypothetical protein Mvan_0601 [Mycobacterium vanbaalenii PYR-1]
gi|119954435|gb|ABM11440.1| protein of unknown function DUF1503 [Mycobacterium vanbaalenii
PYR-1]
Length=248
Score = 258 bits (660), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 137/251 (55%), Positives = 175/251 (70%), Gaps = 3/251 (1%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+ +A LEQT AFG+LIR D +TPVPTC W+L QL+RHVGRG+RWAAQI+ +R +
Sbjct 1 MDFRAALLEQTRAFGDLIRPADPATPVPTCGEWTLKQLYRHVGRGNRWAAQIISERRNQP 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR V GKPP D D AI W GA++++DAV+ G + VWTF+GPRPAGWW+RRR+H
Sbjct 61 LDPREVRDGKPPDDHDAAIEWFQRGAQMVIDAVDHVGADARVWTFIGPRPAGWWIRRRVH 120
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
E AVHRAD A+ +G F L AAD +SE++E V P L+ ++HLHA++
Sbjct 121 ETAVHRADAALALGAPFELPDEFAADCLSEWIELATVDKRH---PPALDPGQSIHLHASE 177
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
L GEWT+ DE G++WSH H K +VALRG T LLLA VRR + AD G+E+LGDA
Sbjct 178 EKLGPTGEWTIAHDEDGLSWSHSHSKSSVALRGPVTGLLLAAVRRKTAADAGLEMLGDAA 237
Query 251 VWQKWLDRTPL 261
VW WL+RTP
Sbjct 238 VWDGWLERTPF 248
>gi|324998628|ref|ZP_08119740.1| hypothetical protein PseP1_07672 [Pseudonocardia sp. P1]
Length=260
Score = 229 bits (583), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 126/252 (50%), Positives = 155/252 (62%), Gaps = 11/252 (4%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
Y+ + + +L+ D + VPTCPGW+L QL RHVGRG RWAAQ+V LD
Sbjct 13 YAEVLVAENDRLADLLETADPTAEVPTCPGWTLLQLLRHVGRGHRWAAQMVASGATEGLD 72
Query 73 PRSVEGGKPPPD-PDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHE 131
PR V GGKPP P+ A WL GA L+DAV G + PVWTF GPRP+ WWVRRRLHE
Sbjct 73 PREVVGGKPPEGGPEVAAQWLRDGADELLDAVVAAGPQAPVWTFTGPRPSAWWVRRRLHE 132
Query 132 VAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLP----LEDDDTLHLH 187
VHRAD AI +G F + P +AADG+SE+L+ + + P P L TLHLH
Sbjct 133 ATVHRADAAIALGTPFEIAPALAADGLSEWLDLLTAR------PAPDEPALAPGATLHLH 186
Query 188 ATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLG 247
ATD GL AGEW VR + V W HGKGA A+RG A +LL ++RR+ D +++LG
Sbjct 187 ATDDGLGPAGEWLVRAESGRVVWEPGHGKGAAAVRGTAADLLQGVLRRIPADDARLDVLG 246
Query 248 DAGVWQKWLDRT 259
D VWQ WL RT
Sbjct 247 DRQVWQDWLART 258
>gi|158316801|ref|YP_001509309.1| hypothetical protein Franean1_5043 [Frankia sp. EAN1pec]
gi|158112206|gb|ABW14403.1| protein of unknown function DUF1503 [Frankia sp. EAN1pec]
Length=246
Score = 219 bits (559), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 124/249 (50%), Positives = 157/249 (64%), Gaps = 5/249 (2%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY++ L Q +L+ D S PVPTCPGW L QL RHVGR DRWAA +VR R
Sbjct 1 MDYAAGLLAQNRLLTDLLGEADLSRPVPTCPGWDLTQLMRHVGRFDRWAAAMVRTRATEV 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
LDPR++EGGKPP D A++WL +LL++AV + PVWTF GPRPA WWVRRR+H
Sbjct 61 LDPRTIEGGKPPADRGGALAWLQESPQLLLEAV-AVDPDVPVWTFTGPRPARWWVRRRMH 119
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
E +HR D A+ +G LE AADGISE+L +A + G+ P D T+HLHATD
Sbjct 120 EAMIHRVDAALALGVGHPLEAAFAADGISEWLCLLAARPGAAILP----DGATVHLHATD 175
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
GL GEW +R G+ W H H KG VA+RG A +LLLA++RR+ D +E+LG+
Sbjct 176 EGLGIEGEWAIRGGADGIGWEHAHEKGDVAVRGTAADLLLALLRRIPGGDGRLEVLGEQE 235
Query 251 VWQKWLDRT 259
W WL T
Sbjct 236 RWTNWLANT 244
>gi|312197499|ref|YP_004017560.1| hypothetical protein FraEuI1c_3683 [Frankia sp. EuI1c]
gi|311228835|gb|ADP81690.1| protein of unknown function DUF1503 [Frankia sp. EuI1c]
Length=244
Score = 217 bits (553), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/251 (52%), Positives = 164/251 (66%), Gaps = 7/251 (2%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+D+++A +EQ F +L+ + D +TPVPTCPGW L QL RHVGRG RWAA +V R
Sbjct 1 MDHAAALVEQNDLFADLLGDADLATPVPTCPGWDLTQLMRHVGRGHRWAAAMVEARAVDI 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
+DPR+V GGKPP D A++WL LL+DAV + PVWTF GPRPA WWVRRRL+
Sbjct 61 IDPRTVAGGKPPAD--GAVAWLRESPALLLDAV-AVDPDAPVWTFTGPRPAHWWVRRRLY 117
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATD 190
E VHR D A+ +G +T+EP +AADG+SE+L +A + L D TLHLHATD
Sbjct 118 EAVVHRVDAALALGTGYTVEPALAADGVSEWLGLLAARPDG----TALRDGATLHLHATD 173
Query 191 PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG 250
GL GEWTVR G+TW H HGKG A+R A +LLLA++RRL D +E++GD G
Sbjct 174 GGLGSDGEWTVRGGPGGITWDHGHGKGDTAVRAAAADLLLALLRRLPADDGSLEIVGDDG 233
Query 251 VWQKWLDRTPL 261
+W WL T
Sbjct 234 LWTGWLANTAF 244
>gi|312197979|ref|YP_004018040.1| hypothetical protein FraEuI1c_4169 [Frankia sp. EuI1c]
gi|311229315|gb|ADP82170.1| protein of unknown function DUF1503 [Frankia sp. EuI1c]
Length=247
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/250 (48%), Positives = 153/250 (62%), Gaps = 6/250 (2%)
Query 11 VDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHF 70
+DY + LEQ +L+ D STPVPTCPGW+L Q+ RHVGR RWAA IVR R
Sbjct 1 MDYGALLLEQNRLLADLLGEADWSTPVPTCPGWTLTQVMRHVGRAPRWAATIVRARAQEV 60
Query 71 LDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTF-LGPRPAGWWVRRRL 129
+DPR EGG+PP D D A++W G RLL++AV + VWT G +PA WWVRR L
Sbjct 61 VDPRGAEGGRPPGDRDGALAWFQQGPRLLLEAVADD-PDARVWTTAAGLQPARWWVRRML 119
Query 130 HEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHAT 189
HE +HR D AI +G + +EP +AADGISE+L+ + G GT + L D T+ LHAT
Sbjct 120 HEAVIHRVDAAIALGVDHPIEPVLAADGISEWLD---LMVGLSGTAM-LRDGSTMRLHAT 175
Query 190 DPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDA 249
D GL GEWT+R + W H HG G VAL G A +LLLA++RR+ D + + G+
Sbjct 176 DVGLGADGEWTIRGGLSRIEWEHGHGVGDVALSGNAADLLLAVMRRIPGDDGRLVIAGER 235
Query 250 GVWQKWLDRT 259
W WL T
Sbjct 236 EHWTTWLANT 245
>gi|331698178|ref|YP_004334417.1| hypothetical protein Psed_4407 [Pseudonocardia dioxanivorans
CB1190]
gi|326952867|gb|AEA26564.1| Conserved hypothetical protein CHP03083 [Pseudonocardia dioxanivorans
CB1190]
Length=261
Score = 210 bits (535), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 109/239 (46%), Positives = 152/239 (64%), Gaps = 3/239 (1%)
Query 24 FGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPP 83
F +L+R+ D PVPTCPGW++ L HV RGDRWAA IV R +DPR+V G+ P
Sbjct 25 FADLVRDADPELPVPTCPGWTMRTLGTHVARGDRWAAAIVATRATEPVDPRTVADGRAPK 84
Query 84 DPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITV 143
D+ +W+ GG L +AV+ G +TPVWTF GP+PA WW+RRRLHE VHRAD A+
Sbjct 85 PVDEFGAWMRGGVAALAEAVDSVGPDTPVWTFTGPKPAAWWLRRRLHEQTVHRADAALAT 144
Query 144 GGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDP-GLLEAGEWTVR 202
GG F ++P +AADG+SE+L+ + V P+ L + T+HLH+ D GL AGEW +R
Sbjct 145 GGSFDIDPAIAADGLSEWLD-LLVARTQREEPV-LGEGRTIHLHSHDADGLGSAGEWVIR 202
Query 203 RDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL 261
++W H H K VA+RG +L +AM+ R+ +D +E+LG+ V++ +L TP
Sbjct 203 PHGTAISWEHGHEKATVAVRGSVADLFIAMLGRIDPSDPRLEVLGERTVFESFLAATPF 261
>gi|288918071|ref|ZP_06412429.1| protein of unknown function DUF1503 [Frankia sp. EUN1f]
gi|288350589|gb|EFC84808.1| protein of unknown function DUF1503 [Frankia sp. EUN1f]
Length=260
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/247 (44%), Positives = 137/247 (56%), Gaps = 7/247 (2%)
Query 16 AYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRS 75
A L +T +L R+ D +TPVPTCPGW+L QL HVG RW A +V R +D +
Sbjct 20 ALLTETDLLADLYRDRDPTTPVPTCPGWTLAQLVAHVGGAHRWTATMVTHRSTENIDYAT 79
Query 76 VEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWT-FLGPRPAGWWVRRRLHEVAV 134
V + P D A+ WL AR ++ AV+ TG E PVWT F G RPA WW+RRRLHEV
Sbjct 80 VPDVRRPHDQQAAVEWLRDSARQIITAVDATGAEVPVWTPFAGLRPAQWWIRRRLHEVTG 139
Query 135 HRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLL 194
HRAD + +G + + P VAADG+SE L+ IA A TPL E+ TL T G
Sbjct 140 HRADALLALGRDVVMAPAVAADGLSELLDLIASGAPWFATPLDDENTLTLTATDTAAG-- 197
Query 195 EAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQK 254
W++ R VTW+ V + G A +L L +RR+S ADT + + GD V
Sbjct 198 ----WSITRSGDTVTWTGVPAAATVTVSGAAVDLYLLALRRISAADTRLTVSGDPKVLDT 253
Query 255 WLDRTPL 261
WLDRT
Sbjct 254 WLDRTAF 260
>gi|337765417|emb|CCB74126.1| conserved protein of unknown function [Streptomyces cattleya
NRRL 8057]
Length=260
Score = 156 bits (394), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/251 (39%), Positives = 129/251 (52%), Gaps = 7/251 (2%)
Query 16 AYLEQTHA----FGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFL 71
AYL Q A E++R+ D VPTCP WSL +L H+G RW Q+V R L
Sbjct 8 AYLSQLTAEADRLREVLRDADPGAHVPTCPDWSLAELIGHLGGVHRWVTQVVTTRAQEPL 67
Query 72 DPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHE 131
V G +PP D + WL G LV A+ + G +T VW++ G +W RR + E
Sbjct 68 RRDLVAGDEPPKDAEGLARWLGDGVTPLVAALREAGPDTRVWSWAGVPTTAFWSRRMVLE 127
Query 132 VAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---DDDTLHLHA 188
VHRAD AI + + +AAD I E+LE +A ++ P E D + LHLHA
Sbjct 128 TLVHRADAAIALQRPYDAPAELAADAIDEWLELMASESALRFRPQLAELRGDGERLHLHA 187
Query 189 TDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGD 248
TD EW V R +G+TW H KG VALR T+L LA RRL + +E++GD
Sbjct 188 TDAPAQLNAEWVVERTPQGITWRREHAKGDVALRAPLTDLFLAFHRRLPLDHERLEIIGD 247
Query 249 AGVWQKWLDRT 259
+ WL+ T
Sbjct 248 RALLDHWLEHT 258
>gi|182438985|ref|YP_001826704.1| hypothetical protein SGR_5192 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|326779639|ref|ZP_08238904.1| hypothetical protein CHP03083 [Streptomyces cf. griseus XylebKG-1]
gi|178467501|dbj|BAG22021.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|326659972|gb|EGE44818.1| hypothetical protein CHP03083 [Streptomyces griseus XylebKG-1]
Length=259
Score = 151 bits (381), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 100/255 (40%), Positives = 131/255 (52%), Gaps = 11/255 (4%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
Y L Q A ++ D + VPTCP W+L +L HVG RW +IVR R +
Sbjct 9 YCDEILTQNDALRAVLTGADLTATVPTCPDWTLRELAVHVGGAHRWVGEIVRTRAAEEVP 68
Query 73 PRSVEGGKPPPD--PDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLH 130
+V G + P P +WL GA V A+ + G + VW++ R A +W RR H
Sbjct 69 EETVPGFEGPDGDGPAALDAWLAEGAADTVAALREAGPDAEVWSWAWERRAAFWARRITH 128
Query 131 EVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---DDDTLHLH 187
EVAVHRAD A+ G +T++ +VAAD I E+L RI + G P E +LHLH
Sbjct 129 EVAVHRADAALAAGVPYTVDADVAADTIEEWL-RIVSFSQDDGDPEAAELRGGGRSLHLH 187
Query 188 ATD-PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELL 246
ATD PG EW + E TW H HGK VALR T+L+L RRL +E+L
Sbjct 188 ATDVPG----AEWLIEFGEERFTWRHAHGKATVALRAPLTDLMLVFNRRLEPTSPRVEVL 243
Query 247 GDAGVWQKWLDRTPL 261
GDA + WL R+
Sbjct 244 GDAALLDFWLARSSF 258
>gi|256390592|ref|YP_003112156.1| hypothetical protein Caci_1392 [Catenulispora acidiphila DSM
44928]
gi|256356818|gb|ACU70315.1| protein of unknown function DUF1503 [Catenulispora acidiphila
DSM 44928]
Length=272
Score = 145 bits (367), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 94/254 (38%), Positives = 131/254 (52%), Gaps = 6/254 (2%)
Query 8 LAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRL 67
L D +A E+ + + D + PVPTCPGW++ ++ RH+G RWAA IVR
Sbjct 15 LVHTDRFTAEAERVATLLDGLGTDDWTRPVPTCPGWTVRKVARHIGTAHRWAAAIVRSPG 74
Query 68 DHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRR 127
++PRS++ G P + + W+ GA L AV + G + PVW++ + A +W RR
Sbjct 75 SEAVNPRSLDLGFPESNAGYS-DWIRAGAAELAHAVREAGPDKPVWSWGPDQHARFWARR 133
Query 128 RLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE-DDDTLHL 186
LHE +H AD+ + +G +P VAADGI EFL + A L D +TLHL
Sbjct 134 MLHETTMHGADMIMALGRTPEFDPAVAADGIDEFLTVLPSAAAFSPKIRALTGDGETLHL 193
Query 187 HATD----PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTG 242
HATD E EW + + G W H KGA A+RG EL L + RR S
Sbjct 194 HATDADPASAAGERAEWLITLEPNGFRWRRAHAKGAAAVRGPVGELYLFLWRRRSPGAQE 253
Query 243 IELLGDAGVWQKWL 256
IE+LGD + W+
Sbjct 254 IEVLGDHVLVDHWV 267
>gi|239987303|ref|ZP_04707967.1| hypothetical protein SrosN1_08367 [Streptomyces roseosporus NRRL
11379]
gi|291444261|ref|ZP_06583651.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
gi|291347208|gb|EFE74112.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
Length=259
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/262 (39%), Positives = 134/262 (52%), Gaps = 11/262 (4%)
Query 6 SSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRD 65
+SL+ Y L QT A ++ D VP+CP W+L +L HVG RW +IVR
Sbjct 2 TSLSHDRYCDEILAQTDALRAVLTGADLGVTVPSCPDWTLRELAVHVGGAHRWVGEIVRT 61
Query 66 RLDHFLDPRSVEGGKPPPDPDDAI--SWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGW 123
R V G + P D A +WL GA + V A+ + G + VWT++ + +
Sbjct 62 RATEEFPEDKVPGFEGPDSEDPAALDAWLAEGAAVTVAALREAGPDAEVWTWVTEQRTAF 121
Query 124 WVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---D 180
W RR HE AVHRAD A+ + ++ VAAD I E+L +A+ A G P E
Sbjct 122 WARRMTHETAVHRADAALAARAPYEVDAEVAADTIEEWLGIVAL-AQEEGDPEAAELRGG 180
Query 181 DDTLHLHATD-PGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVA 239
+LHLHATD PG EW + + TW H H K VALRG T+L+L RRL
Sbjct 181 GRSLHLHATDVPG----AEWLIEFGDERFTWRHAHEKATVALRGTLTDLMLVFNRRLKPT 236
Query 240 DTGIELLGDAGVWQKWLDRTPL 261
D +E+LGDA + WLDR+
Sbjct 237 DPRVEVLGDAALLDFWLDRSSF 258
>gi|134098435|ref|YP_001104096.1| hypothetical protein SACE_1859 [Saccharopolyspora erythraea NRRL
2338]
gi|291003348|ref|ZP_06561321.1| hypothetical protein SeryN2_02344 [Saccharopolyspora erythraea
NRRL 2338]
gi|133911058|emb|CAM01171.1| protein of unknown function DUF1503 [Saccharopolyspora erythraea
NRRL 2338]
Length=267
Score = 140 bits (354), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/243 (38%), Positives = 122/243 (51%), Gaps = 6/243 (2%)
Query 20 QTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGG 79
QT + D + PVPTCPGW+LGQL RHVG RW ++VR R ++ R +
Sbjct 16 QTDLLRSAVAGADLTAPVPTCPGWNLGQLLRHVGAAHRWVEEVVRTRASEPVEER-INDL 74
Query 80 KPPPDPDDAI--SWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRA 137
D D A+ +WL GA L + + + G + VWT +W RR +HE AVHR
Sbjct 75 AGYTDEDAAVLDAWLADGAARLAETLREAGPDARVWTVAPGGTPVFWARRMVHETAVHRC 134
Query 138 DVAITVGGEFTLEPNVAADGISEFLE---RIAVQAGSGGTPLPLEDDDTLHLHATDPGLL 194
D A+ G EF ++ VA D + E+++ V S G L +LHLHATD
Sbjct 135 DAALVAGAEFDVDAEVAVDALDEWMDFGTLAQVFEESPGIRDLLGPGRSLHLHATDAPPE 194
Query 195 EAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQK 254
EW V VTW H K AVA+RG + LLL + R +E++GDA ++
Sbjct 195 AGAEWLVDLSGEPVTWRRAHEKAAVAVRGPLSGLLLTIYGRKPAPGAEVEIVGDAELFHA 254
Query 255 WLD 257
WLD
Sbjct 255 WLD 257
>gi|271970204|ref|YP_003344400.1| hypothetical protein Sros_9026 [Streptosporangium roseum DSM
43021]
gi|270513379|gb|ACZ91657.1| hypothetical protein Sros_9026 [Streptosporangium roseum DSM
43021]
Length=262
Score = 139 bits (350), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/257 (39%), Positives = 129/257 (51%), Gaps = 12/257 (4%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
Y + QT EL++ D S VPTCPGW+L L RH+G R VR + D
Sbjct 9 YCDEIITQTDLLRELLKGADLSADVPTCPGWTLAGLVRHIGGNLRTGETAVRTG-ETIDD 67
Query 73 PRSVEGGKPPPDPDDAI---SWLYGGARLLVDAVEQTG--VETPVWTFLGPRPAGWWVRR 127
P G PD DD +WL GA + + G E +WTF G +WVRR
Sbjct 68 PGKQVPGVAGPDGDDPAELDAWLAEGAARYAGTLREAGPDAEARIWTFQGS--TAFWVRR 125
Query 128 RLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---DDDTL 184
LH++A+HRAD A VG +TL P VAAD + E LE Q +GG+P E ++
Sbjct 126 GLHDLAIHRADAAAAVGAGYTLAPEVAADAVDELLELFRGQQ-AGGSPGLAELRGPGRSI 184
Query 185 HLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIE 244
HLHATD G EW + G TW H K VALRG T++L + RRL +E
Sbjct 185 HLHATDTGAELDAEWLIEFGADGFTWRRGHAKATVALRGPLTDVLRVLYRRLPADSERVE 244
Query 245 LLGDAGVWQKWLDRTPL 261
+LG+A + WL+R L
Sbjct 245 VLGEAALLDFWLERASL 261
>gi|297156087|gb|ADI05799.1| hypothetical protein SBI_02678 [Streptomyces bingchenggensis
BCW-1]
Length=241
Score = 137 bits (346), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 90/259 (35%), Positives = 130/259 (51%), Gaps = 28/259 (10%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
+S L Q AF + + D PVPTCP W L L H+G+ RWAA IVR
Sbjct 1 MASGLLAQIAAFADAVDGADWDAPVPTCPEWPLRVLVGHLGQAPRWAAGIVR-------- 52
Query 73 PRSVEGGKPP--PDPDDAI------SWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWW 124
GG P PDP +A+ +WL GA LV+AV G TPVWT GP PA +W
Sbjct 53 -----GGSPDGIPDPREAVPPQNWRAWLLAGASELVEAVRAIGPGTPVWTLTGPGPASFW 107
Query 125 VRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTP--LPLEDDD 182
+R+ H+ +VH D A+ G + LEP++AAD +++ LE ++ P L
Sbjct 108 LRQAAHDTSVHAVDAALLAGVPYALEPDLAADAVTQCLELLSSPVAEALKPAVAALRGAG 167
Query 183 TLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTG 242
++ L ++ G +E W + R + GV+W G+ V + G +LLL ++RRL
Sbjct 168 SIGLRPSE-GAIEG--WVITRTQTGVSWRRGPGRADVTVTGAVEDLLLVLMRRLPPQHVA 224
Query 243 IELLGDAGVWQKWLDRTPL 261
I+ GD ++ WL + L
Sbjct 225 ID--GDGQLFDHWLAHSAL 241
>gi|302543297|ref|ZP_07295639.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC
53653]
gi|302460915|gb|EFL24008.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC
53653]
Length=266
Score = 134 bits (338), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/259 (36%), Positives = 128/259 (50%), Gaps = 15/259 (5%)
Query 6 SSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRD 65
++LA Y + L QT + D + PVP+CPGW+LGQL RH+G WA +VR
Sbjct 13 TTLAFDRYRTEILHQTALLRSYLTEADPTAPVPSCPGWNLGQLVRHLGGAHGWAEMVVRT 72
Query 66 RLDHFL--DPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAG- 122
R + DP + + DP + L GA L DA+ + G + PVWT P P G
Sbjct 73 RSTEPVPDDPVNDVPLRTGEDPATLSTRLGDGAGRLADALHKAGPDRPVWT---PGPGGT 129
Query 123 --WWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED 180
+W RR HE +HRAD A+ VG F L +A D + E+L + GTP L
Sbjct 130 AMFWARRMTHETVIHRADAALAVGASFQLAEEIALDALDEWLTYSTLPEAYEGTPALLGP 189
Query 181 DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVAD 240
T+ LHATD G +W + T H + A+ LRG T+LLL + RR + +
Sbjct 190 GRTVCLHATDTG----SDWLIDLTGEAPTLHHTAQEAAIELRGTLTDLLLLVYRRPAPS- 244
Query 241 TGIELLGDAGVWQKWLDRT 259
+++ GD + WL R+
Sbjct 245 --VKVTGDTALLDLWLTRS 261
>gi|134099833|ref|YP_001105494.1| hypothetical protein SACE_3294 [Saccharopolyspora erythraea NRRL
2338]
gi|291006132|ref|ZP_06564105.1| hypothetical protein SeryN2_16558 [Saccharopolyspora erythraea
NRRL 2338]
gi|133912456|emb|CAM02569.1| protein of unknown function DUF1503 [Saccharopolyspora erythraea
NRRL 2338]
Length=263
Score = 132 bits (333), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/264 (38%), Positives = 126/264 (48%), Gaps = 11/264 (4%)
Query 6 SSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRD 65
S L Y + QT D +TPVP+CPGW+LGQL RH+G RW +IVR
Sbjct 2 SGLDYQRYCDEIVAQTDLLRTTTAKADMTTPVPSCPGWNLGQLLRHLGGCHRWVERIVRT 61
Query 66 RLDHFLDPRSVEGGKPPPDPDDAI--SWLYGGARLLVDAVEQTGVETPVWTFLGPRPAG- 122
R FL D D A+ WL GA LL DA+ G VW+ P P G
Sbjct 62 RSAEFLPDDDFRDLTQYTDEDAAVLDGWLAEGAALLADALRAAGPRAQVWS---PVPGGG 118
Query 123 --WWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE- 179
++ RR HE VHRAD + VG F + VA D + E++E ++ P E
Sbjct 119 TPFFARRMAHETVVHRADATLAVGNSFEVREQVALDCLDEWMELGSLPQMFEFHPEQREL 178
Query 180 --DDDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLS 237
TLHLHATD GEW V +TW H K AVA+RG T+LLL + +R S
Sbjct 179 LGPGRTLHLHATDTAPEARGEWVVDLTGDAITWRRAHEKCAVAVRGPLTDLLLVVYKRQS 238
Query 238 VADTGIELLGDAGVWQKWLDRTPL 261
+E++GD + WL+R
Sbjct 239 PRAGSVEVIGDTELLDFWLERVSF 262
>gi|302558119|ref|ZP_07310461.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
gi|302475737|gb|EFL38830.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=265
Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/239 (40%), Positives = 122/239 (52%), Gaps = 18/239 (7%)
Query 32 DQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDA--- 88
D S VPT P WSL QL RHVG RW +IV + V G P + DA
Sbjct 29 DLSGTVPTTPDWSLEQLVRHVGGALRWVERIVATGAREEIPEDRVPGFAGPAERGDAGAL 88
Query 89 ISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFT 148
+WL L+V A+ + G + VW++ G G+W RR HEV VHRAD + G +
Sbjct 89 DAWLAESGELVVGALRRAGPDAQVWSWAGIHNTGFWARRVTHEVTVHRADATLAAGLPYE 148
Query 149 LEPNVAADGISEFLERIAVQAGSGGTPLPLEDDD---------TLHLHATDPGLLEAGEW 199
+ P+ AAD I E+LE + + L DD TLHLHATD G EW
Sbjct 149 VAPDAAADAIDEWLEIVEWAQRT------LPDDTVHGLRGPRRTLHLHATDAGPGIDAEW 202
Query 200 TVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDR 258
+ DE GV+W H K VALRG T +LLA RRL + G+E+LGD + + WL+R
Sbjct 203 LIELDEDGVSWRRGHEKATVALRGPLTSVLLAFYRRLPLDAPGLEVLGDRKLLELWLER 261
>gi|297204434|ref|ZP_06921831.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197715824|gb|EDY59858.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=260
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/257 (36%), Positives = 121/257 (48%), Gaps = 17/257 (6%)
Query 12 DYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFL 71
+Y A + QT ++ D + VPTCPGW LG+L RHVG RWA +IVR R +
Sbjct 6 EYCDAIVAQTDLLTRHVKGADPAAQVPTCPGWDLGRLLRHVGGDHRWAEEIVRTRATGPI 65
Query 72 DPRSVEGGKPPPDPDDAI--SWLYGGARLLVDAVEQTGVETPVWT----FLGPRPAGWWV 125
D V DD WL GA L + G + PVWT L + A +W
Sbjct 66 DDDPVNDPAAYAGLDDCAIGGWLVEGATRLAGTLRAAGPDVPVWTPADEQLVQQSAMFWA 125
Query 126 RRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSG---GTPLPLEDDD 182
RR +E +HRAD A+ G EF +E ++A D + E+LE V G P L +
Sbjct 126 RRMTYETLLHRADAALVTGAEFVVEESLAVDAVEEWLEFSTVPEAYDPLPGLPELLGNGR 185
Query 183 TLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTG 242
TL L A AG+W + W G AV++RG T+LLL + R + G
Sbjct 186 TLGLDAG-----AAGQWLLDLGGDRPVWRRGTGAAAVSVRGPVTDLLLFLYARPA---PG 237
Query 243 IELLGDAGVWQKWLDRT 259
+E GD+ + WL RT
Sbjct 238 VETRGDSELLDLWLRRT 254
>gi|300783549|ref|YP_003763840.1| hypothetical protein AMED_1626 [Amycolatopsis mediterranei U32]
gi|299793063|gb|ADJ43438.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524936|gb|AEK40141.1| hypothetical protein RAM_08255 [Amycolatopsis mediterranei S699]
Length=256
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/261 (35%), Positives = 126/261 (49%), Gaps = 17/261 (6%)
Query 7 SLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDR 66
SL+ ++A + FG I VPTCP W+L L HVG +A I+ R
Sbjct 3 SLSHERLAAALGTEAERFGMAIAGAAPDLRVPTCPEWTLRDLTCHVGIAYYKSAAIIASR 62
Query 67 LDHFLDPRSVEGGKPPPDPDDAIS-WLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWV 125
++ +V +PP +A+ WL GA+ LV V + G ETP T+ R AG+W
Sbjct 63 STGYVPFEAVTIDEPPAF--EALGGWLRDGAQRLVATVAEVGPETPTSTWSPDRRAGFWT 120
Query 126 RRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLED----- 180
RR HE VHRAD A G + ++ ++AADGISE L + A P D
Sbjct 121 RRLTHETVVHRADAAFATGTPYDVDADLAADGISEGL---GLAAAFSRLQHPALDRTSLR 177
Query 181 --DDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSV 238
+TL HAT+P + W VRR GV + + V + G A +LLLA+ RL+
Sbjct 178 GTGETLLFHATEPDV----HWLVRRTPSGVEVAQEAAEADVVVEGRAADLLLALTERLAA 233
Query 239 ADTGIELLGDAGVWQKWLDRT 259
D + + GDA ++ W + T
Sbjct 234 DDARLTVSGDAALFHHWRENT 254
>gi|289768817|ref|ZP_06528195.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289699016|gb|EFD66445.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length=266
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 90/251 (36%), Positives = 121/251 (49%), Gaps = 11/251 (4%)
Query 19 EQTHAFGEL----IRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPR 74
E H G L + + VPTCP W+L L RHVGR RW IV R + +
Sbjct 12 EIVHQVGRLRAVVTSGAELTATVPTCPDWTLEDLVRHVGRALRWTGLIVGTRAEQDVPVD 71
Query 75 SVEGGKPPPDPDDAISWLYG---GARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHE 131
G P DA + ++V A+ + G + W++ G AG+W RR HE
Sbjct 72 RAPGAGGPAASGDAAALDAWLAESGEVVVGALREAGPDARAWSWAGVGTAGFWARRMTHE 131
Query 132 VAVHRADVAITVGGEF-TLEPNVAADGISEFLE--RIAVQAGSGGTPLPLED-DDTLHLH 187
+ VH AD A+ G + P VAAD I E+L+ R +A G L +LHLH
Sbjct 132 LVVHGADAALAAGLPHRAVAPEVAADAIDEWLDIVRFVQRALPGAAANELRAPGSSLHLH 191
Query 188 ATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLG 247
ATD EW V + G+TW H K VALRG T++LLA RLS G+E+LG
Sbjct 192 ATDTAAELNAEWIVELPDDGITWRRGHEKATVALRGPLTDVLLAFYGRLSPDAPGLEVLG 251
Query 248 DAGVWQKWLDR 258
D + + WL++
Sbjct 252 DRKLLELWLEK 262
>gi|21224000|ref|NP_629779.1| hypothetical protein SCO5649 [Streptomyces coelicolor A3(2)]
gi|3319737|emb|CAA19903.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
Length=266
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 91/254 (36%), Positives = 123/254 (49%), Gaps = 12/254 (4%)
Query 17 YLEQ-THAFGEL----IRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFL 71
Y E+ H G L + + VPTCP W+L L RHVGR RW IV R + +
Sbjct 9 YCEEIVHQVGRLRAVVTSGAELTATVPTCPDWTLEDLVRHVGRALRWTGLIVGTRAEQDV 68
Query 72 DPRSVEGGKPPPDPDDAISWLYG---GARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRR 128
G P DA + ++V A+ + G + W++ G AG+W RR
Sbjct 69 PVDRAPGAGGPAASGDAAALDAWLAESGEVVVGALREAGPDARAWSWAGVGTAGFWARRM 128
Query 129 LHEVAVHRADVAITVGGEF-TLEPNVAADGISEFLE--RIAVQAGSGGTPLPLED-DDTL 184
HE+ VH AD A+ G + P VAAD I E+L+ R +A G L +L
Sbjct 129 THELVVHGADAALAAGLPHRAVAPEVAADAIDEWLDIVRFVQRALPGAAANELRAPGSSL 188
Query 185 HLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIE 244
HLHATD EW V + G+TW H K VALRG T++LLA RLS G+E
Sbjct 189 HLHATDTAAELNAEWIVELPDDGITWRRGHEKATVALRGPLTDVLLAFYGRLSPDAPGLE 248
Query 245 LLGDAGVWQKWLDR 258
+LGD + + WL++
Sbjct 249 VLGDRKLLELWLEK 262
>gi|312138927|ref|YP_004006263.1| hypothetical protein REQ_15000 [Rhodococcus equi 103S]
gi|311888266|emb|CBH47578.1| hypothetical protein REQ_15000 [Rhodococcus equi 103S]
Length=251
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/232 (35%), Positives = 117/232 (51%), Gaps = 15/232 (6%)
Query 25 GELIRNVDQST---PVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKP 81
G+L+ + T P+PT P W++ + RH G+ W A +R D P +
Sbjct 14 GDLLADTPTETLAEPIPTVPEWTVEHVLRHTGKVHLWVAAALRS--DPQTPPSEIRRIGD 71
Query 82 PPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADV-- 139
P + ++ L++ ++ G + V T +GP P WWVRR+ HEVAVHR DV
Sbjct 72 MPRGPECVAAYRAALDLVLAEFDRLGADRIVPTMVGPAPVAWWVRRQAHEVAVHRIDVSD 131
Query 140 AITVGG---EFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEA 196
AI+ GG +L+P VAADG+ E++ + G + ++HLH TD A
Sbjct 132 AISAGGGPDVPSLDPQVAADGVDEWVSVFLARLADAGRMPETVNGHSIHLHGTD---AVA 188
Query 197 GEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGD 248
EW + D V + H KG VALRG A ELLL + RR + G++++GD
Sbjct 189 AEWYLEFDGGTVAVTREHRKGDVALRGSAQELLLTLWRRRPL--DGLDIVGD 238
>gi|325676650|ref|ZP_08156326.1| hypothetical protein HMPREF0724_14109 [Rhodococcus equi ATCC
33707]
gi|325552540|gb|EGD22226.1| hypothetical protein HMPREF0724_14109 [Rhodococcus equi ATCC
33707]
Length=251
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/232 (35%), Positives = 117/232 (51%), Gaps = 15/232 (6%)
Query 25 GELIRNVDQST---PVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKP 81
G+L+ + T PVPT P W++ + RH G+ W A +R D P +
Sbjct 14 GDLLADTPTETLAEPVPTVPEWTVEHVLRHTGKVHLWVAAALRS--DPQTPPSEIRRIGD 71
Query 82 PPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADV-- 139
P + ++ L++ ++ G + V T +GP P WWVRR+ HEVAVHR DV
Sbjct 72 MPRGPECVAAYRAALDLVLAEFDRLGADRIVPTMVGPAPVAWWVRRQAHEVAVHRIDVSD 131
Query 140 AITVGG---EFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEA 196
AI+ GG +L+P VAADG+ E++ + G + ++HLH TD A
Sbjct 132 AISAGGGPDVPSLDPQVAADGVDEWVSVFLARLADAGRMPETVNGHSIHLHGTD---AVA 188
Query 197 GEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGD 248
EW + D V + H KG VALRG A ELLL + RR + G++++GD
Sbjct 189 AEWYLEFDGGTVAVTREHRKGDVALRGSAQELLLTLWRRRPL--DGLDVVGD 238
>gi|302530682|ref|ZP_07283024.1| predicted protein [Streptomyces sp. AA4]
gi|302439577|gb|EFL11393.1| predicted protein [Streptomyces sp. AA4]
Length=259
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/195 (36%), Positives = 96/195 (50%), Gaps = 12/195 (6%)
Query 12 DYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFL 71
+ + L QT + D++ V CP W+LGQL HV G RWA + VR R H+L
Sbjct 12 ERCAEILRQTELLAAAVEGADRTARVAACPEWNLGQLLEHVSTGHRWAEETVRTRARHWL 71
Query 72 DPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFL--GPRPAGWWVRRRL 129
+ P P SWL GA+ LV + + G + V+T + GP A ++ RR +
Sbjct 72 PDDELRNPVDTPRP---ASWLVDGAKALVATLREAGPDAEVFTPVPNGPPRAAFYARRFM 128
Query 130 HEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---DDDTLHL 186
+E +HRAD + GGEFT+ P VA D + E+LE ++ P E D T+HL
Sbjct 129 NETLIHRADATLAAGGEFTVTPEVAHDAMEEWLELGSLPQLLEFVPERRELLGPDRTIHL 188
Query 187 HATDPGLLEAGEWTV 201
TD A WTV
Sbjct 189 APTD----HAASWTV 199
>gi|333921310|ref|YP_004494891.1| hypothetical protein AS9A_3653 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483531|gb|AEF42091.1| hypothetical protein AS9A_3653 [Amycolicicoccus subflavus DQS3-9A1]
Length=247
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/259 (34%), Positives = 125/259 (49%), Gaps = 33/259 (12%)
Query 12 DYSSAYLEQTHAFGELIRNVDQST---PVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLD 68
DY +A + + GEL+ + PVPTCPGW+L +L H+GR RWAA + D +
Sbjct 5 DYRAAIVRE----GELMAAQPSDSLDVPVPTCPGWNLERLVGHLGRVHRWAAAYLADGTE 60
Query 69 HFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRR 128
+ G PP D + W +LV+ + +T +TP TF GP A +W RR+
Sbjct 61 AAA---GLSSGNRPPRGADVLPWYKESLEILVEELARTDPDTPADTFAGPGTAAFWFRRQ 117
Query 129 LHEVAVHR--ADVAITVGGEFTLEPNVAADGISE----FLERIAVQAGSG---GTPLPLE 179
HE AVHR A+ A++ G ++ +AADG E F+ RI G G+ L LE
Sbjct 118 AHETAVHRWDAENAVSPGQAGRIDATLAADGSEEWLTVFVPRILSARADGRGSGSSLRLE 177
Query 180 DDDTLHLHATDPGLLEAGEWTVRRDERGVTWSH-RHGKGAVALRGGATELLLAMVRRLSV 238
+T E+ WT+ + G + R G+ LRG A++LLL + RR +
Sbjct 178 CSET-----------ESARWTLTLGDAGPSVRRGRGGEAQAVLRGPASDLLLTVWRRTPL 226
Query 239 ADTGIELLGDAGVWQKWLD 257
+EL GD + LD
Sbjct 227 --DSVELTGDRACAAQILD 243
>gi|239985866|ref|ZP_04706530.1| hypothetical protein SrosN1_01032 [Streptomyces roseosporus NRRL
11379]
gi|291442823|ref|ZP_06582213.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
gi|291345770|gb|EFE72674.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
Length=250
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/233 (34%), Positives = 107/233 (46%), Gaps = 26/233 (11%)
Query 37 VPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGA 96
VPTCPGW + L RH G RWAA + + + + G+P D + ++W G
Sbjct 30 VPTCPGWQIRHLLRHTGMVHRWAAAFIAEGYTAY----HPDSGEPDLDGAELLAWFREGH 85
Query 97 RLLVDAVEQTGVETPVWTFL-GPRPAGWWVRRRLHEVAVHRADVAITVGGEFT-LEPNVA 154
RLLV ++E+ + WTFL P P +W RR+L+E VHR D +GG T + + A
Sbjct 86 RLLVRSLEEAPADLECWTFLPAPSPLAFWSRRQLNETTVHRVDAESALGGPLTPVSADRA 145
Query 155 ADGISEFLERIAVQAGSGGTPLPLEDDD---TLHLHATDPGLLEAGEWTVRRDERGVTWS 211
ADGI E L AG P D TL + A D A WTVR +
Sbjct 146 ADGIDELL------AGFHARPKSRVRSDKPRTLRVRAVD----TAATWTVRISDEPPQAV 195
Query 212 HRHGKGA-----VALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRT 259
G+G+ L G A L L + RL + T + L GD V + W D +
Sbjct 196 RTAGEGSAEDVDCELSGTAEGLYLTLWNRLPL--TAVTLRGDRAVARLWTDNS 246
>gi|254380896|ref|ZP_04996262.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194339807|gb|EDX20773.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=258
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/256 (33%), Positives = 117/256 (46%), Gaps = 14/256 (5%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
+ +A +T F + D STPVPTCPGWSL L RHVG RW +++R R+ H
Sbjct 6 HGAAVAAETAEFVATVTAADLSTPVPTCPGWSLADLTRHVGSVHRWFTELLRQRIQHPPT 65
Query 73 PRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEV 132
R V+ + P D WL A + T ++ P+W + + A +WVRR L E
Sbjct 66 SRVVD-LRLPEHTDALPDWLAMSAAEAAEVFAATDLDAPMWAWGVDQHARFWVRRMLFET 124
Query 133 AVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE---DDDTLHLHAT 189
VHR D + +G ++ +A DGI EFL + QA S PL + D T+ T
Sbjct 125 LVHRVDAQLALGLSPRIDRALAVDGIDEFLTNLP-QAASFA-PLTAQLRAPDRTVRFSCT 182
Query 190 DPGLLEAGEWTVRRDERGVTW-----SHRHGKGAVA-LRGGATELLLAMVRRLSVADTGI 243
D G+W V G H + A A +RG A +LLL + RL
Sbjct 183 DAD--ADGDWLVELRPDGFALVAEVADHSEPRPADATVRGTAADLLLLLYGRLDHRSDAF 240
Query 244 ELLGDAGVWQKWLDRT 259
+LLGD + W +
Sbjct 241 QLLGDTSLLAHWFSHS 256
>gi|29827917|ref|NP_822551.1| hypothetical protein SAV_1376 [Streptomyces avermitilis MA-4680]
gi|29605018|dbj|BAC69086.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=265
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/255 (31%), Positives = 114/255 (45%), Gaps = 9/255 (3%)
Query 6 SSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRD 65
S A VD+ +A +T F ++++ D +T VP CPGW+L L +H G RW + ++R
Sbjct 11 SGFAPVDHRTAVAAETARFVAVVKDADLATAVPGCPGWTLADLVKHTGSVQRWFSVLLRA 70
Query 66 RLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWV 125
R+ R V+ + P + WL A + +A T P+W + + A +W
Sbjct 71 RIQEPPQKREVD-LRFPDEEGGYADWLAESATVAAEAFAATDPNLPMWAWGVDQHARFWA 129
Query 126 RRRLHEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLE-DDDTL 184
RR L E +HRAD + +G T++ +A DGI EFL + A L D T+
Sbjct 130 RRMLFETLLHRADAELALGLRPTIDRPLAVDGIDEFLVNLPFAAFFAPKVANLRGPDRTI 189
Query 185 HLHATDPGLLEAGEWTVRRDERGVTWSHRH---GKGAVALRGGATELLLAMVRRLSVADT 241
ATD +W VR G H A +RG AT+LLL RL
Sbjct 190 RFRATDGD----DDWLVRLRPDGFGLDTTHPTEDTAAATVRGTATDLLLLAYGRLPYDAE 245
Query 242 GIELLGDAGVWQKWL 256
+ GD G+ W
Sbjct 246 ALAHEGDEGLLAHWF 260
>gi|297195727|ref|ZP_06913125.1| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
gi|297152920|gb|EDY62838.2| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
Length=260
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/247 (30%), Positives = 106/247 (43%), Gaps = 6/247 (2%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
Y + F +R+ D +TPV TCPGW+L L H G RWA +VR R +
Sbjct 18 YCESIAHVVADFTAAVRDADPATPVSTCPGWTLADLVEHHGTTHRWAEHVVRTRATEPVL 77
Query 73 PRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETPVWTFLGPRPAGWWVRRRLHEV 132
R V P DP WL GA + + + P+W++ + ++ RR L E
Sbjct 78 AREVPLDLPD-DPSAYPQWLARGAESCLRTLRTVDPDLPMWSYGADQRVAFYPRRLLFEA 136
Query 133 AVHRADVAITVGGEFTLEPNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPG 192
+H AD + +G E +EP AADGI+EFLE + + + PL ++ L A D G
Sbjct 137 VIHCADAQLALGQEPRVEPGTAADGIAEFLENLPRRTRTTERQAPLA-GGSVRLLARDTG 195
Query 193 LLEAGEWTVRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAGVW 252
WT+ G +W+ V + +LLL + R + G V
Sbjct 196 ----AAWTITFGAAGFSWTATAEAADVTVTADVADLLLLLYGRRRPEADRFTVRGGTAVL 251
Query 253 QKWLDRT 259
WL T
Sbjct 252 DAWLSTT 258
>gi|291454435|ref|ZP_06593825.1| conserved hypothetical protein [Streptomyces albus J1074]
gi|291357384|gb|EFE84286.1| conserved hypothetical protein [Streptomyces albus J1074]
Length=267
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/254 (34%), Positives = 112/254 (45%), Gaps = 9/254 (3%)
Query 13 YSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFRHVGRGDRWAAQIVRDRLDHFLD 72
Y QT + + D VPTCP W+L L HVG RW +IVR R +
Sbjct 11 YCDEVTVQTGLLRQALAGADLQARVPTCPEWTLRDLAVHVGGATRWMNEIVRTRASAEVP 70
Query 73 PRSVEGGKPPPDPDDAISWLYGGARLLVDAVEQTGVETP---VWTFLGPRPAGWWVRRRL 129
+V PP D + A A E P +WT+ + + +W RR
Sbjct 71 DEAVPEFAGPPVEDGPGALDAWLAEGAEAAAEALREAGPGRKIWTWSWEQSSSFWARRLT 130
Query 130 HEVAVHRADVAITVGGEFTLEPNVAADGISEFLERIA----VQAGSGGTPLPLEDDDTLH 185
E+ VHRAD I G F + +AAD + E+LE +A VQ L TLH
Sbjct 131 QELLVHRADACIAAGVPFAADAELAADAVDEWLEIVAYVQRVQPADPAGEL-RGGGRTLH 189
Query 186 LHATDPGLLEAGEWTVRRDERGVTWSHRHGKGA-VALRGGATELLLAMVRRLSVADTGIE 244
LHATD GEW + + G H GA V LRG TEL+LA RRL + +E
Sbjct 190 LHATDAAPGVHGEWLIELTDDGFAVRPEHTDGATVELRGPMTELMLAFYRRLPLTSDEVE 249
Query 245 LLGDAGVWQKWLDR 258
+ GD + WL+R
Sbjct 250 VRGDRSFLEFWLER 263
Lambda K H
0.319 0.138 0.442
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 388543189928
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40