BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2229c
Length=245
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609366|ref|NP_216745.1| hypothetical protein Rv2229c [Mycob... 471 3e-131
gi|340627234|ref|YP_004745686.1| hypothetical protein MCAN_22521... 470 8e-131
gi|31793410|ref|NP_855903.1| hypothetical protein Mb2254c [Mycob... 470 1e-130
gi|339295145|gb|AEJ47256.1| hypothetical protein CCDC5079_2066 [... 461 3e-128
gi|183983295|ref|YP_001851586.1| hypothetical protein MMAR_3305 ... 364 5e-99
gi|118617020|ref|YP_905352.1| hypothetical protein MUL_1323 [Myc... 362 2e-98
gi|254822753|ref|ZP_05227754.1| hypothetical protein MintA_22679... 337 8e-91
gi|240172443|ref|ZP_04751102.1| hypothetical protein MkanA1_2422... 330 8e-89
gi|41408079|ref|NP_960915.1| hypothetical protein MAP1981c [Myco... 317 1e-84
gi|118463992|ref|YP_881418.1| hypothetical protein MAV_2210 [Myc... 312 3e-83
gi|296166120|ref|ZP_06848565.1| conserved hypothetical protein [... 303 2e-80
gi|118468416|ref|YP_888583.1| hypothetical protein MSMEG_4306 [M... 281 5e-74
gi|342857251|ref|ZP_08713907.1| hypothetical protein MCOL_00195 ... 275 3e-72
gi|333990305|ref|YP_004522919.1| hypothetical protein JDM601_166... 266 3e-69
gi|120404587|ref|YP_954416.1| hypothetical protein Mvan_3619 [My... 266 3e-69
gi|108800309|ref|YP_640506.1| hypothetical protein Mmcs_3343 [My... 251 5e-65
gi|145223478|ref|YP_001134156.1| hypothetical protein Mflv_2891 ... 243 2e-62
gi|15827865|ref|NP_302128.1| hypothetical protein ML1638 [Mycoba... 239 2e-61
gi|3150238|emb|CAA19218.1| hypothetical protein MLCB1243.37 [Myc... 230 1e-58
gi|169628992|ref|YP_001702641.1| hypothetical protein MAB_1904 [... 213 2e-53
gi|54023607|ref|YP_117849.1| hypothetical protein nfa16390 [Noca... 205 5e-51
gi|312140238|ref|YP_004007574.1| hypothetical protein REQ_28760 ... 196 3e-48
gi|226307146|ref|YP_002767106.1| hypothetical protein RER_36590 ... 192 3e-47
gi|229490271|ref|ZP_04384113.1| zinc ribbon domain protein [Rhod... 192 3e-47
gi|296140405|ref|YP_003647648.1| hypothetical protein Tpau_2711 ... 188 7e-46
gi|134098169|ref|YP_001103830.1| hypothetical protein SACE_1583 ... 188 7e-46
gi|111018195|ref|YP_701167.1| hypothetical protein RHA1_ro01182 ... 185 5e-45
gi|226360321|ref|YP_002778099.1| hypothetical protein ROP_09070 ... 184 1e-44
gi|257054784|ref|YP_003132616.1| Zn-ribbon protein, possibly nuc... 182 4e-44
gi|333918914|ref|YP_004492495.1| hypothetical protein AS9A_1243 ... 182 6e-44
gi|262202986|ref|YP_003274194.1| hypothetical protein Gbro_3095 ... 180 1e-43
gi|300783099|ref|YP_003763390.1| Zn-ribbon protein [Amycolatopsi... 175 5e-42
gi|256375027|ref|YP_003098687.1| hypothetical protein Amir_0880 ... 174 7e-42
gi|302524446|ref|ZP_07276788.1| conserved hypothetical protein [... 174 1e-41
gi|343924132|ref|ZP_08763695.1| hypothetical protein GOALK_002_0... 169 3e-40
gi|302869111|ref|YP_003837748.1| hypothetical protein Micau_4661... 156 2e-36
gi|315504417|ref|YP_004083304.1| hypothetical protein ML5_3639 [... 156 3e-36
gi|296393053|ref|YP_003657937.1| hypothetical protein Srot_0624 ... 154 1e-35
gi|331696747|ref|YP_004332986.1| hypothetical protein Psed_2933 ... 152 3e-35
gi|145595870|ref|YP_001160167.1| hypothetical protein Strop_3356... 152 4e-35
gi|291302272|ref|YP_003513550.1| hypothetical protein Snas_4816 ... 150 1e-34
gi|330469453|ref|YP_004407196.1| hypothetical protein VAB18032_2... 147 1e-33
gi|317506106|ref|ZP_07963931.1| hypothetical protein HMPREF9336_... 147 1e-33
gi|324998689|ref|ZP_08119801.1| Zn-ribbon protein, possibly nucl... 144 8e-33
gi|258653439|ref|YP_003202595.1| hypothetical protein Namu_3275 ... 144 9e-33
gi|238060394|ref|ZP_04605103.1| hypothetical protein MCAG_01360 ... 143 2e-32
gi|271968070|ref|YP_003342266.1| Zn-ribbon domain-containing pro... 142 3e-32
gi|84496761|ref|ZP_00995615.1| Zn-ribbon protein-like protein [J... 139 3e-31
gi|297561846|ref|YP_003680820.1| hypothetical protein Ndas_2904 ... 139 4e-31
gi|302545891|ref|ZP_07298233.1| putative zinc ribbon domain prot... 138 6e-31
>gi|15609366|ref|NP_216745.1| hypothetical protein Rv2229c [Mycobacterium tuberculosis H37Rv]
gi|148662049|ref|YP_001283572.1| hypothetical protein MRA_2248 [Mycobacterium tuberculosis H37Ra]
gi|148823437|ref|YP_001288191.1| hypothetical protein TBFG_12258 [Mycobacterium tuberculosis F11]
67 more sequence titles
Length=245
Score = 471 bits (1213), Expect = 3e-131, Method: Compositional matrix adjust.
Identities = 245/245 (100%), Positives = 245/245 (100%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED
Sbjct 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV
Sbjct 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE
Sbjct 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL
Sbjct 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
Query 241 EGFEE 245
EGFEE
Sbjct 241 EGFEE 245
>gi|340627234|ref|YP_004745686.1| hypothetical protein MCAN_22521 [Mycobacterium canettii CIPT
140010059]
gi|340005424|emb|CCC44584.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=245
Score = 470 bits (1210), Expect = 8e-131, Method: Compositional matrix adjust.
Identities = 244/245 (99%), Positives = 245/245 (100%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED
Sbjct 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV
Sbjct 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
LERREELQAQQTAESRAL+ALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE
Sbjct 121 LERREELQAQQTAESRALEALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL
Sbjct 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
Query 241 EGFEE 245
EGFEE
Sbjct 241 EGFEE 245
>gi|31793410|ref|NP_855903.1| hypothetical protein Mb2254c [Mycobacterium bovis AF2122/97]
gi|121638112|ref|YP_978336.1| hypothetical protein BCG_2247c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990606|ref|YP_002645293.1| hypothetical protein JTY_2241 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31619002|emb|CAD97107.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121493760|emb|CAL72235.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773719|dbj|BAH26525.1| hypothetical protein JTY_2241 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341602150|emb|CCC64824.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=245
Score = 470 bits (1209), Expect = 1e-130, Method: Compositional matrix adjust.
Identities = 244/245 (99%), Positives = 245/245 (100%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED
Sbjct 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV
Sbjct 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE
Sbjct 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILL+L
Sbjct 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLQL 240
Query 241 EGFEE 245
EGFEE
Sbjct 241 EGFEE 245
>gi|339295145|gb|AEJ47256.1| hypothetical protein CCDC5079_2066 [Mycobacterium tuberculosis
CCDC5079]
gi|339298766|gb|AEJ50876.1| hypothetical protein CCDC5180_2039 [Mycobacterium tuberculosis
CCDC5180]
Length=241
Score = 461 bits (1187), Expect = 3e-128, Method: Compositional matrix adjust.
Identities = 240/241 (99%), Positives = 241/241 (100%), Gaps = 0/241 (0%)
Query 5 VAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQ 64
+AQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQ
Sbjct 1 MAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQ 60
Query 65 VSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERR 124
VSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERR
Sbjct 61 VSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERR 120
Query 125 EELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGL 184
EELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGL
Sbjct 121 EELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGL 180
Query 185 YERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGFE 244
YERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGFE
Sbjct 181 YERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGFE 240
Query 245 E 245
E
Sbjct 241 E 241
>gi|183983295|ref|YP_001851586.1| hypothetical protein MMAR_3305 [Mycobacterium marinum M]
gi|183176621|gb|ACC41731.1| conserved protein [Mycobacterium marinum M]
Length=245
Score = 364 bits (935), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 185/244 (76%), Positives = 217/244 (89%), Gaps = 0/244 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA AQQRSLLELAKLDAEL+RIAHR+THLP+ +A++QV+ +H A +DR+AA+RIA ED
Sbjct 1 MKAEAAQQRSLLELAKLDAELSRIAHRSTHLPEGSAFEQVRVQHEAVSDRLAAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QVSR E EIDAVRKR DRDRSLLTSGA DAKQLADLQHEL++L+RRQASLED+LLEV
Sbjct 61 LDAQVSRLEDEIDAVRKREDRDRSLLTSGAVDAKQLADLQHELETLERRQASLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQAQQ E AL+ L+A+L AAQQ++D ALAE+DQ+R +HSS+RD L A+L+P+
Sbjct 121 MERREELQAQQNTEIAALEVLQAELTAAQQSVDAALAELDQSRQEHSSRRDTLAASLNPD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
LA LYER RAGGGPGAG+LQGHRCGACRIEIGR ELA+ISAAAEDEVVRCPEC AILLR+
Sbjct 181 LAALYERLRAGGGPGAGQLQGHRCGACRIEIGRSELARISAAAEDEVVRCPECAAILLRI 240
Query 241 EGFE 244
+G E
Sbjct 241 KGPE 244
>gi|118617020|ref|YP_905352.1| hypothetical protein MUL_1323 [Mycobacterium ulcerans Agy99]
gi|118569130|gb|ABL03881.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=245
Score = 362 bits (930), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 184/244 (76%), Positives = 216/244 (89%), Gaps = 0/244 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA AQQRSLLELAKLDAEL+RIAHR+THLP+ +A++QV+ +H A +DR+ A+RIA ED
Sbjct 1 MKAEAAQQRSLLELAKLDAELSRIAHRSTHLPEGSAFEQVRVQHEAVSDRLGAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QVSR E EIDAVRKR DRDRSLLTSGA DAKQLADLQHEL++L+RRQASLED+LLEV
Sbjct 61 LDAQVSRLEDEIDAVRKREDRDRSLLTSGAVDAKQLADLQHELETLERRQASLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQAQQ E AL+ L+A+L AAQQ++D ALAE+DQ+R +HSS+RD L A+L+P+
Sbjct 121 MERREELQAQQNTEIAALEVLQAELTAAQQSVDAALAELDQSRQEHSSRRDTLAASLNPD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
LA LYER RAGGGPGAG+LQGHRCGACRIEIGR ELA+ISAAAEDEVVRCPEC AILLR+
Sbjct 181 LAALYERLRAGGGPGAGQLQGHRCGACRIEIGRSELARISAAAEDEVVRCPECAAILLRI 240
Query 241 EGFE 244
+G E
Sbjct 241 KGPE 244
>gi|254822753|ref|ZP_05227754.1| hypothetical protein MintA_22679 [Mycobacterium intracellulare
ATCC 13950]
Length=245
Score = 337 bits (865), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 173/245 (71%), Positives = 210/245 (86%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQRSLLEL+KLDAEL+R+ HRA HLP++ A +++Q E++AA DR+ A+RIA ED
Sbjct 1 MKAEVAQQRSLLELSKLDAELSRLTHRAAHLPEQEACRRMQEEYDAAGDRIGAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D V R ESEIDAVR+R DRDR+LL SGATDAKQLADLQHEL++LQRRQ SLED+LLEV
Sbjct 61 IDAHVKRLESEIDAVRQREDRDRALLQSGATDAKQLADLQHELETLQRRQTSLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQ+Q E +AL+ L A++A A+QALD ALAE+ +AR HSSQRD L+A LDP
Sbjct 121 MERREELQSQLDTEQKALETLEAEMAGARQALDAALAELTEARELHSSQRDSLSAALDPA 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L+ LYERQRAGGGPGA +L G RCGACR+EI RGELA+ISAAAED+VVRCPECGAILLR+
Sbjct 181 LSALYERQRAGGGPGAAQLLGKRCGACRLEIDRGELARISAAAEDDVVRCPECGAILLRV 240
Query 241 EGFEE 245
+GF++
Sbjct 241 KGFDQ 245
>gi|240172443|ref|ZP_04751102.1| hypothetical protein MkanA1_24225 [Mycobacterium kansasii ATCC
12478]
Length=245
Score = 330 bits (847), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 185/245 (76%), Positives = 220/245 (90%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA AQQRSLLELAKLDAEL+RIAHR+THLPQR A ++VQ EHNAA DR+AA+RIA ED
Sbjct 1 MKAEAAQQRSLLELAKLDAELSRIAHRSTHLPQREACERVQIEHNAAGDRLAAVRIAVED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QVSRFE+EIDAVRKR +RDRSLL SGATDAKQL+DLQHEL++L+RRQASLED+LLEV
Sbjct 61 LDAQVSRFEAEIDAVRKREERDRSLLKSGATDAKQLSDLQHELETLERRQASLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQA+Q AE+ AL L +L A+QALD ALAE++Q+R + SS+R+ L+A+L+P+
Sbjct 121 MERREELQARQAAETAALAKLHTELDGARQALDAALAELEQSRRERSSRREELSASLNPD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRAGGGPGAG LQGHRCGACRIEIGRGELA+ISAAA+D+V+RCPECGAILLR+
Sbjct 181 LVALYERQRAGGGPGAGPLQGHRCGACRIEIGRGELARISAAADDDVLRCPECGAILLRV 240
Query 241 EGFEE 245
+GFE+
Sbjct 241 KGFEQ 245
>gi|41408079|ref|NP_960915.1| hypothetical protein MAP1981c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396434|gb|AAS04298.1| hypothetical protein MAP_1981c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336461840|gb|EGO40696.1| Zn-ribbon protein, possibly binds nucleic acid [Mycobacterium
avium subsp. paratuberculosis S397]
Length=245
Score = 317 bits (812), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/242 (69%), Positives = 203/242 (84%), Gaps = 0/242 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQRSLLELA +DAEL+R+AHRA HLP++ A +++Q E++AA DR+ A+RIA ED
Sbjct 1 MKADVAQQRSLLELANVDAELSRLAHRAEHLPEQQACERMQQEYDAAGDRLGAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D V R E+E+DAVR+R DRDRSLL SGA DAKQLADLQHEL++LQRRQ SLED+LLEV
Sbjct 61 IDAHVLRLEAEVDAVRQREDRDRSLLQSGAIDAKQLADLQHELETLQRRQTSLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQAQ E +AL+ L A++A A++ LD A EI ++R HSS+RD L+A LDPE
Sbjct 121 MERREELQAQLDGEQQALKELEAEMATARRDLDAARGEISESRALHSSRRDALSAELDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRA GGPGAG+L G RCGACR+EI RGEL++ISAAAED+VVRCPECGAILLR+
Sbjct 181 LFALYERQRARGGPGAGQLLGRRCGACRLEIDRGELSRISAAAEDDVVRCPECGAILLRV 240
Query 241 EG 242
+G
Sbjct 241 KG 242
>gi|118463992|ref|YP_881418.1| hypothetical protein MAV_2210 [Mycobacterium avium 104]
gi|254774919|ref|ZP_05216435.1| hypothetical protein MaviaA2_09630 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118165279|gb|ABK66176.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=245
Score = 312 bits (799), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 164/242 (68%), Positives = 201/242 (84%), Gaps = 0/242 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQRSLLELA +DAEL+R+AHRA HLP++ A +++Q E++AA DR+ A+RIA ED
Sbjct 1 MKADVAQQRSLLELANVDAELSRLAHRAEHLPEQQACERMQQEYDAAGDRLGAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D V R E+E+DAVR+R DRDRSLL SGA DAKQLADLQHEL++LQRRQ SLED+LLEV
Sbjct 61 IDAHVRRLEAEVDAVRQREDRDRSLLQSGAIDAKQLADLQHELETLQRRQTSLEDSLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQAQ E + L+ L A +A A++ LD A EI ++R HSS+R+ L+A LDPE
Sbjct 121 MERREELQAQLDGEQQTLKELEAAMATARRDLDAARGEISESRALHSSRREALSAELDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRA GGPGAG+L G RCGACR+EI RGEL++ISAAAED+VVRCPECGAILLR+
Sbjct 181 LFALYERQRARGGPGAGQLLGRRCGACRLEIDRGELSRISAAAEDDVVRCPECGAILLRV 240
Query 241 EG 242
+G
Sbjct 241 KG 242
>gi|296166120|ref|ZP_06848565.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898529|gb|EFG78090.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=245
Score = 303 bits (775), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 169/245 (69%), Positives = 210/245 (86%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQRSLLEL++LDAEL+RI HRA HL ++ AY++V+ E AA DR+ A+RIA ED
Sbjct 1 MKAEVAQQRSLLELSQLDAELSRITHRAGHLAEQQAYERVRDELTAAADRVGAVRIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+DGQVSRFESEI++VR+R DRDRSLL SGA DAKQL+DLQHEL++L RRQASLED+LL+V
Sbjct 61 IDGQVSRFESEIESVRQREDRDRSLLESGAADAKQLSDLQHELETLTRRQASLEDSLLDV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
LERREELQ+Q ++AL A+LA A++ALD+AL EID+AR HS++R+ L+A LDP
Sbjct 121 LERREELQSQLDDAQGKVEALEAELAGARKALDDALTEIDEARQAHSARRETLSAALDPA 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L+ LYERQRAGGGPGAG L G RCGACR+EI RGE+++ISAAAED+VVRCPECGAILLR+
Sbjct 181 LSALYERQRAGGGPGAGPLLGRRCGACRLEIDRGEMSRISAAAEDDVVRCPECGAILLRV 240
Query 241 EGFEE 245
+GF++
Sbjct 241 KGFDQ 245
>gi|118468416|ref|YP_888583.1| hypothetical protein MSMEG_4306 [Mycobacterium smegmatis str.
MC2 155]
gi|118169703|gb|ABK70599.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=242
Score = 281 bits (720), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 146/241 (61%), Positives = 191/241 (80%), Gaps = 0/241 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA V+QQRSLL L+++DAEL RIAHR +L ++ ++ A+ NDR+AAL IA ED
Sbjct 1 MKAEVSQQRSLLTLSEVDAELARIAHRGKNLAEQKRLDELTAQRGEVNDRLAALGIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QV+++ESEID+VR+R DRDR+LL G+ AKQ+ ++QHEL++LQRRQASLE+ LLEV
Sbjct 61 LDAQVAKYESEIDSVRQREDRDRALLEGGSVGAKQVTEIQHELETLQRRQASLEEQLLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREEL A+++ E R + L+ +L AQQA D AL E+DQARHQ +++RD L +D +
Sbjct 121 MERREELMAERSEELRRVDELQTELTEAQQARDAALVELDQARHQCATRRDALVNAIDDQ 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYE+QRA GG GAG LQG RCGACRIEI RGE+A+I+AAA+D+VVRCPECGAILLR+
Sbjct 181 LVELYEKQRARGGAGAGPLQGRRCGACRIEIDRGEIARITAAADDDVVRCPECGAILLRV 240
Query 241 E 241
+
Sbjct 241 K 241
>gi|342857251|ref|ZP_08713907.1| hypothetical protein MCOL_00195 [Mycobacterium colombiense CECT
3035]
gi|342134584|gb|EGT87750.1| hypothetical protein MCOL_00195 [Mycobacterium colombiense CECT
3035]
Length=241
Score = 275 bits (704), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 166/241 (69%), Positives = 206/241 (86%), Gaps = 0/241 (0%)
Query 5 VAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQ 64
+AQQRSLLELAKLDAEL+RI HRATHLP++ A +++QAE AA DR+A LRIA ED+D Q
Sbjct 1 MAQQRSLLELAKLDAELSRITHRATHLPEQEACERLQAESEAAGDRVATLRIALEDIDAQ 60
Query 65 VSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERR 124
V+R E+EI+ VR+R DRDRSLL SGATDAKQL+DLQHEL++LQRRQ SLED+LLEV+ERR
Sbjct 61 VARLETEIEGVRRREDRDRSLLQSGATDAKQLSDLQHELETLQRRQTSLEDSLLEVMERR 120
Query 125 EELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGL 184
EELQ+Q E R L +L ++AAA++AL+ A+AE+ QAR +S+RD L+A LDP L+ L
Sbjct 121 EELQSQLDDEQRTLTSLETEMAAAREALEAAVAELSQARELQASRRDSLSAALDPALSAL 180
Query 185 YERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGFE 244
YERQRAGGG GAG+L G RCGACR+EI RGEL++ISAAAED+VVRCPECGAILLR++GF+
Sbjct 181 YERQRAGGGAGAGQLLGRRCGACRLEIDRGELSRISAAAEDDVVRCPECGAILLRVKGFD 240
Query 245 E 245
+
Sbjct 241 Q 241
>gi|333990305|ref|YP_004522919.1| hypothetical protein JDM601_1665 [Mycobacterium sp. JDM601]
gi|333486273|gb|AEF35665.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=244
Score = 266 bits (679), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 154/243 (64%), Positives = 190/243 (79%), Gaps = 0/243 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQRSLLEL++LDAEL RIAHR+ HLP++ ++ AEH A DR+AAL +A ED
Sbjct 1 MKAEVAQQRSLLELSELDAELARIAHRSGHLPEQQERDRILAEHTTAADRLAALELALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LDGQ +RFESEIDAVR+R DRDR+LL SG T AKQ+ADLQHEL++LQRRQASLED+LLE+
Sbjct 61 LDGQAARFESEIDAVRQRADRDRALLDSGQTSAKQVADLQHELETLQRRQASLEDSLLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
LE+RE+L AQ TAES + L +LA ++ L A AE++ R Q ++ R L T+D E
Sbjct 121 LEQREQLHAQATAESGVVDELATELARVEETLQTASAELETTRAQRAATRAELAGTIDGE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRA GG GAG L+G +CGACRIEI RGELA+ISAA +EV+RCPEC A+LLR+
Sbjct 181 LLALYERQRASGGVGAGPLRGGQCGACRIEIDRGELARISAAPPEEVLRCPECSAVLLRV 240
Query 241 EGF 243
+ F
Sbjct 241 KDF 243
>gi|120404587|ref|YP_954416.1| hypothetical protein Mvan_3619 [Mycobacterium vanbaalenii PYR-1]
gi|119957405|gb|ABM14410.1| protein of unknown function DUF164 [Mycobacterium vanbaalenii
PYR-1]
Length=245
Score = 266 bits (679), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/242 (61%), Positives = 188/242 (78%), Gaps = 0/242 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQ+ L +LA+LDAE++R HR +LP++ A ++ QA H A+DR+AAL++A D
Sbjct 1 MKAPVAQQQLLADLAELDAEVSRNEHRTKNLPEQKAVEEAQAAHREASDRLAALQLALAD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D QV++ ESEID VR+R DRDR+LL G D KQL DLQHELD+LQRRQASLED+ LEV
Sbjct 61 IDAQVAKLESEIDGVRQREDRDRALLDGGTVDPKQLTDLQHELDTLQRRQASLEDSQLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREEL A+QT E A++ L+ L AQ+A D+A AE+ QAR + +++R L + +D E
Sbjct 121 MERREELAAEQTREQAAIEELQTALTGAQRACDDARAELAQAREKSAARRAELVSAIDGE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRA GG GAG LQG RCGACRIEI +GE A+I+AAAEDEV+RCPEC AILLR+
Sbjct 181 LVALYERQRARGGVGAGVLQGRRCGACRIEIDQGESARIAAAAEDEVLRCPECSAILLRV 240
Query 241 EG 242
+G
Sbjct 241 KG 242
>gi|108800309|ref|YP_640506.1| hypothetical protein Mmcs_3343 [Mycobacterium sp. MCS]
gi|119869437|ref|YP_939389.1| hypothetical protein Mkms_3405 [Mycobacterium sp. KMS]
gi|126435932|ref|YP_001071623.1| hypothetical protein Mjls_3354 [Mycobacterium sp. JLS]
gi|108770728|gb|ABG09450.1| protein of unknown function DUF164 [Mycobacterium sp. MCS]
gi|119695526|gb|ABL92599.1| protein of unknown function DUF164 [Mycobacterium sp. KMS]
gi|126235732|gb|ABN99132.1| protein of unknown function DUF164 [Mycobacterium sp. JLS]
Length=245
Score = 251 bits (642), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 145/245 (60%), Positives = 193/245 (79%), Gaps = 0/245 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA V QQ SLL+LA++DA L RI HR LP++ +V+AEH AA D++A L IA +D
Sbjct 1 MKADVWQQHSLLQLAEVDAGLARIEHRVRKLPEQDELDRVRAEHGAATDKVAVLGIAMDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QV++FESEIDAVR+R DRDR+LL + AKQ+A+LQHEL++L+RRQASLED+LLE+
Sbjct 61 LDEQVAKFESEIDAVRQREDRDRALLEGDSVGAKQVAELQHELETLERRQASLEDSLLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREEL AQ+ AE + L+ L+AAQ+A+ +A+AE+D +R ++ S+R+ L +L +
Sbjct 121 MERREELAAQRAAELARVDELQITLSAAQRAVADAVAELDGSRQENLSRREELLGSLQSD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQRA GG GAG+LQG RCGACR+EI RGE+A+ISAA +DEV+RCPEC AIL+R
Sbjct 181 LVDLYERQRARGGAGAGQLQGRRCGACRLEIDRGEMARISAAPDDEVLRCPECNAILVRA 240
Query 241 EGFEE 245
EGF++
Sbjct 241 EGFKK 245
>gi|145223478|ref|YP_001134156.1| hypothetical protein Mflv_2891 [Mycobacterium gilvum PYR-GCK]
gi|315443839|ref|YP_004076718.1| Zn-ribbon protein, possibly nucleic acid-binding protein [Mycobacterium
sp. Spyr1]
gi|145215964|gb|ABP45368.1| protein of unknown function DUF164 [Mycobacterium gilvum PYR-GCK]
gi|315262142|gb|ADT98883.1| Zn-ribbon protein, possibly nucleic acid-binding protein [Mycobacterium
sp. Spyr1]
Length=245
Score = 243 bits (619), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/242 (56%), Positives = 179/242 (74%), Gaps = 0/242 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQ+ LLELA++DAE++R+ HR +LP++ A ++ QA DR+A+LR+A D
Sbjct 1 MKAAVAQQQLLLELAEVDAEISRVEHRTKNLPEQKAVEEAQAALREVGDRVASLRLALAD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D QV++FE+EID VR+R DRD++LL G D KQL DLQHEL++LQRRQASLED+ LE+
Sbjct 61 IDAQVAKFETEIDGVRQREDRDKALLEGGTVDPKQLTDLQHELETLQRRQASLEDSQLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREEL ++ E+ A + L AQ+ D+A AE+ Q R + +++R L T+D E
Sbjct 121 MERREELATREAEEASAAGQAQTALDDAQRVCDDARAELVQTRERATARRAELADTIDGE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYERQR+ G GA LQG RCGACRIEI RGE A+I+AAA+D+VVRCPEC AILLR+
Sbjct 181 LVSLYERQRSRSGVGAAPLQGRRCGACRIEIDRGESARIAAAADDDVVRCPECSAILLRV 240
Query 241 EG 242
+
Sbjct 241 KA 242
>gi|15827865|ref|NP_302128.1| hypothetical protein ML1638 [Mycobacterium leprae TN]
gi|221230342|ref|YP_002503758.1| hypothetical protein MLBr_01638 [Mycobacterium leprae Br4923]
gi|18202759|sp|Q9CBS9.1|Y1638_MYCLE RecName: Full=Uncharacterized protein ML1638
gi|13093417|emb|CAC30589.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933449|emb|CAR71733.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=232
Score = 239 bits (611), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/230 (61%), Positives = 171/230 (75%), Gaps = 0/230 (0%)
Query 14 LAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSRFESEID 73
++KLD EL+RIAHRA +LPQR AY++++ E ANDR+ A++IA ED+D QV ESEID
Sbjct 1 MSKLDDELSRIAHRANYLPQREAYERMRVERTGANDRLVAVQIALEDVDTQVFLLESEID 60
Query 74 AVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREELQAQQTA 133
A+R+R DRDR LL SGATDAKQL+DLQ E + QRR+ SLED+L EV++RR ELQ Q TA
Sbjct 61 AMRQREDRDRLLLNSGATDAKQLSDLQPEFGTWQRRKNSLEDSLREVMKRRGELQDQLTA 120
Query 134 ESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYERQRAGGG 193
E A++ ++ DL A+Q LD A AEIDQ HSSQ D+L A L P L+ YER AGGG
Sbjct 121 ELGAIERMQTDLVGARQTLDVAFAEIDQVGQPHSSQCDVLIAELAPALSAPYERLCAGGG 180
Query 194 PGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGF 243
G G+LQGHRCGACR EIGRGEL+ IS +DEVV+ PE GAI L +GF
Sbjct 181 LGVGQLQGHRCGACRSEIGRGELSCISVDVDDEVVKYPESGAIQLLDKGF 230
>gi|3150238|emb|CAA19218.1| hypothetical protein MLCB1243.37 [Mycobacterium leprae]
Length=225
Score = 230 bits (587), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 134/223 (61%), Positives = 165/223 (74%), Gaps = 0/223 (0%)
Query 21 LTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSRFESEIDAVRKRGD 80
++RIAHRA +LPQR AY++++ E ANDR+ A++IA ED+D QV ESEIDA+R+R D
Sbjct 1 MSRIAHRANYLPQREAYERMRVERTGANDRLVAVQIALEDVDTQVFLLESEIDAMRQRED 60
Query 81 RDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREELQAQQTAESRALQA 140
RDR LL SGATDAKQL+DLQ E + QRR+ SLED+L EV++RR ELQ Q TAE A++
Sbjct 61 RDRLLLNSGATDAKQLSDLQPEFGTWQRRKNSLEDSLREVMKRRGELQDQLTAELGAIER 120
Query 141 LRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYERQRAGGGPGAGRLQ 200
++ DL A+Q LD A AEIDQ HSSQ D+L A L P L+ YER AGGG G G+LQ
Sbjct 121 MQTDLVGARQTLDVAFAEIDQVGQPHSSQCDVLIAELAPALSAPYERLCAGGGLGVGQLQ 180
Query 201 GHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGF 243
GHRCGACR EIGRGEL+ IS +DEVV+ PE GAI L +GF
Sbjct 181 GHRCGACRSEIGRGELSCISVDVDDEVVKYPESGAIQLLDKGF 223
>gi|169628992|ref|YP_001702641.1| hypothetical protein MAB_1904 [Mycobacterium abscessus ATCC 19977]
gi|169240959|emb|CAM61987.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=240
Score = 213 bits (543), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/239 (53%), Positives = 171/239 (72%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA VAQQR L++LA +DAELTR+AHR + P+R + ++QA+ D + AL IA ED
Sbjct 1 MKADVAQQRLLVDLASVDAELTRVAHRRANPPERQEHGELQAQQRTILDEVGALAIALED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD QV++ ++E+ AVR+R DRDRSLL SG+T+AK+L ++QHELD+L+RRQ+SLED+ LE+
Sbjct 61 LDEQVAKLDAEVTAVRQREDRDRSLLASGSTNAKELTEIQHELDTLERRQSSLEDSELEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERREELQ QQ + + L+ ++ + D +Q +RD L +++D
Sbjct 121 MERREELQKQQASAQAKADTIAERLSEIERIQRAVAIDTDAEENQVRQRRDGLASSIDGL 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L YERQR GG GAG LQG++CGACRIE+ RGELA+ISAA DEV+RCPEC AIL+R
Sbjct 181 LLETYERQRRSGGAGAGFLQGNKCGACRIELDRGELARISAADADEVLRCPECSAILVR 239
>gi|54023607|ref|YP_117849.1| hypothetical protein nfa16390 [Nocardia farcinica IFM 10152]
gi|54015115|dbj|BAD56485.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=286
Score = 205 bits (521), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 117/232 (51%), Positives = 153/232 (66%), Gaps = 0/232 (0%)
Query 8 QRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSR 67
Q LL+LA +DAELTRIAHR T LP++ +++A N D + I +DLD + +
Sbjct 49 QAKLLQLAAVDAELTRIAHRRTVLPEQQEVARLEARRNEHKDAAVKVEIVLDDLDRDIKK 108
Query 68 FESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREEL 127
E EI+AVRKR +RDR +LTSG+ AKQL+++QHEL SL+RR+ LED LLEV+ERRE
Sbjct 109 LEGEIEAVRKREERDRGMLTSGSVGAKQLSEIQHELGSLERRRGVLEDELLEVMERREAS 168
Query 128 QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYER 187
+ L +LA AQ+ DEALA++D A+ + + R L EL +Y+R
Sbjct 169 ASDHDHAGAQLTRTEQELADAQRQRDEALADLDVAQARCENDRGELVGLFPDELLAVYDR 228
Query 188 QRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
QRA G GA LQ RCGACRIE+ RGE+A+I+ A DEVVRCPECGAIL+R
Sbjct 229 QRAQRGVGAALLQARRCGACRIELDRGEIARIAKTAADEVVRCPECGAILVR 280
>gi|312140238|ref|YP_004007574.1| hypothetical protein REQ_28760 [Rhodococcus equi 103S]
gi|325677015|ref|ZP_08156686.1| hypothetical protein HMPREF0724_14469 [Rhodococcus equi ATCC
33707]
gi|311889577|emb|CBH48894.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325552177|gb|EGD21868.1| hypothetical protein HMPREF0724_14469 [Rhodococcus equi ATCC
33707]
Length=245
Score = 196 bits (498), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 111/239 (47%), Positives = 152/239 (64%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
M + Q LL+LA +D EL R+AHR + LP++ ++++AE + D A+ I +D
Sbjct 1 MNVDPSVQSKLLQLAGVDTELARLAHRRSALPEQQEVERLEAERLSRKDAAVAVEIVLDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD + + E E+DAVR+R RDR LL G KQL++LQHEL SL+RRQ+ LED LLEV
Sbjct 61 LDRDIKKLEGEVDAVRQRETRDRKLLEGGTLAPKQLSELQHELGSLERRQSVLEDELLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERRE QA L + +L A++ D+A+A++D+A+ + + R L E
Sbjct 121 MERREASQADHDHAGARLTQVEDELIDAERRRDDAVADLDKAQERCDADRSGLVGLFPDE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L +YE+QRA G GA LQ RCGACRIEI RGELA+I+A D VVRCPEC AI++R
Sbjct 181 LLAIYEKQRAERGVGAALLQARRCGACRIEIDRGELARIAATPADVVVRCPECSAIMVR 239
>gi|226307146|ref|YP_002767106.1| hypothetical protein RER_36590 [Rhodococcus erythropolis PR4]
gi|226186263|dbj|BAH34367.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=245
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 110/232 (48%), Positives = 154/232 (67%), Gaps = 0/232 (0%)
Query 8 QRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSR 67
Q LL+LA +DAELTRI HR LP++ ++++AE + D A+ I +D+D + +
Sbjct 8 QSKLLDLAGVDAELTRITHRRGALPEQKEVERLEAERISRKDASVAVEIQIDDIDRDIRK 67
Query 68 FESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREEL 127
E E+DAVR+R DRDR+LL SG+ AKQL +L+HEL SL RRQ LED LLEV+E+RE L
Sbjct 68 LEGEVDAVRQREDRDRTLLQSGSVGAKQLTELEHELGSLVRRQGLLEDELLEVMEQREAL 127
Query 128 QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYER 187
QA L +L A++ D+A+A++D+A+ + ++ R+ L + +YE+
Sbjct 128 QADHDHAGAQLSQAEEELIDAKRRRDDAVADLDKAQERCAADRERLVGEFPADFLAVYEK 187
Query 188 QRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
QR GGPGA LQ RCGACRIEI RGE+++I+A D VVRCPEC AI++R
Sbjct 188 QRTLGGPGAALLQARRCGACRIEIDRGEISRIAATPADVVVRCPECNAIMVR 239
>gi|229490271|ref|ZP_04384113.1| zinc ribbon domain protein [Rhodococcus erythropolis SK121]
gi|229322803|gb|EEN88582.1| zinc ribbon domain protein [Rhodococcus erythropolis SK121]
Length=245
Score = 192 bits (488), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 110/232 (48%), Positives = 155/232 (67%), Gaps = 0/232 (0%)
Query 8 QRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSR 67
Q LL+LA +DAELTRI HR LP++ ++++AE + D A+ I +D+D + +
Sbjct 8 QSKLLDLAGVDAELTRITHRRGALPEQQEVERLEAERISRKDASVAVEIQIDDIDRDIRK 67
Query 68 FESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREEL 127
E E+DAVR+R DRDR+LL SG+ AKQL +L+HEL SL RRQ LED LLEV+E+RE L
Sbjct 68 LEGEVDAVRQREDRDRTLLQSGSVGAKQLTELEHELGSLVRRQGLLEDELLEVMEQREAL 127
Query 128 QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYER 187
QA L +L A++ D+A+A++D+A+ + ++ R+ L + +YE+
Sbjct 128 QADHDHAGAQLSQAEEELIDAKRRRDDAVADLDKAQERCAADRERLVGEFPADFLAVYEK 187
Query 188 QRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
QR+ GGPGA LQ RCGACRIEI RGE+++I+A D VVRCPEC AI++R
Sbjct 188 QRSLGGPGAALLQARRCGACRIEIDRGEISRIAATPADVVVRCPECNAIMVR 239
>gi|296140405|ref|YP_003647648.1| hypothetical protein Tpau_2711 [Tsukamurella paurometabola DSM
20162]
gi|296028539|gb|ADG79309.1| protein of unknown function DUF164 [Tsukamurella paurometabola
DSM 20162]
Length=246
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 104/239 (44%), Positives = 151/239 (64%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
M A QRSLL+LA++DAE++R+AHR THLP+ A +++ + + D + I ED
Sbjct 1 MNADPGAQRSLLDLAEVDAEISRLAHRVTHLPEDAEIAELEKQASTERDDSVRVSILVED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD +++ E+E++ R R ++DR L+ SG AKQL +L+HEL L+RRQ+ LED LE+
Sbjct 61 LDRDIAKLETEVNQTRLREEKDRELMASGRVAAKQLTELEHELKGLERRQSVLEDEQLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERRE ++ Q + A + AAQ+ +EAL +I AR + +++RD + +TL +
Sbjct 121 MERREAVELDQQRAEATVNATAEKITAAQRRREEALKDIGVARTRTAARRDEVVSTLPDD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L YER R G GAG LQ RCGACR+E+ RG L ++ D VV C ECGAIL+R
Sbjct 181 LYAEYERCRQASGVGAGLLQARRCGACRLELDRGFLDTVARTTSDVVVHCDECGAILVR 239
>gi|134098169|ref|YP_001103830.1| hypothetical protein SACE_1583 [Saccharopolyspora erythraea NRRL
2338]
gi|291007552|ref|ZP_06565525.1| hypothetical protein SeryN2_23759 [Saccharopolyspora erythraea
NRRL 2338]
gi|133910792|emb|CAM00905.1| hypothetical protein SACE_1583 [Saccharopolyspora erythraea NRRL
2338]
Length=244
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 113/239 (48%), Positives = 147/239 (62%), Gaps = 1/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A QR LL+LA+ DAEL R+ HR LP+ + + A D + A + A D
Sbjct 1 MKADPAVQRRLLDLAECDAELNRVNHRRRTLPELEEIGTAERDAQAKRDSLVAAQTAFGD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D R E EI+ VR R DRDR L+ +G + +KQL DLQHEL +L RRQ LED LLEV
Sbjct 61 IDRDAKRLEGEIEQVRAREDRDRKLMEAGGS-SKQLEDLQHELQTLARRQGILEDELLEV 119
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERRE L+ + A LA AQ+ DEAL ++D A + +++R+++ L
Sbjct 120 MERREALETDVARAREEVSASEEKLADAQRRRDEALVDLDTAEARRTAERELMVKGLPEN 179
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LY+R RA G GAG LQG RCGACRIE+ R LA++ A D+VVRC ECGAIL+R
Sbjct 180 LVVLYDRIRAQKGVGAGLLQGTRCGACRIELDRSALAEVRDADADDVVRCEECGAILVR 238
>gi|111018195|ref|YP_701167.1| hypothetical protein RHA1_ro01182 [Rhodococcus jostii RHA1]
gi|110817725|gb|ABG93009.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=245
Score = 185 bits (469), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 112/232 (49%), Positives = 149/232 (65%), Gaps = 0/232 (0%)
Query 8 QRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSR 67
Q LL+LA +DAEL+RIAHR T LP+R ++++AE D A+ I +DLD + +
Sbjct 8 QSKLLDLAGVDAELSRIAHRRTALPERQEVERLEAERVTRKDAAVAVEIVLDDLDRDIRK 67
Query 68 FESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREEL 127
E E+DAVR+R DRDR+LL SG +KQL +L+HEL SL RRQ LED LLEV+ERRE
Sbjct 68 LEGEVDAVRQREDRDRTLLQSGTVGSKQLTELEHELGSLVRRQGLLEDELLEVMERREAS 127
Query 128 QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYER 187
Q+ L + DL A + D+A+A++D A + R L +L +YE+
Sbjct 128 QSDHDHAGAQLSQIEEDLIDAGRRRDDAVADLDAAEQRCIRDRTALADQFPADLIAVYEK 187
Query 188 QRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
QR+ G GA LQ RCGACRIE+ RGE+++I+A A D VVRC ECGAIL+R
Sbjct 188 QRSQNGVGAALLQSRRCGACRIELDRGEISRITATAPDVVVRCSECGAILVR 239
>gi|226360321|ref|YP_002778099.1| hypothetical protein ROP_09070 [Rhodococcus opacus B4]
gi|226238806|dbj|BAH49154.1| hypothetical protein [Rhodococcus opacus B4]
Length=245
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/232 (48%), Positives = 148/232 (64%), Gaps = 0/232 (0%)
Query 8 QRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSR 67
Q LL+LA +DAEL+RIAHR T LP+R ++++AE D A+ I +DLD + +
Sbjct 8 QSKLLDLAGVDAELSRIAHRRTALPERQEVERLEAERVTRKDAAVAVEIILDDLDRDIRK 67
Query 68 FESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREEL 127
E E+DAVR+R DRDR LL SG +KQL +L+HEL SL RRQ LED LLEV+ERRE
Sbjct 68 LEGEVDAVRQREDRDRKLLESGTVGSKQLTELEHELGSLVRRQGLLEDELLEVMERREAS 127
Query 128 QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYER 187
Q+ L + DL A + D+A+A++D A + + R L +L +YE+
Sbjct 128 QSDHDHAGAQLSQIEEDLIDASRRRDDAVADLDAAEQRCTRDRAALAEQFPADLITVYEK 187
Query 188 QRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
QR+ G GA LQ RCGACRIE+ RGE+++I+ A D VVRC ECGAIL+R
Sbjct 188 QRSQNGVGAALLQARRCGACRIELDRGEISRITGTAPDVVVRCSECGAILVR 239
>gi|257054784|ref|YP_003132616.1| Zn-ribbon protein, possibly nucleic acid-binding [Saccharomonospora
viridis DSM 43017]
gi|256584656|gb|ACU95789.1| Zn-ribbon protein, possibly nucleic acid-binding [Saccharomonospora
viridis DSM 43017]
Length=245
Score = 182 bits (462), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 107/239 (45%), Positives = 151/239 (64%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A QR LL+LA++DAEL R+AHR +LP+ A + + D + A + A D
Sbjct 1 MKAEPAVQRQLLDLAEVDAELARVAHRRRNLPELAEITEAEKRLRERRDALVAAQTTASD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
L+ +VS+ E EI++VR R +RDR L+ SG+ AKQLADL+ EL++L RRQ+ LED LE+
Sbjct 61 LEREVSKQEREIESVRARAERDRKLMESGSVSAKQLADLERELETLARRQSVLEDDQLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ER+E + A + + +LA AQ+ DEALA++D + + + R L L
Sbjct 121 MERKEAVDADVQRTAAEVDKAEQELADAQRRRDEALADLDTTQARREADRKNLVPKLPEN 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LYER RA G GA L+ RCGAC++E+ R +A+I AA +DEVV+C C AIL+R
Sbjct 181 LLALYERVRAHKGIGAALLKSRRCGACQLELDRSSIAEIKAAPDDEVVQCENCDAILVR 239
>gi|333918914|ref|YP_004492495.1| hypothetical protein AS9A_1243 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481135|gb|AEF39695.1| hypothetical protein AS9A_1243 [Amycolicicoccus subflavus DQS3-9A1]
Length=242
Score = 182 bits (461), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 106/239 (45%), Positives = 144/239 (61%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
M V Q+ LL+LA +DAEL RI HR +LP+ ++++ E A D A IA +D
Sbjct 1 MDVDVKVQQKLLDLADVDAELLRIRHRRINLPEDKEIERLKKERQARKDDAVAAEIALDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD + R E E+D V +R RD +L+ SG AKQL++LQHEL +L RR++ LED LL++
Sbjct 61 LDRDIKRLEREVDQVGQREKRDNALMQSGTVAAKQLSELQHELGTLGRRRSLLEDELLDI 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE + L +L AA +A A++D A + + R+ LT E
Sbjct 121 MEQREAAEENYKHAGARLSHAEEELDAAGSKRGDATADLDVAEKRCDTDREKLTQLFPEE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LYE +R GAGRLQG RCGACRIE+ RGEL +I A + VV+CPECGAIL+R
Sbjct 181 LLSLYENERRSHSVGAGRLQGSRCGACRIELDRGELERIKATPPERVVQCPECGAILVR 239
>gi|262202986|ref|YP_003274194.1| hypothetical protein Gbro_3095 [Gordonia bronchialis DSM 43247]
gi|262086333|gb|ACY22301.1| protein of unknown function DUF164 [Gordonia bronchialis DSM
43247]
Length=245
Score = 180 bits (457), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 105/239 (44%), Positives = 151/239 (64%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MK QR +L+LA DAE+ R+ HR + LP+ A +V + AA D + +A ED
Sbjct 1 MKVDAGAQRLVLDLADADAEIARLQHRRSKLPEDAEIAEVTSALEAARDDLVRSEMAGED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
L + R +SE+ + R +D +LLT+G K L++LQHEL L RR+A+LED LL V
Sbjct 61 LGREYRRIDSEVTGMAAREQKDSALLTAGGLAPKALSELQHELAGLGRRRAALEDDLLAV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ER+E +A++T + + L LA + ++++A ID+ +R L AT+DPE
Sbjct 121 MERQEATEAERTRAAATIDHLEGRLAELRAGREKSIAVIDEDLDGVRERRAGLAATIDPE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L Y+RQR+ G GAG+LQ RCGACR+E+ RG +A+I+AAA DEV+RC ECGAIL+R
Sbjct 181 LLATYDRQRSAGRIGAGKLQARRCGACRMELDRGTIARIAAAAPDEVIRCDECGAILVR 239
>gi|300783099|ref|YP_003763390.1| Zn-ribbon protein [Amycolatopsis mediterranei U32]
gi|299792613|gb|ADJ42988.1| Zn-ribbon protein [Amycolatopsis mediterranei U32]
gi|340524478|gb|AEK39683.1| Zn-ribbon protein [Amycolatopsis mediterranei S699]
Length=245
Score = 175 bits (443), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 109/241 (46%), Positives = 150/241 (63%), Gaps = 4/241 (1%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A QR LLELAK+DAEL+R AHR LP+ A + D + ++ AA D
Sbjct 1 MKAEPAVQRQLLELAKVDAELSRTAHRRRTLPELAEIDAGEKTVRERRDALVSVETAASD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD +++R E EI++VR R DRDR LL SG+ ++KQ+ D++HEL SL+RRQ++LED LLE+
Sbjct 61 LDREIARQEKEIESVRAREDRDRKLLASGSVNSKQMTDIEHELQSLERRQSALEDDLLEL 120
Query 121 LERREEL--QAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLD 178
+E+RE L AQ+T + A++AAA DEA ++D R + R L
Sbjct 121 MEQREALGLDAQRTGAE--VDKAEAEVAAAIARRDEAFKDLDTTRARRDEDRVKLLPRFP 178
Query 179 PELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILL 238
L LYER R G GA L+ RCGAC++E+ R + +I AA ED+V++C CGAIL+
Sbjct 179 EPLLKLYERVREHKGIGAALLRARRCGACQLELDRNTVNEIKAAPEDDVIQCENCGAILV 238
Query 239 R 239
R
Sbjct 239 R 239
>gi|256375027|ref|YP_003098687.1| hypothetical protein Amir_0880 [Actinosynnema mirum DSM 43827]
gi|255919330|gb|ACU34841.1| protein of unknown function DUF164 [Actinosynnema mirum DSM 43827]
Length=245
Score = 174 bits (442), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 100/239 (42%), Positives = 143/239 (60%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+LA++D EL R+ HR LP+ + + + A D + A+ D
Sbjct 1 MKADPVVQRRLLDLARVDTELARVEHRRRTLPEIVEIAEAEKQVRAKQDALTAVETTLGD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD V R E+EID VR R +RDR LL G+ AKQL DL+HEL +L RR+ +LED LLE+
Sbjct 61 LDRDVKRQETEIDQVRAREERDRGLLAGGSVGAKQLTDLEHELATLGRRRGALEDDLLEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ERRE ++ S + LA A + D ALA+++ + +++R + +T +
Sbjct 121 MERREAVEVDSQHASAQFANAQETLADAARRRDSALADLESTEAKRTAERKTIASTFEAP 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L +Y+R R G GA LQ RCGACRIE+ R +A++ A D+VV+C ECGAI++R
Sbjct 181 LLAVYDRVRLHKGTGAALLQSRRCGACRIELDRSAIAKVKEALADDVVQCEECGAIMVR 239
>gi|302524446|ref|ZP_07276788.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302433341|gb|EFL05157.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=245
Score = 174 bits (440), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 105/243 (44%), Positives = 149/243 (62%), Gaps = 8/243 (3%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A QR LLELAK+DAEL+R+AHR LP+ A + + D + ++ A D
Sbjct 1 MKADPAVQRQLLELAKVDAELSRVAHRRRTLPELAEIEAGEKTVREKRDALVSVETATSD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD +++R E E+++VR R DRDR L+ SG+ AKQ+ D++HEL +L RR+ +LED LLE+
Sbjct 61 LDREIARQEKEVESVRAREDRDRKLMESGSVGAKQMTDIEHELQTLGRRKGALEDDLLEL 120
Query 121 LERREEL--QAQQTAE--SRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTAT 176
+E+RE L AQ+T+ +A+Q ++ AAQ DEA ++D + R L
Sbjct 121 MEQREALGLDAQRTSAEVDKAVQ----EVQAAQARRDEAFKDLDTTEARRKEDRAKLVPR 176
Query 177 LDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAI 236
L LY R G GA L+ RCGAC++E+ R +++I AAAED VV+C CGAI
Sbjct 177 FPEPLLKLYTRVYEHKGIGAALLRARRCGACQLELDRNTISEIKAAAEDSVVQCDNCGAI 236
Query 237 LLR 239
L+R
Sbjct 237 LVR 239
>gi|343924132|ref|ZP_08763695.1| hypothetical protein GOALK_002_00860 [Gordonia alkanivorans NBRC
16433]
gi|343765937|dbj|GAA10621.1| hypothetical protein GOALK_002_00860 [Gordonia alkanivorans NBRC
16433]
Length=245
Score = 169 bits (428), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 99/239 (42%), Positives = 145/239 (61%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MK QR LL+LA DAE+ R+ HR +LP+ A + + +AA D + I+AED
Sbjct 1 MKVDAGVQRLLLDLADSDAEINRLEHRRKNLPENAEIAEQEKAIDAARDDLVRAEISAED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
L + R ESE+ + R +D L +G K L++LQHEL L RR+A ED LL +
Sbjct 61 LGREYRRIESEVTGMANREAKDSKQLAAGGLAPKALSELQHELAGLGRRRAVFEDELLTL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E++E ++A++ + + L +A A+ DE+LA I++ + +RD L +D +
Sbjct 121 MEQQEAVEAERDRAAATISHLEEKVADARVRRDESLATIEEDMTRAREKRDTLAGEIDAD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L +Y++QRA G GAG L+ RCGACR+E+ RG +A ISAA DEV+RC ECGAIL+R
Sbjct 181 LLAVYDKQRANGRIGAGLLRARRCGACRMELDRGTIASISAALSDEVIRCEECGAILVR 239
>gi|302869111|ref|YP_003837748.1| hypothetical protein Micau_4661 [Micromonospora aurantiaca ATCC
27029]
gi|302571970|gb|ADL48172.1| protein of unknown function DUF164 [Micromonospora aurantiaca
ATCC 27029]
Length=245
Score = 156 bits (395), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 101/240 (43%), Positives = 140/240 (59%), Gaps = 2/240 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+L +D L ++AHR LP+RA + + E +A D ++A +D
Sbjct 1 MKADPKVQRRLLDLQAIDTALAQLAHRRRTLPERAELEALARELSALEDERVRAQVAVDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD ++R E ++D VR R +D + L SG+ A++L +QHEL SL RRQ+ LEDA LE+
Sbjct 61 LDRDIARIEKDVDQVRARKSKDEARLASGSGPARELEAIQHELVSLNRRQSDLEDAELEL 120
Query 121 LERREELQAQ-QTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDP 179
+E+RE Q ESR +A R AA +Q DE LAEI + + R L L
Sbjct 121 MEQRETAQGVLDGIESRLAEA-RERRAATEQRRDETLAEISKEEEFKRTARQPLAGDLPA 179
Query 180 ELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
+L GLY++ R G GA L G RCG CR+E+ +LA+I A D+VVRC EC I++R
Sbjct 180 DLIGLYDKIREDTGMGAALLTGGRCGGCRLEMSGADLARIRKADPDDVVRCEECRRIMVR 239
>gi|315504417|ref|YP_004083304.1| hypothetical protein ML5_3639 [Micromonospora sp. L5]
gi|315411036|gb|ADU09153.1| protein of unknown function DUF164 [Micromonospora sp. L5]
Length=245
Score = 156 bits (394), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/240 (43%), Positives = 140/240 (59%), Gaps = 2/240 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+L +D L ++AHR LP+RA + + E +A D ++A +D
Sbjct 1 MKADPKVQRRLLDLQAIDTALAQLAHRRRTLPERAELEALARELSALEDERVRAQVAVDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD ++R E ++D VR R +D + L SG+ A++L +QHEL SL RRQ+ LEDA LE+
Sbjct 61 LDRDIARIEKDVDQVRARKSKDEARLASGSGPARELEAIQHELVSLNRRQSDLEDAELEL 120
Query 121 LERREELQAQ-QTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDP 179
+E+RE Q ESR +A R AA +Q DE LAEI + + R L L
Sbjct 121 MEQRETAQGVLDGIESRLAEA-RERRAATEQRRDETLAEIAKEEEFKRTSRQPLAGDLPA 179
Query 180 ELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
+L GLY++ R G GA L G RCG CR+E+ +LA+I A D+VVRC EC I++R
Sbjct 180 DLIGLYDKIREDTGMGAALLTGGRCGGCRLELSGADLARIRKADPDDVVRCEECRRIMVR 239
>gi|296393053|ref|YP_003657937.1| hypothetical protein Srot_0624 [Segniliparus rotundus DSM 44985]
gi|296180200|gb|ADG97106.1| protein of unknown function DUF164 [Segniliparus rotundus DSM
44985]
Length=241
Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/240 (42%), Positives = 145/240 (61%), Gaps = 1/240 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LLELA LD ++ ++ HR LP+ + A D + + + ED
Sbjct 1 MKADPQAQRGLLELAALDTQVRQLTHRLRALPEAERLAAATSAARTAADEASVIAMRIED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
++ ++++ E ++DAVR+R +RD LL SGA D++ D+QHEL SL++RQA+ ED LL
Sbjct 61 VEREIAKREGDVDAVRQREERDEQLLKSGALDSRVQNDVQHELGSLRKRQAAFEDELLGF 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE+ A +A AA+ AL EA +++ + S R+ A + E
Sbjct 121 MEQREQELAVLARAQERRAEAQAAAQAARAALSEAESQVKEQLQAAESSREARAAQIPGE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYER RA G AGRL G +CGACR+E+ ++A++SAAA DEV+RCPEC AIL+R+
Sbjct 181 LLALYERLRA-RGTAAGRLDGRKCGACRLELTVSQMAEMSAAAPDEVLRCPECDAILIRV 239
>gi|331696747|ref|YP_004332986.1| hypothetical protein Psed_2933 [Pseudonocardia dioxanivorans
CB1190]
gi|326951436|gb|AEA25133.1| protein of unknown function DUF164 [Pseudonocardia dioxanivorans
CB1190]
Length=246
Score = 152 bits (385), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/247 (42%), Positives = 143/247 (58%), Gaps = 15/247 (6%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A QR LL+LA++DAEL R+AHR LP+ A + Q + +A D++ + +A D
Sbjct 1 MKADPADQRRLLDLAEVDAELARLAHRRRSLPEDAEHAQAETAVRSAKDKLVEVETSAGD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD + R E +++AVR R RD+ LL AKQ +DLQHEL++L RRQ LED LEV
Sbjct 61 LDRDIRRLERDVEAVRARTVRDQQLLAGAGIGAKQASDLQHELETLARRQGVLEDEQLEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQ--------ARHQHSSQRDM 172
+E+RE + L R DLA A+Q L + A D + ++ ++
Sbjct 121 MEQREAVGID-------LDHARGDLARAEQTLADVGARRDSALADIAAAEAGRERARAEV 173
Query 173 LTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPE 232
+ +L YE +R+ G GAG L+ RCGACR+E+ R + Q+ AAA D+VV C E
Sbjct 174 VAQMGAADLLAAYEARRSQGKVGAGLLRERRCGACRLELDRTFIGQLRAAAADDVVPCEE 233
Query 233 CGAILLR 239
CGAIL+R
Sbjct 234 CGAILVR 240
>gi|145595870|ref|YP_001160167.1| hypothetical protein Strop_3356 [Salinispora tropica CNB-440]
gi|145305207|gb|ABP55789.1| protein of unknown function DUF164 [Salinispora tropica CNB-440]
Length=245
Score = 152 bits (384), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/239 (42%), Positives = 133/239 (56%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+LA +D L ++AHR LP+RA + E +A D A ++A D
Sbjct 1 MKADPLVQRRLLDLAGIDTNLAQLAHRRRTLPERAELDALARELSALEDERARAQVAIAD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD + R E ++D VR R +D L +G A++L LQHEL SL RRQ LEDA LE+
Sbjct 61 LDRDIDRLEQDVDQVRARKRKDEDRLAAGVGPARELEALQHELASLNRRQGDLEDAELEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE Q + L R AA +Q D++L+EI + R L A L +
Sbjct 121 MEQRETAQGVLAGIEKRLTEAREKRAAVEQRRDDSLSEITKEEEFKRGARQPLAADLPGD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LY+R RA G GA L RCG CR+E+ + A+I AA D+VVRC EC I++R
Sbjct 181 LVALYDRIRAESGLGAALLTAGRCGGCRLELSGADRARIRAADPDDVVRCEECRRIMIR 239
>gi|291302272|ref|YP_003513550.1| hypothetical protein Snas_4816 [Stackebrandtia nassauensis DSM
44728]
gi|290571492|gb|ADD44457.1| protein of unknown function DUF164 [Stackebrandtia nassauensis
DSM 44728]
Length=245
Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/241 (41%), Positives = 138/241 (58%), Gaps = 0/241 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
M+A A QR LL+L + D LT++AHR +LP+ A ++ + N DR + D
Sbjct 1 MRANPADQRRLLDLQQADTSLTQLAHRRANLPEEAEIVTLRQQVNELADRAGSNEATVGD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD +++ E EID VR+R D DR SG K+L + HEL++L RRQ+ LED L++
Sbjct 61 LDRDIAKVEREIDQVRRRADTDRERQASGKLGPKELEGIAHELETLARRQSELEDQELDL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE+ QA A++R L R LA ++ DEALA ID S R+ ++ + +
Sbjct 121 MEQREQKQAAAEADARDLADKRTALAEIEKRRDEALAAIDAELAAERSTREGISVDIPED 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL 240
L LYE+ R A L+ RC +CR+E ELA + AA E +VVRC CGAIL+R
Sbjct 181 LRKLYEKIRRTKPIAAALLRQRRCESCRLEQSGAELADLRAADESDVVRCDNCGAILVRT 240
Query 241 E 241
E
Sbjct 241 E 241
>gi|330469453|ref|YP_004407196.1| hypothetical protein VAB18032_27616 [Verrucosispora maris AB-18-032]
gi|328812424|gb|AEB46596.1| hypothetical protein VAB18032_27616 [Verrucosispora maris AB-18-032]
Length=245
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/239 (40%), Positives = 133/239 (56%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+L +D L ++AHR LP+RA + + E +A D ++A +D
Sbjct 1 MKADPQVQRRLLDLQAIDTALAQLAHRRRSLPERAELEALARELSALEDERVRAQVAVDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD ++R E ++D VR R ++ L +G A++L LQHEL SL RRQ LEDA LE+
Sbjct 61 LDRDIARLEKDVDQVRARKAKNEDRLAAGTGPARELEALQHELVSLNRRQGDLEDAELEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE QA + L +R AA +Q DE+L EI + S R L L +
Sbjct 121 MEQRETAQAVLDGVEQRLTEVRERRAATEQRRDESLGEIGREEEFKRSARQPLANDLPAD 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LY+R R G GA L RCG CR+E+ +LA+I +EVVRC +C I++R
Sbjct 181 LVQLYDRIRTDTGLGAALLYAGRCGGCRLELSGADLARIRKTDPEEVVRCEDCRRIMVR 239
>gi|317506106|ref|ZP_07963931.1| hypothetical protein HMPREF9336_00300 [Segniliparus rugosus ATCC
BAA-974]
gi|316255605|gb|EFV14850.1| hypothetical protein HMPREF9336_00300 [Segniliparus rugosus ATCC
BAA-974]
Length=246
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/247 (43%), Positives = 153/247 (62%), Gaps = 12/247 (4%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL++ LD ++ ++ HR LP+ +A A D A + + ED
Sbjct 1 MKADPKAQRELLDVVALDTQVRQLTHRLQSLPEAGQLASAKAAAQTAADEAAVVAMRIED 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+D ++++ E ++DAVR+R +RD LL SG+ D++ D+QHEL SL++RQA+ ED LL +
Sbjct 61 VDREIAKREGDVDAVRQREERDEQLLKSGSLDSRVQNDVQHELSSLRKRQAAFEDELLVL 120
Query 121 LERREELQA-------QQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQ-RDM 172
+ERREE +A +++A A +A A LA + Q L E L + R + +++ +D
Sbjct 121 MERREEERAVLAEAQARRSAADAAAEAAAAALAESDQRLKEQLRAAEATRAERTARLKDQ 180
Query 173 LTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPE 232
A EL GLYER RA G AGRL G RCGACR+E+ E++QIS++A D+VVRCPE
Sbjct 181 FGAA---ELLGLYERLRA-RGTAAGRLDGRRCGACRLELTPVEMSQISSSAPDDVVRCPE 236
Query 233 CGAILLR 239
C AIL+R
Sbjct 237 CEAILVR 243
>gi|324998689|ref|ZP_08119801.1| Zn-ribbon protein, possibly nucleic acid-binding [Pseudonocardia
sp. P1]
Length=253
Score = 144 bits (364), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 99/248 (40%), Positives = 139/248 (57%), Gaps = 18/248 (7%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALR--IAA 58
MKA Q +LL+LA++DAE+ R+AHR +LP+ Q AE + R A +R A
Sbjct 13 MKADPTAQATLLQLAEVDAEIGRLAHRRKNLPE--LQQLADAEQRVRDARDAVVRAETRA 70
Query 59 EDLDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALL 118
DLD ++R E +++ VR R ++D++LL AKQ +LQHELD+L RRQ +LE+ L
Sbjct 71 GDLDRDIARLERDVEGVRARTEKDKALLAGSGIGAKQATELQHELDTLARRQGTLEEEQL 130
Query 119 EVLERREELQAQQTAESRALQALRADLAAAQQAL-------DEALAEIDQARHQHSSQRD 171
++E RE + + L A+LA A++A+ D A A+ID +R R
Sbjct 131 GIMEEREAVGVE-------LDHGSAELATAEEAVTEVTGRRDTAEADIDASRGGRDRART 183
Query 172 MLTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCP 231
L L +L YER R+ G AG L RCGACR+E+ R L Q+ D+VV C
Sbjct 184 ELVGALPEDLLADYERIRSSGRVAAGGLSESRCGACRLELDRTFLTQVRGRPADDVVHCE 243
Query 232 ECGAILLR 239
ECGAIL+R
Sbjct 244 ECGAILVR 251
>gi|258653439|ref|YP_003202595.1| hypothetical protein Namu_3275 [Nakamurella multipartita DSM
44233]
gi|258556664|gb|ACV79606.1| protein of unknown function DUF164 [Nakamurella multipartita
DSM 44233]
Length=242
Score = 144 bits (364), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 106/242 (44%), Positives = 141/242 (59%), Gaps = 8/242 (3%)
Query 2 KAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRM--AALRIAAE 59
KA QR LL+LA++D + HR LP+ A A+ A +R+ A +R AE
Sbjct 4 KADPFIQRRLLDLARIDQAVAAAEHRRRTLPELAQI----ADGTATVERLRGALVRGQAE 59
Query 60 --DLDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDAL 117
DLD + + + EIDAVR R RD L +G A+ L ++QHEL SL RRQ++LED
Sbjct 60 IGDLDRESRKLDQEIDAVRARAKRDSDRLAAGVAPARDLENMQHELVSLARRQSTLEDEA 119
Query 118 LEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATL 177
LE++ERRE AQ T L RADL AA+Q D+A A+ID + +++R LT +
Sbjct 120 LELMERRETADAQVTQVDAELATARADLQAAEQRRDDAFADIDDEIARVTAERAGLTDGM 179
Query 178 DPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAIL 237
+L LYE+ RA G A L G RC ACR+++ R L I AA D+VVRC ECGAIL
Sbjct 180 PADLLALYEQIRARGRTAAAALNGPRCEACRMDLDRSALNDIWAAGVDQVVRCTECGAIL 239
Query 238 LR 239
+R
Sbjct 240 IR 241
>gi|238060394|ref|ZP_04605103.1| hypothetical protein MCAG_01360 [Micromonospora sp. ATCC 39149]
gi|237882205|gb|EEP71033.1| hypothetical protein MCAG_01360 [Micromonospora sp. ATCC 39149]
Length=245
Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/239 (41%), Positives = 134/239 (57%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QR LL+L +D L ++AHR LP+ A + + E +A D ++A +D
Sbjct 1 MKADPKVQRRLLDLQAIDTALAQLAHRRRTLPEWAELEALARELSALEDERVRAQVAVDD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
LD ++R E +++ VR R +D + L +G A++L LQHEL SL RRQ+ LEDA LE+
Sbjct 61 LDRDIARIEKDVEQVRARKGKDEARLAAGTGPARELEALQHELVSLNRRQSDLEDAELEL 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+E+RE Q R R AA++ DEALAEI + R L L E
Sbjct 121 MEQRETAQGVLDGIERRAADARERRVAAERRRDEALAEIAKEEEFKRQARQPLAGDLPAE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LY++ RA G GA L G RCG CR+E+ ++ +I AA D+VVRC EC I++R
Sbjct 181 LVTLYDKIRADTGLGAALLTGARCGGCRLELYGADMGRIRKAAPDDVVRCEECRRIMVR 239
>gi|271968070|ref|YP_003342266.1| Zn-ribbon domain-containing protein [Streptosporangium roseum
DSM 43021]
gi|270511245|gb|ACZ89523.1| Zn-ribbon protein possibly nucleic acid-binding- like protein
[Streptosporangium roseum DSM 43021]
Length=246
Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 97/244 (40%), Positives = 137/244 (57%), Gaps = 9/244 (3%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAE- 59
MKA A Q+ LL+LA+LD+ + R+AHR LP+ A ++ A R+A I+AE
Sbjct 1 MKAAPAAQKRLLDLAELDSVIDRLAHRRRTLPELAEIDEISARVA----RLATQVISAET 56
Query 60 ---DLDGQVSRFESEIDAVRKRGDRDRSLLTSG-ATDAKQLADLQHELDSLQRRQASLED 115
DL + S+ E+++D+VR R +RD+ L SG + K LA LQ E+ SL RRQ LE+
Sbjct 57 EAGDLAREQSKAEADVDSVRIRAERDQKRLDSGQVSSPKDLASLQSEIASLNRRQGDLEE 116
Query 116 ALLEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTA 175
+LE++ERRE AQ T L A ++ D A AEID+ + +R +
Sbjct 117 VVLEIMERRESADAQVTKTVAERDGLAAARGVSEDRRDAAFAEIDKESAEVRGKRAEVVT 176
Query 176 TLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGA 235
+ +L LYE+ R G GA LQG RC CR + E+ +I AA+ DEV+RC EC
Sbjct 177 DIPADLLALYEKLRDQFGVGAAMLQGGRCLGCRTSLSIAEINRIKAASHDEVIRCEECRR 236
Query 236 ILLR 239
IL+R
Sbjct 237 ILVR 240
>gi|84496761|ref|ZP_00995615.1| Zn-ribbon protein-like protein [Janibacter sp. HTCC2649]
gi|84383529|gb|EAP99410.1| Zn-ribbon protein-like protein [Janibacter sp. HTCC2649]
Length=245
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/239 (39%), Positives = 132/239 (56%), Gaps = 0/239 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA QQ LL+L LD L++IAH LPQ A ++ + + +D++ R D
Sbjct 1 MKAEPGQQSKLLDLQALDTRLSQIAHARKTLPQLAEIADLEGKASLLDDQLVRSRTELSD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEV 120
+ +V + + ++ VR R RD+ L SG AK L +QHEL+SL RRQ+ LED LEV
Sbjct 61 IQREVIKADGDVQQVRDRATRDQQRLDSGTGSAKDLTAIQHELESLARRQSELEDVELEV 120
Query 121 LERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPE 180
+ER E +Q+ + R + L + A D+ LAE+D + ++ RD L E
Sbjct 121 MERAEAVQSDVSELERGRGEITDRLTELEAARDKRLAELDADEAEVAAPRDGLVHAAGVE 180
Query 181 LAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLR 239
L LY++ RA G GA L+ RCG C++E+ L +I +A +DEV RC EC IL+R
Sbjct 181 LVALYDKIRATSGTGAAPLRQRRCGGCQLELNPVALREIKSAPQDEVHRCEECRRILVR 239
>gi|297561846|ref|YP_003680820.1| hypothetical protein Ndas_2904 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296846294|gb|ADH68314.1| protein of unknown function DUF164 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=253
Score = 139 bits (350), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/246 (41%), Positives = 140/246 (57%), Gaps = 8/246 (3%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
MKA A Q LL L LD +L R+ HRA LP+ A +++ + + + + D
Sbjct 1 MKAEPAAQARLLTLQDLDTDLQRLDHRARTLPEVAEAARLKERVDQIDSELITAQTGVSD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSG-ATDAKQLADLQHELDSLQRRQASLEDALLE 119
++ Q + ES++D VR R DRD L SG T+AK+L +LQ E+ SL RRQA LE+ +LE
Sbjct 61 VERQQRKAESDVDQVRTRADRDAKRLESGQITNAKELQNLQSEITSLGRRQAELEEIVLE 120
Query 120 VLERREELQ---AQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTAT 176
V+ER EEL A+ TAE L A ++ A + D A AEI R + + R+ +
Sbjct 121 VMERAEELNATVARLTAERERLVAEHTEVVARR---DTAAAEIQWDRTRTTQDRERVAGE 177
Query 177 LDPELAGLYERQRA-GGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGA 235
+ +L LY++ RA GG GA L+ RCG C++ + EL +I A A DEVVRC +C
Sbjct 178 IPADLLALYDKMRAQYGGVGAAPLRYGRCGGCKLALSTVELNEIRAQAADEVVRCEDCRR 237
Query 236 ILLRLE 241
IL+R E
Sbjct 238 ILVRTE 243
>gi|302545891|ref|ZP_07298233.1| putative zinc ribbon domain protein [Streptomyces hygroscopicus
ATCC 53653]
gi|302463509|gb|EFL26602.1| putative zinc ribbon domain protein [Streptomyces himastatinicus
ATCC 53653]
Length=247
Score = 138 bits (348), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 90/241 (38%), Positives = 136/241 (57%), Gaps = 2/241 (0%)
Query 1 MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAED 60
M A A Q LL++ LD L+++AH+ LP+ A + A+H D + A + D
Sbjct 1 MNAAPADQIRLLDVQGLDVRLSQLAHKRRTLPEHAELDTLTADHTQLRDLLVASQTEESD 60
Query 61 LDGQVSRFESEIDAVRKRGDRDRSLLTSGA-TDAKQLADLQHELDSLQRRQASLEDALLE 119
+ ++ E ++D VR+R RD+ L SGA T K L +LQHE+ SL RRQ+ LED +LE
Sbjct 61 TAREQTKAEQDVDQVRQRAARDQKRLDSGAVTSPKDLENLQHEIASLARRQSDLEDVVLE 120
Query 120 VLERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDP 179
V+ERRE Q + T + +++L++ ++ A D A+ EID + +R ++ T+
Sbjct 121 VMERREAAQERATELTGRVESLQSKISDATARRDAAVEEIDNEIATVTKERAVIAGTIPA 180
Query 180 ELAGLYERQRA-GGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILL 238
+L LY++ R GG GA RL RC CR E+ EL ++ +A D VVRC C IL+
Sbjct 181 DLLKLYDKLREQQGGVGAARLYQRRCDGCRQELAITELNEVRSAPADTVVRCENCRRILV 240
Query 239 R 239
R
Sbjct 241 R 241
Lambda K H
0.316 0.129 0.349
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 343201993260
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40