BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1716
Length=276
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608854|ref|NP_216232.1| hypothetical protein Rv1716 [Mycoba... 564 6e-159
gi|307079722|ref|ZP_07488892.1| hypothetical protein TMKG_02223 ... 563 1e-158
gi|15841176|ref|NP_336213.1| hypothetical protein MT1755 [Mycoba... 561 5e-158
gi|289443175|ref|ZP_06432919.1| conserved hypothetical protein [... 560 1e-157
gi|289574384|ref|ZP_06454611.1| conserved hypothetical protein [... 558 2e-157
gi|148822922|ref|YP_001287676.1| hypothetical protein TBFG_11731... 558 5e-157
gi|340626723|ref|YP_004745175.1| hypothetical protein MCAN_17271... 556 9e-157
gi|308231908|ref|ZP_07414239.2| putative cyclase superfamily [My... 525 2e-147
gi|240169425|ref|ZP_04748084.1| hypothetical protein MkanA1_0893... 524 8e-147
gi|308369505|ref|ZP_07418057.2| putative cyclase superfamily [My... 523 1e-146
gi|296164611|ref|ZP_06847178.1| conserved hypothetical protein [... 511 3e-143
gi|289750273|ref|ZP_06509651.1| conserved hypothetical protein [... 431 5e-119
gi|148551008|ref|YP_001260438.1| cyclase family protein [Sphingo... 412 3e-113
gi|169631566|ref|YP_001705215.1| hypothetical protein MAB_4492c ... 409 2e-112
gi|332306827|ref|YP_004434678.1| cyclase family protein [Glaciec... 385 5e-105
gi|109898636|ref|YP_661891.1| putative cyclase [Pseudoalteromona... 385 5e-105
gi|11498799|ref|NP_070028.1| hypothetical protein AF1200 [Archae... 321 6e-86
gi|118592789|ref|ZP_01550178.1| hypothetical protein SIAM614_290... 289 2e-76
gi|339442508|ref|YP_004708513.1| hypothetical protein CXIVA_1445... 233 3e-59
gi|87122102|ref|ZP_01077986.1| hypothetical protein MED121_04133... 199 4e-49
gi|310657511|ref|YP_003935232.1| hypothetical protein CLOST_0197... 198 9e-49
gi|339441306|ref|YP_004707311.1| hypothetical protein CXIVA_0242... 168 9e-40
gi|313902911|ref|ZP_07836307.1| cyclase family protein [Thermaer... 157 2e-36
gi|317122308|ref|YP_004102311.1| cyclase family protein [Thermae... 153 2e-35
gi|300855541|ref|YP_003780525.1| putative cyclase [Clostridium l... 145 5e-33
gi|307298666|ref|ZP_07578469.1| cyclase family protein [Thermoto... 144 1e-32
gi|269792029|ref|YP_003316933.1| cyclase family protein [Therman... 142 7e-32
gi|289524300|ref|ZP_06441154.1| putative cyclase [Anaerobaculum ... 140 3e-31
gi|312880791|ref|ZP_07740591.1| cyclase family protein [Aminomon... 137 2e-30
gi|221632605|ref|YP_002521826.1| putative polyketide cyclase [Th... 131 9e-29
gi|221632604|ref|YP_002521825.1| putative polyketide cyclase [Th... 131 1e-28
gi|338813078|ref|ZP_08625218.1| putative cyclase [Acetonema long... 129 4e-28
gi|150388525|ref|YP_001318574.1| cyclase family protein [Alkalip... 127 3e-27
gi|284046452|ref|YP_003396792.1| cyclase family protein [Conexib... 126 3e-27
gi|167770182|ref|ZP_02442235.1| hypothetical protein ANACOL_0152... 123 3e-26
gi|325972419|ref|YP_004248610.1| cyclase family protein [Spiroch... 119 3e-25
gi|345004382|ref|YP_004807235.1| cyclase family protein [halophi... 119 5e-25
gi|110667888|ref|YP_657699.1| cyclase [Haloquadratum walsbyi DSM... 119 7e-25
gi|339728827|emb|CCC40003.1| cyclase family protein [Haloquadrat... 118 1e-24
gi|158319910|ref|YP_001512417.1| cyclase family protein [Alkalip... 117 2e-24
gi|315425928|dbj|BAJ47578.1| cyclase family protein [Candidatus ... 117 3e-24
gi|320159994|ref|YP_004173218.1| hypothetical protein ANT_05840 ... 115 6e-24
gi|218961867|ref|YP_001741642.1| putative cyclase [Candidatus Cl... 110 3e-22
gi|78187925|ref|YP_375968.1| hypothetical protein Plut_2083 [Chl... 101 1e-19
gi|331696441|ref|YP_004332680.1| cyclase family protein [Pseudon... 100 2e-19
gi|20808973|ref|NP_624144.1| hypothetical protein TTE2628 [Therm... 97.8 1e-18
gi|193213662|ref|YP_001999615.1| cyclase family protein [Chlorob... 97.8 1e-18
gi|325291131|ref|YP_004267312.1| cyclase family protein [Syntrop... 97.4 2e-18
gi|254478157|ref|ZP_05091539.1| Putative cyclase superfamily pro... 96.7 3e-18
gi|310780252|ref|YP_003968584.1| cyclase family protein [Ilyobac... 95.1 1e-17
>gi|15608854|ref|NP_216232.1| hypothetical protein Rv1716 [Mycobacterium tuberculosis H37Rv]
gi|148661514|ref|YP_001283037.1| hypothetical protein MRA_1726 [Mycobacterium tuberculosis H37Ra]
gi|167968671|ref|ZP_02550948.1| hypothetical protein MtubH3_11798 [Mycobacterium tuberculosis
H37Ra]
gi|307084301|ref|ZP_07493414.1| hypothetical protein TMLG_00699 [Mycobacterium tuberculosis SUMu012]
gi|3261548|emb|CAA17613.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium tuberculosis H37Rv]
gi|148505666|gb|ABQ73475.1| hypothetical protein MRA_1726 [Mycobacterium tuberculosis H37Ra]
gi|308366090|gb|EFP54941.1| hypothetical protein TMLG_00699 [Mycobacterium tuberculosis SUMu012]
Length=276
Score = 564 bits (1453), Expect = 6e-159, Method: Compositional matrix adjust.
Identities = 276/276 (100%), Positives = 276/276 (100%), Gaps = 0/276 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKAV 276
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKAV
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKAV 276
>gi|307079722|ref|ZP_07488892.1| hypothetical protein TMKG_02223 [Mycobacterium tuberculosis SUMu011]
gi|308362471|gb|EFP51322.1| hypothetical protein TMKG_02223 [Mycobacterium tuberculosis SUMu011]
Length=276
Score = 563 bits (1450), Expect = 1e-158, Method: Compositional matrix adjust.
Identities = 275/275 (100%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|15841176|ref|NP_336213.1| hypothetical protein MT1755 [Mycobacterium tuberculosis CDC1551]
gi|31792904|ref|NP_855397.1| hypothetical protein Mb1744 [Mycobacterium bovis AF2122/97]
gi|121637624|ref|YP_977847.1| hypothetical protein BCG_1755 [Mycobacterium bovis BCG str. Pasteur
1173P2]
27 more sequence titles
Length=276
Score = 561 bits (1445), Expect = 5e-158, Method: Compositional matrix adjust.
Identities = 274/275 (99%), Positives = 274/275 (99%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|289443175|ref|ZP_06432919.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289569767|ref|ZP_06449994.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289416094|gb|EFD13334.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289543521|gb|EFD47169.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=276
Score = 560 bits (1442), Expect = 1e-157, Method: Compositional matrix adjust.
Identities = 273/275 (99%), Positives = 274/275 (99%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYA+SPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAFSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|289574384|ref|ZP_06454611.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289538815|gb|EFD43393.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=276
Score = 558 bits (1439), Expect = 2e-157, Method: Compositional matrix adjust.
Identities = 273/275 (99%), Positives = 273/275 (99%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GT IDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTQIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|148822922|ref|YP_001287676.1| hypothetical protein TBFG_11731 [Mycobacterium tuberculosis F11]
gi|253799245|ref|YP_003032246.1| hypothetical protein TBMG_02279 [Mycobacterium tuberculosis KZN
1435]
gi|254550727|ref|ZP_05141174.1| hypothetical protein Mtube_09759 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
11 more sequence titles
Length=276
Score = 558 bits (1437), Expect = 5e-157, Method: Compositional matrix adjust.
Identities = 273/275 (99%), Positives = 273/275 (99%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAW LGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWLLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|340626723|ref|YP_004745175.1| hypothetical protein MCAN_17271 [Mycobacterium canettii CIPT
140010059]
gi|340004913|emb|CCC44059.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=276
Score = 556 bits (1434), Expect = 9e-157, Method: Compositional matrix adjust.
Identities = 272/275 (99%), Positives = 273/275 (99%), Gaps = 0/275 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAV EYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVCEYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
F+AFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA
Sbjct 241 FSAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
>gi|308231908|ref|ZP_07414239.2| putative cyclase superfamily [Mycobacterium tuberculosis SUMu001]
gi|308379010|ref|ZP_07484672.2| putative cyclase superfamily [Mycobacterium tuberculosis SUMu010]
gi|308215652|gb|EFO75051.1| putative cyclase superfamily [Mycobacterium tuberculosis SUMu001]
gi|308358531|gb|EFP47382.1| putative cyclase superfamily [Mycobacterium tuberculosis SUMu010]
Length=258
Score = 525 bits (1353), Expect = 2e-147, Method: Compositional matrix adjust.
Identities = 256/257 (99%), Positives = 257/257 (100%), Gaps = 0/257 (0%)
Query 19 LSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD 78
+SHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD
Sbjct 1 MSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD 60
Query 79 EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA 138
EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA
Sbjct 61 EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA 120
Query 139 YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQT 198
YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQT
Sbjct 121 YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQT 180
Query 199 GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL 258
GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL
Sbjct 181 GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL 240
Query 259 VAIVDPTGSYRIETGKA 275
VAIVDPTGSYRIETGKA
Sbjct 241 VAIVDPTGSYRIETGKA 257
>gi|240169425|ref|ZP_04748084.1| hypothetical protein MkanA1_08939 [Mycobacterium kansasii ATCC
12478]
Length=274
Score = 524 bits (1349), Expect = 8e-147, Method: Compositional matrix adjust.
Identities = 253/273 (93%), Positives = 262/273 (96%), Gaps = 0/273 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
M F WPLG AES LEFYDLSHPWGHGAPAWPYFEDV+IERLHGMAKSRVLTQKITTVMHS
Sbjct 1 MAFTWPLGDAESKLEFYDLSHPWGHGAPAWPYFEDVKIERLHGMAKSRVLTQKITTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPK KWG+VTAEDL+ A+P+IRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKTKWGVVTAEDLEQASPEIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
+VNTGWHHKYADSAEYYAYSPGF K+AGEWFAAKGVKAVGTDTQALDHPLATAIAPH PA
Sbjct 121 IVNTGWHHKYADSAEYYAYSPGFYKEAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
EAQGGLLPWAV EYE QTGRKVLDDFP+WEPCHRAILS+GIYGFENVGGDLDKVTGKRVT
Sbjct 181 EAQGGLLPWAVSEYEQQTGRKVLDDFPEWEPCHRAILSKGIYGFENVGGDLDKVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETG 273
FAAFPWRWVGGDGCIVRLVAI DPTGSYRIETG
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIADPTGSYRIETG 273
>gi|308369505|ref|ZP_07418057.2| putative cyclase superfamily [Mycobacterium tuberculosis SUMu002]
gi|308370799|ref|ZP_07422776.2| putative cyclase superfamily [Mycobacterium tuberculosis SUMu003]
gi|308372035|ref|ZP_07427142.2| putative cyclase superfamily [Mycobacterium tuberculosis SUMu004]
11 more sequence titles
Length=258
Score = 523 bits (1348), Expect = 1e-146, Method: Compositional matrix adjust.
Identities = 255/257 (99%), Positives = 256/257 (99%), Gaps = 0/257 (0%)
Query 19 LSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD 78
+SHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD
Sbjct 1 MSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLD 60
Query 79 EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA 138
EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA
Sbjct 61 EIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA 120
Query 139 YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQT 198
YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPH PAEAQGGLLPWAVREYEAQT
Sbjct 121 YSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHGPAEAQGGLLPWAVREYEAQT 180
Query 199 GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL 258
GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL
Sbjct 181 GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRL 240
Query 259 VAIVDPTGSYRIETGKA 275
VAIVDPTGSYRIETGKA
Sbjct 241 VAIVDPTGSYRIETGKA 257
>gi|296164611|ref|ZP_06847178.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900030|gb|EFG79469.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=274
Score = 511 bits (1317), Expect = 3e-143, Method: Compositional matrix adjust.
Identities = 245/273 (90%), Positives = 259/273 (95%), Gaps = 0/273 (0%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
M+ WPLG AE+ LEFYDLSHPWGHGAPAWPYFEDV+IERLH MA+SRVLTQK+TTVMHS
Sbjct 1 MSITWPLGEAEAALEFYDLSHPWGHGAPAWPYFEDVKIERLHNMARSRVLTQKVTTVMHS 60
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIV 120
GTHIDAPAHVVEGTPFL EIPLSAFFGTGVVVSIPK KWG+VTAEDL+ A P+IRPGDIV
Sbjct 61 GTHIDAPAHVVEGTPFLHEIPLSAFFGTGVVVSIPKDKWGVVTAEDLEKAAPEIRPGDIV 120
Query 121 VVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPA 180
VVNTGWHHKYADSAEYYAYSPGF K+AGEWFAAKGVKAVGTDTQALDHPLAT+IAPH PA
Sbjct 121 VVNTGWHHKYADSAEYYAYSPGFYKEAGEWFAAKGVKAVGTDTQALDHPLATSIAPHGPA 180
Query 181 EAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVT 240
E GGLLPWAVREYEAQTGR+VLDDFP+WEPCHRAILS+GIYGFENVGGDLD+VTGKRVT
Sbjct 181 EHNGGLLPWAVREYEAQTGRRVLDDFPEWEPCHRAILSKGIYGFENVGGDLDQVTGKRVT 240
Query 241 FAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETG 273
FAAFPWRWVGGDGCIVRLVAIVDPTG YRIETG
Sbjct 241 FAAFPWRWVGGDGCIVRLVAIVDPTGGYRIETG 273
>gi|289750273|ref|ZP_06509651.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690860|gb|EFD58289.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=246
Score = 431 bits (1109), Expect = 5e-119, Method: Compositional matrix adjust.
Identities = 218/245 (89%), Positives = 222/245 (91%), Gaps = 5/245 (2%)
Query 36 VQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGT-----PFLDEIPLSAFFGTGV 90
+QIERLHGMAKSRVLTQ+ H + P V T FLDEIPLSAFFGTGV
Sbjct 1 MQIERLHGMAKSRVLTQRSPKDHHPSCNSRHPHRRVGVTWWKEHRFLDEIPLSAFFGTGV 60
Query 91 VVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYAYSPGFDKKAGEW 150
VVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYA+SPGFDKKAGEW
Sbjct 61 VVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYAFSPGFDKKAGEW 120
Query 151 FAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQTGRKVLDDFPDWE 210
FAAKGVKAVGTDTQALDHPLATAIAPH PAEAQGGLLPWAVREYEAQTGRKVLDDFPDWE
Sbjct 121 FAAKGVKAVGTDTQALDHPLATAIAPHGPAEAQGGLLPWAVREYEAQTGRKVLDDFPDWE 180
Query 211 PCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVDPTGSYRI 270
PCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVDPTGSYRI
Sbjct 181 PCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVDPTGSYRI 240
Query 271 ETGKA 275
ETGKA
Sbjct 241 ETGKA 245
>gi|148551008|ref|YP_001260438.1| cyclase family protein [Sphingomonas wittichii RW1]
gi|148503419|gb|ABQ71671.1| cyclase family protein [Sphingomonas wittichii RW1]
Length=290
Score = 412 bits (1059), Expect = 3e-113, Method: Compositional matrix adjust.
Identities = 199/261 (77%), Positives = 219/261 (84%), Gaps = 1/261 (0%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
LEFYDLSHPWG G P WPYFEDV+IERLHGM++S VLTQKITTVMHSGTHI+APAHVV G
Sbjct 28 LEFYDLSHPWGLGQPCWPYFEDVKIERLHGMSRSGVLTQKITTVMHSGTHINAPAHVVPG 87
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
TPF+DE+PL FFGTGVVVSIPK KW ++TAEDL+NA P IR GDIV+VNTGWH Y D+
Sbjct 88 TPFMDEVPLPYFFGTGVVVSIPKKKWEVITAEDLENARPQIREGDIVIVNTGWHKYYGDN 147
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
YYAYSPGF K+AGEWF K VK G+DTQALDHPL TAI PH A GL+P E
Sbjct 148 RHYYAYSPGFYKEAGEWFVQKKVKMCGSDTQALDHPLGTAIGPHGTG-APHGLIPQVNIE 206
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
YE TGRKV++DFP+WEPCH AILS GI GFENVGGD+DKVTGKRVTFAAFPWRW GDG
Sbjct 207 YEQLTGRKVIEDFPEWEPCHNAILSAGICGFENVGGDIDKVTGKRVTFAAFPWRWKKGDG 266
Query 254 CIVRLVAIVDPTGSYRIETGK 274
CIVRLVAIVDPTG++RIETGK
Sbjct 267 CIVRLVAIVDPTGNFRIETGK 287
>gi|169631566|ref|YP_001705215.1| hypothetical protein MAB_4492c [Mycobacterium abscessus ATCC
19977]
gi|169243533|emb|CAM64561.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=264
Score = 409 bits (1051), Expect = 2e-112, Method: Compositional matrix adjust.
Identities = 198/262 (76%), Positives = 215/262 (83%), Gaps = 1/262 (0%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
L+FYDLSHPWG G P WPYFEDV+IERLH MA+S VLTQKITTVMHSGTHIDAPAHVV G
Sbjct 3 LQFYDLSHPWGLGTPCWPYFEDVKIERLHNMARSGVLTQKITTVMHSGTHIDAPAHVVPG 62
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
TPFL+E+PL FFGTGVVVSIPK KW ++TAEDL+NA P IR GDIV+VNTGWH Y D+
Sbjct 63 TPFLEEVPLPNFFGTGVVVSIPKKKWEVITAEDLENARPQIREGDIVIVNTGWHRYYGDN 122
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
YYAY+PGF K AGEWF + VK VG+DTQALDHPL TAI PH A GL+P E
Sbjct 123 RHYYAYAPGFYKDAGEWFVERKVKMVGSDTQALDHPLGTAIGPHGTG-APNGLIPQVNLE 181
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
YE TGRKV++DFP WEPCH AILS GI GFENVGGD+DKVTGKRVTFAAFPWRW GDG
Sbjct 182 YEQLTGRKVIEDFPYWEPCHNAILSNGILGFENVGGDIDKVTGKRVTFAAFPWRWKKGDG 241
Query 254 CIVRLVAIVDPTGSYRIETGKA 275
CIVRLVAI DPTG YRIETG A
Sbjct 242 CIVRLVAITDPTGEYRIETGTA 263
>gi|332306827|ref|YP_004434678.1| cyclase family protein [Glaciecola agarilytica 4H-3-7+YE-5]
gi|332174156|gb|AEE23410.1| cyclase family protein [Glaciecola sp. 4H-3-7+YE-5]
Length=291
Score = 385 bits (988), Expect = 5e-105, Method: Compositional matrix adjust.
Identities = 184/266 (70%), Positives = 211/266 (80%), Gaps = 5/266 (1%)
Query 9 AAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPA 68
+ ++ ++FYDLSH WG G P WPYFEDV+IERLHG ++S VLTQKITTVMHSGTHIDAPA
Sbjct 27 SNDTGMQFYDLSHEWGLGQPCWPYFEDVKIERLHGHSRSGVLTQKITTVMHSGTHIDAPA 86
Query 69 HVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHH 128
HVVEGTPF+D++PL FFG GVVVSIPK KW ++TAEDL+N P IR GDIV++NTGWHH
Sbjct 87 HVVEGTPFMDQMPLPRFFGAGVVVSIPKKKWEVITAEDLENVRPKIREGDIVIINTGWHH 146
Query 129 KYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLP 188
YADSAEYY Y PG ++AGEW A K VK VG D QALDHPL TAI PH G L+P
Sbjct 147 TYADSAEYYHYGPGLYREAGEWLAKKKVKMVGIDVQALDHPLGTAIGPH----GTGPLIP 202
Query 189 WAVREYEAQT-GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWR 247
EY T GR + +DFPDWEPCHRA+L+ GI G ENVGG+LDKVTGKR T AAFPWR
Sbjct 203 HLEDEYREFTGGRGIKEDFPDWEPCHRALLNAGICGIENVGGELDKVTGKRCTLAAFPWR 262
Query 248 WVGGDGCIVRLVAIVDPTGSYRIETG 273
W GGDGC+VRLVA+VDP+G YRIE G
Sbjct 263 WKGGDGCMVRLVAMVDPSGEYRIEQG 288
>gi|109898636|ref|YP_661891.1| putative cyclase [Pseudoalteromonas atlantica T6c]
gi|109700917|gb|ABG40837.1| putative cyclase [Pseudoalteromonas atlantica T6c]
Length=291
Score = 385 bits (988), Expect = 5e-105, Method: Compositional matrix adjust.
Identities = 184/266 (70%), Positives = 210/266 (79%), Gaps = 5/266 (1%)
Query 9 AAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPA 68
+ ++ ++FYDLSH WG G P WPYFEDV+IERLHG ++S VLTQKITTVMHSGTHIDAPA
Sbjct 27 SNDTGMQFYDLSHEWGLGQPCWPYFEDVKIERLHGHSRSGVLTQKITTVMHSGTHIDAPA 86
Query 69 HVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHH 128
HVVEGTPF+D++PL FFG GVVVSIPK KW ++TAEDL+N P IR GDIV++NTGWHH
Sbjct 87 HVVEGTPFMDQMPLPRFFGAGVVVSIPKKKWEVITAEDLENVRPKIREGDIVIINTGWHH 146
Query 129 KYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLP 188
YADSAEYY Y PG ++AGEW A K VK VG D QALDHPL TAI PH G L+P
Sbjct 147 TYADSAEYYHYGPGLYREAGEWLAKKKVKMVGIDVQALDHPLGTAIGPH----GTGPLIP 202
Query 189 WAVREYEAQT-GRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWR 247
EY T GR + +DFPDWEPCHRA+L+ GI G ENVGG+LDKVTGKR T AAFPWR
Sbjct 203 HLEDEYREFTGGRGIKEDFPDWEPCHRALLNSGICGIENVGGELDKVTGKRCTIAAFPWR 262
Query 248 WVGGDGCIVRLVAIVDPTGSYRIETG 273
W GGDGC+VRLVA+VDP G YRIE G
Sbjct 263 WKGGDGCMVRLVAMVDPKGEYRIEQG 288
>gi|11498799|ref|NP_070028.1| hypothetical protein AF1200 [Archaeoglobus fulgidus DSM 4304]
gi|2649387|gb|AAB90047.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length=278
Score = 321 bits (823), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 155/251 (62%), Positives = 187/251 (75%), Gaps = 1/251 (0%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+E YDLS+P+G+G P WPYF DV I+R H AKSRVL+Q ITT MH TH DAP HV EG
Sbjct 26 VEVYDLSNPFGYGVPLWPYFNDVIIDRYHYHAKSRVLSQIITTTMHVSTHADAPIHVEEG 85
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
P +DE+P+ + G GVVVSIPK KW ++TAEDL+ A P I GDIV+VNTGWH ++DS
Sbjct 86 FPSIDEVPIERYMGEGVVVSIPKKKWEVITAEDLEKADPPIEKGDIVIVNTGWHRYWSDS 145
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSP-AEAQGGLLPWAVR 192
+Y+ Y+PGF K+AGEWF K VKAVG DTQALDHPLAT A H P A+ + +LPW
Sbjct 146 VKYFCYAPGFYKEAGEWFVKKKVKAVGIDTQALDHPLATREAWHHPGADERNSILPWLKE 205
Query 193 EYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGD 252
EY+ TGR V +DFP WEPCHR +L+ GI G+ENVGGD+DKVTG+R T P RWV GD
Sbjct 206 EYKQLTGRDVSEDFPYWEPCHRLLLTHGIMGWENVGGDIDKVTGQRCTIIGLPIRWVKGD 265
Query 253 GCIVRLVAIVD 263
G IVRLVA+V+
Sbjct 266 GSIVRLVALVE 276
>gi|118592789|ref|ZP_01550178.1| hypothetical protein SIAM614_29031 [Stappia aggregata IAM 12614]
gi|118434559|gb|EAV41211.1| hypothetical protein SIAM614_29031 [Stappia aggregata IAM 12614]
Length=279
Score = 289 bits (740), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/264 (58%), Positives = 188/264 (72%), Gaps = 7/264 (2%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+EFY+LSH +G P WPYF+DVQI+R H MAKS VL+Q ITT MH THIDAPAHVV+G
Sbjct 20 VEFYNLSHRYGFQCPNWPYFQDVQIDRKHYMAKSGVLSQTITTTMHVTTHIDAPAHVVQG 79
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPD-IRPGDIVVVNTGWHHKYAD 132
TPF+DE+PL FFG+G+VVSIPK KW +T +DL+ A IR D++++NTGWH +Y D
Sbjct 80 TPFIDEVPLPHFFGSGLVVSIPKKKWEQITGDDLEKACGHAIRKNDVLIINTGWHKQYED 139
Query 133 SAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVR 192
+Y+AY PG A +W KG+K VG DTQA DHPLATAI P + G +LP
Sbjct 140 -GDYFAYCPGLVPSAADWIVEKGIKVVGHDTQANDHPLATAIGP----QRNGPILPHLEA 194
Query 193 EY-EAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGG 251
EY E GR +DFP+WEP H+ + S GI G ENVGGDLD+VTGKR TFA FPW W G
Sbjct 195 EYKEWSGGRGWEEDFPEWEPVHQKLFSNGILGIENVGGDLDEVTGKRCTFAFFPWNWDRG 254
Query 252 DGCIVRLVAIVDPTGSYRIETGKA 275
DGCI+RLVA++D YRIE G++
Sbjct 255 DGCIIRLVAMIDKGQQYRIEAGES 278
>gi|339442508|ref|YP_004708513.1| hypothetical protein CXIVA_14450 [Clostridium sp. SY8519]
gi|338901909|dbj|BAK47411.1| hypothetical protein CXIVA_14450 [Clostridium sp. SY8519]
Length=305
Score = 233 bits (594), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 114/257 (45%), Positives = 155/257 (61%), Gaps = 9/257 (3%)
Query 11 ESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTV-MHSGTHIDAPAH 69
+ E DLS+P+G G P WP D I+R+ M L Q + MH+ TH D+P+H
Sbjct 34 DGKFELVDLSNPFGRGNPLWPSNGDFHIDRVQHMPMHYRLLQTFNSFHMHNSTHADSPSH 93
Query 70 VVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPD----IRPGDIVVVNTG 125
V+ +PF E+P+ +FG V + IPKGKW +++ ED++NA I+ GD V++NTG
Sbjct 94 VIPESPFTHELPIENYFGEAVCLDIPKGKWELISVEDIENAAKKVPGGIKEGDWVLLNTG 153
Query 126 WHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGG 185
H ++ ++ +Y+AYSPG + WF V+ VG D QA+DH L T A H P G
Sbjct 154 THRRWGENDDYFAYSPGLSIEGAHWFVDHHVRGVGFDMQAIDHILYTYAAEHGP----GP 209
Query 186 LLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFP 245
+P AV EYE + G +DFP+WEPCH +LS + G EN+GGDLDK+T +R F AFP
Sbjct 210 YVPRAVEEYEEEFGHPAKEDFPEWEPCHDILLSNNVMGIENLGGDLDKLTNQRFLFCAFP 269
Query 246 WRWVGGDGCIVRLVAIV 262
RW GDG IVR VA V
Sbjct 270 LRWYMGDGTIVRAVAFV 286
>gi|87122102|ref|ZP_01077986.1| hypothetical protein MED121_04133 [Marinomonas sp. MED121]
gi|86162649|gb|EAQ63930.1| hypothetical protein MED121_04133 [Marinomonas sp. MED121]
Length=311
Score = 199 bits (505), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 112/287 (40%), Positives = 154/287 (54%), Gaps = 41/287 (14%)
Query 18 DLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVE----- 72
DLSHP+G P WPYF+ +IE +HG++K+ VLTQK+ VMH GTH D+P HV+E
Sbjct 8 DLSHPFGSNMPVWPYFKKPKIETMHGLSKAGVLTQKVDFVMHCGTHADSPRHVLEHEFDG 67
Query 73 -GTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNA------TPDIRPGDIVVVNTG 125
+ E+ L ++G V + + +WG+++A DL +A T + G I+++ TG
Sbjct 68 KRARYTHEMELDEYYGDAVCLDVKIHQWGLISAADLDDAVERSAVTHEELEGMIIIIRTG 127
Query 126 WHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGG 185
H K+ DS +Y+ Y G +AG WF VK VG D QALDHPL T++ + G
Sbjct 128 MHLKWDDSKDYFHYCAGTGVEAGHWFVKHKVKTVGLDQQALDHPLHTSMGLNGTNMNLVG 187
Query 186 LLPWAVRE-------------YEAQT-----GRKVLDDF-----------PDWEPCHRAI 216
+ E + +T G+ D+ WEPCH+ +
Sbjct 188 RSSKPITEEYIEKFGEEAYAWFHKETFIKLHGQAAYDEMYGLIEEKAGCVGTWEPCHKLM 247
Query 217 LSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVD 263
L GI G+ENVGGDLDKV GKR FP RWV GDG +VRLVA +D
Sbjct 248 LGNGITGWENVGGDLDKVVGKRFKIMGFPIRWVEGDGSMVRLVAEID 294
>gi|310657511|ref|YP_003935232.1| hypothetical protein CLOST_0197 [Clostridium sticklandii DSM
519]
gi|308824289|emb|CBH20327.1| conserved protein of unknown function [Clostridium sticklandii]
Length=312
Score = 198 bits (503), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 114/288 (40%), Positives = 151/288 (53%), Gaps = 42/288 (14%)
Query 18 DLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVE----- 72
DL+HP+ P WPYF +I+ +H +AKS VLTQKI VMH GTH DAP HV+E
Sbjct 8 DLTHPFHADIPVWPYFAKPKIDTMHNLAKSGVLTQKIDVVMHCGTHADAPRHVMEYEFDG 67
Query 73 -GTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATP--DIRP----GDIVVVNTG 125
+ E+PL A++G V + I +WG++TA+ L +A +I+P G ++ + TG
Sbjct 68 KRARYTHEMPLDAYYGDAVCLDIKVDRWGLITADHLYDACKRANIKPEELEGMVICLRTG 127
Query 126 WHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHS---PAEA 182
H K+ D+ EYY YS G ++AGEWFA K V D QALDHPL T + +
Sbjct 128 MHLKFDDTREYYHYSCGTGREAGEWFAKYKPKCVAMDMQALDHPLHTTMGKNGGYVGMNL 187
Query 183 QGGLLPWAVREYEAQTGRKVLDDFPD---------------------------WEPCHRA 215
G EY + G + +F WEPCH+
Sbjct 188 IGNSGKPITDEYIEKFGIEAYAEFNKDTFIDVFGKDKYMEAYGMLENHGLEGTWEPCHKL 247
Query 216 ILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVD 263
++ GI G EN+GGDLDKV KR F AFP RW GDG +VR VA +D
Sbjct 248 MMGNGIVGVENLGGDLDKVVNKRFKFMAFPIRWWLGDGSMVRCVAEID 295
>gi|339441306|ref|YP_004707311.1| hypothetical protein CXIVA_02420 [Clostridium sp. SY8519]
gi|338900707|dbj|BAK46209.1| uncharacterized ACR protein [Clostridium sp. SY8519]
Length=345
Score = 168 bits (425), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 110/310 (36%), Positives = 151/310 (49%), Gaps = 53/310 (17%)
Query 13 TLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVE 72
+ + DL+HP+G P WPYF+ I+ H MAK VLTQ I MH+GTH DAP HV+E
Sbjct 3 NMVYVDLTHPFGAEIPRWPYFDKPVIDSKHSMAKGGVLTQYIGCTMHTGTHCDAPRHVME 62
Query 73 ------GTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNAT------PDIRPGD-- 118
+ E+P+ A+ G V + I +WG++T + L +A PD G+
Sbjct 63 VEFDGKRARYTHEMPIDAYTGDAVALDIQIERWGLITGKHLDDACRRMGIDPDPAKGELE 122
Query 119 --IVVVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAP 176
+V + TG + + D+ EYY YS G +AG+WF VK V D QALDHPL TA+
Sbjct 123 NKVVCLVTGMNQLFDDTKEYYHYSCGTGVEAGQWFVDHKVKCVAMDMQALDHPLHTAMGN 182
Query 177 HSPAE-----AQGGLLPWAVREYEAQTGRKVLDDFP------------------------ 207
+ A G + +E + D F
Sbjct 183 NGMTRMNLLGASGKPITEEYKELFGEEAYAEFDKFEYIRLHGQEAYDKKFGALEAIGCWG 242
Query 208 DWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVD---- 263
WEPCH+ +L GI G EN+GGD++KV GK+ F FP RW GDG + R VA +D
Sbjct 243 TWEPCHKTMLGHGIVGVENLGGDIEKVKGKKFKFFCFPLRWYMGDGSMARCVAYIDEDDI 302
Query 264 ----PTGSYR 269
PT +Y+
Sbjct 303 NKDVPTRTYK 312
>gi|313902911|ref|ZP_07836307.1| cyclase family protein [Thermaerobacter subterraneus DSM 13965]
gi|313466846|gb|EFR62364.1| cyclase family protein [Thermaerobacter subterraneus DSM 13965]
Length=265
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/263 (35%), Positives = 137/263 (53%), Gaps = 19/263 (7%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+E DLSHPWG PA+ ++ ++ + A RV QKI T +H GTH+DAP H + G
Sbjct 13 VELIDLSHPWGVNTPAFAGYDGPVVKWIKRPAFDRVGGQKIETTLHVGTHLDAPIHFITG 72
Query 74 TPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYA 131
+ +PL FG GVVV I G + + T E + ++R GDI++++TG+HH Y
Sbjct 73 GKDIASLPLDRLFGPGVVVDISDEVGDYDIYTPEHITRKV-EVRKGDILIIHTGYHHYYN 131
Query 132 -----DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGL 186
D Y+ PG ++ EW + ++ +G D + DHP+ T I L
Sbjct 132 HGDRPDEERYFCKHPGPTREFAEWALSMELRWIGIDAGSADHPMNTVIR---------KL 182
Query 187 LPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAF 244
P RE E + GR + + FPD ++ H + + ENVGGD+D+V +RV F
Sbjct 183 RPDLAREAERKLGRPLDEIFPDHEFQLMHNFLFPHDLVHVENVGGDIDRVLNQRVWIGCF 242
Query 245 PWRWVGGDGCIVRLVAIVDPTGS 267
PW++ GG+ R+VA V G+
Sbjct 243 PWKFEGGEAAFCRVVAFVQKQGA 265
>gi|317122308|ref|YP_004102311.1| cyclase family protein [Thermaerobacter marianensis DSM 12885]
gi|315592288|gb|ADU51584.1| cyclase family protein [Thermaerobacter marianensis DSM 12885]
Length=275
Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/260 (35%), Positives = 135/260 (52%), Gaps = 19/260 (7%)
Query 12 STLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVV 71
S +E DLSHPW PA+ ++ ++ + A RV QKI T +H GTH+DAP H +
Sbjct 24 SHVELIDLSHPWSVHTPAFAGYDGPVVKWIKRPAFDRVGGQKIETTLHVGTHLDAPIHFI 83
Query 72 EGTPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHK 129
G + +PL FG GVVV I G + + T E + ++R GDI++++TG+HH
Sbjct 84 TGGKDIASLPLDRLFGPGVVVDISDEVGDYDIYTPEHITRKV-EVRKGDILIIHTGYHHF 142
Query 130 YA-----DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQG 184
Y D Y+ PG ++ EW + ++ +G D + DHP+ T I
Sbjct 143 YNHGDRPDEERYFCKHPGPTREFAEWALSMELRWIGVDAGSADHPMNTVIR--------- 193
Query 185 GLLPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFA 242
L P RE E + GR + + FPD ++ H + + ENVGGD+D+V +RV
Sbjct 194 KLRPDLAREAERKLGRPLDEIFPDHEFQLMHNFLFPHDLVHVENVGGDIDRVLNQRVWIG 253
Query 243 AFPWRWVGGDGCIVRLVAIV 262
FPW++ GG+ R+VA V
Sbjct 254 CFPWKFEGGEAAFCRVVAFV 273
>gi|300855541|ref|YP_003780525.1| putative cyclase [Clostridium ljungdahlii DSM 13528]
gi|300435656|gb|ADK15423.1| putative cyclase [Clostridium ljungdahlii DSM 13528]
Length=263
Score = 145 bits (367), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/260 (34%), Positives = 131/260 (51%), Gaps = 17/260 (6%)
Query 12 STLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVV 71
++ YDL+ H P WP +E +QI+ ++ + Q +T H GTH+D H
Sbjct 8 KNVQMYDLTQKISHLTPPWPTYEPLQIKFFKRLSSNGANGQVLTHSNHVGTHLDGSLHFC 67
Query 72 EGTPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHK 129
+ IPL GVVV I +G+ T++D+ D+R GDI++++TG+H K
Sbjct 68 THGRDISSIPLEELVAPGVVVDISDIAEDYGIYTSKDIMQRA-DVRKGDILIIHTGYH-K 125
Query 130 YA------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQ 183
YA D PG K+ +W K +G D + DHP+ T I +P EA+
Sbjct 126 YAWDEPEADEERVMMRHPGPTKEFSKWCRKMEFKWLGVDCGSADHPMNTKIREWAPKEAK 185
Query 184 GGLLPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTF 241
A + A+ G+ + D +PD ++ H + I EN+GGD+DKV GKR+
Sbjct 186 K-----AEKYLRAKYGKGITDFWPDEDYQLMHYDLFPYNIVHAENLGGDIDKVLGKRLVI 240
Query 242 AAFPWRWVGGDGCIVRLVAI 261
FPWR+VGG+ CI R+VA
Sbjct 241 GCFPWRFVGGESCICRIVAF 260
>gi|307298666|ref|ZP_07578469.1| cyclase family protein [Thermotogales bacterium MesG1.Ag.4.2]
gi|306915831|gb|EFN46215.1| cyclase family protein [Thermotogales bacterium MesG1.Ag.4.2]
Length=263
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/257 (33%), Positives = 131/257 (51%), Gaps = 17/257 (6%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+ Y+LS P H P WP +E +QI+ +A + Q +T H GTH+D H
Sbjct 9 IRIYELSQPISHLTPPWPTYEPLQIKFFKRLAPNGANGQLLTHSNHVGTHLDGSLHFCTH 68
Query 74 TPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYA 131
+ IPL+ GV+V + +G+ T++D++ +++ GDI+++NTG+H KYA
Sbjct 69 GRDIASIPLNELVAPGVIVDLSDIAEDYGIYTSKDIEERV-EVKEGDILIINTGYH-KYA 126
Query 132 ------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGG 185
D Y PG ++ EW K +K +G D + DHP+ T I P +A+
Sbjct 127 YDQPEADEVRYMIKHPGPTREFAEWCKMKKIKWIGVDCGSADHPMNTKIREWMPVQAKE- 185
Query 186 LLPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAA 243
+ G+ + + FPD ++ H + I ENVGG++DK+ KR+
Sbjct 186 ----CDHYMHEKYGKSLEEIFPDEDYQLMHVLLFPYDIIHAENVGGEIDKILDKRMVIGC 241
Query 244 FPWRWVGGDGCIVRLVA 260
FPWR+VGG+ CI R+VA
Sbjct 242 FPWRFVGGESCISRIVA 258
>gi|269792029|ref|YP_003316933.1| cyclase family protein [Thermanaerovibrio acidaminovorans DSM
6589]
gi|269099664|gb|ACZ18651.1| cyclase family protein [Thermanaerovibrio acidaminovorans DSM
6589]
Length=267
Score = 142 bits (357), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 85/261 (33%), Positives = 132/261 (51%), Gaps = 16/261 (6%)
Query 13 TLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVE 72
++ YDL+ P H PAWP +E +Q++ +A + Q +T H GTH+D P H
Sbjct 12 NIKVYDLTIPISHLTPAWPTYEPLQVKFFKRLAPNGANGQLLTHSNHVGTHLDGPLHFCT 71
Query 73 GTPFLDEIPLSAFF-GTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWH-- 127
+ + L F G GVVV I +G+ T++D++ +I GDI+++NTG+H
Sbjct 72 HGCDIASLELKDFLVGPGVVVDISDIAEDYGIYTSKDIEERA-EIHDGDILIINTGYHRY 130
Query 128 ---HKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQG 184
AD Y PG + +W + +K +G D + DHP+ T I P A+
Sbjct 131 GWDQPEADEVRYMVMHPGPTNEFAQWCKKRKIKWIGVDCGSADHPMNTKIREWMPYHAK- 189
Query 185 GLLPWAVREYEAQTGRKVLDDFP--DWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFA 242
A + + G+ + D FP D++ H + I E +GGD+D ++GKRVT
Sbjct 190 ----MADKHLREKYGKGLDDFFPPEDYQLMHIDLFPHNIIHAECLGGDIDLLSGKRVTIG 245
Query 243 AFPWRWVGGDGCIVRLVAIVD 263
FPWR+ GG+ CI R+VA +
Sbjct 246 CFPWRFEGGESCISRIVAFAE 266
>gi|289524300|ref|ZP_06441154.1| putative cyclase [Anaerobaculum hydrogeniformans ATCC BAA-1850]
gi|289502472|gb|EFD23636.1| putative cyclase [Anaerobaculum hydrogeniformans ATCC BAA-1850]
Length=268
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 89/261 (35%), Positives = 132/261 (51%), Gaps = 18/261 (6%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
++ YDLS P P WP +E +Q++ +A + Q +T H GTH+D P H
Sbjct 14 VKIYDLSIPISQLTPPWPTYEPLQVKFFKRLAPNGANGQLLTHSNHVGTHLDGPLHFCTH 73
Query 74 TPFLDEIPLSAFF-GTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWH--- 127
+ +PL F G GVVV + +G+ T++D++ D+ GDI+++NTG+H
Sbjct 74 GDDIASLPLQGFLVGPGVVVDLSDIAEDFGVYTSKDIEERA-DVHEGDILIINTGYHRYG 132
Query 128 --HKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGG 185
AD Y PG K+ EW K +K +G D + DHP+ T I P A+
Sbjct 133 FDQPEADEVRYMVMHPGPTKEFAEWCKKKKIKWIGVDCGSADHPMNTKIREWMPQYAK-- 190
Query 186 LLPWAVREYEAQTGRKVLDD-FP--DWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFA 242
EY + K LD+ FP D++ H A+ I E +GGD+DK+ GKRV
Sbjct 191 ----LADEYMRKKYNKTLDEMFPAEDYQIMHIALFPDNIIHAECLGGDIDKLLGKRVIVG 246
Query 243 AFPWRWVGGDGCIVRLVAIVD 263
FPW++ GG+ CI R+VA +
Sbjct 247 CFPWKFQGGESCISRIVAFTE 267
>gi|312880791|ref|ZP_07740591.1| cyclase family protein [Aminomonas paucivorans DSM 12260]
gi|310784082|gb|EFQ24480.1| cyclase family protein [Aminomonas paucivorans DSM 12260]
Length=267
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 89/274 (33%), Positives = 139/274 (51%), Gaps = 19/274 (6%)
Query 1 MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHS 60
M+ W L + L+ YDLS P H PAWP +E +QI+ +A + Q +T H
Sbjct 1 MSNNWEL-KNWNDLKVYDLSIPISHLTPAWPTYEPLQIKFFKRLAPNGANGQLLTHSNHV 59
Query 61 GTHIDAPAHVVEGTPFLDEIPLSAFF-GTGVVVSIPK--GKWGMVTAEDLQNATPDIRPG 117
GTH+D P H + + L + G GVVV + +G+ ++D+++ ++ G
Sbjct 60 GTHLDGPLHFCTHGGDIASLELKNYLVGPGVVVDLSDMAEDYGIYGSKDIEDRA-EVHDG 118
Query 118 DIVVVNTGWHHKY------ADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLA 171
DI+++NTG+H KY AD Y PG + W + +K +G D + DHP+
Sbjct 119 DILIINTGYH-KYGWDQPEADEIRYMIKHPGPTLEFAHWCEKRKIKWLGVDCGSADHPMN 177
Query 172 TAIAPHSPAEAQGGLLPWAVREYEAQTGRKVLDDFP--DWEPCHRAILSQGIYGFENVGG 229
T I P A+ A + + G+ + D FP D++ H + I E +GG
Sbjct 178 TKIREWMPQYAK-----LADAHLKGKYGKGLDDFFPPEDYQLMHIELFPHNIIHAECLGG 232
Query 230 DLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVD 263
D+D ++GKRVT FPW++VGG+ CI R+VA +
Sbjct 233 DIDLLSGKRVTIGCFPWKFVGGESCISRIVAFAE 266
>gi|221632605|ref|YP_002521826.1| putative polyketide cyclase [Thermomicrobium roseum DSM 5159]
gi|221156178|gb|ACM05305.1| putative polyketide cyclase [Thermomicrobium roseum DSM 5159]
Length=273
Score = 131 bits (330), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 79/266 (30%), Positives = 127/266 (48%), Gaps = 30/266 (11%)
Query 18 DLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFL 77
DLS W P + +E ++ + +A R Q I++ +H GTH+DAP H + G +
Sbjct 12 DLSQDWDIHTPGFALYEGPTVKWIKRVAFERAGGQWISSTLHVGTHLDAPLHFITGGQDI 71
Query 78 DEIPLSAFFGTGVVVSIPK---GKWGMVTAEDLQNATPD----IRPGDIVVVNTGWHHKY 130
IPL+ G +V + + G + + E + D I+PGDI+V++TG+HH Y
Sbjct 72 AAIPLNKLVGWACIVDLTRYGIGDYDIYGPEHFEQWERDTGIRIQPGDILVIHTGYHHYY 131
Query 131 A------------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHS 178
D Y+ PG + EW + + + D + DHP T I
Sbjct 132 PSDWATDPALRQPDETRYFIKHPGPTRAFAEWVLRRQISWLAVDCASADHPFNTVIR--- 188
Query 179 PAEAQGGLLPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTG 236
+ + L+P E+EA+ G+ + + PD ++ H A+ G+ EN GG++DKV
Sbjct 189 --KIRADLVP----EFEAKHGKSISELLPDSDYQVMHFALFPHGVIHIENAGGEIDKVLN 242
Query 237 KRVTFAAFPWRWVGGDGCIVRLVAIV 262
+R+ FPWR+ GG+ R VA V
Sbjct 243 RRIMVGCFPWRFKGGEAAFCRFVAFV 268
>gi|221632604|ref|YP_002521825.1| putative polyketide cyclase [Thermomicrobium roseum DSM 5159]
gi|221157014|gb|ACM06141.1| putative polyketide cyclase [Thermomicrobium roseum DSM 5159]
Length=264
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/265 (31%), Positives = 133/265 (51%), Gaps = 19/265 (7%)
Query 13 TLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVE 72
+++ YDLS P WP++ +++ + A+ V Q I T H GTH+DAP H +
Sbjct 3 SVKLYDLSQPLNQEVSFWPFYPPFEVKYIKRKAEHGVNAQYIMTSNHMGTHLDAPRHFIT 62
Query 73 GTPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKY 130
+D+IPL +G GV+V + + + T E ++ ++R GDI+ ++TGW H+Y
Sbjct 63 NGKTIDQIPLEWLYGPGVIVDLSDVLDELDIFTPEMIEQRV-EVREGDILFIHTGW-HRY 120
Query 131 A------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQG 184
A D +Y PG +W AK +K G D + DHP+ I +G
Sbjct 121 AQFGETPDEEKYLLRHPGPHPSIVDWLIAKKIKIWGVDMVSTDHPMNLPIGRFL---GRG 177
Query 185 GLLPWA-VREY-EAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDK--VTGKR 238
GL W VR E + G K+ + FP+ ++ H A+ EN+GG++ + + +R
Sbjct 178 GLEQWKRVRAICERKFGEKLTELFPEEHYQLTHNALFPHDCIHVENLGGEIGRRELHNRR 237
Query 239 VTFAAFPWRWVGGDGCIVRLVAIVD 263
+T FPW + GG+ R+VA V+
Sbjct 238 LTLGVFPWLFKGGEAAFCRVVAFVE 262
>gi|338813078|ref|ZP_08625218.1| putative cyclase [Acetonema longum DSM 6540]
gi|337274956|gb|EGO63453.1| putative cyclase [Acetonema longum DSM 6540]
Length=261
Score = 129 bits (325), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/255 (32%), Positives = 123/255 (49%), Gaps = 17/255 (6%)
Query 16 FYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTP 75
YDL+ H P WP +E +QI+ ++ + Q ITT H GTH+D H
Sbjct 10 MYDLTQKLSHLTPPWPTYEPLQIKFFKRLSSNGANGQVITTSNHVGTHLDGSLHFCTHGR 69
Query 76 FLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNT-----GWHH 128
+ IPL+ G GVVV + +G+ T++D+ + ++R GDI++++T GW
Sbjct 70 DIASIPLNDLIGPGVVVDLSDICEDYGVYTSKDITDRV-EVRKGDILLIHTGYMKYGWDQ 128
Query 129 KYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLP 188
AD Y PG ++ +W +K +G D + DHP+ T I P +A
Sbjct 129 PEADEVRYMVKHPGPTREFSQWCRKMEIKWLGVDAGSADHPMNTKIREWCPKQAA----- 183
Query 189 WAVREYEAQTGRKVLDDF--PD-WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFP 245
Y + K LD+ PD ++ H + I ENVGG+L KV KR+ +P
Sbjct 184 -ECDSYMQKKFGKSLDEMFPPDHYQLMHVDLFPYDIIHAENVGGELQKVLNKRLIIGCYP 242
Query 246 WRWVGGDGCIVRLVA 260
WR+ GG+ I R+VA
Sbjct 243 WRFEGGESSICRIVA 257
>gi|150388525|ref|YP_001318574.1| cyclase family protein [Alkaliphilus metalliredigens QYMF]
gi|149948387|gb|ABR46915.1| cyclase family protein [Alkaliphilus metalliredigens QYMF]
Length=267
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/258 (32%), Positives = 124/258 (49%), Gaps = 22/258 (8%)
Query 17 YDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPF 76
YDL+ H P WP +E +QI+ ++ Q ITT H GTH+D P H
Sbjct 13 YDLTQNLSHLTPPWPTYEPLQIKFFKRLSPHGANGQLITTSNHVGTHLDGPLHFDTAGRD 72
Query 77 LDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKY---- 130
+ +PL G GVVV + +G+ T +D+ + +++ GDI+++NTG+ HKY
Sbjct 73 IASLPLDKLVGPGVVVDLSDIAEDYGIYTPKDITDRV-EVKKGDILIINTGY-HKYGWDQ 130
Query 131 --ADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLP 188
AD Y PG +W +K +G D + DHP+ T I P EA
Sbjct 131 PEADERRYMLRHPGPSMDFIDWIKEMEIKWIGVDCGSADHPMNTKIREWEPGEA------ 184
Query 189 WAVREYEAQTGRKVLDDFPDWEPCHRAILSQ------GIYGFENVGGDLDKVTGKRVTFA 242
Y + K L++ W ++A+ ++ I ENVGG L++V +R+
Sbjct 185 LQADAYLQEKYGKALEEIYSWPQTYQAMHTKVFPKPYEIIHAENVGGQLNEVLNRRLIIG 244
Query 243 AFPWRWVGGDGCIVRLVA 260
FPW++VGG+ I R++A
Sbjct 245 CFPWKFVGGESSICRILA 262
>gi|284046452|ref|YP_003396792.1| cyclase family protein [Conexibacter woesei DSM 14684]
gi|283950673|gb|ADB53417.1| cyclase family protein [Conexibacter woesei DSM 14684]
Length=264
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/270 (29%), Positives = 128/270 (48%), Gaps = 53/270 (19%)
Query 13 TLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLT-----------QKITTVMHSG 61
T E YDLS P+ P++ ++E+ + + + T ++ + H+G
Sbjct 22 TREVYDLSLPFRRDMPSYYFYENRYQPPMFTVFSHKEGTPLGPETKDGYVTHVSFLTHTG 81
Query 62 THIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATP----DIRPG 117
TH+DAP H + +L E+P + G G ++S+PK + +TAEDL+ A D+RPG
Sbjct 82 THVDAPRHFRDDGQYLHEVPADRWLGEGPILSVPKEEMEPITAEDLERACAESGLDVRPG 141
Query 118 DIVVVNTGWHHKYA-------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPL 170
DIV +NTGWH ++ + EY +PG +++ EW AKG+ V D+ A+D
Sbjct 142 DIVGINTGWHRRFCGPGEDRDQAIEYMERNPGLSRESAEWLVAKGIVTVMIDSPAID--- 198
Query 171 ATAIAPHSPAEAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGD 230
+P+ +++ H A+ ++ + E +GG
Sbjct 199 ------------CARFMPYGDSSFQS----------------HYALFAENVPAVEGLGGQ 230
Query 231 LDKVTGKRVTFAAFPWRWVGGDGCIVRLVA 260
LD+VTGKR + P R+ GD +R++A
Sbjct 231 LDEVTGKRCLISCAPVRYENGDAFPLRVLA 260
>gi|167770182|ref|ZP_02442235.1| hypothetical protein ANACOL_01525 [Anaerotruncus colihominis
DSM 17241]
gi|167667504|gb|EDS11634.1| hypothetical protein ANACOL_01525 [Anaerotruncus colihominis
DSM 17241]
Length=266
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/266 (31%), Positives = 129/266 (49%), Gaps = 24/266 (9%)
Query 12 STLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVV 71
S ++ YDLS P G P WP +E +Q++ +A + Q +T H GTH+D P H
Sbjct 5 SDIKMYDLSIPIGILTPPWPTYEPMQMKFFKRLAPNGANGQLVTHSNHVGTHLDGPLHFD 64
Query 72 EGTPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHK 129
+ + L+ GVVV I +G+ T +DL + +IR GDI+++NTG+ HK
Sbjct 65 TAGRDIASLELTKLCAPGVVVDISDMGQDFGIYTPKDLMDRA-EIRKGDILIINTGY-HK 122
Query 130 Y------ADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQ 183
Y AD Y PG +W +K +G D + DHP+ T I P EA+
Sbjct 123 YGFDQPTADERRYMLRHPGPSMDFVQWIRDMEIKWIGVDCGSADHPMNTKIRDWEPMEAE 182
Query 184 GGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQ-------GIYGFENVGGDLDKVTG 236
A + + K L++ +W ++A+ Q I+ E +GG++DK++
Sbjct 183 ------ACDKIFMERYGKHLNEIYEWPKNYQAMHIQLFCQPYEAIHA-ECLGGEIDKLSN 235
Query 237 KRVTFAAFPWRWVGGDGCIVRLVAIV 262
+R FPW++ G+ CI R+VA
Sbjct 236 QRCVIGCFPWKFTEGESCISRIVAFT 261
>gi|325972419|ref|YP_004248610.1| cyclase family protein [Spirochaeta sp. Buddy]
gi|324027657|gb|ADY14416.1| cyclase family protein [Spirochaeta sp. Buddy]
Length=291
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 77/272 (29%), Positives = 131/272 (49%), Gaps = 17/272 (6%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
++ DL+ P G G P WP + +Q++ +A + Q +T H GTH+D H
Sbjct 1 MKVIDLTIPLGVGTPPWPTYIPLQVQYFKRLAPNGANGQVVTHSNHVGTHLDGEIHFYTP 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHK-- 129
+ ++ + G +V + G + + T++ ++ +++ GDI++++TG+HH
Sbjct 61 GKDIAQLDMDFLVHEGAIVDLSDVCGDYDVYTSKMVEERV-EVKEGDILLIHTGYHHYGW 119
Query 130 ---YADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGL 186
D Y PG D++ EW K ++ +G D + DHP+ T I P EA+
Sbjct 120 DQPTGDEIRYMIKHPGPDREFAEWAKKKKLRWIGVDCGSADHPMNTKIRDWMPKEAKQ-- 177
Query 187 LPWAVREYEAQTGRKVLDDF---PDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAA 243
++ + G K LD+F ++ H + + GI E VGGDLD + +R
Sbjct 178 ---CDAHFKEKYG-KPLDEFFSEDKYQLMHIEMFNHGIIHAECVGGDLDLLLNQRAVIGC 233
Query 244 FPWRWVGGDGCIVRLVAIVDPTGSYRIETGKA 275
+PWR+V G+ I R+VA VD R+ KA
Sbjct 234 YPWRFVDGESSIARIVAHVDDDRYERLMAKKA 265
>gi|345004382|ref|YP_004807235.1| cyclase family protein [halophilic archaeon DL31]
gi|344320008|gb|AEN04862.1| cyclase family protein [halophilic archaeon DL31]
Length=254
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 71/259 (28%), Positives = 128/259 (50%), Gaps = 21/259 (8%)
Query 15 EFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGT 74
E +DL+ PW PAWP +++ ++ + +V QKI + H+GTH+D H +
Sbjct 6 EMHDLTQPWCGDTPAWPTYDNPKVWYEKSLDTEKVNGQKIEFMNHTGTHLDGEKHFIGHG 65
Query 75 PFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTG-----WH 127
++ +PL G V+ I G + + T+E +++ D+R GDI+ ++TG WH
Sbjct 66 RDIESMPLDELVGDAVIADISDKVGDYDVFTSEMIEDVV-DVREGDILYIHTGYQKYAWH 124
Query 128 HKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATA---IAPHSPAEAQG 184
+AD +++ PG +++ +W KG+ + D + DHP+ T + P + AEA+
Sbjct 125 TDHADPHKFFIKHPGPNQEFADWCREKGLNYLIVDCGSADHPMNTVVRDVRPDAAAEARE 184
Query 185 GLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAF 244
L + E + G +++ H + +GI EN +++ +RV F
Sbjct 185 ALGVDDLDEIFPEEGYQLM---------HTELFPEGIIHVENAICP-EELLNERVQIGTF 234
Query 245 PWRWVGGDGCIVRLVAIVD 263
PWR+ GG+ + R VA +
Sbjct 235 PWRFRGGESSVCRCVAFTE 253
>gi|110667888|ref|YP_657699.1| cyclase [Haloquadratum walsbyi DSM 16790]
gi|109625635|emb|CAJ52066.1| probable cyclase [Haloquadratum walsbyi DSM 16790]
Length=252
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 75/256 (30%), Positives = 123/256 (49%), Gaps = 19/256 (7%)
Query 15 EFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGT 74
E YDL+ PW PAWP +++ +I + +V QKI + H+GTH+D H +
Sbjct 6 EMYDLTQPWSQETPAWPTYDNPKIWYEKSLDTEKVNGQKIEFMNHTGTHLDGEKHFIAHG 65
Query 75 PFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTG-----WH 127
+ ++ L+ G GV+ I G + + T+E ++ A D+R GDI+ ++TG WH
Sbjct 66 RDIADMSLNELVGDGVIADISDQVGDYDIYTSEMIEQAA-DVRKGDILFIHTGYQDHAWH 124
Query 128 HKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLL 187
+ AD +++ PG + + EW + + D + DHP+ T I P A+
Sbjct 125 REEADPHKFFCKHPGPNAEFAEWCKEMEINYLILDCGSADHPMNTVIRDIRPELARE--- 181
Query 188 PWAVREYEAQTGRKVLDDFP--DWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFP 245
A +E +V FP ++ H + +GI EN ++ G+RV FP
Sbjct 182 --AAEHFEVDDLDEV---FPPEGYQLMHTELFPEGIVHVENAQVPT-ELLGERVQIGTFP 235
Query 246 WRWVGGDGCIVRLVAI 261
WR+ GG+ + R VA
Sbjct 236 WRFRGGESSVSRCVAF 251
>gi|339728827|emb|CCC40003.1| cyclase family protein [Haloquadratum walsbyi C23]
Length=252
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/256 (29%), Positives = 123/256 (49%), Gaps = 19/256 (7%)
Query 15 EFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGT 74
E YDL+ PW PAWP +++ +I + +V QKI + H+GTH+D H +
Sbjct 6 EMYDLTQPWSQETPAWPTYDNPKIWYEKSLDTEKVNGQKIEFMNHTGTHLDGEKHFIAHG 65
Query 75 PFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTG-----WH 127
+ ++ L+ G GV+ I G + + T+E ++ A D+R GDI+ ++TG WH
Sbjct 66 RDIADMSLNELVGDGVIADISDQVGDYDIYTSEMIEQAA-DVRKGDILFIHTGYQDHAWH 124
Query 128 HKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLL 187
+ AD +++ PG + + EW + + D + DHP+ T I P A+
Sbjct 125 REEADPHKFFCKHPGPNAEFAEWCKEMEINYLILDCGSADHPMNTVIRDIRPELARE--- 181
Query 188 PWAVREYEAQTGRKVLDDFP--DWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFP 245
A +E ++ FP ++ H + +GI EN ++ G+RV FP
Sbjct 182 --AAEHFEVDDLDEI---FPPEGYQLMHTELFPEGIVHVENAQVPT-ELLGERVQIGTFP 235
Query 246 WRWVGGDGCIVRLVAI 261
WR+ GG+ + R VA
Sbjct 236 WRFRGGESSVSRCVAF 251
>gi|158319910|ref|YP_001512417.1| cyclase family protein [Alkaliphilus oremlandii OhILAs]
gi|158140109|gb|ABW18421.1| cyclase family protein [Alkaliphilus oremlandii OhILAs]
Length=264
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/264 (29%), Positives = 122/264 (47%), Gaps = 22/264 (8%)
Query 12 STLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVV 71
+ ++ YDL+ H P WP +E +Q++ ++ + Q IT H GTH+D P H
Sbjct 5 NNVKMYDLTQNTSHLTPPWPTYEPLQVKFFKRLSPNGANGQVITVSNHVGTHLDGPLHFD 64
Query 72 EGTPFLDEIPLSAFFGTGVVVSIP--KGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHK 129
+ + L G GVVV + + + T +D+ + +++ GDI+++NTG+ HK
Sbjct 65 TAGRDIASLELEKLVGPGVVVDLSDISEDFSIYTPQDIMDRV-EVKKGDILIINTGY-HK 122
Query 130 Y------ADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQ 183
Y AD Y PG +W ++ +G D + DHP+ T I P EA+
Sbjct 123 YGWDQPEADERRYMLRHPGPSLDFMDWVKEMEIRWIGVDCGSADHPMNTKIRDWEPGEAK 182
Query 184 GGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAI------LSQGIYGFENVGGDLDKVTGK 237
Y + K L++ W ++A+ I EN+GG +D+V K
Sbjct 183 ------RCDAYMREKYGKGLEEMYPWPDVYQAMHIHVFPKPHEIIHAENLGGQIDEVLNK 236
Query 238 RVTFAAFPWRWVGGDGCIVRLVAI 261
RV FPW++ GG+ R+VA
Sbjct 237 RVIVGCFPWKFQGGESAFCRIVAF 260
>gi|315425928|dbj|BAJ47578.1| cyclase family protein [Candidatus Caldiarchaeum subterraneum]
gi|343484722|dbj|BAJ50376.1| cyclase family protein [Candidatus Caldiarchaeum subterraneum]
Length=288
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/262 (30%), Positives = 125/262 (48%), Gaps = 23/262 (8%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
++ YDLS P AP + ++ + + +++ V I +H GTH D H + G
Sbjct 32 VKIYDLSQPTSTKAPPFMWYPPFKATWIKRLSEHNVNAMYIEGPLHHGTHFDGQLHFMTG 91
Query 74 TPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATP----DIRPGDIVVVNTG-- 125
+ I ++ F G GVVV I + G + + T E ++ A +IRP DI+++ TG
Sbjct 92 GKDIASIDINYFIGEGVVVDISRKVGDYDIYTPETIEGAAKEAGLEIRPDDILIIYTGYS 151
Query 126 ---WHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEA 182
W + D Y PG D + +W K ++ +G D + DHPL T I P
Sbjct 152 RYAWCGQEPDEVRYICKHPGPDVRFAKWCIEKKIRWLGVDAASQDHPLNTIIRRARPD-- 209
Query 183 QGGLLPWAVREYEAQTGRKVLDDFP---DWEPCHRAILSQGIYGFENVGGDLDKVTGKRV 239
V E E + G+K+ + P +++ H + + I EN+GGD+DKV KR
Sbjct 210 -------LVAEAEKKWGKKIDELMPWPENYQVMHTMLFPKMILHAENLGGDIDKVANKRC 262
Query 240 TFAAFPWRWVGGDGCIVRLVAI 261
A P+++ GG+ R++AI
Sbjct 263 LIMAPPFKFEGGESAYCRVIAI 284
>gi|320159994|ref|YP_004173218.1| hypothetical protein ANT_05840 [Anaerolinea thermophila UNI-1]
gi|319993847|dbj|BAJ62618.1| hypothetical protein ANT_05840 [Anaerolinea thermophila UNI-1]
Length=293
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 73/259 (29%), Positives = 125/259 (49%), Gaps = 15/259 (5%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
++ DL+ P G G PAWP +E +Q++ +A + Q +T H GTH+D H
Sbjct 1 MKLIDLTIPLGIGTPAWPTYEPLQVKYFKRLAPNGANGQLLTHSNHLGTHLDGEIHFYTP 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTGWHH--- 128
+ + + +V + G + + T++ ++ +++ GDI+V++TG+HH
Sbjct 61 GKDMASLTMDYLVHEAAIVDLSDVCGDYDVYTSKMIEERV-EVKEGDILVIHTGYHHFGW 119
Query 129 --KYADSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGL 186
AD Y PG D++ EW K ++ + D + DHP+ T I P +A+
Sbjct 120 DMPTADEVRYMVKHPGPDREFAEWAKKKKLRWIAVDCGSADHPMNTIIRTWMPRQAKE-- 177
Query 187 LPWAVREYEAQTGRKVLDDFPD--WEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAF 244
A ++ + G + + F D ++ H + I E GG++D + +RVT F
Sbjct 178 ---AEYVFKKKYGMSLEEFFSDDKYQLMHIEMFPHEIIHAECFGGEIDLLLNQRVTVGFF 234
Query 245 PWRWVGGDGCIVRLVAIVD 263
PWR+V G+ I R VA VD
Sbjct 235 PWRFVDGEASIGRAVAFVD 253
>gi|218961867|ref|YP_001741642.1| putative cyclase [Candidatus Cloacamonas acidaminovorans]
gi|167730524|emb|CAO81436.1| putative cyclase [Candidatus Cloacamonas acidaminovorans]
Length=291
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/267 (30%), Positives = 126/267 (48%), Gaps = 28/267 (10%)
Query 12 STLEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSR----VLTQKITTVMHSGTHIDAP 67
+L+ YDL+ P P WP + + I+ +A + Q ITT H GTHID
Sbjct 32 QSLKMYDLTQPLSIHTPPWPSYMPLGIQYFKRIAGAHSGQGANGQIITTSNHVGTHIDGE 91
Query 68 AHVVEGTPFLDEIPLSAFFGTGVVVSIPK--GKWGMVTAEDLQNATPDIRPGDIVVVNTG 125
H + ++P+ + G GVVV I + + + E L +I+ GDI+++NTG
Sbjct 92 IHFFASGRSIGQVPMEEWIGPGVVVDISDSVNDYDLYSPELLMQKA-EIKKGDILIINTG 150
Query 126 WHHKYA------DSAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIA---P 176
+ H+YA D Y+ PG D W +K +G D + DHP+ T I P
Sbjct 151 Y-HRYAWDQPESDELRYFVKHPGPDPSFHTWALDMHLKWIGVDCGSADHPMNTIIRNWHP 209
Query 177 HSPAEAQGGLLPWAVREYEAQTGRKVLDDFPD---WEPCHRAILSQGIYGFENVGGDLDK 233
S EA+ L+ A+ G+ + FP ++ H + + + EN+GGD+DK
Sbjct 210 GSFNEAEQKLI--------AKYGKTWDEMFPPEEYYQVMHLKLFPKKLVHAENLGGDIDK 261
Query 234 VTGKRVTFAAFPWRWVGGDGCIVRLVA 260
++ KRV FP R + + + R++A
Sbjct 262 ISNKRVWIGLFPLRGIELESSMCRIMA 288
>gi|78187925|ref|YP_375968.1| hypothetical protein Plut_2083 [Chlorobium luteolum DSM 273]
gi|78167827|gb|ABB24925.1| Kynurenine formamidase [Chlorobium luteolum DSM 273]
Length=217
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/244 (28%), Positives = 103/244 (43%), Gaps = 38/244 (15%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+ +DLSH G P WP ++ G L ++ + H+GTH+D P H++
Sbjct 1 MRIHDLSHSIAEGMPLWPASPVTRVRDAAGYGTEGYLEREYSFSSHAGTHVDLPLHMLPE 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
LD PL AF G G V+ G+VTA + P D ++++TGW ++ S
Sbjct 61 GRSLDACPLEAFAGRGFVLDAAPENGGVVTATVIAAGAPPEGSCDFLLIHTGWSSRWG-S 119
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
Y+ P ++A +KG+K +G D+ ++D PL
Sbjct 120 PSYFEACPYLQEEAALLLVSKGLKGIGIDSPSIDPPLG---------------------- 157
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
D P HR +L G+ EN+ G L + GKR F+AFP + G +
Sbjct 158 --------------DAYPSHRILLGHGLVVVENLTG-LFPLIGKRFLFSAFPLKIAGAEA 202
Query 254 CIVR 257
VR
Sbjct 203 SPVR 206
>gi|331696441|ref|YP_004332680.1| cyclase family protein [Pseudonocardia dioxanivorans CB1190]
gi|326951130|gb|AEA24827.1| cyclase family protein [Pseudonocardia dioxanivorans CB1190]
Length=224
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/258 (30%), Positives = 110/258 (43%), Gaps = 36/258 (13%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLT-QKITTVMHSGTHIDAPAHVVE 72
+ D+SH G P P +V+ ++ +A L IT +H GTHIDAPAH V+
Sbjct 1 MALVDVSHQLWPGMPKIPILPEVERHQVARIADGAPLNISAITLALHVGTHIDAPAHAVD 60
Query 73 GTPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYAD 132
G +DE+P+ F GTGVV + + +T +D+ P R G+ ++V TGW ++
Sbjct 61 GAKTIDELPIERFAGTGVVAKVDRKPGEEITVDDVLAGGPAPRRGEFLLVATGWSERFL- 119
Query 133 SAEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVR 192
+ YA P W +GV VG D I P P +G +
Sbjct 120 -SPDYADHPSLSPDLAAWCVEQGVPFVGVDM----------ITPDLPVHRRGEGFDY--- 165
Query 193 EYEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGD 252
P HR +L + EN+ DL+ + G+RV A+P GGD
Sbjct 166 ------------------PVHRTLLGAEVLIAENL-TDLEGLGGRRVHVHAYPLAIRGGD 206
Query 253 GCIVRLVAIVD-PTGSYR 269
R+V D P G R
Sbjct 207 AGPARVVVDTDLPDGETR 224
>gi|20808973|ref|NP_624144.1| hypothetical protein TTE2628 [Thermoanaerobacter tengcongensis
MB4]
gi|20517639|gb|AAM25748.1| uncharacterized ACR, predicted metal-dependent hydrolases [Thermoanaerobacter
tengcongensis MB4]
Length=210
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/250 (29%), Positives = 112/250 (45%), Gaps = 40/250 (16%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+ DLSH G P +P ++++ERL + K ++ V+H GTH DAPAH +E
Sbjct 1 MRMIDLSHFIEEGMPQYPGQPEIKVERLVEVEKDGYQLTELKYVVHLGTHCDAPAHFIEK 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
++++P+ + G V+V +P ++ E L+ D++ GDIV+ TG KY
Sbjct 61 GDTIEKLPVDFYSGEAVIVDVPHLPDRLMRPELLEGV--DLKEGDIVIFRTGM-SKYWRE 117
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
Y P ++ + K VKA+G DT I+P P E +
Sbjct 118 EAYIKEFPYLTEELAHFLVDKKVKAIGLDT----------ISP-DPVETE---------- 156
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
DF P H +L + EN+ +L+ + KR F A P + G DG
Sbjct 157 -----------DF----PVHHVLLGNKVGIIENL-TNLEAIDKKRFLFIALPLKIKGSDG 200
Query 254 CIVRLVAIVD 263
VR VAI++
Sbjct 201 SPVRAVAILE 210
>gi|193213662|ref|YP_001999615.1| cyclase family protein [Chlorobaculum parvum NCIB 8327]
gi|193087139|gb|ACF12415.1| cyclase family protein [Chlorobaculum parvum NCIB 8327]
Length=217
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/249 (26%), Positives = 102/249 (41%), Gaps = 38/249 (15%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+ DLSHP P WP + L + + + + H+GTH+DAPAH+ EG
Sbjct 1 MRIVDLSHPISPAMPVWPGTPAPEFSDLCTVGRDGFGERWMQLSSHTGTHLDAPAHLFEG 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
LD + + F G G ++ + G+V+ + L+ P I D ++++ GW ++ +
Sbjct 61 AASLDRMSVERFIGKGALLDLRGASSGLVSLDQLRVIQPSIEKADFLLLHVGW-SRFWGT 119
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
AEY P +A W A G+K VG D + D P + A+
Sbjct 120 AEYDRNYPVLSSEAATWLAGLGLKGVGIDAPSFDDPDSEAL------------------- 160
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
P HR +L G+ EN+ LD++ + P G +
Sbjct 161 -----------------PIHRCLLGSGLLLIENLTA-LDQLGDSDFLLSVLPLPISGAEA 202
Query 254 CIVRLVAIV 262
VR VA++
Sbjct 203 SPVRAVAVI 211
>gi|325291131|ref|YP_004267312.1| cyclase family protein [Syntrophobotulus glycolicus DSM 8271]
gi|324966532|gb|ADY57311.1| cyclase family protein [Syntrophobotulus glycolicus DSM 8271]
Length=209
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/248 (27%), Positives = 108/248 (44%), Gaps = 40/248 (16%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
+E DLSHP P +P E +IE + M ++ H+GTH+D P HV +
Sbjct 1 MEVVDLSHPIREDMPVFPGEEQPKIEIVADMEHCGYHEKRFLLNSHTGTHLDVPKHVFQD 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
L++ P+ + G +++++ G + E+L +R D ++VNTGW + S
Sbjct 61 GYSLEKYPVKKYIGQAIMITLIDS--GRIEIEELAPYENALRDCDFMLVNTGWSRHWG-S 117
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
A+YY P F ++A +W ++ +K +G D+ ++D + GL
Sbjct 118 AQYYGDPPYFSREAADWLSSFELKGIGIDSPSVDQ------------MSDQGL------- 158
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
P HRA+L + I EN+ + D++ T P G D
Sbjct 159 -----------------PVHRALLEKEIVIIENM-TNFDQLKKPVFTLYCLPLNIEGADA 200
Query 254 CIVRLVAI 261
C VR VA+
Sbjct 201 CPVRAVAV 208
>gi|254478157|ref|ZP_05091539.1| Putative cyclase superfamily protein [Carboxydibrachium pacificum
DSM 12653]
gi|214035886|gb|EEB76578.1| Putative cyclase superfamily protein [Carboxydibrachium pacificum
DSM 12653]
Length=208
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/246 (30%), Positives = 111/246 (46%), Gaps = 40/246 (16%)
Query 18 DLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFL 77
DLSH G P +P ++++ERL + K ++ V+H GTH DAPAH +E +
Sbjct 3 DLSHFIEEGMPQYPGQPEIKVERLAEVEKDGYQLTELKDVVHLGTHCDAPAHFIEKGDTI 62
Query 78 DEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYY 137
+++P+ + G V+V +P ++ E L+ D++ GDIV+ TG KY Y
Sbjct 63 EKLPVDFYSGEAVIVDVPHLPDRLMRPELLEGI--DLKVGDIVIFRTGM-SKYWREEAYI 119
Query 138 AYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQ 197
P ++ + K VKA+G DT I+P P E +
Sbjct 120 KEFPYLTEELAHFLVDKKVKAIGLDT----------ISP-DPVETE-------------- 154
Query 198 TGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVR 257
DF P H +L + EN+ +L+ + KR F A P + G DG VR
Sbjct 155 -------DF----PVHHILLGNKVGIIENL-TNLEAIDKKRFLFIALPLKIKGSDGSPVR 202
Query 258 LVAIVD 263
VAI++
Sbjct 203 AVAILE 208
>gi|310780252|ref|YP_003968584.1| cyclase family protein [Ilyobacter polytropus DSM 2926]
gi|309749575|gb|ADO84236.1| cyclase family protein [Ilyobacter polytropus DSM 2926]
Length=211
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/250 (28%), Positives = 106/250 (43%), Gaps = 40/250 (16%)
Query 14 LEFYDLSHPWGHGAPAWPYFEDVQIERLHGMAKSRVLTQKITTVMHSGTHIDAPAHVVEG 73
++ DL+H P +P E + E + + K +KIT H+GTH+DAP H++
Sbjct 1 MKIVDLTHEIRENMPVFPGSECPKFESIGILEKDGFEEKKITIYSHTGTHMDAPKHIIPY 60
Query 74 TPFLDEIPLSAFFGTGVVVSIPKGKWGMVTAEDLQNATPDIRPGDIVVVNTGWHHKYADS 133
LDE F G GVVV +G+ ++ + L I D +++NTGW +
Sbjct 61 GKGLDEFSADKFLGKGVVVD-ARGE-SSISLDLLIEYEEKIEKSDFILINTGWDRNWG-K 117
Query 134 AEYYAYSPGFDKKAGEWFAAKGVKAVGTDTQALDHPLATAIAPHSPAEAQGGLLPWAVRE 193
YY P KKA +W ++K +K +G D ++D V
Sbjct 118 ENYYNGFPCMTKKAAQWLSSKKIKGLGIDAISVD----------------------PVNS 155
Query 194 YEAQTGRKVLDDFPDWEPCHRAILSQGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDG 253
YE H L + I EN+ +K+ GK+ F+A P + DG
Sbjct 156 YELVN--------------HNIFLKKEIVIIENLKIP-EKLHGKKFLFSALPLKTENSDG 200
Query 254 CIVRLVAIVD 263
+R VAI+D
Sbjct 201 SPIRAVAILD 210
Lambda K H
0.319 0.137 0.447
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 435544382258
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40