BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1718
Length=272
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608856|ref|NP_216234.1| hypothetical protein Rv1718 [Mycoba... 543 9e-153
gi|289554509|ref|ZP_06443719.1| conserved hypothetical protein [... 543 1e-152
gi|340626725|ref|YP_004745177.1| hypothetical protein MCAN_17291... 540 1e-151
gi|240169427|ref|ZP_04748086.1| hypothetical protein MkanA1_0894... 491 4e-137
gi|289750275|ref|ZP_06509653.1| conserved hypothetical protein [... 445 3e-123
gi|289569769|ref|ZP_06449996.1| LOW QUALITY PROTEIN: conserved h... 444 7e-123
gi|289443177|ref|ZP_06432921.1| LOW QUALITY PROTEIN: conserved h... 444 8e-123
gi|54025237|ref|YP_119479.1| hypothetical protein nfa32680 [Noca... 415 4e-114
gi|31792906|ref|NP_855399.1| hypothetical protein Mb1746 [Mycoba... 393 2e-107
gi|225174973|ref|ZP_03728970.1| protein of unknown function DUF8... 227 2e-57
gi|319653634|ref|ZP_08007733.1| hypothetical protein HMPREF1013_... 221 6e-56
gi|157363093|ref|YP_001469860.1| hypothetical protein Tlet_0226 ... 215 7e-54
gi|160902539|ref|YP_001568120.1| hypothetical protein Pmob_1078 ... 211 9e-53
gi|158321617|ref|YP_001514124.1| hypothetical protein Clos_2597 ... 210 2e-52
gi|310777946|ref|YP_003966279.1| 3-keto-5-aminohexanoate cleavag... 209 3e-52
gi|150388158|ref|YP_001318207.1| hypothetical protein Amet_0318 ... 209 4e-52
gi|89902644|ref|YP_525115.1| hypothetical protein Rfer_3885 [Rho... 207 2e-51
gi|188586499|ref|YP_001918044.1| 3-keto-5-aminohexanoate cleavag... 205 8e-51
gi|239617355|ref|YP_002940677.1| protein of unknown function DUF... 204 1e-50
gi|340753474|ref|ZP_08690255.1| transposase [Fusobacterium sp. 2... 202 3e-50
gi|154248908|ref|YP_001409733.1| hypothetical protein Fnod_0209 ... 202 3e-50
gi|294783512|ref|ZP_06748836.1| conserved hypothetical protein [... 202 5e-50
gi|169633210|ref|YP_001706946.1| hypothetical protein ABSDF1527 ... 202 5e-50
gi|262066230|ref|ZP_06025842.1| conserved hypothetical protein [... 202 6e-50
gi|217076540|ref|YP_002334256.1| hypothetical protein THA_422 [T... 201 1e-49
gi|150020098|ref|YP_001305452.1| hypothetical protein Tmel_0190 ... 201 1e-49
gi|307298174|ref|ZP_07577978.1| protein of unknown function DUF8... 201 1e-49
gi|218778117|ref|YP_002429435.1| hypothetical protein Dalk_0258 ... 200 2e-49
gi|309389804|gb|ADO77684.1| 3-keto-5-aminohexanoate cleavage enz... 199 4e-49
gi|335428956|ref|ZP_08555866.1| hypothetical protein HLPCO_08299... 198 6e-49
gi|19705173|ref|NP_602668.1| cytoplasmic protein [Fusobacterium ... 197 1e-48
gi|296328272|ref|ZP_06870801.1| protein of hypothetical function... 197 2e-48
gi|237743423|ref|ZP_04573904.1| transposase [Fusobacterium sp. 7... 197 2e-48
gi|237741287|ref|ZP_04571768.1| transposase [Fusobacterium sp. 4... 197 2e-48
gi|34763496|ref|ZP_00144438.1| Transposase [Fusobacterium nuclea... 197 2e-48
gi|336419820|ref|ZP_08600074.1| hypothetical protein HMPREF0401_... 196 2e-48
gi|221633717|ref|YP_002522943.1| hypothetical protein trd_1744 [... 196 3e-48
gi|254303336|ref|ZP_04970694.1| hypothetical protein FNP_0982 [F... 196 3e-48
gi|338812065|ref|ZP_08624264.1| hypothetical protein ALO_08223 [... 196 4e-48
gi|229496573|ref|ZP_04390287.1| 3-keto-5-aminohexanoate cleavage... 196 4e-48
gi|338811235|ref|ZP_08623464.1| hypothetical protein ALO_04121 [... 196 4e-48
gi|339889625|gb|EGQ78895.1| protein of hypothetical function DUF... 196 4e-48
gi|339441730|ref|YP_004707735.1| hypothetical protein CXIVA_0666... 196 4e-48
gi|121535646|ref|ZP_01667451.1| protein of unknown function DUF8... 194 8e-48
gi|331004171|ref|ZP_08327651.1| hypothetical protein HMPREF0491_... 194 1e-47
gi|345017211|ref|YP_004819564.1| hypothetical protein Thewi_0850... 192 4e-47
gi|254478897|ref|ZP_05092260.1| conserved hypothetical protein [... 192 4e-47
gi|317063133|ref|ZP_07927618.1| conserved hypothetical protein [... 192 5e-47
gi|326391507|ref|ZP_08213040.1| protein of unknown function DUF8... 191 9e-47
gi|158321464|ref|YP_001513971.1| hypothetical protein Clos_2443 ... 191 1e-46
>gi|15608856|ref|NP_216234.1| hypothetical protein Rv1718 [Mycobacterium tuberculosis H37Rv]
gi|148661516|ref|YP_001283039.1| hypothetical protein MRA_1728 [Mycobacterium tuberculosis H37Ra]
gi|148822924|ref|YP_001287678.1| hypothetical protein TBFG_11733 [Mycobacterium tuberculosis F11]
55 more sequence titles
Length=272
Score = 543 bits (1400), Expect = 9e-153, Method: Compositional matrix adjust.
Identities = 272/272 (100%), Positives = 272/272 (100%), Gaps = 0/272 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA
Sbjct 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 272
LVSRTIRLAEALDLPIASVEEAEAALQLPGTS
Sbjct 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 272
>gi|289554509|ref|ZP_06443719.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289745844|ref|ZP_06505222.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289757827|ref|ZP_06517205.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|308376802|ref|ZP_07440086.2| hypothetical protein TMHG_00898 [Mycobacterium tuberculosis SUMu008]
gi|289439141|gb|EFD21634.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289686372|gb|EFD53860.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289713391|gb|EFD77403.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|308349935|gb|EFP38786.1| hypothetical protein TMHG_00898 [Mycobacterium tuberculosis SUMu008]
Length=274
Score = 543 bits (1398), Expect = 1e-152, Method: Compositional matrix adjust.
Identities = 272/272 (100%), Positives = 272/272 (100%), Gaps = 0/272 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 3 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 62
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 63 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 122
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 123 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 182
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA
Sbjct 183 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 242
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 272
LVSRTIRLAEALDLPIASVEEAEAALQLPGTS
Sbjct 243 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 274
>gi|340626725|ref|YP_004745177.1| hypothetical protein MCAN_17291 [Mycobacterium canettii CIPT
140010059]
gi|340004915|emb|CCC44061.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=272
Score = 540 bits (1391), Expect = 1e-151, Method: Compositional matrix adjust.
Identities = 271/272 (99%), Positives = 271/272 (99%), Gaps = 0/272 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
DNLLTMVRRLP GAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA
Sbjct 181 DNLLTMVRRLPHGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 272
LVSRTIRLAEALDLPIASVEEAEAALQLPGTS
Sbjct 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGTS 272
>gi|240169427|ref|ZP_04748086.1| hypothetical protein MkanA1_08949 [Mycobacterium kansasii ATCC
12478]
Length=273
Score = 491 bits (1264), Expect = 4e-137, Method: Compositional matrix adjust.
Identities = 244/271 (91%), Positives = 257/271 (95%), Gaps = 0/271 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MS+VITVAPTGPIATKADNPALPT+PEEIA AVEQAYHAGAAVAHIHLRDE ERPTAD
Sbjct 1 MSVVITVAPTGPIATKADNPALPTTPEEIAAAVEQAYHAGAAVAHIHLRDEKERPTADLA 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
ARRAMDLIGERCPILIQLSTGVGL+VPFE RE+LVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 TARRAMDLIGERCPILIQLSTGVGLSVPFEDREKLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
P AVRRLAARMRELDIKPELEIYDTGHLEACL+L EDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PDAVRRLAARMRELDIKPELEIYDTGHLEACLQLREEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
DNLLTMVRRLPP A+WQVIAIG+AN+ELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA
Sbjct 181 DNLLTMVRRLPPDAMWQVIAIGRANLELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQLPGT 271
LV+RT+RL +ALDLP+ASVEEAE L+LPG
Sbjct 241 LVTRTMRLVQALDLPVASVEEAEVLLRLPGV 271
>gi|289750275|ref|ZP_06509653.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690862|gb|EFD58291.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=535
Score = 445 bits (1145), Expect = 3e-123, Method: Compositional matrix adjust.
Identities = 219/219 (100%), Positives = 219/219 (100%), Gaps = 0/219 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA
Sbjct 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
>gi|289569769|ref|ZP_06449996.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
gi|289543523|gb|EFD47171.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
Length=219
Score = 444 bits (1142), Expect = 7e-123, Method: Compositional matrix adjust.
Identities = 219/219 (100%), Positives = 219/219 (100%), Gaps = 0/219 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA
Sbjct 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
>gi|289443177|ref|ZP_06432921.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
gi|289416096|gb|EFD13336.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
Length=271
Score = 444 bits (1141), Expect = 8e-123, Method: Compositional matrix adjust.
Identities = 219/219 (100%), Positives = 219/219 (100%), Gaps = 0/219 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA
Sbjct 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNA 219
>gi|54025237|ref|YP_119479.1| hypothetical protein nfa32680 [Nocardia farcinica IFM 10152]
gi|54016745|dbj|BAD58115.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=272
Score = 415 bits (1066), Expect = 4e-114, Method: Compositional matrix adjust.
Identities = 213/270 (79%), Positives = 228/270 (85%), Gaps = 1/270 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MS VITVAPTGPIA+ DNP LPT PEEIA AV AY AGAAVAHIHLRD ++RPTADP
Sbjct 1 MSAVITVAPTGPIASTTDNPHLPTQPEEIADAVADAYEAGAAVAHIHLRDADQRPTADPA 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARR MDLI +RCPILIQLSTGVGL VPF +R LVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRTMDLIAQRCPILIQLSTGVGLQVPFAERAALVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAE-PLQFSIVLGVRGGMAAT 179
P+ VR LAARM EL +KPELEIYDTGHLEACLRL + LL + PLQFSIVLGV GGMAAT
Sbjct 121 PEQVRELAARMLELGVKPELEIYDTGHLEACLRLRDQGLLGDGPLQFSIVLGVAGGMAAT 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
ADNLLTMVRRLP G+IWQVIAIG+ N+ LTAMGLALGGNAR GLEDTL+LRKGEL+P NL
Sbjct 181 ADNLLTMVRRLPEGSIWQVIAIGRNNLPLTAMGLALGGNARAGLEDTLHLRKGELSPGNL 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQLP 269
LV R +RLAE LD IA VEEAE L LP
Sbjct 241 PLVRRAVRLAEDLDRGIAGVEEAETLLGLP 270
>gi|31792906|ref|NP_855399.1| hypothetical protein Mb1746 [Mycobacterium bovis AF2122/97]
gi|121637626|ref|YP_977849.1| hypothetical protein BCG_1757 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224990101|ref|YP_002644788.1| hypothetical protein JTY_1732 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31618497|emb|CAD94449.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121493273|emb|CAL71744.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773214|dbj|BAH26020.1| hypothetical protein JTY_1732 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341601644|emb|CCC64317.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=207
Score = 393 bits (1009), Expect = 2e-107, Method: Compositional matrix adjust.
Identities = 193/193 (100%), Positives = 193/193 (100%), Gaps = 0/193 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN
Sbjct 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP
Sbjct 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA
Sbjct 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
Query 181 DNLLTMVRRLPPG 193
DNLLTMVRRLPPG
Sbjct 181 DNLLTMVRRLPPG 193
>gi|225174973|ref|ZP_03728970.1| protein of unknown function DUF849 [Dethiobacter alkaliphilus
AHT 1]
gi|225169613|gb|EEG78410.1| protein of unknown function DUF849 [Dethiobacter alkaliphilus
AHT 1]
Length=271
Score = 227 bits (578), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/268 (44%), Positives = 168/268 (63%), Gaps = 0/268 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVAP G AT+ DNP LP +P +I AV +++ AGAA+AH+H+RD PT DP+I
Sbjct 3 KVIITVAPVGAEATRDDNPNLPLTPTQIIEAVYESWQAGAAIAHLHVRDPQGNPTQDPDI 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
R+ ++ I ++C I+IQ+STG + +QR + L+P MATL +++FG+ F NP
Sbjct 63 FRQVIEGIKQKCDIIIQVSTGGSTDMTPQQRAAPLTLKPEMATLTTGTVNFGSEIFSNPF 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ A RMRE ++ PE+EI+DTG L+ L L +++++ PL F VLGV GGM+A+A
Sbjct 123 PLITDFANRMRENNVVPEIEIFDTGMLDTALVLIKKNIISLPLHFDFVLGVPGGMSASAR 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
NL + R+P W V IG+ + L M LA+GG+ RVG ED +Y KG A SN L
Sbjct 183 NLAYLADRIPENCTWSVAGIGRHELPLGTMALAMGGHVRVGFEDNVYYEKGVPAASNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQLP 269
V+R RLA+ L A+ +A L LP
Sbjct 243 VARITRLAQELGRTPATPNQARKILGLP 270
>gi|319653634|ref|ZP_08007733.1| hypothetical protein HMPREF1013_04350 [Bacillus sp. 2_A_57_CT2]
gi|317394833|gb|EFV75572.1| hypothetical protein HMPREF1013_04350 [Bacillus sp. 2_A_57_CT2]
Length=286
Score = 221 bits (564), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 106/267 (40%), Positives = 162/267 (61%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVAPTG T+ P +P SP+EIA +V +++ GAA+AHIH+RD T + +
Sbjct 3 KLIITVAPTGAQTTREHTPYVPLSPKEIADSVYESWKEGAAIAHIHVRDHRGENTLNLDT 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ +D + ++C I++ L+T G+ E R ++ EL P MAT + +M+FG+G F N P
Sbjct 63 YKEVIDRVQDKCDIILNLTTAGGIGNGDEDRLRVCELNPEMATFDAGTMNFGSGVFHNTP 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ RLAA +E IKPE+EI+D G + LR+ + L+ +P F VLGV GGM AT
Sbjct 123 DFLERLAAVTKERQIKPEIEIFDVGMIHNTLRIAKKGLIDDPFHFQFVLGVHGGMPATPK 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
NL+ ++ +P G+ W I K + + M + LGG+ RVG+ED++Y R+GELA +N
Sbjct 183 NLMFLIDSIPEGSTWSAIGASKDQLTINTMSILLGGHVRVGMEDSVYFRRGELAETNAQF 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V+R LA+ L +A+ EA L +
Sbjct 243 VNRIANLAQTLGREVATPAEARKILGI 269
>gi|157363093|ref|YP_001469860.1| hypothetical protein Tlet_0226 [Thermotoga lettingae TMO]
gi|157313697|gb|ABV32796.1| protein of unknown function DUF849 [Thermotoga lettingae TMO]
Length=275
Score = 215 bits (547), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 108/268 (41%), Positives = 160/268 (60%), Gaps = 1/268 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVA G TK P +P +PEEI +A+ GA++ H+H+RDEN PT + I
Sbjct 3 KLIITVAVCGAEVTKQHTPYIPVTPEEIVQQSYEAFLEGASIVHLHVRDENGNPTQNAEI 62
Query 62 ARRAMDLIGERC-PILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
++ + +I E+C +++Q+STG + + E+R Q +E P MATL +++FG F N
Sbjct 63 FKKVVTMIREKCRGMIVQVSTGGAVWMTAEERLQSLESDPDMATLTTGTVNFGNDVFMNS 122
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
+ R A M++ +I PE E +D GH+ L L + L+ L F V+GV GG+AA
Sbjct 123 IPMIERFAEEMKKRNIMPEFECFDMGHITNALNLVKKGLVHGHLHFDFVMGVPGGIAANG 182
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
NL+ MV LP GA W V IG+ + AM +A+GG+ RVGLED +Y++KGELA SN
Sbjct 183 RNLIAMVDNLPAGATWSVAGIGRHEFPMAAMAIAMGGHVRVGLEDNIYVKKGELAKSNAE 242
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +++A + IAS +EA L L
Sbjct 243 LVKKVVKIAREIGRDIASCQEARQILNL 270
>gi|160902539|ref|YP_001568120.1| hypothetical protein Pmob_1078 [Petrotoga mobilis SJ95]
gi|160360183|gb|ABX31797.1| protein of unknown function DUF849 [Petrotoga mobilis SJ95]
Length=273
Score = 211 bits (537), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 103/269 (39%), Positives = 153/269 (57%), Gaps = 0/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG T+ PALP SP+EIA A Y AGA++ H+H RD+ PT +
Sbjct 3 KLIITAALTGAEVTREQQPALPMSPQEIAQAAYDCYLAGASIVHVHARDQKGNPTQSIYV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ + I RC I+ Q STG + FE+R Q +EL P MATL+ + +FG F N
Sbjct 63 YKEIKEEIESRCNIIFQPSTGGAVYHTFEERRQPLELNPEMATLSAGTTNFGKDIFLNTE 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A M++ IKPE+E+++ GH+ LR+ + L+ P+ F V+GV G + D
Sbjct 123 EYIEKFAHEMKQRKIKPEIEVFERGHINNALRIEKKGLIDRPIHFDFVMGVPGAIPGEID 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ +V +PP + W V IGK + L + +GG+ RVG ED +Y +KGELA SN L
Sbjct 183 DLIYLVSHIPPNSTWTVAGIGKYELSLAVHAILMGGHVRVGFEDNIYFKKGELAKSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQLPG 270
V R +++ L +A EEA L + G
Sbjct 243 VERIAKISIELGREVAGPEEARKILNIGG 271
>gi|158321617|ref|YP_001514124.1| hypothetical protein Clos_2597 [Alkaliphilus oremlandii OhILAs]
gi|158141816|gb|ABW20128.1| protein of unknown function DUF849 [Alkaliphilus oremlandii OhILAs]
Length=269
Score = 210 bits (534), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/267 (40%), Positives = 160/267 (60%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG T+ + P LP +P+EIA A Q Y AGA++ H+H RD PT ++
Sbjct 3 KLIITAALTGAEVTRENQPNLPLTPDEIAEAAYQCYLAGASIVHVHARDAEGNPTQSYDV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ + I +C I+ Q STG + E+R Q V+L+P MATL+ + +FG F N
Sbjct 63 YKEIKEKIEAKCNIIFQPSTGGAVWHGPEERLQPVDLKPEMATLSAGTCNFGPDVFMNTE 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A +M+E+ +KPE+E+++ G +E +L + L+ PL F VLGV G AT +
Sbjct 123 EYIEKFATKMKEMGVKPEIEVFERGMIENAKKLVKQGLVETPLHFDFVLGVPGACPATPE 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+LL MVR +P G+ W V IG+ + L MG+ LGG+ RVG ED +Y KG+LA SN L
Sbjct 183 DLLYMVRNIPEGSTWTVAGIGRHELPLATMGIILGGHVRVGFEDNVYYGKGQLAQSNAEL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V R +R+A+ L +A+ +EA L +
Sbjct 243 VERVVRIAKELGREVATPDEARRILNI 269
>gi|310777946|ref|YP_003966279.1| 3-keto-5-aminohexanoate cleavage enzyme [Ilyobacter polytropus
DSM 2926]
gi|309747269|gb|ADO81931.1| 3-keto-5-aminohexanoate cleavage enzyme [Ilyobacter polytropus
DSM 2926]
Length=274
Score = 209 bits (533), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/270 (42%), Positives = 156/270 (58%), Gaps = 2/270 (0%)
Query 1 MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPN 60
M +ITVA TG TK DNP +P +P+EIA V Y AGAAVAH+H+RDE + T D
Sbjct 1 MKTIITVATTGAWPTKKDNPNVPLTPQEIANDVYDCYKAGAAVAHLHMRDEEGKGTMDKE 60
Query 61 IARRAMDLIGERCPILIQLSTGVGLTVPFEQRE-QLVELRPRMATLNPCSMSFG-AGEFR 118
+ LI E+C I+I ++T L E R+ L ELRP MA+ + SM++G +G F
Sbjct 61 KFKETAALIKEKCDIIINMTTSGDLNATDETRQAHLKELRPDMASYDCGSMNWGHSGLFI 120
Query 119 NPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAA 178
N PQ + L M+E ++KPE+EI+D G + L + +L P+ + VLG GG A
Sbjct 121 NSPQFLEELGTTMQECNVKPEIEIFDAGMVYNSLYYLKKGILKAPIHYQFVLGAAGGSTA 180
Query 179 TADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSN 238
T +NL+ + +P G+ W + IGK ++ + LA+GG+ RVGLED + K ELA SN
Sbjct 181 TVENLVYLKSLIPEGSTWSALGIGKGHLPILLTSLAMGGHVRVGLEDNVMYSKNELAKSN 240
Query 239 LALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R R+ E +A+ +EA L L
Sbjct 241 RQLVERAARIVEEFGNKVATPDEAREILGL 270
>gi|150388158|ref|YP_001318207.1| hypothetical protein Amet_0318 [Alkaliphilus metalliredigens
QYMF]
gi|149948020|gb|ABR46548.1| protein of unknown function DUF849 [Alkaliphilus metalliredigens
QYMF]
Length=270
Score = 209 bits (532), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/267 (42%), Positives = 155/267 (59%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG T+ P LP +P+EIA A + Y AGA++ H+H RDE +PT +
Sbjct 3 KLIITAALTGAEVTREQQPNLPLTPDEIAQAAYECYEAGASIVHVHARDEEGKPTQSYEV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
I +C I+ Q STG + E+R Q VEL+P MATL+ + +FG F N
Sbjct 63 YEEIKQKIQAKCDIIFQPSTGGAVWHTPEERLQPVELKPEMATLSCGTCNFGPDVFMNSQ 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A RM EL +KPE+EI++ G +E L + L PL F VLGV G T +
Sbjct 123 EYIEKFAKRMMELGVKPEIEIFERGMIENAKGLVKKGLAKTPLHFDFVLGVPGAAPGTVE 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+LL MVR +P G+ W V IG+A + L M + +GG+ RVG ED +Y KGELA SN L
Sbjct 183 DLLYMVRCIPEGSTWTVAGIGRAELPLATMAMIMGGHVRVGFEDNVYYGKGELAESNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V+R +R+A+ L IA+ EEA L L
Sbjct 243 VARILRIAKELGREIATPEEARHILGL 269
>gi|89902644|ref|YP_525115.1| hypothetical protein Rfer_3885 [Rhodoferax ferrireducens T118]
gi|89347381|gb|ABD71584.1| 3-keto-5-aminohexanoate cleavage enzyme [Rhodoferax ferrireducens
T118]
Length=273
Score = 207 bits (526), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/267 (42%), Positives = 156/267 (59%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG T+ ALP +PEEI A E+ AGA++ H+H R+ + PT D +
Sbjct 3 KLIITAALTGAEVTREQQAALPITPEEIGRAAEECCQAGASMVHVHARNADGSPTQDKEV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
R+ M + RC +++Q+STG + + +R V L P MATL+ S++FG F N P
Sbjct 63 YRQIMAAVRARCDVIVQVSTGGAVGMTPAERLAPVTLAPEMATLSMGSVNFGGDVFMNHP 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ M+E +KPELEI+D G L R + LL PL F VLG+ GGMA + +
Sbjct 123 ADMAVFLQAMQEHGVKPELEIFDAGMLTTAHRWLKKGLLTGPLHFDFVLGIPGGMAGSPE 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
L+ + +LP GA W V IG A + L + + LGG+ RVG ED +Y RKGELA SN L
Sbjct 183 ALMYLKAQLPEGASWTVAGIGAAQLPLGTLAIVLGGHVRVGFEDNVYYRKGELASSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V+R R++ LD P+AS +EA A L L
Sbjct 243 VARIARISRELDRPVASPDEARALLGL 269
>gi|188586499|ref|YP_001918044.1| 3-keto-5-aminohexanoate cleavage enzyme [Natranaerobius thermophilus
JW/NM-WN-LF]
gi|179351186|gb|ACB85456.1| 3-keto-5-aminohexanoate cleavage enzyme [Natranaerobius thermophilus
JW/NM-WN-LF]
Length=289
Score = 205 bits (521), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 111/268 (42%), Positives = 158/268 (59%), Gaps = 1/268 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK DNP LP + EE+A +A AGA++ H+H+RDE PT D +
Sbjct 21 KLIITAAICGAEVTKEDNPNLPITAEELAEDAVKAEKAGASIIHLHVRDEEGNPTQDGEV 80
Query 62 ARRAMDLIGER-CPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
++A+D + ER +IQ STG + FE+R Q +EL+P MATL+ + +FG F N
Sbjct 81 FKKAIDAMKERGVSAIIQPSTGGAAGMSFEERAQPIELKPEMATLDCGTTNFGDAIFVND 140
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
+R M+ L+I PELE ++ GH+ L+L E+LL L F +VLGV G M A+
Sbjct 141 LPMMREFGKEMKRLNILPELECFEPGHVYNALQLDKENLLPNHLHFDMVLGVPGAMKASL 200
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
NL+ MV LP G+ W V +G+ + L + +GG+ RVG ED +Y +KG LA SN
Sbjct 201 KNLMFMVDLLPEGSTWTVAGVGRHELPLATHAILMGGHVRVGFEDNIYYKKGVLAESNAQ 260
Query 241 LVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R RLAE L +A+ +EA L++
Sbjct 261 LVERIARLAEELGREVATPDEAREILKI 288
>gi|239617355|ref|YP_002940677.1| protein of unknown function DUF849 [Kosmotoga olearia TBF 19.5.1]
gi|239506186|gb|ACR79673.1| protein of unknown function DUF849 [Kosmotoga olearia TBF 19.5.1]
Length=274
Score = 204 bits (520), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 105/265 (40%), Positives = 154/265 (59%), Gaps = 0/265 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG TK P LP +P+EIA + Y +GA++ H+H RD +PT I
Sbjct 6 KLIITAAVTGAEVTKKQQPNLPITPDEIAEEAYRCYLSGASIVHVHARDPEGKPTQSLEI 65
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
R + I +C I++Q STG + E+R Q + L P MATL+ + +FG F NP
Sbjct 66 YREIKEKIEAKCNIIVQPSTGGAVWHTVEERIQPLYLNPEMATLSTGTCNFGKDIFANPE 125
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + R A M++ IKPE+E+++ G +E LRL + +L PL F V+GV G +
Sbjct 126 EYIERFALEMKKRGIKPEIEVFERGMIENALRLVKKGILEPPLHFDFVMGVPGAIPGNIQ 185
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ +V +PPG+ W V IG+ + L +A+GG+ RVG ED +Y RKGELA SN L
Sbjct 186 DLVYLVNCIPPGSTWSVAGIGRYELPLAVHAIAMGGHVRVGFEDNIYYRKGELAKSNAQL 245
Query 242 VSRTIRLAEALDLPIASVEEAEAAL 266
V R +R+A+ L IA+ +EA L
Sbjct 246 VERIVRIAKELGREIATPDEAREIL 270
>gi|340753474|ref|ZP_08690255.1| transposase [Fusobacterium sp. 2_1_31]
gi|229423047|gb|EEO38094.1| transposase [Fusobacterium sp. 2_1_31]
Length=271
Score = 202 bits (515), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 104/269 (39%), Positives = 161/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAIPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ M+ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCMEAIREKCPDVIIQPSTGGAVGMSDLERLQPTELHPEMATLDCGTCNFGGDEVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMVDYAIRFQKQGFIQKPMHFDFVLGVQ--MTAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ MV +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMVESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 241 ELVERVVRLAKELGREIATPDEARQILSL 269
>gi|154248908|ref|YP_001409733.1| hypothetical protein Fnod_0209 [Fervidobacterium nodosum Rt17-B1]
gi|154152844|gb|ABS60076.1| protein of unknown function DUF849 [Fervidobacterium nodosum
Rt17-B1]
Length=275
Score = 202 bits (515), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/266 (42%), Positives = 153/266 (58%), Gaps = 1/266 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVA TG TK P LP +P+EIA V + + AGA++AHIH R + PT +
Sbjct 3 KLIITVAVTGAEVTKQQQPNLPITPDEIAEDVYRCWKAGASIAHIHARLPDGTPTQSKEV 62
Query 62 ARRAMDLIGER-CPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
I E+ C I+IQ STG + E+R Q ++ P MATL+ S +FG F N
Sbjct 63 YAEIKRKIREKGCDIIIQFSTGGAVWHKPEERIQCLDAEPEMATLSAGSCNFGDDVFMNS 122
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
P + LA RM+E IKPE+E+++ G +E LRL + LL PL F VLGV G M
Sbjct 123 PSFMELLAMRMKEKGIKPEIEVFEPGMIENALRLVKKGLLELPLHFDFVLGVPGAMTGNI 182
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
++L+ +V +LP G W V IG+ + L + +GG+ RVG ED +Y RKGELA SN
Sbjct 183 EDLVFLVNKLPEGCTWSVAGIGRYELPLAVHAIVMGGHVRVGFEDNIYYRKGELATSNAQ 242
Query 241 LVSRTIRLAEALDLPIASVEEAEAAL 266
LV R +R+A + IA+ +EA L
Sbjct 243 LVERIVRIAHEVGREIATPDEARKIL 268
>gi|294783512|ref|ZP_06748836.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA]
gi|294480390|gb|EFG28167.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA]
Length=271
Score = 202 bits (514), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 104/269 (39%), Positives = 161/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + EEI E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAIPYTVEEIVREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ S +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMSDLERLQPTELHPEMATLDCGSCNFGGDEVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ MAA+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMVDYAIRFQKQGFIQKPMHFDFVLGVQ--MAAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ MV +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMVESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 241 ELVERVVRLAKELGREIATPDEARQILSL 269
>gi|169633210|ref|YP_001706946.1| hypothetical protein ABSDF1527 [Acinetobacter baumannii SDF]
gi|169152002|emb|CAP00868.1| conserved hypothetical protein [Acinetobacter baumannii]
Length=274
Score = 202 bits (513), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 104/270 (39%), Positives = 158/270 (59%), Gaps = 5/270 (1%)
Query 3 IVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNIA 62
++IT A G I ++ NPA+P +PEEIA AV + ++AGA+VAHIH R+ + P+ +
Sbjct 4 LIITAAVNGGITPRSKNPAVPYTPEEIANAVYEVWNAGASVAHIHARNLDGSPSYQQEVW 63
Query 63 RRAMDLIGERCPILIQLSTGVGLTVPFE----QREQLVELRPRMATLNPCSMSFGAGEFR 118
+D + RC I++ LST GL +P + Q + RP +A+ N S++ G+ F
Sbjct 64 GEIVDKVRARCDIILNLSTS-GLNLPLDAPKDQAWNHLVYRPEIASYNCGSVNHGSKPFI 122
Query 119 NPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAA 178
NPP LA + + +KPE+EIY +G + L + L P+ F+ +G+ GG+ A
Sbjct 123 NPPALAMELADAINQYGVKPEIEIYHSGVINEAETLHLKGYLKSPMLFAFAMGIHGGVTA 182
Query 179 TADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSN 238
T NL+ ++ LP G++W + IGKA + + + LGG+ R GLED +Y + GELA SN
Sbjct 183 TCKNLIHLIDSLPAGSLWSALGIGKAQLPINVHTILLGGHVRTGLEDNIYYKAGELATSN 242
Query 239 LALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RL+ LD P+AS +EA L L
Sbjct 243 AQLVERLVRLSHELDRPVASTQEARKILGL 272
>gi|262066230|ref|ZP_06025842.1| conserved hypothetical protein [Fusobacterium periodonticum ATCC
33693]
gi|291380086|gb|EFE87604.1| conserved hypothetical protein [Fusobacterium periodonticum ATCC
33693]
Length=271
Score = 202 bits (513), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 103/269 (39%), Positives = 162/269 (61%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAIPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMSDLERLQPTELHPEMATLDCGTCNFGGDEVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRFQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ MV +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMVESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 241 ELVERVVRLAKELGREIATPDEARQILSL 269
>gi|217076540|ref|YP_002334256.1| hypothetical protein THA_422 [Thermosipho africanus TCF52B]
gi|217036393|gb|ACJ74915.1| conserved hypothetical protein [Thermosipho africanus TCF52B]
Length=272
Score = 201 bits (511), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/267 (39%), Positives = 161/267 (61%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVA TG T+ P LP +P+EIA AV + Y AGA++AH+H R ++ PT +
Sbjct 3 KLIITVAVTGAEVTREKQPNLPITPDEIADAVYECYLAGASIAHVHARLDDGTPTQSYEV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ + I ++C I+ Q STG FE+R Q + P MATL+ + +FG F NP
Sbjct 63 YKEIKEKIEKKCDIIFQPSTGGATWHTFEERMQPLLTNPEMATLSAGTCNFGNDVFLNPM 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A M++ +IKPE+E+++ G +E ++L + LL PL F V+GV G + T +
Sbjct 123 EYIEKFAIEMKKRNIKPEIEVFERGMIETAIKLVDKGLLNPPLHFDFVMGVPGAIPGTIE 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ +V ++PPG+ W V IG+ + L + +GG+ RVG ED +Y +KGELA SN L
Sbjct 183 DLVYLVSKIPPGSTWSVAGIGRYELPLAVHAILMGGHVRVGFEDNIYYKKGELAKSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V R +R+A+ L IA+ +EA L +
Sbjct 243 VERIVRIAKELGREIATPDEARKILGI 269
>gi|150020098|ref|YP_001305452.1| hypothetical protein Tmel_0190 [Thermosipho melanesiensis BI429]
gi|149792619|gb|ABR30067.1| protein of unknown function DUF849 [Thermosipho melanesiensis
BI429]
Length=272
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 105/267 (40%), Positives = 158/267 (60%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVA TG T+ P LP +P+EIA AV + Y AGA++AHIH R E+ PT I
Sbjct 3 KLIITVAVTGAEVTREKQPNLPITPDEIADAVYECYLAGASIAHIHARKEDGTPTQSYEI 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ I ++C I+ Q STG FE+R Q + P MATL+ + +FG F NP
Sbjct 63 YIEIKEKIEKKCNIIFQPSTGGATWHTFEERMQPLLTNPEMATLSAGTCNFGNDVFLNPM 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A M++ IKPE+E+++ G +E L+L + LL PL F V+GV G + T D
Sbjct 123 EYIEKFAIEMKKRKIKPEIEVFERGMIETALKLVKKGLLEAPLHFDFVMGVPGAIPGTID 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ +V ++P G+ W V IG+ + L + +GG+ R+G ED +Y +KGELA SN L
Sbjct 183 DLVYLVSKIPEGSTWSVAGIGRYELPLAVHAILMGGHVRIGFEDNIYYKKGELAKSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V R +R+A+ + IA+ +EA L +
Sbjct 243 VERIVRIAKEVGREIATPDEARKILGI 269
>gi|307298174|ref|ZP_07577978.1| protein of unknown function DUF849 [Thermotogales bacterium MesG1.Ag.4.2]
gi|306916260|gb|EFN46643.1| protein of unknown function DUF849 [Thermotogales bacterium MesG1.Ag.4.2]
Length=274
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 104/265 (40%), Positives = 150/265 (57%), Gaps = 0/265 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG K PALP SPEEIA A + +GA++ HIH RD + +PT + +
Sbjct 3 KLIITAALTGAEVMKDQQPALPISPEEIADAAYDCFLSGASIVHIHARDSSGKPTQNLEV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
R + I E+C ++ Q STG + +R Q +EL P MATL+ + +FG F N
Sbjct 63 YREIKERIAEKCDLIFQPSTGGAVWHKVRERAQPLELNPEMATLSAGTCNFGEDVFFNSQ 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ A M+ IKPE+E+++ G +E L+L + L+ P+ F VLGV G +
Sbjct 123 DTMETFAQEMKARGIKPEIEVFERGMIENALKLLKKGLIDSPIHFDFVLGVPGACPGNIE 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ MVR +P G+ W V IG+ + L + LGG+ RVG ED +Y +KGELA SN L
Sbjct 183 DLIHMVRAIPQGSTWTVAGIGRNELVLATAAILLGGHVRVGFEDNIYYKKGELAISNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAAL 266
V R +R++ L +AS EEA L
Sbjct 243 VERVVRISNELGRDVASPEEAREIL 267
>gi|218778117|ref|YP_002429435.1| hypothetical protein Dalk_0258 [Desulfatibacillum alkenivorans
AK-01]
gi|218759501|gb|ACL01967.1| protein of unknown function DUF849 [Desulfatibacillum alkenivorans
AK-01]
Length=284
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/279 (41%), Positives = 153/279 (55%), Gaps = 13/279 (4%)
Query 3 IVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNIA 62
++I+VA TG + K+ NPALP P+EIA + Y+AGA+V HIH+RD+ R TAD N+
Sbjct 4 LIISVAQTGGLHGKSSNPALPEQPDEIAQSAYDCYNAGASVCHIHVRDKQGRTTADLNVY 63
Query 63 RRAMDLIGERCPILIQLSTGVGLTV---------PFEQREQLVEL--RPRMATLNPCSMS 111
+ I +CPI+ Q+ G+G V E++ L L +P M T+N +
Sbjct 64 SDVLTKIQSKCPIITQVGGGIGTIVEPDGRSRGATLEEKMALTALAPKPDMLTINAGTFD 123
Query 112 FG--AGEFRNPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIV 169
FG A F NP A R + I E E YD H+E L L P+ FS+V
Sbjct 124 FGWIAEPFINPMDWNEDFARRCNQRKIAVECECYDISHIENVKELIRRGALNSPVHFSLV 183
Query 170 LGVRGGMAATADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYL 229
LGV+GG+ ++ + MV +P G+ WQVI IGK + T M + G N R GLED +Y
Sbjct 184 LGVKGGIPSSPKMISAMVDMIPEGSTWQVITIGKHQLTSTVMAMCQGANIRTGLEDNVYY 243
Query 230 RKGELAPSNLALVSRTIRLAEALDLPIASVEEAEAALQL 268
+GELA SN LV R +R+A L IA+VEEA AL +
Sbjct 244 SRGELAKSNAQLVERMVRIARELGRNIATVEEAVGALGI 282
>gi|309389804|gb|ADO77684.1| 3-keto-5-aminohexanoate cleavage enzyme [Halanaerobium praevalens
DSM 2228]
Length=270
Score = 199 bits (506), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 106/267 (40%), Positives = 151/267 (57%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A TG T+ P LP + EEIA E+AY AGAA+ H+H R+E+ PT
Sbjct 3 KLIITAALTGAEVTQDIQPNLPITAEEIAIEAEKAYEAGAAIVHVHAREEDGSPTQAKEA 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ + I RCP++ Q STG E+R Q VEL P MATL+ + +FG F N
Sbjct 63 YQEIKEKIEARCPVIFQPSTGGATWHTAEERLQPVELSPEMATLSTGTCNFGEDVFMNTQ 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ + + A +M+E +KPE+E+++ G + L + L+ PL F VLGV G M A+A
Sbjct 123 EYMIKFAKKMKEKGVKPEIEVFEAGMIANAQYLVKKGLIDTPLHFDFVLGVPGAMPASAR 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
NL+ M +P G+ W V IG+ L M +A+GG+ RVG ED +Y +KGELA SN L
Sbjct 183 NLVYMAETIPAGSTWTVAGIGRHETPLAMMAIAMGGHVRVGFEDNIYYKKGELAKSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V R R+A +A+ +EA L +
Sbjct 243 VERIARMAAEAGREVATPDEARKILSI 269
>gi|335428956|ref|ZP_08555866.1| hypothetical protein HLPCO_08299 [Haloplasma contractile SSD-17B]
gi|335430542|ref|ZP_08557432.1| hypothetical protein HLPCO_16211 [Haloplasma contractile SSD-17B]
gi|334887945|gb|EGM26260.1| hypothetical protein HLPCO_16211 [Haloplasma contractile SSD-17B]
gi|334891897|gb|EGM30143.1| hypothetical protein HLPCO_08299 [Haloplasma contractile SSD-17B]
Length=279
Score = 198 bits (504), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 102/269 (38%), Positives = 158/269 (59%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK P +P + +EI E AY+AGA++ H+H+R+++ PT + +
Sbjct 3 KLIITAAICGAEVTKEHTPYIPYTIDEIVREAELAYNAGASIIHLHVREDDGTPTQNKDR 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
+ A++ I ERC ++IQ STG + + ++R Q +EL P MATL+ +++FG E F N
Sbjct 63 FKEAINRIKERCKDVIIQPSTGGAVGMTTDERLQPIELNPEMATLDCGTLNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ A R++E IKPELE +D GH++ +RL+ + + PL FS VLGV GGM+
Sbjct 123 TENDIKEFAKRIQERHIKPELECFDKGHIDLVIRLYKKGFIKGPLHFSFVLGVNGGMSGD 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+ + M LP + + V IG+ L + GG+ RVG ED +Y+ KG+LA SN
Sbjct 183 LRDFVYMNESLPCNSTFSVAGIGRYEFPLAVASIVSGGHVRVGFEDNIYIEKGQLAKSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +RLA L IA+ +EA L +
Sbjct 243 ELVEKVVRLANELGRDIATPDEARKILGI 271
>gi|19705173|ref|NP_602668.1| cytoplasmic protein [Fusobacterium nucleatum subsp. nucleatum
ATCC 25586]
gi|19713114|gb|AAL93967.1| Hypothetical cytosolic protein [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
Length=272
Score = 197 bits (501), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 102/269 (38%), Positives = 160/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 4 KLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 63
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 64 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVN 123
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 124 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQ--MSAS 181
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ M +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 182 ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGILAKSNG 241
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 242 ELVERVVRLAKELGREIATPDEARQILSL 270
>gi|296328272|ref|ZP_06870801.1| protein of hypothetical function DUF849 [Fusobacterium nucleatum
subsp. nucleatum ATCC 23726]
gi|296154576|gb|EFG95364.1| protein of hypothetical function DUF849 [Fusobacterium nucleatum
subsp. nucleatum ATCC 23726]
Length=271
Score = 197 bits (501), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 102/269 (38%), Positives = 160/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ M +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGILAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 241 ELVERVVRLAKELGREIATPDEARQILSL 269
>gi|237743423|ref|ZP_04573904.1| transposase [Fusobacterium sp. 7_1]
gi|260494969|ref|ZP_05815098.1| transposase [Fusobacterium sp. 3_1_33]
gi|289764964|ref|ZP_06524342.1| transposase [Fusobacterium sp. D11]
gi|336401613|ref|ZP_08582375.1| hypothetical protein HMPREF0404_01666 [Fusobacterium sp. 21_1A]
gi|229433202|gb|EEO43414.1| transposase [Fusobacterium sp. 7_1]
gi|260197412|gb|EEW94930.1| transposase [Fusobacterium sp. 3_1_33]
gi|289716519|gb|EFD80531.1| transposase [Fusobacterium sp. D11]
gi|336160714|gb|EGN63746.1| hypothetical protein HMPREF0404_01666 [Fusobacterium sp. 21_1A]
Length=271
Score = 197 bits (500), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/269 (38%), Positives = 161/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILMERGVKPEIEVFDKGMVDYAIRFQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ + +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFISESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +R+A+ L IA+ +EA L L
Sbjct 241 ELVERVVRMAKELGREIATPDEARQILSL 269
>gi|237741287|ref|ZP_04571768.1| transposase [Fusobacterium sp. 4_1_13]
gi|229430819|gb|EEO41031.1| transposase [Fusobacterium sp. 4_1_13]
Length=271
Score = 197 bits (500), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 101/269 (38%), Positives = 160/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ M +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +R+A+ L IA+ +EA L L
Sbjct 241 ELVERVVRMAKELGREIATPDEARQILSL 269
>gi|34763496|ref|ZP_00144438.1| Transposase [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
gi|256846430|ref|ZP_05551887.1| transposase [Fusobacterium sp. 3_1_36A2]
gi|294784505|ref|ZP_06749794.1| conserved hypothetical protein [Fusobacterium sp. 3_1_27]
gi|27886825|gb|EAA23956.1| Transposase [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
gi|256718199|gb|EEU31755.1| transposase [Fusobacterium sp. 3_1_36A2]
gi|294487721|gb|EFG35080.1| conserved hypothetical protein [Fusobacterium sp. 3_1_27]
Length=271
Score = 197 bits (500), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 101/269 (38%), Positives = 160/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ M +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +R+A+ L IA+ +EA L L
Sbjct 241 ELVERVVRMAKELGREIATPDEARQILSL 269
>gi|336419820|ref|ZP_08600074.1| hypothetical protein HMPREF0401_02094 [Fusobacterium sp. 11_3_2]
gi|336162834|gb|EGN65780.1| hypothetical protein HMPREF0401_02094 [Fusobacterium sp. 11_3_2]
Length=271
Score = 196 bits (499), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/269 (38%), Positives = 161/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFIN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILMERGVKPEIEVFDKGMVDYAIRFQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ + +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFISESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +R+A+ L IA+ +EA L L
Sbjct 241 ELVERVVRMAKELGREIATPDEARQILSL 269
>gi|221633717|ref|YP_002522943.1| hypothetical protein trd_1744 [Thermomicrobium roseum DSM 5159]
gi|221156629|gb|ACM05756.1| Prokaryotic protein of unknown function (DUF849) [Thermomicrobium
roseum DSM 5159]
Length=276
Score = 196 bits (499), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 109/270 (41%), Positives = 153/270 (57%), Gaps = 2/270 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++++VA TG T+ P +P + EEIA + + GAA+ HIH+RDE R T+DP
Sbjct 3 KVIVSVATTGSWTTREQTPYVPITEEEIAAEAIRCWREGAAIVHIHVRDEQGRVTSDPAR 62
Query 62 ARRAMDLI-GERCPILIQLSTGVGL-TVPFEQREQLVELRPRMATLNPCSMSFGAGEFRN 119
R DLI + C I++ STG G VP E+R V LRP +A+ + S++FG F N
Sbjct 63 YARVRDLIRSQGCDIILNFSTGGGAGIVPDEERIAPVRLRPEIASFDAGSLNFGDRVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
P + LA M+ +KPE+E +++G +E R L+ P F +VLGVRGG AT
Sbjct 123 SPAFLEALAHEMQAHGVKPEIECFESGFIETAKRFIERGLIQPPYWFQMVLGVRGGAPAT 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
D L+ MVR+LP G++W V AIG+ + + L +GG+ R GLED +Y LA N
Sbjct 183 VDQLVHMVRQLPAGSLWSVCAIGRHQLPMNVAALVMGGHVRTGLEDNIYYSYRVLAEGNA 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQLP 269
LV+R +R+A L AS EA L LP
Sbjct 243 PLVARIVRIARELGREPASPSEARTLLGLP 272
>gi|254303336|ref|ZP_04970694.1| hypothetical protein FNP_0982 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
gi|148323528|gb|EDK88778.1| hypothetical protein FNP_0982 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length=271
Score = 196 bits (499), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 101/269 (38%), Positives = 160/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEVFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ M +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ +G LA SN
Sbjct 181 ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDRGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +RLA+ L IA+ +EA L L
Sbjct 241 ELVERVVRLAKELGREIATPDEARQILSL 269
>gi|338812065|ref|ZP_08624264.1| hypothetical protein ALO_08223 [Acetonema longum DSM 6540]
gi|337276034|gb|EGO64472.1| hypothetical protein ALO_08223 [Acetonema longum DSM 6540]
Length=271
Score = 196 bits (498), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 114/272 (42%), Positives = 161/272 (60%), Gaps = 6/272 (2%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT+APTG + TKA P +P + EIA + + AGAAVAHIH RD PTA
Sbjct 3 KLIITIAPTGNVPTKAMTPHVPVTAAEIAADIVTCHQAGAAVAHIHARDHAGLPTAGLEC 62
Query 62 AR---RAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFR 118
R +A+D G CP++ Q+STG E R + + L P A+L S +F
Sbjct 63 FREIWQALDQTG--CPVIRQISTGARAGNSAEARAEALSLDPESASLTTGSTNFPNKANL 120
Query 119 NPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAA 178
N P + LA M E +IKPE+EI+D + + L + LLA PLQF++V+GV+G + A
Sbjct 121 NDPDLIHFLAQTMHERNIKPEIEIFDLAMINNAVELQKKGLLASPLQFNLVMGVKGAIPA 180
Query 179 TADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSN 238
TA NL +V LPPG++W + AIG ++ L+ + +ALGG+ RVG+ED +Y KG LA +N
Sbjct 181 TAKNLFFLVDSLPPGSVWTLSAIGPQHLPLSMIAMALGGHIRVGVEDNIYYSKGVLA-TN 239
Query 239 LALVSRTIRLAEALDLPIASVEEAEAALQLPG 270
+ LV R + LA+A+ +AS EA L L G
Sbjct 240 IMLVERIVALAKAMGRELASPAEARRILGLAG 271
>gi|229496573|ref|ZP_04390287.1| 3-keto-5-aminohexanoate cleavage enzyme [Porphyromonas endodontalis
ATCC 35406]
gi|229316470|gb|EEN82389.1| 3-keto-5-aminohexanoate cleavage enzyme [Porphyromonas endodontalis
ATCC 35406]
Length=275
Score = 196 bits (498), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 112/269 (42%), Positives = 158/269 (59%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NPA+P + EEI + AY AGAAV HIH+R+++ PT +
Sbjct 3 KLIITAAICGAEVTKEQNPAVPYTVEEIVREAKSAYDAGAAVVHIHVREDDGTPTQSRDR 62
Query 62 ARRAMDLIGERCPILIQL-STGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
+ MD + E CP +I + STG + + E+R Q EL P MATL+ + +FG F N
Sbjct 63 FKVCMDAVREACPDVILIPSTGGAVGMTAEERLQPTELFPEMATLDCGTCNFGDEVFENT 122
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAE-DLLAEPLQFSIVLGVRGGMAAT 179
+R RM E +IKPE E ++ GHL+ LR+ A+ ++ +P+QF+ VLGV G AT
Sbjct 123 MPMMRTFGKRMLENNIKPEYECFEMGHLDTILRMAAKGEVPGDPMQFNFVLGVPGCTPAT 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+NL+ +V R+P G+ W IG++ L A + +GGN RVG ED L + +G LA SN
Sbjct 183 VENLVWLVNRIPAGSTWTATGIGRSAFTLAAPTIVMGGNVRVGFEDNLNISRGVLARSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +RL+ L IAS EA A L L
Sbjct 243 ELVEKVVRLSRELGREIASPAEARAILSL 271
>gi|338811235|ref|ZP_08623464.1| hypothetical protein ALO_04121 [Acetonema longum DSM 6540]
gi|337276788|gb|EGO65196.1| hypothetical protein ALO_04121 [Acetonema longum DSM 6540]
Length=271
Score = 196 bits (498), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/267 (40%), Positives = 154/267 (58%), Gaps = 0/267 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++ITVAP G AT+ DNP LP +P EIA A + GA++ H+H+RD + T +
Sbjct 3 KLIITVAPVGAEATRQDNPNLPLTPVEIAAAALRCVEKGASIIHLHVRDAEGQATQSKEV 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNPP 121
+ M LI ++ ++IQ STG + +R Q +EL P MATL +++FG F NP
Sbjct 63 FQETMALIRKQSNVIIQTSTGGAAWMTAAERMQPLELNPEMATLTTGTVNFGDDIFSNPM 122
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
V A M + +KPE+E+++ G ++ L L + +L PL F V+GV GG+A
Sbjct 123 PMVTEFAKEMVKRSVKPEIEVFEAGMIQTALNLVKQGILRLPLHFDFVMGVPGGIAGEPR 182
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
+L+ +V LP G W V IG++ + L + +A+GGN RVG ED +Y +G LA SN L
Sbjct 183 HLVHLVDSLPAGCTWTVAGIGRSELPLATVAIAMGGNVRVGFEDNVYYSRGVLADSNAQL 242
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V R R+A L P+A+ +EA A L L
Sbjct 243 VERIARIAGELGRPVATPDEARAILGL 269
>gi|339889625|gb|EGQ78895.1| protein of hypothetical function DUF849 [Fusobacterium nucleatum
subsp. animalis ATCC 51191]
Length=271
Score = 196 bits (497), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 99/269 (37%), Positives = 161/269 (60%), Gaps = 4/269 (1%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK +NPA+P + +EIA E AY AGA++ H+H+R+++ PT D
Sbjct 3 KLIITAAICGAEVTKENNPAVPYTVDEIAREAESAYKAGASIIHLHVREDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R+ ++ I E+CP ++IQ STG + + +R Q EL P MATL+ + +FG E F N
Sbjct 63 FRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
++ + E +KPE+E++D G ++ +R + + +P+ F VLGV+ M+A+
Sbjct 123 TENTIKNFGKILMERGVKPEIEVFDKGMVDYAIRFQKQGFIQKPMHFDFVLGVQ--MSAS 180
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L+ + +P G+ W V +G+ ++ A+ + +GG+ RVG ED +Y+ KG LA SN
Sbjct 181 ARDLVFISESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGVLAKSNG 240
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R +R+A+ L IA+ +EA L L
Sbjct 241 ELVERVVRMAKELGREIATPDEARQILSL 269
>gi|339441730|ref|YP_004707735.1| hypothetical protein CXIVA_06660 [Clostridium sp. SY8519]
gi|338901131|dbj|BAK46633.1| hypothetical protein CXIVA_06660 [Clostridium sp. SY8519]
Length=274
Score = 196 bits (497), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/269 (40%), Positives = 154/269 (58%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT G TK NPA+P + EEIA + AY AGA++ H+H+R+++ PT
Sbjct 3 KLIITACICGAEVTKEHNPAVPYTVEEIAREAKSAYDAGASIIHLHVREDDGTPTQSRER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
M+ I CP ++IQ STG + + E+R LRP MATL+ + +FG E F N
Sbjct 63 FAECMEAIRGLCPDVIIQPSTGGAVGMSNEERLAPTALRPEMATLDCGTCNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
+R A M+E IKPELE++D G ++ +RL + + P+ F V+GV GG++
Sbjct 123 TENMIRAFAENMKEYGIKPELEVFDKGMVDMAIRLHRKGFIQAPMHFDFVMGVNGGISGE 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+LL M +P G+ W V +GKA + MG+ +GG+ RVG ED +YL KG LA SN
Sbjct 183 PRDLLFMAESIPAGSTWTVSGVGKAEYPMITMGILMGGHVRVGFEDNVYLEKGVLAESNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
A+V + +R+A+ L +AS EA L L
Sbjct 243 AMVEKVVRIAKELGRAVASPAEAREILGL 271
>gi|121535646|ref|ZP_01667451.1| protein of unknown function DUF849 [Thermosinus carboxydivorans
Nor1]
gi|121305750|gb|EAX46687.1| protein of unknown function DUF849 [Thermosinus carboxydivorans
Nor1]
Length=270
Score = 194 bits (494), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 107/263 (41%), Positives = 157/263 (60%), Gaps = 2/263 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
+++TVAPTG + TKA P +P +PEEIA + Y GAAVAHIH R+E RPT +
Sbjct 3 KLIVTVAPTGNVPTKAMTPFVPVTPEEIAEDIAACYEKGAAVAHIHARNEEGRPTHEIKF 62
Query 62 ARRAMDLIGER-CPILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGEFRNP 120
+ + E+ CPI+ Q+STG + R + + L P A+L S +F N
Sbjct 63 FAEILRRLDEKGCPIIRQISTGARAGKTAQDRAEALALNPASASLATGSSNFPTSANVND 122
Query 121 PQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATA 180
P + LA M E +IKPELEI+DT + ++L LL EPL F++VLGV+G + AT
Sbjct 123 PALIEYLAKIMLERNIKPELEIFDTAMINNAVQLHKAGLLKEPLLFNLVLGVKGSLPATP 182
Query 181 DNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLA 240
NL +V LPP ++W V IG ++ L+ + +ALGG+ RVG+ED +Y KG LA +N+
Sbjct 183 KNLFFLVESLPPNSVWSVSVIGPQHVPLSMIAMALGGHVRVGVEDNIYYSKGVLA-TNVT 241
Query 241 LVSRTIRLAEALDLPIASVEEAE 263
LV R + +A+A+ IA+ ++ +
Sbjct 242 LVERIVNIAKAMGREIATPDDVK 264
>gi|331004171|ref|ZP_08327651.1| hypothetical protein HMPREF0491_02513 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330411581|gb|EGG90991.1| hypothetical protein HMPREF0491_02513 [Lachnospiraceae oral taxon
107 str. F0167]
Length=272
Score = 194 bits (494), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/269 (38%), Positives = 155/269 (58%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT G TK NP +P + EEI + AY AGAA+ H+H+R+++ PT
Sbjct 3 KLIITACICGAEVTKEQNPNIPYTVEEIVREAKSAYDAGAAIIHLHVREDDGTPTQSEKR 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
+ +D I + P ++IQ STG + + E+R L+P MATL+ + +FG + F N
Sbjct 63 FKECIDAIKKEIPDVIIQPSTGGAVGMSNEERLAPTVLKPEMATLDCGTCNFGGDDIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
+ A RM EL IKPE+E++D G ++ +RL + ++ P+ F V+GV GG++ T
Sbjct 123 TENTIIEFANRMNELGIKPEVEVFDKGMIDMAIRLNKKGIIKSPMHFDFVMGVNGGISGT 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
A +L MV +P G+ W +G+A + M + +GG+ARVG ED +YL KG +A SN
Sbjct 183 ARDLNFMVESIPAGSTWTASGVGRAEFPMVTMAILMGGHARVGFEDNIYLSKGVMAKSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +RLA+ L IAS +EA L L
Sbjct 243 ELVEKVVRLAKELGREIASPDEAREILGL 271
>gi|345017211|ref|YP_004819564.1| hypothetical protein Thewi_0850 [Thermoanaerobacter wiegelii
Rt8.B1]
gi|344032554|gb|AEM78280.1| protein of unknown function DUF849 [Thermoanaerobacter wiegelii
Rt8.B1]
Length=278
Score = 192 bits (489), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/269 (38%), Positives = 155/269 (58%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NP +P + EEIA E AY+AGA++ H+H+R ++ PT D
Sbjct 3 KLIITAAICGAEVTKKHNPNVPYTVEEIAREAESAYNAGASIIHLHVRYDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R ++ I RCP ++IQ STG + + E+R Q + L+P MA+L+ +++FG E F N
Sbjct 63 FRECIEAIKARCPDVIIQPSTGGAVGMTSEERLQPIYLQPEMASLDCGTLNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
+ A +M EL IKPELE++D G ++ +RL + + P+ F+ V+GV GG++
Sbjct 123 TENMIIEFALKMNELSIKPELEVFDKGMIDTAIRLHKKGYIKAPMHFNFVMGVNGGISGE 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+ L + +P G+ + IG+ + M + GG+ RVG ED +Y+ KG LA SN
Sbjct 183 MRDFLFLKESIPEGSTFTATGIGRYEFPVATMAILTGGHVRVGFEDNVYISKGVLAKSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +R+A L IA+ +EA L L
Sbjct 243 ELVEKVVRIARELGKEIATPDEARKILGL 271
>gi|254478897|ref|ZP_05092260.1| conserved hypothetical protein [Carboxydibrachium pacificum DSM
12653]
gi|214035163|gb|EEB75874.1| conserved hypothetical protein [Carboxydibrachium pacificum DSM
12653]
Length=275
Score = 192 bits (488), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/269 (38%), Positives = 157/269 (59%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NP +P + EE+ AY+AGA++ H+H+R ++ PT D
Sbjct 3 KLIITAAICGAEVTKKHNPNVPYTVEEMVREALSAYNAGASIIHLHVRYDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R ++ I +CP ++IQ STG + + E+R Q V L+P MA+L+ +M+FG E F N
Sbjct 63 FREVIEAIKAKCPDVIIQPSTGGAVGMTPEERLQPVYLKPEMASLDCGTMNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
+ A +M EL +KPELE++D G ++A +RL + + P+ F+ V+GV GG++A
Sbjct 123 TENMIIEFATKMNELGVKPELEVFDKGMIDAAIRLHKKGYIKAPMHFNFVMGVNGGISAE 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+ + ++ +PPG+ + IG+ + M + GG+ RVG ED +YL KG LA SN
Sbjct 183 MRDFVFLMESIPPGSTFTATGIGRYEFPVATMAILAGGHVRVGFEDNVYLEKGVLAKSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +R+A L IA+ +EA L L
Sbjct 243 ELVEKVVRIARELGREIATPDEARKILGL 271
>gi|317063133|ref|ZP_07927618.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185]
gi|313688809|gb|EFS25644.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185]
Length=276
Score = 192 bits (487), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 102/269 (38%), Positives = 155/269 (58%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
I+ITVAPTG +K DNP +P +PEEIA V + Y AGA++AH+H+RD+ + T D
Sbjct 3 KIIITVAPTGAWPSKKDNPNIPLTPEEIANDVYECYKAGASIAHLHMRDDMGKGTMDTKK 62
Query 62 ARRAMDLIGERCPILIQLSTGVGLTVPFEQRE-QLVELRPRMATLNPCSMSFGAGE-FRN 119
+ LI E+C I+I L+T L E R+ L ++P +A+ + SM++ F N
Sbjct 63 FEETVKLIKEKCDIVINLTTSGDLNATDETRQAHLKSIKPDLASYDCGSMNWMHNSLFIN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
P+ + L M+E ++KPE+EI+D G + L + +L EP+ + VLG GG AAT
Sbjct 123 HPKFLEELGYTMQENNVKPEIEIFDAGMIYNSLYYIKKGVLKEPVHYQFVLGAAGGTAAT 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+NL+ + +P G+ W + IG+ ++ + +A+GG+ RVG+ED +Y ELA SN
Sbjct 183 VENLVYLKSLIPEGSTWSALGIGRGHIPILMTAIAMGGHVRVGMEDNVYYGPAELAVSNA 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV R RL + +A+ EA L L
Sbjct 243 QLVERAARLIKNSMNEVATPAEAREILGL 271
>gi|326391507|ref|ZP_08213040.1| protein of unknown function DUF849 [Thermoanaerobacter ethanolicus
JW 200]
gi|325992436|gb|EGD50895.1| protein of unknown function DUF849 [Thermoanaerobacter ethanolicus
JW 200]
Length=278
Score = 191 bits (485), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 99/269 (37%), Positives = 154/269 (58%), Gaps = 2/269 (0%)
Query 2 SIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNI 61
++IT A G TK NP +P + EEIA E AY+AGA++ H+H+R ++ PT D
Sbjct 3 KLIITAAICGAEVTKKHNPNVPYTVEEIAREAESAYNAGASIIHLHVRYDDGTPTQDKER 62
Query 62 ARRAMDLIGERCP-ILIQLSTGVGLTVPFEQREQLVELRPRMATLNPCSMSFGAGE-FRN 119
R ++ I RCP ++IQ STG + + E+R Q + L+P MA+L+ +++FG E F N
Sbjct 63 FRECIEAIKARCPDVIIQPSTGGAVGMTSEERLQPIYLQPEMASLDCGTLNFGGDEIFVN 122
Query 120 PPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAAT 179
+ +M EL IKPELE++D G ++ +RL + + P+ F+ V+GV GG++
Sbjct 123 TENMIIEFTLKMNELSIKPELEVFDKGMIDTAIRLHKKGYIKAPMHFNFVMGVNGGISGE 182
Query 180 ADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNL 239
+ L + +P G+ + IG+ + M + GG+ RVG ED +Y+ KG LA SN
Sbjct 183 MRDFLFLKESIPEGSTFTATGIGRYEFPVATMAILTGGHVRVGFEDNVYISKGVLAKSNG 242
Query 240 ALVSRTIRLAEALDLPIASVEEAEAALQL 268
LV + +R+A L IA+ +EA L L
Sbjct 243 ELVEKVVRIARELGKEIATPDEARKILGL 271
>gi|158321464|ref|YP_001513971.1| hypothetical protein Clos_2443 [Alkaliphilus oremlandii OhILAs]
gi|158141663|gb|ABW19975.1| protein of unknown function DUF849 [Alkaliphilus oremlandii OhILAs]
Length=272
Score = 191 bits (485), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/267 (39%), Positives = 152/267 (57%), Gaps = 2/267 (0%)
Query 4 VITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVAHIHLRDENERPTADPNIAR 63
+ITVA TG TK DNP +P +PEEIA V Q Y AGAA+AH+H+RD+ + T D
Sbjct 5 IITVATTGAWPTKKDNPNIPLTPEEIAEDVYQCYKAGAAIAHLHMRDDEGQGTMDKERFE 64
Query 64 RAMDLIGERCPILIQLSTGVGLTVPFEQRE-QLVELRPRMATLNPCSMSF-GAGEFRNPP 121
+ + LI E+C I++ L+T L E R+ L ++P +A+ + SM++ F N P
Sbjct 65 KTVQLIREKCDIVLNLTTSGDLNATDETRQAHLKSIKPELASYDCGSMNWMHQTVFLNTP 124
Query 122 QAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLLAEPLQFSIVLGVRGGMAATAD 181
+ L M+ D+KPE+EI+D G + L + +L PL + VLG GGMAAT +
Sbjct 125 SFLEELGHTMQAYDVKPEIEIFDGGMVYNSLYYLKKGVLKGPLHYQFVLGAAGGMAATIE 184
Query 182 NLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGNARVGLEDTLYLRKGELAPSNLAL 241
NL+ + +P G+ W + IGK ++ + +A+GG+ RVG+ED + KGELA SN
Sbjct 185 NLVFLKSLIPEGSTWSALGIGKGHVPIMLAAIAMGGHIRVGMEDNVMFNKGELAESNAQF 244
Query 242 VSRTIRLAEALDLPIASVEEAEAALQL 268
V+R + +A+ +EA L L
Sbjct 245 VTRAANIIRESGNEVATPQEAREILGL 271
Lambda K H
0.319 0.134 0.386
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 423010730970
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40