BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2819c
Length=375
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609956|ref|NP_217335.1| hypothetical protein Rv2819c [Mycob... 778 0.0
gi|298526288|ref|ZP_07013697.1| conserved hypothetical protein [... 776 0.0
gi|254232914|ref|ZP_04926241.1| hypothetical protein TBCG_02755 ... 775 0.0
gi|289762994|ref|ZP_06522372.1| hypothetical protein TBIG_02177 ... 775 0.0
gi|340627815|ref|YP_004746267.1| hypothetical protein MCAN_28431... 582 2e-164
gi|224543482|ref|ZP_03684021.1| hypothetical protein CATMIT_0269... 154 2e-35
gi|315925057|ref|ZP_07921274.1| conserved hypothetical protein [... 153 4e-35
gi|114567267|ref|YP_754421.1| hypothetical protein Swol_1752 [Sy... 142 6e-32
gi|345284418|gb|AEN78271.1| CRISPR-associated Csm5 family protei... 139 7e-31
gi|116627770|ref|YP_820389.1| hypothetical protein STER_0977 [St... 137 3e-30
gi|327469968|gb|EGF15432.1| hypothetical protein HMPREF9386_0579... 136 4e-30
gi|312278325|gb|ADQ62982.1| CRISPR-associated protein, Csm5 fami... 136 7e-30
gi|339278117|emb|CCC19865.1| hypothetical protein STH8232_1166 [... 135 8e-30
gi|325687526|gb|EGD29547.1| hypothetical protein HMPREF9381_1060... 135 1e-29
gi|55822918|ref|YP_141359.1| hypothetical protein str0964 [Strep... 135 1e-29
gi|55820999|ref|YP_139441.1| hypothetical protein stu0964 [Strep... 135 1e-29
gi|240143670|ref|ZP_04742271.1| CRISPR-associated RAMP protein, ... 134 3e-29
gi|125718067|ref|YP_001035200.1| hypothetical protein SSA_1247 [... 132 8e-29
gi|331004039|ref|ZP_08327521.1| csm5 family CRISPR-associated ra... 127 4e-27
gi|322387547|ref|ZP_08061156.1| hypothetical protein HMPREF9423_... 126 6e-27
gi|229826462|ref|ZP_04452531.1| hypothetical protein GCWU000182_... 124 2e-26
gi|270292490|ref|ZP_06198701.1| conserved hypothetical protein [... 122 1e-25
gi|322375482|ref|ZP_08049995.1| CRISPR-associated RAMP protein [... 119 7e-25
gi|225018979|ref|ZP_03708171.1| hypothetical protein CLOSTMETH_0... 117 3e-24
gi|253578036|ref|ZP_04855308.1| CRISPR-associated protein [Rumin... 115 1e-23
gi|334126727|ref|ZP_08500675.1| csm5 family CRISPR-associated ra... 111 2e-22
gi|295105101|emb|CBL02645.1| CRISPR-associated RAMP protein, Csm... 103 7e-20
gi|323141259|ref|ZP_08076155.1| CRISPR-associated RAMP protein, ... 102 7e-20
gi|121533436|ref|ZP_01665264.1| CRISPR-associated RAMP protein, ... 97.4 4e-18
gi|227890795|ref|ZP_04008600.1| conserved hypothetical protein [... 92.8 8e-17
gi|334308473|gb|EGL99459.1| CRISPR-associated protein, Csm5 fami... 89.4 8e-16
gi|291460037|ref|ZP_06599427.1| CRISPR-associated RAMP protein, ... 87.0 5e-15
gi|313894850|ref|ZP_07828410.1| CRISPR-associated RAMP protein, ... 82.4 1e-13
gi|312899098|ref|ZP_07758476.1| CRISPR-associated RAMP protein, ... 79.3 9e-13
gi|341822665|emb|CCC73589.1| CRISPR-associated RAMP protein [Meg... 79.0 1e-12
gi|296133517|ref|YP_003640764.1| CRISPR-associated RAMP protein,... 78.6 2e-12
gi|303231949|ref|ZP_07318657.1| CRISPR-associated RAMP protein, ... 74.3 3e-11
gi|339893267|emb|CCB52454.1| CRISPR associated RAMP family prote... 74.3 3e-11
gi|333976281|gb|EGL77150.1| CRISPR-associated RAMP protein, Csm5... 74.3 3e-11
gi|341656686|gb|EGS80395.1| CRISPR-associated RAMP protein, Csm5... 74.3 3e-11
gi|269798857|ref|YP_003312757.1| CRISPR-associated RAMP protein,... 73.6 5e-11
gi|342213932|ref|ZP_08706645.1| CRISPR type III-A/MTUBE-associat... 73.2 7e-11
gi|57865880|ref|YP_190000.1| CRISPR-associated Csm5 family prote... 73.2 7e-11
gi|289549403|ref|YP_003470307.1| CRISPR-associated protein, Csm5... 72.4 1e-10
gi|301299525|ref|ZP_07205794.1| conserved domain protein [Lactob... 72.4 1e-10
gi|292669138|ref|ZP_06602564.1| Csm5 family CRISPR-associated RA... 72.0 2e-10
gi|258645683|ref|ZP_05733152.1| CRISPR-associated RAMP protein, ... 70.1 6e-10
gi|238018270|ref|ZP_04598696.1| hypothetical protein VEIDISOL_00... 69.7 8e-10
gi|315641549|ref|ZP_07896618.1| csm5 family CRISPR-associated ra... 68.9 1e-09
gi|15669863|ref|NP_248677.1| hypothetical protein MJ_1667 [Metha... 65.5 2e-08
>gi|15609956|ref|NP_217335.1| hypothetical protein Rv2819c [Mycobacterium tuberculosis H37Rv]
gi|15842360|ref|NP_337397.1| hypothetical protein MT2886 [Mycobacterium tuberculosis CDC1551]
gi|31793995|ref|NP_856488.1| hypothetical protein Mb2843c [Mycobacterium bovis AF2122/97]
64 more sequence titles
Length=375
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/375 (100%), Positives = 375/375 (100%), Gaps = 0/375 (0%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA
Sbjct 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL
Sbjct 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE
Sbjct 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
Query 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE
Sbjct 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
Sbjct 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
Query 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI
Sbjct 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
Query 361 CYEMGQCELSIRRAE 375
CYEMGQCELSIRRAE
Sbjct 361 CYEMGQCELSIRRAE 375
>gi|298526288|ref|ZP_07013697.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298496082|gb|EFI31376.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=375
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/375 (99%), Positives = 375/375 (100%), Gaps = 0/375 (0%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEY+VEGDRVYFPDMELLYADIPAHKRKSFEA
Sbjct 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYNVEGDRVYFPDMELLYADIPAHKRKSFEA 60
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL
Sbjct 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE
Sbjct 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
Query 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE
Sbjct 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
Sbjct 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
Query 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI
Sbjct 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
Query 361 CYEMGQCELSIRRAE 375
CYEMGQCELSIRRAE
Sbjct 361 CYEMGQCELSIRRAE 375
>gi|254232914|ref|ZP_04926241.1| hypothetical protein TBCG_02755 [Mycobacterium tuberculosis C]
gi|124601973|gb|EAY60983.1| hypothetical protein TBCG_02755 [Mycobacterium tuberculosis C]
Length=375
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/375 (99%), Positives = 374/375 (99%), Gaps = 0/375 (0%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA
Sbjct 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL
Sbjct 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE
Sbjct 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
Query 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGK DGLPLFRE
Sbjct 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKHDGLPLFRE 240
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
Sbjct 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
Query 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI
Sbjct 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
Query 361 CYEMGQCELSIRRAE 375
CYEMGQCELSIRRAE
Sbjct 361 CYEMGQCELSIRRAE 375
>gi|289762994|ref|ZP_06522372.1| hypothetical protein TBIG_02177 [Mycobacterium tuberculosis GM
1503]
gi|289710500|gb|EFD74516.1| hypothetical protein TBIG_02177 [Mycobacterium tuberculosis GM
1503]
Length=375
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/375 (99%), Positives = 374/375 (99%), Gaps = 0/375 (0%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
MNTYLKPFE TLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA
Sbjct 1 MNTYLKPFERTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA 60
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL
Sbjct 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE
Sbjct 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
Query 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
RK+LRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE
Sbjct 181 RKQLRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
Sbjct 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV 300
Query 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI
Sbjct 301 GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNI 360
Query 361 CYEMGQCELSIRRAE 375
CYEMGQCELSIRRAE
Sbjct 361 CYEMGQCELSIRRAE 375
>gi|340627815|ref|YP_004746267.1| hypothetical protein MCAN_28431 [Mycobacterium canettii CIPT
140010059]
gi|340006005|emb|CCC45174.1| hypothetical protein MCAN_28431 [Mycobacterium canettii CIPT
140010059]
Length=375
Score = 582 bits (1501), Expect = 2e-164, Method: Compositional matrix adjust.
Identities = 292/376 (78%), Positives = 314/376 (84%), Gaps = 2/376 (0%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAH-KRKSFE 59
M+ YLKPFELTLRCLGPVFIGSGEKRT KEY VYFPDME LYAD+ A K +SFE
Sbjct 1 MSQYLKPFELTLRCLGPVFIGSGEKRTPKEYVASTSMVYFPDMERLYADVAAQGKSESFE 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLT 119
F+MNT AQ EW+ N VK+ P H GY VKIGSI P RA RGR G+M +++
Sbjct 61 EFMMNTGKAQPDERFNEWIAENGVKVSPKNHGGYGVKIGSIVPGRAHRGRDGQMIQEQRQ 120
Query 120 LNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERF 179
LN+IH+FIKD LG PYVPGS+VKGMLRSIYLQSLVH+RTAQPVRVPGHQTREHRQYGERF
Sbjct 121 LNDIHSFIKDVLGNPYVPGSSVKGMLRSIYLQSLVHQRTAQPVRVPGHQTREHRQYGERF 180
Query 180 ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFR 239
ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSP LRTSDLLICQKMD+NVHGKPDGLPLFR
Sbjct 181 ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPGLRTSDLLICQKMDVNVHGKPDGLPLFR 240
Query 240 ECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAI 299
ECLAPGTSIS RVVVDTSPTARGGW GERFLETL++T A VN+ARYAEY A Y +
Sbjct 241 ECLAPGTSISLRVVVDTSPTARGGWPAGERFLETLSDTVAFVNKARYAEYAAKYWDDDPQ 300
Query 300 VGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDN 359
GPIVYLGGGAGYRSKTFVT QDDMAKVLDAQF K +KHV KTR+L VSPLVLK TKI +
Sbjct 301 FGPIVYLGGGAGYRSKTFVTQQDDMAKVLDAQFPK-IKHVAKTRDLGVSPLVLKLTKIGD 359
Query 360 ICYEMGQCELSIRRAE 375
YEMGQCELSIRRAE
Sbjct 360 KYYEMGQCELSIRRAE 375
>gi|224543482|ref|ZP_03684021.1| hypothetical protein CATMIT_02691 [Catenibacterium mitsuokai
DSM 15897]
gi|224523609|gb|EEF92714.1| hypothetical protein CATMIT_02691 [Catenibacterium mitsuokai
DSM 15897]
Length=380
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 123/403 (31%), Positives = 184/403 (46%), Gaps = 62/403 (15%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEY-HVEGDRVYFPDMELLYADIPAHKR-KSF 58
M YLK + + L+ LGPVFIGSG++ + KEY + +++ D+ Y ++ K+ SF
Sbjct 1 MKNYLKSYRIHLKVLGPVFIGSGKELSKKEYLFYKNNQIAIIDIAKFYLELKKIKKLDSF 60
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPA-KHRGYEVKIGSIEPRRASRGRGGRMTRKK 117
EAF+++ T W+ N V + Y + G IE + R+
Sbjct 61 EAFMLDEHEHLGT-----WIRKNNVNNSIVDRCIKYTLDKGDIEETK----------RRN 105
Query 118 LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL--------QSLVHKRTAQPVRVPGHQT 169
+ L F+KDP G PYVPGS++KGM R+I+ + +R +V +T
Sbjct 106 VML----EFVKDPYGNPYVPGSSLKGMFRTIFFADRLINHSKDYTIQRKQFKEKVFEKET 161
Query 170 REHRQYG---ERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDM 226
+ R + E K R RP+T+ DAVND+ V+DS L DL + QK+D+
Sbjct 162 NKKRYLSRNIQDIEAKTFRTLHRPDTKVDDAVNDIMAGFIVSDSEPLSVEDLTLAQKVDV 221
Query 227 NVHGKPDGLPLFRECLAPGTSISHRVVVDTS--PTARGGWREGERFLETLAETAASVNQA 284
+V LP RECL PGT I V +DTS P A+ E + + N
Sbjct 222 HVKKGAKNLPSVRECLKPGTDIVFTVTIDTSICPYAKQDIIESINYFDD--------NYN 273
Query 285 RYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFV------TDQDDMAKVLD------AQF 332
+Y + + V I V+LGGG+GY SKT V D + +++ F
Sbjct 274 KY--FVEPFTAVEYIDDGSVFLGGGSGYASKTAVYPLFDGEDSEQTVRIVQQIMVNTTTF 331
Query 333 GK-----VVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELS 370
K + KH D R +SP +K T + +MG C+L
Sbjct 332 DKKTRRNLHKHEDDLRLYGISPHTIKCTYYHDQLLQMGLCQLD 374
>gi|315925057|ref|ZP_07921274.1| conserved hypothetical protein [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621956|gb|EFV01920.1| conserved hypothetical protein [Pseudoramibacter alactolyticus
ATCC 23263]
Length=363
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 120/388 (31%), Positives = 179/388 (47%), Gaps = 55/388 (14%)
Query 8 FELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPD--------MELLYADIPAHKRKSF- 58
F +TL GPV IGSGE+ + KEY V+FP+ M LYA SF
Sbjct 6 FRMTLTAQGPVSIGSGEEISKKEY------VFFPEKRRIVVMAMPKLYALAQKKNLGSFF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAV---KLDPAKHRGYEVKIGSIEPRRASRGRGGRMTR 115
E F+ G + L W+ N + +LD Y + G+IE R
Sbjct 60 EDFLCPPPGRRRNQDLGSWICKNRISGKELDTCVR--YILPTGAIETSRNY--------- 108
Query 116 KKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQS-LVHKRTAQPVRVP------GHQ 168
+ AF+KDP G+PYVPGS+VKGMLR++ L + L A + G++
Sbjct 109 ------NVWAFVKDPYGKPYVPGSSVKGMLRTVLLTARLWQNHRAWQAEIDALKSGRGNR 162
Query 169 TREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNV 228
++ + E + GRP TR +DAVND + V+DS L DL++CQ+++
Sbjct 163 NNYLKREIDTLEARAFHTLGRPGTRREDAVNDELSGLIVSDSEPLTLKDLILCQRLEHKP 222
Query 229 HGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAE 288
GK LP+ +E L PGT I + +D S A E L E A ++
Sbjct 223 DGKEKTLPILKESLRPGTRIRFSLTIDPSRCALTK--------EALLEAVARFDEVYQKC 274
Query 289 YRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTD----QDDMAKVLDA-QFGKVVKHVDKTR 343
+ + + G++ + G VYLGG AG+ +KT + Q + + Q KV ++ R
Sbjct 275 FLSAFLGMDRLTGSEVYLGGNAGFATKTVIYAALGRQAGIQTIRQIFQQTKVPRNHHHER 334
Query 344 ELRVSPLVLKRTKIDNICYEMGQCELSI 371
+VSP ++K + YE G+C L+I
Sbjct 335 NQKVSPHIVKCARYGGKLYEFGKCRLAI 362
>gi|114567267|ref|YP_754421.1| hypothetical protein Swol_1752 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338202|gb|ABI69050.1| CRISPR-associated protein, Csm5 family [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
Length=380
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 123/399 (31%), Positives = 176/399 (45%), Gaps = 50/399 (12%)
Query 3 TYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDR--VYFPDMELLYADIPAHKRKSFEA 60
+L+ LTLR L PVFIGSGE+ KEY + +YFPD L A + K +S A
Sbjct 4 AHLERLNLTLRALAPVFIGSGEQLGKKEYIFDSPNALIYFPDFPRLVAFL---KERSLLA 60
Query 61 FVMNTDGAQATAPLKEWVEPNAVKL-DPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLT 119
++ ++E N + D Y + G R
Sbjct 61 EYEKFLSTPRLKDIRVFLEENGISAADYPSFVRYSIAAGEAAHIENFR------------ 108
Query 120 LNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVR---------VPG--HQ 168
E+ FIKD G PY+PGS++KG +R+ L+ + + R VP +
Sbjct 109 --EVLTFIKDSKGYPYIPGSSLKGAIRTALATYLLKRGDWERDRRNIEGSDSSVPARKYL 166
Query 169 TREHRQYGER-FERKELR--KSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMD 225
RE ++ F + ++R K G+ + P +NDL Q IR++DS AL +L + K D
Sbjct 167 ARESSTVEKKVFYQLDIRNPKDGKEISSP---INDLMQGIRISDSAALSFENLTLTGKYD 223
Query 226 MNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQAR 285
G + LP+FRECL PG+ ++ +D AR G G +E A + A
Sbjct 224 RKPDGTVNLLPIFRECLTPGSEAHLQLTLDLPMLARVGLNAG--IIEEALHDFADEHYAH 281
Query 286 YAEYRAMYP---GVNAIVGPIVYLGGGAGYRSKTFVTDQ--------DDMAKVLDAQFGK 334
+ +Y A P V A G ++LGGG GY SKT + AK+L QF
Sbjct 282 FEQYFAELPEDASVAAKEGVDIFLGGGVGYVSKTLTYNLFPQRENAVSLAAKILTKQFSP 341
Query 335 VVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIRR 373
H + +VSP +LK T Y MG+CEL I R
Sbjct 342 KHGHSKDASQYKVSPHILKTTMYAGEYYHMGKCELIITR 380
>gi|345284418|gb|AEN78271.1| CRISPR-associated Csm5 family protein [Lactobacillus ruminis
ATCC 27782]
Length=347
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 112/391 (29%), Positives = 178/391 (46%), Gaps = 62/391 (15%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHK--RKSF 58
M Y F+ TL LGPV IGSGEK T KEY E ++ YFPDM LY I +F
Sbjct 1 MKDYHTKFDFTLLVLGPVHIGSGEKYTKKEYVYENNKYYFPDMGRLYLRIKDEPGLNSAF 60
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKK- 117
AF+ + T L E++ N++ LD GY + E ++S GR ++
Sbjct 61 TAFMTEINDGSRTTTLGEFLSANSI-LD-RDFGGYSISESGYEFEKSS-GRSWNSRNREP 117
Query 118 ---LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQ 174
LNEI AF+KD GRPY+PGS++KG +R+I +
Sbjct 118 GAGRNLNEISAFVKDSYGRPYIPGSSLKGAIRTILIN----------------------- 154
Query 175 YGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGK-PD 233
E+F+ ++ P +++F IR++DS + +L + QK D N +
Sbjct 155 --EKFKTDDV---------PWKDGDNIFNEIRISDSKPISVDNLTLVQKWDYNAKKNCSN 203
Query 234 GLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMY 293
LP++RE L P T I + +S AR + +E L+ A S Q ++ + Y
Sbjct 204 SLPIWRESLKPLTRIEFTITT-SSERAR-------KLIENLSHYAKSFYQRYKNKFLSAY 255
Query 294 PG--VNAIVGPIVYLGGGAGYRSKT------FVTDQDDMAKVLDAQFGKVVKHVDKTREL 345
P + + +YLG G+G +K Q+ K + + V+K + + +++
Sbjct 256 PDRVIQKNIDCPIYLGAGSGLWTKVDYHHVRIDKIQEKSYKKMKMKGNGVLK-LARYKKV 314
Query 346 RVSPLVLKRTKIDN-ICYEMGQCELSIRRAE 375
++ K + N + YEMG+C SI+ +
Sbjct 315 KIKTKDGKSIHLTNDVFYEMGKCGFSIKEVD 345
>gi|116627770|ref|YP_820389.1| hypothetical protein STER_0977 [Streptococcus thermophilus LMD-9]
gi|116101047|gb|ABJ66193.1| CRISPR-associated protein, Csm5 family [Streptococcus thermophilus
LMD-9]
Length=357
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 111/394 (29%), Positives = 174/394 (45%), Gaps = 57/394 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKR--KSF 58
M + F+L+L L P+ IG+GEK TS+E+ E + YFPDM Y + KR + F
Sbjct 1 MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKM-VEKRLAEKF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
EAF++ T L ++ N ++ GY + +E R G
Sbjct 60 EAFLIQTRPNARNNRLISFLNDN--RIAERSFGGYSISETGLESDRNPNSAGA------- 110
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+NE++ FI+D G PY+PGS++KG +R+I + + V G +E+
Sbjct 111 -INEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFPKEN------ 163
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH-GKPDGLPL 237
K L G + D DLF AIRV+DS L++ QK D + K LPL
Sbjct 164 ---KNLIPWGPKKGKEYD---DLFNAIRVSDSKPFDNKSLILVQKWDYSAKTNKAKPLPL 217
Query 238 FRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG-- 295
+RE ++P T I + T E R +E L + A QA Y +Y+A +
Sbjct 218 YRESISPLTKIEFEITTTTD--------EAGRLIEELGKRA----QAFYKDYKAFFLSEF 265
Query 296 ----VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLV 351
+ A + +YLG G+G +KT D +L ++ ++ + K L+++
Sbjct 266 PDDKIQANLQYPIYLGAGSGAWTKTLFKQADG---ILQRRYSRMKTKMVKKGVLKLTKAP 322
Query 352 LKRTKIDN----------ICYEMGQCELSIRRAE 375
LK KI + YEMG+ I+ +
Sbjct 323 LKTVKIPSGNHSLVKNHESFYEMGKANFMIKEID 356
>gi|327469968|gb|EGF15432.1| hypothetical protein HMPREF9386_0579 [Streptococcus sanguinis
SK330]
Length=378
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 121/398 (31%), Positives = 185/398 (47%), Gaps = 53/398 (13%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD-IPAHKRKSFE 59
M T + F+LTL LGPV IGSG+ T++EY +EGD YFPDM LLY + I + F+
Sbjct 1 MKTKYRKFKLTLWTLGPVHIGSGQLHTAREYILEGDEYYFPDMTLLYDELIKRGIDEKFQ 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLT 119
F++++D T + +++ + + GY +K +E + + T
Sbjct 61 KFLIDSD--NKTNRISDFLAEHGIT--KRNFGGYRLKATGLEKPKGENVPRNQETTDPGE 116
Query 120 LNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERF 179
+N +H F++D G PYVPGS++KG +R+I + + H + G +
Sbjct 117 INGVHQFMRDCYGNPYVPGSSLKGAIRTILMNTHWHSTDFKQENKKGKIVENKKAIPWGP 176
Query 180 ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMN-VHGKPDGLPLF 238
R++ + +P +D+F IRV+DS L DL++ QK D H KP L ++
Sbjct 177 TRRQRHEKIKP-------FDDIFNEIRVSDSQPLTNDDLILVQKWDFTPDHTKPHSLSIY 229
Query 239 RECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETA-----ASVNQARYAEYRAM- 292
RE L PGT + ++ TS + G R GE + +L E A Y Y+
Sbjct 230 REALRPGTKMEFEII--TSLGFKDG-RAGE-LISSLGEYAQKFYFGMTEDEGYEGYKDFF 285
Query 293 ---YPG---VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRE-- 344
+P N + P+ YLGGG+G +KT D G+V K K E
Sbjct 286 LKKFPNHLIQNNLSYPL-YLGGGSGAWTKTVFRQAD----------GEVQKRHKKMSERG 334
Query 345 ---LRVSP-LVLKRTK-----IDNI--CYEMGQCELSI 371
L +P V+K TK I+N YEMG+ +I
Sbjct 335 ALKLTKAPQQVIKTTKGEKSLINNAQNFYEMGKTCFTI 372
>gi|312278325|gb|ADQ62982.1| CRISPR-associated protein, Csm5 family [Streptococcus thermophilus
ND03]
Length=357
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/394 (28%), Positives = 174/394 (45%), Gaps = 57/394 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKR--KSF 58
M + F+L+L L P+ IG+GEK TS+E+ E + YFPDM Y + KR + F
Sbjct 1 MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKM-VEKRLAEKF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
EAF++ T L ++ N ++ GY + +E + G
Sbjct 60 EAFLIQTRPNARNNRLISFLNDN--RIAERSFGGYSISETGLESDKNPNSAGA------- 110
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+NE++ FI+D G PY+PGS++KG +R+I + + V G +E+
Sbjct 111 -INEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFPKEN------ 163
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH-GKPDGLPL 237
K L G + D DLF AIRV+DS L++ QK D + K LPL
Sbjct 164 ---KNLIPWGPKKGKEYD---DLFNAIRVSDSKPFDNKSLILVQKWDYSAKTNKAKPLPL 217
Query 238 FRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG-- 295
+RE ++P T I + T E R +E L + A QA Y +Y+A +
Sbjct 218 YRESISPLTKIEFEITTTTD--------EAGRLIEELGKRA----QAFYKDYKAFFLSEF 265
Query 296 ----VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLV 351
+ A + +YLG G+G +KT D +L ++ ++ + K L+++
Sbjct 266 PDDKIQANLQYPIYLGAGSGAWTKTLFKQADG---ILQRRYSRMKTKMVKKGVLKLTKAP 322
Query 352 LKRTKIDN----------ICYEMGQCELSIRRAE 375
LK KI + YEMG+ I+ +
Sbjct 323 LKTVKIPSGNHSLVKNHESFYEMGKANFMIKEID 356
>gi|339278117|emb|CCC19865.1| hypothetical protein STH8232_1166 [Streptococcus thermophilus
JIM 8232]
Length=357
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 111/394 (29%), Positives = 174/394 (45%), Gaps = 57/394 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKR--KSF 58
M + F+L+L L P+ IG+GEK TS+E+ E + YFPDM Y + KR + F
Sbjct 1 MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKM-VEKRLAEKF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
EAF++ T L ++ N ++ GY + +E R G
Sbjct 60 EAFLIQTRPNARNNRLISFLNDN--RIAERSFGGYSISETGLESDRNPNSAGA------- 110
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+NE++ FI+D G PY+PGS++KG +R+I + + V G +E+
Sbjct 111 -INEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFPKEN------ 163
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH-GKPDGLPL 237
K L G + D DLF AIRV+DS L++ QK D + K LPL
Sbjct 164 ---KNLIPWGPKKGKEYD---DLFNAIRVSDSKPFDNKRLILVQKWDYSAKTNKAKPLPL 217
Query 238 FRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG-- 295
+RE ++P T I + T E R +E L + A QA Y +Y+A +
Sbjct 218 YRESISPLTKIEFEITTTTD--------EAGRLIEELGKRA----QAFYKDYKAFFLSEF 265
Query 296 ----VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLV 351
+ A + +YLG G+G +KT D +L ++ ++ + K L+++
Sbjct 266 PDDKIQANLQYPIYLGAGSGAWTKTLFKQADG---ILQRRYSRMKTKMVKKGVLKLTKAP 322
Query 352 LKRTKIDN----------ICYEMGQCELSIRRAE 375
LK KI + YEMG+ I+ +
Sbjct 323 LKIVKIPSGNHSLIKNHESFYEMGKANFMIKEID 356
>gi|325687526|gb|EGD29547.1| hypothetical protein HMPREF9381_1060 [Streptococcus sanguinis
SK72]
Length=378
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/392 (30%), Positives = 187/392 (48%), Gaps = 41/392 (10%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD-IPAHKRKSFE 59
M T + F+LTL LGPV IGSG+ T++EY +EGD YFPDM LLY + I + F+
Sbjct 1 MKTKYRKFKLTLWTLGPVHIGSGQLHTAREYILEGDEYYFPDMTLLYDELIKRGIDEKFQ 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAV-KLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
F++ D T +++++ + + K D GY +K +E + + T
Sbjct 61 KFLI--DSENKTNRIRDFLAEHGITKRDFG---GYRLKATGLENPKEENATRNQETTNPG 115
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+N +H F++D G PYVPGS++KG +R+I + + H ++ + GE
Sbjct 116 EINGVHQFMRDCYGNPYVPGSSLKGAIRTILMNTHWHST----------DFKQENKKGEI 165
Query 179 FERKELRKSG---RPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHG-KPDG 234
E K+ G R + + +D+F IRV+DS L DL++ QK D KP
Sbjct 166 VENKKAIPWGPTRRQRYKELEPFDDIFNEIRVSDSQPLTNDDLILVQKWDFTPDDTKPHS 225
Query 235 LPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAAS-----VNQARYAEY 289
L ++RE L PGT + ++ T+ + G R GE + +L E A Y Y
Sbjct 226 LSIYREALRPGTKMEFEII--TALGFKDG-RAGE-LVASLGEYAQKFYFGVTEDEGYEGY 281
Query 290 RAM----YPG---VNAIVGPIVYLGGGAGYRSKTFVTDQD-DMAKVLDAQFGKVVKHVDK 341
+ +P N + P+ YLGGG+G +KT D ++ + + G+ + K
Sbjct 282 KDFFLKKFPNHLIQNNLSYPL-YLGGGSGAWTKTVFRQADGEVQQRHEKMSGRGALKLTK 340
Query 342 TRELRVSPLVLKRTKIDNI--CYEMGQCELSI 371
+ + +++ I+N YEMG+ +I
Sbjct 341 APQQVIKTTKGEKSLINNAQNFYEMGKTCFTI 372
>gi|55822918|ref|YP_141359.1| hypothetical protein str0964 [Streptococcus thermophilus CNRZ1066]
gi|55738903|gb|AAV62544.1| hypothetical protein str0964 [Streptococcus thermophilus CNRZ1066]
Length=357
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/394 (28%), Positives = 174/394 (45%), Gaps = 57/394 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKR--KSF 58
M + F+L+L L P+ IG+GEK TS+E+ E + YFPDM Y + KR + F
Sbjct 1 MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKM-VEKRLAEKF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
EAF++ T L ++ N ++ GY + +E + G
Sbjct 60 EAFLIQTRPNARNNRLISFLNDN--RIAERSFGGYSISETGLESDKNPDSTGA------- 110
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+NE++ FI+D G PY+PGS++KG +R+I + + V G +E+
Sbjct 111 -INEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFPKEN------ 163
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH-GKPDGLPL 237
K L G + D DLF AIRV+DS L++ QK D + K LPL
Sbjct 164 ---KNLIPWGPKKGKEYD---DLFNAIRVSDSKPFDNKSLILVQKWDYSAKTNKAKPLPL 217
Query 238 FRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG-- 295
+RE ++P T I + T E R +E L + A QA Y +Y+A +
Sbjct 218 YRESISPLTKIEFEITTTTD--------EAGRLIEELGKRA----QAFYKDYKAFFLSEF 265
Query 296 ----VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLV 351
+ A + +YLG G+G +KT D +L ++ ++ + K L+++
Sbjct 266 PDDKIQANLQYPIYLGAGSGAWTKTLFKQADG---ILQRRYSRMKTKMVKKGVLKLTKAP 322
Query 352 LKRTKIDN----------ICYEMGQCELSIRRAE 375
LK KI + YEMG+ I+ +
Sbjct 323 LKTVKIPSGNHSLVKNHESFYEMGKANFMIKEID 356
>gi|55820999|ref|YP_139441.1| hypothetical protein stu0964 [Streptococcus thermophilus LMG
18311]
gi|55736984|gb|AAV60626.1| hypothetical protein stu0964 [Streptococcus thermophilus LMG
18311]
Length=357
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/394 (28%), Positives = 174/394 (45%), Gaps = 57/394 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKR--KSF 58
M + F+L+L L P+ IG+GEK TS+E+ E + YFPDM Y + KR + F
Sbjct 1 MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKM-VEKRLAEKF 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
EAF++ T L ++ N ++ GY + +E + G
Sbjct 60 EAFLIQTRPNARNNRLISFLNDN--RIAERSFGGYSISETGLESDKNPDSAGA------- 110
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
+NE++ FI+D G PY+PGS++KG +R+I + + V G +E+
Sbjct 111 -INEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFPKEN------ 163
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH-GKPDGLPL 237
K L G + D DLF AIRV+DS L++ QK D + K LPL
Sbjct 164 ---KNLIPWGPKKGKEYD---DLFNAIRVSDSKPFDNKSLILVQKWDYSAKTNKAKPLPL 217
Query 238 FRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG-- 295
+RE ++P T I + T E R +E L + A QA Y +Y+A +
Sbjct 218 YRESISPLTKIEFEITTTTD--------EAGRLIEELGKRA----QAFYKDYKAFFLSEF 265
Query 296 ----VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLV 351
+ A + +YLG G+G +KT D +L ++ ++ + K L+++
Sbjct 266 PDDKIQANLQYPIYLGAGSGAWTKTLFKQADG---ILQRRYSRMKTKMVKKGVLKLTKAP 322
Query 352 LKRTKIDN----------ICYEMGQCELSIRRAE 375
LK KI + YEMG+ I+ +
Sbjct 323 LKTVKIPSGNHSLVKNHESFYEMGKANFMIKEID 356
>gi|240143670|ref|ZP_04742271.1| CRISPR-associated RAMP protein, Csm5 family [Roseburia intestinalis
L1-82]
gi|257204347|gb|EEV02632.1| CRISPR-associated RAMP protein, Csm5 family [Roseburia intestinalis
L1-82]
Length=373
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/387 (30%), Positives = 173/387 (45%), Gaps = 45/387 (11%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKE--YHVEGDRVYFPDMELLYADIPAHKR-KS 57
M YLK + + + L PV+IGSGEK KE Y V P++E +Y D+ K
Sbjct 1 MRDYLKHYRVKICVLSPVYIGSGEKIGKKEHIYMPWNHHVIIPNVEKMYMDLQKKGLGKE 60
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAV-KLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRK 116
F ++M DG L +W+ + + + D + + YE+ G + +R +
Sbjct 61 FADYMM--DGRPKEPSLSQWLGQHKMQREDYERWKLYEMDAGEAFVSQTARPK------- 111
Query 117 KLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHK------RTAQPVRVPGHQTR 170
EI AF+KD G PYVPGST+KGM R+ + + K RT + ++ +
Sbjct 112 -----EIEAFVKDAYGMPYVPGSTLKGMFRTALIADEIQKCPEKYERTGREIQSASAERA 166
Query 171 EHRQY----GERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDM 226
+Q +R E++ R +P +AVND + V DS + L + QK+D+
Sbjct 167 SRKQCLARETKRLEQQIFYTLNRDEKKPANAVNDNLSGLHVGDSQPISVDQLTLSQKIDV 226
Query 227 NVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARY 286
+ G L + RE L PGT I V +DT+ + E +E L N+ Y
Sbjct 227 TLDGTEKPLNVLRETLIPGTEICFDVSIDTTICP----YQMEDIIEALNIFQNICNRYFY 282
Query 287 AEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQ--DDMAKVLDAQF----GK--VVKH 338
A + N V+LGGG G+ SKT + + KV+D F GK +V
Sbjct 283 ARFHWEAKEKNT-----VWLGGGCGFLSKTVLYPLLGSNAVKVVDNVFKNTLGKNYIVHK 337
Query 339 VDKTRELRVSPLVLKRTKIDNICYEMG 365
K +L+++P K TK Y MG
Sbjct 338 HTKDLQLKLAPHACKCTKYQGKLYHMG 364
>gi|125718067|ref|YP_001035200.1| hypothetical protein SSA_1247 [Streptococcus sanguinis SK36]
gi|125497984|gb|ABN44650.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
gi|327474442|gb|EGF19848.1| hypothetical protein HMPREF9391_0568 [Streptococcus sanguinis
SK408]
Length=378
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 116/398 (30%), Positives = 182/398 (46%), Gaps = 53/398 (13%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD-IPAHKRKSFE 59
M T + F+LTL LGPV IGSG+ T++EY +EGD YFPDM LLY + I + F+
Sbjct 1 MKTKYRKFKLTLWTLGPVHIGSGQLHTAREYILEGDEYYFPDMTLLYDELIKRGIDEKFQ 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLT 119
F++++D T + +++ + + GY +K +E + + T
Sbjct 61 KFLIDSD--NKTNRISDFLAEHGIT--KRNFGGYRLKATGLEKPKGENVPRNQETTDPGE 116
Query 120 LNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERF 179
+N +H F++D G PYVPGS++KG +R+I + + H + G +
Sbjct 117 INGVHQFMRDCYGNPYVPGSSLKGAIRTILMNTHWHSTDFKQENKKGKIVENKKAIPWGP 176
Query 180 ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHG-KPDGLPLF 238
R++ + +P +D+F IRV+DS L DL++ QK D KP L ++
Sbjct 177 TRRQRHEKIKP-------FDDIFNEIRVSDSQPLTNDDLILVQKWDFTPDDTKPHSLSIY 229
Query 239 RECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASV--------NQARYAEYR 290
RE L PGT + ++ T+ +GG R GE + +L E A Y Y+
Sbjct 230 REALRPGTKMEFEII--TALGFKGG-RAGE-LVASLGEYAQKFYFGVTEDEGYEGYEGYK 285
Query 291 AM----YPG---VNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTR 343
+P N + P+ YLGGG+G +KT D G+V K K
Sbjct 286 DFFLKKFPNHLIQNNLSYPL-YLGGGSGAWTKTVFRQAD----------GEVQKRHKKMS 334
Query 344 E-----LRVSPLVLKRTKIDNI-----CYEMGQCELSI 371
E L +P + +I+ I YEMG+ +I
Sbjct 335 ERGALKLTKAPFLTVDGEIELINNAENFYEMGKTCFTI 372
>gi|331004039|ref|ZP_08327521.1| csm5 family CRISPR-associated ramp protein [Lachnospiraceae oral
taxon 107 str. F0167]
gi|330411625|gb|EGG91033.1| csm5 family CRISPR-associated ramp protein [Lachnospiraceae oral
taxon 107 str. F0167]
Length=381
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 100/407 (25%), Positives = 183/407 (45%), Gaps = 75/407 (18%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYA-DIPAHKRKSFE 59
M +LK + + L+ +GPVFIGSGE KE + D+V D +L++ + + K ++
Sbjct 1 MGDFLKKYNIELKTVGPVFIGSGETINKKEALFKKDKVVIIDTKLMFEYFLKRNLLKQYQ 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRAS-----RGRGGRMT 114
++++T + L + + N + D ++ + +K S+ +++ +GRG
Sbjct 61 EYMLDT-----SKDLAVFFKDNNI--DEKIYKTWNIKELSLGDTKSTGDGDVKGRG---- 109
Query 115 RKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQ 174
I F++D GR YVPGS++KGMLR+I + + + ++ G +
Sbjct 110 --------IVRFVRDGNGRVYVPGSSLKGMLRTILAGEYIIQNSNCGIK--GKLDETAWE 159
Query 175 YGERFERKELRKSGR----------------PNT----RPQDAVNDLFQAIRVTDSPALR 214
G+ RKE K+ + PN+ + + +ND + V+DS +
Sbjct 160 LGKNPRRKEFEKAYKNTLTDIDVNIFHKDLFPNSDGKNKLDNKINDTLRGFMVSDSEYIS 219
Query 215 TSDLLICQKMDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPT---------ARGGWR 265
D+ +CQK+D++ G LPL+REC+ P T+I + +D+S + G +
Sbjct 220 DEDMCVCQKVDISTDGTEIALPLYRECIKPDTTIRFSITIDSSFCDYTKPDIIESIGSFY 279
Query 266 EGERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQDDMA 325
E + A +++ RY +LGGGAG+ SKT + +
Sbjct 280 ENYWNKVSKQFKKAPISKDRYT----------------CFLGGGAGFESKTIIYSSFERT 323
Query 326 KVLDAQ---FGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
K +D + + +++ VSP V+ TK ++ + G C L
Sbjct 324 KAVDFTSNILSVMFPNAKHDKDMEVSPRVINCTKFEHTKHLFGACSL 370
>gi|322387547|ref|ZP_08061156.1| hypothetical protein HMPREF9423_0554 [Streptococcus infantis
ATCC 700779]
gi|321141414|gb|EFX36910.1| hypothetical protein HMPREF9423_0554 [Streptococcus infantis
ATCC 700779]
Length=358
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 105/385 (28%), Positives = 171/385 (45%), Gaps = 57/385 (14%)
Query 8 FELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD-IPAHKRKSFEAFVMNTD 66
F+ +L + P+ IG+GEK TS+E+ E YFPDM Y + + FE F+ T
Sbjct 8 FQFSLLAMAPIHIGNGEKYTSREFIYENGYFYFPDMGKFYNRMVEKGYDQKFERFLQETK 67
Query 67 GAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIHAF 126
L +++ N ++ GY + +E + +R K T+NE+ F
Sbjct 68 PNARNNRLISFLDDN--RISNRDFGGYRIVETGLEIEKNNR------DSKLGTINEVAKF 119
Query 127 IKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQ---YGERFERKE 183
I+DP G PY+PGS++KG +R+I + + + G +E+++ +G
Sbjct 120 IRDPFGSPYIPGSSLKGAIRTILMNTNPDWNNKNAIDFRGRGPKENKKMIPWGA------ 173
Query 184 LRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDM---NVHGKPDGLPLFRE 240
K G+ NDLF AIRV+DS +++ QK D ++ KP LPL+RE
Sbjct 174 --KKGQ-------EFNDLFNAIRVSDSKPFNNEQIILVQKWDYSAKSLTAKP--LPLYRE 222
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG---VN 297
+ P T I+ + T +E +E L + A QA Y EY+ + N
Sbjct 223 AIVPLTRINFTITTTT--------KEAGILIEELGQRA----QAFYKEYKEFFLSDFPEN 270
Query 298 AIVGPI---VYLGGGAGYRSKTFVTDQDDM-----AKVLDAQFGKVVKHVDKT--RELRV 347
I + +YLG G+G +KT D + +++ GK V + K + ++
Sbjct 271 KIQPNLQYPIYLGAGSGAWTKTLFQQADGILQKRYSRMKTKMVGKGVLKLTKAPMKSVKT 330
Query 348 SPLVLKRTKIDNICYEMGQCELSIR 372
+ K D YEMG+ I+
Sbjct 331 TQATRKLIMNDESFYEMGKANFIIK 355
>gi|229826462|ref|ZP_04452531.1| hypothetical protein GCWU000182_01835 [Abiotrophia defectiva
ATCC 49176]
gi|229789332|gb|EEP25446.1| hypothetical protein GCWU000182_01835 [Abiotrophia defectiva
ATCC 49176]
Length=381
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/402 (29%), Positives = 185/402 (47%), Gaps = 56/402 (13%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDR--VYFPDMELLYADIPAHKRKS- 57
M YL +EL ++ L PV+IGSG +EY + + V F D+E L+ I + S
Sbjct 1 MEDYLINYELKIKILTPVYIGSGYTVGKREYIHDKSKNLVSFLDLEKLFKGILDNGLYSE 60
Query 58 FEA-FVMNTDGAQATAPLKEWVEPNAVKLDP-AKHRGYEVKIGSIEPRRASRGRGGRMTR 115
+E F + + LK+++E + D ++ Y +GS
Sbjct 61 YEKYFTADNKNREMNVELKQFLERAGIGEDKYSEWITYSEYMGS---------------- 104
Query 116 KKLTL---NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREH 172
L+L +EI FIKD G PY+PGS++KG +R+I + + K+ + R +E
Sbjct 105 SNLSLQNTHEIQTFIKDAYGNPYIPGSSLKGAIRTILESNYIRKKYNEFDRSRAEVKKEG 164
Query 173 RQYGERF---------ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQK 223
+ R+ E+ R+ + ++ ND+ + I V DS ++ + L ICQK
Sbjct 165 MKGKTRYMSVPQNHLKEKVFHRQITDERVKLENMQNDIMRGIIVGDSLSIDKNSLCICQK 224
Query 224 MDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQ 283
+D++ G L + RECL PGT ++ + +D S R + G++F L + A +N
Sbjct 225 IDLSTKGNKKSLNVLRECLKPGTVVTVPLTID-SKIVRNMY--GKKF--DLEDIKADINM 279
Query 284 ARYAEYRAMYPGVNAIVGPIV------YLGGGAGYRSKTF---VTDQDDMAKVLDAQFGK 334
Y Y+ Y I+ YLGGG+GY SKT + ++D+ KV+ +
Sbjct 280 F-YKNYKDEYITKFKNFPQIIEEKNAFYLGGGSGYVSKTVTHSLFNEDNATKVVSEILNE 338
Query 335 VVKHVDK------TRELRVSPLVLKRT-KIDNICYEMGQCEL 369
V K L VSP LK T ++N+C +MG C +
Sbjct 339 VFTQKSKPSANKDDEVLGVSPHTLKCTYYLENLC-QMGLCRI 379
>gi|270292490|ref|ZP_06198701.1| conserved hypothetical protein [Streptococcus sp. M143]
gi|270278469|gb|EFA24315.1| conserved hypothetical protein [Streptococcus sp. M143]
Length=362
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/329 (32%), Positives = 147/329 (45%), Gaps = 56/329 (17%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPA----HKRK 56
M T + F+ TL + P+ IG+GEK TS+E+ E YFPDM Y + HK
Sbjct 1 MKTEYRTFQFTLLAMAPIHIGNGEKYTSREFIYENGYFYFPDMGKFYNRMVEKGYDHK-- 58
Query 57 SFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRK 116
FE F+ T L ++E N ++ GY + +E + RGG
Sbjct 59 -FERFLQETKPNARNNRLISFLEDN--RISDRNFGGYRIIETKLETNN-NYLRGG----- 109
Query 117 KLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYG 176
LN++ FI+DP G PY+PGS++KG +R+I + + P + Q
Sbjct 110 --ALNQVSKFIRDPFGNPYIPGSSLKGAIRTILMNT-----------NPDWNNKNVLQCK 156
Query 177 ERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVH---GKPD 233
+ E K L G + D DLF AIRV+DS L++ QK D KP
Sbjct 157 K--ENKSLIPWGAKKGQDYD---DLFNAIRVSDSKPFSNKSLILVQKWDHKAKPPLAKP- 210
Query 234 GLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMY 293
LPL+RE +AP T I+ + T +E +E L + A QA Y EY+ +
Sbjct 211 -LPLYREAIAPSTKINFTITTTT--------KEAGILIEELGKRA----QAFYKEYKNFF 257
Query 294 PG---VNAIVGPI---VYLGGGAGYRSKT 316
N I I +YLG G+G +KT
Sbjct 258 LSDFPENKIQPNIQYPIYLGAGSGAWTKT 286
>gi|322375482|ref|ZP_08049995.1| CRISPR-associated RAMP protein [Streptococcus sp. C300]
gi|321279745|gb|EFX56785.1| CRISPR-associated RAMP protein [Streptococcus sp. C300]
Length=364
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/393 (27%), Positives = 166/393 (43%), Gaps = 62/393 (15%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD-IPAHKRKSFE 59
M T + F+ TL + P+ GSG+K TS+E+ E YFPDM Y + + FE
Sbjct 1 MKTEYRTFQFTLLAMAPIHTGSGDKYTSREFIYEDGYFYFPDMGKFYNRMVEKGYDQKFE 60
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLT 119
F+ + + L ++E N ++ GY +K E + + K T
Sbjct 61 RFLQERKASASNNRLISFLEDN--RISDRDFGGYRIKETGFETEK------NNIDSKLGT 112
Query 120 LNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERF 179
+NE+ F++D G PY+PGS++KG +R+I + + V+ ++
Sbjct 113 INEVSKFMRDSYGNPYIPGSSLKGAIRTILMNTNPDWNNENVVK-------------DKK 159
Query 180 ERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHG---KPDGLP 236
E K L G + D DLF IRV+DS R L++ QK D KP LP
Sbjct 160 ENKSLIPWGAKKGQNYD---DLFNTIRVSDSKPFRNDSLILVQKWDHKATTPLVKP--LP 214
Query 237 LFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG- 295
L+RE L PG I+ ++ T +E +E L E A Y +Y+ +
Sbjct 215 LYREALTPGKIINFKITTTT--------KEAGELIEKLGEKAFEF----YNDYKIFFLKD 262
Query 296 --VNAIVGPI---VYLGGGAGYRSKTFVTDQDDMAKVLDAQF--GKVVKHVDK------- 341
N I I +YLG G+G +KT D +L ++ + + V+K
Sbjct 263 FPENKIQPNIQYPIYLGAGSGAWTKTIFKQAKD---ILQERYENSRTTRMVEKGVLKLTK 319
Query 342 --TRELRVSPLVLKRTKIDNICYEMGQCELSIR 372
+ ++ + K + YEMG+ I+
Sbjct 320 APMKSVKTTQATRKLIMNNESFYEMGKANFMIK 352
>gi|225018979|ref|ZP_03708171.1| hypothetical protein CLOSTMETH_02930 [Clostridium methylpentosum
DSM 5476]
gi|224948259|gb|EEG29468.1| hypothetical protein CLOSTMETH_02930 [Clostridium methylpentosum
DSM 5476]
Length=373
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/397 (26%), Positives = 179/397 (46%), Gaps = 58/397 (14%)
Query 5 LKPFELTLRCLGPVFIGSGEKRTSKEY--HVEGDRVYFPDMELLYADIPAHKR-KSFEAF 61
++ +E+ L PV IG G K + KEY + ++V D+ + + K ++ F
Sbjct 2 IQRYEVVLTTQSPVHIGCGTKISKKEYVYYQNSNQVKIIDLVKFFRFLDEKKLVDDYQLF 61
Query 62 VMNTDGAQATAPLKEWVEPNAVKLDPAKH-RGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
A + L +W + V L+ ++ Y VK K
Sbjct 62 -----AADSYQSLGKWFKEKKVNLNQVENLTAYTVK---------------NHANNKDDE 101
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
EIH F+KD G PY+PGS++KG LR+ L LV R+ + + R++ + +++
Sbjct 102 KEIHMFLKDVYGNPYIPGSSLKGALRTAILSGLVKNRSQE--LFSAQKFRDNFKVSKKYR 159
Query 181 RKELRKSGRP------NT-----------RPQDAVNDLFQAIRVTDSPALRTSDLLICQK 223
+KE+ KS + NT +A+ + + I ++DS + S L IC K
Sbjct 160 KKEMNKSSQWIENRVLNTLQLKNRKGNWINHSNALTSILRGISISDSAPIDKSRLAICPK 219
Query 224 MDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQ 283
+D ++ P + L REC+ P T + + +D ++ G +E L ++ S
Sbjct 220 IDYSIQQNPSKVMLLRECIVPQTEVRFYMSLDPVYLSKAGVD-----IEFLQKSIQSFYM 274
Query 284 ARYAEYRAMYPGVNAIVGPI--VYLGGGAGYRSKTFVT----DQ--DDMAKVLDAQFGKV 335
+ + + +P P ++LGGG G++SKT DQ + ++++L +F +
Sbjct 275 LQRDCWLSKFPSWKENGEPHCRLFLGGGTGFQSKTITQSLYGDQALNLISELLQNRFNEH 334
Query 336 VKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIR 372
++D + VSP LK TK D Y MG+CE++ R
Sbjct 335 KHNLDVGQG--VSPRKLKCTKYDGETYLMGECEVAFR 369
>gi|253578036|ref|ZP_04855308.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850354|gb|EES78312.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39BFAA]
Length=374
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/399 (27%), Positives = 175/399 (44%), Gaps = 64/399 (16%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYH---------VEGDRVYFPDMELLYADIP 51
M LK +++ L+ GPVF+G G + KEY ++G + Y +L
Sbjct 11 MERKLKTYQIHLKVNGPVFVGDGNEIQKKEYMFLNRNTIGVIDGAKFYMLAKKL------ 64
Query 52 AHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGG 111
H + FE F+++ LK W N V + K+ V+ ++ R +G+
Sbjct 65 -HLQNDFERFMID----DTREDLKHWCFRNHVSQNDLKNCMKYVE--NVGDRSEEKGKLQ 117
Query 112 RMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL-QSLVHKRTAQPVRVPGHQTR 170
MT I DP G PY+PGS++KGMLR+I L + ++ R + R Q R
Sbjct 118 VMT-----------CITDPYGNPYIPGSSLKGMLRTILLGRDILQHR--EKYRTDTRQIR 164
Query 171 EHRQYGERFERK-------ELRKSGRPNTRP--QDAVN-DLFQAIRVTDSPALRTSDLLI 220
+ R R+ ++ K+ + R ++ V+ D+ + V DS L D+++
Sbjct 165 SDLEVN-RINRRILNNNIVKIEKNAFNSVRSSGKETVDFDIMSGVIVGDSEPLSREDIIL 223
Query 221 CQKMDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAAS 280
CQK + + G L L REC+ PGT I + +D + + + E
Sbjct 224 CQKWEQHTDGTYKTLNLLRECIKPGTVIKSTLTIDETLCNIKK--------KDILEAVQL 275
Query 281 VNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFV-------TDQDDMAKVLD-AQF 332
+ Y ++ +P + V+LGGG+G+ SKT + + + + D
Sbjct 276 FYEQYYQNFQKKFPRSDRRKPNTVFLGGGSGFVSKTVIYPLFGEKEGIETVKNIFDRTNV 335
Query 333 GKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSI 371
K +H TR + VSP +LK T+ Y MG+CEL+I
Sbjct 336 PKTHQHYKDTR-MGVSPHILKCTRYQGKEYMMGECELNI 373
>gi|334126727|ref|ZP_08500675.1| csm5 family CRISPR-associated ramp protein [Centipeda periodontii
DSM 2778]
gi|333391137|gb|EGK62258.1| csm5 family CRISPR-associated ramp protein [Centipeda periodontii
DSM 2778]
Length=404
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/403 (29%), Positives = 171/403 (43%), Gaps = 58/403 (14%)
Query 9 ELTLRCLGPVFIGSGEKRTSKEYHVEG--DRVYFPDMELLYADIPAHKRKSFEAFVMNTD 66
E L C+ PV GSGEKR + EY + + V FP+ E + + A +
Sbjct 9 EYELTCIAPVHTGSGEKRRAFEYLYDSRKNEVAFPN-ESKWIVLLAQCGLMDDFARAIEH 67
Query 67 GAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIHAF 126
GA L+EW+ N VK E + SI R+A+ R + +LN+I
Sbjct 68 GAFREKSLREWLLANGVK---------EGALRSIVLRKAATPDLMTTARGRRSLNDIVCQ 118
Query 127 IKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYG---------- 176
GRPY+PGST+KG LR+ L V + P+R R + G
Sbjct 119 TTHADGRPYIPGSTIKGALRTGLLYGAVRR---DPMRFRSFWARIRAEAGALRDKKKAWS 175
Query 177 ---ERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDM----NVH 229
E ER L P + DAV+ + +RV+D+ D ++ QK+D N
Sbjct 176 RIIEEMERTTLHTLALPGAKASDAVSSALRGLRVSDAVGTGAMDTIVLQKVDATTKRNKA 235
Query 230 GKPDG-LPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAE 288
GK + LPLFREC+ G ++ + D + G ++ +E+L + + + +
Sbjct 236 GKNESRLPLFRECIPAGRTLRFSITADLAMLETAGIMSLDQVMESLRDYTSDGLRLQKQV 295
Query 289 YRAMYPGVNAIVGPI--------VYLGGGAGYRSKTFV---TDQDD-----MAKVLDAQF 332
+ M P P+ + LGGG G+ +KT V D D+ +A LD F
Sbjct 296 FLPMNP---RFYQPLFEEAETADMLLGGGTGFLAKTLVYALADSDEEAREFIAAYLDEAF 352
Query 333 -----GKVV-KHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
G+V KH K + +SP LKR + + MG C L
Sbjct 353 TERKGGRVEPKHRHKQFDRTLSPRTLKRAVMGQDDWIMGLCAL 395
>gi|295105101|emb|CBL02645.1| CRISPR-associated RAMP protein, Csm5 family [Faecalibacterium
prausnitzii SL3/3]
Length=383
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 99/390 (26%), Positives = 169/390 (44%), Gaps = 35/390 (8%)
Query 4 YLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDR--VYFPDMELLYADIPAHKR-KSFEA 60
+L+ F+LTL+ P+F+GSG K +EY ++ V +MEL + + H + FE
Sbjct 6 HLQVFDLTLKTQSPLFVGSGRKIGKREYIYSQNQGCVKILNMELFFDYMLRHDLVRQFEK 65
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
F+++++ ++ L W + D +H K+ +P R
Sbjct 66 FMLSSN----SSLLDFWTRDCHLAEDWLEHP----KLMGDKPLVQYRLAVTEDVAGYNGT 117
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
EIH F +D GR Y+PGS++KG LR+ +L L+ T P + + E + F
Sbjct 118 KEIHQFQRDAYGRAYIPGSSLKGALRTAWLVHLLLHETLAPGKKRTLEAFE-VNHDYVFP 176
Query 181 RKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMN---------VHGK 231
R D ++ +F+ ++V+DS + L++ + ++ G
Sbjct 177 EGSYANRLRSGAAADDILDSIFRGVQVSDSAPIDNDKLILTGRTLISPLSAARVEAFDGD 236
Query 232 PDGLPLFRECLAPGTSISHRVVVDTSPTARGGW-REGERFLETLAETAASVNQARYAEYR 290
LPL++EC+ PG +I R+ +D S R + LE +AE + + +
Sbjct 237 AKDLPLYQECVRPGETIRFRLTLDQSILNRYAHPITKDALLEAIAEFSRFYQDTFLSHFP 296
Query 291 AMYPGVNAIVGPIVYLGGGAGYRSKT--FVTDQDDMA-------KVLDAQFGKVVKHVDK 341
+P N P + LGGG G+ SKT + +DD A ++L Q G+ +
Sbjct 297 QGHPVANIPDTPHLILGGGTGFFSKTVGYPYLKDDYAAALKWTQRILQTQHGRHEADI-- 354
Query 342 TRELRVSPLVLKRTKIDNICYEMGQCELSI 371
L VSP + Y G CE++I
Sbjct 355 --SLGVSPHRARYVTYAGKRYPAGFCEVNI 382
>gi|323141259|ref|ZP_08076155.1| CRISPR-associated RAMP protein, Csm5 family [Phascolarctobacterium
sp. YIT 12067]
gi|322414216|gb|EFY05039.1| CRISPR-associated RAMP protein, Csm5 family [Phascolarctobacterium
sp. YIT 12067]
Length=387
Score = 102 bits (255), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 107/404 (27%), Positives = 181/404 (45%), Gaps = 55/404 (13%)
Query 1 MNTYLKPFE---LTLRCLGPVFIGSGEKRTSKEYHVEGDR--VYFPDMELLYADIPAHKR 55
MN+ K FE + L+ + P+ I G +K+Y + R V+F ++ + I H
Sbjct 1 MNS--KQFETAKMCLKVVTPINISDGIVLGAKDYLYDSRRQKVFFLNLHQWHMFIYKHML 58
Query 56 -KSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMT 114
+ +E+++ N Q+ L EW++ +D + ++ A
Sbjct 59 LEKYESYLANFRDKQS---LLEWLQMQGYDIDDVR---------TVITSEAQATVNLMDN 106
Query 115 RKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPV--------RVPG 166
KK TLN+I+ I+ P G YVPGS++KG+ R+ L SL+ KR V ++
Sbjct 107 EKKKTLNDINRHIQQPEGSLYVPGSSIKGVFRTAILYSLLQKRQDIKVKYWRQIQEKISS 166
Query 167 HQTREHRQYGERFE--RKELRKSGRP---NTRPQDAVNDLFQAIRVTDSPALRTSDLLIC 221
+ + +R + + E + R N R +AV + ++V+D+ A R I
Sbjct 167 NYFKPYRDFNKLISDLENEFLHTLRLVDGNIRSNNAVCSAMRGLQVSDTYASRNMQTAIL 226
Query 222 QKMD--MNVHGK--PDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAET 277
QK+D + GK P LP+FREC+ P + V ++ + + G + L+
Sbjct 227 QKVDGGFDKFGKASPKKLPIFRECMLPKAELFFDVKIEKAVMSTIGINTVDDLLKATHSF 286
Query 278 AASV----NQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFV--------TDQDDMA 325
A+V QA EY+ + GV A G + +LGG G+ SKT + T ++ +
Sbjct 287 FAAVTDLLQQAFEKEYQEAFQGVAA--GNM-FLGGNTGFLSKTLLAMLAPDKDTAKNTIK 343
Query 326 KVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
+LD F K KH+ R+ ++P LK T + MG E+
Sbjct 344 VLLDKSF-KTHKHL--LRDKVIAPRTLKCTNYNGKLMLMGVAEV 384
>gi|121533436|ref|ZP_01665264.1| CRISPR-associated RAMP protein, Csm5 family [Thermosinus carboxydivorans
Nor1]
gi|121307995|gb|EAX48909.1| CRISPR-associated RAMP protein, Csm5 family [Thermosinus carboxydivorans
Nor1]
Length=394
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 101/409 (25%), Positives = 165/409 (41%), Gaps = 59/409 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEY--HVEGDRVYFPDMELLYADIPAHKR--K 56
MN +L+ + L CLGPV +GSG+K T +Y + R YF + E + + + KR
Sbjct 1 MNKHLETVTIKLTCLGPVHVGSGDKLTKLQYIYDTKQRRAYFLN-ETAWIGLLSQKRLLS 59
Query 57 SFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRK 116
SF + A + + L W N + PA+ ++ P + R
Sbjct 60 SFSDRI----AAGSISDLYRWCTDNWIT--PAEIERVASGWATVAPA---------IERD 104
Query 117 KLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGH--------Q 168
LN I ++ GRPY+PGS++KG LR+ L L+ T + + +
Sbjct 105 SRLLNSITPLMRGADGRPYIPGSSIKGALRTAILHHLLTSNTLSAINKHAYWQQLGDLVR 164
Query 169 TREHRQY--------------GERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALR 214
TR + R EL +DAV + + IRV+D+
Sbjct 165 TRHMSDKDKLKKIEKLTAQIESDLLHRLELFDENNKKVPAKDAVTSVMKGIRVSDAFCTA 224
Query 215 TSDLLICQKMDMNVHGKPDG------LPLFRECLAPGTSISHRVVVDTSPT-ARGGWREG 267
C ++ P G + L REC PGT+++ + V+ + T A G
Sbjct 225 PKAPPTCLLRKVDWQDAPGGRDPENYIALVRECFTPGTTLTFTLTVEPALTRAIGIASPA 284
Query 268 ERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQDDM--- 324
+ + A +N + A + + + + + LGGG+G+ KT + +
Sbjct 285 DVLAAARSHAAHMLNIEKAAFGQRLGSLFSRMASANLILGGGSGFLDKTLLYSLATVNEA 344
Query 325 ----AKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
A +LD +F KH R+ R++P LK +N Y MG C+L
Sbjct 345 RALTAALLDLRFA---KHRHVQRDSRLAPRTLKLGIYNNERYLMGVCKL 390
>gi|227890795|ref|ZP_04008600.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
gi|227867204|gb|EEJ74625.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
Length=328
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 93/383 (25%), Positives = 160/383 (42%), Gaps = 82/383 (21%)
Query 8 FELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKS---FEAFVMN 64
+E L L PV+IGSG K TSKE+ E YFP+M+ LY + + +S FE ++++
Sbjct 9 YEFMLHTLAPVYIGSGVKATSKEFIQENGEYYFPEMDKLYLFLEKNYPESLPAFEQYLLD 68
Query 65 TDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIH 124
+ + N K++ G+++K ++ L E+
Sbjct 69 SGNKTNKRKSRLIDFLNDQKIEERDFGGFKIKQNNLVK----------------NLGEVS 112
Query 125 AFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFERKEL 184
FI+D LG Y+PGS++KG +R+I R ++P +G
Sbjct 113 LFIRDGLGNRYIPGSSLKGAIRTILESEYFRGR-----QIP---------WGA------- 151
Query 185 RKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRECLAP 244
K G+ A ND+F IRV+DS ++ S + +K D P L ++RE L P
Sbjct 152 -KKGK-------AFNDIFNNIRVSDSSSIEESLFSVVEKWDYAKGKAPKNLNIYREALLP 203
Query 245 GTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPG--------V 296
VV + S GE+ + L ++ + Y Y+ +
Sbjct 204 ----EQDVVFNISAI-------GEKAI-FLMNNLENIAEKHYLFYKGFFLDNGFDKKYVQ 251
Query 297 NAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTK 356
N I PI YLG G+G +K + + +D + ++ ++ ++ V L T
Sbjct 252 NNINAPI-YLGAGSGIWTKINI-------RQMDKKKIDKIQIKNRMKDKGVMKLTKYPTN 303
Query 357 IDNIC------YEMGQCELSIRR 373
+++ YEMG+C +++
Sbjct 304 VNSKIVKTKDFYEMGKCNFEVKK 326
>gi|334308473|gb|EGL99459.1| CRISPR-associated protein, Csm5 family [Lactobacillus salivarius
NIAS840]
Length=326
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 96/379 (26%), Positives = 159/379 (42%), Gaps = 74/379 (19%)
Query 8 FELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKS---FEAFVMN 64
+E L L PV IGSG K TSKE+ E YFP+M+ LY + + +S FE ++++
Sbjct 7 YEFVLHTLAPVHIGSGVKATSKEFIQENGEYYFPEMDKLYLFLEKNYPESLPTFEQYLLD 66
Query 65 TDGAQATAPLKEWVE-PNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEI 123
+ G++ ++ N ++ G+++K ++ L E+
Sbjct 67 S-GSKTNKRKSRLIDFLNDQRIKKRDFGGFKIKQNNLVK----------------NLGEV 109
Query 124 HAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFERKE 183
FI+D LG Y+PGS++KG +R+I L+S E F K+
Sbjct 110 SLFIRDGLGNRYIPGSSLKGAIRTI-LES------------------------EYFRGKQ 144
Query 184 L---RKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE 240
+ KSGR +D+F IRV+DS ++ + I Q+ + P + ++RE
Sbjct 145 IPWGAKSGR-------QFDDIFNNIRVSDSSSIEEMNFSIVQRWNHAKGKDPKRMNIYRE 197
Query 241 CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAE------YRAMYP 294
L P VV + S E FL E A + Y E + Y
Sbjct 198 ALLP----EQDVVFNISVIG-----EEAIFLMDNLENMAEKHYLFYKEFFLDKGFDKKYI 248
Query 295 GVNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKR 354
N PI YLG G+G +KT + Q + K+ Q +K+ + + ++ +
Sbjct 249 QDNT-EAPI-YLGAGSGIWTKTNIR-QMNKEKIDRIQMKNKMKNQGVMKLTKYPTNIISK 305
Query 355 TKIDNICYEMGQCELSIRR 373
YEMG+C +++
Sbjct 306 IVKTKDFYEMGKCNFEVKK 324
>gi|291460037|ref|ZP_06599427.1| CRISPR-associated RAMP protein, Csm5 family [Oribacterium sp.
oral taxon 078 str. F0262]
gi|291417378|gb|EFE91097.1| CRISPR-associated RAMP protein, Csm5 family [Oribacterium sp.
oral taxon 078 str. F0262]
Length=383
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 77/281 (28%), Positives = 120/281 (43%), Gaps = 39/281 (13%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDR------VYFPD-MELLYADIPAH 53
M YLK + + + L P+++G G+ KEY R V PD ++L
Sbjct 1 MKDYLKYYRIRITALSPIYVGDGKLIGKKEYIRRNRRSRGWGTVEIPDPRKMLTCLRLLS 60
Query 54 KRKSFEAFVMNTDGAQATAPLKEWVEPNAV-KLDPAKHRGYEVKIGS--IEPRRASRGRG 110
+ FE ++++ G L +W++ + + + Y + G I PR R +G
Sbjct 61 CVQDFENYMLDQGGN--VPDLYQWLQAQGISEATISSWIRYSMDAGDVFIGPRNG-RNKG 117
Query 111 GRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL----------QSLVHKRTAQ 160
I +F KD G+PY+PGS++KGMLR+ L S + KR +
Sbjct 118 ------------IESFQKDAYGKPYIPGSSIKGMLRTALLAWELGKQRESNSGIEKRVRR 165
Query 161 PVRVP-GHQTREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLL 219
V G R E E K + R +DAVN + + V DS + LL
Sbjct 166 AVEGGRGKGDAFLRNQAEDLEVKVFHRPERNKENLKDAVNSVMAGLIVGDSDTISEKQLL 225
Query 220 ICQKMDMNVHG---KPDGLPLFRECLAPGTSISHRVVVDTS 257
+CQK+D + G K P+ RE L PGT + + +D +
Sbjct 226 LCQKIDYSCIGDKKKERAFPILREALKPGTEVFFDLSIDET 266
>gi|313894850|ref|ZP_07828410.1| CRISPR-associated RAMP protein, Csm5 family [Selenomonas sp.
oral taxon 137 str. F0430]
gi|312976531|gb|EFR41986.1| CRISPR-associated RAMP protein, Csm5 family [Selenomonas sp.
oral taxon 137 str. F0430]
Length=394
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/400 (24%), Positives = 168/400 (42%), Gaps = 58/400 (14%)
Query 9 ELTLRCLGPVFIGSGEKRTSKEYHVEGDR--VYFPDMELLYADIPAHKRKSFEAFVMNTD 66
++ L C+ PV IGSG K EY + + V+F D E ++++ R + FV D
Sbjct 8 QIELNCISPVHIGSGVKLLPFEYLYDRRKRDVFFVD-EGKFSELLMRHR-LIDNFV--AD 63
Query 67 GAQATAP-LKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIHA 125
Q P L W+ + ++ + +G V+ + R+ R +LN++
Sbjct 64 MRQRRPPYLLNWLTDH--RISEREMQGITVRRAKVHIRQNERS----------SLNDVAC 111
Query 126 FIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVR-------VPGHQTREHRQYGER 178
G PY+PGS++KG +R+ + L+ + + +R R+ + +
Sbjct 112 LETAAGGIPYIPGSSLKGAIRTAVIYHLLRQSAHENLRRKYWGKLQDAMSARDIKAEIGK 171
Query 179 FERK-------ELRKSGRPNTRPQDAVNDLFQAIRVTDS-PALRTSDLLICQKMDMNVHG 230
+K +L+ A+ D+ + +RV D+ P + D +I QK+D + H
Sbjct 172 LAKKLEEELLCQLKYVDEKGKYSDAAIQDVMRGLRVGDAMPTAKRLDTVILQKIDCSTHA 231
Query 231 KPDG-----LPLFRECLAPGTSISHRVVVDTSPTARGGWREGE---RFLETLAETAASVN 282
G + LFREC+ G+ R+ + A+ G R+ + R T + ++
Sbjct 232 NKSGRKEHSISLFRECIPIGSKFRFRITFEKEILAQIGIRDIDALIRMCRTYTASGLAMQ 291
Query 283 QARYA-EYRAMYPGVNAIVGPIVYLGGGAGYRSKTFV--------TDQDDMAKVLDAQFG 333
+ + +YRA + V LGGG G+ SKT + +A +LD F
Sbjct 292 EHAFGRDYRAEFVEAG---DADVMLGGGTGFLSKTIFYALAPGEEIGRKAVAALLDELFF 348
Query 334 ----KVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
+ +H + ++ +SP LK T D MG C L
Sbjct 349 DRRRRQPQHFHRQKDTVLSPRTLKLTWTDTDSSIMGLCAL 388
>gi|312899098|ref|ZP_07758476.1| CRISPR-associated RAMP protein, Csm5 family [Megasphaera micronuciformis
F0359]
gi|310619765|gb|EFQ03347.1| CRISPR-associated RAMP protein, Csm5 family [Megasphaera micronuciformis
F0359]
Length=408
Score = 79.3 bits (194), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 101/409 (25%), Positives = 169/409 (42%), Gaps = 67/409 (16%)
Query 12 LRCLGPVFIGSGEKRTSKEYHVE--GDRVYFPD----MELLY-----ADIPAHKRKSFEA 60
+ CL PV IGSG+K T+ EY + +VYF D ++ LY + H R++ E
Sbjct 13 IECLSPVHIGSGDKLTAVEYIFDEKARQVYFLDQARWLQFLYRKRLTDEFLRHIRRTAEQ 72
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
+ + L +W+ ++ P + R IG + R ++
Sbjct 73 --LKSKDPFCGQLLWDWLTQKGIR--PDEIRNLAGTIGHVHTNNPLIDRR--------SV 120
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
N+I + D G Y+PGS++KG LR+ L S++ K + + E + R
Sbjct 121 NDIARNVTDAFGSVYIPGSSIKGALRTGLLSSIILKNKEK--YTTSWKEIESTIFNAR-G 177
Query 181 RKELRKSGRPNTR----------PQD--------AVNDLFQAIRVTDSPALR-TSDLLIC 221
R +L+ G+ ++ QD +VND+ + + V+D+ + + +I
Sbjct 178 RSDLKCLGKVQSKLEGLVFQRLGLQDEHGRACGGSVNDVLRGLIVSDAACVEPVCNTVIV 237
Query 222 QKMDMNVHGKPDG-----LPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAE 276
QK+D ++ K +G LPLFREC+ GT + V D G + E
Sbjct 238 QKLDGSL-AKTEGMNPCRLPLFRECIPAGTRLRFSVTADLEMLKVIGIGSIDDIFSVTRE 296
Query 277 TAA-SVNQARYAEYRAM------YPGVNAIVGPIVYLGGGAGYRSKTFVTD--------Q 321
++ +A RA N ++LGGG G++ KT + D +
Sbjct 297 YVMRNLKFQEHAFTRAFGRQFFAAQAFNEAKQADLFLGGGTGFQYKTVIYDLAPDEEIGR 356
Query 322 DDMAKVLDAQF-GKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
+AK LD F K K K+++ +SP +K T MG C +
Sbjct 357 AAVAKYLDLVFTNKDSKPQHKSKDKDISPRTVKLTDQGREYQLMGLCRV 405
>gi|341822665|emb|CCC73589.1| CRISPR-associated RAMP protein [Megasphaera elsdenii DSM 20460]
Length=393
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/401 (25%), Positives = 162/401 (41%), Gaps = 68/401 (16%)
Query 6 KPFELTLRCLGPVFIGSGEKRTSKEYHVEGDR----VYFPD----MELLYADIPAHKRKS 57
K +ELT C+ P+ +G+GE EY DR VYF D M L I H
Sbjct 10 KTYELT--CISPIHVGNGEVLKQYEYIFTKDRNQQRVYFLDKAKWMNFL---IRHHLIDD 64
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKK 117
+ + V + L+ W++ + R + + R + R
Sbjct 65 YASQVFS-----GKMNLRGWLQAQRLGSLSTIIREICISSADVYLVRDVKQR-------- 111
Query 118 LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHK------------RTAQPVRVP 165
LN+IH +K P G PY+PGST+KG +RS L + + + A R
Sbjct 112 --LNDIHRQVKTPDGTPYIPGSTLKGAIRSAILFHDIRQHPDDYRLFWSRIKAAMKARER 169
Query 166 GHQTREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPAL-RTSDLLICQKM 224
++ + ERK + + +P DA+ + + + V+D+ + D +I QK
Sbjct 170 DRYDKQMGHLVQAIERKAFARLKQYKNQPDDALQSVMKGLSVSDAMLVGHERDTVILQKY 229
Query 225 DMNVHGKP--DG--LPLFRECLAPGTSISHRVVVDTSPTARGG-------WREGERFLET 273
D++ + DG L LFREC+ G + +D R G W+ +L
Sbjct 230 DVSAVCREGLDGHSLALFRECIPAGRKFRFSMTLDRDIAKRIGITTLDDIWQWVRDYLAF 289
Query 274 -LAETAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTF---VTDQDDMAKVLD 329
LA+ A EY+ + + LGGG G+ +KT + +++ VL
Sbjct 290 GLAQEKAVFGH----EYKGKFEESKL---ADIRLGGGTGFLTKTVYYALAPKEEGRTVLA 342
Query 330 AQFGKVV-----KHVDKTRELRVSPLVLKRTKIDNICYEMG 365
F KV+ H T++ +++P LK + + C +G
Sbjct 343 EFFDKVLFTRRSCHHHMTKDDKLTPRTLKLAWVHDDCQILG 383
>gi|296133517|ref|YP_003640764.1| CRISPR-associated RAMP protein, Csm5 family [Thermincola sp.
JR]
gi|296032095|gb|ADG82863.1| CRISPR-associated RAMP protein, Csm5 family [Thermincola potens
JR]
Length=429
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 119/445 (27%), Positives = 179/445 (41%), Gaps = 105/445 (23%)
Query 5 LKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEAFVMN 64
+K + + L+ L P+FIG GE L YA +P K+ V
Sbjct 6 MKTYRVKLKVLTPLFIGGGESTVISR--------------LDYAYVPNEKK------VYV 45
Query 65 TDGAQATAPLKE---------WVEPNAVKLDPAKHRGYE--------------------V 95
DG Q L E ++ A + P K G E +
Sbjct 46 LDGRQWIGWLAEKGLLDLYQQYIRQQAEQSSPHKKAGREKGKKENGVNNFAWLQEKEHLL 105
Query 96 KIGSIEP-RRASRGRGGRMTRKK----LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL 150
K + E R+ SR + +K N+IH FI++ G PY+PGS++KG LR+ L
Sbjct 106 KFRAAEVFRQVSRAAYSTVDAEKNGQRFNTNDIHGFIRNAEGLPYIPGSSIKGALRTAVL 165
Query 151 QSLVH---KRTAQPVRVPG------------HQTREHRQYGERFERKE----LRKSGRPN 191
+L+ T Q R G +RE++Q + + E L +
Sbjct 166 AALLQGDAAGTGQYCRKLGEILQSRNKDRYNQGSRENKQKDAKHKVNELYSILERDYLDY 225
Query 192 TRPQDAVNDLFQ---AIRVTDSPALRTSDLLICQKMDMN-VHGK----PDGLPLFRECLA 243
TR + LF+ I V+DS +L++ +K D + V GK + LPL+REC
Sbjct 226 TRQINGETHLFRGMAGISVSDSTPFPPENLMLVRKCDFSLVDGKLKKSAEKLPLYRECAR 285
Query 244 PGTSISHRVVVDTSPTARG-GWREGERFLETLAETAASVNQAR-----YAEYRAMYP-GV 296
PGT + + +D G R +E L + +V + A+ + P G
Sbjct 286 PGTEVEFTLTIDEFKIKNAYGIRSFADIVEVLQKQYDAVFGEKGVIGVEAQSKKYLPAGA 345
Query 297 NAIVGPIVYLGGGAGYRSKTFVTD-------QDDMAK-VLDAQFGKVVKHVDKTRELRVS 348
I+ LGGG GY SKT V+ +D+A+ +L ++ K KH +K R L S
Sbjct 346 LQDSRGIMLLGGGVGYHSKTVVSSLADSPRQANDLAREILKFRYSK-HKH-EKDRPL--S 401
Query 349 PLVLKRT---KIDNICYEMGQCELS 370
P LK + D + MG C LS
Sbjct 402 PRALKLAVAGRGDKVF--MGLCRLS 424
>gi|303231949|ref|ZP_07318657.1| CRISPR-associated RAMP protein, Csm5 family [Veillonella atypica
ACS-049-V-Sch6]
gi|302513378|gb|EFL55412.1| CRISPR-associated RAMP protein, Csm5 family [Veillonella atypica
ACS-049-V-Sch6]
Length=391
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/417 (24%), Positives = 169/417 (41%), Gaps = 76/417 (18%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGD--RVYFPDMELLYADIPAH-KRKS 57
M+ + +L+L + P IG E T+K+Y D VY + + + H K
Sbjct 1 MSNRIDHVQLSLTIVSPTNIGGSETLTTKDYMYNYDAGEVYLLNNYEWFRFLARHNKLAE 60
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAVKL-DPAKHRGYEVKIGSIEPRRASRGRG-GRMTR 115
FE ++ N E V PN + D AK+ IGS + + G G + +
Sbjct 61 FEIYMQN-----------EMVRPNGRTMYDWAKN-----TIGSSQLTKDVLGPAIGSIIK 104
Query 116 -------KKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQ 168
+K +LN+I I+ G Y+PGS++KG++ S + ++ A V
Sbjct 105 SSIYNEGRKNSLNDITPQIRGANGEVYIPGSSIKGVIDSAIISHMLRNNKAFRSNVQ--- 161
Query 169 TREHRQYGERFERKE-----------------------LRKSGRPNTRPQDAVNDLFQAI 205
RE R+ + ++RK G+P + + F+ I
Sbjct 162 -RELRKVLDVYKRKNAGSLFKDIFKMVNQAIIKHIHVLTNNDGKP---LKGILASAFRGI 217
Query 206 RVTDSPALRTSDLLICQKMDMNVHGKPDG---LPLFRECLAPGTSISHRVVVDTSPTARG 262
V+D+ + + +K D V DG + + REC+ P + +DT+ T
Sbjct 218 SVSDAMPMSAIQTEVLKKEDSCV--DEDGTHEISVHRECILPNQKFFFTLTLDTAITKEI 275
Query 263 GWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPI-VYLGGGAGYRSKTFV--- 318
G ++ LE L E + ++ ++++ + P + + P Y+G G+ KT +
Sbjct 276 GITSVDQVLEILQEDFDATHELLSSKFKKVSPAIFKALEPANAYIGSNTGFVQKTIIMAA 335
Query 319 ------TDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
T D + +LD +F K KH +K + +P +K K + YEMG +
Sbjct 336 FTDNEETGIDIIRAILDVKFHK-AKHANKDHFM--APRAIKLVKWNGHYYEMGGIHI 389
>gi|339893267|emb|CCB52454.1| CRISPR associated RAMP family protein [Staphylococcus lugdunensis
N920143]
Length=336
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/405 (23%), Positives = 171/405 (43%), Gaps = 105/405 (25%)
Query 5 LKPFELTLRCLGPVFIGSGE--KRTSKEYHVEGDRVYFPDMELLYADIPAHKRKS----F 58
+K F+ ++ +GP+ IGSG+ K+ Y +V+ + L + KRK+ +
Sbjct 3 IKTFDAIIQTIGPIHIGSGQVLKKQDYIYDFHKSKVHMINGNQL---VKVLKRKNLLNMY 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVK-------LDPAKHRGYEVKIGSIEPRRASRGRGG 111
+ F+ LK ++E + + + ++ K G+I+P+
Sbjct 60 QEFLRYPPKNPRENGLKNFLEAHKITQSEWKEFISYSESVNQGKKYGNIKPK-------- 111
Query 112 RMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTRE 171
LN++H I+D + Y+PGS++KG +++ +LV K
Sbjct 112 -------PLNDLHLMIRDGQNKVYIPGSSIKGAIKT----ALVSKY-------------- 146
Query 172 HRQYGERFERKELRKSGRPNTRPQDAVND--LFQAIRVTDSPALRTSDLLICQKMDMNVH 229
D ND +F I+++DS + S+L I QK+D+N
Sbjct 147 ------------------------DNENDKSVFSRIKISDSEPVDESNLAIYQKIDINKD 182
Query 230 GKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEY 289
KP +PL+REC+ T I ++ ++ + + + E ++ + + +R+
Sbjct 183 EKP--MPLYRECIDVNTQIKFKITIEDNQYS---IEDIENCIQDFYKNYYNQWLSRFKNT 237
Query 290 RA-----MYPGVNAIVGP-IVYLGGGAGYRSKT--FVTDQDDMAK-----VLDAQF---- 332
R + G+ + G I+YLGGG G+ SKT + T + AK +L +F
Sbjct 238 RGGQKFILEGGMPEVKGQNILYLGGGVGFSSKTTHYQTKSHEQAKHDTFEILRKRFRGTY 297
Query 333 GKVVKHVDKTRELRVSPLVLKRT--KIDNICYEMGQCELSIRRAE 375
GK+ R + P+ LK T N Y+ G C+++ ++ +
Sbjct 298 GKM------KRIPQNVPVALKGTLNYSKNQSYQQGMCQITFKKND 336
>gi|333976281|gb|EGL77150.1| CRISPR-associated RAMP protein, Csm5 family [Veillonella parvula
ACS-068-V-Sch12]
Length=391
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/416 (23%), Positives = 170/416 (41%), Gaps = 74/416 (17%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGD--RVYF-PDMELLYADIPAHKRKS 57
M+ + +L+L + P IG E T+K+Y D VY + E +K
Sbjct 1 MSNRIDHAQLSLTIVSPTNIGGPETLTTKDYMYNYDAGEVYLLNNYEWFRFLAQLNKLAE 60
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRG-GRMTR- 115
FE ++ N E V PN + + + IG+ + +A GR G + +
Sbjct 61 FEEYMQN-----------EMVRPNGRTM----YGWAKNTIGTSQLTKAKLGRAIGSIMKS 105
Query 116 ------KKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQT 169
+K +LN+I I+ G Y+PGS++KG++ S + ++ A V
Sbjct 106 SIYNKGRKNSLNDITPQIRGANGDVYIPGSSIKGVIDSAIISHMLRNNKAFRSNVQ---- 161
Query 170 REHRQYGERFERKELR-----------------------KSGRPNTRPQDAVNDLFQAIR 206
RE R+ + ++RK R G+P + + F+ I
Sbjct 162 RELRKVLDVYKRKNARSLFKDIFKMVNLAILKHIHVLTNNEGKP---FKGILASAFRGIS 218
Query 207 VTDSPALRTSDLLICQKMDMNVHGKPDG---LPLFRECLAPGTSISHRVVVDTSPTARGG 263
V+D+ + + +K D V + DG + + REC+ P S + +DT+ T G
Sbjct 219 VSDAMPMSVIQTEVLKKEDSCV--EEDGTHDISVHRECILPNQQFSFTLTLDTAMTKEIG 276
Query 264 WREGERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPI-VYLGGGAGYRSKTFV---- 318
++ L+ L E + ++ ++++ + P + + P Y+G G+ KT +
Sbjct 277 ITSIDQVLDILQEDFDATHKLLASKFKKVSPSIFKALEPANAYIGSNTGFIQKTIIMAAF 336
Query 319 -----TDQDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
T D + +LD F K KH K + + +P +K K + YEMG +
Sbjct 337 TDDEKTGIDIIRAILDVNFQK-AKHDSKDKFM--APRAIKLVKWNGNYYEMGGIHI 389
>gi|341656686|gb|EGS80395.1| CRISPR-associated RAMP protein, Csm5 family [Staphylococcus epidermidis
VCU037]
Length=340
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/384 (23%), Positives = 164/384 (43%), Gaps = 69/384 (17%)
Query 5 LKPFELTLRCLGPVFIGSGE--KRTSKEYHVEGDRVYFPDMELLYADIPAHKRK----SF 58
+K +E+ ++ LGPV IGSG+ K+ Y +VY + L + KRK ++
Sbjct 3 IKNYEVVVKTLGPVHIGSGQVMKKQDYIYDFYNSKVYMINGNKL---VKFLKRKNLLHTY 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
+ F+ LK++++ VK +E + E + ++G+ R K
Sbjct 60 QNFLRYPPKNPRENGLKDYLDAQNVK-----QSEWEAFVSYSE--KVNQGKKYGNVRPK- 111
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
LN++H ++D + Y+PGS++KG +++ +LV K + +
Sbjct 112 PLNDLHLMVRDGQNKVYLPGSSIKGAIKT----TLVSKYNNEKNK--------------- 152
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLF 238
D++ I+V+DS + S+L I QK+D+N KP +PL+
Sbjct 153 ---------------------DIYSKIKVSDSKPIDESNLAIYQKIDINKSEKP--MPLY 189
Query 239 RECLAPGTSISHRVVVDTSPTARGGWREGER--FLETLAETAASVNQARYAEYRAMYPGV 296
REC+ T I ++ ++ + + R + + + + A+ G+
Sbjct 190 RECVDVNTEIKFKLTIEDEIYSINEIEQSIRDFYKNYYDKWLVGFKETKGGRRFALEGGI 249
Query 297 NAIVGP-IVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHV----DKTRELRVS-PL 350
++ I++LG G G+ SKT + + F + K K +E+ + P+
Sbjct 250 PDVLNQNILFLGAGTGFVSKTTHYQLKNRKQAKQDSFEILTKKFRGTYGKMKEIPSNVPV 309
Query 351 VLKRT--KIDNICYEMGQCELSIR 372
LK T + + Y+ G C++S +
Sbjct 310 ALKGTTNQSRHTSYQQGMCKVSFQ 333
>gi|269798857|ref|YP_003312757.1| CRISPR-associated RAMP protein, Csm5 family [Veillonella parvula
DSM 2008]
gi|269095486|gb|ACZ25477.1| CRISPR-associated RAMP protein, Csm5 family [Veillonella parvula
DSM 2008]
Length=391
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 93/408 (23%), Positives = 169/408 (42%), Gaps = 58/408 (14%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGD--RVYFPDMELLYADIPAH-KRKS 57
M+ + +L+L + P IG E T+K+Y D VY + + + H K +
Sbjct 1 MSNRIDHAQLSLTIVSPTNIGGPENLTTKDYMYNYDAGEVYLLNNYEWFRFLAHHNKLEE 60
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKK 117
FE ++ + + +W + NA+ IGSI ++S GR K
Sbjct 61 FELYMQDEMIRPNGRTMYDWAK-NAIGASQLTKDTLRSAIGSI--MKSSIYNKGR----K 113
Query 118 LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGE 177
+LN+I I+ G Y+PGS++KG++ S + ++ A V RE R+ +
Sbjct 114 NSLNDITPQIRGANGDVYIPGSSIKGVIDSAIISHMLRNNKAFRSNVQ----RELRKVLD 169
Query 178 RFERKELR-----------------------KSGRPNTRPQDAVNDLFQAIRVTDSPALR 214
++RK R G+P + + F+ I ++D+ +
Sbjct 170 VYKRKNARSLFKDIFKMVNLAILKHIHVLTNNEGKP---FKGILASAFRGISISDAMPMG 226
Query 215 TSDLLICQKMDMNVHGKPDG---LPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFL 271
+ +K D V + DG + + REC+ P S + +DT+ T G ++ L
Sbjct 227 VIKTEVLKKEDSCV--EEDGTHDISVHRECILPNQQFSFTLTLDTAMTKEIGITSIDQVL 284
Query 272 ETLAETAASVNQARYAEYRAMYPGV-NAIVGPIVYLGGGAGYRSKTFV---------TDQ 321
+ L E + ++ ++++ + P V A+ Y+G G+ KT + T
Sbjct 285 DILQEDFDATHKLLASKFKKVSPSVFKALDSANAYIGSNTGFIQKTIIMAAFTDDEKTGI 344
Query 322 DDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
D + +LD F K KH K + + +P +K K + YE+G +
Sbjct 345 DIIRAILDVNFQK-AKHDSKDKFM--APRAIKLVKWNGNYYEVGGIHI 389
>gi|342213932|ref|ZP_08706645.1| CRISPR type III-A/MTUBE-associated RAMP protein Csm5 [Veillonella
sp. oral taxon 780 str. F0422]
gi|341596430|gb|EGS39032.1| CRISPR type III-A/MTUBE-associated RAMP protein Csm5 [Veillonella
sp. oral taxon 780 str. F0422]
Length=389
Score = 73.2 bits (178), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 92/396 (24%), Positives = 157/396 (40%), Gaps = 55/396 (13%)
Query 7 PFELTLRCLGPVFIGSGEKRTSKEYHVE--GDRVYFPD----MELLYADIPAHKRKSFEA 60
PF TL + PV IGSG+ +Y ++ VY + + LY+ +K +E
Sbjct 11 PF--TLEVITPVSIGSGQGLKVLDYILDTANHDVYILNQKKWFQYLYS---INKLSEYEL 65
Query 61 FVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTL 120
F+ + EW+E N LD E + SI R R + K TL
Sbjct 66 FIKKYATGNTKDTIFEWMERNIGILD-------ESILKSISTRHV---RCVKSAISKRTL 115
Query 121 NEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFE 180
N+I + G PY+PGS++KG++ + + ++ ++ Q R + H R
Sbjct 116 NDIKLCMSLSDGSPYIPGSSLKGVIIASVIAYIIEQK--QSFRNEWSRRFLHTMNDTREL 173
Query 181 RKELRKSGRP-------------NTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMN 227
+K +R G T +D+ LF I V+D + + I + D +
Sbjct 174 QKCIRDYGNALDKLISSYIADNTGTIEKDSTKKLFHGISVSDVMPVSKLNTFILPRYD-S 232
Query 228 VHGKPD--GLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQAR 285
V GK + LPL+REC+ P T + + D + G + ++ + Q
Sbjct 233 VVGKYERKSLPLYRECIVPNTKLKGTLSADIRELQKVGVQSMSELIQIIERHT----QRI 288
Query 286 YAEYRAMYPG------VNAIVGPIVYLGGGAGYRSKTFVT----DQDDMAKVLDA--QFG 333
+ ++ ++ G + + LG G+ KT + DQ D V+ +
Sbjct 289 VSRWKQVFTGDVERTCLADLENTTCLLGSSIGFLHKTLLLPLFDDQRDEVDVIKSVLNLQ 348
Query 334 KVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
+ K + ++ +SP LK TK Y G +L
Sbjct 349 RAFKKHNHWKDRSISPRTLKLTKYRGKDYIFGGVKL 384
>gi|57865880|ref|YP_190000.1| CRISPR-associated Csm5 family protein [Staphylococcus epidermidis
RP62A]
gi|57636538|gb|AAW53326.1| CRISPR-associated protein, TM1807 family [Staphylococcus epidermidis
RP62A]
Length=340
Score = 73.2 bits (178), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 86/387 (23%), Positives = 167/387 (44%), Gaps = 75/387 (19%)
Query 5 LKPFELTLRCLGPVFIGSGE--KRTSKEYHVEGDRVYFPDMELLYADIPAHKRK----SF 58
+K +E+ ++ LGP+ IGSG+ K+ Y +VY + L + KRK ++
Sbjct 3 IKNYEVVIKTLGPIHIGSGQVMKKQDYIYDFYNSKVYMINGNKL---VKFLKRKNLLYTY 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKL 118
+ F+ LK++++ VK +E + E + ++G+ TR K
Sbjct 60 QNFLRYPPKNPRENGLKDYLDAQNVK-----QSEWEAFVSYSE--KVNQGKKYGNTRPK- 111
Query 119 TLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGER 178
LN++H ++D + Y+PGS++KG +++ +LV K + +
Sbjct 112 PLNDLHLMVRDGQNKVYLPGSSIKGAIKT----TLVSKYNNEKNK--------------- 152
Query 179 FERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLF 238
D++ I+V+DS + S+L I QK+D+N K +PL+
Sbjct 153 ---------------------DIYSKIKVSDSKPIDESNLAIYQKIDINKSEK--SMPLY 189
Query 239 RECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYR-----AMY 293
REC+ T I ++ ++ + E E+ ++ + + E + A+
Sbjct 190 RECIDVNTEIKFKLTIEDEIYS---INEIEQSIQDFYKNYYDKWLVGFKETKGGRRFALE 246
Query 294 PGVNAIVGP-IVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHV----DKTRELRVS 348
G+ ++ I++LG G G+ SKT + + F + K K +E+ +
Sbjct 247 GGIPDVLNQNILFLGAGTGFVSKTTHYQLKNRKQAKQDSFEILTKKFRGTYGKMKEIPSN 306
Query 349 -PLVLKRT--KIDNICYEMGQCELSIR 372
P+ LK T + + Y+ G C++S +
Sbjct 307 VPVALKGTTNQSRHTSYQQGMCKVSFQ 333
>gi|289549403|ref|YP_003470307.1| CRISPR-associated protein, Csm5 family [Staphylococcus lugdunensis
HKU09-01]
gi|289178935|gb|ADC86180.1| CRISPR-associated protein, Csm5 family [Staphylococcus lugdunensis
HKU09-01]
Length=336
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/399 (22%), Positives = 165/399 (42%), Gaps = 93/399 (23%)
Query 5 LKPFELTLRCLGPVFIGSGE--KRTSKEYHVEGDRVYFPDMELLYADIPAHKRKS----F 58
+K F+ ++ +GP+ IGSG+ K+ Y +V+ + L + KRK+ +
Sbjct 3 IKTFDAIIQTIGPIHIGSGQVLKKQDYIYDFHKSKVHMINGNQL---VKVLKRKNLLNMY 59
Query 59 EAFVMNTDGAQATAPLKEWVEPNAVK-------LDPAKHRGYEVKIGSIEPRRASRGRGG 111
+ F+ LK ++E + + + ++ K G+I+P+
Sbjct 60 QEFLRYPPKNPRENGLKNFLEAHKITQSEWKEFISYSESVNQGKKYGNIKPK-------- 111
Query 112 RMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTRE 171
LN++H I+D + Y+PGS++KG +++ +LV K
Sbjct 112 -------PLNDLHLMIRDGQNKVYIPGSSIKGAIKT----ALVSKY-------------- 146
Query 172 HRQYGERFERKELRKSGRPNTRPQDAVND--LFQAIRVTDSPALRTSDLLICQKMDMNVH 229
D ND +F I+++DS + S+L I QK+D+N
Sbjct 147 ------------------------DNENDKSVFSRIKISDSEPVDESNLAIYQKIDINKD 182
Query 230 GKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEY 289
KP +PL+REC+ T I ++ ++ + + + E ++ + + +R+
Sbjct 183 EKP--MPLYRECIDVNTQIKFKITIEDNQYS---IEDIENCIQDFYKNYYNQWLSRFKNT 237
Query 290 RA-----MYPGVNAIVGP-IVYLGGGAGYRSKT--FVTDQDDMAK-----VLDAQFGKVV 336
R + G+ + G I+YLGGG G+ SKT + T + AK +L +F
Sbjct 238 RGGQKFILEGGMPEVKGQNILYLGGGVGFSSKTTHYQTKSHEQAKHDTFEILRKRFRGTY 297
Query 337 KHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIRRAE 375
+ + + L N Y+ G C+++ ++ +
Sbjct 298 GKMKRIPQNVSVALKGTLNYSKNQSYQQGMCQITFKKND 336
>gi|301299525|ref|ZP_07205794.1| conserved domain protein [Lactobacillus salivarius ACS-116-V-Col5a]
gi|300852872|gb|EFK80487.1| conserved domain protein [Lactobacillus salivarius ACS-116-V-Col5a]
Length=179
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/220 (28%), Positives = 99/220 (45%), Gaps = 50/220 (22%)
Query 6 KPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKS---FEAFV 62
+ +E L L PV IGSG K TSKE E YFP+M+ LY + + +S FE ++
Sbjct 6 QDYEFVLYTLAPVHIGSGVKVTSKESIQENGEYYFPEMDKLYLFLEKNHPESLPAFEQYL 65
Query 63 MNTDGAQATAPLKEWVE-PNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLN 121
+++ G++ ++ N K+ G+++K ++ R LN
Sbjct 66 LDS-GSKTNKSKSRLIDFLNDQKIKERDFGGFKIKQNNLVER----------------LN 108
Query 122 EIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFER 181
E+ F +D LGR Y+PGS++KG +R+I L+S + G Q + G++F+
Sbjct 109 EVSLFARDGLGRRYIPGSSLKGAIRTI-LESEYFR---------GKQISWGAKSGQQFD- 157
Query 182 KELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLIC 221
D+F IRV DS + S+ I
Sbjct 158 ------------------DIFNNIRVGDSNTIGESNFSIV 179
>gi|292669138|ref|ZP_06602564.1| Csm5 family CRISPR-associated RAMP protein [Selenomonas noxia
ATCC 43541]
gi|292649190|gb|EFF67162.1| Csm5 family CRISPR-associated RAMP protein [Selenomonas noxia
ATCC 43541]
Length=410
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/415 (24%), Positives = 165/415 (40%), Gaps = 80/415 (19%)
Query 12 LRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYAD---IPAHKRKSFEAFV--MNTD 66
++C+ PV IGSGE+ + EY + D++ +M LL+ R +AF+ + +
Sbjct 12 IKCIAPVHIGSGEELRTFEYLYDRDKL---EMSLLHESKWLAFLDARGLTDAFIKYIEIE 68
Query 67 G-AQATAPLKEWVEPNAVKLDPAKHRGY---EVKIGSIEPRRASRGRGGRMTRKKLTLNE 122
G + L EW+ N V + G V + + ++ +K LN+
Sbjct 69 GQGNRSRNLLEWLTANRVTEADLRKAGVIRRRVPVAMLSEKK--------YRNRKPNLNK 120
Query 123 IHA-FIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTA-------QPVRVPGHQTREHRQ 174
+ ++ PY+PGST+KG LR+ L L+ K A + V G + +
Sbjct 121 VVCHLVRADNAHPYIPGSTIKGALRTGILYHLIRKDPARFRAYWQEISSVKGSLKEKEHK 180
Query 175 YGE---RFERKELRKSGRPNTRPQDAVNDLFQAIRVTDS---PALRTSDLLICQKMDMNV 228
+ E R E++ L +T DA + + V+D+ + + +I QK+D
Sbjct 181 WNEIILRLEQELLHTLTYEDTERGDAAASALRGLSVSDAMLVGCVAKAPTVIVQKIDATT 240
Query 229 HGKP------DGLPLFRECLAPGTS-----------ISHRVVVDTSPTARG--------G 263
KP + LFREC+ P S + H +D+ P+ G
Sbjct 241 LIKPGEKRGESPIVLFRECI-PADSRLRFTITANLPMLHAAGIDSLPSVLNMLRAYTLDG 299
Query 264 WREGERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQDD 323
+R E + A +Y+A NA+ LGGG G+ SKT D
Sbjct 300 LTRQQRVFEAIDAKYYGDLFADIGKYKA-----NAL------LGGGTGFLSKTLTYALAD 348
Query 324 --------MAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELS 370
A D QF H + + +++P LKR + D + MG C ++
Sbjct 349 KETDARRFAAAYFDEQFTN-PSHKHRETDTQLTPRTLKRAQTDGADWLMGLCSIT 402
>gi|258645683|ref|ZP_05733152.1| CRISPR-associated RAMP protein, Csm5 family [Dialister invisus
DSM 15470]
gi|260403051|gb|EEW96598.1| CRISPR-associated RAMP protein, Csm5 family [Dialister invisus
DSM 15470]
Length=388
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 69/288 (24%), Positives = 122/288 (43%), Gaps = 60/288 (20%)
Query 5 LKPFELTLRCLGPVFIGSGEKRTSKEYHVEGD----RVYFPDMELLYADIPAHKRKSFEA 60
+K +++ L C PV IGSG+ +Y E + +YF + E +A+ K K ++
Sbjct 2 MKYWKMKLTCQSPVHIGSGDIYQKNQYVYEDNGKKAHIYFLN-ESKWAEF-LEKEKLLDS 59
Query 61 FV---------------MNTDGAQATAP------LKEWVEPNAVKLDPAKHRGYEVKIGS 99
FV +NT P +++ V+ + Y S
Sbjct 60 FVSEIHRKFKHFSIYDFLNTCKRNDRQPESLKRLIRDLVDSGVLSKPETADVPY-----S 114
Query 100 IEPRRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLV----- 154
PR A LN++H FIKD GR Y+PGS++KG R+ + +++
Sbjct 115 KNPRNA--------------LNDVHTFIKDSKGRMYIPGSSLKGAFRTAIIAAMIRKDRE 160
Query 155 --HKRTAQPVRVPGHQTREHRQYG---ERFERKELRKSGRPNTRPQDAVNDLFQAIRVTD 209
K + + +R G ++ E++ G R + VN F+A+ V D
Sbjct 161 RYEKYWNEIFNIAKRANYLNRNIGNVLDKLEKEIFIPIGTDGKR--NMVNSCFRALTVGD 218
Query 210 SPALRTSDLLICQKMDM-NVHGKPDGLPLFRECLAPGTSISHRVVVDT 256
S + +++ QK D P + L+REC+APG +++ ++ +D+
Sbjct 219 SSTASKAGIIV-QKADFGEKEDNPHTISLWRECMAPGDTVNFKLGIDS 265
>gi|238018270|ref|ZP_04598696.1| hypothetical protein VEIDISOL_00094 [Veillonella dispar ATCC
17748]
gi|237864741|gb|EEP66031.1| hypothetical protein VEIDISOL_00094 [Veillonella dispar ATCC
17748]
Length=391
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 94/403 (24%), Positives = 166/403 (42%), Gaps = 48/403 (11%)
Query 1 MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGD--RVYFPDMELLYADIPAH-KRKS 57
M+ + +L L + P IG EK T+K+Y D VY + + + H K
Sbjct 1 MSNRIDHAQLLLTVVSPTNIGGPEKLTTKDYMYNYDAGEVYLLNNYEWFRFLARHNKLAE 60
Query 58 FEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKK 117
FE ++ + + +W + N V IGSI ++S GR K
Sbjct 61 FELYMQDEMVRPNGRTMYDWAK-NTVGAAQLTKDALGPVIGSI--MKSSIYNKGR----K 113
Query 118 LTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGE 177
+LN+I I+ G Y+PGS++KG++ S + ++ A V RE ++ +
Sbjct 114 NSLNDITPQIRGANGDVYIPGSSIKGVIDSAIISHMLRNNKAFRSTVQ----RELKKVLD 169
Query 178 RFERKELRKSGRPNTRPQDAV---------NDL---FQAIRVTDSPALRTSDLLICQKMD 225
++RK R + + + V N+ F+AI + L SD + +
Sbjct 170 VYKRKNARNLFKDIFKMVNLVILKHIHVLTNNEGKPFKAILASAFRGLSVSDAMPMGAIQ 229
Query 226 MNVHGKPD---------GLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAE 276
V K D + + REC+ P S V +DT+ T G + L+ L E
Sbjct 230 TEVLKKEDSCIDEDGTHAISVHRECILPNQKFSFTVTLDTAMTKEIGITSINQVLDILQE 289
Query 277 TAASVNQARYAEYRAMYPGV-NAIVGPIVYLGGGAGYRSKT-----FVTDQ----DDMAK 326
+ ++ ++++ + P + A+ Y+G G+ KT F+ D+ D +
Sbjct 290 DFDATHKLLASKFKKVSPSIFKALELANAYIGSNTGFVQKTIIMAAFIDDEKTGIDIIKA 349
Query 327 VLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCEL 369
+LD F K +H ++ ++P +K K + YEMG +
Sbjct 350 ILDVNFQK-AEH--DRKDTIMAPRAIKLVKWNGNYYEMGGIHI 389
>gi|315641549|ref|ZP_07896618.1| csm5 family CRISPR-associated ramp protein [Enterococcus italicus
DSM 15952]
gi|315482686|gb|EFU73213.1| csm5 family CRISPR-associated ramp protein [Enterococcus italicus
DSM 15952]
Length=349
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 95/409 (24%), Positives = 163/409 (40%), Gaps = 110/409 (26%)
Query 6 KPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLY-ADIP-----AHKRKSFE 59
K +++ L+ GPV IGSG+ +EY +Y L + D P +K+ F
Sbjct 4 KVYQVKLKVYGPVHIGSGKIIRKQEY------IYDRRKSLAHIVDGPNLVKFLNKKGKFT 57
Query 60 AFVMNTDGAQATAPLKEWVEPNAVKLDPAK------HRGYEVKIGSIEPRRASRGRGGRM 113
A++ + + A L ++ + + K R + KI + SR R
Sbjct 58 AYLQYLNTTKERADLYTFLRQEQIDTNDWKTFVLYTERVNQGKIDMKDHNPYSRTSTNRR 117
Query 114 TRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHR 173
K +N++H F++D G Y+PGS++KG LR++ L+ +Q+ E
Sbjct 118 QVDK-GMNDLHLFVRDGRGDLYIPGSSLKGALRTV-LEG-------------ANQSAE-- 160
Query 174 QYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPD 233
F ++ ++DS + +L I QK+D+N KP
Sbjct 161 ---------------------------AFHSLSISDSLPIDPKNLAIYQKIDINKELKP- 192
Query 234 GLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMY 293
+PL+REC+ GT++ + +++ W T+ + + QA Y +Y +
Sbjct 193 -MPLYRECVNVGTTVEFTMKINSD-----DW--------TIEKIEKQIQQA-YLQYWNKW 237
Query 294 -------PGVNAIVG--------------PIVYLGGGAGYRSKTF-------VTDQDDMA 325
PG A + +++LGGG G+ SKT Q D+
Sbjct 238 FVGMVTTPGGKAFIKGGGLPSVLHAKHRPTVLFLGGGTGFPSKTTHYLQKPKEQAQKDIF 297
Query 326 KVLDAQFGKVVKHVDKTRELRVSPLVLKRTKID--NICYEMGQCELSIR 372
+L +F V + + P+VLK T D N Y+ G C L +
Sbjct 298 AILQRRFRNVYGKMATVP--KNVPMVLKGTVNDSTNKWYQQGVCLLEFQ 344
>gi|15669863|ref|NP_248677.1| hypothetical protein MJ_1667 [Methanocaldococcus jannaschii DSM
2661]
gi|41688762|sp|Q59061.1|Y1667_METJA RecName: Full=Uncharacterized protein MJ1667
gi|1500570|gb|AAB99692.1| hypothetical protein MJ_1667 [Methanocaldococcus jannaschii DSM
2661]
Length=418
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/336 (22%), Positives = 138/336 (42%), Gaps = 63/336 (18%)
Query 9 ELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEA--FVMNTD 66
E+ + P+FIG GE+ + +Y +E + D+E +D+ ++ + + V N D
Sbjct 47 EVKCELITPIFIGCGEEYSQLDYFIEDGLAHIIDLEKAVSDLDDLEKVDYISGLIVSNID 106
Query 67 GAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIHAF 126
+ K+ +E +V L+P Y+ I IE S + TR K +N+ + +
Sbjct 107 NNRLNLTAKDILE--SVGLNP-----YDYVIRKIESEIFSNKK----TRVKKFINQNNTY 155
Query 127 IKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQPVRVPGHQTREHRQYGERFERKELRK 186
Y+PGS++KG +R+ Y+ + K + +++ + + G+ E+
Sbjct 156 --------YIPGSSIKGAIRTAYIFNYYDKNLPELLKILDDRNIKLHDKGKELEK----- 202
Query 187 SGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRECLAPGT 246
N +D D F+ ++++DS L I K N K +P+ E + GT
Sbjct 203 ----NAISKDIPKDFFKYLKISDSLNLEGEFKFIHTKR-WNYRKKKFDVPINMEGMTKGT 257
Query 247 -------------SISHRVVVDTSPTARGGWREGER-----------FLETLAETAASVN 282
+I+ R+ + +P ++ E+ F +T+ E N
Sbjct 258 FSINIKIEDEFFKNINKRLKTNYNP------KDDEKKFDILKNLCNNFSKTVVEFELKKN 311
Query 283 QARYAE--YRAMYPGVNAIVGPIVYLGGGAGYRSKT 316
Y E Y + +N + LG G G+ +KT
Sbjct 312 NPVYVEKSYEKLLADINKDDAIYLNLGFGGGFLNKT 347
Lambda K H
0.319 0.136 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 718963958700
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40