BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2820c
Length=302
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609957|ref|NP_217336.1| hypothetical protein Rv2820c [Mycob... 607 5e-172
gi|289751479|ref|ZP_06510857.1| hypothetical protein TBDG_02577 ... 606 1e-171
gi|340627816|ref|YP_004746268.1| hypothetical protein MCAN_28441... 538 3e-151
gi|323718577|gb|EGB27744.1| csm4 family CRISPR-associated ramp p... 518 4e-145
gi|289571007|ref|ZP_06451234.1| hypothetical protein TBJG_02426 ... 460 1e-127
gi|289746619|ref|ZP_06505997.1| predicted protein [Mycobacterium... 236 4e-60
gi|224543483|ref|ZP_03684022.1| hypothetical protein CATMIT_0269... 225 6e-57
gi|331004040|ref|ZP_08327522.1| csm4 family CRISPR-associated ra... 202 5e-50
gi|253578037|ref|ZP_04855309.1| CRISPR-associated protein [Rumin... 196 3e-48
gi|289549402|ref|YP_003470306.1| CRISPR-associated RAMP protein,... 184 2e-44
gi|240143671|ref|ZP_04742272.1| CRISPR-associated RAMP protein, ... 182 8e-44
gi|291539923|emb|CBL13034.1| CRISPR-associated RAMP protein, Csm... 179 4e-43
gi|229826461|ref|ZP_04452530.1| hypothetical protein GCWU000182_... 178 1e-42
gi|329736407|gb|EGG72676.1| CRISPR-associated RAMP protein, Csm4... 177 2e-42
gi|57865881|ref|YP_190001.1| CRISPR-associated Csm4 family prote... 177 2e-42
gi|291460038|ref|ZP_06599428.1| CRISPR-associated RAMP protein, ... 177 3e-42
gi|315641550|ref|ZP_07896619.1| csm4 family CRISPR-associated ra... 171 1e-40
gi|315925058|ref|ZP_07921275.1| CRISPR-associated Csm4 family pr... 171 1e-40
gi|345284419|gb|AEN78272.1| CRISPR-associated protein, Csm4 fami... 170 3e-40
gi|295105102|emb|CBL02646.1| CRISPR-associated RAMP protein, Csm... 166 4e-39
gi|322387546|ref|ZP_08061155.1| csm4 family CRISPR-associated ra... 165 7e-39
gi|270292489|ref|ZP_06198700.1| conserved hypothetical protein [... 161 1e-37
gi|322375483|ref|ZP_08049996.1| CRISPR-associated RAMP protein, ... 161 1e-37
gi|339278116|emb|CCC19864.1| hypothetical protein STH8232_1165 [... 158 1e-36
gi|312278324|gb|ADQ62981.1| CRISPR-associated protein, Csm4 fami... 158 1e-36
gi|55820998|ref|YP_139440.1| hypothetical protein stu0963 [Strep... 157 2e-36
gi|55822917|ref|YP_141358.1| hypothetical protein str0963 [Strep... 156 3e-36
gi|334308472|gb|EGL99458.1| CRISPR-associated RAMP protein, Csm4... 151 1e-34
gi|227890794|ref|ZP_04008599.1| CRISPR-associated Csm4 family pr... 142 7e-32
gi|125718068|ref|YP_001035201.1| hypothetical protein SSA_1248 [... 140 2e-31
gi|114567268|ref|YP_754422.1| hypothetical protein Swol_1753 [Sy... 140 3e-31
gi|325687527|gb|EGD29548.1| hypothetical protein HMPREF9381_1061... 138 1e-30
gi|327469967|gb|EGF15431.1| hypothetical protein HMPREF9386_0578... 138 1e-30
gi|296133518|ref|YP_003640765.1| CRISPR-associated RAMP protein,... 137 2e-30
gi|225018978|ref|ZP_03708170.1| hypothetical protein CLOSTMETH_0... 134 2e-29
gi|237741577|ref|ZP_04572058.1| CRISPR-associated protein [Fusob... 132 5e-29
gi|294782690|ref|ZP_06748016.1| CRISPR-associated RAMP protein, ... 126 5e-27
gi|340752432|ref|ZP_08689231.1| csm4 family CRISPR-associated ra... 121 1e-25
gi|301299524|ref|ZP_07205793.1| putative CRISPR-associated RAMP ... 120 3e-25
gi|258645684|ref|ZP_05733153.1| CRISPR-associated RAMP protein, ... 115 1e-23
gi|121533437|ref|ZP_01665265.1| CRISPR-associated RAMP protein, ... 103 3e-20
gi|323141260|ref|ZP_08076156.1| CRISPR-associated RAMP protein, ... 103 4e-20
gi|341822664|emb|CCC73588.1| CRISPR-associated RAMP protein [Meg... 100 4e-19
gi|334126728|ref|ZP_08500676.1| Csm4 family CRISPR-associated RA... 97.8 2e-18
gi|313894815|ref|ZP_07828375.1| CRISPR-associated RAMP protein, ... 93.2 4e-17
gi|292669139|ref|ZP_06602565.1| csm4 family CRISPR-associated ra... 92.4 8e-17
gi|312899097|ref|ZP_07758475.1| CRISPR-associated RAMP protein, ... 90.9 2e-16
gi|344997572|ref|YP_004799915.1| CRISPR-associated RAMP protein,... 88.2 1e-15
gi|312794662|ref|YP_004027585.1| crispr-associated ramp protein,... 87.8 2e-15
gi|345303032|ref|YP_004824934.1| CRISPR-associated RAMP protein,... 87.4 3e-15
>gi|15609957|ref|NP_217336.1| hypothetical protein Rv2820c [Mycobacterium tuberculosis H37Rv]
gi|15842361|ref|NP_337398.1| hypothetical protein MT2887 [Mycobacterium tuberculosis CDC1551]
gi|31793996|ref|NP_856489.1| hypothetical protein Mb2844c [Mycobacterium bovis AF2122/97]
63 more sequence titles
Length=302
Score = 607 bits (1566), Expect = 5e-172, Method: Compositional matrix adjust.
Identities = 302/302 (100%), Positives = 302/302 (100%), Gaps = 0/302 (0%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL
Sbjct 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
Query 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ 120
TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ
Sbjct 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ 120
Query 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISA 180
TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISA
Sbjct 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISA 180
Query 181 LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS 240
LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS
Sbjct 181 LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS 240
Query 241 GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES
Sbjct 241 GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
Query 301 AA 302
AA
Sbjct 301 AA 302
>gi|289751479|ref|ZP_06510857.1| hypothetical protein TBDG_02577 [Mycobacterium tuberculosis T92]
gi|289692066|gb|EFD59495.1| hypothetical protein TBDG_02577 [Mycobacterium tuberculosis T92]
Length=302
Score = 606 bits (1563), Expect = 1e-171, Method: Compositional matrix adjust.
Identities = 301/302 (99%), Positives = 302/302 (100%), Gaps = 0/302 (0%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL
Sbjct 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
Query 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ 120
TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ
Sbjct 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ 120
Query 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISA 180
TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLL+GISA
Sbjct 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLRGISA 180
Query 181 LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS 240
LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS
Sbjct 181 LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS 240
Query 241 GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES
Sbjct 241 GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
Query 301 AA 302
AA
Sbjct 301 AA 302
>gi|340627816|ref|YP_004746268.1| hypothetical protein MCAN_28441 [Mycobacterium canettii CIPT
140010059]
gi|340006006|emb|CCC45175.1| hypothetical protein MCAN_28441 [Mycobacterium canettii CIPT
140010059]
Length=302
Score = 538 bits (1387), Expect = 3e-151, Method: Compositional matrix adjust.
Identities = 281/302 (94%), Positives = 285/302 (95%), Gaps = 0/302 (0%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
MNSRLFRFD RTHFGDHGLESSTI CPADTLYSALCVEALRMGGQQLL ELVACSTLRL
Sbjct 1 MNSRLFRFDVVRTHFGDHGLESSTIGCPADTLYSALCVEALRMGGQQLLDELVACSTLRL 60
Query 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQ 120
TDLLPYVGPDYLVPKPL SVRSD SS+QKKLAKKIGFLPAAQLGSFLDGTADL +LAARQ
Sbjct 61 TDLLPYVGPDYLVPKPLKSVRSDSSSLQKKLAKKIGFLPAAQLGSFLDGTADLDDLAARQ 120
Query 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISA 180
TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFE DAGLWLLATGSESELGLLTRLLKGISA
Sbjct 121 TKIGVHAVSAKAAIHNGKKDADPYRVGYFRFEPDAGLWLLATGSESELGLLTRLLKGISA 180
Query 181 LGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS 240
LGGERTSGFGAF TESE PAALTPT AASLMTLTTSLPTDDELE ALAGATYRLVKRS
Sbjct 181 LGGERTSGFGAFIPTESEVPAALTPTTAAASLMTLTTSLPTDDELEQALAGATYRLVKRS 240
Query 241 GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
GFVAS+TYA+ P RKRDIYK AAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES
Sbjct 241 GFVASATYAETPRRKRDIYKLAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES 300
Query 301 AA 302
AA
Sbjct 301 AA 302
>gi|323718577|gb|EGB27744.1| csm4 family CRISPR-associated ramp protein [Mycobacterium tuberculosis
CDC1551A]
Length=260
Score = 518 bits (1335), Expect = 4e-145, Method: Compositional matrix adjust.
Identities = 260/260 (100%), Positives = 260/260 (100%), Gaps = 0/260 (0%)
Query 43 MGGQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQ 102
MGGQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQ
Sbjct 1 MGGQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQ 60
Query 103 LGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLAT 162
LGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLAT
Sbjct 61 LGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLAT 120
Query 163 GSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTD 222
GSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTD
Sbjct 121 GSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTD 180
Query 223 DELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG 282
DELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG
Sbjct 181 DELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG 240
Query 283 NHPVYSYARPLFLALPESAA 302
NHPVYSYARPLFLALPESAA
Sbjct 241 NHPVYSYARPLFLALPESAA 260
>gi|289571007|ref|ZP_06451234.1| hypothetical protein TBJG_02426 [Mycobacterium tuberculosis T17]
gi|289544761|gb|EFD48409.1| hypothetical protein TBJG_02426 [Mycobacterium tuberculosis T17]
Length=237
Score = 460 bits (1184), Expect = 1e-127, Method: Compositional matrix adjust.
Identities = 232/235 (99%), Positives = 232/235 (99%), Gaps = 0/235 (0%)
Query 68 GPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHA 127
GP VPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHA
Sbjct 3 GPITWVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHA 62
Query 128 VSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISALGGERTS 187
VSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISALGGERTS
Sbjct 63 VSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGISALGGERTS 122
Query 188 GFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASST 247
GFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASST
Sbjct 123 GFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASST 182
Query 248 YADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPESAA 302
YADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPESAA
Sbjct 183 YADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPESAA 237
>gi|289746619|ref|ZP_06505997.1| predicted protein [Mycobacterium tuberculosis 02_1987]
gi|289758938|ref|ZP_06518316.1| predicted protein [Mycobacterium tuberculosis T85]
gi|294994090|ref|ZP_06799781.1| CRISPR-associated RAMP protein, Csm4 family [Mycobacterium tuberculosis
210]
gi|289687147|gb|EFD54635.1| predicted protein [Mycobacterium tuberculosis 02_1987]
gi|289714502|gb|EFD78514.1| predicted protein [Mycobacterium tuberculosis T85]
gi|326904434|gb|EGE51367.1| hypothetical protein TBPG_02338 [Mycobacterium tuberculosis W-148]
Length=118
Score = 236 bits (601), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 114/115 (99%), Positives = 114/115 (99%), Gaps = 0/115 (0%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL
Sbjct 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
Query 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKE 115
TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADL E
Sbjct 61 TDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLNE 115
>gi|224543483|ref|ZP_03684022.1| hypothetical protein CATMIT_02692 [Catenibacterium mitsuokai
DSM 15897]
gi|224523610|gb|EEF92715.1| hypothetical protein CATMIT_02692 [Catenibacterium mitsuokai
DSM 15897]
Length=305
Score = 225 bits (574), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 123/306 (41%), Positives = 186/306 (61%), Gaps = 12/306 (3%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
MN ++++ F + HFG+H LE S I ADTL+SALC+EAL++ + L + V + L
Sbjct 1 MNYKIYKMIFTQGVHFGEHSLEKSEIIFQADTLFSALCIEALKIDKLETLLKSVKENHLV 60
Query 60 LTDLLPYVGPDYLVPKPLHSV----RSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKE 115
+D PY+ ++ VPKP+ + +S+ +++KK KK+ ++ + L +L G + +
Sbjct 61 FSDAFPYMNQEFFVPKPMKKIEQVTQSEDMTIRKKF-KKLEYVQVSLLDQYLKGQYPIDK 119
Query 116 LAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLA-TGSESELGLLTRL 174
+ K+GVHA+ A+I G ++A PYRVG +RFE GL+++ S+ L L +L
Sbjct 120 -GSDIKKLGVHALKTSASI-RGNEEALPYRVGIYRFEKHNGLYIIVGYDSKETLHLFDKL 177
Query 175 LK--GISALGGERTSGFGAFNLTESEAPAALTPTVDAA-SLMTLTTSLPTDDELEAALAG 231
K +S +GG++ SG G F E P L ++ +MTL+ SLPTD E+E L
Sbjct 178 FKMLSLSGIGGKKNSGLGHFRYNTVELPKELNNRLNTKGEVMTLSVSLPTDKEIEDILDD 237
Query 232 ATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYAR 291
+ Y LVKRSGFV S TY+ RK+DIY F AGS F++ +QG I +VS GG+HPVY YA+
Sbjct 238 SRYLLVKRSGFVDSYTYSKEQRRKKDIYLFKAGSCFNKTYQGDIYNVSSGGSHPVYKYAK 297
Query 292 PLFLAL 297
PLF+ +
Sbjct 298 PLFMGV 303
>gi|331004040|ref|ZP_08327522.1| csm4 family CRISPR-associated ramp protein [Lachnospiraceae oral
taxon 107 str. F0167]
gi|330411626|gb|EGG91034.1| csm4 family CRISPR-associated ramp protein [Lachnospiraceae oral
taxon 107 str. F0167]
Length=313
Score = 202 bits (514), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/312 (36%), Positives = 179/312 (58%), Gaps = 15/312 (4%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
MN ++ + DF HFG GLE ADT++SAL +EAL+ G L EL L+
Sbjct 1 MNYKILKLDFTTAVHFGSGGLEKGQNVLNADTIFSALFIEALKYGKSDRLLELCKNCKLK 60
Query 60 LTDLLPYVGPDYLVPKPLHSVRSD--GSSMQKKLAKKIGFLPAAQLGSFLDGTADLK-EL 116
+++ PY+G DY +PKP+ + +D G S+ KK KK+ ++P ++ F++G D+K E
Sbjct 61 ISNAFPYIGCDYYLPKPIIRLNNDADGDSIIKKALKKLRYIPVSRFDDFINGKLDIKAEA 120
Query 117 AARQTKIGVHAVSAKAAIHNGKKDA--DPYRVGYFRFELDAGLWL-LATGSESELGLLTR 173
+G + K ++ +D + Y V F+++ D+GL+ + G++ + ++++
Sbjct 121 DLFGENLGESVLMEKVSMETASEDEVNNLYAVEIFKYKKDSGLYFFVGYGADEDFEMISK 180
Query 174 LLKGIS--ALGGERTSGFGAFNLTE----SEAPAALTPTVDAASLMTLTTSLPTDDELEA 227
L++ IS +GG+ +SG+G F L+ E + +M+L+ LP +DELE
Sbjct 181 LMESISYTGIGGKVSSGYGKFKLSVDIPVKEIVKKFEDIDNYKDIMSLSLCLPNEDELEQ 240
Query 228 ALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS--LGGNHP 285
+L GA+Y +VKRSGF+ASS YAD +K+D+Y AGSV+ F+G I DVS G HP
Sbjct 241 SLEGASYTVVKRSGFIASSKYADTFRKKKDLYMIEAGSVYKNAFEGDIYDVSDKNNGTHP 300
Query 286 VYSYARPLFLAL 297
VY Y +P F+ +
Sbjct 301 VYRYGKPFFMGV 312
>gi|253578037|ref|ZP_04855309.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850355|gb|EES78313.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39BFAA]
Length=306
Score = 196 bits (498), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 111/308 (37%), Positives = 180/308 (59%), Gaps = 13/308 (4%)
Query 1 MNSRLFRFDF-DRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
MN L+ +F + HFG L+SS I+ ADTL+SAL EAL+M Q + V+ +
Sbjct 1 MNYTLYCLEFINGVHFGRGNLDSSEITFHADTLFSALFQEALKMKKQDRFLKQVSDGKII 60
Query 60 LTDLLPYVGPDYLVPKPLHSVRSD-----GSSMQKKLAKKIGFLPAAQLGSFLDGTADLK 114
+D PY+G +Y +PKP+ +V+++ G S QKK K + ++P + L F++GT +
Sbjct 61 FSDAFPYIGKNYYIPKPMIAVQTEDESKQGDSRQKKKFKNLNYIPVSTLADFVNGTFP-E 119
Query 115 ELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLA-TGSESELGLLTR 173
E +G++ + IH G ++ +P+RV + F GL++LA +ES+ LL
Sbjct 120 EHMEDMKYLGLYDMKVSVGIH-GMEEPEPFRVNTWHFNTGTGLYVLAGYENESDRELLDE 178
Query 174 LLKGI--SALGGERTSGFGAFNLTESEAPAALTPTVDAASL--MTLTTSLPTDDELEAAL 229
L + + + +GG+++SG G F + P + + S + L+T+LP D+ELE +
Sbjct 179 LFESLQYTGIGGKKSSGLGRFVYKVCKVPEDMMGYLKKKSEKNILLSTALPEDEELEQVM 238
Query 230 AGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSY 289
G++Y L+KRSGFV SS YA +RKRD+Y F++GS F+ F+G I++ GG HPV+ Y
Sbjct 239 KGSSYLLIKRSGFVDSSEYALQQMRKRDLYVFSSGSCFAHTFRGRIIEERNGGKHPVFRY 298
Query 290 ARPLFLAL 297
A+ F+ +
Sbjct 299 AKAFFMGV 306
>gi|289549402|ref|YP_003470306.1| CRISPR-associated RAMP protein, Csm4 family [Staphylococcus lugdunensis
HKU09-01]
gi|289178934|gb|ADC86179.1| CRISPR-associated RAMP protein, Csm4 family [Staphylococcus lugdunensis
HKU09-01]
gi|339893266|emb|CCB52452.1| CRISPR associated RAMP family protein [Staphylococcus lugdunensis
N920143]
Length=302
Score = 184 bits (466), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 111/315 (36%), Positives = 172/315 (55%), Gaps = 31/315 (9%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQ--QLLGELVACST 57
M +++F+ F HFG L ++ ADTL+SAL E L +G Q L+ +L+
Sbjct 1 MITKIFKLSFKTPVHFGKKRLSDGEMTIKADTLFSALYTETLNLGKQTNWLMNDLI---- 56
Query 58 LRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGT---ADLK 114
++D PY Y +PKPL + S+ K KK+ ++P ++ G D++
Sbjct 57 --ISDTFPYESELYYLPKPLIKIESNSEGNHKDF-KKLKYVPVYNYNDYIKGQLSEEDVR 113
Query 115 ELA--------ARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSES 166
+L + QTK VS KA + +D++PY VG F FE DAGL+ +A GSE
Sbjct 114 DLNDIFRVGQFSLQTK-----VSLKAQEQSPNEDSEPYSVGTFSFEKDAGLYFIAKGSER 168
Query 167 ELGLLTRLLKGI--SALGGERTSGFGAFNLT--ESEAPAALTPTVDAASLMTLTTSLPTD 222
+ L ++ + S +GG+R++G+G F T +E + L D + + L+TS+
Sbjct 169 TIQRLNEVMYALQFSGIGGKRSAGYGRFEYTCVSNENISNLL-NQDGDNFILLSTSMAKR 227
Query 223 DELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG 282
+ELEA+L A Y L KR+GF+ S+TY+D ++K D Y F+ GSVF F+G I +V G
Sbjct 228 EELEASLNKARYILSKRTGFIQSTTYSDRLVKKNDFYSFSVGSVFKTIFKGDIFNVGNQG 287
Query 283 NHPVYSYARPLFLAL 297
HPVY YA+PL++ +
Sbjct 288 QHPVYRYAKPLWMEV 302
>gi|240143671|ref|ZP_04742272.1| CRISPR-associated RAMP protein, Csm4 family [Roseburia intestinalis
L1-82]
gi|257204348|gb|EEV02633.1| CRISPR-associated RAMP protein, Csm4 family [Roseburia intestinalis
L1-82]
Length=308
Score = 182 bits (461), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 120/305 (40%), Positives = 173/305 (57%), Gaps = 11/305 (3%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
M +++ +F HFG L S + AD ++SAL +EAL++ QQ L + V L
Sbjct 5 MEYTIYQLEFKTGVHFGTGMLNESACTFKADQIFSALYIEALKLNLQQQLYDAVKKGNLL 64
Query 60 LTDLLPYVGPDYLVPKPLHSVR--SDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA 117
++D PY+G Y++PKP+ V + G S QKK KK+ F+P L FL+G DL
Sbjct 65 ISDAFPYIGQQYMIPKPMIYVEPVNRGESKQKKAYKKMKFIPVECLIDFLNGKMDLSNDP 124
Query 118 ARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLAT-GSESELGLLTRLLK 176
G + AA+ G++ PYRVG F +E GL+++A SE+E L+ +LL
Sbjct 125 MEH--YGHYFQQTMAAVRTGEETL-PYRVGTFYYEEGCGLYIIAAYQSENEKCLMEKLLT 181
Query 177 GIS--ALGGERTSGFGAFNLTESEAPAALTPTVDAAS--LMTLTTSLPTDDELEAALAGA 232
+S +GG+++ G G + E + P L + S M L+ +LP D+ELE A+ A
Sbjct 182 ALSYTGIGGKKSGGLGKYRYNEGQIPEQLLECLQKKSDRYMLLSVALPADEELENAMENA 241
Query 233 TYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARP 292
+Y L KRSGFVASS YA+ +K+D+Y F AGS F F G I DVS GG HPVY YA+P
Sbjct 242 SYLLEKRSGFVASSDYAEEWRKKKDLYVFTAGSCFVNCFAGDIYDVSEGGKHPVYRYAKP 301
Query 293 LFLAL 297
+F+ +
Sbjct 302 IFMGV 306
>gi|291539923|emb|CBL13034.1| CRISPR-associated RAMP protein, Csm4 family [Roseburia intestinalis
XB6B4]
Length=304
Score = 179 bits (455), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 120/305 (40%), Positives = 172/305 (57%), Gaps = 11/305 (3%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
M +++ +F HFG L S + AD ++SAL +EAL++ QQ L + V L
Sbjct 1 MEYTIYQLEFKTGVHFGTGMLNESACTFKADQIFSALYIEALKLNLQQQLYDAVKKGNLL 60
Query 60 LTDLLPYVGPDYLVPKPLHSVR--SDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA 117
++D PY+G Y++PKP+ V + S QKK KK+ F+P L FL+G DL
Sbjct 61 ISDAFPYIGQQYMIPKPMIYVEPVNRAESKQKKAYKKMKFIPVECLIDFLNGKMDLSNDP 120
Query 118 ARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGS-ESELGLLTRLLK 176
G + AA+ G++ PYRVG F +E GL+++A ++E L+ +LL
Sbjct 121 MEH--YGHYFQQTMAAVRTGEETL-PYRVGTFYYEEGCGLYIIAAYQGKNEKCLMEKLLT 177
Query 177 GIS--ALGGERTSGFGAFNLTESEAPAALTPTVDAAS--LMTLTTSLPTDDELEAALAGA 232
+S +GG+++ G G F E + P L ++ S M L+ +LP DDELE AL A
Sbjct 178 ALSYTGIGGKKSGGLGKFQYDEKQIPGKLLESLQKKSDRYMLLSVALPADDELENALENA 237
Query 233 TYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARP 292
+Y L KRSGFVASS YA+ +K+D+Y F AGS F F G I DVS GG HPVY YA+P
Sbjct 238 SYLLEKRSGFVASSDYAEEWRKKKDLYVFTAGSCFVNCFAGDIYDVSEGGKHPVYRYAKP 297
Query 293 LFLAL 297
+F+ +
Sbjct 298 IFMGV 302
>gi|229826461|ref|ZP_04452530.1| hypothetical protein GCWU000182_01834 [Abiotrophia defectiva
ATCC 49176]
gi|229789331|gb|EEP25445.1| hypothetical protein GCWU000182_01834 [Abiotrophia defectiva
ATCC 49176]
Length=309
Score = 178 bits (451), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 108/296 (37%), Positives = 163/296 (56%), Gaps = 12/296 (4%)
Query 13 THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVAC--STLRLTDLLPYVGPD 70
H G L + ADT++SALC EAL++G +L + AC ++LTD LP++
Sbjct 14 VHIGAGALTRGKYTLYADTVFSALCKEALKLGENKLSRLVEACRDDKIKLTDGLPFIEDR 73
Query 71 YLVPKPLHS--VRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLK-ELAARQTKIGVHA 127
Y VPKP+ + ++ KK AKK+ ++ ++ +L G+ D+K E IG H
Sbjct 74 YYVPKPMLELDISTESDRTVKKAAKKLEYISIEKMDDYLSGSLDVKQENEYFSNNIGRHN 133
Query 128 VSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLL---KGISALGGE 184
+ KA I G+ DA+PY V F + +GL++ EL + + S +GG+
Sbjct 134 LVGKAKIIIGE-DANPYVVDTFVYGEKSGLYVCVGVDNDELEAFVKEIFTSLSYSGVGGK 192
Query 185 RTSGFGAFNLTESEAPAALTPTVDAASL---MTLTTSLPTDDELEAALAGATYRLVKRSG 241
++G+G F L A +D + M+L+ SLP+DDEL+ L+ A Y L+KRSG
Sbjct 193 ISAGYGKFELEVVPATENFVKRLDNTAYKKYMSLSISLPSDDELKDVLSTADYLLIKRSG 252
Query 242 FVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLAL 297
FV+S TY++ +K+D Y AAG VF + F+G ILDVS GG H VY YA+P + +
Sbjct 253 FVSSETYSETLRKKKDKYCMAAGFVFEKEFKGSILDVSNGGTHLVYRYAKPFIMGV 308
>gi|329736407|gb|EGG72676.1| CRISPR-associated RAMP protein, Csm4 family [Staphylococcus epidermidis
VCU045]
gi|341656676|gb|EGS80385.1| CRISPR-associated RAMP protein, Csm4 family [Staphylococcus epidermidis
VCU037]
Length=302
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/315 (34%), Positives = 169/315 (54%), Gaps = 31/315 (9%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQ--QLLGELVACST 57
M +++F+ F HFG L ++ ADTL+SAL +E L++G LL +L+
Sbjct 1 MATKVFKLSFKTPVHFGKKRLSDGEMTITADTLFSALFIETLQLGKDTDWLLNDLI---- 56
Query 58 LRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA 117
++D PY Y +PKPL + S K KK+ ++P +L+G EL+
Sbjct 57 --ISDTFPYENELYYLPKPLIKIDSKEEDNHKAF-KKLKYVPVHHYNQYLNG-----ELS 108
Query 118 ARQT-------KIGVHAVSAKAAI----HNGKKDADPYRVGYFRFELDAGLWLLATGSES 166
A IG ++ K ++ + D++PY VG F FE +AGL+ +A GSE
Sbjct 109 AEDATDLNDIFNIGYFSLQTKVSLIAQETDSSADSEPYSVGTFTFEPEAGLYFIAKGSEE 168
Query 167 ELGLLTRLLKGI--SALGGERTSGFGAFN--LTESEAPAALTPTVDAASLMTLTTSLPTD 222
L L ++ + S LGG+R +G+G F + ++ + L S++ L+T++
Sbjct 169 TLDHLNNIMTALQYSGLGGKRNAGYGQFEYEIINNQQLSKLLNQNGKHSIL-LSTAMAKK 227
Query 223 DELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG 282
+E+E+AL A Y L KRSGF+ S+ Y++M ++K D Y F++GSVF F G I +V G
Sbjct 228 EEIESALKEARYILTKRSGFIQSTNYSEMLVKKSDFYSFSSGSVFKNIFDGDIFNVGHNG 287
Query 283 NHPVYSYARPLFLAL 297
HPVY YA+PL+L +
Sbjct 288 KHPVYRYAKPLWLEV 302
>gi|57865881|ref|YP_190001.1| CRISPR-associated Csm4 family protein [Staphylococcus epidermidis
RP62A]
gi|57636539|gb|AAW53327.1| CRISPR-associated protein, TM1808 family [Staphylococcus epidermidis
RP62A]
Length=304
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/313 (35%), Positives = 168/313 (54%), Gaps = 31/313 (9%)
Query 3 SRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRMGGQ--QLLGELVACSTLR 59
+++F+ F HFG L ++ ADTL+SAL +E L++G LL +L+
Sbjct 5 TKVFKLSFKTPVHFGKKRLSDGEMTITADTLFSALFIETLQLGKDTDWLLNDLI------ 58
Query 60 LTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAAR 119
++D PY Y +PKPL + S K KK+ ++P +L+G EL+A
Sbjct 59 ISDTFPYENELYYLPKPLIKIDSKEEDNHKAF-KKLKYVPVHHYNQYLNG-----ELSAE 112
Query 120 QT-------KIGVHAVSAKAAI----HNGKKDADPYRVGYFRFELDAGLWLLATGSESEL 168
IG ++ K ++ + D++PY VG F FE +AGL+ +A GSE L
Sbjct 113 DATDLNDIFNIGYFSLQTKVSLIAQETDSSADSEPYSVGTFTFEPEAGLYFIAKGSEETL 172
Query 169 GLLTRLLKGI--SALGGERTSGFGAFN--LTESEAPAALTPTVDAASLMTLTTSLPTDDE 224
L ++ + S LGG+R +G+G F + ++ + L S++ L+T++ +E
Sbjct 173 DHLNNIMTALQYSGLGGKRNAGYGQFEYEIINNQQLSKLLNQNGKHSIL-LSTAMAKKEE 231
Query 225 LEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNH 284
+E+AL A Y L KRSGFV S+ Y++M ++K D Y F++GSVF F G I +V G H
Sbjct 232 IESALKEARYILTKRSGFVQSTNYSEMLVKKSDFYSFSSGSVFKNIFNGDIFNVGHNGKH 291
Query 285 PVYSYARPLFLAL 297
PVY YA+PL+L +
Sbjct 292 PVYRYAKPLWLEV 304
>gi|291460038|ref|ZP_06599428.1| CRISPR-associated RAMP protein, Csm4 family [Oribacterium sp.
oral taxon 078 str. F0262]
gi|291417379|gb|EFE91098.1| CRISPR-associated RAMP protein, Csm4 family [Oribacterium sp.
oral taxon 078 str. F0262]
Length=305
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 115/308 (38%), Positives = 162/308 (53%), Gaps = 16/308 (5%)
Query 1 MNSRLFRFDFDRT-HFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLR 59
M ++R F + HFG L S ADTL+SA +EAL++G ++ L V L
Sbjct 1 MKHLIYRLRFSTSVHFGRGMLSESAFCFSADTLFSAFYIEALKLGREEELYSAVKSGALC 60
Query 60 LTDLLPYVGPDYLVPKPLHSVRSD--GSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA 117
+D P++ Y +PKP+ V S G S +KK KKI ++PA L F G EL
Sbjct 61 FSDAFPFLRERYFLPKPMFYVESKAPGDSKEKKRFKKIQYIPAELLEDFFRG-----ELH 115
Query 118 ARQT---KIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLL-ATGSESELGLLTR 173
A ++G + +AA+ D PY VG F F GL+L+ A S E L+
Sbjct 116 AEHCDLGELGEESSQMRAAVSRSGDDTRPYCVGDFFFREGNGLYLIFALESNREEKLIEI 175
Query 174 LLKGIS--ALGGERTSGFGAFNLTESEAPAALTPTV--DAASLMTLTTSLPTDDELEAAL 229
L + +S +GG+R+SG G F+ E L + + M L ++LP + ELE+AL
Sbjct 176 LFQSLSYSGIGGKRSSGKGRFSYERRELSEELQDAMMREGKRYMLLCSALPREVELESAL 235
Query 230 AGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSY 289
GA+Y L KRSGF+ S ++ +KRD+Y F +GS F FQG + V+ GG HPVY Y
Sbjct 236 EGASYLLQKRSGFILSENFSPEQQKKRDLYTFRSGSCFRHRFQGVVYLVNEGGAHPVYRY 295
Query 290 ARPLFLAL 297
AR LFL +
Sbjct 296 ARGLFLGV 303
>gi|315641550|ref|ZP_07896619.1| csm4 family CRISPR-associated ramp protein [Enterococcus italicus
DSM 15952]
gi|315482687|gb|EFU73214.1| csm4 family CRISPR-associated ramp protein [Enterococcus italicus
DSM 15952]
Length=307
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/296 (38%), Positives = 165/296 (56%), Gaps = 22/296 (7%)
Query 13 THFGDHGLESSTISCPADTLYSALCVEALRMGGQQL-LGELVACSTLRLTDLLPYVGPDY 71
HFG L S + ADTL+SAL +EAL+ QQL L L+ + L +TDL PY Y
Sbjct 17 VHFGMKRLSDSNHTIAADTLFSALIIEALQ---QQLELSHLL--NNLVITDLFPYNKTSY 71
Query 72 LVPKPLHSVR-SDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA--ARQTKIGVHAV 128
+PKPL + G K KK+ ++P +L G D E + A +G ++
Sbjct 72 FLPKPLIRIEGKKGDESGYKAFKKLTYIPVENYSEYLRGEIDSLEASKIAESLNLGKASL 131
Query 129 SAKAAI----HNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTRLLKGI--SALG 182
S K ++ HNG +++PY VG F F ++GL+ LA G+ +G L L+ + S +G
Sbjct 132 STKVSLQAVDHNG--ESEPYSVGNFTFYPESGLYFLAKGNADTIGQLEILMHALQYSGIG 189
Query 183 GERTSGFGAFNLTESEA---PAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKR 239
G+R++G+G F T ++ + L+ T + A L L++++ +D+EL L A Y L KR
Sbjct 190 GKRSAGYGQFRCTIEDSGKFDSLLSQTGNIAIL--LSSAMASDEELVDCLEDARYLLKKR 247
Query 240 SGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFL 295
+GFV S TYAD ++K+D Y F+AGS F + F G I DVS G H VY YA+ +L
Sbjct 248 TGFVQSKTYADQLVKKKDFYAFSAGSTFYQKFNGKIFDVSDNGRHSVYRYAKAFWL 303
>gi|315925058|ref|ZP_07921275.1| CRISPR-associated Csm4 family protein [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621957|gb|EFV01921.1| CRISPR-associated Csm4 family protein [Pseudoramibacter alactolyticus
ATCC 23263]
Length=314
Score = 171 bits (433), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/315 (40%), Positives = 175/315 (56%), Gaps = 21/315 (6%)
Query 1 MNSRLFRFDFDR-THFGDHGLESSTISCPADTLYSALCVEALRM--GGQQLLGELVACST 57
M + L +F+F HFGD LE+ + ADTL+SAL +EAL+ + L V
Sbjct 1 MKTALIQFEFSSGVHFGDQRLENGRSTFGADTLFSALFIEALKRDEACAEALLNAVRDDR 60
Query 58 LRLTDLLPYVGPDYLVPKPLHSVRSD---GSSMQKKLAKKIGFLPAAQLGSFLDG----- 109
L +D +P++ Y +PKP + + G S KK KK+ ++ +FL G
Sbjct 61 LNFSDGMPFMDERYYLPKPFIHIENKAEAGDSSVKKAYKKMAYVAVDCFDAFLKGEMPLA 120
Query 110 -TADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGS-ESE 167
T DL+ L K V + + +G+++ +PYRV +F F D GL++L + E E
Sbjct 121 RTRDLEALGRFDMKTQVCLQNER---DSGQQEPEPYRVKHFYFNPDCGLYILVRAAGEEE 177
Query 168 LGLLTRLLKGIS--ALGGERTSGFGAFNLTESEAPAALTPTVDAA-SL-MTLTTSLPTDD 223
L+ LL +S +GG R+SG G F E+ P A+ + A SL MTL+ +LP D
Sbjct 178 RQLMADLLTSLSFVGIGGRRSSGLGKFTWQEAPVPRAIVNRLAAGGSLHMTLSNALPADG 237
Query 224 ELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLG-G 282
L AALAGA+Y+LV+R GFVAS+ YA LRKRD+Y FAAGS F F G + DVS G G
Sbjct 238 ALSAALAGASYKLVRRGGFVASAAYAREQLRKRDLYVFAAGSCFKTRFAGQVADVSTGAG 297
Query 283 NHPVYSYARPLFLAL 297
+HPVY A+P ++ +
Sbjct 298 SHPVYRLAKPFWMEV 312
>gi|345284419|gb|AEN78272.1| CRISPR-associated protein, Csm4 family [Lactobacillus ruminis
ATCC 27782]
Length=306
Score = 170 bits (430), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/308 (39%), Positives = 161/308 (53%), Gaps = 24/308 (7%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +++R F R HFG L++S + A LYSALC+EA++ EL L
Sbjct 1 MTFKIYRMSFQRAHFGKGYLDTSDMLFDASRLYSALCLEAIKNDCLNEFTELAESDGFFL 60
Query 61 TDLLPYVGPDYLVPKPL-----HSVRSDG---SSMQKKLAKKIGFLPAAQLGSFLDGTAD 112
+D P+VG Y PKP+ VR +G + + KL +I +P +++ +F+ G AD
Sbjct 61 SDAFPHVGEPYF-PKPVCYPKKRMVRLEGLKEDNEKNKLTDRIVAVPMSEMSAFIRGEAD 119
Query 113 LKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLT 172
L Q ++ +K DPY VG F D +++LAT SE L+T
Sbjct 120 YSVLFEDQEGFCRSSIVV-------RKGEDPYEVGVTTFSQD--VYVLATQSELLDELMT 170
Query 173 RLLKGISALGGERTSGFGAFNLTESEAPAALTP--TVDAASLMTLTTSLPTDDELEAALA 230
L S LGG+R+SG+G F+L E P L D M LTT+LP D EL A+
Sbjct 171 SL--QYSGLGGKRSSGYGRFDLAIEELPEGLEEMLNTDGNEQMLLTTALPQDAELHQAMT 228
Query 231 GATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGN-HPVYSY 289
GA Y L K SGF S T A +RK+D+YKF AGSVF F+G I DV G HPVY++
Sbjct 229 GARYDLKKSSGFAYSET-AGQLVRKQDLYKFRAGSVFVNKFKGQIADVRPDGYPHPVYNF 287
Query 290 ARPLFLAL 297
A+ LFL L
Sbjct 288 AKGLFLDL 295
>gi|295105102|emb|CBL02646.1| CRISPR-associated RAMP protein, Csm4 family [Faecalibacterium
prausnitzii SL3/3]
Length=322
Score = 166 bits (420), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/327 (36%), Positives = 177/327 (55%), Gaps = 37/327 (11%)
Query 1 MNSRLFRFDFDR-THFGDH----GLESSTISCPADTLYSALCVEALRMGGQQLLGELVAC 55
MN L + FD HFG G +SS ++ ADT++SALC AL + G+ L EL+
Sbjct 1 MNYFLLKLAFDTAVHFGGSDSAVGSQSSALTLRADTIFSALCHTALEVYGEPALEELLVS 60
Query 56 S---TLRLTDLLPYVGPDYLVPKPLHSVRSDG--SSMQKKLAKKIGFLPAAQL----GSF 106
+ LR++D +P+ G + +PKP+ + S S++++K KK+ ++PA++ S
Sbjct 61 ADADALRISDAMPWRGDTFYLPKPIAASTSPAELSTVERKAVKKLAWIPASKFDRYTASL 120
Query 107 LDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLL-ATGSE 165
GT L EL G KA++ +G DA PY VG +R GL++L A
Sbjct 121 HTGTYPLDEL---DQSFGQAYEQTKASVTDGA-DAKPYFVGLYRLHAGCGLYVLCACEGN 176
Query 166 SELGLLTRL--LKGISALGGERTSGFGAFNLTESEAPAALTPTVDA-------------A 210
+L +L L L G+S +GG ++G+G F+L P L DA A
Sbjct 177 DQLKMLKELFTLLGLSGIGGRTSAGYGRFHLDGE--PICLNTAEDASLRWMLQALERGTA 234
Query 211 SLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRP 270
+ LT+SLPT +E++AAL GA ++L +R GF AS+ + + P++K+ Y AGSV
Sbjct 235 PYLLLTSSLPTGEEMDAALEGAAFQLARRGGF-ASTEWVETPVKKQTQYFLTAGSVLQHT 293
Query 271 FQGGILDVSLGGNHPVYSYARPLFLAL 297
+QG + DV +G HPVY Y++PLF+ +
Sbjct 294 YQGELCDVGIGVPHPVYRYSKPLFMGV 320
>gi|322387546|ref|ZP_08061155.1| csm4 family CRISPR-associated ramp protein [Streptococcus infantis
ATCC 700779]
gi|321141413|gb|EFX36909.1| csm4 family CRISPR-associated ramp protein [Streptococcus infantis
ATCC 700779]
Length=301
Score = 165 bits (418), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 115/313 (37%), Positives = 168/313 (54%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +++ +F HFG L+SS ++ AD L+SAL +EA +MG + L L
Sbjct 1 MTYKMYIMNFQSAHFGAGTLDSSKMTFAADRLFSALAIEAKKMGKMEEFVSLAGQDEFVL 60
Query 61 TDLLPY-VGPDYLVPKPLHSVRSDGSSM---------QKKLAKKIGFLPAAQLGSFLDGT 110
TD PY GP +PKP+ + D + Q K+AKK+ F+P + S+++GT
Sbjct 61 TDAFPYKSGP--FLPKPIGFPKFDQPDLTTDVKEVRRQAKMAKKLQFIPLDKFDSYVNGT 118
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
L E A HAV+ + D Y+V RF + L+++AT S+ L
Sbjct 119 --LFEDAE-------HAVTNIITKNQPHVDGHLYQVSTVRFADQSALYVIATESD----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALT---PTVDAASLMTLTTSLPTDDEL 225
L +L+ + + +GG+R+SG+G F+LT ++ P AL V +MTL TSLP + EL
Sbjct 166 LNQLMTSLQYTGIGGKRSSGYGRFDLTITDIPDALKNRLTKVHQGPVMTLATSLPVEKEL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ +Y L K SGF A ST + RK+D+YKFA+GS FS F G I+DV L H
Sbjct 226 EYAMETGSYLLSKSSGF-AFSTETNENYRKQDLYKFASGSTFSETFTGHIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V +YA+PLF +
Sbjct 285 EVLNYAKPLFFNM 297
>gi|270292489|ref|ZP_06198700.1| conserved hypothetical protein [Streptococcus sp. M143]
gi|270278468|gb|EFA24314.1| conserved hypothetical protein [Streptococcus sp. M143]
Length=301
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/314 (36%), Positives = 165/314 (53%), Gaps = 28/314 (8%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +++ +F HFG L+SS ++ AD L+SAL +EA +MG + + L
Sbjct 1 MTYKMYIMNFQSAHFGAGTLDSSKMTFAADRLFSALAIEAKKMGKMEEFVSIAGQDHFVL 60
Query 61 TDLLPYV-GPDYLVPKPLHSVRSDGSSM---------QKKLAKKIGFLPAAQLGSFLDGT 110
TD PY GP L+PKP+ + D + Q K+AKK+ F+P + S++ GT
Sbjct 61 TDAFPYQSGP--LLPKPIGFPKFDQPDLTTDVKEVRRQAKMAKKLQFIPLDKFDSYVKGT 118
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
E HAV+ + D + ++V RF D+ L+++A S+ L
Sbjct 119 LFEDE---------EHAVTNIITKNQPHVDGNLFQVSTVRFRDDSSLYVIANESDLLNEL 169
Query 171 LTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAAS---LMTLTTSLPTDDELEA 227
+T L + +GG+R+SG+G F+LT + P +L + +MTLTTSLP + ELE
Sbjct 170 MTSL--QYTGIGGKRSSGYGQFDLTILDLPDSLKNRLTKTHQEPVMTLTTSLPVEKELEY 227
Query 228 ALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNHPV 286
A+ +Y + K SGF A T + RK+D+YKFA+GS FS F G I+DV L H V
Sbjct 228 AMETGSYLISKSSGF-AFGTETNENYRKQDLYKFASGSTFSETFTGHIVDVRPLDFPHEV 286
Query 287 YSYARPLFLALPES 300
+YA+PLF + E
Sbjct 287 LNYAKPLFFKMEEE 300
>gi|322375483|ref|ZP_08049996.1| CRISPR-associated RAMP protein, Csm4 family [Streptococcus sp.
C300]
gi|321279746|gb|EFX56786.1| CRISPR-associated RAMP protein, Csm4 family [Streptococcus sp.
C300]
Length=301
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/313 (36%), Positives = 163/313 (53%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +++ +F HFG L+SS ++ AD L+SAL +EA +MG + L L
Sbjct 1 MTYKMYIMNFHTAHFGAGTLDSSKMTFAADRLFSALAIEAKKMGKMEEFVSLAGLDGFVL 60
Query 61 TDLLPYV-GPDYLVPKPLHSVRSDGSSM---------QKKLAKKIGFLPAAQLGSFLDGT 110
+D PY GP +PKP+ D + Q K+AKK+ F+P + S+++GT
Sbjct 61 SDAFPYQSGP--FLPKPIGFPTFDQPDLTTDVKEVRRQAKMAKKLQFIPLDKFDSYVNGT 118
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
K HAV+ + D Y+V R+ D+ L+++A SE L
Sbjct 119 L---------FKDAEHAVTNIVTKNQPHLDGALYQVSTVRYRDDSSLYVIANESE----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALT---PTVDAASLMTLTTSLPTDDEL 225
L L+ + + +GG+R+SG+G F+LT + P + V +MTLTTSLP + EL
Sbjct 166 LNELMASLQYTGIGGKRSSGYGQFDLTILDLPDSFKNRLTKVHQGPVMTLTTSLPVEKEL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ +Y L K SGF A ST + RK+D+YKFA+GS FS F G I+DV L H
Sbjct 226 EYAMETGSYLLSKSSGF-AFSTETNENYRKQDLYKFASGSTFSETFTGQIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V SYA+PLF +
Sbjct 285 EVLSYAKPLFFKM 297
>gi|339278116|emb|CCC19864.1| hypothetical protein STH8232_1165 [Streptococcus thermophilus
JIM 8232]
Length=299
Score = 158 bits (399), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/313 (37%), Positives = 159/313 (51%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L+ F HFG L+SS ++ AD ++SAL +EAL+MG L
Sbjct 1 MTYKLYIMTFQNAHFGSGTLDSSKLTFSADRIFSALVLEALKMGKLDAFLAEANQDKFTL 60
Query 61 TDLLPY-VGPDYLVPKPL---------HSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGT 110
TD P+ GP +PKP+ SV Q KL+KK+ FL + +L+G
Sbjct 61 TDAFPFQFGP--FLPKPIGYPKHDQIDQSVDVKEVRRQAKLSKKLQFLALENVDDYLNG- 117
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
EL + HAV + KD + Y+V RF D L+++A S+ L
Sbjct 118 ----ELFENED----HAVIDTVTKNQPHKDGNLYQVATTRFSNDTSLYVIANESD----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV---DAASLMTLTTSLPTDDEL 225
L L+ + S LGG+R+SGFG F L P L+ + + +M+LTT+LP D +L
Sbjct 166 LNELMSSLQYSGLGGKRSSGFGRFELDIQNIPLELSDRLTKNHSDKVMSLTTALPVDADL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ Y L K SGF A S + RK+D+YKFA+GS FS+ F+G I+DV L H
Sbjct 226 EEAMEDGHYLLTKSSGF-AFSHATNENYRKQDLYKFASGSTFSKTFEGQIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V +YA+PLF L
Sbjct 285 AVLNYAKPLFFKL 297
>gi|312278324|gb|ADQ62981.1| CRISPR-associated protein, Csm4 family [Streptococcus thermophilus
ND03]
Length=299
Score = 158 bits (399), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/313 (37%), Positives = 159/313 (51%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L+ F HFG L+SS ++ AD ++SAL +EAL+MG L
Sbjct 1 MTYKLYIMTFQNAHFGSGTLDSSKLTFSADRIFSALVLEALKMGKLDAFLAEANQDKFTL 60
Query 61 TDLLPY-VGPDYLVPKPL---------HSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGT 110
TD P+ GP +PKP+ SV Q KL+KK+ FL + +L+G
Sbjct 61 TDAFPFQFGP--FLPKPIGYPKHDQIDQSVDVKEVRRQAKLSKKLQFLALENVDDYLNG- 117
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
EL + HAV + KD + Y+V RF D L+++A S+ L
Sbjct 118 ----ELFENEE----HAVIDTVTKNQPHKDDNLYQVATTRFSNDTSLYVIANESD----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV---DAASLMTLTTSLPTDDEL 225
L L+ + S LGG+R+SGFG F L P L+ + + +M+LTT+LP D +L
Sbjct 166 LNELMSSLQYSGLGGKRSSGFGRFELDIQNIPLELSDRLTKNHSDKVMSLTTALPVDADL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ Y L K SGF A S + RK+D+YKFA+GS FS+ F+G I+DV L H
Sbjct 226 EEAMEDGHYLLTKSSGF-AFSHATNENYRKQDLYKFASGSTFSKTFEGQIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V +YA+PLF L
Sbjct 285 AVLNYAKPLFFKL 297
>gi|55820998|ref|YP_139440.1| hypothetical protein stu0963 [Streptococcus thermophilus LMG
18311]
gi|116627769|ref|YP_820388.1| hypothetical protein STER_0976 [Streptococcus thermophilus LMD-9]
gi|55736983|gb|AAV60625.1| conserved hypothetical protein [Streptococcus thermophilus LMG
18311]
gi|116101046|gb|ABJ66192.1| CRISPR-associated protein, Csm4 family [Streptococcus thermophilus
LMD-9]
Length=299
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/313 (36%), Positives = 159/313 (51%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L+ F HFG L+SS ++ AD ++SAL +E+L+MG L
Sbjct 1 MTYKLYIMTFQNAHFGSGTLDSSKLTFSADRIFSALVLESLKMGKLDAFLAEANQDKFTL 60
Query 61 TDLLPY-VGPDYLVPKPL---------HSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGT 110
TD P+ GP +PKP+ SV Q KL+KK+ FL + +L+G
Sbjct 61 TDAFPFQFGP--FLPKPIGYPKHDQIDQSVDVKEVRRQAKLSKKLQFLALENVDDYLNG- 117
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
EL + HAV + KD + Y+V RF D L+++A S+ L
Sbjct 118 ----ELFENEE----HAVIDTVTKNQPHKDGNLYQVATTRFSNDTSLYVIANESD----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV---DAASLMTLTTSLPTDDEL 225
L L+ + S LGG+R+SGFG F L P L+ + + +M+LTT+LP D +L
Sbjct 166 LNELMSSLQYSGLGGKRSSGFGRFELDIQNIPLELSDRLTKNHSDKVMSLTTALPVDADL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ Y L K SGF A S + RK+D+YKFA+GS FS+ F+G I+DV L H
Sbjct 226 EEAMEDGHYLLTKSSGF-AFSHATNENYRKQDLYKFASGSTFSKTFEGQIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V +YA+PLF L
Sbjct 285 AVLNYAKPLFFKL 297
>gi|55822917|ref|YP_141358.1| hypothetical protein str0963 [Streptococcus thermophilus CNRZ1066]
gi|55738902|gb|AAV62543.1| conserved hypothetical protein [Streptococcus thermophilus CNRZ1066]
Length=299
Score = 156 bits (395), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/313 (36%), Positives = 159/313 (51%), Gaps = 32/313 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L+ F HFG L+SS ++ AD ++SAL +E+L+MG L
Sbjct 1 MTYKLYIMTFQNAHFGSGTLDSSKLTFSADRIFSALVLESLKMGKLDAFLAEANQDKFTL 60
Query 61 TDLLPY-VGPDYLVPKPL---------HSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGT 110
TD P+ GP +PKP+ SV Q KL+KK+ FL + +++G
Sbjct 61 TDAFPFQFGP--FLPKPIGYPKHDQIDQSVDVKEVRRQAKLSKKLQFLALENVDDYING- 117
Query 111 ADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGL 170
EL + HAV + KD + Y+V RF D L+++A S+ L
Sbjct 118 ----ELFENEE----HAVIDTVTKNQPHKDGNLYQVATTRFSNDTSLYVIANESD----L 165
Query 171 LTRLLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV---DAASLMTLTTSLPTDDEL 225
L L+ + S LGG+R+SGFG F L P L+ + + +M+LTT+LP D +L
Sbjct 166 LNELMSSLQYSGLGGKRSSGFGRFELDIQNIPLELSDRLTKNHSDKVMSLTTALPVDADL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVS-LGGNH 284
E A+ Y L K SGF A S + RK+D+YKFA+GS FS+ F+G I+DV L H
Sbjct 226 EEAMEDGHYLLTKSSGF-AFSHATNENYRKQDLYKFASGSTFSKTFEGQIVDVRPLDFPH 284
Query 285 PVYSYARPLFLAL 297
V +YA+PLF L
Sbjct 285 AVLNYAKPLFFKL 297
>gi|334308472|gb|EGL99458.1| CRISPR-associated RAMP protein, Csm4 family [Lactobacillus salivarius
NIAS840]
Length=305
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/313 (31%), Positives = 160/313 (52%), Gaps = 29/313 (9%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M + ++ DF HFG+ L S S A LYS+L +E+L++ + L + L
Sbjct 1 MEVQAYKLDFQTVHFGNGNLNESIGSFNASRLYSSLFLESLKLNVDKEFLNLSKSANFFL 60
Query 61 TDLLPYVGPDYLVPK-------PLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADL 113
+D P ++ +PK PL+S + + + K +KK+ ++ + +++G D+
Sbjct 61 SDSFPLKDGEFYLPKPIGYPKIPLNSESTRETRRKAKRSKKLRYIKYTDMEDYVEGNCDV 120
Query 114 KELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTR 173
+L + V K I DPY VG F+ L++L + LL
Sbjct 121 DKLDGTDSFFSKSTVVTKKGI-------DPYEVGITNFK--TSLYILTIKHK----LLDM 167
Query 174 LLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV-----DAASLMTLTTSLPTDDELE 226
L+ + S +GG+R+SG+G F + + + P + + + MTL+TS+P +DEL+
Sbjct 168 LMNSLQYSGIGGKRSSGYGRFTVEKLDIPDEFSKNIVVNDSEYGVYMTLSTSIPNNDELD 227
Query 227 AALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG-NHP 285
+ L A Y L K SGF SST ++ LRK+D+YKFA G+ ++ + G I DV G +HP
Sbjct 228 SVLPTAEYLLEKSSGFAYSSTSRNL-LRKQDLYKFAVGTTLTKTYNGNIFDVRPDGFSHP 286
Query 286 VYSYARPLFLALP 298
V++YA+ LF LP
Sbjct 287 VWNYAKGLFYKLP 299
>gi|227890794|ref|ZP_04008599.1| CRISPR-associated Csm4 family protein [Lactobacillus salivarius
ATCC 11741]
gi|227867203|gb|EEJ74624.1| CRISPR-associated Csm4 family protein [Lactobacillus salivarius
ATCC 11741]
Length=305
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 99/313 (32%), Positives = 160/313 (52%), Gaps = 29/313 (9%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M + ++ DF HFG+ L S S A LYSAL +E+L++ + +L L
Sbjct 1 MEVQAYKLDFQTVHFGNGNLNESIGSFNASRLYSALFLESLKLNVDKEFLDLSKSDNFLL 60
Query 61 TDLLPYVGPDYLVPK-------PLHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADL 113
+D P ++ +PK PL+S + + + K +KK+ ++ + +++G D+
Sbjct 61 SDSFPLKDGEFYLPKPIGYPKMPLNSESTKETRRKTKKSKKLRYIKYTDIEDYVEGNCDV 120
Query 114 KELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLLTR 173
++L G+ + +K + KK DPY VG F+ L++L E LL
Sbjct 121 EKLD------GIDSFFSKNTVVT-KKGIDPYEVGITNFK--TSLYILTIKHE----LLDM 167
Query 174 LLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV-----DAASLMTLTTSLPTDDELE 226
L+ + S +GG+R+SG+G F + + + P + + + MTL TS+P +DEL+
Sbjct 168 LMNSLQYSGIGGKRSSGYGRFTVEKLDIPNEFSKNIVINDSEYGVYMTLNTSIPNNDELD 227
Query 227 AALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGG-NHP 285
L A Y L K SGF A ST + LRK+D+YKF G+ ++ + G I DV G HP
Sbjct 228 VVLPTAEYLLEKSSGF-AYSTASKSLLRKQDLYKFVVGTTLTKTYSGNIFDVRPDGFPHP 286
Query 286 VYSYARPLFLALP 298
V++YA+ LF LP
Sbjct 287 VWNYAKGLFYKLP 299
>gi|125718068|ref|YP_001035201.1| hypothetical protein SSA_1248 [Streptococcus sanguinis SK36]
gi|125497985|gb|ABN44651.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
Length=302
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/317 (34%), Positives = 159/317 (51%), Gaps = 33/317 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L++ DF HFG+ L+ S ++ A LYSAL +EA++ G L L
Sbjct 1 MAYQLYKLDFKSAHFGEGHLDDSVMTFTAARLYSALVLEAIKAGVLDEFENLSRQDEFVL 60
Query 61 TDLLPYVGPDYLVPKP-----LHSVRSDGSSMQK----KLAKKIGFLPAAQLGSFLDGTA 111
TD PY+G YL PKP L V + +++ K AKK+ F+ FL G +
Sbjct 61 TDAFPYMGAPYL-PKPIGYPLLDKVNRNEDIIKQREEAKKAKKLAFIKLGDFDCFLSGNS 119
Query 112 DLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLL 171
+ E + +++ K + +D + Y+V F+ + L+ +A S+ LL
Sbjct 120 IVGE------SLATKSINTK---NQPFQDGNLYQVASVHFD-KSSLYFIANQSD----LL 165
Query 172 TRLLKGI--SALGGERTSGFGAFNLTES-EAPAALTPTVDA---ASLMTLTTSLPTDDEL 225
RLL+ + S +GG+R+SG+G F L ++ + P + +M LTTSLP D EL
Sbjct 166 DRLLESLQFSGIGGKRSSGYGGFTLDKTVQQPLDFFKRLTVKYTGKVMALTTSLPVDSEL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGN-- 283
+ A+ Y L K SGF A S + RK+++Y F AGS FS+ + G I DV
Sbjct 226 KVAMNEGRYLLKKSSGF-AFSEETESNYRKQNLYTFKAGSTFSKTYNGQICDVKPSDEFP 284
Query 284 HPVYSYARPLFLALPES 300
H V+ YA+PLF L ES
Sbjct 285 HSVWHYAKPLFYILEES 301
>gi|114567268|ref|YP_754422.1| hypothetical protein Swol_1753 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338203|gb|ABI69051.1| CRISPR-associated protein, Csm4 family [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
Length=315
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/317 (32%), Positives = 157/317 (50%), Gaps = 24/317 (7%)
Query 1 MNSRLFRFDFDRT-HFG-DHG---LESSTISCPADTLYSALCVEALRMGGQQLLGELVAC 55
M L+R +F H G D G L+ + ADTL++ALC EA+R G L + A
Sbjct 1 MEHFLYRLNFSTALHIGKDAGGPSLDDGQMIIHADTLFAALCCEAVRGGRITQLVKYFAD 60
Query 56 STLRLTDLLPYVGPDYLVPKPL----HSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTA 111
L ++D LPY G + +P+P+ + R+ S++ K L K ++P + G +L
Sbjct 61 GILSISDALPYAGDEIFLPRPVLFTENRKRAGDSALAKALKNK-DYIPLSFFGDYLKSMQ 119
Query 112 DLK---ELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESEL 168
L E ++ G + AI G PY V +RF GL+++ + E
Sbjct 120 QLDFNLESLKSESDFGYLTTLTRVAI-KGNSPPLPYHVAAWRFADGCGLYIIVRSEQEEA 178
Query 169 -----GLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVD---AASLMTLTTSLP 220
LL L G+S +GG+++SG+G F + P L ++ A M + T+LP
Sbjct 179 RTMFASLLAEL--GLSGIGGKQSSGWGKFEVKPGPVPDELLRLLEDKQAEYQMLMGTALP 236
Query 221 TDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSL 280
D+EL+ L Y+L++R GF+ S +YA ++K+ IY A+GS F+G +LD+S
Sbjct 237 VDNELDTVLLNGWYKLLRRGGFIRSESYAARQMKKKTIYMLASGSCLRSRFKGAMLDLSD 296
Query 281 GGNHPVYSYARPLFLAL 297
G HPV+ LF+ +
Sbjct 297 NGAHPVWRCGNTLFVGV 313
>gi|325687527|gb|EGD29548.1| hypothetical protein HMPREF9381_1061 [Streptococcus sanguinis
SK72]
Length=302
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/317 (34%), Positives = 159/317 (51%), Gaps = 33/317 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L++ DF HFG+ L+ S ++ A LYSAL +EA++ G L L
Sbjct 1 MAYQLYKLDFKSAHFGEGHLDDSVMTFTAARLYSALVLEAIKAGVLDEFENLSRQDEFVL 60
Query 61 TDLLPYVGPDYLVPKP-----LHSVRSDGSSMQK----KLAKKIGFLPAAQLGSFLDGTA 111
TD PY+ YL PKP L V + +++ K AKK+ F+ FL G +
Sbjct 61 TDAFPYMEVPYL-PKPIGYPLLDKVNRNEDIIKQREEAKKAKKLAFIKLGDFDCFLSGNS 119
Query 112 DLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLL 171
+ E + +++ K + +D + Y+V F+ + L+ +A S+ LL
Sbjct 120 IVGE------SLATKSINTK---NQPFQDGNLYQVASVHFD-KSSLYFIANQSD----LL 165
Query 172 TRLLKGI--SALGGERTSGFGAFNLTES-EAPAALTPTVDA---ASLMTLTTSLPTDDEL 225
RLL+ + S +GG+R+SG+G F L ++ + P + +M LTTSLP D EL
Sbjct 166 DRLLESLQFSGIGGKRSSGYGGFTLDKTVQQPLDFFKRLTVKYTGKVMALTTSLPVDSEL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGN-- 283
+ A+ Y L K SGF A S + RK+++Y F AGS FS+ + G I DV +
Sbjct 226 KVAMNEGRYLLKKSSGF-AFSEETESNYRKQNLYTFKAGSTFSKTYNGQICDVKPSDDFP 284
Query 284 HPVYSYARPLFLALPES 300
H V+ YA+PLF L ES
Sbjct 285 HSVWHYAKPLFYILEES 301
>gi|327469967|gb|EGF15431.1| hypothetical protein HMPREF9386_0578 [Streptococcus sanguinis
SK330]
Length=302
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/317 (34%), Positives = 159/317 (51%), Gaps = 33/317 (10%)
Query 1 MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRL 60
M +L++ DF HFG+ L+ S ++ A LYSAL +EA++ G L L
Sbjct 1 MAYQLYKLDFKSAHFGEGHLDDSVMTFTAARLYSALVLEAIKAGVLDEFENLSLQDEFVL 60
Query 61 TDLLPYVGPDYLVPKP-----LHSVRSDGSSMQK----KLAKKIGFLPAAQLGSFLDGTA 111
TD PY+ YL PKP L V + +++ K AKK+ F+ FL G +
Sbjct 61 TDAFPYMEAPYL-PKPIGYPLLDKVNRNEDIIKQREEAKKAKKLAFIKLGDFDYFLSGNS 119
Query 112 DLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESELGLL 171
+ E + +++ K + ++ + Y+V F+ ++ L+ +A S+ LL
Sbjct 120 IVGE------SLATKSINTK---NQPFQEGNLYQVASVHFDKNS-LYFIANQSD----LL 165
Query 172 TRLLKGI--SALGGERTSGFGAFNLTES-EAPAALTPTVDA---ASLMTLTTSLPTDDEL 225
RLL+ + S +GG+R+SG+G F L ++ + P + +M LTTSLP D EL
Sbjct 166 DRLLESLQFSGIGGKRSSGYGGFTLDKTVQQPLDFFKRLTVKYTGKVMALTTSLPVDSEL 225
Query 226 EAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGN-- 283
+ A+ Y L K SGF A S + RK+++Y F AGS FS + G I DV
Sbjct 226 KVAMNEGRYLLKKSSGF-AFSEETESNYRKQNLYTFKAGSTFSETYNGQICDVKPSDEFP 284
Query 284 HPVYSYARPLFLALPES 300
HPV+ YA+PLF L ES
Sbjct 285 HPVWHYAKPLFYMLEES 301
>gi|296133518|ref|YP_003640765.1| CRISPR-associated RAMP protein, Csm4 family [Thermincola sp.
JR]
gi|296032096|gb|ADG82864.1| CRISPR-associated RAMP protein, Csm4 family [Thermincola potens
JR]
Length=326
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 99/302 (33%), Positives = 157/302 (52%), Gaps = 27/302 (8%)
Query 20 LESSTISCPADTLYSALCVEALRMGGQQLLGELVAC---STLRLTDLLPYVGPDYLVPKP 76
L ++C ADTL+SALC E + + GQ+ EL+A + + L+D+ P+ G + +PKP
Sbjct 26 LAQGKMACTADTLFSALCQEWIAVFGQKGFDELIAAVQGNQIFLSDMFPWCGLELYLPKP 85
Query 77 LH--SVR--SDGSSMQ-KKLAKKIGFLPAAQLG---SFLDGTADLKELAARQTKIGVHAV 128
SVR +D ++ +K KK+ ++P ++ FL DL L + G +
Sbjct 86 AMPPSVRRNTDVEGLKDRKALKKLVYIPVSRFADYIKFLHNGGDLPWLE-EIVEPGYEQL 144
Query 129 SAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSE---SELGLLTRLLKGISALGGER 185
+A I ++D PY V +RF+ ++GL+ + +E ++ L G++ +GG+R
Sbjct 145 LYRANIAR-EQDTVPYPVMVYRFKENSGLYFILRSTEYWRERFDIVVESL-GLTGIGGKR 202
Query 186 TSGFGAFNLTESEAPAALTPT---------VDAASLMTLTTSLPTDDELE-AALAGATYR 235
+SG G F L E L + DA M L+ P +EL A A + Y
Sbjct 203 SSGLGKFELAEESFETGLYDSDIMLEKMLLEDANLYMLLSVLSPGQEELGIAKQANSYYN 262
Query 236 LVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFL 295
LVKR+G+VAS YAD L+K+ + F AGS F +G +LDVS G HPVY Y + +++
Sbjct 263 LVKRTGYVASPDYADTWLKKKPVVMFGAGSCFPEKIRGRVLDVSDCGGHPVYRYGKGMYV 322
Query 296 AL 297
+
Sbjct 323 GV 324
>gi|225018978|ref|ZP_03708170.1| hypothetical protein CLOSTMETH_02929 [Clostridium methylpentosum
DSM 5476]
gi|224948258|gb|EEG29467.1| hypothetical protein CLOSTMETH_02929 [Clostridium methylpentosum
DSM 5476]
Length=319
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/309 (33%), Positives = 151/309 (49%), Gaps = 30/309 (9%)
Query 13 THFG-DHG---LESSTISCPADTLYSALCVEALRMGGQQLLGELV---ACSTLRLTDLLP 65
H G DHG LESS + +D+ +SALCVEA R GG +++ LV S + L+DLLP
Sbjct 15 VHIGPDHGQSPLESSMTTMHSDSFFSALCVEAARYGGSEMVENLVDDARQSRMVLSDLLP 74
Query 66 YVGPDYLVPKP---LHSVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTK 122
G + +PKP R + S +K KK+ F+P + + + K ++ K
Sbjct 75 CRGEELFLPKPNLPFVQARVEADSENRKEFKKMRFVPISMWTDYCKFCFEEKPFPLKKCK 134
Query 123 IGVHAVS-----AKAAIHNGKKDADPYRVGYFRFELDAGLW-LLATGSESELGLLTRLLK 176
+ S + A+ G +D +PY + F D+ L+ +L S +LL
Sbjct 135 EAMQGFSYPDQRQRVAV-EGSEDPEPYFLQATSFPPDSSLYCILGFDSGQTRSKWEQLLN 193
Query 177 GI--SALGGERTSGFGAFN------LTESEAPAALTPTVDAASLMTLTTSLPTDDELEAA 228
+ S +GG+R+SG+G F L + A DA +TL TSLPTD+EL +
Sbjct 194 SLAWSGIGGKRSSGWGKFEVSSPDPLENHQVLHAGLSAKDAPMWITLNTSLPTDEELRSI 253
Query 229 LAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYS 288
++Y L +R GFV Y +KR +Y F+AGS F+G +LD++ G VY
Sbjct 254 AQQSSYLLCRRGGFVDGEAY-----KKRTVYAFSAGSCVHTRFEGDVLDLAPKGRRAVYR 308
Query 289 YARPLFLAL 297
+P+ L +
Sbjct 309 MLKPILLGV 317
>gi|237741577|ref|ZP_04572058.1| CRISPR-associated protein [Fusobacterium sp. 4_1_13]
gi|229429225|gb|EEO39437.1| CRISPR-associated protein [Fusobacterium sp. 4_1_13]
Length=336
Score = 132 bits (333), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 84/317 (27%), Positives = 151/317 (48%), Gaps = 33/317 (10%)
Query 13 THFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRLTDLLPY-----V 67
T FG + LE + IS +DT YSAL E +++ L ++ L+DLLP+ +
Sbjct 18 TAFG-NTLEETMISVYSDTFYSALFNEYMKIYNNDELYKISESGDFLLSDLLPFKEKEDM 76
Query 68 GPDYLVPKPLHSV------RSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQT 121
D+ +PKP ++ + D + +K K F+PA +LG + + K
Sbjct 77 STDFYLPKPFINIERKEIKKDDEEKIDRKKVKATNFIPADKLGEYFSFLKNGKNFPEIDD 136
Query 122 KIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLAT--GSESELGLLTRLLKGIS 179
G + K + +D Y + F+F +GL+ + E + +L+ +S
Sbjct 137 NFGKKQLYTKNKVSLENEDTKLYNIEIFKFNEKSGLYFIVKLPKDEKWQRIFENILESLS 196
Query 180 --ALGGERTSGFGAFNLTE--------------SEAPAALTPTV--DAASLMTLTTSLPT 221
+GG++ SGFG F + E SE+ A + + + + +++ P
Sbjct 197 LTGIGGKKNSGFGQFTIKEDAMNFDGLDFEKFESESDAYINKALYSNEEKFLIISSYSPR 256
Query 222 DDELEAALAGATY-RLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSL 280
+E+E G Y +L+KRSGFV SS+Y++ +++ +Y ++GSV + +G ILD++L
Sbjct 257 IEEIEKLKDGNNYYQLIKRSGFVNSSSYSEQAEKRKQVYMLSSGSVLNFKPEGKILDLNL 316
Query 281 GGNHPVYSYARPLFLAL 297
G H +Y +P+ L +
Sbjct 317 HGKHSIYRMGKPIVLGV 333
>gi|294782690|ref|ZP_06748016.1| CRISPR-associated RAMP protein, Csm4 family [Fusobacterium sp.
1_1_41FAA]
gi|294481331|gb|EFG29106.1| CRISPR-associated RAMP protein, Csm4 family [Fusobacterium sp.
1_1_41FAA]
Length=334
Score = 126 bits (316), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 82/311 (27%), Positives = 148/311 (48%), Gaps = 35/311 (11%)
Query 20 LESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRLTDLLPY-----VGPDYLVP 74
LE + +S +DT YSA+ E +++ L ++ ++DLLP+ + D+ +P
Sbjct 24 LEETMMSVYSDTFYSAVFNEYMKIYNDDELYKISEAGEFLVSDLLPFKEKEDMSTDFYLP 83
Query 75 KPLHSV------RSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHAV 128
KP SV +++ + +K K F+PA +LG +L K G +
Sbjct 84 KPFISVQRQEIEKNEEEVVDRKKVKATNFIPADKLGEYLTFLKTGKNFPEIDDDFGKKEL 143
Query 129 SAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESE------LGLLTRLLKGISALG 182
K + +D Y + F+F +GL+ + E G+L L ++ +G
Sbjct 144 YTKNKVSLQNEDTKLYNIEVFKFNEKSGLYFIVKIPEDNRWQEIFQGVLDSL--ALTGIG 201
Query 183 GERTSGFG-------------AFNLTESEAPAALTPTV--DAASLMTLTTSLPTDDELEA 227
G+R SGFG F+ ESE+ A + + D + ++L++ P +E++
Sbjct 202 GKRNSGFGQFRREEPMFFDGETFDAIESESDAYINRGLYSDEKNFLSLSSYSPKKEEIDK 261
Query 228 ALAGATY-RLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPV 286
Y +L+KRSGFV SS Y++ +++ +Y ++GSV + +G ILD++L G H +
Sbjct 262 IKESENYYQLIKRSGFVNSSLYSEQAEKRKQVYMLSSGSVLTFKPEGKILDLNLHGKHSI 321
Query 287 YSYARPLFLAL 297
Y +P+ L +
Sbjct 322 YRMGKPIVLGV 332
>gi|340752432|ref|ZP_08689231.1| csm4 family CRISPR-associated ramp protein [Fusobacterium sp.
2_1_31]
gi|229422231|gb|EEO37278.1| csm4 family CRISPR-associated ramp protein [Fusobacterium sp.
2_1_31]
Length=334
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/309 (27%), Positives = 147/309 (48%), Gaps = 31/309 (10%)
Query 20 LESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRLTDLLPY-----VGPDYLVP 74
LE + +S +DT YSA+ E +++ L ++ ++DLLP+ + D+ +P
Sbjct 24 LEETMMSVYSDTFYSAIFNEYMKIYNDDELYKISEAGEFLVSDLLPFKEKEDMSTDFYLP 83
Query 75 KPLHSV------RSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHAV 128
KP SV +++ + +K K F+PA +LG +L K G +
Sbjct 84 KPFISVQRQEMGKNEEEVVDRKKVKATNFIPADKLGEYLTFLKTGKNFPEIDDDFGKKEL 143
Query 129 SAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGSESE--LGLLTRLLKGIS--ALGGE 184
K + +D Y + F+F +GL+ + E + +L+ +S +GG+
Sbjct 144 YTKNKVSLQNEDTKLYNIEVFKFNEKSGLYFIVKLPEDNEWQEIFENILESLSLTGIGGK 203
Query 185 RTSGFGAF-------------NLTESEAPAALTPTV--DAASLMTLTTSLPTDDELEAAL 229
R SGFG F + ESE+ A + + D ++L++ P +E+E
Sbjct 204 RNSGFGQFISEDPMFFDGEDFDAIESESDAYINKALYSDEEKYLSLSSYSPKIEEIEKIK 263
Query 230 AGATY-RLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYS 288
Y +L+KRSGFV SS Y++ +++ +Y ++GSV S +G ILD++L G H +Y
Sbjct 264 KSENYYQLIKRSGFVNSSLYSEQAEKRKQVYMLSSGSVLSFKPEGKILDLNLHGKHSIYR 323
Query 289 YARPLFLAL 297
+P+ L +
Sbjct 324 MGKPIVLGV 332
>gi|301299524|ref|ZP_07205793.1| putative CRISPR-associated RAMP protein, Csm4 family [Lactobacillus
salivarius ACS-116-V-Col5a]
gi|300852871|gb|EFK80486.1| putative CRISPR-associated RAMP protein, Csm4 family [Lactobacillus
salivarius ACS-116-V-Col5a]
Length=257
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/263 (32%), Positives = 132/263 (51%), Gaps = 29/263 (11%)
Query 51 ELVACSTLRLTDLLPYVGPDYLVPKPLHS-----VRSDGSSMQK--KLAKKIGFLPAAQL 103
+L L+D P ++ +PKP+ + MQ+ K +KK+ ++ +
Sbjct 8 DLSKSDNFFLSDSFPLKDGEFYLPKPIGYPKMPLISESTKEMQRNAKKSKKLQYIKYTDI 67
Query 104 GSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLATG 163
++ G D+ +L G+ + +K+ + KK DPY VG F+ L++L
Sbjct 68 EDYVKGNCDIGKLE------GISSFFSKSTVVT-KKGIDPYEVGITNFK--TSLYILTIK 118
Query 164 SESELGLLTRLLKGI--SALGGERTSGFGAFNLTESEAPAALTPTV-----DAASLMTLT 216
E LL L+ + S +GG+R+SG+G F + + + P + + + MTL
Sbjct 119 HE----LLDMLMNSLQYSGIGGKRSSGYGRFTIEKLDIPNEFSKNIVINDSEYGVYMTLN 174
Query 217 TSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGIL 276
TS+P +DEL+ L A Y L K SGF A ST + LRK+D+YKF GS ++ + G I
Sbjct 175 TSIPNNDELDVVLPTAEYLLEKSSGF-AYSTASKSLLRKQDLYKFVVGSTLTKTYSGNIF 233
Query 277 DVSLGG-NHPVYSYARPLFLALP 298
DV G HPV++YA+ LF LP
Sbjct 234 DVRPDGFPHPVWNYAKGLFYKLP 256
>gi|258645684|ref|ZP_05733153.1| CRISPR-associated RAMP protein, Csm4 family [Dialister invisus
DSM 15470]
gi|260403052|gb|EEW96599.1| CRISPR-associated RAMP protein, Csm4 family [Dialister invisus
DSM 15470]
Length=328
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/343 (32%), Positives = 163/343 (48%), Gaps = 61/343 (17%)
Query 1 MNSRLFRFDFDR-THFGD----HGLESSTISCPADTLYSALCVEALRMGGQQLLG---EL 52
M + F F HFGD GL C ADT +SALC EA + Q+LL E
Sbjct 1 MRHEIILFHFTSPVHFGDVAEGGGLGEILSYCRADTFFSALCREAADIS-QELLECVIEN 59
Query 53 VACSTLRLTDLLPYVGPDYL----VPKP-LHSVRSD-----------GSSMQKKLAKKIG 96
V LR++DL P+ ++ +P+P ++ SD S ++K KK
Sbjct 60 VRLGNLRVSDLFPWKKANHCYELYLPRPVMYQKNSDLVETLSYEEVRAQSGERKKHKKRS 119
Query 97 FLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAG 156
F+ A+++ +L G + Q G + + +N + + PY +G + F DAG
Sbjct 120 FIRASEMEIYLQGRD-----VSVQPDFGKEEIRTQ---YNAR-ERHPYGIGAYHFMPDAG 170
Query 157 LWLLATGSESELGLLTRLLK--GISALGGERTSGFGAFNLTESEAPAAL----TPTVDAA 210
L+ + +GSE L L+K G++ +GG+R+SGFG + + P AL T D
Sbjct 171 LYFILSGSEELAERLEPLIKLLGMAGIGGKRSSGFGKYIFEDD--PLALSDEDTYGGDDV 228
Query 211 SL------------MTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADM--PLRKR 256
SL M+L++ LP E++ AG T +++KR GF S DM +
Sbjct 229 SLYKMLCADHSDCYMSLSSFLPEKSEVKDVSAG-TGKIIKRGGFAWSR---DMISAAKTN 284
Query 257 DIYKFAAGSVFSRPFQGGILDVSLGGN-HPVYSYARPLFLALP 298
+Y A+G+ FS+ +G I DV+ G HPVY Y R LF+ LP
Sbjct 285 SVYMIASGACFSKRLEGRIADVNNGSAPHPVYKYGRGLFVGLP 327
>gi|121533437|ref|ZP_01665265.1| CRISPR-associated RAMP protein, Csm4 family [Thermosinus carboxydivorans
Nor1]
gi|121307996|gb|EAX48910.1| CRISPR-associated RAMP protein, Csm4 family [Thermosinus carboxydivorans
Nor1]
Length=341
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/341 (30%), Positives = 154/341 (46%), Gaps = 46/341 (13%)
Query 1 MNSRLFRFDF-DRTHFGDHG----LESSTISCPADTLYSALCVEA--LRMGGQQLLGELV 53
M RL++ F FGD G L+ + ++ ADTL+SALC EA L Q L
Sbjct 1 MLFRLYKLRFLTPVRFGDDGAAAGLDQARLAGRADTLFSALCSEAAMLSAAAPQRLAAAA 60
Query 54 ACSTLRLTDLLPYVGPD-------------YLVPKPLHSVRSDGSSMQKKLAKKIGFLPA 100
A L +TDL PY G + P + + + ++KKL K+ +LP
Sbjct 61 ADGNLLVTDLFPYRGETLYLPRPLLPPDTAWQQPAAGRTASATAAHIKKKL-NKLPYLPV 119
Query 101 AQLGSFLDG--TADLKELAARQTK-IGVHAVSAKAAIH-----NGKKDADPYRVGYFRFE 152
L ++L T D + I + ++ +A+ +G++ PY V ++F
Sbjct 120 RLLPAYLYWLRTGDSSQFDLDNANAIALSDIAKFSAVRVNARVDGERQTLPYVVTQYQFG 179
Query 153 LDAGLWLLATGSESEL-GLLTRLLKGIS--ALGGERTSGFGAFNLTES------------ 197
GL+ +A ++ +L + RL+ +S +GG+R++G G F L E
Sbjct 180 AGCGLYFVAAAADQDLLDWIDRLIASLSYSGIGGKRSAGLGKFELAEDPIDMDETGVYAD 239
Query 198 -EAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKR 256
A +L DA M L+ P +E+ A LA Y L R GFVAS YA ++
Sbjct 240 DAALYSLLTATDADWYMALSCLWPLPNEV-ALLADGFYSLTARGGFVASPAYAPSAVKHH 298
Query 257 DIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLAL 297
++ AAGS G + D++ GG HPVY Y +PL+ L
Sbjct 299 SVHMLAAGSCLKAKAAGQVGDLAAGGKHPVYRYGKPLYAGL 339
>gi|323141260|ref|ZP_08076156.1| CRISPR-associated RAMP protein, Csm4 family [Phascolarctobacterium
sp. YIT 12067]
gi|322414217|gb|EFY05040.1| CRISPR-associated RAMP protein, Csm4 family [Phascolarctobacterium
sp. YIT 12067]
Length=337
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/326 (30%), Positives = 143/326 (44%), Gaps = 47/326 (14%)
Query 13 THFGDHG----LESSTISCPADTLYSALCVEALRMGGQ--QLLGELVACSTLRLTDLLPY 66
HFGD L+ ++ C ADTL++ALC EA G + L + A + + L PY
Sbjct 15 VHFGDTANGGSLDKFSLQCSADTLFAALCNEAANKGSDAVETLVKKTAEGKIVFSSLFPY 74
Query 67 ---VGPD--YLVPKPLHSVRSDGSSMQK------------KLAKKIGFLPAAQLGSFLDG 109
V D + +PKPL + D K K KK ++ A+Q+ S L+
Sbjct 75 CRTVDDDLYFYLPKPLLKLEQDEQQSAKSFEEIKQLATKLKKQKKSTYIRASQINSLLES 134
Query 110 TADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLW-LLATGSESEL 168
+ ++ A + V+ + A+ K PY VG + F +GL+ +L E E
Sbjct 135 GSSDRQFAVPE--FAAPLVAGRVALREEK--PLPYYVGSYVFSEHSGLYFILGVEHEEEF 190
Query 169 GLLTRLL--KGISALGGERTSGFGAFNLTESE--------------APAALTPTVDAASL 212
L+ LL G S +GG+R+SG+G F L + E A A + +
Sbjct 191 TLIKELLLSLGYSGIGGKRSSGYGKFELADDELELFDDGGVYDDDTAIALMLYNEKSKYQ 250
Query 213 MTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQ 272
M L P DEL A + Y+L+KR GF+ SS D +++ IY GS F
Sbjct 251 MCLAPVCPKADEL-AVVKQGRYKLIKRGGFITSSAAKD-NIKRNSIYMLQEGSCFPERLC 308
Query 273 GGILDVSLGG-NHPVYSYARPLFLAL 297
G +L ++ G H VY +F+ L
Sbjct 309 GQMLQQTVDGLAHDVYRDGIGMFVGL 334
>gi|341822664|emb|CCC73588.1| CRISPR-associated RAMP protein [Megasphaera elsdenii DSM 20460]
Length=336
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/341 (28%), Positives = 156/341 (46%), Gaps = 51/341 (14%)
Query 1 MNSRLFRFDFDR-THFG--DHG--LESSTISCPADTLYSALCVEALRMGGQQ---LLGEL 52
M L+ FD HFG ++G LE S+++ AD+L+SALC E G ++ L E
Sbjct 1 MKYALYPLQFDTPVHFGCAENGGKLEQSSLNYRADSLFSALCYELSLQGDEKGLTHLQEA 60
Query 53 VACSTLRLTDLLPYVGPD-----YLVPKPLHSVRSDG------------SSMQKKLAKKI 95
+ L +DL PY+ D VPKP+ S+ ++ + Q+K KK+
Sbjct 61 IVKGKLVFSDLFPYIYDDTEELQLYVPKPILSIPAESRQETVDYDTFRRQATQQKRQKKL 120
Query 96 GFLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDA 155
++ +QL F+ K + + G + + ++ + PY V +F +A
Sbjct 121 SYIRISQLADFIQAMKAGKSFCSDEPCFGCGQLLTR--VNCTEAVPRPYYVHQIQFTEEA 178
Query 156 GLW-LLATGSESELGLLTRLLK--GISALGGERTSGFGAFNLTES----------EAPAA 202
GL+ L+ + + L LL+ G S +GG+R+SG+G F+ + E
Sbjct 179 GLYGLVGYEDDEDWDWLQSLLELLGFSGIGGKRSSGYGKFHFRDDPIDMDELGVYEDDRI 238
Query 203 LTPTV---DAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIY 259
L + +A+ M L+ LPT E+ G Y L +RSGF++ +K DIY
Sbjct 239 LYKGLTMSNASMYMCLSVLLPTPAEVIDVQEG-QYALCRRSGFLSPD--GGRMQKKNDIY 295
Query 260 KFAAGSVFSRPFQGGILDVSLGGN---HPVYSYARPLFLAL 297
AGS F + G + +V GG HPV+ Y + L++ +
Sbjct 296 MIQAGSCFPKKLAGCMAEV--GGQDAVHPVWRYGKGLYVGV 334
>gi|334126728|ref|ZP_08500676.1| Csm4 family CRISPR-associated RAMP protein [Centipeda periodontii
DSM 2778]
gi|333391138|gb|EGK62259.1| Csm4 family CRISPR-associated RAMP protein [Centipeda periodontii
DSM 2778]
Length=333
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/343 (28%), Positives = 147/343 (43%), Gaps = 58/343 (16%)
Query 1 MNSRLFRFDFDR-THFGDHG----LESSTISCPADTLYSALCVEALRMGGQQLLGEL--- 52
M+ ++ FD HF G L+ + AD L+SALC E G L L
Sbjct 1 MSYVIYPLQFDTAVHFAQAGRGGRLDEAGTEYGADALFSALCAELAATGEMDALAHLHER 60
Query 53 VACSTLRLTDLLPY----VGP-DYLVPKPLHSVRSDGSS-----------MQKKLAKKIG 96
VA L +DLLP+ VG ++ VP+P+ + G + ++K K +
Sbjct 61 VAARELLFSDLLPWRVDAVGEMEFYVPRPVLRIEGTGEARANFVETCSRATERKKQKSMK 120
Query 97 FLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAG 156
+L A+ + ++D ++ A + S + ++ + + PY VG F F DAG
Sbjct 121 YLRASCMADYVDA---MRTGAPFSSAAEFGTASLRQRVNTREAEPLPYYVGQFDFHRDAG 177
Query 157 LWLLATGSESELGLLTR---LLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASL- 212
L+LLA +E R + G+S +GG+RTSGFG F++ E E +DA +
Sbjct 178 LYLLAYVHHAEDADFLRELLIWLGLSGIGGKRTSGFGKFHIAEDEI------VLDADGIY 231
Query 213 ------------------MTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLR 254
MT+ +P +EL GA YRL + GF+ + A +
Sbjct 232 ADDAALYALLHAADAPWQMTIAPVVPAAEELPRVKDGA-YRLRRTGGFITAP--AHEAEK 288
Query 255 KRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLAL 297
K IY AGS G + ++ HPV+ Y L+L +
Sbjct 289 KNSIYLIDAGSCLRTRIGGSLAELCTYDGHPVWRYGFGLYLGV 331
>gi|313894815|ref|ZP_07828375.1| CRISPR-associated RAMP protein, Csm4 family [Selenomonas sp.
oral taxon 137 str. F0430]
gi|312976496|gb|EFR41951.1| CRISPR-associated RAMP protein, Csm4 family [Selenomonas sp.
oral taxon 137 str. F0430]
Length=331
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 100/330 (31%), Positives = 150/330 (46%), Gaps = 49/330 (14%)
Query 1 MNSRLFRFDFDR-THFG--DHG--LESSTISCPADTLYSALCVEALRMGGQQLLGELV-A 54
M L++ FD HFG + G L ++ +S P+DTL+SALC E G ++ L +
Sbjct 1 MAYYLYQLAFDAPVHFGMAEQGGSLAAAGMSYPSDTLFSALCCELAAAGEEERLRAFIEK 60
Query 55 CST--LRLTDLLPYVGPD-----YLVPKPLHSVRSDGS-----------SMQKKLAKKIG 96
C + L+DLLPY D YL+ L +++ + + S +K KKI
Sbjct 61 CRAGDIILSDLLPYREEDGEVHYYLLKPTLAAMQPNAAPPENLTGARAQSGARKQMKKIA 120
Query 97 FLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAG 156
+L A++L +LD R + + ++ K + +PY VG F G
Sbjct 121 YLRASRLTEYLDAQQSGNPFDER-GEFCTYGLTTKVNC----RAQEPYPVGSISFHAKCG 175
Query 157 LW-LLATGSESELGLLTRLLK--GISALGGERTSGFGAFNLTE---------SEAPAALT 204
L+ +L E + +T L G+S +GG+R++G+G F+L + SE AAL
Sbjct 176 LYAVLYLRDEDDAARMTELFTYLGLSGIGGKRSAGWGKFHLEDDPYDLVDAFSEDDAALH 235
Query 205 ---PTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKF 261
+A M L++ LP D E AL TY L KRSGF+ + A +K +Y
Sbjct 236 AFLENENAPRQMLLSSLLP-DPEDVGALRAGTYELHKRSGFI-TQVGAGGVRKKHSVYML 293
Query 262 AAGSVFSRPFQGGILDVSLGGNHPVYSYAR 291
+AGS R G I V G P Y R
Sbjct 294 SAGSCLPRRISGTIAHV---GKEPGYEVLR 320
>gi|292669139|ref|ZP_06602565.1| csm4 family CRISPR-associated ramp protein [Selenomonas noxia
ATCC 43541]
gi|292649191|gb|EFF67163.1| csm4 family CRISPR-associated ramp protein [Selenomonas noxia
ATCC 43541]
Length=336
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 85/306 (28%), Positives = 135/306 (45%), Gaps = 40/306 (13%)
Query 20 LESSTISCPADTLYSALCVEALRMGGQQLLG---ELVACSTLRLTDLLPYVGPDY----- 71
L+ + + PAD L+ ALC E G + LG E V LRL+DLLP+ ++
Sbjct 25 LDEACMEYPADALFGALCAELAASGETEELGRLAETVERGDLRLSDLLPWQRREHDGALA 84
Query 72 -LVPKPLHSVR-------------SDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELA 117
+P+P+ + + ++M+KK KK+ ++ A ++ ++
Sbjct 85 LFLPRPVLRIEHTQAQEREDYRSTCENATMRKK-QKKLKYIRACRMQDYICAMQSGNPFE 143
Query 118 ARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLLA-TGSESELGLLTRLLK 176
R+ S + ++ ++ PY V F F +AGL+L+A E + L RLL
Sbjct 144 DREFDANFGMESLRQRVNRRGEEPLPYYVAQFDFRTEAGLYLIACVRDEKTISWLHRLLV 203
Query 177 --GISALGGERTSGFGAFNLTE--------SEAPAALTPTVDAAS---LMTLTTSLPTDD 223
G + +GG+RTSG+G F E AAL + A S + L LPT D
Sbjct 204 WLGTAGIGGKRTSGYGKFRAGEIIHMDADDGGDIAALRDMLAADSAPWQLALAPVLPTAD 263
Query 224 ELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGN 283
+L GA YRL + GF++ ++ +K +Y AGS F G +
Sbjct 264 DLATVKRGA-YRLRRAGGFISYPVHS--AEKKNSVYLLDAGSCFPARISGTCGMLGTHDG 320
Query 284 HPVYSY 289
HPV+ Y
Sbjct 321 HPVWRY 326
>gi|312899097|ref|ZP_07758475.1| CRISPR-associated RAMP protein, Csm4 family [Megasphaera micronuciformis
F0359]
gi|310619764|gb|EFQ03346.1| CRISPR-associated RAMP protein, Csm4 family [Megasphaera micronuciformis
F0359]
Length=334
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/341 (27%), Positives = 143/341 (42%), Gaps = 54/341 (15%)
Query 1 MNSRLFRFDFDR-THFGD----HGLESSTISCPADTLYSALCVEALRMGGQQLLGELVAC 55
M S L++ FD HFG GLES + +D L+S+LC E G + +LV
Sbjct 2 MVSYLYQLKFDTPVHFGTIEAGDGLESVVYTYESDRLFSSLCCELAENGDSNKILDLVGA 61
Query 56 S---TLRLTDLLPYVGPD----YLVPKPLHSVRSDGS------------SMQKKLAKKIG 96
+ L L+DL P+ D +PKP+ S+ + S S+ KK KK+
Sbjct 62 ADSGKLILSDLFPFAKVDGDCRLYIPKPVLSIERNESDEIRSYTEACKASVAKKTNKKLE 121
Query 97 FLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAG 156
F+ +++ ++ + G ++ + + K PY VG F F + G
Sbjct 122 FIRVSKVDEYIQSLKSNRAFT-EDDDFGASWLTEQVSCRGDK--PLPYYVGTFVFAKNTG 178
Query 157 LWLLATGSES----ELGLLTRLLKGISALGGERTSGFGAFNLTES-----EAPAALTPTV 207
L+ + G E L LL +L G+S +GG+R+SG+G F + E P P
Sbjct 179 LYGIIQGEERIVQRSLDLLEQL--GLSGIGGKRSSGYGKFRFHDDPFILDEEP----PYD 232
Query 208 DAASLMTLTTSLPTDDELEAALAGAT-----------YRLVKRSGFVASSTYADMPLRKR 256
DA L + +L Y L KRSGFV+S + + ++
Sbjct 233 DAIELKRRMDDKTAQWHMNMSLLIPDIGDIDDIKKGFYSLKKRSGFVSSLGFGYVH-KRH 291
Query 257 DIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLAL 297
DIY A+GS F G + + H VY + L+L +
Sbjct 292 DIYGIASGSCFRNQPAGKVAAMRTDTGHDVYRNGKALYLGV 332
>gi|344997572|ref|YP_004799915.1| CRISPR-associated RAMP protein, Csm4 family [Caldicellulosiruptor
lactoaceticus 6A]
gi|343965791|gb|AEM74938.1| CRISPR-associated RAMP protein, Csm4 family [Caldicellulosiruptor
lactoaceticus 6A]
Length=336
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/304 (29%), Positives = 136/304 (45%), Gaps = 42/304 (13%)
Query 29 ADTLYSALCVEALRMGGQQLLGELV-----ACSTLRLTDLLPYVGPDYLVPKP----LHS 79
+DTL S + + G EL+ ++ +PYV ++ VPKP LH
Sbjct 35 SDTLMSGIINAYSLLYGNSSTNELLDGFLRKSPPFEVSSTMPYVQGEFFVPKPAGLNLHH 94
Query 80 VRSDG--SSMQKKLAKKIGFLPAAQL-----------GSFLDGTADLKELAARQTKIGVH 126
+ +G K KKI F+ L GSFL L + + I +
Sbjct 95 YKDEGKIEVENDKELKKIKFIRENDLLYNFPDKYKAAGSFLLPRDMLYKFVESKKHISLG 154
Query 127 AVS--AKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGS----ESELGLLTRLLKGISA 180
V A+ +I ++ Y +F FE AGLW + E ++ RLL G
Sbjct 155 KVKERARVSIDRLSSSSNIYYFSHFEFEESAGLWFYLRINDQSLEEKIKAAIRLL-GDEG 213
Query 181 LGGERTSGFGAF--NLTESEAPAALTPTVDAASLMTLTTSLP-TDDELEAALAGATYRLV 237
LGG+RT G G+F N ES P A M+L+ P ++DE+++A+ +Y ++
Sbjct 214 LGGDRTCGLGSFEANFEESSMPEE---NDSAKYYMSLSLVNPQSEDEIKSAI---SYEIL 267
Query 238 KRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLG--GNHPVYSYARPLFL 295
RSG++ S A + ++++ + F+ G+VFS G ++DV+ H VY +A L
Sbjct 268 TRSGYIYSK--AGLGIKRKALRVFSEGTVFSGKVCGRVVDVTPQKFSQHRVYCFALAFLL 325
Query 296 ALPE 299
LPE
Sbjct 326 PLPE 329
>gi|312794662|ref|YP_004027585.1| crispr-associated ramp protein, csm4 family [Caldicellulosiruptor
kristjanssonii 177R1B]
gi|312181802|gb|ADQ41972.1| CRISPR-associated RAMP protein, Csm4 family [Caldicellulosiruptor
kristjanssonii 177R1B]
Length=336
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/304 (29%), Positives = 135/304 (45%), Gaps = 42/304 (13%)
Query 29 ADTLYSALCVEALRMGGQQLLGELV-----ACSTLRLTDLLPYVGPDYLVPKP----LHS 79
+DTL S + + G EL+ ++ +PYV ++ VPKP LH
Sbjct 35 SDTLMSGIINAYSLLYGNSSTNELLDGFLRKSPPFEVSSTMPYVQGEFFVPKPVGLNLHH 94
Query 80 VRSDG--SSMQKKLAKKIGFLPAAQL-----------GSFLDGTADLKELAARQTKIGVH 126
+ +G K KKI F+ L GSFL L + + I +
Sbjct 95 YKDEGKIEVENDKELKKIKFIRENDLLYNFPDKYKVAGSFLLPKDMLYKFVESKKAISLG 154
Query 127 AVS--AKAAIHNGKKDADPYRVGYFRFELDAGLWLLATGS----ESELGLLTRLLKGISA 180
V A+ +I ++ Y +F FE AGLW + E ++ RLL G
Sbjct 155 KVKERARVSIDRLSSSSNIYYFSHFEFESSAGLWFYLRINDQSLEEKIKAAIRLL-GDEG 213
Query 181 LGGERTSGFGAF--NLTESEAPAALTPTVDAASLMTLTTSLP-TDDELEAALAGATYRLV 237
LGG+RT G G+F N ES P A M+L+ P ++DE++ A+ +Y ++
Sbjct 214 LGGDRTCGLGSFEANFEESSMPEE---NDSAKYYMSLSLVNPQSEDEIKNAI---SYEIL 267
Query 238 KRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLG--GNHPVYSYARPLFL 295
RSG++ S A + ++++ + F+ G+VFS G ++DV+ H VY +A L
Sbjct 268 TRSGYIYSK--AGLGIKRKAVRVFSEGTVFSGKVCGRVVDVTPQKFSQHRVYCFALAFLL 325
Query 296 ALPE 299
LPE
Sbjct 326 PLPE 329
>gi|345303032|ref|YP_004824934.1| CRISPR-associated RAMP protein, Csm4 family [Rhodothermus marinus
SG0.5JP17-172]
gi|345112265|gb|AEN73097.1| CRISPR-associated RAMP protein, Csm4 family [Rhodothermus marinus
SG0.5JP17-172]
Length=325
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/309 (28%), Positives = 137/309 (45%), Gaps = 36/309 (11%)
Query 19 GLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLH 78
G ES + + P+DTL+SA+CV A + G + + L+ +RL+ P+VG +Y P+PL
Sbjct 22 GEESVSPTVPSDTLFSAVCVSAFWLYGAEGVERLLKPGAVRLSSTFPFVGTEYFFPRPLS 81
Query 79 ---SVRSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTK------------- 122
+ ++ + K++ KK+ +L L+GT E+ + Q +
Sbjct 82 FFPKIPNEQYELLKRI-KKVRYLSRTLFEQVLEGTQ--PEIRSDQIRGLFWFAGPPPDEP 138
Query 123 IGVHAVSAKAAIHNGKKDADPYRVG--YFRFELDAGLWLLATGSESELGLLTR---LLKG 177
+ V + A+ + ++ Y +F LDAGL+ LA + ++ L L
Sbjct 139 VMQTEVRPRVALDRVTQASEIYHFAEVHFNPRLDAGLFFLAQFEDPKVQQLFESALALLA 198
Query 178 ISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLV 237
+G +RT G G F E + D A L++L P ++ + A + Y LV
Sbjct 199 DEGIGADRTMGKGWFRWEREELTIRVPEATDRAVLLSLYNPTP-EEAVAIAPYDSCYALV 257
Query 238 KRSGFVASSTYADMPLRKRDIYKFAAGSV---FSRPFQGGILDVSLGGN------HPVYS 288
R G+V + M LR+R + FA GSV S+ G L V L HPVY
Sbjct 258 TRRGWV--TVPGAMTLRRRPVRFFAEGSVLRFISQHMPQGRLVVVLSEKDAPELTHPVYR 315
Query 289 YARPLFLAL 297
+ L L +
Sbjct 316 NGQALALPI 324
Lambda K H
0.319 0.135 0.391
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 505781532318
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40