BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2824c
Length=314
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609961|ref|NP_217340.1| hypothetical protein Rv2824c [Mycob... 635 2e-180
gi|31794000|ref|NP_856493.1| hypothetical protein Mb2848c [Mycob... 631 4e-179
gi|308405981|ref|ZP_07494637.2| hypothetical protein TMLG_01306 ... 534 8e-150
gi|308232254|ref|ZP_07415441.2| CRISPR-associated protein Cas6 [... 489 3e-136
gi|308374707|ref|ZP_07667852.1| hypothetical protein TMFG_00014 ... 486 1e-135
gi|308371144|ref|ZP_07423970.2| hypothetical protein TMCG_02068 ... 465 4e-129
gi|315925062|ref|ZP_07921279.1| tm1814 family CRISPR-associated ... 154 1e-35
gi|253578041|ref|ZP_04855313.1| conserved hypothetical protein [... 150 2e-34
gi|345284423|gb|AEN78276.1| CRISPR-associated RAMP superfamily p... 149 5e-34
gi|240143675|ref|ZP_04742276.1| CRISPR-associated protein Cas6 [... 144 2e-32
gi|331004044|ref|ZP_08327526.1| CRISPR-associated protein cas6 [... 143 4e-32
gi|291539919|emb|CBL13030.1| CRISPR-associated protein Cas6 [Ros... 142 6e-32
gi|125718072|ref|YP_001035205.1| hypothetical protein SSA_1252 [... 137 3e-30
gi|55820994|ref|YP_139436.1| hypothetical protein stu0959 [Strep... 137 3e-30
gi|116627766|ref|YP_820385.1| CRISPR-associated RAMP superfamily... 137 3e-30
gi|270292485|ref|ZP_06198696.1| putative CRISPR-associated prote... 137 3e-30
gi|224543487|ref|ZP_03684026.1| hypothetical protein CATMIT_0269... 136 3e-30
gi|114567266|ref|YP_754420.1| hypothetical protein Swol_1751 [Sy... 136 4e-30
gi|322387542|ref|ZP_08061151.1| hypothetical protein HMPREF9423_... 136 4e-30
gi|322375487|ref|ZP_08050000.1| CRISPR-associated protein Cas6 [... 136 5e-30
gi|339278112|emb|CCC19860.1| hypothetical protein STH8232_1161 [... 136 5e-30
gi|327474436|gb|EGF19842.1| hypothetical protein HMPREF9391_0562... 136 6e-30
gi|325696577|gb|EGD38467.1| hypothetical protein HMPREF9384_1724... 135 9e-30
gi|327469963|gb|EGF15427.1| hypothetical protein HMPREF9386_0574... 135 9e-30
gi|325687532|gb|EGD29553.1| hypothetical protein HMPREF9381_1066... 134 1e-29
gi|229826457|ref|ZP_04452526.1| hypothetical protein GCWU000182_... 133 4e-29
gi|291460042|ref|ZP_06599432.1| CRISPR-associated protein Cas6 [... 126 3e-27
gi|323141258|ref|ZP_08076154.1| putative CRISPR-associated endor... 119 4e-25
gi|121533435|ref|ZP_01665263.1| conserved hypothetical protein [... 114 2e-23
gi|334126726|ref|ZP_08500674.1| hypothetical protein HMPREF9081_... 113 4e-23
gi|296133516|ref|YP_003640763.1| Protein of unknown function DUF... 112 9e-23
gi|342213934|ref|ZP_08706647.1| putative CRISPR-associated endor... 112 1e-22
gi|164688462|ref|ZP_02212490.1| hypothetical protein CLOBAR_0210... 111 2e-22
gi|292669137|ref|ZP_06602563.1| conserved hypothetical protein [... 110 2e-22
gi|227890790|ref|ZP_04008595.1| conserved hypothetical protein [... 108 1e-21
gi|334308468|gb|EGL99454.1| CRISPR-associated protein Cas6 [Lact... 106 4e-21
gi|329736388|gb|EGG72657.1| CRISPR-associated endoribonuclease C... 104 2e-20
gi|258645682|ref|ZP_05733151.1| CRISPR-associated protein Cas6 [... 103 4e-20
gi|57865878|ref|YP_189998.1| hypothetical protein SERP2455 [Stap... 102 6e-20
gi|339893268|emb|CCB52456.1| CRISPR associated protein [Staphylo... 100 2e-19
gi|341822666|emb|CCC73590.1| putative uncharacterized protein [M... 97.8 2e-18
gi|340752434|ref|ZP_08689233.1| hypothetical protein FSAG_00290 ... 88.6 1e-15
gi|237741579|ref|ZP_04572060.1| conserved hypothetical protein [... 88.2 2e-15
gi|294792435|ref|ZP_06757582.1| putative CRISPR-associated prote... 88.2 2e-15
gi|339890608|gb|EGQ79709.1| hypothetical protein HMPREF9094_1266... 87.8 2e-15
gi|295105100|emb|CBL02644.1| Uncharacterized conserved protein (... 87.4 3e-15
gi|294782688|ref|ZP_06748014.1| CRISPR-associated protein Cas6 [... 86.7 4e-15
gi|289549406|ref|YP_003470310.1| CRISPR-associated protein Cas6 ... 86.7 5e-15
gi|269798856|ref|YP_003312756.1| hypothetical protein Vpar_1801 ... 86.3 6e-15
gi|315641547|ref|ZP_07896616.1| CRISPR-associated protein cas6 [... 85.9 9e-15
>gi|15609961|ref|NP_217340.1| hypothetical protein Rv2824c [Mycobacterium tuberculosis H37Rv]
gi|15842365|ref|NP_337402.1| hypothetical protein MT2891 [Mycobacterium tuberculosis CDC1551]
gi|121638703|ref|YP_978927.1| hypothetical protein BCG_2843c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
49 more sequence titles
Length=314
Score = 635 bits (1639), Expect = 2e-180, Method: Compositional matrix adjust.
Identities = 313/314 (99%), Positives = 314/314 (100%), Gaps = 0/314 (0%)
Query 1 LAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEHLSRL 60
+AARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEHLSRL
Sbjct 1 MAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEHLSRL 60
Query 61 TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW 120
TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW
Sbjct 61 TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW 120
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR 180
KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR
Sbjct 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR 180
Query 181 KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA 240
KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA
Sbjct 181 KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA 240
Query 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR
Sbjct 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
Query 301 VQPLAPREKCVPKP 314
VQPLAPREKCVPKP
Sbjct 301 VQPLAPREKCVPKP 314
>gi|31794000|ref|NP_856493.1| hypothetical protein Mb2848c [Mycobacterium bovis AF2122/97]
gi|31619594|emb|CAD95033.1| HYPOTHETICAL PROTEIN Mb2848c [Mycobacterium bovis AF2122/97]
Length=314
Score = 631 bits (1628), Expect = 4e-179, Method: Compositional matrix adjust.
Identities = 312/314 (99%), Positives = 313/314 (99%), Gaps = 0/314 (0%)
Query 1 LAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEHLSRL 60
+AARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTE LSRL
Sbjct 1 MAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEPLSRL 60
Query 61 TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW 120
TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW
Sbjct 61 TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW 120
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR 180
KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR
Sbjct 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR 180
Query 181 KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA 240
KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA
Sbjct 181 KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA 240
Query 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR
Sbjct 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
Query 301 VQPLAPREKCVPKP 314
VQPLAPREKCVPKP
Sbjct 301 VQPLAPREKCVPKP 314
>gi|308405981|ref|ZP_07494637.2| hypothetical protein TMLG_01306 [Mycobacterium tuberculosis SUMu012]
gi|308364946|gb|EFP53797.1| hypothetical protein TMLG_01306 [Mycobacterium tuberculosis SUMu012]
gi|323718581|gb|EGB27748.1| hypothetical protein TMMG_03691 [Mycobacterium tuberculosis CDC1551A]
Length=262
Score = 534 bits (1375), Expect = 8e-150, Method: Compositional matrix adjust.
Identities = 262/262 (100%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 53 MTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALA 112
MTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALA
Sbjct 1 MTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALA 60
Query 113 RSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARI 172
RSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARI
Sbjct 61 RSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARI 120
Query 173 FYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEF 232
FYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEF
Sbjct 121 FYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEF 180
Query 233 GQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKA 292
GQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKA
Sbjct 181 GQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKA 240
Query 293 SMGMGAIRVQPLAPREKCVPKP 314
SMGMGAIRVQPLAPREKCVPKP
Sbjct 241 SMGMGAIRVQPLAPREKCVPKP 262
>gi|308232254|ref|ZP_07415441.2| CRISPR-associated protein Cas6 [Mycobacterium tuberculosis SUMu001]
gi|308369870|ref|ZP_07419348.2| hypothetical protein TMBG_02961 [Mycobacterium tuberculosis SUMu002]
gi|308372261|ref|ZP_07428010.2| hypothetical protein TMDG_00008 [Mycobacterium tuberculosis SUMu004]
11 more sequence titles
Length=240
Score = 489 bits (1258), Expect = 3e-136, Method: Compositional matrix adjust.
Identities = 239/240 (99%), Positives = 240/240 (100%), Gaps = 0/240 (0%)
Query 75 VATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV 134
+ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV
Sbjct 1 MATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV 60
Query 135 GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 194
GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ
Sbjct 61 GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 120
Query 195 SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR 254
SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR
Sbjct 121 SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR 180
Query 255 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 314
VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP
Sbjct 181 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 240
>gi|308374707|ref|ZP_07667852.1| hypothetical protein TMFG_00014 [Mycobacterium tuberculosis SUMu006]
gi|308341011|gb|EFP29862.1| hypothetical protein TMFG_00014 [Mycobacterium tuberculosis SUMu006]
Length=240
Score = 486 bits (1252), Expect = 1e-135, Method: Compositional matrix adjust.
Identities = 238/240 (99%), Positives = 239/240 (99%), Gaps = 0/240 (0%)
Query 75 VATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV 134
+A LGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV
Sbjct 1 MAILGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIV 60
Query 135 GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 194
GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ
Sbjct 61 GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 120
Query 195 SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR 254
SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR
Sbjct 121 SGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR 180
Query 255 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 314
VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP
Sbjct 181 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 240
>gi|308371144|ref|ZP_07423970.2| hypothetical protein TMCG_02068 [Mycobacterium tuberculosis SUMu003]
gi|308375474|ref|ZP_07444043.2| hypothetical protein TMGG_02048 [Mycobacterium tuberculosis SUMu007]
gi|308379328|ref|ZP_07668948.1| hypothetical protein TMJG_01808 [Mycobacterium tuberculosis SUMu010]
gi|308329705|gb|EFP18556.1| hypothetical protein TMCG_02068 [Mycobacterium tuberculosis SUMu003]
gi|308346200|gb|EFP35051.1| hypothetical protein TMGG_02048 [Mycobacterium tuberculosis SUMu007]
gi|308357397|gb|EFP46248.1| hypothetical protein TMJG_01808 [Mycobacterium tuberculosis SUMu010]
gi|339295665|gb|AEJ47776.1| hypothetical protein CCDC5079_2586 [Mycobacterium tuberculosis
CCDC5079]
gi|339299281|gb|AEJ51391.1| hypothetical protein CCDC5180_2554 [Mycobacterium tuberculosis
CCDC5180]
Length=228
Score = 465 bits (1196), Expect = 4e-129, Method: Compositional matrix adjust.
Identities = 228/228 (100%), Positives = 228/228 (100%), Gaps = 0/228 (0%)
Query 87 MESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFR 146
MESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFR
Sbjct 1 MESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFR 60
Query 147 LRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRL 206
LRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRL
Sbjct 61 LRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRL 120
Query 207 VFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTV 266
VFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTV
Sbjct 121 VFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTV 180
Query 267 RGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 314
RGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP
Sbjct 181 RGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP 228
>gi|315925062|ref|ZP_07921279.1| tm1814 family CRISPR-associated protein [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621961|gb|EFV01925.1| tm1814 family CRISPR-associated protein [Pseudoramibacter alactolyticus
ATCC 23263]
Length=254
Score = 154 bits (390), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/250 (37%), Positives = 124/250 (50%), Gaps = 13/250 (5%)
Query 57 LSRLTLTLEVDAPLERARVATLG----PHLHGVLMESIPADYVQTLHTVPVNPYSQYALA 112
LS L L L+ D+P G +L GVLME I +D Q LH +PYSQ L
Sbjct 3 LSELILDLKADSP-------NFGYYQSSNLQGVLMEWIASDDAQALHRQRRHPYSQ-CLL 54
Query 113 RSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARI 172
R W I T A +QI+ PI + F L + V SR+ + P +
Sbjct 55 REDGQWRWHIRTTNQRANEQIIQPILSRNVSEFELTGKPMKISVLSRAYREIPQEKLLAH 114
Query 173 FYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG-EEPDPGLIAE 231
FY R +R + F + TAFKQSG Y +PD RL+FQSL QKY A D E D + +
Sbjct 115 FYQRDYSRYLHMAFQSATAFKQSGRYQIFPDVRLIFQSLMQKYSASNDMIEMADEKTLEQ 174
Query 232 FGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIK 291
+ + +R+ S F + +P F G+ T V+G A Y L FGEFSG G+K
Sbjct 175 LCRESEIVQYRLRSVKFPMEGMAIPAFMGTVTIKVKGASAMAKYARMLAEFGEFSGVGVK 234
Query 292 ASMGMGAIRV 301
++MGMGA+ +
Sbjct 235 SAMGMGAMHL 244
>gi|253578041|ref|ZP_04855313.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850359|gb|EES78317.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length=246
Score = 150 bits (379), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/224 (36%), Positives = 123/224 (55%), Gaps = 10/224 (4%)
Query 81 HLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDA 140
+L GV+ME+I +Y LH +NPYSQ + R S W I TL EA + I+ P+++
Sbjct 21 NLQGVIMENISPEYAARLHGNQLNPYSQ-CITRENNSTIWTIKTLNEEAYENIIMPLSEC 79
Query 141 AFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVF 200
LR G++ V ++ + ++ FY + + ++F TPTAFK G+YV
Sbjct 80 T--DIFLRKKGLSISVCNKRMHLKNDNELITEFYEKKCPKYLEIKFQTPTAFKSDGKYVI 137
Query 201 WPDPRLVFQSLAQKYGAIVDG----EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVP 256
+PD L++ SL +KY A+ + +E + E + VR +R+ + PF + ++
Sbjct 138 YPDLGLIYASLMRKYSAVSEAFDMFDEETLEALVEQSEIVR---YRLQTVPFPLEKVQIT 194
Query 257 GFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
GFTGS +RG +T A Y+ L FGEF+G GIK MGMGA++
Sbjct 195 GFTGSICIHIRGPETMARYLRMLFKFGEFAGVGIKTGMGMGAMK 238
>gi|345284423|gb|AEN78276.1| CRISPR-associated RAMP superfamily protein [Lactobacillus ruminis
ATCC 27782]
Length=255
Score = 149 bits (376), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 83/247 (34%), Positives = 132/247 (54%), Gaps = 6/247 (2%)
Query 57 LSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTT 116
+ +L L + ++ R + +LHG LM + D+ LH +NP S +
Sbjct 1 MKKLLLKCRRECDIDDCRESV---YLHGWLMNHLDDDFASELHQAGMNPLS-IQVVHDEE 56
Query 117 SLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIAT-QVTSRSLEQNPLSQFARIFYA 175
S+ + I+ LT +A ++ I + R+ + ++ +++ + S ++IFY
Sbjct 57 SVSFIINLLTGKACNEVEPLIMSDSRNMIRINSGNQHEFEIIEKAVFERSESDLSKIFYG 116
Query 176 RPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYG-AIVDGEEPDPGLIAEFGQ 234
++ +++ +TPTAFK +G+YVF PD RLVFQ+L +KYG A GE+ D L+ E
Sbjct 117 NDCSKVLKLKIMTPTAFKTNGKYVFLPDVRLVFQNLMKKYGCAFEKGEDIDFELLDEICS 176
Query 235 SVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASM 294
V ++AF + S F + A V GF G T G T +YIA LL F E+SG G+K SM
Sbjct 177 KVEVAAFSLKSRRFYLHKAYVNGFQGYLTLVCHGSQTLTNYIAMLLKFAEYSGIGVKTSM 236
Query 295 GMGAIRV 301
GMGA+R+
Sbjct 237 GMGAVRI 243
>gi|240143675|ref|ZP_04742276.1| CRISPR-associated protein Cas6 [Roseburia intestinalis L1-82]
gi|257204352|gb|EEV02637.1| CRISPR-associated protein Cas6 [Roseburia intestinalis L1-82]
Length=248
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/241 (31%), Positives = 121/241 (51%), Gaps = 7/241 (2%)
Query 64 LEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKIS 123
LE+ E+ + HG LME +P +Y LH ++PY+Q+ R + W I+
Sbjct 5 LELKLKCEKELTYQMSSLFHGALMELLPEEYADYLHISSLHPYAQHLECREG-NWYWVIT 63
Query 124 TLTNEARQQIVGPINDAAFA--GFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRK 181
L EA + I I D + ++ + + ++ + + FY R
Sbjct 64 GLNKEAVKII---IQDTLWKIEYILIKKHDLKVLIVKKNYMETTYKELMDHFYEDDGKRY 120
Query 182 FRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKY-GAIVDGEEPDPGLIAEFGQSVRLSA 240
++ FL+PTAFKQ+G Y+F+PD R VFQSL KY A + D + + + ++
Sbjct 121 IQIHFLSPTAFKQNGRYLFYPDLRCVFQSLMNKYDSATAENTMHDEDTLEQICEHAQVIR 180
Query 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
+ + S F++ R+P F G T + G DT A+++ L FGE+SG GIK S+GMG ++
Sbjct 181 YDLKSVSFSLEGVRIPSFIGKITIKLHGTDTMANFVNMLFEFGEYSGVGIKTSLGMGYMK 240
Query 301 V 301
+
Sbjct 241 I 241
>gi|331004044|ref|ZP_08327526.1| CRISPR-associated protein cas6 [Lachnospiraceae oral taxon 107
str. F0167]
gi|330411630|gb|EGG91038.1| CRISPR-associated protein cas6 [Lachnospiraceae oral taxon 107
str. F0167]
Length=243
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/241 (32%), Positives = 126/241 (53%), Gaps = 6/241 (2%)
Query 61 TLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEW 120
+L +E++ +++R LG G +ME+I +YV+ LH ++PYSQY + + L W
Sbjct 4 SLRIELEGEFDKSRNDLLGSLFQGFIMENIDVEYVEELHVSTLHPYSQY-ITLNNNKLIW 62
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGI-ATQVTSRSLEQNPLSQFARIFYARPET 179
++TL EA+++I + + + + + VT +++ L + Y +
Sbjct 63 TLNTLNAEAKEKIADILKNKKIIDIKHKDREYKVSSVTEKNISYKDL---VKECYLKDGQ 119
Query 180 RKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYG-AIVDGEEPDPGLIAEFGQSVRL 238
R+ ++ FLTPT+FKQ G+Y +P RL+FQSL K+ A E ++ F + V +
Sbjct 120 RRLKITFLTPTSFKQDGKYAIFPSVRLIFQSLMMKFDKASTQMEVFGKDILETFEKHVEI 179
Query 239 SAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGA 298
S +++ S F + +VP F G T V+G + LL FG +SG GIK +GMG
Sbjct 180 SMYKLRSTSFHLDGTKVPAFIGDITIVVKGPVQLVNLANMLLTFGTYSGVGIKTGIGMGG 239
Query 299 I 299
I
Sbjct 240 I 240
>gi|291539919|emb|CBL13030.1| CRISPR-associated protein Cas6 [Roseburia intestinalis XB6B4]
Length=248
Score = 142 bits (358), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 73/241 (31%), Positives = 119/241 (50%), Gaps = 7/241 (2%)
Query 64 LEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKIS 123
LE+ E+ + HG LME +P Y LH ++PY+Q+ R + W I+
Sbjct 5 LELKLKCEKELTYQMSSLFHGALMELLPEKYADYLHISSLHPYAQHLECREG-NWYWVIT 63
Query 124 TLTNEARQQIVGPINDA--AFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRK 181
L EA + I I D ++ + + ++ + + FY R
Sbjct 64 GLNKEAVKII---IQDTLWKLEYILIKKHDLKVLIVKKNYMETTYKELMDHFYEDDGKRY 120
Query 182 FRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKY-GAIVDGEEPDPGLIAEFGQSVRLSA 240
++ FL+PTAFKQ+G Y+F+PD R VFQSL KY A + D + + + ++
Sbjct 121 IQIHFLSPTAFKQNGRYLFYPDLRCVFQSLMNKYDSATAENTMHDEDTLEQICEHAQVIR 180
Query 241 FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
+ + S F++ ++P F G T + G DT A+++ L FGE+SG GIK S+GMG ++
Sbjct 181 YDLKSVSFSLEGVKIPSFIGKITIKLHGTDTMANFVNMLFEFGEYSGVGIKTSLGMGYMK 240
Query 301 V 301
+
Sbjct 241 I 241
>gi|125718072|ref|YP_001035205.1| hypothetical protein SSA_1252 [Streptococcus sanguinis SK36]
gi|125497989|gb|ABN44655.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
Length=244
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/259 (31%), Positives = 129/259 (50%), Gaps = 25/259 (9%)
Query 51 RRMTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA 110
+++ HLS+++L + L L G LME + D+ LH NPYS
Sbjct 2 KKIRLHLSKVSL-----------KDDDLVCKLQGFLMEKLSDDFASFLHQQETNPYSMNL 50
Query 111 LARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQ-- 168
+ S+ W ++ L+ EA QQI+ + ++ + ++ +++E LS
Sbjct 51 RSEREESI-WTVNLLSEEAEQQILPQLLSLEM----IKLETYSEEILVKNIEIQSLSSQS 105
Query 169 FARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG----EEP 224
+F + + F TPT FK+ G++V +PD RL+FQSL QKY +V+G EE
Sbjct 106 LLEVFQGDEASHLISLNFYTPTTFKRQGQFVLFPDTRLIFQSLMQKYSRLVEGKAEIEEE 165
Query 225 DPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGE 284
+AE Q +S++R+ S F + + P F G T ++G T +Y LL FGE
Sbjct 166 TLEFLAEHSQ---ISSYRLKSHYFPIHGRKYPAFEGRVTIRIQGASTLKAYAQMLLRFGE 222
Query 285 FSGCGIKASMGMGAIRVQP 303
+SG G K S+GMG +R++
Sbjct 223 YSGVGAKCSLGMGGMRIEE 241
>gi|55820994|ref|YP_139436.1| hypothetical protein stu0959 [Streptococcus thermophilus LMG
18311]
gi|55736979|gb|AAV60621.1| hypothetical protein stu0959 [Streptococcus thermophilus LMG
18311]
gi|312278320|gb|ADQ62977.1| CRISPR-associated protein, Cas6 family [Streptococcus thermophilus
ND03]
Length=243
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/253 (31%), Positives = 123/253 (49%), Gaps = 21/253 (8%)
Query 57 LSRLTLTLE-VDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARST 115
+ +L T + +D P L HG LME + +DYV LH NPY+ + +
Sbjct 1 MKKLVFTFKRIDHP-----AQDLAVKFHGFLMEQLDSDYVDYLHQQQTNPYAT-KVIQGK 54
Query 116 TSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFA----- 170
+ +W + LT++ I D F + S+E+ + +
Sbjct 55 ENTQWVVHLLTDD--------IEDKVFMTLLQIKEVSLNDLPKLSVEKVEIQELGADKLL 106
Query 171 RIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGE-EPDPGLI 229
IF + F + F TPT FK G YV +P RL+FQSL QKYG +V+ + E + +
Sbjct 107 EIFNSEENQTYFSIIFETPTGFKSQGSYVIFPSMRLIFQSLMQKYGRLVENQPEIEEDTL 166
Query 230 AEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCG 289
+ ++ +R+ ++ F V R+P F G TF V+G T +Y+ LL FGE+SG G
Sbjct 167 DYLSEHSTITNYRLETSYFRVHRQRIPAFRGKLTFKVQGAQTLKAYVKMLLTFGEYSGLG 226
Query 290 IKASMGMGAIRVQ 302
+K S+GMG I+++
Sbjct 227 MKTSLGMGGIKLE 239
>gi|116627766|ref|YP_820385.1| CRISPR-associated RAMP superfamily protein [Streptococcus thermophilus
LMD-9]
gi|116101043|gb|ABJ66189.1| CRISPR-associated protein, Cas6 family [Streptococcus thermophilus
LMD-9]
Length=243
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/253 (31%), Positives = 123/253 (49%), Gaps = 21/253 (8%)
Query 57 LSRLTLTLE-VDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARST 115
+ +L T + +D P L HG LME + +DYV LH NPY+ + +
Sbjct 1 MKKLVFTFKRIDHP-----AQDLAVKFHGFLMEQLDSDYVDYLHQQQTNPYAT-KVIQGK 54
Query 116 TSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFA----- 170
+ +W + LT++ I D F + S+E+ + +
Sbjct 55 ENTQWVVHLLTDD--------IEDKVFMTLLQIKEVSLNDLPKLSVEKVEIQELGTDKLL 106
Query 171 RIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGE-EPDPGLI 229
IF + F + F TPT FK G YV +P RL+FQSL QKYG +V+ + E + +
Sbjct 107 EIFNSEENQTYFSIIFETPTGFKSQGSYVIFPSMRLIFQSLMQKYGRLVENQPEIEEDTL 166
Query 230 AEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCG 289
+ ++ +R+ ++ F V R+P F G TF V+G T +Y+ LL FGE+SG G
Sbjct 167 DYLSEHSTITNYRLETSYFRVHRQRIPAFRGKLTFKVQGAKTLKAYVKMLLTFGEYSGLG 226
Query 290 IKASMGMGAIRVQ 302
+K S+GMG I+++
Sbjct 227 MKTSLGMGGIKLE 239
>gi|270292485|ref|ZP_06198696.1| putative CRISPR-associated protein Cas6 [Streptococcus sp. M143]
gi|270278464|gb|EFA24310.1| putative CRISPR-associated protein Cas6 [Streptococcus sp. M143]
Length=243
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/229 (33%), Positives = 121/229 (53%), Gaps = 7/229 (3%)
Query 76 ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVG 135
+ L G LME++ DYV LH NPYS + + +L W + LT+EA +QI+
Sbjct 16 SDLSTKFQGFLMENLEPDYVTWLHEQETNPYSLKIIHQKDKTL-WSLHLLTDEAVKQILP 74
Query 136 PINDAAFAGFRLRASGIAT-QVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 194
+ + ++ + T V S S++ Q F + + + F TPT F+
Sbjct 75 VLLELK----KVELHDLPTLMVESLSMQDLSSEQLFEFFNENQDRSLYTIHFQTPTGFRS 130
Query 195 SGEYVFWPDPRLVFQSLAQKYGAIVDG-EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAA 253
GEYV +P RL+FQSL KY +V+ ++ + + + R++++R+ S+ F V
Sbjct 131 QGEYVLFPTMRLIFQSLMMKYARLVENRQDIEEETLDYLVKHSRVTSYRLESSYFKVHGK 190
Query 254 RVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQ 302
++PGF G TF + G +T +Y LL FGE+SG G+K S+GMG + ++
Sbjct 191 KIPGFRGKLTFKITGPNTLKAYANMLLKFGEYSGLGMKTSLGMGGLELE 239
>gi|224543487|ref|ZP_03684026.1| hypothetical protein CATMIT_02696 [Catenibacterium mitsuokai
DSM 15897]
gi|224523614|gb|EEF92719.1| hypothetical protein CATMIT_02696 [Catenibacterium mitsuokai
DSM 15897]
Length=247
Score = 136 bits (343), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/224 (33%), Positives = 111/224 (50%), Gaps = 8/224 (3%)
Query 82 LHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAA 141
G L E + DYV LH +PYSQY + + + W I T +++ ++ PI D +
Sbjct 22 FQGALFELMDTDYVSILHQQNRHPYSQY-VYKDKDKVYWTICTCDDDSSHYMMNPILDDS 80
Query 142 FAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFW 201
L S+ L+ Q FY +P R + LTP +FK G Y+ +
Sbjct 81 IQQISLNKEKEPISFVSKQLKMVSQEQLMDHFYNKPAERYLEIRILTPMSFKSYGRYINY 140
Query 202 PDPRLVFQSLAQKYGAIVDG----EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPG 257
PD RL++QSL KY +++ +E ++ E + V+ + + S F + ++P
Sbjct 141 PDLRLIYQSLMNKYDSVLKEASMFDEDTLDMLVEGSEIVK---YNLRSYLFPLQGVKIPS 197
Query 258 FTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRV 301
F G+ T V DT A +I LL FGE+SG GIK +GMGAI++
Sbjct 198 FFGTMTIKVTSTDTAAKFIRLLLEFGEYSGVGIKTGLGMGAIQI 241
>gi|114567266|ref|YP_754420.1| hypothetical protein Swol_1751 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338201|gb|ABI69049.1| CRISPR-associated protein, Cas6 family [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
Length=249
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/228 (31%), Positives = 113/228 (50%), Gaps = 2/228 (0%)
Query 79 GPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPIN 138
G HG+L++S+P+D + LH + P+SQY L+ S L W I E I+ +
Sbjct 23 GSLFHGILVKSLPSDIAEMLHENHLRPFSQYVLSSSNQELTWNIGLWDAEIANHIIQAVL 82
Query 139 DAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEY 198
+ +A+ + RS QN F F R++ +EFLTP KQ G Y
Sbjct 83 PLVQIELQHKATTLEVTGVKRS-SQNEYEYFNHYFATENPCRRYEIEFLTPCTHKQDGSY 141
Query 199 VFWPDPRLVFQSLAQKYGAIVDGEEPD-PGLIAEFGQSVRLSAFRVASAPFAVGAARVPG 257
V +P P L+ +SL +Y A + D P + + + + + + + SA F + ++ G
Sbjct 142 VLFPTPELIVKSLNNRYCAFMQDVSLDAPEAMEQIAKHIHIVRYSLHSAVFYLERTKITG 201
Query 258 FTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLA 305
+ G T + G + A ALL F E+SG GIK ++GMG ++++ LA
Sbjct 202 YMGRITVVISGTEQLARLAGALLSFAEYSGLGIKTALGMGGVKIRALA 249
>gi|322387542|ref|ZP_08061151.1| hypothetical protein HMPREF9423_0549 [Streptococcus infantis
ATCC 700779]
gi|321141409|gb|EFX36905.1| hypothetical protein HMPREF9423_0549 [Streptococcus infantis
ATCC 700779]
Length=243
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/226 (33%), Positives = 119/226 (53%), Gaps = 5/226 (2%)
Query 78 LGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPI 137
L G LME++ DYV LH NPYS + + +L W + LT+EA +QI+ +
Sbjct 18 LSTKFQGFLMENLEPDYVTWLHEQETNPYSLKIIHQKDKTL-WSLHLLTDEAVKQILPVL 76
Query 138 NDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGE 197
+ + + V+ + L L +F F + + + F TPT F+ GE
Sbjct 77 LELKKVELHDLPTLMVESVSMQDLSSEQLFEF---FNENQDRSLYTIHFQTPTGFRSQGE 133
Query 198 YVFWPDPRLVFQSLAQKYGAIVDG-EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVP 256
YV +P RL+FQSL KY +V+ ++ + + + R++++R+ S+ F V ++P
Sbjct 134 YVLFPTMRLIFQSLMMKYARLVENRQDIEEETLDYLVKHSRVTSYRLESSYFKVHGKKIP 193
Query 257 GFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQ 302
GF G TF + G +T +Y LL FGE+SG G+K S+GMG + ++
Sbjct 194 GFRGKLTFKITGPNTLKAYANMLLKFGEYSGLGMKTSLGMGGLELE 239
>gi|322375487|ref|ZP_08050000.1| CRISPR-associated protein Cas6 [Streptococcus sp. C300]
gi|321279750|gb|EFX56790.1| CRISPR-associated protein Cas6 [Streptococcus sp. C300]
Length=243
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/229 (33%), Positives = 120/229 (53%), Gaps = 7/229 (3%)
Query 76 ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVG 135
+ L G LME + DYV LH NPYS + + +L W + LT+EA +QI+
Sbjct 16 SDLSTKFQGFLMEKLEPDYVTWLHEQETNPYSLKIIHQKDKTL-WSLHLLTDEAVKQILP 74
Query 136 PINDAAFAGFRLRASGIAT-QVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 194
+ + R+ + T V S S++ Q F + + + F TPT F+
Sbjct 75 VLLELK----RVELHDLPTLMVESLSMQDLSSEQLFEFFNENQDRSLYTICFQTPTGFRS 130
Query 195 SGEYVFWPDPRLVFQSLAQKYGAIVDG-EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAA 253
GEYV +P RL+FQSL KY +V+ ++ + + + R++++R+ S+ F V
Sbjct 131 QGEYVLFPTMRLIFQSLMMKYARLVENRQDIEEETLDYLVKHSRITSYRLESSYFKVHGK 190
Query 254 RVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQ 302
++PGF G TF + G +T +Y LL FGE+SG G+K S+GMG + ++
Sbjct 191 KIPGFRGRLTFKITGPNTLKAYANMLLKFGEYSGIGMKTSLGMGGLELE 239
>gi|339278112|emb|CCC19860.1| hypothetical protein STH8232_1161 [Streptococcus thermophilus
JIM 8232]
Length=243
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 76/248 (31%), Positives = 122/248 (50%), Gaps = 11/248 (4%)
Query 57 LSRLTLTLE-VDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARST 115
+ +L T + +D P L HG LME + +DYV LH NPY+ + +
Sbjct 1 MKKLVFTFKRIDHP-----AQDLAVKFHGFLMEQLDSDYVDYLHQQQTNPYAT-KVIQGK 54
Query 116 TSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYA 175
+ +W + LT++ ++ + +V + L + L IF +
Sbjct 55 ENTQWVVHLLTDDHEDKVFMTLLQIKEVSLNDLPKLSVEKVEIQELGADKL---LEIFNS 111
Query 176 RPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGE-EPDPGLIAEFGQ 234
F + F TPT FK G YV +P RL+FQSL QKYG +V+ + E + + +
Sbjct 112 EENQTYFSIIFETPTGFKSQGSYVIFPSMRLIFQSLMQKYGRLVENQPEIEEDTLDYLSE 171
Query 235 SVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASM 294
++ +R+ ++ F V R+P F G TF V+G T +Y+ LL FGE+SG G+K S+
Sbjct 172 HSTITNYRLETSYFRVHRQRIPAFRGKLTFKVQGAKTLKAYVKMLLTFGEYSGLGMKTSL 231
Query 295 GMGAIRVQ 302
GMG I+++
Sbjct 232 GMGGIKLE 239
>gi|327474436|gb|EGF19842.1| hypothetical protein HMPREF9391_0562 [Streptococcus sanguinis
SK408]
Length=244
Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/228 (33%), Positives = 118/228 (52%), Gaps = 14/228 (6%)
Query 82 LHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAA 141
L G LME + D+ LH NPYS + S+ W ++ L+ EA QQI+ +
Sbjct 22 LQGFLMEKLSDDFASFLHQQETNPYSMNLRSEREESI-WTVNLLSEEAEQQILPQL---- 76
Query 142 FAGFRLRASGIATQVTSRSLEQNPLSQ--FARIFYARPETRKFRVEFLTPTAFKQSGEYV 199
+ ++ + ++ +++E LS IF + + F TPT FK+ G++V
Sbjct 77 LSLETIKLETYSEEILVKNIEIQSLSSQSLLEIFQGDEASHLISLNFYTPTTFKRQGQFV 136
Query 200 FWPDPRLVFQSLAQKYGAIVDG----EEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARV 255
+PD RL+FQSL QKY +V+G EE +AE Q ++++R+ S F + +
Sbjct 137 LFPDTRLIFQSLMQKYSRLVEGKAEIEEETLEFLAEHSQ---ITSYRLKSHYFPIHGRKY 193
Query 256 PGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQP 303
P F G T ++G T +Y LL FGE+SG G K S+GMG +R++
Sbjct 194 PAFEGRVTIRIQGASTLKAYAQMLLRFGEYSGVGAKCSLGMGGMRIEE 241
>gi|325696577|gb|EGD38467.1| hypothetical protein HMPREF9384_1724 [Streptococcus sanguinis
SK160]
Length=244
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 79/259 (31%), Positives = 130/259 (51%), Gaps = 25/259 (9%)
Query 51 RRMTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA 110
+++ HLS+++L + L L G LME + D+ LH NPYS
Sbjct 2 KKIRLHLSKVSL-----------KDDDLVCKLQGFLMEKLSDDFASFLHQQETNPYSMNL 50
Query 111 LARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQ-- 168
+ S+ W ++ L+ EA QQI+ + + ++ + ++ +++E LS
Sbjct 51 RSEREESI-WTVNLLSEEAEQQILPQL----LSLETIKLETYSEEILVKNIEIQSLSSQS 105
Query 169 FARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG----EEP 224
+F + + F TPT FK+ G++V +PD RL+FQSL QKY +V+G EE
Sbjct 106 LLEVFQGDEASHLISLNFYTPTTFKRQGQFVLFPDTRLIFQSLMQKYSRLVEGKAEIEEE 165
Query 225 DPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGE 284
+AE Q ++++R+ S F + + P F G T ++G T +Y LL FGE
Sbjct 166 TLEFLAEHSQ---ITSYRLKSHYFPIHGRKYPAFEGRVTIRIQGASTLKAYAQMLLRFGE 222
Query 285 FSGCGIKASMGMGAIRVQP 303
+SG G K S+GMG +R++
Sbjct 223 YSGVGAKCSLGMGGMRIEE 241
>gi|327469963|gb|EGF15427.1| hypothetical protein HMPREF9386_0574 [Streptococcus sanguinis
SK330]
Length=244
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/232 (33%), Positives = 119/232 (52%), Gaps = 14/232 (6%)
Query 78 LGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPI 137
L L G LME + D+ LH NPYS + S+ W ++ L+ EA QQI+ +
Sbjct 18 LVSKLQGFLMEKLSDDFASFLHQQETNPYSMNLRSEREESI-WTVNLLSEEAEQQILPQL 76
Query 138 NDAAFAGFRLRASGIATQVTSRSLEQNPLSQ--FARIFYARPETRKFRVEFLTPTAFKQS 195
+ ++ + ++ +++E LS +F + + F TPT FK+
Sbjct 77 ----LSLETIKLETYSEEILVKNIEIQSLSSQSLLEVFQGDEASHLISLNFYTPTTFKRQ 132
Query 196 GEYVFWPDPRLVFQSLAQKYGAIVDG----EEPDPGLIAEFGQSVRLSAFRVASAPFAVG 251
G++V +PD RL+FQSL QKY +V+G EE +AE Q ++++R+ S F +
Sbjct 133 GQFVLFPDTRLIFQSLMQKYSRLVEGKAEIEEETLEFLAEHSQ---ITSYRLKSHYFPIH 189
Query 252 AARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQP 303
+ P F G T ++G T +Y LL FGE+SG G K S+GMG +R++
Sbjct 190 GRKYPAFEGRVTIRIQGASTLKAYAQMLLRFGEYSGVGAKCSLGMGGMRIEE 241
>gi|325687532|gb|EGD29553.1| hypothetical protein HMPREF9381_1066 [Streptococcus sanguinis
SK72]
Length=244
Score = 134 bits (338), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 79/259 (31%), Positives = 130/259 (51%), Gaps = 25/259 (9%)
Query 51 RRMTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA 110
+++ HLS+++L + L L G LME + D+ LH NPYS
Sbjct 2 KKIRLHLSKVSL-----------KDDDLVCKLQGFLMEKLSDDFASFLHQQETNPYSMNL 50
Query 111 LARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQ-- 168
+ S+ W ++ L+ EA QQI+ + + ++ + ++ +++E LS
Sbjct 51 RSEREESI-WTVNLLSEEAEQQILPQL----LSLETIKLETYSEEILVKNIEIQSLSSQS 105
Query 169 FARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG----EEP 224
+F + + F TPT FK+ G++V +PD RL+FQSL QKY +V+G EE
Sbjct 106 LLEVFQGDEVSHLISLNFYTPTTFKRQGQFVLFPDTRLIFQSLMQKYSRLVEGKAEIEEE 165
Query 225 DPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGE 284
+AE Q ++++R+ S F + + P F G T ++G T +Y LL FGE
Sbjct 166 TLEFLAEHSQ---ITSYRLKSHYFPIHGRKYPAFEGRVTIRIQGASTLKAYAQMLLRFGE 222
Query 285 FSGCGIKASMGMGAIRVQP 303
+SG G K S+GMG +R++
Sbjct 223 YSGVGAKCSLGMGGMRIEE 241
>gi|229826457|ref|ZP_04452526.1| hypothetical protein GCWU000182_01830 [Abiotrophia defectiva
ATCC 49176]
gi|229789327|gb|EEP25441.1| hypothetical protein GCWU000182_01830 [Abiotrophia defectiva
ATCC 49176]
Length=260
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 79/257 (31%), Positives = 123/257 (48%), Gaps = 14/257 (5%)
Query 59 RLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSL 118
RL + LE + + +L G LM+ I + +H+ +PYSQ+ + + L
Sbjct 4 RLVIELENNKGIPYNY--SLSTAFQGYLMDLIDEGFADKMHSSGYHPYSQFVMI-ADGRL 60
Query 119 EWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPE 178
W ++ L EA + IV + + + + ++ + + Y + +
Sbjct 61 RWIVNVLDEEAEKFIVKKLLEDDVKTVHINKLEDDLNIINKEYSATTYDELFKECYFKND 120
Query 179 TRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGA------IVDGEEPDPGLIAEF 232
+R V+FLTPTAFKQ+ Y F+PD +L+FQSL KY A I D E L+ F
Sbjct 121 SRYIEVKFLTPTAFKQNNRYQFFPDIKLIFQSLMMKYDAASSQNVIFDAE-----LLIHF 175
Query 233 GQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKA 292
++ + ++ + S F V + ++P FTG F V G A+ LL FG FSG GIK+
Sbjct 176 EENAEIVSYNLRSTNFFVNSNKIPAFTGRVVFKVNGPMQMANLAYLLLKFGAFSGVGIKS 235
Query 293 SMGMGAIRVQPLAPREK 309
MGMG I V +EK
Sbjct 236 GMGMGGIEVNINKGKEK 252
>gi|291460042|ref|ZP_06599432.1| CRISPR-associated protein Cas6 [Oribacterium sp. oral taxon 078
str. F0262]
gi|291417383|gb|EFE91102.1| CRISPR-associated protein Cas6 [Oribacterium sp. oral taxon 078
str. F0262]
Length=262
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 79/249 (32%), Positives = 123/249 (50%), Gaps = 14/249 (5%)
Query 57 LSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTT 116
L+RL L L PL ++ HG LME +P +Y LH ++PY+Q+ L R +
Sbjct 2 LARLELKLGGTEPLSYQMTSSF----HGALMELLP-EYAAELHESRLHPYTQH-LERRES 55
Query 117 SLEWKISTLTNEARQQIVGPINDAAFAGF---RLRASGIATQVTSRSLEQNPLSQFARIF 173
W ++ L + A V I + A R+R+ + + R + +F+ F
Sbjct 56 GWYWVVTALNDLA----VSEIMEKALRSLDEIRIRSHQLRIPILGREYRELSDREFSASF 111
Query 174 YARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKY-GAIVDGEEPDPGLIAEF 232
Y R ++F+TPTAFKQ+G Y+ +PD R +F +L KY AI D D ++ +
Sbjct 112 YQGEGGRYIGLQFVTPTAFKQNGRYLNFPDLRFMFLNLMNKYDAAISDSSMRDDEVLEQL 171
Query 233 GQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKA 292
L + + S F++ R+P F G + G T A++ L FG +SG GIK
Sbjct 172 LNGASLHRYELRSTVFSLEGVRIPAFLGKMVLKISGTQTMANFARMLFLFGSYSGIGIKT 231
Query 293 SMGMGAIRV 301
++GMGAIR+
Sbjct 232 ALGMGAIRI 240
>gi|323141258|ref|ZP_08076154.1| putative CRISPR-associated endoribonuclease Cas6 [Phascolarctobacterium
sp. YIT 12067]
gi|322414215|gb|EFY05038.1| putative CRISPR-associated endoribonuclease Cas6 [Phascolarctobacterium
sp. YIT 12067]
Length=253
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/226 (31%), Positives = 118/226 (53%), Gaps = 11/226 (4%)
Query 82 LHGVLMESIPADYVQTLHTVPVNPYSQYA-LARSTTSLEWKISTLTNEARQQIVGPINDA 140
LHGVLME I + Y + LH + PYSQY + + W+++ L +A +++G A
Sbjct 25 LHGVLMEHIDSTYAELLHQQSLRPYSQYLYFDKERAGVYWRLTALNKQADDELLG----A 80
Query 141 AF---AGFRLRASGIATQVTSRS-LEQNPLSQFARIFYARPETRKF-RVEFLTPTAFKQS 195
AF A L+ + Q+ S+ L++ ++ A +A+P K+ FLT +FK
Sbjct 81 AFSLPATVYLKKKQMEVQLVSKEYLKETSYAEIAEKCFAQPLAGKYLSCSFLTSCSFKSE 140
Query 196 GEYVFWPDPRLVFQSLAQKYGAIVDGEEPDP-GLIAEFGQSVRLSAFRVASAPFAVGAAR 254
G+YV +P P+ + SL +++ + D E D GL + Q ++ ++++ F+V AR
Sbjct 141 GQYVIFPQPQFLLGSLIKRWNSFADKERLDALGLAQDLAQETYVADYKLSLHSFSVDGAR 200
Query 255 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
+P F G ++ IA L + +SG GIK ++GMGA++
Sbjct 201 IPAFRGLYVLGMKNNVMCNRIIAMLGEYANYSGIGIKTALGMGAVK 246
>gi|121533435|ref|ZP_01665263.1| conserved hypothetical protein [Thermosinus carboxydivorans Nor1]
gi|121307994|gb|EAX48908.1| conserved hypothetical protein [Thermosinus carboxydivorans Nor1]
Length=287
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/229 (34%), Positives = 115/229 (51%), Gaps = 4/229 (1%)
Query 76 ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVG 135
A G LHG L+E + + TLH + PYSQ+ L + + W+I TLT A QQ+V
Sbjct 19 ANAGSVLHGALIERLDSAAATTLHEPGLRPYSQH-LRVTKEAAVWRIGTLTPPAAQQLVA 77
Query 136 PINDAAFAGFRLRASGIATQVT-SRSLEQNPLSQFARIFYARPE-TRKFRVEFLTPTAFK 193
P+ A A F LR +T R L +F F+ P R+F ++F TP +FK
Sbjct 78 PLLAAPNAAFYLRDKHAHIAITVKRQLVACTYREFVNHFFLSPAPARRFVMKFATPASFK 137
Query 194 QSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPD-PGLIAEFGQSVRLSAFRVASAPFAVGA 252
Y +P ++QSL ++ A G D P LI + R+ +R+ F+V +
Sbjct 138 IDNAYQIFPSVFHIYQSLVNRWNACASGFVLDRPRLIDDLTAYTRIIDYRLRLNTFSVES 197
Query 253 ARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRV 301
R+P F+G T ++ G + A LL +GE SG G+K ++GMG + V
Sbjct 198 IRIPAFSGEITLSIAGPEQLVRLAAMLLAYGEISGIGVKTALGMGGVTV 246
>gi|334126726|ref|ZP_08500674.1| hypothetical protein HMPREF9081_0261 [Centipeda periodontii DSM
2778]
gi|333391136|gb|EGK62257.1| hypothetical protein HMPREF9081_0261 [Centipeda periodontii DSM
2778]
Length=252
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/228 (32%), Positives = 114/228 (50%), Gaps = 6/228 (2%)
Query 76 ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA-LARSTTSLEWKISTLTNEARQQIV 134
+ +G LHG LME +PAD + LHT + PYSQ + + W+I+TLT+E +
Sbjct 19 SAMGSVLHGALMERLPADVAEFLHTQNLRPYSQSVHYEKESERTLWRINTLTDEMGAIVE 78
Query 135 GPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ 194
G + +A R + I+ Q E + ++ R F R + F T TAFK+
Sbjct 79 GLLGEAEAIYLRQKGYAISIQNFCCVAEMDDVALADRYFLPDDAPRGAELTFRTMTAFKR 138
Query 195 SGEYVFWPDPRLVFQSLAQK---YGAIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVG 251
G+YV P+ L+ QSL + Y V E D L + G + R+S + + +A F+V
Sbjct 139 DGQYVLLPEIYLIVQSLLARWALYCPQVRIEAED--LAQQLGAACRISQYALRTAGFSVD 196
Query 252 AARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAI 299
+ GF G+ + G D+ + L+ F ++G GIK ++GMGA+
Sbjct 197 GHTLRGFRGTLSMGFTGTDSVRRILGMLMEFAPYAGVGIKTALGMGAV 244
>gi|296133516|ref|YP_003640763.1| Protein of unknown function DUF2276 [Thermincola sp. JR]
gi|296032094|gb|ADG82862.1| Protein of unknown function DUF2276 [Thermincola potens JR]
Length=249
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 73/251 (30%), Positives = 118/251 (48%), Gaps = 8/251 (3%)
Query 57 LSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTT 116
L RL + LE D E + HG+LME + Y LH P+SQ+
Sbjct 2 LRRLKILLEPDR--EEKCHYNMASLFHGMLMERVNPSYAGYLHESGYKPFSQFVSGAVGN 59
Query 117 SL-EWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLS--QFARIF 173
L W +S LT +A Q++ P+ D F L+ I V + LE P+S + R +
Sbjct 60 GLWMWTVSFLTEQAWQEVGRPLLDDKAGEFILKDKDIRLTVREKRLEP-PVSYGELTRKY 118
Query 174 YARPETRK-FRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG-EEPDPGLIAE 231
Y + R+ ++ FLTP +FK +G Y PD L++QSL ++ A D +
Sbjct 119 YLEEQPRRAIKITFLTPCSFKSAGRYAILPDLALIYQSLMNRFDAFADEFSLRSTDALEH 178
Query 232 FGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIK 291
+ + + + S + + ++P F G V G +T A L FG+++G GIK
Sbjct 179 LARFTYIRRYDLRSTRYHLEGVKIPSFIGKLELAVNGPETMAGLANLLFAFGQWAGIGIK 238
Query 292 ASMGMGAIRVQ 302
++GMGA++++
Sbjct 239 TALGMGAVQIE 249
>gi|342213934|ref|ZP_08706647.1| putative CRISPR-associated endoribonuclease Cas6 [Veillonella
sp. oral taxon 780 str. F0422]
gi|341596432|gb|EGS39034.1| putative CRISPR-associated endoribonuclease Cas6 [Veillonella
sp. oral taxon 780 str. F0422]
Length=258
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/247 (28%), Positives = 119/247 (49%), Gaps = 9/247 (3%)
Query 63 TLEVDAPLERAR--VATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYAL---ARSTTS 117
T+E D L+ + V +LG LHG++M I +Y LHT PY QY R+T+
Sbjct 8 TIEFDIHLDNGQKIVQSLGSVLHGIIMSCISTEYATFLHTTATPPYHQYVYYDKERNTSV 67
Query 118 LEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYAR- 176
W+I+ LT ++ +IV + A + SG R + + AR +
Sbjct 68 --WRITALTMDSVHEIVDCLYTIAPIVKLEQKSGNLIIDERRVVLETTYGDIARSYLGEA 125
Query 177 PETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGE-EPDPGLIAEFGQS 235
+ +K + F+TPT+FK + EY +PD + +S +K+ + + D L +
Sbjct 126 KQYKKIEIHFVTPTSFKVNQEYAIFPDIEKMMRSFLKKWNSFSTSDVYDDEELFQSTCTN 185
Query 236 VRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMG 295
+ ++ +R+ F + ++PGF G T + ++ A L ++G GCGIK ++G
Sbjct 186 LYVADYRMRLQRFYLERTKIPGFLGDYTLLCKQNMILSNLAAMLCYYGTLCGCGIKVAIG 245
Query 296 MGAIRVQ 302
MGA++V
Sbjct 246 MGAMKVN 252
>gi|164688462|ref|ZP_02212490.1| hypothetical protein CLOBAR_02107 [Clostridium bartlettii DSM
16795]
gi|164602875|gb|EDQ96340.1| hypothetical protein CLOBAR_02107 [Clostridium bartlettii DSM
16795]
Length=241
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/236 (29%), Positives = 116/236 (50%), Gaps = 20/236 (8%)
Query 77 TLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGP 136
+G L G++M + DY + LH + PYSQ+ + W I+ LT EA++ I+
Sbjct 14 NIGSVLQGIMMTFLDRDYGEVLHRQSLMPYSQHFETKDGKYY-WIINALTEEAKENIICK 72
Query 137 INDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQ-S 195
I D+ R+ + + + ++E+ + +F R E + + F TPT+FK+ S
Sbjct 73 ILDSD-----KRSLDLTYRKSKLNIERLIFEE-VNLFKDRGE-KDIVLNFKTPTSFKRTS 125
Query 196 GEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQ----------SVRLSAFRVAS 245
G Y +P+ R +F SL KY + + D L ++ + +V + + + +
Sbjct 126 GGYEIFPNVRHIFNSLINKY-EMFEMNNLDDSLFSKINKKEDFLEDIIKNVDIVGYNLKT 184
Query 246 APFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRV 301
F + +PGF G V+G F + I+ LL FGE+SG G+K +MGMG + +
Sbjct 185 EKFGIKGNYIPGFMGKVNIKVKGSAEFKNNISKLLQFGEYSGVGLKCTMGMGVMEI 240
>gi|292669137|ref|ZP_06602563.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
gi|292649189|gb|EFF67161.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
Length=251
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/229 (32%), Positives = 109/229 (48%), Gaps = 8/229 (3%)
Query 76 ATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQ-YALARSTTSLEWKISTLTNEARQQIV 134
+++G LHG LME +P DY LHT + PYSQ + + W+I TL A +I+
Sbjct 19 SSMGSVLHGALMELLPEDYADALHTQNLRPYSQSIRWDKERERVIWRIGTLDQTA-GEII 77
Query 135 GPINDAAFAGFRLRASGIATQVTS-RSLEQNPLSQFARIFYARPET--RKFRVEFLTPTA 191
G + + LR G V + + +E+ A ++ R ET R + FLTPT+
Sbjct 78 GTVLQS-LEHIHLRQKGYTVDVQNIQCVEERSYQDIADEYF-RAETAPRGAELHFLTPTS 135
Query 192 FKQSGEYVFWPDPRLVFQSLAQKYGAIV-DGEEPDPGLIAEFGQSVRLSAFRVASAPFAV 250
FKQ G Y+ P+ L+ QSL ++ D + L RL+ + + S F+V
Sbjct 136 FKQGGAYIILPESTLILQSLLARWNRFCPDIRIEEDDLAQTLAAHTRLTRYTLRSVGFSV 195
Query 251 GAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAI 299
+ GF G G D + LL F ++G GIK ++GMGA+
Sbjct 196 DGYNIRGFRGQIVLQFAGSDMVRRILGTLLAFAPYAGIGIKTALGMGAV 244
>gi|227890790|ref|ZP_04008595.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
gi|227867199|gb|EEJ74620.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
Length=218
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/218 (29%), Positives = 109/218 (50%), Gaps = 5/218 (2%)
Query 87 MESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFR 146
ME+I + LH +N YS +++ ++ + I+ L A + + D
Sbjct 1 MENISEEAADYLHESKINCYS-ISVSNDDKNVYFTINLLNEVAEKIFSYLVLDKEIDKIV 59
Query 147 LRASGIATQ--VTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDP 204
L S I + V ++ +E+ Q R FY +R+ V+ ++P +FK G+Y F+PD
Sbjct 60 LNNS-IQKEFLVLNKQIEELTAKQLTRNFYEGISSREVVVDIMSPMSFKVQGDYYFFPDL 118
Query 205 RLVFQSLAQKYGAIVDGEE-PDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSAT 263
L+F++L QKY A + D L+ E ++ ++ ++++ S+ + + A +PG G
Sbjct 119 ELMFRNLMQKYNATFENTNIVDNDLLQEILENSKIVSYKIQSSYYPIHKAFIPGTIGRIK 178
Query 264 FTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRV 301
+G T +Y LL FG FSG G+K MGMG I +
Sbjct 179 IRFKGNQTLTNYTQMLLNFGVFSGIGVKTGMGMGHISI 216
>gi|334308468|gb|EGL99454.1| CRISPR-associated protein Cas6 [Lactobacillus salivarius NIAS840]
Length=218
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 61/216 (29%), Positives = 109/216 (51%), Gaps = 5/216 (2%)
Query 87 MESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFR 146
ME+I + LH +N YS +++ ++ + ++ L A + I +
Sbjct 1 MENISEEAADYLHKSKINCYS-ISVSNDDKNIYFIVNLLNKVAEKIFNHLILNKEIDKIV 59
Query 147 LRASGIATQ--VTSRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDP 204
L S I + V ++ +E+ + Q R FY +++ V+ ++P +FK G+Y F+PD
Sbjct 60 LNNS-IQKEFLVLNKQIEELTVKQLTRNFYEGISSKEVVVDIMSPMSFKVQGDYYFFPDL 118
Query 205 RLVFQSLAQKYGAIVDGEE-PDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSAT 263
L+F++L QKY A + D L+ E ++ ++ ++++ S+ + + A +PG G
Sbjct 119 ELMFRNLMQKYNATFENTNIVDNDLLQEILENSKIVSYKIQSSYYPIHKAFIPGTIGRIK 178
Query 264 FTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAI 299
+G T +Y LL FG FSG G+K MGMG I
Sbjct 179 IRFKGNQTLTNYTQMLLNFGVFSGIGVKTGMGMGHI 214
>gi|329736388|gb|EGG72657.1| CRISPR-associated endoribonuclease Cas6 [Staphylococcus epidermidis
VCU045]
gi|341656688|gb|EGS80397.1| CRISPR-associated endoribonuclease Cas6 [Staphylococcus epidermidis
VCU037]
Length=244
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/246 (28%), Positives = 116/246 (48%), Gaps = 13/246 (5%)
Query 62 LTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLH-TVPVNPYSQYALARSTTSLEW 120
+T+E+D P + R LG LHGVLM+ +P D LH +P Q +S + W
Sbjct 5 ITVELDLP-DNIRFQYLGSILHGVLMDYLPNDIADQLHHEFAYSPLKQRIYYKSKKVI-W 62
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLE----QNPLSQFARIFYAR 176
+I +++ ++IV + + + I Q S +E QN ++Q +
Sbjct 63 EIVCMSDNLFKEIVKLFSSKNSLLLKYYQTNIDIQ--SFQIEKINVQNIMNQLLQ---TE 117
Query 177 PETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEP-DPGLIAEFGQS 235
R R+ TP +FK Y+ +PD + F+S+ ++ A + + D + ++
Sbjct 118 DLNRYVRLNIQTPMSFKYQSSYMIFPDVKRFFRSIMIQFDAFFEEYKMYDKETLDFLMKN 177
Query 236 VRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMG 295
V + +++ S F + ++P FTG F ++G F LL FGEFSG G+K S+G
Sbjct 178 VNIVDYKLKSTRFNLEKVKIPSFTGEMVFKIKGPLPFLQLTHFLLKFGEFSGSGMKTSLG 237
Query 296 MGAIRV 301
MG +
Sbjct 238 MGKYSI 243
>gi|258645682|ref|ZP_05733151.1| CRISPR-associated protein Cas6 [Dialister invisus DSM 15470]
gi|260403050|gb|EEW96597.1| CRISPR-associated protein Cas6 [Dialister invisus DSM 15470]
Length=255
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 66/230 (29%), Positives = 111/230 (49%), Gaps = 12/230 (5%)
Query 77 TLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGP 136
+ G HG L+ + ++ + +H + PYSQY L + W+I+ LT EA I+ P
Sbjct 20 SFGSVFHGALISELDREWAEKMHEQQIRPYSQYLLVKEGNPY-WRIAVLTEEAFDHILRP 78
Query 137 INDAAFAGFRLRASGIATQVTSRS-LEQNPLSQFARIFYARPE-TRKFRVEFLTPTAFKQ 194
+ L G +V S L+++ F+ E ++FLT +FK+
Sbjct 79 MMQKT--SLFLEQKGYEVEVGKFSILKKDSFQGLEERFWTGTEKIHHIELDFLTSASFKK 136
Query 195 SGEYVFWPDPRLVFQSLAQKYGAIVD----GEEPDPGLIAEFGQSVRLSAFRVASAPFAV 250
+GEY +P+ LVF +L +K+ D GEE +AEF + ++ +R+ + PF+V
Sbjct 137 NGEYKIFPELLLVFNNLIRKWNVYSDSMVLGEERLGDKLAEF---MCITDYRLHTHPFSV 193
Query 251 GAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIR 300
R+ F G+ + D + L F +++G GIK +MGMGA+
Sbjct 194 EGRRIRAFRGNIRLGLFKDDITRRMASMLAAFADYAGIGIKTAMGMGAVH 243
>gi|57865878|ref|YP_189998.1| hypothetical protein SERP2455 [Staphylococcus epidermidis RP62A]
gi|57636536|gb|AAW53324.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
Length=244
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/246 (28%), Positives = 115/246 (47%), Gaps = 13/246 (5%)
Query 62 LTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLH-TVPVNPYSQYALARSTTSLEW 120
+T+E+D P E R LG LHGVLM+ + D LH +P Q + + W
Sbjct 5 ITVELDLP-ESIRFQYLGSVLHGVLMDYLSDDIADQLHHEFAYSPLKQ-RIYHKNKKIIW 62
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLE----QNPLSQFARIFYAR 176
+I +++ +++V + + + I Q S +E QN ++Q ++
Sbjct 63 EIVCMSDNLFKEVVKLFSSKNSLLLKYYQTNIDIQ--SFQIEKINVQNMMNQLLQV---E 117
Query 177 PETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEP-DPGLIAEFGQS 235
+R R+ TP +FK Y+ +PD + F+S+ ++ A + D + ++
Sbjct 118 DLSRYVRLNIQTPMSFKYQNSYMIFPDVKRFFRSIMIQFDAFFEEYRMYDKETLNFLEKN 177
Query 236 VRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMG 295
V + +++ S F + ++P FTG F ++G F LL FGEFSG GIK S+G
Sbjct 178 VNIVDYKLKSTRFNLEKVKIPSFTGEIVFKIKGPLPFLQLTHFLLKFGEFSGSGIKTSLG 237
Query 296 MGAIRV 301
MG +
Sbjct 238 MGKYSI 243
>gi|339893268|emb|CCB52456.1| CRISPR associated protein [Staphylococcus lugdunensis N920143]
Length=250
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/243 (26%), Positives = 114/243 (47%), Gaps = 7/243 (2%)
Query 62 LTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLH-TVPVNPYSQYALARSTTSLEW 120
+T++++ P + +G LHGVLM+ + D +LH +P Q + W
Sbjct 5 ITVQLNLP-NNINLPYMGSILHGVLMDYLSNDIASSLHHNFAYSPLKQRVFYFEDKKI-W 62
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIF-YARPET 179
+I +++ E ++V N L+ + S+E+ + + F + R +
Sbjct 63 EIVSMSEELFNELVNLFNKEN--KIYLKHYKSTVSIEKYSVEKISIQKLIDTFLHKRDLS 120
Query 180 RKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEF-GQSVRL 238
R ++ TP +FK + +Y+ +P+ + F+S+ ++ A + + EF Q+V +
Sbjct 121 RYIKINVSTPMSFKLNNQYMIFPNVKRFFRSIMIQFDAFFESHKLYDKETLEFLEQNVNI 180
Query 239 SAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGA 298
+++ S F + ++P F G F + G F + LL FGEFSG GIK S+GMG
Sbjct 181 VNYKLKSVRFHMEKVKIPSFKGEIVFKINGPLPFLQLVYFLLAFGEFSGTGIKTSLGMGK 240
Query 299 IRV 301
+
Sbjct 241 YNI 243
>gi|341822666|emb|CCC73590.1| putative uncharacterized protein [Megasphaera elsdenii DSM 20460]
Length=249
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/245 (29%), Positives = 119/245 (49%), Gaps = 16/245 (6%)
Query 66 VDAPL---ERARVA-TLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLE-W 120
++ PL E R+ +G HG LM+ I + H + + PYSQ T W
Sbjct 5 IEIPLKMPEHTRIHPAMGSIFHGALMDVIAPTSAELYHHMTLRPYSQVVYWDETKHCPLW 64
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIA---TQVTSRSLEQNPLSQFARIFYARP 177
+I TLT+EA +++V P+ + + ++ Q+ ++ ++ +QF + A P
Sbjct 65 RIGTLTDEAYERLVIPLEKVPALWLKQKQYEVSLGPMQLLRQTSFEDLAAQFVKADSA-P 123
Query 178 ETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDG---EEPDPGLIAEFGQ 234
+++ L+ +FKQ G YV PD RL++QSL Q++ D E+ D L+ +
Sbjct 124 AGAEWQC--LSIMSFKQEGRYVILPDIRLIYQSLLQRWNTFSDTVKLEQDD--LLEQLTS 179
Query 235 SVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASM 294
RL+ +++ S F+V +++ G G F+ G D L FSG G+K ++
Sbjct 180 HCRLTKYQLRSQVFSVNGSQIYGCEGWQRFSFFGYDMLKRLQGLLASLAPFSGVGVKTAL 239
Query 295 GMGAI 299
GMGA+
Sbjct 240 GMGAV 244
>gi|340752434|ref|ZP_08689233.1| hypothetical protein FSAG_00290 [Fusobacterium sp. 2_1_31]
gi|229422233|gb|EEO37280.1| hypothetical protein FSAG_00290 [Fusobacterium sp. 2_1_31]
Length=240
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/244 (22%), Positives = 114/244 (47%), Gaps = 11/244 (4%)
Query 62 LTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTS-LEW 120
+ +E+++ +A+L HG LME+I Y + H NP++ + W
Sbjct 5 INMELESKELNMNMASL---FHGYLMENIDPAYAEYFHYNTTNPFTSCIFKDTKEDKFFW 61
Query 121 KISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETR 180
+++T + +A ++ + L+ + V S S+++ + +F E +
Sbjct 62 RVTTFSQKAYDMLMSYFSKGIPEKIYLKNKDLEINVKSFSIQK---KSYEDLFLEATERK 118
Query 181 KFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVD-GEEPDPGLIAEFGQSVRLS 239
R++ ++PT+FK G +P+ + + K + E D ++ E + V +
Sbjct 119 --RIKLISPTSFKSDGITHIFPNISTLISGVIAKINQHSETAELEDKKIVNELLEKVYIK 176
Query 240 AFRVASAPFAVGAARVPGFTGSATFTVRGVD-TFASYIAALLWFGEFSGCGIKASMGMGA 298
+ + + F + + ++ GF G+ ++G D T A+ + L+ E++G GIK S+GMG
Sbjct 177 DYNLRTKIFHLESIKIKGFIGTMDLAIKGEDRTLANILNFLILMSEYTGLGIKTSLGMGG 236
Query 299 IRVQ 302
++V+
Sbjct 237 VKVE 240
>gi|237741579|ref|ZP_04572060.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13]
gi|229429227|gb|EEO39439.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13]
Length=242
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/242 (24%), Positives = 116/242 (48%), Gaps = 10/242 (4%)
Query 65 EVDAPLERARVAT-LGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALAR-STTSLEWKI 122
+++ LE + T +G HG LME+I + Y H NP++ T W+I
Sbjct 7 QINIELEANGLNTNMGSLFHGYLMENIDSAYADYFHYNTTNPFTSCIYKDIKTDKFFWRI 66
Query 123 STLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKF 182
+T +A ++ ++ + + L+ + V S S+++ + +F E +
Sbjct 67 TTYNQKAYDMLMTYFSNIPESVY-LKNRDLEINVKSFSIQK---KSYEDLFLECTERK-- 120
Query 183 RVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVD-GEEPDPGLIAEFGQSVRLSAF 241
R++ +TPT+FK +G +P+ + + K + E D +I E + V + +
Sbjct 121 RIKLITPTSFKSNGITHIFPNISTLISGVITKINQHSETAELGDKKIIDELLEKVYIKDY 180
Query 242 RVASAPFAVGAARVPGFTGSATFTVRGVDT-FASYIAALLWFGEFSGCGIKASMGMGAIR 300
+ + F + + ++ GF G+ ++G +T A+ + L+ E++G GIK S+GMG ++
Sbjct 181 NLRTKVFYLESIKIKGFLGTMDLAIKGEETTLANILNFLILMSEYTGLGIKTSLGMGGVK 240
Query 301 VQ 302
++
Sbjct 241 IE 242
>gi|294792435|ref|ZP_06757582.1| putative CRISPR-associated protein Cas6 [Veillonella sp. 6_1_27]
gi|294456334|gb|EFG24697.1| putative CRISPR-associated protein Cas6 [Veillonella sp. 6_1_27]
Length=256
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/255 (23%), Positives = 118/255 (47%), Gaps = 3/255 (1%)
Query 53 MTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA-L 111
M++++ +++ L + A L V ++G LHGVLME + +Y LH + PYSQY
Sbjct 1 MSDNVEIMSIELVIVADLSIKIVQSIGSVLHGVLMELVGTEYAGQLHETGLRPYSQYIYF 60
Query 112 ARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFAR 171
+ W++S +T +A +IV P + F + G LE+
Sbjct 61 NKHKKQYIWRLSAVTADAVNRIVRPTLEMPEKIFLKQKRGYLYIKDRTILEETSYEALIH 120
Query 172 IFYARPE-TRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVD-GEEPDPGLI 229
F++ + +++ ++ T+FK +Y +P+ +++ L +++ G LI
Sbjct 121 KFWSSDAFYSQTKLQCMSTTSFKVDQQYTIFPEAFRIYRYLLRQWNHFSTFGTMDADSLI 180
Query 230 AEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCG 289
F + V + + + +++ ++ GF G + +A L ++ +F+G G
Sbjct 181 DTFEKGVFIRDYNLRMGIYSLEGIKIRGFRGQIVMQFKRNIELQKILALLSYYSQFTGLG 240
Query 290 IKASMGMGAIRVQPL 304
IK ++GMG ++ + +
Sbjct 241 IKTALGMGGVKCEII 255
>gi|339890608|gb|EGQ79709.1| hypothetical protein HMPREF9094_1266 [Fusobacterium nucleatum
subsp. animalis ATCC 51191]
Length=239
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/245 (23%), Positives = 115/245 (47%), Gaps = 10/245 (4%)
Query 62 LTLEVDAPLERARVAT-LGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALAR-STTSLE 119
+ ++++ LE ++T +G HG LME+I + Y H NP++
Sbjct 1 MLVQINMELEANGLSTNMGSLFHGYLMENIDSAYADYFHYNTTNPFTSCIFKDIKNDKFF 60
Query 120 WKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPET 179
W+++T +A ++ ++ L+ + V S S+++ + +F E
Sbjct 61 WRVTTFNQKAYDMLMTYFSNIP-ESIYLKNRDLEINVKSFSIQK---KSYEDLFLECTER 116
Query 180 RKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVD-GEEPDPGLIAEFGQSVRL 238
+ R+ +TPT+FK +G +P+ + + K + E D +I E + V +
Sbjct 117 K--RIRLITPTSFKSNGVTHIFPNISTLISGVIAKINQHSETAELGDKKIIDELLEKVYI 174
Query 239 SAFRVASAPFAVGAARVPGFTGSATFTVRGVDT-FASYIAALLWFGEFSGCGIKASMGMG 297
+ + + F + + ++ GF G+ ++G +T A+ + L+ E++G GIK S+GMG
Sbjct 175 KDYNLRTKVFYLESIKIKGFIGTMDLAIKGEETTLANILNFLILMSEYTGLGIKTSLGMG 234
Query 298 AIRVQ 302
++++
Sbjct 235 GVKIE 239
>gi|295105100|emb|CBL02644.1| Uncharacterized conserved protein (DUF2276). [Faecalibacterium
prausnitzii SL3/3]
Length=252
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/235 (32%), Positives = 109/235 (47%), Gaps = 16/235 (6%)
Query 82 LHGVLMESIPADYVQTLHTVPVNPYSQYAL--ARSTTSLEWKISTLTNEARQQIVGPIND 139
++G LM +PAD LH +P SQ A + TS+ W ++ L +EA V P+
Sbjct 25 IYGWLMAQLPADTAARLHEQGEHPLSQSLCFDAAAQTSV-WTLNLL-DEALAAQVRPL-- 80
Query 140 AAFAGFR-LRASGIATQVT---SRSLEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQS 195
AG L G+ Q+ S S+E Q P +R R+ F TP AFKQ+
Sbjct 81 --LAGCTTLELHGVPLQMELLGSHSVENG--LQLLLAARENPASRT-RLWFRTPCAFKQA 135
Query 196 GEYVFWPDPRLVFQSLAQKYG-AIVDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAAR 254
G Y +P L+ QSL + A D + DP + + +R+ + + + + +
Sbjct 136 GRYAIYPQEFLLLQSLVLHWNTAFPDCQLSDPDALDAILRGLRILDYSLHTVSYPIKNTC 195
Query 255 VPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREK 309
+PGF GSA R ALL F + G GIK ++GMG + V+PL +K
Sbjct 196 IPGFVGSAVVEARLALPLLELWNALLSFAPYGGIGIKTTLGMGGVSVEPLVLPQK 250
>gi|294782688|ref|ZP_06748014.1| CRISPR-associated protein Cas6 [Fusobacterium sp. 1_1_41FAA]
gi|294481329|gb|EFG29104.1| CRISPR-associated protein Cas6 [Fusobacterium sp. 1_1_41FAA]
Length=240
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/245 (23%), Positives = 116/245 (48%), Gaps = 13/245 (5%)
Query 62 LTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLE-- 119
+ +E++A +A+L HG LME+I Y + H NP++ + + T +
Sbjct 5 INMELEAVGLNVNMASL---FHGYLMENIDPAYAEYFHYNMTNPFTS-CIFKDTKEDKYF 60
Query 120 WKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPET 179
W+I+T + +A I+ + L+ + V S S+++ + +F E
Sbjct 61 WRITTFSQKAYDMIMSYFSKEIPEKIYLKNKDLEINVKSFSIQK---KSYEDLFLEATER 117
Query 180 RKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEE-PDPGLIAEFGQSVRL 238
+ R++ ++PT+FK G +P+ + + K + E D ++ E + V +
Sbjct 118 K--RIKLISPTSFKSEGVTHIFPNISTLISGVITKINQHSETTELEDKKIVDELLEKVYI 175
Query 239 SAFRVASAPFAVGAARVPGFTGSATFTVRGVD-TFASYIAALLWFGEFSGCGIKASMGMG 297
+ + + F + + ++ GF G+ ++G D + + + L+ E++G GIK S+GMG
Sbjct 176 KDYNLRTKIFHLESIKIKGFIGTMDLAIKGEDRSLINILNFLILMSEYTGLGIKTSLGMG 235
Query 298 AIRVQ 302
++V+
Sbjct 236 GVKVE 240
>gi|289549406|ref|YP_003470310.1| CRISPR-associated protein Cas6 [Staphylococcus lugdunensis HKU09-01]
gi|289178938|gb|ADC86183.1| CRISPR-associated protein Cas6 [Staphylococcus lugdunensis HKU09-01]
Length=222
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/218 (25%), Positives = 99/218 (46%), Gaps = 6/218 (2%)
Query 87 MESIPADYVQTLH-TVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGF 145
M+ + D +LH +P Q + W+I +++ E ++V N
Sbjct 1 MDYLSNDIASSLHHNFAYSPLKQRVFYFEDKKI-WEIVSMSEELFNELVNLFNKEN--KI 57
Query 146 RLRASGIATQVTSRSLEQNPLSQFARIF-YARPETRKFRVEFLTPTAFKQSGEYVFWPDP 204
L+ + S+E+ + + F + R +R ++ TP +FK + +Y+ +P+
Sbjct 58 YLKHYKSTVSIEKYSVEKISIQKLIDTFLHKRDLSRYIKINVSTPMSFKLNNQYMIFPNV 117
Query 205 RLVFQSLAQKYGAIVDGEEPDPGLIAEF-GQSVRLSAFRVASAPFAVGAARVPGFTGSAT 263
+ F+S+ ++ A + + EF Q+V + +++ S F + ++P F G
Sbjct 118 KRFFRSIMIQFDAFFESHKLYDKETLEFLEQNVNIVNYKLKSVRFHMEKVKIPSFKGEIV 177
Query 264 FTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRV 301
F + G F + LL FGEFSG GIK S+GMG +
Sbjct 178 FKINGPLPFLQLVYFLLAFGEFSGTGIKTSLGMGKYNI 215
>gi|269798856|ref|YP_003312756.1| hypothetical protein Vpar_1801 [Veillonella parvula DSM 2008]
gi|269095485|gb|ACZ25476.1| hypothetical protein Vpar_1801 [Veillonella parvula DSM 2008]
Length=256
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/251 (24%), Positives = 117/251 (47%), Gaps = 3/251 (1%)
Query 53 MTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA-L 111
M +++ + + L + A V ++G LHGVLME + +Y LH + PYSQY
Sbjct 1 MADNVEIMAIELGITADPSIKIVQSIGSVLHGVLMELVGIEYAGQLHETGLRPYSQYIYF 60
Query 112 ARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASG-IATQVTSRSLEQNPLSQFA 170
+ W++S +T EA ++I+ P+ D F + G I Q + E + + A
Sbjct 61 DKEKGQYIWRLSAVTAEAVERILRPVLDMPEKIFLKQKRGHIYIQDRTILEETSYEALMA 120
Query 171 RIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPD-PGLI 229
+ + E + ++ +T T+FK +Y +P+ +++ L +++ E D L+
Sbjct 121 KFWSGEAEYAQAKLRCVTTTSFKVDQQYTIFPEAFRIYRYLLRQWNQFTTFEMMDSEDLL 180
Query 230 AEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCG 289
A + + + + + + ++ GF G + ++ L ++ +F+G G
Sbjct 181 AALESAAFIRDYNLRMGIYGLEGVKIRGFRGEIVMQFKRNLVMQRILSLLTYYSQFTGLG 240
Query 290 IKASMGMGAIR 300
IK ++GMG ++
Sbjct 241 IKTALGMGGVQ 251
>gi|315641547|ref|ZP_07896616.1| CRISPR-associated protein cas6 [Enterococcus italicus DSM 15952]
gi|315482684|gb|EFU73211.1| CRISPR-associated protein cas6 [Enterococcus italicus DSM 15952]
Length=244
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/235 (25%), Positives = 108/235 (46%), Gaps = 14/235 (5%)
Query 71 ERARVATLGPHLHGVLMESIPADYVQTLHTVPVNPYSQYA-----LARSTTSLEWKISTL 125
+ + A +G LHG LME +P + V LH YS Y+ L + ++W+I
Sbjct 13 DEIKTANIGSLLHGCLMEWLPEETVSFLHQ-----YSTYSPLKQRLLLNDKKVQWEIVVF 67
Query 126 TNEARQQIVGPINDAAFAGFRLRASGIATQVTSRSLEQNPLSQFARIFYARPETRKF-RV 184
+ QI + FRL + + ++Q + + + +++ E ++ R+
Sbjct 68 NDILFNQIEQTL--TLRKSFRLHYNQKEITIEKIEIQQLAIEELVKKYFSMQEVPRYARL 125
Query 185 EFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQS-VRLSAFRV 243
+PT+FK +G+Y +PD + +F+S+ + G E+ S ++ +++
Sbjct 126 NIQSPTSFKSNGQYDIFPDLKKIFRSIMRNTDTFFPEYRLFDGDTLEYLVSKTKIVNYQL 185
Query 244 ASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGA 298
S F + ++P F G+ T + G LL FG+++G G+K S+GMG
Sbjct 186 RSTKFHLEGIKIPSFQGNFTVQLNGPLPVKQLSYFLLTFGQWTGIGVKTSLGMGK 240
Lambda K H
0.322 0.135 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 543016982550
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40