BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2816c
Length=113
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609953|ref|NP_217332.1| hypothetical protein Rv2816c [Mycob... 228 3e-58
gi|340627812|ref|YP_004746264.1| hypothetical protein MCAN_28401... 226 1e-57
gi|308369868|ref|ZP_07419336.2| hypothetical protein TMBG_02950 ... 175 2e-42
gi|224543480|ref|ZP_03684019.1| hypothetical protein CATMIT_0268... 85.5 3e-15
gi|229826472|ref|ZP_04452541.1| hypothetical protein GCWU000182_... 84.3 5e-15
gi|257413194|ref|ZP_04742268.2| CRISPR-associated protein Cas2 [... 82.4 2e-14
gi|315925050|ref|ZP_07921267.1| CRISPR-associated protein cas2 [... 79.7 1e-13
gi|331004037|ref|ZP_08327519.1| CRISPR-associated protein cas2 [... 77.8 5e-13
gi|339890604|gb|EGQ79705.1| CRISPR-associated protein cas2 [Fuso... 76.3 1e-12
gi|291460044|ref|ZP_06599434.1| CRISPR-associated protein Cas2 [... 76.3 2e-12
gi|237741582|ref|ZP_04572063.1| predicted protein [Fusobacterium... 75.5 2e-12
gi|340752437|ref|ZP_08689236.1| CRISPR-associated protein cas2 [... 73.6 8e-12
gi|294782685|ref|ZP_06748011.1| CRISPR-associated protein Cas2 [... 73.2 1e-11
gi|312899093|ref|ZP_07758471.1| CRISPR-associated protein Cas2 [... 70.5 7e-11
gi|114567263|ref|YP_754417.1| hypothetical protein Swol_1748 [Sy... 70.5 9e-11
gi|296133513|ref|YP_003640760.1| CRISPR-associated protein Cas2 ... 69.3 2e-10
gi|121533568|ref|ZP_01665396.1| CRISPR-associated protein Cas2 [... 68.6 3e-10
gi|341822660|emb|CCC73584.1| CRISPR-associated protein cas2 [Meg... 63.9 7e-09
gi|313894759|ref|ZP_07828319.1| CRISPR-associated protein Cas2 [... 63.9 7e-09
gi|292669133|ref|ZP_06602559.1| CRISPR-associated protein cas2 [... 63.9 8e-09
gi|253578034|ref|ZP_04855306.1| CRISPR-associated protein [Rumin... 63.5 9e-09
gi|121533441|ref|ZP_01665269.1| CRISPR-associated protein Cas2 [... 63.5 1e-08
gi|91201520|emb|CAJ74580.1| conserved hypothetical protein [Cand... 62.4 2e-08
gi|159898908|ref|YP_001545155.1| CRISPR-associated Cas2 family p... 61.6 3e-08
gi|334126732|ref|ZP_08500680.1| CRISPR-associated protein cas2 [... 61.6 4e-08
gi|328953423|ref|YP_004370757.1| CRISPR-associated protein Cas2 ... 61.6 4e-08
gi|323141544|ref|ZP_08076430.1| CRISPR-associated protein Cas2 [... 60.5 8e-08
gi|209526392|ref|ZP_03274920.1| CRISPR-associated protein Cas2 [... 60.1 1e-07
gi|172035263|ref|YP_001801764.1| hypothetical protein cce_0347 [... 60.1 1e-07
gi|284055102|ref|ZP_06385312.1| hypothetical protein AplaP_26965... 59.7 1e-07
gi|218442810|ref|YP_002381130.1| hypothetical protein PCC7424_58... 59.3 2e-07
gi|159899003|ref|YP_001545250.1| CRISPR-associated Cas2 family p... 58.5 3e-07
gi|163848916|ref|YP_001636960.1| CRISPR-associated Cas2 family p... 58.5 3e-07
gi|55820993|ref|YP_139435.1| hypothetical protein stu0958 [Strep... 58.2 4e-07
gi|159898756|ref|YP_001545003.1| CRISPR-associated Cas2 family p... 58.2 4e-07
gi|328953001|ref|YP_004370335.1| CRISPR-associated protein Cas2 ... 58.2 4e-07
gi|38505761|ref|NP_942381.1| hypothetical protein ssr7093 [Synec... 57.8 5e-07
gi|328949725|ref|YP_004367060.1| CRISPR-associated protein Cas2 ... 57.8 5e-07
gi|320161860|ref|YP_004175085.1| hypothetical protein ANT_24590 ... 57.8 5e-07
gi|312278319|gb|ADQ62976.1| CRISPR-associated protein, Cas2 fami... 57.8 5e-07
gi|308272613|emb|CBX29217.1| hypothetical protein N47_J01980 [un... 57.8 5e-07
gi|156741962|ref|YP_001432091.1| CRISPR-associated Cas2 family p... 57.8 6e-07
gi|258645679|ref|ZP_05733148.1| CRISPR-associated protein Cas2 [... 57.8 6e-07
gi|342214556|ref|ZP_08707243.1| CRISPR-associated endoribonuclea... 57.0 8e-07
gi|303231933|ref|ZP_07318641.1| CRISPR-associated protein Cas2 [... 57.0 1e-06
gi|333976332|gb|EGL77201.1| CRISPR-associated protein Cas2 [Veil... 57.0 1e-06
gi|327470947|gb|EGF16403.1| hypothetical protein HMPREF9386_0573... 56.6 1e-06
gi|307591968|ref|YP_003899559.1| CRISPR-associated protein Cas2 ... 56.2 2e-06
gi|55822915|ref|YP_141356.1| hypothetical protein str0958 [Strep... 56.2 2e-06
gi|219883137|ref|YP_002478299.1| CRISPR-associated protein Cas2 ... 56.2 2e-06
>gi|15609953|ref|NP_217332.1| hypothetical protein Rv2816c [Mycobacterium tuberculosis H37Rv]
gi|15842357|ref|NP_337394.1| hypothetical protein MT2883 [Mycobacterium tuberculosis CDC1551]
gi|31793992|ref|NP_856485.1| hypothetical protein Mb2840c [Mycobacterium bovis AF2122/97]
64 more sequence titles
Length=113
Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 113/113 (100%), Positives = 113/113 (100%), Gaps = 0/113 (0%)
Query 1 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE 60
MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE
Sbjct 1 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE 60
Query 61 AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF
Sbjct 61 AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
>gi|340627812|ref|YP_004746264.1| hypothetical protein MCAN_28401 [Mycobacterium canettii CIPT
140010059]
gi|340006002|emb|CCC45171.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=113
Score = 226 bits (575), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 112/113 (99%), Positives = 112/113 (99%), Gaps = 0/113 (0%)
Query 1 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE 60
MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE
Sbjct 1 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE 60
Query 61 AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVF
Sbjct 61 AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFL 113
>gi|308369868|ref|ZP_07419336.2| hypothetical protein TMBG_02950 [Mycobacterium tuberculosis SUMu002]
gi|308326178|gb|EFP15029.1| hypothetical protein TMBG_02950 [Mycobacterium tuberculosis SUMu002]
Length=88
Score = 175 bits (443), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/88 (100%), Positives = 88/88 (100%), Gaps = 0/88 (0%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNI 85
MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNI
Sbjct 1 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNI 60
Query 86 RIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
RIYKIRGVAAVTFYGRGRLVSAEEFVFF
Sbjct 61 RIYKIRGVAAVTFYGRGRLVSAEEFVFF 88
>gi|224543480|ref|ZP_03684019.1| hypothetical protein CATMIT_02689 [Catenibacterium mitsuokai
DSM 15897]
gi|224523607|gb|EEF92712.1| hypothetical protein CATMIT_02689 [Catenibacterium mitsuokai
DSM 15897]
Length=106
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/107 (44%), Positives = 68/107 (64%), Gaps = 4/107 (3%)
Query 5 SREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLT 64
RE+YF +VDE+ +I K+FVL+IYDI DNR+R A+ L+G+G RVQ+SAFEA L
Sbjct 2 EREDYF---FEVDENK-SIRKVFVLIIYDIVDNRKRQRFARWLSGYGVRVQKSAFEAHLR 57
Query 65 KGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFV 111
K + KLV I + D++RIYKI G + +G+ +E+ +
Sbjct 58 KNKFDKLVKGIPKRIGTQDSVRIYKINGKGQIISWGKDESEESEDII 104
>gi|229826472|ref|ZP_04452541.1| hypothetical protein GCWU000182_01845 [Abiotrophia defectiva
ATCC 49176]
gi|229789342|gb|EEP25456.1| hypothetical protein GCWU000182_01845 [Abiotrophia defectiva
ATCC 49176]
Length=95
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/87 (50%), Positives = 60/87 (69%), Gaps = 1/87 (1%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDN 84
K +L+IYDI+D++ R +L KIL+ FG RVQ+SAFEA L K Q KL+++I++F D DN
Sbjct 8 KYIILIIYDITDDKHRRNLVKILSSFGLRVQKSAFEARLNKRQYNKLLSKIEKFYRDSDN 67
Query 85 IRIYKIRGVAAVTFYGRGRLVSAEEFV 111
IRIY+++ V YG SAEE +
Sbjct 68 IRIYRLQEYEEVRVYG-TEDYSAEEVI 93
>gi|257413194|ref|ZP_04742268.2| CRISPR-associated protein Cas2 [Roseburia intestinalis L1-82]
gi|257204344|gb|EEV02629.1| CRISPR-associated protein Cas2 [Roseburia intestinalis L1-82]
Length=112
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/96 (44%), Positives = 62/96 (65%), Gaps = 3/96 (3%)
Query 18 ESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR 77
E +I K+++LVIYDI DN+RR AK + G+G+RVQ+SAFEAM+T+ +L+ I
Sbjct 15 EEENSIKKLYILVIYDIVDNKRRVRFAKKMNGYGFRVQKSAFEAMVTENLYRRLLHDIPE 74
Query 78 FAID--CDNIRIYKIRGVAAVTFYGRGRLVSAEEFV 111
ID D++R+YKIRG V+ +G + EE +
Sbjct 75 L-IDRRSDSVRVYKIRGYGEVSLFGASPEIKNEEVI 109
>gi|315925050|ref|ZP_07921267.1| CRISPR-associated protein cas2 [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621949|gb|EFV01913.1| CRISPR-associated protein cas2 [Pseudoramibacter alactolyticus
ATCC 23263]
Length=107
Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/89 (44%), Positives = 61/89 (69%), Gaps = 0/89 (0%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDN 84
++F L+IYDI DN++R L+K+LAG+G RVQ SAFEA L++ + A+L+A++ RF + D+
Sbjct 19 QIFALIIYDIIDNKKRYRLSKLLAGYGDRVQRSAFEARLSQKKYAELLAKLPRFCGEEDS 78
Query 85 IRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
IR+YKI G + +G V E+ +
Sbjct 79 IRVYKIVGEGQIQTWGVNAGVMQEDVILI 107
>gi|331004037|ref|ZP_08327519.1| CRISPR-associated protein cas2 [Lachnospiraceae oral taxon 107
str. F0167]
gi|330411623|gb|EGG91031.1| CRISPR-associated protein cas2 [Lachnospiraceae oral taxon 107
str. F0167]
Length=103
Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/96 (42%), Positives = 61/96 (64%), Gaps = 0/96 (0%)
Query 18 ESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR 77
E T K VL+IYDI++++ R L+K+L+ +G RVQ+SAFEA L K Q KLV+ +DR
Sbjct 8 EEISTQMKYRVLIIYDITEDKPRVKLSKLLSSYGIRVQKSAFEACLNKKQYDKLVSELDR 67
Query 78 FAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
+ D+IR+YK+ + V YG+ + +EF+
Sbjct 68 YVGREDSIRVYKLYEDSEVITYGKEDEILFDEFIII 103
>gi|339890604|gb|EGQ79705.1| CRISPR-associated protein cas2 [Fusobacterium nucleatum subsp.
animalis ATCC 51191]
Length=107
Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/86 (42%), Positives = 54/86 (63%), Gaps = 0/86 (0%)
Query 28 VLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRI 87
V++IYDI N+RR L+K+L+ FG+R+Q+SAFE +LT+ + L+ +IDR+A D IRI
Sbjct 22 VIIIYDIISNKRRTQLSKLLSAFGFRIQKSAFECLLTREKYKLLIEKIDRYAKPEDLIRI 81
Query 88 YKIRGVAAVTFYGRGRLVSAEEFVFF 113
Y++ YG E + FF
Sbjct 82 YRLNQNVVTQIYGEKLENENEMYYFF 107
>gi|291460044|ref|ZP_06599434.1| CRISPR-associated protein Cas2 [Oribacterium sp. oral taxon 078
str. F0262]
gi|291417385|gb|EFE91104.1| CRISPR-associated protein Cas2 [Oribacterium sp. oral taxon 078
str. F0262]
Length=105
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/90 (46%), Positives = 57/90 (64%), Gaps = 3/90 (3%)
Query 24 GKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID-- 81
GK+FVL+IYDI NRRR AK L G+G+RVQ+SAFEA++ K KL I + ID
Sbjct 15 GKLFVLIIYDIVSNRRRNKFAKCLNGYGFRVQKSAFEALIEKRLFLKLQKEIPQL-IDPS 73
Query 82 CDNIRIYKIRGVAAVTFYGRGRLVSAEEFV 111
D++RIY++ G V YG + A++ +
Sbjct 74 ADSVRIYRMTGYGEVDLYGVNTEIKADDIM 103
>gi|237741582|ref|ZP_04572063.1| predicted protein [Fusobacterium sp. 4_1_13]
gi|229429230|gb|EEO39442.1| predicted protein [Fusobacterium sp. 4_1_13]
Length=107
Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/86 (42%), Positives = 53/86 (62%), Gaps = 0/86 (0%)
Query 28 VLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRI 87
V++IYDI N+RR L+K+L+ FG+R+Q+SAFE +LT+ + L+ IDR+A D IRI
Sbjct 22 VIIIYDIISNKRRTQLSKLLSAFGFRIQKSAFECLLTREKYKLLIEEIDRYAKPEDLIRI 81
Query 88 YKIRGVAAVTFYGRGRLVSAEEFVFF 113
Y++ YG E + FF
Sbjct 82 YRLNQNVVTQIYGEKLENENEMYYFF 107
>gi|340752437|ref|ZP_08689236.1| CRISPR-associated protein cas2 [Fusobacterium sp. 2_1_31]
gi|229422236|gb|EEO37283.1| CRISPR-associated protein cas2 [Fusobacterium sp. 2_1_31]
Length=109
Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 35/75 (47%), Positives = 49/75 (66%), Gaps = 0/75 (0%)
Query 28 VLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRI 87
V+VIYDI N+RR L+K+L+ FG+R+Q SAFE +LT+ + LV RI+R+A D IRI
Sbjct 22 VIVIYDIISNKRRTQLSKLLSAFGFRIQRSAFECLLTREKYKLLVERINRYAKPEDLIRI 81
Query 88 YKIRGVAAVTFYGRG 102
Y++ YG
Sbjct 82 YRLNQNVITEIYGEN 96
>gi|294782685|ref|ZP_06748011.1| CRISPR-associated protein Cas2 [Fusobacterium sp. 1_1_41FAA]
gi|294481326|gb|EFG29101.1| CRISPR-associated protein Cas2 [Fusobacterium sp. 1_1_41FAA]
Length=109
Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/86 (41%), Positives = 52/86 (61%), Gaps = 0/86 (0%)
Query 28 VLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRI 87
V+VIYDI N+RR L+K+L+ FG+R+Q+SAFE +LT+ + L+ RI R+ D IRI
Sbjct 22 VIVIYDIISNKRRMQLSKLLSAFGFRIQKSAFECLLTREKYKLLIERISRYVKSEDLIRI 81
Query 88 YKIRGVAAVTFYGRGRLVSAEEFVFF 113
Y++ YG V E ++
Sbjct 82 YRLNQNVVTEIYGEKSEVENENKTYY 107
>gi|312899093|ref|ZP_07758471.1| CRISPR-associated protein Cas2 [Megasphaera micronuciformis F0359]
gi|310619760|gb|EFQ03342.1| CRISPR-associated protein Cas2 [Megasphaera micronuciformis F0359]
Length=100
Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 37/90 (42%), Positives = 54/90 (60%), Gaps = 2/90 (2%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA-IDCD 83
K +LVIYDI DN+RR+ + K L +G RVQ+SAFE ++K +L KL A I CD
Sbjct 12 KYVILVIYDIVDNKRRSQMVKCLEKYGIRVQKSAFEVYISKKKLVKLEAEAGSIIDITCD 71
Query 84 NIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
++RIY ++ A+ +G G AE+ +
Sbjct 72 SLRIYSLKHNTAIKTWGIG-CCKAEDVIIL 100
>gi|114567263|ref|YP_754417.1| hypothetical protein Swol_1748 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338198|gb|ABI69046.1| CRISPR-associated protein, Cas2 family [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
Length=109
Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 36/76 (48%), Positives = 48/76 (64%), Gaps = 0/76 (0%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDN 84
K V+VIYDI DNRRRA+ AK L GFG RVQ+SAFE +L + KL+ I + D
Sbjct 21 KYLVVVIYDIVDNRRRAAFAKYLKGFGVRVQKSAFECILPDAKYQKLLKGIPKLIDKEDQ 80
Query 85 IRIYKIRGVAAVTFYG 100
+R+YK+ A + +G
Sbjct 81 VRVYKLTSNADIRAWG 96
>gi|296133513|ref|YP_003640760.1| CRISPR-associated protein Cas2 [Thermincola sp. JR]
gi|296032091|gb|ADG82859.1| CRISPR-associated protein Cas2 [Thermincola potens JR]
Length=111
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/111 (37%), Positives = 59/111 (54%), Gaps = 3/111 (2%)
Query 1 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFE 60
+ TR ++YF D + + V+VIYD+ DN+RR LAK L FG+RVQ+SAFE
Sbjct 2 VETRLLDDYFRFD---DTEPEEMRRYLVVVIYDVIDNKRRNRLAKYLKRFGFRVQKSAFE 58
Query 61 AMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFV 111
+L KL I ++ D +R+YK+ G A V +G +E +
Sbjct 59 CVLDSKNYKKLTGGIAKYITADDLLRVYKLAGNADVQVWGSVEKTEVDEVI 109
>gi|121533568|ref|ZP_01665396.1| CRISPR-associated protein Cas2 [Thermosinus carboxydivorans Nor1]
gi|121308127|gb|EAX49041.1| CRISPR-associated protein Cas2 [Thermosinus carboxydivorans Nor1]
Length=111
Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/87 (48%), Positives = 55/87 (64%), Gaps = 3/87 (3%)
Query 22 TIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID 81
T K FVLVIYDI D++RR +AK+L FG+RVQ+SAFE ML + + +LV R ID
Sbjct 19 TSHKYFVLVIYDIIDDKRRRKMAKLLEAFGFRVQKSAFECMLDRRRYDRLVKIAPRL-ID 77
Query 82 C--DNIRIYKIRGVAAVTFYGRGRLVS 106
D++RIY + G AV +G +V
Sbjct 78 HAEDSLRIYLLSGKMAVLSWGSETIVD 104
>gi|341822660|emb|CCC73584.1| CRISPR-associated protein cas2 [Megasphaera elsdenii DSM 20460]
Length=101
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (44%), Positives = 49/82 (60%), Gaps = 7/82 (8%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVAR----IDRFAI 80
K VL+IYDI+DN+ R + L +G RVQ+SAFEA +TK + KL+ ID
Sbjct 13 KYIVLIIYDITDNKTRNKMVACLEKYGVRVQKSAFEAYITKRKYHKLMQEAPFLID---T 69
Query 81 DCDNIRIYKIRGVAAVTFYGRG 102
D D++RIY + AV +GRG
Sbjct 70 DTDSLRIYLLDSYMAVHSWGRG 91
>gi|313894759|ref|ZP_07828319.1| CRISPR-associated protein Cas2 [Selenomonas sp. oral taxon 137
str. F0430]
gi|312976440|gb|EFR41895.1| CRISPR-associated protein Cas2 [Selenomonas sp. oral taxon 137
str. F0430]
Length=109
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 37/80 (47%), Positives = 51/80 (64%), Gaps = 3/80 (3%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID--C 82
+ VLVIYDI D+R+R + + L G+G RVQ+SAFEA LTK Q ++ RI + ID
Sbjct 21 RYIVLVIYDIVDDRKRYRMVRFLEGYGIRVQKSAFEARLTKKQYDRMTTRIHKL-IDKGT 79
Query 83 DNIRIYKIRGVAAVTFYGRG 102
D++RIY + AV +G G
Sbjct 80 DSLRIYFLDNHFAVRSWGIG 99
>gi|292669133|ref|ZP_06602559.1| CRISPR-associated protein cas2 [Selenomonas noxia ATCC 43541]
gi|292649185|gb|EFF67157.1| CRISPR-associated protein cas2 [Selenomonas noxia ATCC 43541]
Length=105
Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 51/80 (64%), Gaps = 3/80 (3%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID--C 82
+ VLVIYDI++N+RRA + K L +G RVQ+SAFE LT+ + AKL + R ID
Sbjct 17 RYIVLVIYDITENKRRAKMVKCLERYGVRVQKSAFEGFLTEKKYAKLADQAHRL-IDPRT 75
Query 83 DNIRIYKIRGVAAVTFYGRG 102
D++RIY + +V +G G
Sbjct 76 DSLRIYLLANHTSVRSWGLG 95
>gi|253578034|ref|ZP_04855306.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850352|gb|EES78310.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39BFAA]
Length=64
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 29/40 (73%), Positives = 35/40 (88%), Gaps = 0/40 (0%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLT 64
K FVL+IYDI DNR+R LAK+L+G+G RVQ+SAFEAMLT
Sbjct 23 KEFVLIIYDIVDNRKRVKLAKLLSGYGKRVQKSAFEAMLT 62
>gi|121533441|ref|ZP_01665269.1| CRISPR-associated protein Cas2 [Thermosinus carboxydivorans Nor1]
gi|121308000|gb|EAX48914.1| CRISPR-associated protein Cas2 [Thermosinus carboxydivorans Nor1]
Length=111
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/89 (47%), Positives = 54/89 (61%), Gaps = 9/89 (10%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDC-- 82
K FVLVIYDI N+RR + K+L FG+RVQ+SAFE L + + +LV R ID
Sbjct 22 KYFVLVIYDIVCNKRRRRMVKLLEAFGFRVQKSAFECQLERRRYDRLVKIAPRL-IDKTE 80
Query 83 DNIRIYKIRGVAAVTFYGRGRLVSAEEFV 111
D++RIY + G +V +GR EEFV
Sbjct 81 DSLRIYLLSGKMSVLSWGR------EEFV 103
>gi|91201520|emb|CAJ74580.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis]
Length=91
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/86 (45%), Positives = 49/86 (57%), Gaps = 3/86 (3%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAI-DCDN 84
MF LV YDI + RRR LAKIL FG RVQ S FE +L + L K++ RI I + D+
Sbjct 1 MFYLVSYDIPETRRRTKLAKILEDFGDRVQYSVFECILDEKLLGKMIKRIQEIIIAEDDS 60
Query 85 IRIYKIRGVAA--VTFYGRGRLVSAE 108
IRIY I + G+G++ E
Sbjct 61 IRIYSICAGCEKRIEVMGKGKVSKIE 86
>gi|159898908|ref|YP_001545155.1| CRISPR-associated Cas2 family protein [Herpetosiphon aurantiacus
DSM 785]
gi|159891947|gb|ABX05027.1| CRISPR-associated protein Cas2 [Herpetosiphon aurantiacus DSM
785]
Length=92
Score = 61.6 bits (148), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/90 (38%), Positives = 49/90 (55%), Gaps = 3/90 (3%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAI-DCDN 84
MF+L+ YDI ++RR+ +AK L FG RVQ S FE LT QLA + R+ + + D+
Sbjct 1 MFILISYDIPHDKRRSKIAKTLENFGKRVQYSVFECQLTDSQLADVRGRLTALVVPNEDS 60
Query 85 IRIYKIR--GVAAVTFYGRGRLVSAEEFVF 112
IR Y + V A+ G G + F +
Sbjct 61 IRFYSLPKDAVTAMLILGHGVVTHDPSFYW 90
>gi|334126732|ref|ZP_08500680.1| CRISPR-associated protein cas2 [Centipeda periodontii DSM 2778]
gi|333391142|gb|EGK62263.1| CRISPR-associated protein cas2 [Centipeda periodontii DSM 2778]
Length=104
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 51/80 (64%), Gaps = 3/80 (3%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID--C 82
+ VLVIYDI+DNRRRA + K L +G RVQ+SAFEA LT+ + ++V + ID
Sbjct 16 RYIVLVIYDITDNRRRARMVKCLERYGIRVQKSAFEAFLTEKKYDRMVE-LTSGLIDPAT 74
Query 83 DNIRIYKIRGVAAVTFYGRG 102
D++RIY + +V +G G
Sbjct 75 DSLRIYLLANHTSVRSWGIG 94
>gi|328953423|ref|YP_004370757.1| CRISPR-associated protein Cas2 [Desulfobacca acetoxidans DSM
11109]
gi|328453747|gb|AEB09576.1| CRISPR-associated protein Cas2 [Desulfobacca acetoxidans DSM
11109]
Length=125
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/75 (43%), Positives = 47/75 (63%), Gaps = 1/75 (1%)
Query 15 KVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVAR 74
K+ + ++G MF+ + YDI+DNRRR LAK+L+ +G+RVQ+S FE L Q KL
Sbjct 24 KLPHAVTSLGLMFITISYDITDNRRRQRLAKMLSNYGHRVQKSVFECRLDDRQYLKLKKG 83
Query 75 IDR-FAIDCDNIRIY 88
I+ D D++R Y
Sbjct 84 IEEIIDWDDDSVRYY 98
>gi|323141544|ref|ZP_08076430.1| CRISPR-associated protein Cas2 [Phascolarctobacterium sp. YIT
12067]
gi|322414003|gb|EFY04836.1| CRISPR-associated protein Cas2 [Phascolarctobacterium sp. YIT
12067]
Length=92
Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/80 (43%), Positives = 49/80 (62%), Gaps = 3/80 (3%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAID--C 82
K VL+IYDI DN+RR + K L +G RVQ+SAFEA+L + Q K++ R ID
Sbjct 4 KFIVLMIYDIVDNKRRNKMVKCLEAYGVRVQKSAFEALLNRRQYEKML-RESSILIDEAV 62
Query 83 DNIRIYKIRGVAAVTFYGRG 102
D++R+Y + + V +G G
Sbjct 63 DSLRVYVLDDIIDVYTWGIG 82
>gi|209526392|ref|ZP_03274920.1| CRISPR-associated protein Cas2 [Arthrospira maxima CS-328]
gi|209493165|gb|EDZ93492.1| CRISPR-associated protein Cas2 [Arthrospira maxima CS-328]
Length=104
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/65 (44%), Positives = 41/65 (64%), Gaps = 1/65 (1%)
Query 27 FVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA-IDCDNI 85
F L+ YDI ++RRR ++ +L +G RVQ+S FEA+LT Q KL R+ + DCD +
Sbjct 3 FYLICYDIVEDRRRTKVSSLLEAYGIRVQKSVFEAVLTPPQFKKLEQRLKKLINSDCDQL 62
Query 86 RIYKI 90
R Y +
Sbjct 63 RFYPL 67
>gi|172035263|ref|YP_001801764.1| hypothetical protein cce_0347 [Cyanothece sp. ATCC 51142]
gi|171696717|gb|ACB49698.1| DUF196-containing protein [Cyanothece sp. ATCC 51142]
Length=119
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/78 (42%), Positives = 47/78 (61%), Gaps = 3/78 (3%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARI-DRFAIDCDN 84
+F +V YDI +RRR ++ +L G+G RVQ S FE +LTK Q +L R+ +R +D D+
Sbjct 29 VFYVVTYDIVCDRRRKKVSDLLEGYGQRVQYSVFECVLTKAQYKQLCTRMKERVNLDEDS 88
Query 85 IRIYKI--RGVAAVTFYG 100
IR Y I + V +G
Sbjct 89 IRFYPISDHTLGQVELWG 106
>gi|284055102|ref|ZP_06385312.1| hypothetical protein AplaP_26965 [Arthrospira platensis str.
Paraca]
gi|291568438|dbj|BAI90710.1| CRISPR-associated protein Cas2 [Arthrospira platensis NIES-39]
Length=93
Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/67 (42%), Positives = 41/67 (62%), Gaps = 1/67 (1%)
Query 27 FVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA-IDCDNI 85
F L+ YDI ++RRR ++ +L +G RVQ+S FEA+LT Q KL R+ + DCD +
Sbjct 3 FYLICYDIVEDRRRTKVSALLEAYGIRVQKSVFEAVLTPPQFKKLEQRLKKLINSDCDQL 62
Query 86 RIYKIRG 92
R Y +
Sbjct 63 RFYPLSA 69
>gi|218442810|ref|YP_002381130.1| hypothetical protein PCC7424_5842 [Cyanothece sp. PCC 7424]
gi|218175168|gb|ACK73900.1| CRISPR-associated protein Cas2 [Cyanothece sp. PCC 7424]
Length=91
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/71 (44%), Positives = 41/71 (58%), Gaps = 1/71 (1%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARI-DRFAIDCDN 84
M VLV+YDI DN+RR L+K L G+G RVQ S FE L+ ++ L ++ R DN
Sbjct 1 MLVLVVYDIPDNKRRTKLSKFLEGYGERVQWSVFECFLSLEEMRVLYQKVKKRVEPLEDN 60
Query 85 IRIYKIRGVAA 95
+R Y I A
Sbjct 61 VRFYWISNEAV 71
>gi|159899003|ref|YP_001545250.1| CRISPR-associated Cas2 family protein [Herpetosiphon aurantiacus
DSM 785]
gi|159892042|gb|ABX05122.1| CRISPR-associated protein Cas2 [Herpetosiphon aurantiacus DSM
785]
Length=93
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/72 (45%), Positives = 42/72 (59%), Gaps = 2/72 (2%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRF--AIDCD 83
M L+ YDI+ ++RR +AKIL GFG RVQ S FE LT Q KL ++ + D D
Sbjct 1 MLYLISYDIAVDKRRTKIAKILEGFGQRVQYSVFECDLTAKQYTKLRGKLHKVLRPEDGD 60
Query 84 NIRIYKIRGVAA 95
N+R Y+I A
Sbjct 61 NLRTYRICAACA 72
>gi|163848916|ref|YP_001636960.1| CRISPR-associated Cas2 family protein [Chloroflexus aurantiacus
J-10-fl]
gi|222526873|ref|YP_002571344.1| CRISPR-associated protein Cas2 [Chloroflexus sp. Y-400-fl]
gi|163670205|gb|ABY36571.1| CRISPR-associated protein Cas2 [Chloroflexus aurantiacus J-10-fl]
gi|222450752|gb|ACM55018.1| CRISPR-associated protein Cas2 [Chloroflexus sp. Y-400-fl]
Length=93
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/82 (44%), Positives = 50/82 (61%), Gaps = 5/82 (6%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDC-- 82
KMF ++ YDI D++RR S+ K+L G+G RVQ S FEA+L + L ++ R ID
Sbjct 2 KMFTVISYDIVDDQRRTSVMKVLKGYGVRVQYSVFEAILDAREFHDLSNQL-RKIIDPGQ 60
Query 83 DNIRIYKIRGVAA--VTFYGRG 102
D+IR Y++ VAA YG G
Sbjct 61 DSIRCYRLDQVAAQRTVIYGIG 82
>gi|55820993|ref|YP_139435.1| hypothetical protein stu0958 [Streptococcus thermophilus LMG
18311]
gi|116627765|ref|YP_820384.1| hypothetical protein STER_0971 [Streptococcus thermophilus LMD-9]
gi|55736978|gb|AAV60620.1| unknown protein [Streptococcus thermophilus LMG 18311]
gi|116101042|gb|ABJ66188.1| CRISPR-associated protein, Cas2 family [Streptococcus thermophilus
LMD-9]
Length=109
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/107 (35%), Positives = 56/107 (53%), Gaps = 3/107 (2%)
Query 9 YFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQL 68
YFNL + E + MF L+IYDI N+RR L+K+L G+G RVQ+S FE L++
Sbjct 4 YFNLSEEEREFAKQ-KTMFCLIIYDIRSNKRRLKLSKLLEGYGVRVQKSCFEVNLSRNDY 62
Query 69 AKLVARIDRF--AIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
L+ I+ F A + D+I +Y +F ++ +FF
Sbjct 63 QSLLKDIEGFYKADEEDSIIVYVTTKEEVTSFSPYHSAEKLDDILFF 109
>gi|159898756|ref|YP_001545003.1| CRISPR-associated Cas2 family protein [Herpetosiphon aurantiacus
DSM 785]
gi|159891795|gb|ABX04875.1| CRISPR-associated protein Cas2 [Herpetosiphon aurantiacus DSM
785]
Length=94
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/67 (41%), Positives = 41/67 (62%), Gaps = 1/67 (1%)
Query 29 LVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA-IDCDNIRI 87
++ YDI +++RR + K+L G+GY Q S FE LTK L +L A+I+R D IR+
Sbjct 8 IIAYDIPNDKRRTKVHKLLCGYGYWTQYSLFECWLTKRHLVELRAKINRLVDASLDTIRL 67
Query 88 YKIRGVA 94
Y++ G
Sbjct 68 YRVCGAC 74
>gi|328953001|ref|YP_004370335.1| CRISPR-associated protein Cas2 [Desulfobacca acetoxidans DSM
11109]
gi|328453325|gb|AEB09154.1| CRISPR-associated protein Cas2 [Desulfobacca acetoxidans DSM
11109]
Length=92
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/91 (37%), Positives = 51/91 (57%), Gaps = 3/91 (3%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR-FAIDCDN 84
MF + YDI DNRRR +AKIL +G RVQ S FEA L + LA+L R+++ + D
Sbjct 2 MFYAISYDIRDNRRRLRVAKILKDYGERVQLSVFEADLDEKSLARLKKRLEKCLDLTADG 61
Query 85 IRIYKIRGVA--AVTFYGRGRLVSAEEFVFF 113
+R+Y + G + G+G + +++
Sbjct 62 LRLYPLCGACRPRIEIMGQGVVSQDPDYIIL 92
>gi|38505761|ref|NP_942381.1| hypothetical protein ssr7093 [Synechocystis sp. PCC 6803]
gi|38423785|dbj|BAD01995.1| ssr7093 [Synechocystis sp. PCC 6803]
Length=92
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/86 (38%), Positives = 49/86 (57%), Gaps = 4/86 (4%)
Query 26 MFVLVI-YDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR-FAIDCD 83
MF+ VI YDI D+RRR +A +L G+G RVQ S FE L+K + +L R+ + + + D
Sbjct 1 MFLYVIAYDIPDDRRRKKMADLLEGYGQRVQYSVFECTLSKSKFNELQKRLRKIYQSEED 60
Query 84 NIRIYKIRG--VAAVTFYGRGRLVSA 107
++R Y + G + V +G L
Sbjct 61 SLRFYPLSGHTLTQVDIWGEPPLTKP 86
>gi|328949725|ref|YP_004367060.1| CRISPR-associated protein Cas2 [Marinithermus hydrothermalis
DSM 14884]
gi|328450049|gb|AEB10950.1| CRISPR-associated protein Cas2 [Marinithermus hydrothermalis
DSM 14884]
Length=91
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/80 (39%), Positives = 45/80 (57%), Gaps = 3/80 (3%)
Query 30 VIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR-FAIDCDNIRIY 88
V YD+ D+RRR +A +L +G RVQ S FE L ++ L R++R D++RIY
Sbjct 9 VTYDVPDDRRRVKIANLLKSYGERVQLSVFECWLNASEVEALKQRLERVMEPSEDSVRIY 68
Query 89 KIRGVAAVTFYGRGRLVSAE 108
+RG AV G G++ E
Sbjct 69 SVRG--AVQVLGVGKITEEE 86
>gi|320161860|ref|YP_004175085.1| hypothetical protein ANT_24590 [Anaerolinea thermophila UNI-1]
gi|319995714|dbj|BAJ64485.1| hypothetical protein ANT_24590 [Anaerolinea thermophila UNI-1]
Length=96
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/68 (43%), Positives = 45/68 (67%), Gaps = 1/68 (1%)
Query 24 GKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDR-FAIDC 82
G+ F ++ YDISD+RRR LA+++ G RVQ S FEA LT +L +L+ R + +
Sbjct 4 GRSFYVLAYDISDDRRRLKLARLMESLGVRVQGSVFEAYLTATELERLLRRCSKILKKEE 63
Query 83 DNIRIYKI 90
D++RIY++
Sbjct 64 DSLRIYRL 71
>gi|312278319|gb|ADQ62976.1| CRISPR-associated protein, Cas2 family [Streptococcus thermophilus
ND03]
gi|339278111|emb|CCC19859.1| CRISPR-associated protein cas2 [Streptococcus thermophilus JIM
8232]
Length=109
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/107 (35%), Positives = 56/107 (53%), Gaps = 3/107 (2%)
Query 9 YFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQL 68
YFNL + E + MF L+IYDI N+RR L+K+L G+G RVQ+S FE L++
Sbjct 4 YFNLSEEEREFAKQ-KTMFCLIIYDIRSNKRRLKLSKLLEGYGVRVQKSCFEVDLSRNDY 62
Query 69 AKLVARIDRF--AIDCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
L+ I+ F A + D+I +Y +F ++ +FF
Sbjct 63 QSLLKDIEGFYKADEEDSIIVYVTTKEEVTSFSPYHSAEKLDDILFF 109
>gi|308272613|emb|CBX29217.1| hypothetical protein N47_J01980 [uncultured Desulfobacterium
sp.]
gi|308274657|emb|CBX31256.1| hypothetical protein N47_E47680 [uncultured Desulfobacterium
sp.]
Length=91
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/64 (47%), Positives = 41/64 (65%), Gaps = 1/64 (1%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDC-DN 84
MF LV YDI D+RRR LAK L +G RVQ S FE +L + K++ RI+ ++ D+
Sbjct 1 MFYLVSYDIPDDRRRTRLAKTLKDYGGRVQYSVFECLLNQELFDKMIGRIETIIMEAEDS 60
Query 85 IRIY 88
+RIY
Sbjct 61 VRIY 64
>gi|156741962|ref|YP_001432091.1| CRISPR-associated Cas2 family protein [Roseiflexus castenholzii
DSM 13941]
gi|156233290|gb|ABU58073.1| CRISPR-associated protein Cas2 [Roseiflexus castenholzii DSM
13941]
Length=93
Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/81 (41%), Positives = 48/81 (60%), Gaps = 4/81 (4%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA--IDCD 83
M ++ YDI D+ RR LA +L GFG RVQ S FE LT+ + L+ +++R + D
Sbjct 1 MLYVIAYDIPDDARRLKLANVLEGFGQRVQRSVFECDLTEREYRALIKKVERVVNLNEGD 60
Query 84 NIRIYKIRG--VAAVTFYGRG 102
++RIY++ G VA V G G
Sbjct 61 SVRIYRLCGACVANVDVRGEG 81
>gi|258645679|ref|ZP_05733148.1| CRISPR-associated protein Cas2 [Dialister invisus DSM 15470]
gi|260403047|gb|EEW96594.1| CRISPR-associated protein Cas2 [Dialister invisus DSM 15470]
Length=103
Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/92 (36%), Positives = 53/92 (58%), Gaps = 5/92 (5%)
Query 24 GKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQ---LAKLVARIDRFAI 80
+ VL+IYDI+DN+RR S+ + L F RVQ+SAFE LT Q +++L +RI
Sbjct 14 NRYIVLIIYDITDNKRRLSMVRCLEQFAVRVQKSAFEGFLTPKQYECISELASRI--INA 71
Query 81 DCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVF 112
+ D++RIY + V +G G + + ++
Sbjct 72 EQDSLRIYILYDHTRVRSWGIGDIKEDDVIIY 103
>gi|342214556|ref|ZP_08707243.1| CRISPR-associated endoribonuclease Cas2 [Veillonella sp. oral
taxon 780 str. F0422]
gi|341592069|gb|EGS34964.1| CRISPR-associated endoribonuclease Cas2 [Veillonella sp. oral
taxon 780 str. F0422]
Length=91
Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/79 (38%), Positives = 47/79 (60%), Gaps = 1/79 (1%)
Query 25 KMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFA-IDCD 83
K VL+IYDI DN+ R + K L +G RVQ+SAFEA+L + Q ++ + R + D
Sbjct 3 KFIVLIIYDIVDNKIRLKMVKCLERYGVRVQKSAFEALLNRKQYDAMIRQCSRLINPNID 62
Query 84 NIRIYKIRGVAAVTFYGRG 102
++RIY + + + +G G
Sbjct 63 SLRIYILDDLVKIYTWGIG 81
>gi|303231933|ref|ZP_07318641.1| CRISPR-associated protein Cas2 [Veillonella atypica ACS-049-V-Sch6]
gi|302513362|gb|EFL55396.1| CRISPR-associated protein Cas2 [Veillonella atypica ACS-049-V-Sch6]
Length=91
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/81 (39%), Positives = 47/81 (59%), Gaps = 1/81 (1%)
Query 23 IGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDC 82
+ K VLVIYD+ DN+ R L K L +G RVQ+SAFEA+L + Q ++ R R
Sbjct 1 MKKFIVLVIYDVVDNKTRNHLVKCLERYGVRVQKSAFEALLNRKQYDVMMRRASRIINPV 60
Query 83 -DNIRIYKIRGVAAVTFYGRG 102
D++R+Y + + + +G G
Sbjct 61 EDSLRVYILDDIINIYTWGIG 81
>gi|333976332|gb|EGL77201.1| CRISPR-associated protein Cas2 [Veillonella parvula ACS-068-V-Sch12]
Length=91
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/82 (40%), Positives = 49/82 (60%), Gaps = 3/82 (3%)
Query 23 IGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRF--AI 80
+ K VLVIYD+ DN+ R L K L +G RVQ+SAFEA+L K Q ++ R + I
Sbjct 1 MKKFIVLVIYDVVDNKTRNRLVKCLERYGVRVQKSAFEALLNKKQYDAMMRRASKMINPI 60
Query 81 DCDNIRIYKIRGVAAVTFYGRG 102
+ D++R+Y + + + +G G
Sbjct 61 E-DSLRVYVLDDIINIYTWGIG 81
>gi|327470947|gb|EGF16403.1| hypothetical protein HMPREF9386_0573 [Streptococcus sanguinis
SK330]
Length=110
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/89 (42%), Positives = 45/89 (51%), Gaps = 2/89 (2%)
Query 27 FVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRF--AIDCDN 84
F LVIYDI N+RR LAK+L G+G RVQ S FE + K L+ I F A + DN
Sbjct 22 FCLVIYDIVSNKRRLKLAKLLEGYGTRVQSSCFEVNIEKLNFELLIKDIRDFYQADEGDN 81
Query 85 IRIYKIRGVAAVTFYGRGRLVSAEEFVFF 113
I +Y V F EE +FF
Sbjct 82 IIVYVGHKEETVVFNPYAGAELLEEILFF 110
>gi|307591968|ref|YP_003899559.1| CRISPR-associated protein Cas2 [Cyanothece sp. PCC 7822]
gi|306985613|gb|ADN17493.1| CRISPR-associated protein Cas2 [Cyanothece sp. PCC 7822]
Length=97
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/71 (44%), Positives = 42/71 (60%), Gaps = 5/71 (7%)
Query 23 IGKMFVLVIYDISD----NRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARI-DR 77
+ ++F L+IYD++D N+RR L +L GFG Q S FE LTK Q KL +I D
Sbjct 1 MSQLFYLIIYDLADSKAANKRRKRLHSLLCGFGKWTQYSVFECFLTKMQFVKLQHQIEDL 60
Query 78 FAIDCDNIRIY 88
D D++RIY
Sbjct 61 IKPDEDSVRIY 71
>gi|55822915|ref|YP_141356.1| hypothetical protein str0958 [Streptococcus thermophilus CNRZ1066]
gi|55738900|gb|AAV62541.1| unknown protein [Streptococcus thermophilus CNRZ1066]
Length=90
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/65 (45%), Positives = 42/65 (65%), Gaps = 2/65 (3%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRF--AIDCD 83
MF L+IYDI N+RR L+K+L G+G RVQ+S FE L++ L+ I+ F A + D
Sbjct 1 MFCLIIYDIRSNKRRLKLSKLLEGYGVRVQKSCFEVDLSRNDYQSLLKDIEGFSKADEED 60
Query 84 NIRIY 88
+I +Y
Sbjct 61 SIIVY 65
>gi|219883137|ref|YP_002478299.1| CRISPR-associated protein Cas2 [Cyanothece sp. PCC 7425]
gi|219867262|gb|ACL47600.1| CRISPR-associated protein Cas2 [Cyanothece sp. PCC 7425]
Length=92
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/68 (42%), Positives = 42/68 (62%), Gaps = 1/68 (1%)
Query 26 MFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARI-DRFAIDCDN 84
+F ++ YDIS N+RR +A +L G+G RVQ S FE +LT + +L R+ R+ D+
Sbjct 2 LFYVIAYDISCNKRRKKVADLLCGYGQRVQYSVFECVLTPDKYNELQKRLKKRYKETEDS 61
Query 85 IRIYKIRG 92
IR Y + G
Sbjct 62 IRFYPLSG 69
Lambda K H
0.328 0.142 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127560148160
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40