BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2822c
Length=124
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609959|ref|NP_217338.1| hypothetical protein Rv2822c [Mycob... 251 2e-65
gi|340627818|ref|YP_004746270.1| hypothetical protein MCAN_28461... 181 3e-44
gi|322375485|ref|ZP_08049998.1| CRISPR-associated protein, Csm2 ... 103 8e-21
gi|125718070|ref|YP_001035203.1| hypothetical protein SSA_1250 [... 103 8e-21
gi|327469965|gb|EGF15429.1| csm2 family CRISPR-associated protei... 103 1e-20
gi|325687529|gb|EGD29550.1| csm2 family CRISPR-associated protei... 100 7e-20
gi|322387544|ref|ZP_08061153.1| csm2 family CRISPR-associated pr... 100 8e-20
gi|270292487|ref|ZP_06198698.1| CRISPR-associated protein, Csm2 ... 96.7 1e-18
gi|55820996|ref|YP_139438.1| hypothetical protein stu0961 [Strep... 96.7 1e-18
gi|116627767|ref|YP_820386.1| CRISPR-system related protein [Str... 96.3 1e-18
gi|331004042|ref|ZP_08327524.1| hypothetical protein HMPREF0491_... 94.0 6e-18
gi|229826459|ref|ZP_04452528.1| hypothetical protein GCWU000182_... 92.8 2e-17
gi|227890792|ref|ZP_04008597.1| conserved hypothetical protein [... 91.7 4e-17
gi|240143673|ref|ZP_04742274.1| CRISPR-associated protein, Csm2 ... 87.0 9e-16
gi|334308470|gb|EGL99456.1| CRISPR-associated protein, Csm2 fami... 85.5 3e-15
gi|339278114|emb|CCC19862.1| hypothetical protein STH8232_1163 [... 85.1 3e-15
gi|114567270|ref|YP_754424.1| hypothetical protein Swol_1755 [Sy... 84.7 4e-15
gi|253578039|ref|ZP_04855311.1| CRISPR-associated protein [Rumin... 83.2 1e-14
gi|296133520|ref|YP_003640767.1| CRISPR-associated protein, Csm2... 82.4 2e-14
gi|291460040|ref|ZP_06599430.1| CRISPR-associated protein, Csm2 ... 81.6 4e-14
gi|345284421|gb|AEN78274.1| CRISPR-associated protein, Csm2 fami... 77.4 6e-13
gi|313894781|ref|ZP_07828341.1| CRISPR-associated protein, Csm2 ... 76.3 2e-12
gi|315925060|ref|ZP_07921277.1| csm2 family CRISPR-associated pr... 73.6 1e-11
gi|224543485|ref|ZP_03684024.1| hypothetical protein CATMIT_0269... 70.9 7e-11
gi|121533439|ref|ZP_01665267.1| CRISPR-associated protein, Csm2 ... 70.5 7e-11
gi|334126730|ref|ZP_08500678.1| csm2 family CRISPR-associated pr... 70.5 8e-11
gi|315641552|ref|ZP_07896621.1| csm2 family CRISPR-associated pr... 70.1 1e-10
gi|323141262|ref|ZP_08076158.1| CRISPR-associated protein, Csm2 ... 69.7 1e-10
gi|341822662|emb|CCC73586.1| CRISPR-associated protein [Megaspha... 67.8 5e-10
gi|342215298|ref|ZP_08707947.1| CRISPR type III-A/MTUBE-associat... 67.4 6e-10
gi|225018976|ref|ZP_03708168.1| hypothetical protein CLOSTMETH_0... 65.5 3e-09
gi|339893264|emb|CCB52450.1| CRISPR associated protein [Staphylo... 65.1 3e-09
gi|289549400|ref|YP_003470304.1| CRISPR-associated protein [Stap... 65.1 3e-09
gi|57865883|ref|YP_190003.1| CRISPR-associated Csm2 family prote... 63.5 1e-08
gi|341656706|gb|EGS80415.1| CRISPR-associated protein, Csm2 fami... 63.2 1e-08
gi|312899095|ref|ZP_07758473.1| CRISPR-associated protein, Csm2 ... 63.2 1e-08
gi|340752430|ref|ZP_08689229.1| csm2 family CRISPR-associated pr... 62.8 1e-08
gi|237741575|ref|ZP_04572056.1| predicted protein [Fusobacterium... 59.7 1e-07
gi|294782692|ref|ZP_06748018.1| CRISPR-associated protein, Csm2 ... 58.9 2e-07
gi|339890600|gb|EGQ79702.1| Csm2 family CRISPR-associated protei... 58.9 3e-07
gi|295105104|emb|CBL02648.1| CRISPR-associated protein, Csm2 fam... 58.5 3e-07
gi|269798860|ref|YP_003312760.1| CRISPR-associated protein, Csm2... 58.2 4e-07
gi|303231971|ref|ZP_07318679.1| CRISPR-associated protein, Csm2 ... 57.4 8e-07
gi|333976300|gb|EGL77169.1| CRISPR-associated protein, Csm2 fami... 57.0 9e-07
gi|238018267|ref|ZP_04598693.1| hypothetical protein VEIDISOL_00... 56.6 1e-06
gi|260424751|ref|ZP_05733155.2| CRISPR-associated protein, Csm2 ... 55.5 3e-06
gi|301299526|ref|ZP_07205795.1| conserved domain protein [Lactob... 48.1 4e-04
gi|292669141|ref|ZP_06602567.1| Csm2 family CRISPR-associated pr... 45.4 0.003
gi|329736392|gb|EGG72661.1| CRISPR-associated protein, Csm2 fami... 42.4 0.022
gi|335429797|ref|ZP_08556695.1| Xenobiotic-transporting ATPase [... 34.7 5.0
>gi|15609959|ref|NP_217338.1| hypothetical protein Rv2822c [Mycobacterium tuberculosis H37Rv]
gi|15842363|ref|NP_337400.1| hypothetical protein MT2889 [Mycobacterium tuberculosis CDC1551]
gi|31793998|ref|NP_856491.1| hypothetical protein Mb2846c [Mycobacterium bovis AF2122/97]
77 more sequence titles
Length=124
Score = 251 bits (642), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 124/124 (100%), Positives = 124/124 (100%), Gaps = 0/124 (0%)
Query 1 MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKE 60
MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKE
Sbjct 1 MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKE 60
Query 61 KVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLD 120
KVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLD
Sbjct 61 KVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLD 120
Query 121 PKDK 124
PKDK
Sbjct 121 PKDK 124
>gi|340627818|ref|YP_004746270.1| hypothetical protein MCAN_28461 [Mycobacterium canettii CIPT
140010059]
gi|340006008|emb|CCC45177.1| putative uncharacterized protein [Mycobacterium canettii CIPT
140010059]
Length=130
Score = 181 bits (459), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 93/124 (75%), Positives = 103/124 (84%), Gaps = 1/124 (0%)
Query 1 MSVIQDDYVKQAE-VIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLK 59
MSVIQDDYVKQAE VIRGLPKK FELTTTQLRVLLSLTAQLFDEAQ S++ L L+
Sbjct 1 MSVIQDDYVKQAEQVIRGLPKKNGDFELTTTQLRVLLSLTAQLFDEAQLSSDQNLSPALR 60
Query 60 EKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYL 119
+KVQYLRVRFVYQ+GRE AV+ FV A LL+ L IGDSRD LL+FC YMEAL AYKK+L
Sbjct 61 DKVQYLRVRFVYQAGREKAVRVFVERAGLLDELAQIGDSRDRLLKFCHYMEALVAYKKFL 120
Query 120 DPKD 123
DPK+
Sbjct 121 DPKE 124
>gi|322375485|ref|ZP_08049998.1| CRISPR-associated protein, Csm2 family [Streptococcus sp. C300]
gi|321279748|gb|EFX56788.1| CRISPR-associated protein, Csm2 family [Streptococcus sp. C300]
Length=126
Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 61/131 (47%), Positives = 90/131 (69%), Gaps = 13/131 (9%)
Query 1 MSVIQDD-YVKQAE-VIRGLPKKKNG------FELTTTQLRVLLSLTAQLFDEAQQSANP 52
M+++ DD YV +AE VI+ L K+ F LTTT++R LL+LT+ LFDE++ +
Sbjct 1 MAILTDDNYVDKAEKVIKSLNHTKDHRNNKIKFFLTTTKIRNLLNLTSNLFDESKVRS-- 58
Query 53 TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEAL 112
++L +K+ YLRV+FVYQSGRE AVK V+ A++L+ L+ I ++++ L RFCRYMEAL
Sbjct 59 --YKELADKIAYLRVQFVYQSGRETAVKDLVKKAEILDILKEI-NNKESLQRFCRYMEAL 115
Query 113 AAYKKYLDPKD 123
AY ++ KD
Sbjct 116 VAYFRFYGGKD 126
>gi|125718070|ref|YP_001035203.1| hypothetical protein SSA_1250 [Streptococcus sanguinis SK36]
gi|125497987|gb|ABN44653.1| Conserved uncharacterized protein, putative [Streptococcus sanguinis
SK36]
gi|327474438|gb|EGF19844.1| csm2 family CRISPR-associated protein [Streptococcus sanguinis
SK408]
Length=176
Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 64/130 (50%), Positives = 86/130 (67%), Gaps = 12/130 (9%)
Query 3 VIQDDYVKQAE-VIRGLPKKK----NG---FELTTTQLRVLLSLTAQLFDEAQQSANPTL 54
++ DDYV +A+ VI L K NG F+LT+TQ+R L +LT+ LFDE++ +
Sbjct 51 ILTDDYVDKADLVIHTLDSDKKQLNNGKIKFKLTSTQIRNLAALTSNLFDESKTKNDI-- 108
Query 55 PRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAA 114
+L+EK+ YLR++FVYQSGRE AV+ FV+ A LLEAL I D L RFCRYMEAL A
Sbjct 109 -EELREKISYLRIQFVYQSGREPAVEDFVKKAGLLEALTEIQDI-PSLQRFCRYMEALIA 166
Query 115 YKKYLDPKDK 124
Y K+ +D+
Sbjct 167 YFKFNGGRDQ 176
>gi|327469965|gb|EGF15429.1| csm2 family CRISPR-associated protein [Streptococcus sanguinis
SK330]
Length=179
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/133 (46%), Positives = 85/133 (64%), Gaps = 15/133 (11%)
Query 3 VIQDDYVKQAEVI-----------RGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSAN 51
++ DDYV +A+++ G K K F+LT+TQ+R L +LT+ LFDE++ +
Sbjct 51 ILTDDYVDKADLVIHSLKNSGTYTEGPNKGKIKFKLTSTQIRNLAALTSNLFDESKTKND 110
Query 52 PTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEA 111
+L+EK+ YLR++FVYQSGRE AV+ FV+ A LLEAL I D L RFCRYMEA
Sbjct 111 I---EELREKISYLRIQFVYQSGREPAVEDFVKKAGLLEALTEIQDI-PSLQRFCRYMEA 166
Query 112 LAAYKKYLDPKDK 124
L AY K+ +D+
Sbjct 167 LIAYFKFNGGRDQ 179
>gi|325687529|gb|EGD29550.1| csm2 family CRISPR-associated protein [Streptococcus sanguinis
SK72]
Length=179
Score = 100 bits (249), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/127 (47%), Positives = 82/127 (65%), Gaps = 15/127 (11%)
Query 3 VIQDDYVKQAEVI-----------RGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSAN 51
++ D YV +A+++ G K K F+LT+TQ+R L +LT+ LFDE++ +
Sbjct 51 ILTDAYVDKADLVIHNLKNSGTYTEGPNKDKIKFKLTSTQIRNLAALTSNLFDESKTKND 110
Query 52 PTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEA 111
+L+EK+ YLR++FVYQSGRE AV+ FV+ A LLEAL+ I D L RFCRYMEA
Sbjct 111 I---EELREKISYLRIQFVYQSGREPAVEDFVKKAGLLEALKEIQDI-PSLQRFCRYMEA 166
Query 112 LAAYKKY 118
L AY K+
Sbjct 167 LIAYFKF 173
>gi|322387544|ref|ZP_08061153.1| csm2 family CRISPR-associated protein [Streptococcus infantis
ATCC 700779]
gi|321141411|gb|EFX36907.1| csm2 family CRISPR-associated protein [Streptococcus infantis
ATCC 700779]
Length=126
Score = 100 bits (249), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/131 (47%), Positives = 87/131 (67%), Gaps = 13/131 (9%)
Query 1 MSVIQDD-YVKQAE-VIRGLPKKK------NGFELTTTQLRVLLSLTAQLFDEAQQSANP 52
M+++ DD YV +AE VI+ L + F LTT+++R LLSLT+ LFDE++
Sbjct 1 MAILTDDNYVDKAENVIKSLNRNTRDSRNPEAFLLTTSKIRNLLSLTSTLFDESKVRE-- 58
Query 53 TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEAL 112
+ L +K+ YLRV+FVYQSGRE AVK V+ A++L+ L+ I ++++ L RFCRYMEAL
Sbjct 59 --YKDLADKIAYLRVQFVYQSGRETAVKDLVKKAEILDILKEI-NNKESLQRFCRYMEAL 115
Query 113 AAYKKYLDPKD 123
AY K+ KD
Sbjct 116 VAYFKFYGGKD 126
>gi|270292487|ref|ZP_06198698.1| CRISPR-associated protein, Csm2 family [Streptococcus sp. M143]
gi|270278466|gb|EFA24312.1| CRISPR-associated protein, Csm2 family [Streptococcus sp. M143]
Length=126
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/131 (47%), Positives = 85/131 (65%), Gaps = 13/131 (9%)
Query 1 MSVIQDD-YVKQAE-VIRGLPKKKNGFE------LTTTQLRVLLSLTAQLFDEAQQSANP 52
M+++ DD YV +AE I+ L K F+ L+ ++LR LLSLT+ LFDE++
Sbjct 1 MAILTDDNYVDKAEKTIKNLVTDKRNFKNKNSDVLSMSKLRNLLSLTSTLFDESKVRE-- 58
Query 53 TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEAL 112
+LK+K+ YLRV+FVYQSGRE+AV V+ ++L L+ I +SR+ L RFCRYMEAL
Sbjct 59 --YEELKDKIAYLRVQFVYQSGREEAVLDLVQKGEILPILKEI-NSRESLQRFCRYMEAL 115
Query 113 AAYKKYLDPKD 123
AY K+ KD
Sbjct 116 VAYFKFYGGKD 126
>gi|55820996|ref|YP_139438.1| hypothetical protein stu0961 [Streptococcus thermophilus LMG
18311]
gi|55736981|gb|AAV60623.1| conserved hypothetical protein [Streptococcus thermophilus LMG
18311]
gi|312278322|gb|ADQ62979.1| CRISPR-associated protein, Csm2 family [Streptococcus thermophilus
ND03]
Length=126
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/131 (48%), Positives = 84/131 (65%), Gaps = 13/131 (9%)
Query 1 MSVIQD-DYVKQAE-----VIRGLPKKKN--GFELTTTQLRVLLSLTAQLFDEAQQSANP 52
M+++ D +YV AE + R +KN F LTT++LR LLSLT+ LFDE++
Sbjct 1 MTILTDENYVDIAEKAILKLERNTRNRKNPDAFFLTTSKLRNLLSLTSTLFDESKVKEYD 60
Query 53 TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEAL 112
L +++ YLRV+FVYQ+GRE AVK + A++LEAL+ I D R+ L RFCRYMEAL
Sbjct 61 ALL----DRIAYLRVQFVYQAGREIAVKDLIEKAQILEALKEIKD-RETLQRFCRYMEAL 115
Query 113 AAYKKYLDPKD 123
AY K+ KD
Sbjct 116 VAYFKFYGGKD 126
>gi|116627767|ref|YP_820386.1| CRISPR-system related protein [Streptococcus thermophilus LMD-9]
gi|116101044|gb|ABJ66190.1| CRISPR-associated protein, Csm2 family [Streptococcus thermophilus
LMD-9]
Length=126
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/131 (48%), Positives = 84/131 (65%), Gaps = 13/131 (9%)
Query 1 MSVIQD-DYVKQAE-----VIRGLPKKKN--GFELTTTQLRVLLSLTAQLFDEAQQSANP 52
M+++ D +YV AE + R +KN F LTT++LR LLSLT+ LFDE++
Sbjct 1 MTILTDENYVDIAEKAILKLERNTRNRKNPDAFFLTTSKLRNLLSLTSTLFDESKVKEYD 60
Query 53 TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEAL 112
L +++ YLRV+FVYQ+GRE AVK + A++LEAL+ I D R+ L RFCRYMEAL
Sbjct 61 DLL----DRIAYLRVQFVYQAGREIAVKDLIEKAQILEALKEIKD-RETLQRFCRYMEAL 115
Query 113 AAYKKYLDPKD 123
AY K+ KD
Sbjct 116 VAYFKFYGGKD 126
>gi|331004042|ref|ZP_08327524.1| hypothetical protein HMPREF0491_02386 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330411628|gb|EGG91036.1| hypothetical protein HMPREF0491_02386 [Lachnospiraceae oral taxon
107 str. F0167]
Length=146
Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 48/124 (39%), Positives = 81/124 (66%), Gaps = 2/124 (1%)
Query 1 MSVIQDDYVKQAE-VIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLK 59
+ + ++YVK+AE VI L K+ LTT+++R LL++ + ++ +A++ + TL
Sbjct 23 IELTNENYVKKAEDVINNLIAGKSKI-LTTSKIRKLLAMVSDMYTKAKRLKSNTLSSDWV 81
Query 60 EKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYL 119
K+QY ++ +Y++GRE +VK FV A+++E ++ I +D L+ FC YMEAL AY+KYL
Sbjct 82 SKIQYFKMHTIYEAGREPSVKKFVEEAQIIEQIDKIKADKDKLILFCLYMEALVAYRKYL 141
Query 120 DPKD 123
KD
Sbjct 142 GGKD 145
>gi|229826459|ref|ZP_04452528.1| hypothetical protein GCWU000182_01832 [Abiotrophia defectiva
ATCC 49176]
gi|229789329|gb|EEP25443.1| hypothetical protein GCWU000182_01832 [Abiotrophia defectiva
ATCC 49176]
Length=130
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/125 (40%), Positives = 81/125 (65%), Gaps = 7/125 (5%)
Query 1 MSVIQDDYVKQAE-VIRGLPK-KKNGFEL-----TTTQLRVLLSLTAQLFDEAQQSANPT 53
M + ++Y+K+AE VI L K K+ G +L TTT+LR +LS+ ++++ +A +
Sbjct 1 MILTNENYLKEAEKVIDNLCKDKRTGKQLYAPKITTTKLRKILSMVSEIYSDASRLREEK 60
Query 54 LPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALA 113
L ++K ++QYL++ +Y+ GRE VK FV +KL AL+ + DS+ L+ FC Y+EAL
Sbjct 61 LDTEMKSRLQYLKLHIIYEEGREAVVKEFVEESKLTAALDEVKDSKSQLINFCHYVEALV 120
Query 114 AYKKY 118
AY+K+
Sbjct 121 AYRKF 125
>gi|227890792|ref|ZP_04008597.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
gi|227867201|gb|EEJ74622.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
Length=160
Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 67/100 (67%), Gaps = 6/100 (6%)
Query 27 LTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNA 86
LT TQLR LL++T+ ++DEA+ + + EK+ YL+V+F+YQSGR AVK FV A
Sbjct 65 LTNTQLRNLLAMTSAVYDEARNNGFD----HVNEKIAYLKVQFIYQSGRNLAVKAFVEVA 120
Query 87 KLLEALEGIGDSR--DGLLRFCRYMEALAAYKKYLDPKDK 124
+L+E ++ I D + D LLRFC YMEAL AY KY DK
Sbjct 121 QLVELVDKIRDLKKMDDLLRFCHYMEALIAYFKYYGGADK 160
>gi|240143673|ref|ZP_04742274.1| CRISPR-associated protein, Csm2 family [Roseburia intestinalis
L1-82]
gi|257204350|gb|EEV02635.1| CRISPR-associated protein, Csm2 family [Roseburia intestinalis
L1-82]
gi|291539921|emb|CBL13032.1| CRISPR-associated protein, Csm2 family [Roseburia intestinalis
XB6B4]
Length=130
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/130 (37%), Positives = 82/130 (64%), Gaps = 6/130 (4%)
Query 1 MSVIQDDYVKQAE-VIRGLPKKKNG-----FELTTTQLRVLLSLTAQLFDEAQQSANPTL 54
M + +++YV AE I+ L +K+ +TT+++R LL++T+ ++++ S + L
Sbjct 1 MKLTEENYVGIAEQAIKELCSEKDQKGRLVGPVTTSKIRNLLAMTSDIYNDVVNSQSDKL 60
Query 55 PRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAA 114
++ ++ Y+++RF+Y++GRE VK V AK+LE LE I SRD + F RYMEAL A
Sbjct 61 NAEIIGRINYMKIRFIYEAGREPKVKKLVDKAKILEHLEEIKGSRDQYILFSRYMEALVA 120
Query 115 YKKYLDPKDK 124
Y+K+ +D+
Sbjct 121 YRKFYGGRDE 130
>gi|334308470|gb|EGL99456.1| CRISPR-associated protein, Csm2 family [Lactobacillus salivarius
NIAS840]
Length=152
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/117 (45%), Positives = 73/117 (63%), Gaps = 9/117 (7%)
Query 6 DDYVKQAEVIRGLPK----KKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEK 61
+ YV +A I G+ K K N LT TQLR LL++T ++ EAQ+ ++ K
Sbjct 35 ESYVDEARKIIGIFKEEKFKINKNILTNTQLRNLLAMTNSVYAEAQKKGFDSV----KGD 90
Query 62 VQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
+ YL++ F+YQSGR AVK FV A+L++ +E + + +D L RFCRYMEAL AY KY
Sbjct 91 IAYLKIHFIYQSGRNIAVKAFVELAQLIKVIEELKNLKD-LQRFCRYMEALVAYFKY 146
>gi|339278114|emb|CCC19862.1| hypothetical protein STH8232_1163 [Streptococcus thermophilus
JIM 8232]
Length=130
Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/135 (43%), Positives = 79/135 (59%), Gaps = 17/135 (12%)
Query 1 MSVIQD-DYVKQAEVIRGLPKKKN--GFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQ 57
M+++ D +YV +AE L +K N + LTT+Q+R LLSL + L+D +++ +
Sbjct 1 MAILTDENYVDKAERAISLLEKDNKGNYLLTTSQIRKLLSLCSSLYDRSKERKFD----E 56
Query 58 LKEKVQYLRVRFVYQSGREDA---------VKTFVRNAKLLEALEGIGDSRDGLLRFCRY 108
L V YLRV+FVYQSGR VK V ++LEAL+ I D R+ L RFCRY
Sbjct 57 LINDVSYLRVQFVYQSGRNSVRVNRQTFFPVKDLVEKGQILEALKEIKD-RETLQRFCRY 115
Query 109 MEALAAYKKYLDPKD 123
MEAL AY K+ KD
Sbjct 116 MEALVAYFKFYGGKD 130
>gi|114567270|ref|YP_754424.1| hypothetical protein Swol_1755 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338205|gb|ABI69053.1| hypothetical protein Swol_1755 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
Length=158
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/121 (39%), Positives = 76/121 (63%), Gaps = 4/121 (3%)
Query 7 DYVKQAE-VIRGLPKK--KNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQ 63
+Y QAE VI+ L K +N TT+++R +L+ ++++++ + + L L+ +++
Sbjct 38 NYTAQAEQVIQELKKSMGRNYQNFTTSKIRNILAQVSEIYNDVRAENDVFLSPDLQNRIE 97
Query 64 YLRVRFVYQSGREDAV-KTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPK 122
YL+VR VY+ GRE + K FV AKLL+ L IGD+R ++F RYMEAL AY ++ +
Sbjct 98 YLKVRLVYECGREPWIIKPFVDKAKLLDLLNNIGDNRQNFIKFARYMEALVAYHRFYGGR 157
Query 123 D 123
D
Sbjct 158 D 158
>gi|253578039|ref|ZP_04855311.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850357|gb|EES78315.1| CRISPR-associated protein [Ruminococcus sp. 5_1_39BFAA]
Length=131
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/131 (36%), Positives = 83/131 (64%), Gaps = 7/131 (5%)
Query 1 MSVIQDDYVKQAE-VIRGL---PKKKNGFEL---TTTQLRVLLSLTAQLFDEAQQSANPT 53
M + +++YV +AE I+ L K+K ++ TT+++R LL++TA ++++ +
Sbjct 1 MRINENNYVDKAEEAIKSLVEESKQKCRGKVNIVTTSKIRNLLAMTADIYNQVLTYTSEK 60
Query 54 LPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALA 113
L ++ +++YLR+RF+Y+ GRE VK FV+ A++LE L+ I S+ L F +YMEAL
Sbjct 61 LDDEICGRIEYLRIRFIYECGREPKVKAFVKQAEILEILKEIRQSKKNYLLFSKYMEALI 120
Query 114 AYKKYLDPKDK 124
A+ KY K++
Sbjct 121 AFHKYYGGKEQ 131
>gi|296133520|ref|YP_003640767.1| CRISPR-associated protein, Csm2 family [Thermincola sp. JR]
gi|296032098|gb|ADG82866.1| CRISPR-associated protein, Csm2 family [Thermincola potens JR]
Length=125
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/119 (43%), Positives = 72/119 (61%), Gaps = 8/119 (6%)
Query 7 DYVKQA-EVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQ------QSANPTLPRQLK 59
DYVK+A EVI+ L KK+NG +TT+Q+R L+ + ++ Q + LP ++
Sbjct 4 DYVKRAAEVIKDL-KKENGKMVTTSQIRKFLAGVNAIKNKVQIRTFQGEITEGRLPEDIQ 62
Query 60 EKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
++Q L+V+ VYQ GRE VKTFV AKLL+ ++ I S L F Y+EAL AY KY
Sbjct 63 REIQALKVKLVYQCGREPKVKTFVEKAKLLDGIDAIEGSTKKFLDFAGYVEALVAYHKY 121
>gi|291460040|ref|ZP_06599430.1| CRISPR-associated protein, Csm2 family [Oribacterium sp. oral
taxon 078 str. F0262]
gi|291417381|gb|EFE91100.1| CRISPR-associated protein, Csm2 family [Oribacterium sp. oral
taxon 078 str. F0262]
Length=157
Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/119 (35%), Positives = 76/119 (64%), Gaps = 2/119 (1%)
Query 7 DYVKQAE-VIRGLPKKKNGFEL-TTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQY 64
+YV +AE VI+ L ++K+ + +T++LR LLS+++ +++E +L + ++ K+ Y
Sbjct 38 NYVDEAEKVIKALIERKSKKNMISTSKLRNLLSMSSDIYNEILMEKGSSLSKTIEAKICY 97
Query 65 LRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKD 123
+RVRF Y++GRE++VK F+ A E ++ I SR+ F Y+E+L A+ +Y K+
Sbjct 98 MRVRFYYEAGREESVKAFLNEADAFEQIKKIEGSREKFFFFHHYLESLVAFHRYYVEKN 156
>gi|345284421|gb|AEN78274.1| CRISPR-associated protein, Csm2 family [Lactobacillus ruminis
ATCC 27782]
Length=145
Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/133 (41%), Positives = 74/133 (56%), Gaps = 21/133 (15%)
Query 8 YVKQAE-VIRGL--------PKKKNGFE--LTTTQLRVLLSLTAQLFDEAQQSANPTLPR 56
YVK AE VIR L +KNG LT + +R +LS T+ ++D + T
Sbjct 17 YVKSAENVIRFLKDENFHVVTNRKNGKGDYLTMSAIRNILSETSAIYDTVRSQGVETA-- 74
Query 57 QLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIG------DSRDGLLRFCRYME 110
+ K+ YL+V+ VYQSGR AVK FV+ + LL AL+ + + +D ++ FCRYME
Sbjct 75 --RIKLSYLKVKLVYQSGRNAAVKRFVKVSNLLGALDEVNEYYEKPEEKDWIILFCRYME 132
Query 111 ALAAYKKYLDPKD 123
AL AY KY KD
Sbjct 133 ALVAYFKYYGGKD 145
>gi|313894781|ref|ZP_07828341.1| CRISPR-associated protein, Csm2 family [Selenomonas sp. oral
taxon 137 str. F0430]
gi|312976462|gb|EFR41917.1| CRISPR-associated protein, Csm2 family [Selenomonas sp. oral
taxon 137 str. F0430]
Length=127
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 63/120 (53%), Gaps = 6/120 (5%)
Query 10 KQAEVIRGLPKKKNG-FELTTTQLRVLLSLTAQLFDEA-----QQSANPTLPRQLKEKVQ 63
K VI L + G +L Q+R LS L ++ + TLP L +VQ
Sbjct 8 KAQSVIPSLMQDNRGDIKLKANQIRKFLSAVTTLTNKVNRYKMKHPHEKTLPDDLAAQVQ 67
Query 64 YLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKD 123
YLRV+ YQ+GR+ AVK FV A+L + GI +S + +F RYMEAL AY KY KD
Sbjct 68 YLRVKMAYQAGRDKAVKDFVEKAQLDAVICGIKNSIETYEKFARYMEALVAYHKYYGGKD 127
>gi|315925060|ref|ZP_07921277.1| csm2 family CRISPR-associated protein [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621959|gb|EFV01923.1| csm2 family CRISPR-associated protein [Pseudoramibacter alactolyticus
ATCC 23263]
Length=132
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/110 (36%), Positives = 67/110 (61%), Gaps = 2/110 (1%)
Query 10 KQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRF 69
+ A VI+ L + K +TT++R LL++T+ +++E L ++ E+++YL++RF
Sbjct 12 RAAHVIQDLYQNKQL--PSTTKIRDLLAMTSSIYNEILIQRQDELSAEMVERIEYLKIRF 69
Query 70 VYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYL 119
+Y++G++ F++ A LL L+ I SR L F RYMEAL A+ KY
Sbjct 70 LYEAGKDKDTWFFIKKAGLLSILDEIEASRKNYLLFSRYMEALVAFYKYF 119
>gi|224543485|ref|ZP_03684024.1| hypothetical protein CATMIT_02694 [Catenibacterium mitsuokai
DSM 15897]
gi|224523612|gb|EEF92717.1| hypothetical protein CATMIT_02694 [Catenibacterium mitsuokai
DSM 15897]
Length=125
Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 35/98 (36%), Positives = 58/98 (60%), Gaps = 1/98 (1%)
Query 27 LTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNA 86
LT +Q+R +L+++A +++ +S L L +++ YL VR Y++GR VK FV A
Sbjct 29 LTVSQIRNILAMSADIYNSVLESPTENLSEDLLDRISYLTVRLYYEAGRNQLVKKFVEKA 88
Query 87 KLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKDK 124
KL+E L+ +D + + YMEAL A+ +Y KD+
Sbjct 89 KLIEKLKNAKTKKD-YVDYYHYMEALVAFHRYYGGKDQ 125
>gi|121533439|ref|ZP_01665267.1| CRISPR-associated protein, Csm2 family [Thermosinus carboxydivorans
Nor1]
gi|121307998|gb|EAX48912.1| CRISPR-associated protein, Csm2 family [Thermosinus carboxydivorans
Nor1]
Length=134
Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/122 (37%), Positives = 68/122 (56%), Gaps = 9/122 (7%)
Query 6 DDYVKQAEVIRGLPKKKNG-FELTTTQLRVLLSLTAQLFD--EAQQSANPT------LPR 56
D+ +KQA+ I K ++G L TT+LR L+ + + EA QS LP+
Sbjct 2 DEIIKQAQKIVADLKDRDGKIRLNTTKLRKFLTAVNAINNKLEAYQSQTGAGNELKELPK 61
Query 57 QLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYK 116
L ++++YL V+ Y+SGRE VK FV AKL++ + IG S D F + +EA+ A+
Sbjct 62 PLADEIRYLEVKLAYESGREKDVKDFVTKAKLIDRIRAIGTSADKYRDFAKLIEAIVAFH 121
Query 117 KY 118
KY
Sbjct 122 KY 123
>gi|334126730|ref|ZP_08500678.1| csm2 family CRISPR-associated protein [Centipeda periodontii
DSM 2778]
gi|333391140|gb|EGK62261.1| csm2 family CRISPR-associated protein [Centipeda periodontii
DSM 2778]
Length=133
Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/125 (39%), Positives = 67/125 (54%), Gaps = 8/125 (6%)
Query 7 DYVKQAE-VIRGLPKKKNG-FELTTTQLRVLLSLTAQLFDE-----AQQSANPTLPRQLK 59
D ++AE VI L K+ NG LTT+Q+R L+ L ++ AQ L L
Sbjct 9 DIAREAENVIVRLAKEGNGRLFLTTSQIRKFLAAVNALTNKITVYRAQNDGATALTEALA 68
Query 60 EKVQYLRVRFVYQSGRED-AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
+V+YL+V+ YQ GR AV+ FV A+L E ++GIG + F Y+EAL AY KY
Sbjct 69 SEVKYLKVKLAYQVGRNPRAVRPFVETARLTEWIDGIGTNIRAYEDFAHYVEALVAYHKY 128
Query 119 LDPKD 123
+D
Sbjct 129 HGGRD 133
>gi|315641552|ref|ZP_07896621.1| csm2 family CRISPR-associated protein [Enterococcus italicus
DSM 15952]
gi|315482689|gb|EFU73216.1| csm2 family CRISPR-associated protein [Enterococcus italicus
DSM 15952]
Length=140
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/98 (36%), Positives = 63/98 (65%), Gaps = 4/98 (4%)
Query 23 NGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTF 82
NG LTT++LR LL L ++ + S + TL ++++++YL+V+F Y+SGRE AV+TF
Sbjct 40 NG--LTTSKLRNLLELINHVYTKVYNSDDTTLSEDVRDELEYLKVKFAYESGREPAVRTF 97
Query 83 VRNAKLLEALEGI--GDSRDGLLRFCRYMEALAAYKKY 118
+ + + ++ + +++ L +C+Y EAL AY K+
Sbjct 98 IEKTYVDKLVDVVLKKNTKKIFLDYCKYFEALVAYAKF 135
>gi|323141262|ref|ZP_08076158.1| CRISPR-associated protein, Csm2 family [Phascolarctobacterium
sp. YIT 12067]
gi|322414219|gb|EFY05042.1| CRISPR-associated protein, Csm2 family [Phascolarctobacterium
sp. YIT 12067]
Length=162
Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/127 (34%), Positives = 73/127 (58%), Gaps = 10/127 (7%)
Query 7 DYVKQAE-VIRGLPKKKN----GFELTTTQLRVLLSLTAQLFDE-----AQQSANPTLPR 56
D V +AE I+GL K ++TT+Q+R L+ + ++ A+ L +
Sbjct 36 DVVTEAEKAIKGLQYKDRYDNIKIDVTTSQIRKFLTAVNVVRNKVDLYKAKNKGAEALSK 95
Query 57 QLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYK 116
+L ++++L+V +YQ+GR AVK F+ +KL ++GIGDS ++F +Y+EAL AY
Sbjct 96 ELTAEIKFLKVNLLYQAGRTAAVKQFMTVSKLNIIIDGIGDSLARFVKFTKYVEALVAYH 155
Query 117 KYLDPKD 123
K+L +D
Sbjct 156 KFLGGRD 162
>gi|341822662|emb|CCC73586.1| CRISPR-associated protein [Megasphaera elsdenii DSM 20460]
Length=126
Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/124 (34%), Positives = 67/124 (55%), Gaps = 7/124 (5%)
Query 7 DYVKQAE-VIRGLPKKKNG-FELTTTQLRVLLSLTAQLFDE-----AQQSANPTLPRQLK 59
D K+AE I L K+ NG L T Q+R L+ + ++ A+ LP +L
Sbjct 3 DIAKEAEQAILALKKQNNGKIYLKTNQIRKFLTAVNAITNKVNVYKAKHLDATELPDELA 62
Query 60 EKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYL 119
++Q+L+V+ YQ+GRE +VK F++ + + + +E +G S F Y+EAL AY K+
Sbjct 63 GEIQFLKVKAAYQAGRERSVKDFMKQSNMKQHIEAVGTSIAKYEAFAHYVEALVAYHKFY 122
Query 120 DPKD 123
KD
Sbjct 123 GGKD 126
>gi|342215298|ref|ZP_08707947.1| CRISPR type III-A/MTUBE-associated protein Csm2 [Veillonella
sp. oral taxon 780 str. F0422]
gi|341588588|gb|EGS31982.1| CRISPR type III-A/MTUBE-associated protein Csm2 [Veillonella
sp. oral taxon 780 str. F0422]
Length=148
Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/131 (34%), Positives = 74/131 (57%), Gaps = 19/131 (14%)
Query 7 DYVKQAE-VIRGLPKKKNG-FELTTTQLRVLLS----LTAQLFDEAQQSANPT--LPRQL 58
DYV +AE VI+GL K +N L T+QLR +LS + ++ EA ++ + + +L
Sbjct 3 DYVSEAESVIKGLSKNRNNEILLNTSQLRKILSAITDVKNKVIVEAAKNKDKIKRISPEL 62
Query 59 KEKVQYLRVRFVYQSGRE-----------DAVKTFVRNAKLLEALEGIGDSRDGLLRFCR 107
+ ++++L+ YQ+GRE +AV F+ AKL+ L+ IG+ D +C+
Sbjct 63 QMEIRFLKTILRYQAGRELEENNKKRITTNAVDEFIEKAKLIPRLDAIGEDIDKFYEYCK 122
Query 108 YMEALAAYKKY 118
Y+E+L A+ KY
Sbjct 123 YIESLVAFHKY 133
>gi|225018976|ref|ZP_03708168.1| hypothetical protein CLOSTMETH_02927 [Clostridium methylpentosum
DSM 5476]
gi|224948256|gb|EEG29465.1| hypothetical protein CLOSTMETH_02927 [Clostridium methylpentosum
DSM 5476]
Length=166
Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/113 (30%), Positives = 64/113 (57%), Gaps = 1/113 (0%)
Query 7 DYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQ-QSANPTLPRQLKEKVQYL 65
DY+ + + + PK + +LT TQ+R + ++++ + Q + L +++++++
Sbjct 47 DYMPKRKDEKKRPKSCDYGDLTVTQMRNMWGRVTAIYNQVRLQPSADNLSGEIQQQLRAF 106
Query 66 RVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
++R VY+S R V F + + +L AL+ IG+ +D R+ RY EAL AY Y
Sbjct 107 KIRLVYESARTPDVGEFCQTSSVLSALDQIGEDKDKFFRYVRYFEALVAYHYY 159
>gi|339893264|emb|CCB52450.1| CRISPR associated protein [Staphylococcus lugdunensis N920143]
Length=128
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/99 (32%), Positives = 57/99 (58%), Gaps = 2/99 (2%)
Query 27 LTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNA 86
LTT++LR L+ +L+ S L R ++++YL+++F Y++GRE +V F++
Sbjct 30 LTTSKLRNLMEQVNRLYTMIFNSTEEKLSRNFIDELEYLKIKFYYEAGREKSVDEFLKKT 89
Query 87 KLLEALEGI--GDSRDGLLRFCRYMEALAAYKKYLDPKD 123
+ ++ + +S+ L +C+Y EAL AY KY +D
Sbjct 90 LMFPIIDKVIQKESKKFFLDYCKYFEALVAYSKYYQKED 128
>gi|289549400|ref|YP_003470304.1| CRISPR-associated protein [Staphylococcus lugdunensis HKU09-01]
gi|289178932|gb|ADC86177.1| CRISPR-associated protein [Staphylococcus lugdunensis HKU09-01]
Length=141
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/99 (32%), Positives = 57/99 (58%), Gaps = 2/99 (2%)
Query 27 LTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNA 86
LTT++LR L+ +L+ S L R ++++YL+++F Y++GRE +V F++
Sbjct 43 LTTSKLRNLMEQVNRLYTMIFNSTEEKLSRNFIDELEYLKIKFYYEAGREKSVDEFLKKT 102
Query 87 KLLEALEGI--GDSRDGLLRFCRYMEALAAYKKYLDPKD 123
+ ++ + +S+ L +C+Y EAL AY KY +D
Sbjct 103 LMFPIIDKVIQKESKKFFLDYCKYFEALVAYSKYYQKED 141
>gi|57865883|ref|YP_190003.1| CRISPR-associated Csm2 family protein [Staphylococcus epidermidis
RP62A]
gi|57636541|gb|AAW53329.1| CRISPR-associated protein, TM1810 family [Staphylococcus epidermidis
RP62A]
Length=128
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/103 (33%), Positives = 60/103 (59%), Gaps = 4/103 (3%)
Query 23 NGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTF 82
NG LTT++LR L+ +L+ A S L + ++++YL+++F Y++GRE +V F
Sbjct 28 NG--LTTSKLRNLMEQVNRLYTIAFNSNEDQLNEEFIDELEYLKIKFYYEAGREKSVDEF 85
Query 83 VRNAKLLEALEGI--GDSRDGLLRFCRYMEALAAYKKYLDPKD 123
++ + ++ + +S+ L +C+Y EAL AY KY +D
Sbjct 86 LKKTLMFPIIDRVIKKESKKFFLDYCKYFEALVAYAKYYQKED 128
>gi|341656706|gb|EGS80415.1| CRISPR-associated protein, Csm2 family [Staphylococcus epidermidis
VCU037]
Length=141
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/103 (33%), Positives = 60/103 (59%), Gaps = 4/103 (3%)
Query 23 NGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTF 82
NG LTT++LR L+ +L+ A S L + ++++YL+++F Y++GRE +V F
Sbjct 41 NG--LTTSKLRNLMEQVNRLYTIAFNSNEDQLNEEFIDELEYLKIKFYYEAGREKSVDEF 98
Query 83 VRNAKLLEALEGI--GDSRDGLLRFCRYMEALAAYKKYLDPKD 123
++ + ++ + +S+ L +C+Y EAL AY KY +D
Sbjct 99 LKKTLMFPIIDRVIKKESKKFFLDYCKYFEALVAYAKYYQKED 141
>gi|312899095|ref|ZP_07758473.1| CRISPR-associated protein, Csm2 family [Megasphaera micronuciformis
F0359]
gi|310619762|gb|EFQ03344.1| CRISPR-associated protein, Csm2 family [Megasphaera micronuciformis
F0359]
Length=150
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/106 (34%), Positives = 54/106 (51%), Gaps = 9/106 (8%)
Query 27 LTTTQLRVLLSLTAQLFDEA---------QQSANPTLPRQLKEKVQYLRVRFVYQSGRED 77
+T +Q+R L+ L D+ Q L L +V+YL+++ YQSGR+
Sbjct 43 ITVSQIRKFLTAVNSLTDKIERYKVEHLRQGEQVLELSTDLAAEVKYLKIKLAYQSGRKS 102
Query 78 AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKD 123
+VK F + A LL + IG + + F RY+EAL AY KY +D
Sbjct 103 SVKDFEKKAGLLAEISSIGKDLNKYMNFARYVEALVAYHKYYGGRD 148
>gi|340752430|ref|ZP_08689229.1| csm2 family CRISPR-associated protein [Fusobacterium sp. 2_1_31]
gi|229422229|gb|EEO37276.1| csm2 family CRISPR-associated protein [Fusobacterium sp. 2_1_31]
Length=122
Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/102 (35%), Positives = 56/102 (55%), Gaps = 4/102 (3%)
Query 27 LTTTQLRVLLS----LTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTF 82
+TTTQLR+LLS + ++ E + + +L+ +++YL V+ +YQ GRE VK F
Sbjct 21 VTTTQLRLLLSNAVIIKNKIQVETRTKKGDEISEKLENEIKYLLVKHIYQCGREPKVKRF 80
Query 83 VRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKDK 124
+ E ++ IG S F RY+E + AY KY + +K
Sbjct 81 DNEFHISEKIKSIGKSAKKFNEFYRYLEEIVAYMKYYESDNK 122
>gi|237741575|ref|ZP_04572056.1| predicted protein [Fusobacterium sp. 4_1_13]
gi|229429223|gb|EEO39435.1| predicted protein [Fusobacterium sp. 4_1_13]
Length=119
Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/101 (36%), Positives = 56/101 (56%), Gaps = 2/101 (1%)
Query 20 KKKNGFELTTTQLRVLLSLTAQLFDEAQQSA--NPTLPRQLKEKVQYLRVRFVYQSGRED 77
+K N +TTTQLR+LLS + ++ Q + +L+ +++YL V+ +YQ GRE
Sbjct 14 QKDNKNPVTTTQLRLLLSNAVIIKNKIQVETRKGDEISEKLENEIKYLLVKHIYQCGREP 73
Query 78 AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
VKTF + + ++ IG S F RY+E + AY KY
Sbjct 74 KVKTFDNEFVISKKIKEIGKSAKKFNEFYRYLEEIVAYMKY 114
>gi|294782692|ref|ZP_06748018.1| CRISPR-associated protein, Csm2 family [Fusobacterium sp. 1_1_41FAA]
gi|294481333|gb|EFG29108.1| CRISPR-associated protein, Csm2 family [Fusobacterium sp. 1_1_41FAA]
Length=119
Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 57/105 (55%), Gaps = 6/105 (5%)
Query 21 KKNGFELTTTQLRVLLS----LTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGRE 76
KKN +TTTQLR+LLS + ++ E + + +L+ +++YL V+ +YQ GRE
Sbjct 17 KKNT--VTTTQLRLLLSNAVIIKNKIQVETRTKKGDEISEKLENEIKYLLVKHIYQCGRE 74
Query 77 DAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDP 121
VK F + E ++ IG S F RY+E + AY KY +
Sbjct 75 PKVKRFDNEFYISEKIKEIGRSAKKFNEFYRYLEEIVAYMKYYES 119
>gi|339890600|gb|EGQ79702.1| Csm2 family CRISPR-associated protein [Fusobacterium nucleatum
subsp. animalis ATCC 51191]
Length=120
Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/94 (37%), Positives = 53/94 (57%), Gaps = 2/94 (2%)
Query 27 LTTTQLRVLLSLTAQLFDEAQQSANP--TLPRQLKEKVQYLRVRFVYQSGREDAVKTFVR 84
+TT+QLR+LLS + ++ Q + +L+ +V+YL ++ +YQ GRE VK F
Sbjct 22 VTTSQLRLLLSNAVVVKNKIQVEVGKGDEISEKLQNEVKYLLIKHIYQCGREPKVKKFDD 81
Query 85 NAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
K+ E ++ IG S F RY+E + AY KY
Sbjct 82 YFKISEKIKEIGKSAKKFNEFYRYLEEIVAYMKY 115
>gi|295105104|emb|CBL02648.1| CRISPR-associated protein, Csm2 family [Faecalibacterium prausnitzii
SL3/3]
Length=150
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/120 (30%), Positives = 64/120 (54%), Gaps = 8/120 (6%)
Query 3 VIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKV 62
+I ++YV AE + K+N +T T+++ LL L +++ + L ++ ++
Sbjct 23 IIPENYVDFAEQL----MKENCALITKTKIQNLLRLACDVYNNENRRTEERLLKESVNQI 78
Query 63 QYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGD----SRDGLLRFCRYMEALAAYKKY 118
+ LR+R Y+ GR+ V+ FV +A L E L + +R L+ + YMEAL A+ +Y
Sbjct 79 KLLRIRLAYECGRDPQVRQFVESANLFEYLAKLSSVGTCTRQDLIDYYHYMEALVAFHRY 138
>gi|269798860|ref|YP_003312760.1| CRISPR-associated protein, Csm2 family [Veillonella parvula DSM
2008]
gi|269095489|gb|ACZ25480.1| CRISPR-associated protein, Csm2 family [Veillonella parvula DSM
2008]
Length=170
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/103 (31%), Positives = 58/103 (57%), Gaps = 9/103 (8%)
Query 25 FELTTTQLRVLLSLTAQL-----FDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGRED-- 77
F++ Q+R +LS + ++ + + LP + +V++L+ F+YQ+GR+
Sbjct 36 FDVKYAQVRKILSSVVAIKNKLGVEQRKSKSFDKLPENIAMEVRFLKTTFLYQAGRDKDN 95
Query 78 --AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
VK F+ +++L+E +E IG + FC+Y+EAL A+ KY
Sbjct 96 KYPVKNFIEDSQLVEMVECIGTDVNKFEMFCKYVEALVAFYKY 138
>gi|303231971|ref|ZP_07318679.1| CRISPR-associated protein, Csm2 family [Veillonella atypica ACS-049-V-Sch6]
gi|302513400|gb|EFL55434.1| CRISPR-associated protein, Csm2 family [Veillonella atypica ACS-049-V-Sch6]
Length=170
Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/103 (33%), Positives = 60/103 (59%), Gaps = 9/103 (8%)
Query 25 FELTTTQLRVLLSLTAQLFDE--AQQSANPT---LPRQLKEKVQYLRVRFVYQSGRED-- 77
F++ Q+R +LS + ++ +Q N + LP + +V++L+ F+YQ+GR+
Sbjct 36 FDVKYAQVRKILSSVVAIKNKLGVEQRKNKSFDKLPDSIAMEVRFLKATFLYQAGRDKDY 95
Query 78 --AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
VK+F+ +++L+E +E IG FC+Y+EAL A+ KY
Sbjct 96 KYPVKSFIEDSQLVEMVECIGTDVKKFDIFCKYVEALVAFYKY 138
>gi|333976300|gb|EGL77169.1| CRISPR-associated protein, Csm2 family [Veillonella parvula ACS-068-V-Sch12]
Length=170
Score = 57.0 bits (136), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/103 (31%), Positives = 57/103 (56%), Gaps = 9/103 (8%)
Query 25 FELTTTQLRVLLSLTAQL-----FDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGRED-- 77
F++ Q+R +LS + ++ + + LP + +V++L+ F+YQ+GR+
Sbjct 36 FDVKYAQVRKILSSVVAIKNKLGVEQRKSKSFDKLPENIAMEVRFLKTTFLYQAGRDKDN 95
Query 78 --AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
VK F+ +++L+E +E IG FC+Y+EAL A+ KY
Sbjct 96 KYPVKNFIEDSQLVEMVECIGTDVKKFDMFCKYVEALVAFYKY 138
>gi|238018267|ref|ZP_04598693.1| hypothetical protein VEIDISOL_00091 [Veillonella dispar ATCC
17748]
gi|237864738|gb|EEP66028.1| hypothetical protein VEIDISOL_00091 [Veillonella dispar ATCC
17748]
Length=170
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/103 (30%), Positives = 58/103 (57%), Gaps = 9/103 (8%)
Query 25 FELTTTQLRVLLSLTAQL-----FDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGRED-- 77
F++ Q+R +LS + ++ + + LP + +V++L+ F+YQ+GR+
Sbjct 36 FDVKYAQVRKILSSVVAIKNKLGVEQRKSKSFDKLPDSIAMEVRFLKTTFLYQAGRDKDY 95
Query 78 --AVKTFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKY 118
+K+F+ +++L+E +E IG FC+Y+EAL A+ KY
Sbjct 96 KYPIKSFIEDSQLVEMVECIGTDVKKFDMFCKYVEALVAFYKY 138
>gi|260424751|ref|ZP_05733155.2| CRISPR-associated protein, Csm2 family [Dialister invisus DSM
15470]
gi|260403054|gb|EEW96601.1| CRISPR-associated protein, Csm2 family [Dialister invisus DSM
15470]
Length=133
Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/134 (29%), Positives = 70/134 (53%), Gaps = 12/134 (8%)
Query 1 MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEA------QQSANPTL 54
M + ++ V +A+ + G +K G +TT+Q+R L+ + ++ + TL
Sbjct 1 MMLEENKIVDRAQQVMGNLSRK-GQMVTTSQIRKFLTAVNTVTEKVNAYKLEKTDEYDTL 59
Query 55 PRQLKEKVQYLRVRFVYQSGRE-----DAVKTFVRNAKLLEALEGIGDSRDGLLRFCRYM 109
P +L+ +++YL+V+ YQ GR + V+ F + A L+ ++GI S +F Y+
Sbjct 60 PVELQAQIKYLKVKLAYQIGRNRSKWGNPVEDFEKEAGLISLIDGIKSSTKEYEKFAHYI 119
Query 110 EALAAYKKYLDPKD 123
EAL A+ K+ KD
Sbjct 120 EALVAFHKFYGGKD 133
>gi|301299526|ref|ZP_07205795.1| conserved domain protein [Lactobacillus salivarius ACS-116-V-Col5a]
gi|300852873|gb|EFK80488.1| conserved domain protein [Lactobacillus salivarius ACS-116-V-Col5a]
Length=49
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/49 (54%), Positives = 32/49 (66%), Gaps = 2/49 (4%)
Query 78 AVKTFVRNAKLLEALEGIGDSR--DGLLRFCRYMEALAAYKKYLDPKDK 124
AVK F+ A+L+E ++ I D + D LLRFC YMEAL AY KY DK
Sbjct 1 AVKAFIEVAQLVELVDMIRDFKELDDLLRFCHYMEALIAYFKYYGGSDK 49
>gi|292669141|ref|ZP_06602567.1| Csm2 family CRISPR-associated protein [Selenomonas noxia ATCC
43541]
gi|292649193|gb|EFF67165.1| Csm2 family CRISPR-associated protein [Selenomonas noxia ATCC
43541]
Length=149
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/138 (28%), Positives = 65/138 (48%), Gaps = 22/138 (15%)
Query 6 DDYVKQAE-VIRGLPKKK-----NGFELTTTQLRVLLSLTAQLFDEA------------- 46
DD +AE +I GL + NG LTT Q+R L+ L ++
Sbjct 12 DDIAGKAEKIILGLKNDRLLGGTNG--LTTNQIRKFLTAVNTLTNKIILYRYQQMKARGR 69
Query 47 QQSANPTLPRQLKEKVQYLRVRFVYQSGRED-AVKTFVRNAKLLEALEGIGDSRDGLLRF 105
+Q + +L + V++L+V+ YQ R + VK F + +L E ++ +G + F
Sbjct 70 EQEKAFEMSDELAKAVRFLKVKLAYQVARGNKGVKRFAEDTRLKEYIDTVGTDLREYMAF 129
Query 106 CRYMEALAAYKKYLDPKD 123
+++EAL AY K+ K+
Sbjct 130 AQFIEALVAYHKFYGEKE 147
>gi|329736392|gb|EGG72661.1| CRISPR-associated protein, Csm2 family [Staphylococcus epidermidis
VCU045]
Length=102
Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 21/62 (34%), Positives = 39/62 (63%), Gaps = 2/62 (3%)
Query 23 NGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTF 82
NG LTT++LR L+ +L+ A S L + ++++YL+++F Y++GRE +V F
Sbjct 41 NG--LTTSKLRNLMEQVNRLYTIAFNSNEDQLNEEFIDELEYLKIKFYYEAGREKSVDEF 98
Query 83 VR 84
++
Sbjct 99 LK 100
>gi|335429797|ref|ZP_08556695.1| Xenobiotic-transporting ATPase [Haloplasma contractile SSD-17B]
gi|334889807|gb|EGM28092.1| Xenobiotic-transporting ATPase [Haloplasma contractile SSD-17B]
Length=601
Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats.
Identities = 23/90 (26%), Positives = 40/90 (45%), Gaps = 6/90 (6%)
Query 12 AEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVY 71
A +I P+ GFE + V+L+ D+ + + + R ++ ++ V F Y
Sbjct 314 AAIINIYPQLAKGFESMNSISEVVLA------DDVEDNQGKQIIRSVEGSFEFCNVNFSY 367
Query 72 QSGREDAVKTFVRNAKLLEALEGIGDSRDG 101
E A+K F N K E + +G+S G
Sbjct 368 NESEEHAIKDFDLNVKKGEVIALVGESGAG 397
Lambda K H
0.319 0.136 0.373
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130872486112
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40