BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2818c
Length=382
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842359|ref|NP_337396.1| hypothetical protein MT2885 [Mycoba... 775 0.0
gi|323718575|gb|EGB27742.1| csm6 family CRISPR-associated protei... 775 0.0
gi|15609955|ref|NP_217334.1| hypothetical protein Rv2818c [Mycob... 773 0.0
gi|148824007|ref|YP_001288761.1| hypothetical protein TBFG_12832... 773 0.0
gi|289448479|ref|ZP_06438223.1| csm6 family CRISPR-associated pr... 771 0.0
gi|340627814|ref|YP_004746266.1| hypothetical protein MCAN_28421... 733 0.0
gi|298526287|ref|ZP_07013696.1| conserved hypothetical protein [... 688 0.0
gi|31793994|ref|NP_856487.1| hypothetical protein Mb2842c [Mycob... 615 3e-174
gi|121638697|ref|YP_978921.1| hypothetical protein BCG_2837c [My... 615 3e-174
gi|306781001|ref|ZP_07419338.1| CRISPR-associated protein, Csm6 ... 478 7e-133
gi|308371143|ref|ZP_07423964.2| hypothetical protein TMCG_02062 ... 298 8e-79
gi|306781005|ref|ZP_07419342.1| hypothetical protein TMBG_02955 ... 298 1e-78
gi|298525991|ref|ZP_07013400.1| predicted protein [Mycobacterium... 167 2e-39
gi|315925049|ref|ZP_07921266.1| conserved hypothetical protein [... 145 1e-32
gi|257413192|ref|ZP_04742265.2| CRISPR-associated protein, Csm6 ... 138 2e-30
gi|224543479|ref|ZP_03684018.1| hypothetical protein CATMIT_0268... 124 2e-26
gi|296133515|ref|YP_003640762.1| CRISPR-associated protein Csm6 ... 121 3e-25
gi|331004045|ref|ZP_08327527.1| hypothetical protein HMPREF0491_... 118 2e-24
gi|292669136|ref|ZP_06602562.1| conserved hypothetical protein [... 114 2e-23
gi|253578032|ref|ZP_04855304.1| conserved hypothetical protein [... 114 3e-23
gi|291460043|ref|ZP_06599433.1| CRISPR-associated protein, Csm6 ... 112 1e-22
gi|323141543|ref|ZP_08076429.1| putative CRISPR-associated prote... 110 4e-22
gi|341822667|emb|CCC73591.1| putative uncharacterized protein [M... 106 8e-21
gi|238018272|ref|ZP_04598698.1| hypothetical protein VEIDISOL_00... 102 8e-20
gi|303231923|ref|ZP_07318631.1| CRISPR-associated protein, Csm6 ... 101 2e-19
gi|334126725|ref|ZP_08500673.1| hypothetical protein HMPREF9081_... 100 4e-19
gi|342213924|ref|ZP_08706637.1| putative CRISPR type III-A/MTUBE... 98.6 2e-18
gi|258645681|ref|ZP_05733150.1| CRISPR-associated protein, Csm6 ... 97.1 5e-18
gi|312899100|ref|ZP_07758478.1| CRISPR-associated protein, Csm6 ... 94.7 2e-17
gi|114567261|ref|YP_754415.1| hypothetical protein Swol_1746 [Sy... 88.6 1e-15
gi|229826475|ref|ZP_04452544.1| hypothetical protein GCWU000182_... 88.6 2e-15
gi|333976325|gb|EGL77194.1| CRISPR-associated protein, Csm6 fami... 85.9 1e-14
gi|322387549|ref|ZP_08061158.1| hypothetical protein HMPREF9423_... 73.2 7e-11
gi|322375481|ref|ZP_08049994.1| CRISPR-associated protein, Csm6 ... 71.2 3e-10
gi|270292491|ref|ZP_06198702.1| conserved hypothetical protein [... 70.1 5e-10
gi|315641548|ref|ZP_07896617.1| csm6 family CRISPR-associated pr... 67.0 5e-09
gi|322387548|ref|ZP_08061157.1| hypothetical protein HMPREF9423_... 64.3 3e-08
gi|57865879|ref|YP_189999.1| hypothetical protein SERP2456 [Stap... 63.2 7e-08
gi|329736405|gb|EGG72674.1| CRISPR-associated protein, Csm6 fami... 62.8 1e-07
gi|289549404|ref|YP_003470308.1| CRISPR-associated protein Csm6 ... 59.3 1e-06
gi|312278327|gb|ADQ62984.1| Putative uncharacterized protein [St... 57.4 4e-06
gi|339278119|emb|CCC19867.1| hypothetical protein STH8232_1168 [... 57.0 5e-06
gi|334308476|gb|EGL99462.1| CRISPR-associated protein Csm6 [Lact... 56.6 7e-06
gi|339278118|emb|CCC19866.1| hypothetical protein STH8232_1167 [... 56.6 8e-06
gi|227890800|ref|ZP_04008605.1| conserved hypothetical protein [... 55.8 1e-05
gi|325687525|gb|EGD29546.1| hypothetical protein HMPREF9381_1059... 55.5 1e-05
gi|55822919|ref|YP_141360.1| hypothetical protein str0965 [Strep... 50.4 5e-04
gi|55821001|ref|YP_139443.1| hypothetical protein stu0966 [Strep... 46.6 0.007
gi|301299687|ref|ZP_07205941.1| putative CRISPR-associated prote... 44.3 0.034
gi|325696574|gb|EGD38464.1| hypothetical protein HMPREF9384_1721... 42.0 0.18
>gi|15842359|ref|NP_337396.1| hypothetical protein MT2885 [Mycobacterium tuberculosis CDC1551]
gi|13882657|gb|AAK47210.1| hypothetical protein MT2885 [Mycobacterium tuberculosis CDC1551]
Length=430
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/382 (100%), Positives = 382/382 (100%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 49 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 108
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 109 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 168
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 169 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 228
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 229 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 288
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 289 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 348
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA
Sbjct 349 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 408
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 409 DLTLYDRLNDEIIRQIDMAPLG 430
>gi|323718575|gb|EGB27742.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
CDC1551A]
Length=424
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/382 (100%), Positives = 382/382 (100%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 43 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 102
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 103 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 162
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 163 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 222
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 223 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 282
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 283 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 342
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA
Sbjct 343 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 402
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 403 DLTLYDRLNDEIIRQIDMAPLG 424
>gi|15609955|ref|NP_217334.1| hypothetical protein Rv2818c [Mycobacterium tuberculosis H37Rv]
gi|148662660|ref|YP_001284183.1| hypothetical protein MRA_2842 [Mycobacterium tuberculosis H37Ra]
gi|289444367|ref|ZP_06434111.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
T46]
12 more sequence titles
Length=382
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/382 (99%), Positives = 382/382 (100%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+LFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 1 MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA
Sbjct 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 361 DLTLYDRLNDEIIRQIDMAPLG 382
>gi|148824007|ref|YP_001288761.1| hypothetical protein TBFG_12832 [Mycobacterium tuberculosis F11]
gi|167968188|ref|ZP_02550465.1| hypothetical protein MtubH3_09199 [Mycobacterium tuberculosis
H37Ra]
gi|253798097|ref|YP_003031098.1| hypothetical protein TBMG_01155 [Mycobacterium tuberculosis KZN
1435]
21 more sequence titles
Length=415
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/382 (100%), Positives = 382/382 (100%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 34 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 94 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 153
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 154 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 213
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 214 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 273
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 274 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 333
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA
Sbjct 334 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 393
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 394 DLTLYDRLNDEIIRQIDMAPLG 415
>gi|289448479|ref|ZP_06438223.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
CPHL_A]
gi|289421437|gb|EFD18638.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
CPHL_A]
Length=382
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/382 (99%), Positives = 381/382 (99%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+LFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 1 MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLK LARETGA
Sbjct 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKSLARETGA 360
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 361 DLTLYDRLNDEIIRQIDMAPLG 382
>gi|340627814|ref|YP_004746266.1| hypothetical protein MCAN_28421 [Mycobacterium canettii CIPT
140010059]
gi|340006004|emb|CCC45173.1| hypothetical protein MCAN_28421 [Mycobacterium canettii CIPT
140010059]
Length=382
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/382 (94%), Positives = 369/382 (97%), Gaps = 0/382 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+LFLSAEIAAFENADRRYSAAITRLAPETDVR V +T+PSVHRFDLFVP+FR+HL +LS+
Sbjct 1 MLFLSAEIAAFENADRRYSAAITRLAPETDVRAVIHTDPSVHRFDLFVPIFRDHLAQLSS 60
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPD TILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARA SKPGDRESPD YDLE
Sbjct 61 EFPDTTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARASSKPGDRESPDTYDLE 120
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDN+P A NRCFE TSAALGALLERANLKQLI SYDYSAAVTIAADSRLPD VS
Sbjct 121 LMWDANDDNEPAASNRCFETTSAALGALLERANLKQLIASYDYSAAVTIAADSRLPDHVS 180
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAP+FFK T FTYDPANKVAEY+SALALLAKREQWAEFAR+ATPA
Sbjct 181 NLIRGAMHRSRLEHLVAPRFFKGTVFTYDPANKVAEYVSALALLAKREQWAEFARAATPA 240
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA
Sbjct 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGA 360
Query 361 DLTLYDRLNDEIIRQIDMAPLG 382
DLTLYDRLNDEIIRQIDMAPLG
Sbjct 361 DLTLYDRLNDEIIRQIDMAPLG 382
>gi|298526287|ref|ZP_07013696.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298496081|gb|EFI31375.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=384
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/342 (99%), Positives = 341/342 (99%), Gaps = 0/342 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 14 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 73
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 74 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 133
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 134 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 193
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 194 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 253
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 254 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 313
Query 301 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKD 342
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRIT +
Sbjct 314 LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITMN 355
>gi|31793994|ref|NP_856487.1| hypothetical protein Mb2842c [Mycobacterium bovis AF2122/97]
gi|289575518|ref|ZP_06455745.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
K85]
gi|31619588|emb|CAD95027.1| HYPOTHETICAL PROTEIN [FIRST PART] [Mycobacterium bovis AF2122/97]
gi|289539949|gb|EFD44527.1| csm6 family CRISPR-associated protein [Mycobacterium tuberculosis
K85]
Length=303
Score = 615 bits (1587), Expect = 3e-174, Method: Compositional matrix adjust.
Identities = 302/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+LFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 1 MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
Query 301 LLR 303
LLR
Sbjct 301 LLR 303
>gi|121638697|ref|YP_978921.1| hypothetical protein BCG_2837c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224991189|ref|YP_002645878.1| hypothetical protein JTY_2831 [Mycobacterium bovis BCG str. Tokyo
172]
gi|121494345|emb|CAL72825.1| Hypothetical protein BCG_2837c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224774304|dbj|BAH27110.1| hypothetical protein JTY_2831 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341602735|emb|CCC65413.1| hypothetical protein BCGM_2820c [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=336
Score = 615 bits (1587), Expect = 3e-174, Method: Compositional matrix adjust.
Identities = 303/303 (100%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 34 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 94 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 153
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS
Sbjct 154 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 213
Query 181 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 240
NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA
Sbjct 214 NLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPA 273
Query 241 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 300
ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
Sbjct 274 ITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA 333
Query 301 LLR 303
LLR
Sbjct 334 LLR 336
>gi|306781001|ref|ZP_07419338.1| CRISPR-associated protein, Csm6 family [Mycobacterium tuberculosis
SUMu002]
gi|306785636|ref|ZP_07423958.1| cutinase cut1 [Mycobacterium tuberculosis SUMu003]
gi|306789676|ref|ZP_07427998.1| cutinase cut1 [Mycobacterium tuberculosis SUMu004]
9 more sequence titles
Length=242
Score = 478 bits (1230), Expect = 7e-133, Method: Compositional matrix adjust.
Identities = 236/238 (99%), Positives = 236/238 (99%), Gaps = 0/238 (0%)
Query 145 LGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDT 204
ALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDT
Sbjct 5 FSALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDT 64
Query 205 AFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGR 264
AFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGR
Sbjct 65 AFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGR 124
Query 265 VDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVR 324
VDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVR
Sbjct 125 VDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVR 184
Query 325 NTAAHEIVSISEDRITKDGGLLPEQLLKILARETGADLTLYDRLNDEIIRQIDMAPLG 382
NTAAHEIVSISEDRITKDGGLLPEQLLKILARETGADLTLYDRLNDEIIRQIDMAPLG
Sbjct 185 NTAAHEIVSISEDRITKDGGLLPEQLLKILARETGADLTLYDRLNDEIIRQIDMAPLG 242
>gi|308371143|ref|ZP_07423964.2| hypothetical protein TMCG_02062 [Mycobacterium tuberculosis SUMu003]
gi|308329699|gb|EFP18550.1| hypothetical protein TMCG_02062 [Mycobacterium tuberculosis SUMu003]
Length=206
Score = 298 bits (764), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 146/146 (100%), Positives = 146/146 (100%), Gaps = 0/146 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 49 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 108
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 109 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 168
Query 121 LMWDANDDNQPGAPNRCFEATSAALG 146
LMWDANDDNQPGAPNRCFEATSAALG
Sbjct 169 LMWDANDDNQPGAPNRCFEATSAALG 194
>gi|306781005|ref|ZP_07419342.1| hypothetical protein TMBG_02955 [Mycobacterium tuberculosis SUMu002]
gi|306789682|ref|ZP_07428004.1| hypothetical protein TMDG_00002 [Mycobacterium tuberculosis SUMu004]
gi|306794315|ref|ZP_07432617.1| hypothetical protein TMEG_03957 [Mycobacterium tuberculosis SUMu005]
11 more sequence titles
Length=191
Score = 298 bits (763), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/146 (100%), Positives = 146/146 (100%), Gaps = 0/146 (0%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA
Sbjct 34 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE
Sbjct 94 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 153
Query 121 LMWDANDDNQPGAPNRCFEATSAALG 146
LMWDANDDNQPGAPNRCFEATSAALG
Sbjct 154 LMWDANDDNQPGAPNRCFEATSAALG 179
>gi|298525991|ref|ZP_07013400.1| predicted protein [Mycobacterium tuberculosis 94_M4241A]
gi|298495785|gb|EFI31079.1| predicted protein [Mycobacterium tuberculosis 94_M4241A]
Length=110
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 79/81 (98%), Positives = 80/81 (99%), Gaps = 0/81 (0%)
Query 262 MGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFES 321
MGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFES
Sbjct 1 MGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFES 60
Query 322 RVRNTAAHEIVSISEDRITKD 342
RVRNTAAHEIVSISEDRIT +
Sbjct 61 RVRNTAAHEIVSISEDRITMN 81
>gi|315925049|ref|ZP_07921266.1| conserved hypothetical protein [Pseudoramibacter alactolyticus
ATCC 23263]
gi|315621948|gb|EFV01912.1| conserved hypothetical protein [Pseudoramibacter alactolyticus
ATCC 23263]
Length=469
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/433 (29%), Positives = 197/433 (46%), Gaps = 61/433 (14%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLA-------PETDVRIVTYTNPSVHRFDLFVPVFRN 53
+L+LSAE+ F D RY + +LA PE ++ I +V FD+F F
Sbjct 39 ILYLSAEMMEFHEKDDRYMYCLEKLARLQNRAMPEIEI-IERPELRNVQYFDIFFDEFWE 97
Query 54 HLVELSAEF-PDRTILLNTSSGTPAMQAAL-VAINVFGIPRTT-AVQVSTPARALSKPGD 110
+ +++ E D +LLN SSGTPAM++ L V + G R T +QV TP + +++
Sbjct 98 KISQITDEMAEDDELLLNVSSGTPAMKSGLEVLQTIRGFSRKTRLIQVDTPTKKMNE--- 154
Query 111 RESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIA 170
+ + +D+EL W+ + DN P NRC E +L + + +KQL+ YDY AA+ +A
Sbjct 155 -HAHEGFDVELAWEVDVDNAPDFKNRCHEFECRSLKNIQDEEIIKQLVKDYDYRAAMAVA 213
Query 171 ADSRLPDQVS-NLIRGAMHRSRLEHLVAPKFFKDTAFTYDP-----ANKVAEYISALALL 224
D PD + ++ A R L+ K K T P A K EY L +
Sbjct 214 KDMPEPDPAYLDKLKLARARQLLDFSTVTKLEKKTGMDVTPVKSGDARKSFEYALLLWIK 273
Query 225 AKREQWAEFARSATPAIT----IVLRAAVAKHLPEDRYLDDMGRVDRR------------ 268
R ++ +F R+ TP I +LR + + Y D G V R+
Sbjct 274 KDRREYVDFCRALTPLIVDLFEQILRCQCKIDINQYVYGDFPGWVRRKCEEEGWNEEERE 333
Query 269 --------------KLEREPEIRCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRV--GA 312
+L+++ +I + S A + + L L+ +F +R A
Sbjct 334 KRFRGAKICKWHKGRLKQDEKIFSVFQKAYSSGVANRNISSDHLLKLIEEFCKNRAIRDA 393
Query 313 LEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETG--------ADLTL 364
+ + E +RN +AHE+VS++ED I GL +++LK++ R G AD
Sbjct 394 AKQIREVEEAIRNDSAHEMVSVTEDVIEHRTGLTTDEILKLIKRLFGYTGYGIKEADWHS 453
Query 365 YDRLNDEIIRQID 377
Y +N EI++ ID
Sbjct 454 YQAMNQEIVQAID 466
>gi|257413192|ref|ZP_04742265.2| CRISPR-associated protein, Csm6 family [Roseburia intestinalis
L1-82]
gi|257204342|gb|EEV02627.1| CRISPR-associated protein, Csm6 family [Roseburia intestinalis
L1-82]
Length=451
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/413 (30%), Positives = 190/413 (47%), Gaps = 44/413 (10%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTN------PSVHRFDLFVPVFRNH 54
+L++S E+ F+ D RY + RLA D R + Y VH FD F FR
Sbjct 43 ILYMSKEMLDFQEKDDRYRYCLDRLAKMQD-RPMIYEIIERRELTKVHEFDYFYEDFRKV 101
Query 55 LVELSAEFPDR-TILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRES 113
+ + D T+LLN SSGTPAM++ L+ + G +QV+TP L++ +
Sbjct 102 ISHIYETMDDSDTLLLNVSSGTPAMKSGLLVLQTLGEFPAKVIQVATPVGKLNE----QV 157
Query 114 PDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADS 173
+ YD+E +W+ ++DNQ GA NRC E L + + +K+ I+ YDY AA+ + ADS
Sbjct 158 HEGYDVETLWELDEDNQEGAQNRCKEIQCPTLSKIKKEEIIKKHILVYDYQAALDV-ADS 216
Query 174 RLPD----QVSNLIRGAMHRSRLEHLVAPKFFKDTAFT-----YDPANKVAEYISALALL 224
LP Q +LI A R L+ + K + T F Y K EY + +
Sbjct 217 -LPAEQTVQYRDLIYQAARRVLLDFVNVDKTIQKTNFQCLPVRYSSQRKYFEYALTIDIR 275
Query 225 AKREQWAEFARSATPAITIVLRAAVAKH--LPEDRYLDDMGRVDR-------RKLEREPE 275
KR ++ +F RS TP + + + K + D Y D R + +KL
Sbjct 276 LKRGEYVDFIRSITPIVVDLFEMILKKQCGIIVDDYCDQYKRAGQWKRMWSAKKLNGTEV 335
Query 276 IRCALKHPPKSPN--AEWYLYTKDWLALLRQFAPD-RVGAL-EVLGRFESRVRNTAAHEI 331
+ H K +Y++ L F+ D R+ L E L ES +RN AAHEI
Sbjct 336 GKVLNSHYQKMEKRFEAKDVYSEHLKILTDHFSSDTRLKQLMEDLRNVESNIRNLAAHEI 395
Query 332 VSISEDRITKDGGLLPEQLLKILARETG-ADLTL-------YDRLNDEIIRQI 376
VS++++ I G ++ + G ++++ YD +N +I+ Q+
Sbjct 396 VSVTDETIKNLTGFYGRDIMSKIKELFGYTEISIRKGYWDSYDEMNRKILEQM 448
>gi|224543479|ref|ZP_03684018.1| hypothetical protein CATMIT_02688 [Catenibacterium mitsuokai
DSM 15897]
gi|224523606|gb|EEF92711.1| hypothetical protein CATMIT_02688 [Catenibacterium mitsuokai
DSM 15897]
Length=445
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/413 (28%), Positives = 191/413 (47%), Gaps = 42/413 (10%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAP----ETDVRIVTYTN-PSVHRFDLFVPVFRNHL 55
+L+LS E+A + Y I +LA DV + V FD F FRN L
Sbjct 39 ILYLSKEMAEKHHKYNPYGYCIEKLAELQSRHIDVEYIERNELTKVQEFDYFYKDFRNIL 98
Query 56 VELSAEF-PDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESP 114
+++ + D LLN SSGTPAM++ LV + G +QV TP L++ +E+
Sbjct 99 MDIMGDMDEDDEFLLNISSGTPAMKSGLVVLKTLGELPCRTIQVVTPTGKLNEHSHKEN- 157
Query 115 DAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIA---A 171
D E +W+ ++DN P + NRC E L + + +K+ I +YDYSAA+ +A
Sbjct 158 ---DYETLWELDEDNNPDSANRCIEVECPTLAIIKKEEIIKKHIEAYDYSAALQVAKTIK 214
Query 172 DSRLPDQVSNLIRGAMHRSRLEHLVAPKF-FKDTAFTY----DPANKVAEYISALALLAK 226
S + +LI A +R L++ A + K+ + + D K+ EY + + +
Sbjct 215 KSAMDKGYYSLIEMAKYRESLDYKKALEISSKEKVYCFPVTDDKGIKLFEYALNIDVKRR 274
Query 227 REQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRK--------LER--EPEI 276
R ++A+F RS TP + + D ++ R++++K LE+ ++
Sbjct 275 RHEYADFIRSITPLFVDLFELVLKHETGID--INKYCRIEKKKKHSMRVWDLEKLNGSDV 332
Query 277 RCALKHPPKSPNAEWYLYTKDWLALLRQFAP-DRVGALEV---LGRFESRVRNTAAHEIV 332
+L + + E +Y++ + LL F P R A ++ L E +RN AHE+V
Sbjct 333 LKSLNNYYLNGFKEGPIYSEPLVVLLNDFIPSSRKEAADLVSDLRSVEGNIRNITAHEMV 392
Query 333 SISEDRITKDGGLLPEQLLKILARE-TGADLTL-------YDRLNDEIIRQID 377
+++D I ++K + + + DL + YD +N II +ID
Sbjct 393 CVTDDVIKDKTNFSSNAIMKKIEKVFSYTDLDIKDEYWNSYDLMNQLIIERID 445
>gi|296133515|ref|YP_003640762.1| CRISPR-associated protein Csm6 [Thermincola sp. JR]
gi|296032093|gb|ADG82861.1| CRISPR-associated protein Csm6 [Thermincola potens JR]
Length=460
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 123/428 (29%), Positives = 188/428 (44%), Gaps = 70/428 (16%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPE----TDVRIVTYTNPSVHRFDLFVPVFRNHLVE 57
+F S E+ E DRR++ A+ L+ E D ++ + H FD F+ VF HL E
Sbjct 38 IFFSGEMGRREEKDRRFTRAVDLLSRELSWPIDKHLIFSGIQNPHDFDAFIGVFSKHLEE 97
Query 58 LSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTP-ARA-LSKPGDRESPD 115
+S + P+ TILLN SSGTP M + L V R VQV TP ARA LSK G PD
Sbjct 98 ISKDHPEATILLNVSSGTPQMMSMLCLETVVSSKRLVPVQVITPAARANLSKMG---GPD 154
Query 116 AYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRL 175
YD+E + N DN+P APNRC + + L R + L+ +Y+Y A I L
Sbjct 155 -YDVEWEFGNNLDNEPDAPNRCVQPDIQSFKRALARGQVTALLENYNYEGAALILGGYGL 213
Query 176 P--DQVSNLIRGAMHRSRLEHLVAPKFFKDT-AFTYDP-----ANKVAEYISALALLAKR 227
V L+R A+ L+ F++ A T P ++ EY + + LL +
Sbjct 214 GTDSTVMGLLRFAIALKNLDSDAKGSQFQEARALTGYPQMDWECLEICEYCNVVKLLQRT 273
Query 228 EQWAEFARSATPAITIVLRAAVAKH--------LPEDRYL----DDMGRVDRRKLERE-- 273
Q A+F P +T L+ K+ + E+R+ + G+ R + R+
Sbjct 274 GQLADFLLRLNPLVT-ELQTKFLKYCLGFAVEAIIEERHCAGRKNTAGKFTERLVRRDKI 332
Query 274 ----PEIRCALKHPPKSPNAEW----------------YLYTKDWLALLRQFAPDRVGAL 313
PE+ L + + N E+ + K+ + R FA + +
Sbjct 333 RALNPEL---LAYLDECHNGEYRDGSHVNIRMQNCLINFFLRKNPDSQTRSFA-GFLDTM 388
Query 314 EVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKIL---------ARETGADLTL 364
E+L +R RN AAH + + E+ I + GL Q++ L R T+
Sbjct 389 EIL----NRDRNLAAHNLYGVLEEDIKQRSGLTGGQIVDKLENLIKFIFKGRCKPEIFTI 444
Query 365 YDRLNDEI 372
+D +N+ I
Sbjct 445 FDTVNNVI 452
>gi|331004045|ref|ZP_08327527.1| hypothetical protein HMPREF0491_02389 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330411631|gb|EGG91039.1| hypothetical protein HMPREF0491_02389 [Lachnospiraceae oral taxon
107 str. F0167]
Length=462
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/419 (27%), Positives = 183/419 (44%), Gaps = 48/419 (11%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPETD-----VRIVTYTNPSVHRFDLFVPVFRNHLV 56
L+++ EI D RY I +LA + + V I +V +D F+ F +
Sbjct 38 LYMTKEIYEKHEKDDRYRFFINKLAEQKNKEIESVIIADKERDNVQEYDPFLFKFEEEIN 97
Query 57 ELSAEF-PDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPD 115
+ E D +N SSGTPAM+ ALV + +QVSTP + K +
Sbjct 98 NIINELNDDDNFFINISSGTPAMKNALVILQDLNEYNCKFIQVSTP---IKKMNEHTHGK 154
Query 116 AYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRL 175
+LELMW+ N + + NRC E+ +L L + +K+ I YDY AA+++A D
Sbjct 155 VLELELMWEMNMELEKEGNNRCVESKCPSLSRLRKEEIIKKHIDEYDYRAALSVAGDME- 213
Query 176 PDQVSNLIR---GAMHRSRLEHLVAPKFFKDTAFTYDP-----ANKVAEYISALALLAKR 227
+ N I A++R L +K F P A K+ EY L + KR
Sbjct 214 KNSTKNYIDELLSAVNRYNLNMKKVDNEYKKEGFDITPVKAGDARKLFEYALWLNIKVKR 273
Query 228 EQWAEFARSATPAIT----IVLRA----AVAKHLPEDRY---LDDMGRVDRRKLEREPEI 276
E++ +F R TP + +VL+ + K+ + Y + D G++ + ++ I
Sbjct 274 EEYIDFVRGITPIVVELFEVVLKGRGKLDINKYCTLNGYKVRVWDTGKIAKNIPGKDTNI 333
Query 277 R------CALKHPPKSPNAEW---YLYTKDWLALLRQFAPDRV--GALEVLGRFESRVRN 325
+ +KH K E+ +Y++ L L++ D V + E +VRN
Sbjct 334 KDIVNKEYKIKHSDKDKVEEFRFGMIYSEALLYLIKNLIDDEVLFDIASNIRTVEEKVRN 393
Query 326 TAAHEIVSISEDRITKDGGLLPEQLL---KILARETGADLT-----LYDRLNDEIIRQI 376
AAH+IV++ + I G P Q++ K L T + YD +N E+ R+I
Sbjct 394 LAAHDIVALDSNDIKNRTGFTPVQIMDKIKKLFNYTNFGIKPEYWDSYDDMNKELKRRI 452
>gi|292669136|ref|ZP_06602562.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
gi|292649188|gb|EFF67160.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
Length=459
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/414 (27%), Positives = 184/414 (45%), Gaps = 49/414 (11%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTN-PSVHRFDLFVPVFRNHLVELS 59
VLF + E+ E ++RY+ A+ +AP+ + +T+ R++ F + + +L
Sbjct 41 VLFFTKEMGEIERNEKRYTTAVRYVAPDCIIDPPIFTDIVDASRYEEFSQILPQTVQDLL 100
Query 60 AEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDL 119
++P+ ILLN SSGTP ++ L + R +Q TP R + P + +P+ +L
Sbjct 101 QKYPEHEILLNLSSGTPQIKTILAMLAADN-ERCIGIQTVTPERRANNP-QKITPE--EL 156
Query 120 ELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRL-PDQ 178
+ M N+DN+PGA RC E E+ + LI SY+Y+AA+T+A +SRL P
Sbjct 157 QSMLQMNEDNKPGAVRRCDEPPLKIFRYHAEKNRILALIHSYEYNAALTLARNSRLVPTD 216
Query 179 VSNLIRGAMHRSRL----EHLVAPKFFKDTAFTY-DPANKVAEYISALALLAKREQWAEF 233
L++ A R+ L + P++ F + + ++ EY + + + E+ + F
Sbjct 217 AKTLLKHAAARTMLLPDKARKILPEYNGQKLFLFKEDEERIVEYFLVMQIDQENERLSNF 276
Query 234 ARSATPAITIVLRAAVAKHLPEDR-------------YLDDMGRVDRRKLEREP-EIRCA 279
TP + L VAK++ R Y D + R+K+E+
Sbjct 277 MLRITPFLYEFLHDYVAKNVKGGRKNAQHIDNLCIKKYNTDGYILQRKKIEKNARNFLDL 336
Query 280 LKHPPKSPNAEWYLYTK-DWLALLR--------------QFAPDRVGALEVLGRFESRVR 324
L NA Y T +L L+ Q D + L LG +VR
Sbjct 337 LDQEFAGSNAHQYTNTDLSFLLLIHYCTYMQEAGLAKDAQLHSDMMDELGKLGTVR-KVR 395
Query 325 NTAAHEIVSISEDRITKDGGLLPEQLL----KILARETGADL----TLYDRLND 370
N+ AH IV+++ + KD + P L+ K+L G + T Y R+N+
Sbjct 396 NSVAHVIVNVTRESFQKDTQMTPPALMDTFAKMLTLVYGTKVKEARTTYSRINN 449
>gi|253578032|ref|ZP_04855304.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251850350|gb|EES78308.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length=438
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/406 (27%), Positives = 185/406 (46%), Gaps = 37/406 (9%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAP----ETDVRIVTYTNP-SVHRFDLFVPVFRNHLV 56
L+LS E+ D RY + L + ++ I+ ++ V ++D+F F +
Sbjct 38 LYLSKEMMENHKKDNRYVKTLELLGEFLHHKFEIHIIENSDMIDVQQYDIFYNEFHRIIA 97
Query 57 ELSAE-FPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPD 115
E+ + P+ +L+N +SGTPAM++AL+ + R +QVSTP + + E D
Sbjct 98 EIEEQKGPEDILLVNMASGTPAMKSALLVMATLSEYRFLPIQVSTPQK--KSNLEHEERD 155
Query 116 AYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR- 174
YD++ W+ N DN+ A NRC E L LL+ +K+ +++YDY AA+ + + +
Sbjct 156 EYDVDANWELNMDNEEAAENRCQEVKCLNLMRLLKIDMIKKHLLAYDYHAALAVGKEIKE 215
Query 175 -LPDQVSNLIRGAMHRSRLEHLVAPKFFKD-----TAFTYDPANKVA-EYISALALLAKR 227
L + A RS L+ + + TA + KV EY+ AL L KR
Sbjct 216 DLSPVAYQWLETADARSLLDWTRMNRVLPENNGIITAVRGENEKKVLFEYMLALDLKVKR 275
Query 228 EQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSP 287
++A+F R+ TP +L + + D+ R +R +R + +
Sbjct 276 GEYADFIRAITPLGVDLLEIVLEQSCD-----IDITRYYKRNNQRIWDKNRLVGEILDIL 330
Query 288 NAEWY------LYTKDWLALLRQFAPD--RVGALEVLGRFESRVRNTAAHEIVSISEDRI 339
N ++Y +Y+ L ++++ D V ++ L E VRN AAH IVS++ + I
Sbjct 331 NQKFYPFRYGPVYSAHLLEIIQKKCTDTLMVQRIQELVNIEQNVRNVAAHNIVSVTPEWI 390
Query 340 TKDGGLLPEQLLKILA--------RETGADLTLYDRLNDEIIRQID 377
+ G + +L IL + YD +N II ++D
Sbjct 391 KERTGKSVDDILWILKYVCEQVKINTRKENWNSYDSMNKRIINELD 436
>gi|291460043|ref|ZP_06599433.1| CRISPR-associated protein, Csm6 family [Oribacterium sp. oral
taxon 078 str. F0262]
gi|291417384|gb|EFE91103.1| CRISPR-associated protein, Csm6 family [Oribacterium sp. oral
taxon 078 str. F0262]
Length=434
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/422 (27%), Positives = 196/422 (47%), Gaps = 54/422 (12%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYT---NPS---VHRFDLFVPVFRNHL 55
L++S EI ++ AD RY+ + +L E R + Y P V F++F F+ +
Sbjct 20 LYMSREIIQYQEADERYTYCLKKLG-ELQNREIEYELIRRPELVEVQDFEIFYREFKEEI 78
Query 56 VELSAEF-PDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESP 114
++ E D ++LN SSGTPAM++ L+ I AVQV TP RA+++ ++
Sbjct 79 DKIRKEMGEDDELILNLSSGTPAMKSWLLVIRTMNELSCKAVQVVTPDRAMNEHRHKD-- 136
Query 115 DAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR 174
Y ++ +W+ + DN+ GA NRC E +L + + N+++LI YDY AA+ +AA+ +
Sbjct 137 --YQVKELWELDPDNEGGAENRCREVPCPSLSRVRQEVNIRKLIREYDYHAALELAAELK 194
Query 175 LPDQ-VSNLIRGAMHRSRLEHLVAPKFFKDTAFT--------YDPANKVAEYISALALLA 225
++ LIR A R L+ K + + EY + +
Sbjct 195 DHEKPYMKLIRVAEERELLDMDAVEKKLETNHLKGLYRLPIRKGEKRDIFEYALVMQIRL 254
Query 226 KREQWAEFARSATPAITIVLRAAVAK-HLPEDRYL---DDMGRVDRRKLERE---PEIRC 278
+R ++A+F R+ +P + + + + K + + Y+ + DR KL + EI
Sbjct 255 RRGEYADFIRAISPILYRLYKRIMKKLGICLEEYVSGTETKTVWDREKLSGDMAGKEILK 314
Query 279 ALKHPPKSPNAEWY--LYTKDWLALLRQFAPDR-----VGALEVLGRFESRVRNTAAHEI 331
L+ KS + + +Y +++ + D+ VG L E +VRN AAH+I
Sbjct 315 ILEGAYKSGDGFRFGNVYPVHMQKIIQAKSDDKELKKLVGELR---EAEEKVRNQAAHQI 371
Query 332 VSISEDRI----------TKDGGLLPEQLLKILARETGADL------TLYDRLNDEIIRQ 375
VS+++ I +D + + + K + D+ YD++N+ IIR
Sbjct 372 VSVNKKTIREWMEKEGCKDRDAEWIMDGIKKAIGYAEIIDIANKEVWNSYDQMNEVIIRL 431
Query 376 ID 377
+D
Sbjct 432 MD 433
>gi|323141543|ref|ZP_08076429.1| putative CRISPR-associated protein, Csm6 family [Phascolarctobacterium
sp. YIT 12067]
gi|322414002|gb|EFY04835.1| CRISPR-associated protein, Csm6 family [Phascolarctobacterium
sp. YIT 12067]
Length=437
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/403 (27%), Positives = 186/403 (47%), Gaps = 32/403 (7%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAE 61
+FLSAE++A E YS AI PE + +V + VP+ L EL E
Sbjct 36 IFLSAEMSAKEKNRHIYSKAIEYNVPECKFDFIYTDIVNVQLMEELVPLAEGFL-ELRKE 94
Query 62 FPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLEL 121
FP+ ILLN SSGTP M+ + + A+QV +P RA ++ + D D+ +
Sbjct 95 FPEEEILLNLSSGTPQMKTVMSFLAT-DFENVRAIQVDSPQRASNRTA-HATQDNEDINV 152
Query 122 MWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLP--DQV 179
+ + N DN P RC EA + L R L LI +Y+Y AA+T+ +++ ++
Sbjct 153 VIENNFDNVPDYTCRCHEAPLSLLRRYSIRHQLISLINNYEYRAALTMYNKNKIMFVEET 212
Query 180 SNLIRGAMHRSRLEHLVAPKF---FKDTAFTYDPANKVAEYISALALLAKREQWAEFARS 236
NL+R A RS+L L+ F K+ + + K+ E++ + L ++ + AEF
Sbjct 213 GNLLRHADLRSKL--LINEAFKGMGKENIYNNNSVKKLNEFLMVMELHQRKGELAEFIPK 270
Query 237 ATPAITIVLRAAVAKH--LPEDRYL------DDMGRVDRRKLERE-PEIRCALKHPPKSP 287
TP + +L H L +R+ + ++ KL +E P++ L + +
Sbjct 271 LTPFLYELLLYYFENHVALKLERFCYRKRNNNSSWKISAEKLRKEAPDVFTYLNYYFRQG 330
Query 288 NAEWYLYTKDWLAL---LRQFAPDRVGALEVLGRFESRVRNTAAHEIVS-ISEDRITK-D 342
+ L + L + L+ + +G L++L E RN AH I++ ++E+ + +
Sbjct 331 FRDTDLSFSNMLLILESLKSVKTELMGELQILREVEKNQRNKIAHTILTDVTEENLQAIE 390
Query 343 GGLLPEQLLK--------ILARETGADLTLYDRLNDEIIRQID 377
L Q+++ I+ E+ +YD LN I+ ++
Sbjct 391 PKLSSYQIIQHLRKAFLLIMEGESICKRNVYDDLNRRIVDSLN 433
>gi|341822667|emb|CCC73591.1| putative uncharacterized protein [Megasphaera elsdenii DSM 20460]
Length=450
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/388 (26%), Positives = 164/388 (43%), Gaps = 46/388 (11%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VL+ +AE+ E Y+ I + P V + H +D ++ H+++L
Sbjct 35 VLYFTAEMEKRERNTHMYTLGIEHVQPGCPVESLYSGIVDAHLYDAYLHDLPGHVLKLHQ 94
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
+P+ ILLN SSGTP ++ L AI +QV++P S + D D+E
Sbjct 95 IYPEAEILLNLSSGTPQIKVVL-AIMSTEYAWCRGIQVASPEHR-SNTNNIPVQDEEDVE 152
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR-LPDQV 179
M N+D++P APNRC E L E+ + L+ Y+Y A S + Q
Sbjct 153 EMLACNEDDEPDAPNRCEEPHLEILRFYREKYEIMSLVNQYEYMGAWAFCKGSHTISAQT 212
Query 180 SNLIRGAMHRSRLE----HLVAPKFFKDTAFTYD-PANKVAEYISALALLAKREQWAEFA 234
LI+ AM+RS L+ + K+ F ++ + EY+ + + +++Q+A F
Sbjct 213 KKLIQFAMYRSDLQTKAAQQIMRKYHGQALFPFEREGESLTEYLLTMQIHKEKKQYASFM 272
Query 235 RSATPAIT--IVLRAAVAKHLPEDRYLDDMGRVDRRKLERE--------PEIRCALKHPP 284
+P + V A + +P Y + + RR L R+ PE+ L H
Sbjct 273 VQISPFLYELFVTYAKMNLKIPLLNYREKVA--GRRILRRQTLLQKPQGPELIAYLDHVW 330
Query 285 KSPNAEWYLYTKDW-LALLRQ---FAPDRVGALEVLGRFE-----------------SRV 323
P Y + LL Q FA GA + E ++
Sbjct 331 PQP-----FYDSELSFILLYQVFCFAEQFDGAKDAEKHHEFMTDPLMNSANPYMDKLRKL 385
Query 324 RNTAAHEIVSISEDRITKDGGLLPEQLL 351
RN AHEI++++E+ I K GL P+ ++
Sbjct 386 RNNTAHEIINVTEETIQKRTGLTPDDIM 413
>gi|238018272|ref|ZP_04598698.1| hypothetical protein VEIDISOL_00096 [Veillonella dispar ATCC
17748]
gi|237864743|gb|EEP66033.1| hypothetical protein VEIDISOL_00096 [Veillonella dispar ATCC
17748]
Length=439
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 107/417 (26%), Positives = 185/417 (45%), Gaps = 54/417 (12%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+L LS ++ E A+ R++ A+ + + D++I+ VHR D+ P F +H E +
Sbjct 35 ILVLSKDMEQKEAANHRFTKALKHVKADLDIKIIHTGLEDVHRIDVLQP-FVDHFYETLS 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESP---DAY 117
+PD IL+N SSGTP M+ + ++V +QV +P R +R P D
Sbjct 94 TYPDAEILINLSSGTPQMKLIMSYLSVEH-DAVRGIQVDSPQRG----SNRSEPAVNDDE 148
Query 118 DLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQ----LIVSYDYSAAVTI---- 169
D+EL+ + N D+Q + NRC E ++R N+KQ LI SY Y A++
Sbjct 149 DIELVIENNFDDQEDSENRCHEPQM----GYIKRNNIKQSLHTLITSYKYKEAISSYHSY 204
Query 170 --AADSRLPDQVSNLIRGAMHRSRLEH---LVAPKFFKDTA----FTYDPANKVAEYISA 220
+S + + V L+ A R L + L + T+ FT K+ E++
Sbjct 205 KRTFESDVVNDVLPLLEHAQLRLGLNYDDALQKARKVGSTSLSSLFTDKELRKLHEFLML 264
Query 221 LALLAKREQWAEFARSATPAITIVLRAAVAKHLP------EDRYLDDMGRVDRRKLERE- 273
+ + K+ Q +F TP + ++R K L E + M R+D + +
Sbjct 265 MEVRLKQGQIEDFVLKTTPFMYELMRYYFTKELNVNWRQVEKKTSKGM-RLDMVAFKNQY 323
Query 274 PEIRCALKHPPKSPN-AEWYLYTKDWLALLRQFAPDRVGA-----LEVLGRFESRVRNTA 327
P++ + + +P E + L +L + D V + L+ + R E ++RN
Sbjct 324 PKLYESWQENSDTPYLQELQVSFYHMLHMLEDY--DTVDSSLLKHLKEIRRIERKIRNKI 381
Query 328 AHEIVSISEDRITKDGGLLPEQLLK--------ILARETGADLTLYDRLNDEIIRQI 376
AHE+V +E I + Q I+AR+ + +YD +N ++ QI
Sbjct 382 AHEVVVFTEQDICSAAEIQSLQFFLHQIKDVFFIIARQEKQNKLIYDTINKYVLDQI 438
>gi|303231923|ref|ZP_07318631.1| CRISPR-associated protein, Csm6 family [Veillonella atypica ACS-049-V-Sch6]
gi|302513352|gb|EFL55386.1| CRISPR-associated protein, Csm6 family [Veillonella atypica ACS-049-V-Sch6]
Length=439
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/417 (25%), Positives = 184/417 (45%), Gaps = 54/417 (12%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+L LS ++ E A+ R++ A+ + + D+ ++ VHR D P F +H E+ +
Sbjct 35 ILVLSKDMEKKEAANHRFTKALKHVKADLDITLIHTGLEDVHRIDTLQP-FVDHFYEMLS 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
+PD IL+N SSGTP M+ + ++V +QV +P R S + D D++
Sbjct 94 NYPDAEILINLSSGTPQMKLIMSYLSVEH-DAVRGIQVDSPQRG-SNRSEAAVHDDEDID 151
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQ----LIVSYDYSAAVTIAADSR-- 174
++ + N D+Q + NRC E ++R N+KQ LI SY Y A++ +
Sbjct 152 IVIENNFDDQEDSENRCHEPQM----GYIKRNNIKQSLHTLITSYKYKEAISAYHSYKRS 207
Query 175 LPDQVSNLI---------------RGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYIS 219
D V+N + GA+ +SR ++ + F K+ E++
Sbjct 208 FEDGVANDVLPLLEHAQLRLGLDYDGALQKSRKVGSISLS----SLFANKEVRKLHEFLM 263
Query 220 ALALLAKREQWAEFARSATPAITIVLRAAVAKHLPED-RYLDDMG----RVDRRKLERE- 273
+ + K+ Q +F TP + ++R K L + R ++ R+D E++
Sbjct 264 LMEVRLKQGQIEDFILKTTPFMYELIRYYFTKELHVNWRQIEKKTSKGIRLDMVAFEKQY 323
Query 274 PEIRCALKHPPKSPN-AEWYLYTKDWLALLRQFAPDRVGA-----LEVLGRFESRVRNTA 327
P++ + K +P E L L +L + D V + L+ + R E ++RN
Sbjct 324 PKLYKSWKANTHTPFLQELQLSFYHMLHMLEE--QDIVDSLLLKQLKEIRRIEQKIRNKM 381
Query 328 AHEIVSISEDRITKDGGLLPEQ--------LLKILARETGADLTLYDRLNDEIIRQI 376
AHE+V +E I K + Q + I+ + + +YD +N ++ QI
Sbjct 382 AHEVVVFTEQDICKAAEIQSLQSFLHQIKDVFFIITGQAKQNKLIYDVINQYVLEQI 438
>gi|334126725|ref|ZP_08500673.1| hypothetical protein HMPREF9081_0260 [Centipeda periodontii DSM
2778]
gi|333391135|gb|EGK62256.1| hypothetical protein HMPREF9081_0260 [Centipeda periodontii DSM
2778]
Length=452
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/380 (28%), Positives = 172/380 (46%), Gaps = 28/380 (7%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLF + ++A E+ D RY+ AI R A + + + H ++ F + ++ L +
Sbjct 45 VLFYTQDMAEKEHRDHRYTRAIHRTAFDCVIEEIFTDIQEAHLYESFSQILPQEVLRLRS 104
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
E ILLN SSGTP M+ L A+ + +QV+ P+R S + + DA D++
Sbjct 105 ENQGAQILLNLSSGTPQMKTVL-AMLAADMENCVGIQVAAPSRT-SNRANEATQDAEDID 162
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRL-PDQV 179
+ + N D + GA NRC E ER+ ++ LI SY+Y+AA+ IA S L P +
Sbjct 163 ALLENNFDEEEGAENRCDEPPLGIFRYYAERSRIRSLIESYEYAAALKIARRSPLVPPEA 222
Query 180 SNLIRGAMHRSRLEHLVAPKFFKD----TAFTY-DPANKVAEYISALALLAKREQWAEFA 234
S L+ A RS L A ++ F + ++ EY + + + + +
Sbjct 223 SLLLSHAEQRSMLLTEEAKAILREYRGKKLFPFIGKTEELVEYFLMMQIDQETGRLSNLM 282
Query 235 RSATPAITIVLRAAVAKHLP-------EDRYLDDMGRVDRRKL-EREPEIRCALKHP--- 283
P + LR AK+L E R + + + R +L +E E+ AL+
Sbjct 283 LRMIPFLYEFLREYTAKNLTIPIRALCEPR--NGVRCLARERLAAQEKELLAALEREFPY 340
Query 284 --PKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFES-----RVRNTAAHEIVSISE 336
P + + L A Q D EV+ ES ++RN AAHE+V+++E
Sbjct 341 GYRDQPLSFYLLSLCCAYAGKAQRVRDADLHAEVMAELESIADIRKLRNEAAHEMVNVTE 400
Query 337 DRITKDGGLLPEQLLKILAR 356
+R + G+ +++L R
Sbjct 401 ERFRQKIGMGSQEVLSCFCR 420
>gi|342213924|ref|ZP_08706637.1| putative CRISPR type III-A/MTUBE-associated protein Csm6 [Veillonella
sp. oral taxon 780 str. F0422]
gi|341596422|gb|EGS39024.1| putative CRISPR type III-A/MTUBE-associated protein Csm6 [Veillonella
sp. oral taxon 780 str. F0422]
Length=449
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 166/372 (45%), Gaps = 40/372 (10%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VLFLS E+ E +++Y+ AI+ + P+ VR++ VH+ D + + L
Sbjct 35 VLFLSKEMVVEEERNQQYTKAISYVNPQCVVRLIKTELQEVHKIDALYSLV-DEFYRLKD 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
EF + L+N +SGTP M + + + T VQV TP S + +++
Sbjct 94 EFSEAEFLINLTSGTPQMCQLMTYLAIENTD-VTGVQVDTPTER-SNRTEHALQGNEEID 151
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVS----YDYSAAVTIAADSRL- 175
+ + N DN+ G NRC E L +++R LK+ +S Y+Y A+ + ++
Sbjct 152 YVIECNFDNERGTKNRCHE----PLLQIVKRRFLKERCISLVKVYEYKQALQALKEYKML 207
Query 176 --------PDQVSNLIRGAMHRSRLEH----LVAPKFFKDTAFTYDPANKV-----AEYI 218
D + L++ +M+RS E+ PK K T P++K+ EY+
Sbjct 208 AEEEDKEYLDVIGKLLQHSMYRSAFEYDTSLTYIPKELKQTLTHSMPSSKIDIRNLIEYL 267
Query 219 SALALLAKREQWAEFARSATPAITIVLRAAVA-KHLPEDRYLDDMGR---VDRRKLERE- 273
+ ++ + +F TP + ++ + + R ++ GR VDR KL +E
Sbjct 268 YIAQIRIEKGMYQDFIVKLTPYLFQFMKCILKDTYRVNFRNIEVRGRRGMVDRLKLSKEY 327
Query 274 PEIRCALKHPPKSPNAEWYLYTKDWLALLR--QFAPDR----VGALEVLGRFESRVRNTA 327
PE+ + + N + + + +L QF D V +L++ VRN
Sbjct 328 PELYKSWERSMNQLNYDTKDFELSFFHMLNMIQFHSDTRSDLVNSLKIFTPILKNVRNIV 387
Query 328 AHEIVSISEDRI 339
AHEI +I+++ I
Sbjct 388 AHEIATITQEDI 399
>gi|258645681|ref|ZP_05733150.1| CRISPR-associated protein, Csm6 family [Dialister invisus DSM
15470]
gi|260403049|gb|EEW96596.1| CRISPR-associated protein, Csm6 family [Dialister invisus DSM
15470]
Length=454
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/290 (29%), Positives = 131/290 (46%), Gaps = 22/290 (7%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPETDVRIV--TYTNPSVH-RFDLFVPVFRNHLVEL 58
+FL+ E+ E Y+ I ++AP+ + + T P ++ R + VF E
Sbjct 37 VFLTKEMEDKEAESECYTKGIQKVAPQCKIEFIRSGITEPHIYERLTVLQDVFH----EK 92
Query 59 SAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARAL-SKPGDRESPDAY 117
++PD LLN SSGTP ++ + I + P T A+QV TP ++ SK E+P
Sbjct 93 YEQYPDEEWLLNLSSGTPQIKTVMGLIGL-DYPETKAIQVLTPGKSSNSKNHPEETPGLV 151
Query 118 DLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR--L 175
+ M D NDDN P APNRC EA + L + + L+ +Y+Y A+ + +R
Sbjct 152 E---MLDCNDDNDPAAPNRCKEAKLSLLKKHSVKWQIISLVENYEYEGALQLLRQNRHLF 208
Query 176 PDQVSNLIRGAMHRSRLEHLVAPKFFKDTAFTYDP----ANKVAEYISALALLAKREQWA 231
D L+R A+ R L A K ++ P A E+ + L +++Q
Sbjct 209 SDISEKLLRHAVCRRNLMWKDANKII--PSYNGKPLISKAGDFEEFFRVMELRQRKKQLY 266
Query 232 EFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALK 281
EF TP I L A L E R L D+ + + + ++R LK
Sbjct 267 EFIVKTTP-ICTKLATDYAISL-EQRTLFDLNACSEIRRDEDGDVRYVLK 314
>gi|312899100|ref|ZP_07758478.1| CRISPR-associated protein, Csm6 family [Megasphaera micronuciformis
F0359]
gi|310619767|gb|EFQ03349.1| CRISPR-associated protein, Csm6 family [Megasphaera micronuciformis
F0359]
Length=446
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/418 (26%), Positives = 172/418 (42%), Gaps = 55/418 (13%)
Query 2 LFLSAEIAAFENADRRYSAAITRLAPETDVRIVT--YTNPSVHRFDLFVPVFRNHLVELS 59
+FL+A++ E YS + ++AP+ ++ + T P + L++ + EL
Sbjct 36 IFLTADMEEKEEQWHCYSLGVKKVAPQCEIEFIKSGITEPQNYEKLLYL---QEKFDELF 92
Query 60 AEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDL 119
+FPD +LN +SGT MQ + ++V P TAVQVS P K + Y
Sbjct 93 EQFPDVKWILNITSGTSQMQTIMSFLSV-DYPSCTAVQVSNPHVDRDKVAVHCEKEEY-- 149
Query 120 ELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR--LPD 177
M + N+D+ P +PNRC E + + R ++ L+ +Y+Y A+ + +R D
Sbjct 150 VQMLECNEDDDPSSPNRCTEPPLLMIRRHVLRFQIESLVRNYEYGGALQLVEQNRRLFSD 209
Query 178 QVSNLIRGAMHRSRLEHLVAPKFFKD---TAFTYDPANKVAEYISALALLAKREQWAEFA 234
L+R + R+ L A K D P + +EY + L ++ Q +EF
Sbjct 210 TTERLLRHGVCRTMLNWREANKIISDYEGNILMQSPGD-FSEYFQVMELRQRKGQLSEFI 268
Query 235 RSATPAITIVLRAAVAKHLPEDRYLD--DMGRVDRRKLER------------EPEIRCAL 280
+P VL K+L + D GR R ER PE++ L
Sbjct 269 VKLSP----VLMGLGFKYLECIKGFDLLQCGRELDRNGERVFIWDCNKARKYNPELQDYL 324
Query 281 KHPPKSPNAEWYLYTKDWLALLRQFAPDRVG----------ALEVLGRFESRVRNTAAHE 330
+ LY + +AL + + A L E RN AH
Sbjct 325 DKKYSGDMKDGPLYFQTIMALCEYYKATTLKSDALHNEITTAFSKLRTVEETARNPIAHN 384
Query 331 IVSISEDRI---TKDGGLLPEQ---LLKILARETGADLT------LYDRLNDEIIRQI 376
I +++E R+ TK L P +L+IL R+ D+ YD LND I+ +
Sbjct 385 ICNMTETRLEEETKKQLLEPLNSAGILRIL-RKVYKDIYKKNMAWTYDGLNDCIVESL 441
>gi|114567261|ref|YP_754415.1| hypothetical protein Swol_1746 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
gi|114338196|gb|ABI69044.1| hypothetical protein Swol_1746 [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
Length=413
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/352 (27%), Positives = 157/352 (45%), Gaps = 38/352 (10%)
Query 30 DVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFG 89
++R NP +FD+F PVF L+++ P IL+N SSGTP M++A + +
Sbjct 34 ELRYEEIDNP--QQFDIFYPVFEKELIDIHNANPGCEILINLSSGTPQMKSACHLLALTT 91
Query 90 IPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDDNQP--GAPNRCFEATSAALGA 147
+QV+TP + + YDLE W N DN P G NR S L
Sbjct 92 PFPVIPIQVTTPNES-----ENYGSANYDLETSWKNNLDNDPELGTNNRTQLVESDNLRY 146
Query 148 LLERANLKQLIVSYDYSAAVTIAAD--SRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDTA 205
L R I +++YS+A+ I A +P+ V +L+ A HR ++ A K +
Sbjct 147 LFLREAAISNINAFNYSSALAILASVAEFVPEDVIHLLMAAQHRKNMDLREAKKRSRLAN 206
Query 206 FTYDP-----ANKVAEYISALALLAKREQWAEFARSATPAIT----IVLRAAVAKHLPED 256
+ P A ++ EY+ L L Q +F R +PA++ LR + + D
Sbjct 207 YDLFPVKSGDAQELFEYLLLLDLQQNSGQLMDFVRGISPALSRLFECFLREKCQRQVKLD 266
Query 257 -----RYLDDMGRVDRRKL-EREPEIRCA--LKHPPKSPNAEWYLYTKDWLALLR----- 303
R+ D + R KL E++P + C L+ P +++ L L ++
Sbjct 267 YCVNKRHEPDHYWLKRDKLAEKDPSLLCYYDLRFPNGFRDSD--LSCSTLLPMIEFDCRP 324
Query 304 --QFAPDRV-GALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLK 352
+F ++V + + E ++RN AAH IV++ E + + G+ L+K
Sbjct 325 GGRFPNEKVLLKAQYMRSVEEKIRNPAAHNIVAVKEKQFMQLVGISSASLVK 376
>gi|229826475|ref|ZP_04452544.1| hypothetical protein GCWU000182_01848 [Abiotrophia defectiva
ATCC 49176]
gi|229789345|gb|EEP25459.1| hypothetical protein GCWU000182_01848 [Abiotrophia defectiva
ATCC 49176]
Length=450
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 104/419 (25%), Positives = 189/419 (46%), Gaps = 51/419 (12%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAP--ETDVRIVTYTNPS---VHRFDLFVPVFRNHL 55
+L++S E+ + D RY I +L T I P V+ FD F F+ L
Sbjct 37 ILYISNEMLENQEKDDRYRYCIRQLDKFASTSTEIAVIERPDLKDVNDFDYFYKDFKEIL 96
Query 56 VELSAEF-PDRTILLNTSSGTPAMQAALVAIN-VFGIPRTTAVQVSTPARALSKPGDRES 113
+ D +L+N SSGTP M++ L + + +QVSTP + + + +
Sbjct 97 DKYVKTLNEDDELLINISSGTPQMKSGLAVLQTMLEYSNCKLIQVSTPEK---RSNEHYT 153
Query 114 PDAYDLELMWDA--NDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAA 171
++E +W+ + NRC E +L + +K+ I++YDY+AA++IA
Sbjct 154 SSDENIEELWNIYIEYNGVESFENRCKEVIFPSLSTIKMEEIIKKHILAYDYAAALSIAE 213
Query 172 DSRLPDQVS----NLIRGAMHRSRLEHLVAPKFFK-DTAFTYDPANK-----VAEYISAL 221
+ LP + + +L+R A R +L + + + P K + EY AL
Sbjct 214 E--LPKESTESYIHLLRYAKARLQLNEIDVNNIKSANNECDFLPVKKSEQRKIVEYTLAL 271
Query 222 ALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLE-REPEIRCAL 280
+ KRE++A+F R+ TP + + L A + K+ E L+ +++ + +I +
Sbjct 272 DVKRKREEYADFLRAITPLL-VELFANILKNCFEID-LNPYTEIEKGEFRWNGTKISEDI 329
Query 281 KHPPKSPNAEWYL------YTKDWLALLRQFAPDR-------VGALEVLGRFESRVRNTA 327
+ K N + Y T L + + P + +++L E +RN A
Sbjct 330 EQTLKDGNIDLYKNSFKPSVTSFHLYTIMKNLPKKNDNMRKAFEIIDILRAVEQNIRNKA 389
Query 328 AHEIVSISEDRITKDGGLLPEQLLKILARE--TGADLTL--------YDRLNDEIIRQI 376
AH++VS+++ +I K + E ++K + RE T +D+ + Y+ +ND II +I
Sbjct 390 AHQMVSVTDAKIEKITDMNAEGIMKKI-RELFTYSDINIPKQGGWNSYELMNDSIIAKI 447
>gi|333976325|gb|EGL77194.1| CRISPR-associated protein, Csm6 family [Veillonella parvula ACS-068-V-Sch12]
Length=439
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/425 (24%), Positives = 179/425 (43%), Gaps = 70/425 (16%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
+L LS ++ E +D R+S A+ + + D++++ VHR D P F +H E+ +
Sbjct 35 ILVLSKDMEQKEASDSRFSKALKHVKADLDIKLIHTGLEDVHRIDTLQP-FVDHFYEMLS 93
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
++PD IL+N SSGTP M+ + ++V +QV +P +R P D E
Sbjct 94 KYPDAEILINLSSGTPQMKLIMSYLSVEH-DAVRGIQVDSPQGG----SNRSEPAVNDDE 148
Query 121 LMWDAND---DNQPGAPNRCFEATSAALGALLERANLKQ----LIVSYDYSAAVTIA--- 170
+ + D+Q G NRC E ++R NLKQ LI SY Y A+++
Sbjct 149 DIEIIIENNLDDQEGTENRCHEPQM----GYIKRNNLKQSLHTLINSYKYKEAISLYHGY 204
Query 171 ----ADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDTA-------FTYDPANKVAEYIS 219
D + D V L+ A R L++ A + + FT K+ E++
Sbjct 205 KRTFKDGVVID-VLPLLEHAQLRLGLDYDSALQKSRKVGSINLSSIFTDKVLRKLHEFLM 263
Query 220 ALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLERE------ 273
+ + K+ Q +F TP + ++R K V+ R++E++
Sbjct 264 LMEVRLKQGQIEDFILKTTPFMYELMRYYFTKEFS----------VNWRQVEKKTSKGVR 313
Query 274 ----------PEIRCALKHPPKSPN-AEWYLYTKDWLALLRQF-APDR--VGALEVLGRF 319
P++ + + +P E + L +L + DR + L+ + R
Sbjct 314 LDMVAFKNQYPKLYESWQENSDTPYLKELQVSFYHMLHMLENYDTVDRSLLKQLKEIRRI 373
Query 320 ESRVRNTAAHEIVSISEDRITKDGGLLPEQ--------LLKILARETGADLTLYDRLNDE 371
E ++RN AHEIV +E I + Q + I+ + + +YD +N
Sbjct 374 EQKIRNKMAHEIVVFTERDICSAAEIQSLQSFLHQIKDVFFIITGQEKQNKLIYDTINTY 433
Query 372 IIRQI 376
++ QI
Sbjct 434 VLEQI 438
>gi|322387549|ref|ZP_08061158.1| hypothetical protein HMPREF9423_0556 [Streptococcus infantis
ATCC 700779]
gi|321141416|gb|EFX36912.1| hypothetical protein HMPREF9423_0556 [Streptococcus infantis
ATCC 700779]
Length=429
Score = 73.2 bits (178), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 66/268 (25%), Positives = 115/268 (43%), Gaps = 25/268 (9%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VL S E+ ++ + +I P+ + + N V+ FD V + + S
Sbjct 35 VLVYSEEMLVKKDLVEKALCSIEGYHPKVVIESIILKNDEVYLFDKMYEVMGQIIEKYSG 94
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSK---PGDRESPDAY 117
D ++LN SSGTP + +AL A+N T A+QV+TP ++ ++ P E
Sbjct 95 --TDHQLILNLSSGTPQIISALFALNRINDYNTQAIQVATPNKSANRKYVPLSNE----- 147
Query 118 DLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPD 177
D + ++D N+DNQ +R + + L + +L+ LI SYDY +AA+ +
Sbjct 148 DEQKLFDENEDNQKDYEDRTIKDEAEKFNQSLIKRHLRNLISSYDY-----LAAEELVTR 202
Query 178 QVSNLIRGAMHRSRLEHLVAP--KFFKDTAFTYD--------PANKVAEYISALALLAKR 227
+ N + +RL L+ K FK A D K Y + +L +R
Sbjct 203 KEYNKLLSKKKLARLRDLLNDFVKVFKTQAILKDIQGYSLTEVEKKALNYFLMIEVLKER 262
Query 228 EQWAEFARSATPAITIVLRAAVAKHLPE 255
Q A+ + + ++ + K P+
Sbjct 263 GQVADVLIKSKSYVEFIIEEKIKKDYPD 290
>gi|322375481|ref|ZP_08049994.1| CRISPR-associated protein, Csm6 family [Streptococcus sp. C300]
gi|321279744|gb|EFX56784.1| CRISPR-associated protein, Csm6 family [Streptococcus sp. C300]
Length=407
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/356 (24%), Positives = 145/356 (41%), Gaps = 31/356 (8%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VL S E+ + R + P+ + N V+ +D + + E S
Sbjct 13 VLLYSEEMLVKKTLIERALLSFKDYKPDVKIHEQILRNDEVYLYDKMYEIIGKIIKEYSK 72
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
++LN SSGTP +++AL AIN T A+QV+TP+ + + P S + D
Sbjct 73 --LGEELILNLSSGTPQIKSALFAINRIDDYNTQAIQVTTPSNSSNNPQKILSKEEED-- 128
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAA--VTIAADSRLPDQ 178
++ N+DNQ NRC + L + +L+ LI SYDY A + I DS+
Sbjct 129 NLFKNNEDNQDNYENRCIMDIAEKFNHSLVKRHLRSLIESYDYLAVEKIVIRRDSKGLLS 188
Query 179 VSNLIRGAMHRSRLEHLVAPKFFKDTAFTY---DPANKVAEYISALALLAKREQWAEFAR 235
L R + + L ++ + Y + K Y + +L KR Q A+
Sbjct 189 NKQLARLRIILTDLVNVFKKQEVLSEIQKYPLSEVEKKALNYFLMIEILNKRGQVADVLI 248
Query 236 SATPAITIVLRAAVAKHLPE-----------DRYLDDMGRV------DRRKLEREPEIRC 278
+ + +L + ++ P ++ D +V D +K + E E +
Sbjct 249 KSKSLVEFILEDRIKRNHPNLIIYKNKLPKLNKEHQDFEKVIGYLDSDYKKSQNENEGKK 308
Query 279 ALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSI 334
P + N +YTK + +++P+ + +L V+ + RN AH + I
Sbjct 309 EDFSPTTTLN--LIIYTK--ILEYYKYSPELIKSLRVIISLNNE-RNKVAHGLSEI 359
>gi|270292491|ref|ZP_06198702.1| conserved hypothetical protein [Streptococcus sp. M143]
gi|270278470|gb|EFA24316.1| conserved hypothetical protein [Streptococcus sp. M143]
Length=349
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 72/290 (25%), Positives = 125/290 (44%), Gaps = 29/290 (10%)
Query 67 ILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDAN 126
++LN SSGTP +++AL AIN T A+QV+TP+ + + P S + D ++ N
Sbjct 19 LILNLSSGTPQIKSALFAINRIDDYNTQAIQVTTPSNSSNNPQKILSKEEED--NLFKNN 76
Query 127 DDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAA--VTIAADSRLPDQVSNLIR 184
+DNQ NRC + L + +L+ LI SYDY A + I DS+ L R
Sbjct 77 EDNQDNYENRCIMDIAEKFNHSLVKRHLRSLIESYDYLAVEKIVIRRDSKGLLSNKQLAR 136
Query 185 GAMHRSRLEHLVAPKFFKDTAFTY---DPANKVAEYISALALLAKREQWAEFARSATPAI 241
+ + L ++ + Y + K Y + +L KR Q A+ + +
Sbjct 137 LRIILTDLVNVFKKQEVLSEIQKYPLSEVEKKALNYFLMIEILNKRGQVADVLIKSKSLV 196
Query 242 TIVLRAAVAKHLPE-----------DRYLDDMGRV------DRRKLEREPEIRCALKHPP 284
+L + ++ P ++ D +V D +K + E E + P
Sbjct 197 EFILEDRIKRNHPNLIIYKNKLPKLNKEHQDFEKVIGYLDSDYKKSQNENEGKKEDFSPT 256
Query 285 KSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSI 334
+ N +YTK + +++P+ + +L V+ + RN AH + I
Sbjct 257 TTLN--LIIYTK--ILEYYKYSPELIKSLRVIISLNNE-RNKVAHGLSEI 301
>gi|315641548|ref|ZP_07896617.1| csm6 family CRISPR-associated protein [Enterococcus italicus
DSM 15952]
gi|315482685|gb|EFU73212.1| csm6 family CRISPR-associated protein [Enterococcus italicus
DSM 15952]
Length=430
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/218 (24%), Positives = 95/218 (44%), Gaps = 15/218 (6%)
Query 44 FDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPAR 103
FD + +F +LVE ++P+ I LN +SGTP M+ L V + +QVSTP +
Sbjct 83 FDAYKDLFHQYLVEEKRKYPNAEIFLNVTSGTPQMETTLCLEYVTYPDKMRCIQVSTPLK 142
Query 104 ALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDY 163
+ D +++L + ++ + P+RC + + + R +K L+ +YDY
Sbjct 143 TSNAKTKYAQADCQEVDL--EIVNEEESQQPSRCHKIAILSFREAIVRNQIKSLLDNYDY 200
Query 164 SAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLV----APKFFKDTAFTYDPANKVAEYIS 219
AA+ + A + + G R +L+ L+ + F Y K+ + +
Sbjct 201 EAALQLVASQK------SFRNGKEIRKKLKELIDDIKMHRVFSYLIKQYPRNEKLQKALL 254
Query 220 ALALLAKREQWAEFARSATPAITI---VLRAAVAKHLP 254
LL R Q + A + +I ++ + K+ P
Sbjct 255 HTILLEMRHQRGDIAETLIRVKSIAEYIVEQYIQKNYP 292
>gi|322387548|ref|ZP_08061157.1| hypothetical protein HMPREF9423_0555 [Streptococcus infantis
ATCC 700779]
gi|321141415|gb|EFX36911.1| hypothetical protein HMPREF9423_0555 [Streptococcus infantis
ATCC 700779]
Length=386
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/391 (23%), Positives = 155/391 (40%), Gaps = 62/391 (15%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
++F I+ ++ ++ + T PE N V FD F + +
Sbjct 36 IVFSERTISKKDDIEKVIHSIDTEYLPEIVCHEPIILNEDVFVFDTMYEQFDAIIQKYYT 95
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
+ D +LN SS TP +++AL IN AVQVS+P S G D+ D++
Sbjct 96 K--DDGFILNLSSATPQVKSALFVINRLSEINVKAVQVSSPEND-SNAG-VGHDDSEDID 151
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
+ D N DN+ +R E TS L + L+ I YDY A++ +A +Q+S
Sbjct 152 ALIDTNLDNKQDYIDRTIEDTSEKFKQGLMKKTLRDFITKYDYKASLEVA------NQLS 205
Query 181 NLIRGAMHRSRLEHLV-------APKFFKDTAFTYDPANKVAEYISALALLAKREQWAEF 233
+ R +L+ +V P+ + ++ + + Y++ + L +R ++E
Sbjct 206 DFPGLKECRKKLQDIVDSLDRQAVPQVLQKKKWSEEQKKVLNSYLT-IDLQKERGNFSEG 264
Query 234 ARSATPAITIVLRAAVAKHLPE--DRYLDD-----MGRVDRRKLEREPEIRCALKHPPKS 286
+L + P D Y +D +G D K+ +E
Sbjct 265 LIRIKNLTEFILDDYIENRYPGFLDNYANDSEKYYIGIWDYGKILQEKR----------- 313
Query 287 PNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLL 346
EW L+ K +LR ++ RNT AH++ S+ + + + G +L
Sbjct 314 ---EWTLHNK-IKPILRM----------------NKTRNTIAHKLDSLDSEELKQLGPVL 353
Query 347 PEQLLKILARE----TGADLTLYDRLNDEII 373
+ LK L +E T D Y N E++
Sbjct 354 --KALKGLIKEQYQLTEKDFNFYKDFNKELL 382
>gi|57865879|ref|YP_189999.1| hypothetical protein SERP2456 [Staphylococcus epidermidis RP62A]
gi|57636537|gb|AAW53325.1| hypothetical protein SERP2456 [Staphylococcus epidermidis RP62A]
Length=422
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 39/163 (24%), Positives = 74/163 (46%), Gaps = 14/163 (8%)
Query 18 YSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPA 77
+ I ++P T+V I+ + +D+F F +L + + D I+LN +SGTP
Sbjct 57 WEKIIQTVSPNTEVEIIIENVDNAQDYDVFKEKFHKYLKIIEDSYEDCEIILNVTSGTPQ 116
Query 78 MQAALVAINVFGIPRTTAVQVSTPAR------ALSKPGDRESPDAYDLELMWDANDDNQP 131
M++ L + VQVSTP + S P D+ + E++ ++ +
Sbjct 117 MESTLCLEYIVYPENKKCVQVSTPTKDSNAGIEYSNPKDK----VEEFEIV----NEVEK 168
Query 132 GAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR 174
+ RC E + + R+ + LI +YDY A+ + ++ +
Sbjct 169 KSEKRCKEINILSFREAMIRSQILGLIDNYDYEGALNLVSNQK 211
>gi|329736405|gb|EGG72674.1| CRISPR-associated protein, Csm6 family [Staphylococcus epidermidis
VCU045]
gi|341656707|gb|EGS80416.1| CRISPR-associated protein, Csm6 family [Staphylococcus epidermidis
VCU037]
Length=422
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/163 (24%), Positives = 74/163 (46%), Gaps = 14/163 (8%)
Query 18 YSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPA 77
+ I ++P T+V I+ + +D+F F +L + + D I+LN +SGTP
Sbjct 57 WEKIIQTVSPNTEVEIIIENVDNAQDYDVFKEKFHKYLKIIEDSYEDCEIILNVTSGTPQ 116
Query 78 MQAALVAINVFGIPRTTAVQVSTPAR------ALSKPGDRESPDAYDLELMWDANDDNQP 131
M++ L + +QVSTP + S P D+ + E++ ++ +
Sbjct 117 MESTLCLEYIVYPENKKCIQVSTPTKDSNAGIEYSNPKDK----VEEFEIV----NEVEK 168
Query 132 GAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSR 174
+ RC E + + R+ + LI +YDY A+ + ++ +
Sbjct 169 KSEKRCKEINILSFREAMIRSQILGLIDNYDYEGALNLVSNQK 211
>gi|289549404|ref|YP_003470308.1| CRISPR-associated protein Csm6 [Staphylococcus lugdunensis HKU09-01]
gi|289178936|gb|ADC86181.1| CRISPR-associated protein Csm6 [Staphylococcus lugdunensis HKU09-01]
Length=225
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 73/156 (47%), Gaps = 6/156 (3%)
Query 18 YSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPA 77
+ +++++P+T V I H FD + +F + + P+ ILLN +SGTP
Sbjct 57 WEKIVSKVSPQTSVEIKVENIEHEHDFDSYKDLFSYFIKGIRMSNPESEILLNVTSGTPQ 116
Query 78 MQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDA--YDLELMWDANDDNQPGAPN 135
M++ L + +QVS P + + +P+ DLE + + N+ A N
Sbjct 117 MESTLCLEYISNPNNAQCIQVSAPQPSNNTKRLYANPNNAFKDLEKV----NQNEHLADN 172
Query 136 RCFEATSAALGALLERANLKQLIVSYDYSAAVTIAA 171
RC + ++ R+ ++ LI +YDY A+ + +
Sbjct 173 RCKSINILSFREVMVRSQVRGLIDNYDYEGALNLIS 208
>gi|312278327|gb|ADQ62984.1| Putative uncharacterized protein [Streptococcus thermophilus
ND03]
Length=391
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/143 (29%), Positives = 68/143 (48%), Gaps = 4/143 (2%)
Query 27 PETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAIN 86
PE + + ++ VH FD+ F + L E + + +LN SS TP +++AL IN
Sbjct 10 PELIIHDLIISDNEVHIFDVMFQRFSDILQEYYTK--EDEFILNLSSATPQIKSALFVIN 67
Query 87 VFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALG 146
AV+VS+P A +K ++ + D EL+ N+DN+ +R E +
Sbjct 68 RLNGINVKAVKVSSPEHASNKNIGHDNDENID-ELIK-VNEDNKVNFIDRTIEDNAEKFS 125
Query 147 ALLERANLKQLIVSYDYSAAVTI 169
L + + I +DY AA+ I
Sbjct 126 QALLKKTARDFIEKFDYKAALDI 148
>gi|339278119|emb|CCC19867.1| hypothetical protein STH8232_1168 [Streptococcus thermophilus
JIM 8232]
Length=428
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/259 (26%), Positives = 106/259 (41%), Gaps = 15/259 (5%)
Query 1 VLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSA 60
VL S E+ ++ + +I P ++ N V FD V + + +
Sbjct 35 VLVYSQEMMVKQDLINKVLLSIEGYNPIIEIDSTILNNDEVFLFDKMYEVMGQIVQKYTN 94
Query 61 EFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLE 120
+ D I+LN SSGTP + +AL A+N T A+QV+TP ++ + D
Sbjct 95 D--DNEIILNLSSGTPQIISALFALNRINDYNTQAIQVATPKNRANREYTALTESEIDAL 152
Query 121 LMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVS 180
+M N DN+ +R + S L + +L+ LI S+DY AA I +S
Sbjct 153 IM--ENQDNRLDFVDRSIKDKSEKFTQALVKRHLRSLIASFDYQAAEAIINRKEYNKLLS 210
Query 181 NLIRGAMHRSRLEHLVAPKFFKDT-------AFTYDPANKVA-EYISALALLAKREQWAE 232
+ A R +L + FK+ +F D + K A Y + +L +RE A+
Sbjct 211 KK-KIAYIREKLYDF--SRVFKNQSILSDILSFPLDDSQKKALNYYLMIDVLKEREHIAD 267
Query 233 FARSATPAITIVLRAAVAK 251
A V+ + K
Sbjct 268 VLIKAKSLAEFVIEETIKK 286
>gi|334308476|gb|EGL99462.1| CRISPR-associated protein Csm6 [Lactobacillus salivarius NIAS840]
Length=350
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/139 (29%), Positives = 64/139 (47%), Gaps = 4/139 (2%)
Query 37 TNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAV 96
++ V FD V + + S E D ++LN SSGTP M++AL IN A
Sbjct 10 SDSEVFIFDKMYEVLNGIISKYSKE--DEDLILNLSSGTPQMKSALFTINRLKDINVKAY 67
Query 97 QVSTPARALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQ 156
QV TP+ + S G + + D++ + N DN+ R E + L + +K
Sbjct 68 QVVTPSHS-SNEGIKHDNNL-DIDYLISTNLDNRDDFEKRILEDKAEKFQQTLIKRTMKD 125
Query 157 LIVSYDYSAAVTIAADSRL 175
L+ S+DY + ++ R+
Sbjct 126 LLNSFDYESLYNLSKRYRV 144
>gi|339278118|emb|CCC19866.1| hypothetical protein STH8232_1167 [Streptococcus thermophilus
JIM 8232]
Length=386
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 83/369 (23%), Positives = 146/369 (40%), Gaps = 62/369 (16%)
Query 21 AITRLAPETDVRIVTY----TNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTP 76
A+ +AP + ++ + ++ VH FD+ F + L E + + +LN SS TP
Sbjct 52 ALFSIAPNYEPELIIHDPIISDNEVHIFDVMFQRFSDILQEYYTK--EDEFILNLSSATP 109
Query 77 AMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDDNQPGAPNR 136
+++AL IN AVQVS+P A ++ ++ + D EL+ + N DN+ +R
Sbjct 110 QIKSALFVINRLNGINVKAVQVSSPEHASNENIGHDNDENID-ELI-EVNKDNKVNFIDR 167
Query 137 CFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLV 196
E + L + + I +DY AA+ I DQ+S+ R + +V
Sbjct 168 TIEDNAEKFSQALLKKTARDFIEKFDYKAALDIL------DQLSDFPNLKSVREEIRDVV 221
Query 197 -------APKFFKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAV 249
PK + + ++ Y++ + L +R +E +L +
Sbjct 222 NCLSKQDVPKGLRHKKLKEEEQKILSAYLT-IELQRERGNVSESFIRIKNLTEFILEDYI 280
Query 250 AKHLPE--DRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALL---RQ 304
K P D Y +D+ + +YL D+ LL ++
Sbjct 281 EKRYPGLIDEYCEDIQK--------------------------YYLSLFDYSKLLKATKE 314
Query 305 FAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARE----TGA 360
F R A ++ S RN AH + + D + + G + + LK L RE + +
Sbjct 315 FKLKRTIA-PIIDMNSS--RNKVAHSLSPLDSDAVKQLG--IAMKTLKTLVREQYHFSQS 369
Query 361 DLTLYDRLN 369
D Y LN
Sbjct 370 DFNFYHDLN 378
>gi|227890800|ref|ZP_04008605.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
gi|227867209|gb|EEJ74630.1| conserved hypothetical protein [Lactobacillus salivarius ATCC
11741]
Length=412
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/135 (29%), Positives = 62/135 (46%), Gaps = 4/135 (2%)
Query 37 TNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAV 96
++ V FD V + + S E D ++LN SSGTP M++AL IN A
Sbjct 71 SDSEVFIFDKMYEVLNGIISKYSKE--DEDLILNLSSGTPQMKSALFTINRLKDINVKAY 128
Query 97 QVSTPARALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQ 156
QV TP+ + S G + + D++ + N DN+ R E + L + +K
Sbjct 129 QVVTPSHS-SNEGIKHDNNL-DIDYLISTNLDNRDDFKKRILEDKAEKFQQTLIKRTMKD 186
Query 157 LIVSYDYSAAVTIAA 171
L+ S+DY + ++
Sbjct 187 LLNSFDYESLYNLST 201
>gi|325687525|gb|EGD29546.1| hypothetical protein HMPREF9381_1059 [Streptococcus sanguinis
SK72]
Length=438
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/258 (25%), Positives = 104/258 (41%), Gaps = 30/258 (11%)
Query 18 YSAAITRLAPETDVRIVT--YTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGT 75
+ AA+ + V ++ Y VH FD L E D +LN +SGT
Sbjct 90 FEAAVQAVYDGKKVYVIQNKYVKEGVHEFDTMYKFVEEILDEEDMSHGD--YILNVTSGT 147
Query 76 PAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESP------DAYDLELM-WDANDD 128
P QAA+ AIN T +V++P + ++ +P Y LE D D+
Sbjct 148 PQCQAAMYAINFVKDYHTRLARVNSPRSEKTNQSNQGAPWFETATFKYFLEKQASDYEDN 207
Query 129 NQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMH 188
Q G E LL+R K I+ Y+Y AA+ I ++ PD +S+
Sbjct 208 RQLG-----IEKGEKFKNNLLQRT-YKNFILKYEYKAALDILKEN--PDIISDKQDQENS 259
Query 189 RSRLEHLVA--------PKFFKDTAFTY---DPANKVAEYISALALLAKREQWAEFARSA 237
++ LE++++ + D+ Y D KV Y + +L +R Q + A
Sbjct 260 KNILENMISVFQKQRVLEELAADSNLKYNNTDEFQKVLNYYLMIDILNRRGQVTDVLVKA 319
Query 238 TPAITIVLRAAVAKHLPE 255
+L++ + + P+
Sbjct 320 KSFAEFILKSVIERRHPD 337
>gi|55822919|ref|YP_141360.1| hypothetical protein str0965 [Streptococcus thermophilus CNRZ1066]
gi|55738904|gb|AAV62545.1| unknown protein [Streptococcus thermophilus CNRZ1066]
Length=399
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/143 (28%), Positives = 66/143 (47%), Gaps = 4/143 (2%)
Query 27 PETDVRIVTYTNPSVHRFDLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAIN 86
PE + ++ VH FD+ F + L E + + +LN SS TP +++AL IN
Sbjct 10 PELIIHDPIISDNEVHIFDVMFQRFSDILQEYYTK--EDEFILNLSSATPQIKSALFVIN 67
Query 87 VFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALG 146
AV+V +P A ++ ++ + D EL+ N+DN+ +R E +
Sbjct 68 RLNGINIKAVKVWSPEHASNENIGHDNDENID-ELI-KVNEDNKVNFIDRTIEDNAEKFS 125
Query 147 ALLERANLKQLIVSYDYSAAVTI 169
L + + I +DY AA+ I
Sbjct 126 QALLKKTARDFIEKFDYKAALDI 148
>gi|55821001|ref|YP_139443.1| hypothetical protein stu0966 [Streptococcus thermophilus LMG
18311]
gi|55736986|gb|AAV60628.1| unknown protein, truncated [Streptococcus thermophilus LMG 18311]
Length=342
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/101 (30%), Positives = 50/101 (50%), Gaps = 2/101 (1%)
Query 69 LNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDD 128
+N SS TP +++AL IN AV+VS+P A ++ ++ + D EL+ N+D
Sbjct 1 MNLSSATPQIKSALFVINRLNGINVKAVKVSSPEHASNENIGHDNDENID-ELIK-VNED 58
Query 129 NQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTI 169
N+ +R E + L + + I +DY AA+ I
Sbjct 59 NKVNFIDRTIEDNAEKFSQALLKKTARDFIEKFDYKAALDI 99
>gi|301299687|ref|ZP_07205941.1| putative CRISPR-associated protein, Csm6 family [Lactobacillus
salivarius ACS-116-V-Col5a]
gi|300852710|gb|EFK80340.1| putative CRISPR-associated protein, Csm6 family [Lactobacillus
salivarius ACS-116-V-Col5a]
Length=311
Score = 44.3 bits (103), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 29/107 (28%), Positives = 48/107 (45%), Gaps = 2/107 (1%)
Query 69 LNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDANDD 128
+N SSGTP M++AL IN A QV TP+ + S G + + + N D
Sbjct 1 MNLSSGTPQMKSALFTINRLNDINVRAYQVITPSHS-SNEGIGHDNNL-GINYLISTNLD 58
Query 129 NQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRL 175
N+ R E + L + +K L+ ++DY + ++ R+
Sbjct 59 NRKDFKKRILEDKAEKFQKTLIKRTMKDLLNNFDYESLYNLSIRHRV 105
>gi|325696574|gb|EGD38464.1| hypothetical protein HMPREF9384_1721 [Streptococcus sanguinis
SK160]
Length=438
Score = 42.0 bits (97), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 48/204 (24%), Positives = 84/204 (42%), Gaps = 30/204 (14%)
Query 68 LLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWD--- 124
+LN +SGT QAA+ IN T +V +P + ++ +P Y E++ D
Sbjct 140 ILNITSGTAQCQAAMYFINFIKDYHTRLARVDSPNGKKTNRSNQGAP--YFEEVVLDDLL 197
Query 125 ------ANDDNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQ 178
D+ +P E LL+R K I++Y+Y AA+ I + PD
Sbjct 198 KKQTAECRDERKPE-----IETGEKLKNNLLQRT-YKDFILNYEYKAALDILKAN--PDI 249
Query 179 VSNLIRGAMHRSRLEHLVA----PKFFK----DTAFTYDPA---NKVAEYISALALLAKR 227
+SN + LE++++ K K D+ Y+ KV Y + +L +R
Sbjct 250 ISNKDDQEKSKKALENMISVFQKQKVLKELAADSKLKYNDTGEFQKVLNYYLMIDILNRR 309
Query 228 EQWAEFARSATPAITIVLRAAVAK 251
Q + A +L++ + +
Sbjct 310 GQVTDVLVKAKSFAEFILKSVIER 333
Lambda K H
0.320 0.134 0.388
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 740471427550
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40