BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1514c
Length=262
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608652|ref|NP_216030.1| hypothetical protein Rv1514c [Mycob... 533 8e-150
gi|167969326|ref|ZP_02551603.1| hypothetical protein MtubH3_1539... 518 3e-145
gi|3550472|emb|CAA11570.1| gsa [Mycobacterium avium subsp. silva... 332 5e-89
gi|336457964|gb|EGO36952.1| glycosyl transferase [Mycobacterium ... 331 8e-89
gi|254775707|ref|ZP_05217223.1| hypothetical protein MaviaA2_137... 321 6e-86
gi|5524325|gb|AAD44218.1|AF143772_24 GtfC [Mycobacterium avium] 317 8e-85
gi|8250407|emb|CAB93583.1| Gsa protein [Mycobacterium avium] 186 3e-45
gi|229492287|ref|ZP_04386094.1| glycosyltransferase GtfC [Rhodoc... 173 2e-41
gi|86132903|ref|ZP_01051494.1| glycosyl transferase family 2 [Do... 124 1e-26
gi|226334855|ref|YP_002784527.1| putative glycosyltransferase [R... 124 2e-26
gi|224536713|ref|ZP_03677252.1| hypothetical protein BACCELL_015... 119 5e-25
gi|86160715|ref|YP_467500.1| glycosyl transferase family protein... 119 5e-25
gi|255016223|ref|ZP_05288349.1| glycosyl transferase family prot... 117 2e-24
gi|150009370|ref|YP_001304113.1| glycosyl transferase family pro... 116 3e-24
gi|262384715|ref|ZP_06077848.1| glycosyltransferase family 2 [Ba... 116 4e-24
gi|301312280|ref|ZP_07218197.1| putative glycosyltransferase [Ba... 114 1e-23
gi|224024984|ref|ZP_03643350.1| hypothetical protein BACCOPRO_01... 114 2e-23
gi|298377844|ref|ZP_06987794.1| glycosyltransferase [Bacteroides... 114 2e-23
gi|212691360|ref|ZP_03299488.1| hypothetical protein BACDOR_0085... 113 3e-23
gi|237726342|ref|ZP_04556823.1| glycosyltransferase family 2 pro... 112 4e-23
gi|120434921|ref|YP_860607.1| colanic acid biosynthesis glycosyl... 112 4e-23
gi|197124816|ref|YP_002136767.1| family 2 glycosyl transferase [... 112 7e-23
gi|150006448|ref|YP_001301192.1| glycosyl transferase family pro... 111 1e-22
gi|218259478|ref|ZP_03475201.1| hypothetical protein PRABACTJOHN... 107 2e-21
gi|333378622|ref|ZP_08470352.1| hypothetical protein HMPREF9456_... 107 2e-21
gi|340348308|ref|ZP_08671398.1| glycosyl transferase [Prevotella... 106 3e-21
gi|212691303|ref|ZP_03299431.1| hypothetical protein BACDOR_0079... 106 3e-21
gi|295132440|ref|YP_003583116.1| glycosyl transferase [Zunongwan... 106 3e-21
gi|149277635|ref|ZP_01883776.1| putative glycosyltransferase [Pe... 105 6e-21
gi|154490349|ref|ZP_02030610.1| hypothetical protein PARMER_0058... 105 7e-21
gi|336477564|ref|YP_004616705.1| family 2 glycosyl transferase [... 105 8e-21
gi|146300270|ref|YP_001194861.1| glycosyl transferase family pro... 104 1e-20
gi|329962365|ref|ZP_08300370.1| glycosyltransferase, group 2 fam... 104 1e-20
gi|336412731|ref|ZP_08593084.1| hypothetical protein HMPREF1017_... 104 2e-20
gi|145225343|ref|YP_001136021.1| glycosyl transferase family pro... 104 2e-20
gi|336415876|ref|ZP_08596214.1| hypothetical protein HMPREF1017_... 103 2e-20
gi|332292692|ref|YP_004431301.1| glycosyl transferase family 2 [... 103 2e-20
gi|299149042|ref|ZP_07042104.1| glycosyl transferase, group 2 fa... 103 2e-20
gi|237725561|ref|ZP_04556042.1| glycosyltransferase family 2 pro... 103 3e-20
gi|330996431|ref|ZP_08320314.1| glycosyltransferase, group 2 fam... 102 4e-20
gi|237755814|ref|ZP_04584414.1| glycosyltransferase family 2 [Su... 102 4e-20
gi|336404145|ref|ZP_08584843.1| hypothetical protein HMPREF0127_... 102 5e-20
gi|298479932|ref|ZP_06998131.1| glycosyl transferase, group 2 fa... 102 5e-20
gi|329963019|ref|ZP_08300799.1| glycosyltransferase, group 2 fam... 102 5e-20
gi|337754896|ref|YP_004647407.1| Colanic acid biosynthesis glyco... 102 8e-20
gi|153806959|ref|ZP_01959627.1| hypothetical protein BACCAC_0123... 101 1e-19
gi|220918345|ref|YP_002493649.1| family 2 glycosyl transferase [... 101 1e-19
gi|255536067|ref|YP_003096438.1| hypothetical protein FIC_01934 ... 101 1e-19
gi|343085763|ref|YP_004775058.1| family 2 glycosyl transferase [... 100 2e-19
gi|332881317|ref|ZP_08448967.1| glycosyltransferase, group 2 fam... 100 2e-19
>gi|15608652|ref|NP_216030.1| hypothetical protein Rv1514c [Mycobacterium tuberculosis H37Rv]
gi|15840980|ref|NP_336017.1| hypothetical protein MT1564 [Mycobacterium tuberculosis CDC1551]
gi|148661310|ref|YP_001282833.1| hypothetical protein MRA_1526 [Mycobacterium tuberculosis H37Ra]
73 more sequence titles
Length=262
Score = 533 bits (1374), Expect = 8e-150, Method: Compositional matrix adjust.
Identities = 261/262 (99%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+TSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF
Sbjct 1 MTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG
Sbjct 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
Query 121 FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF
Sbjct 121 FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
Query 181 ILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYL 240
ILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYL
Sbjct 181 ILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYL 240
Query 241 RGREFYAYNSRFWENVFTRMSK 262
RGREFYAYNSRFWENVFTRMSK
Sbjct 241 RGREFYAYNSRFWENVFTRMSK 262
>gi|167969326|ref|ZP_02551603.1| hypothetical protein MtubH3_15390 [Mycobacterium tuberculosis
H37Ra]
Length=254
Score = 518 bits (1334), Expect = 3e-145, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 9 VITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEPD 68
+ITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEPD
Sbjct 1 MITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEPD 60
Query 69 GGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVG 128
GGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVG
Sbjct 61 GGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVG 120
Query 129 LDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAALVC 188
LDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAALVC
Sbjct 121 LDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAALVC 180
Query 189 EPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYLRGREFYAY 248
EPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYLRGREFYAY
Sbjct 181 EPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYLRGREFYAY 240
Query 249 NSRFWENVFTRMSK 262
NSRFWENVFTRMSK
Sbjct 241 NSRFWENVFTRMSK 254
>gi|3550472|emb|CAA11570.1| gsa [Mycobacterium avium subsp. silvaticum]
gi|4416472|gb|AAD20372.1| glycosyltransferase gtfC [Mycobacterium avium]
Length=277
Score = 332 bits (850), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 176/275 (64%), Positives = 207/275 (76%), Gaps = 15/275 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
++SAPTVSVITIS NDL+GL+ TV+SVRAQRY GRIEHIVIDGGSGD VV YLSG +PGF
Sbjct 1 MSSAPTVSVITISLNDLEGLKSTVESVRAQRYGGRIEHIVIDGGSGDAVVEYLSG-DPGF 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
AYWQS+PD GRYDAMNQGIAH+SGDLLWF+HS DRFS PD VA VEALSG GPV +LWG
Sbjct 60 AYWQSQPDNGRYDAMNQGIAHSSGDLLWFMHSTDRFSDPDAVASVVEALSGHGPVRDLWG 119
Query 121 FGMDRLVGLD------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGI 174
+G + LVGLD R G +PF +RKFL G V HQA+FFG+SLVAK+GGYDLDFG+
Sbjct 120 YGKNNLVGLDGKPLFPRPYGYMPFKMRKFLLGATVA-HQATFFGASLVAKLGGYDLDFGL 178
Query 175 AADQEFILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRR 234
ADQ FI RAAL+ PVTI V+C+FD TG GS + + LRR+ DLH YP GGRR
Sbjct 179 EADQLFIYRAALIRPPVTIDRVVCDFDVTGPGSTQPIREHYRTLRRLWDLHGDYPLGGRR 238
Query 235 ISHAYLRGREFY------AYNS-RFWENVFTRMSK 262
+S AYLR +E+ A+N+ +F F R S+
Sbjct 239 VSWAYLRVKEYLIRADLAAFNAVKFLRAKFARASR 273
>gi|336457964|gb|EGO36952.1| glycosyl transferase [Mycobacterium avium subsp. paratuberculosis
S397]
Length=284
Score = 331 bits (848), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 175/275 (64%), Positives = 206/275 (75%), Gaps = 15/275 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
++S PTVSVITIS NDL+GL+ TV+SVRAQRY GRIEHIVIDGGSGD VV YLSG +PGF
Sbjct 1 MSSTPTVSVITISLNDLEGLKSTVESVRAQRYGGRIEHIVIDGGSGDAVVEYLSG-DPGF 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
AYWQS+PD GRYDAMNQGIAH+SGDLLWF+HS DRFS PD VA VEALSG GPV +LWG
Sbjct 60 AYWQSQPDNGRYDAMNQGIAHSSGDLLWFMHSTDRFSDPDAVASVVEALSGHGPVRDLWG 119
Query 121 FGMDRLVGLD------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGI 174
+G + LVGLD R G +PF +RKFL G V HQA+FFG+SLVAK+GGYDLDFG+
Sbjct 120 YGKNNLVGLDGKPLFPRPYGYMPFKMRKFLLGATVA-HQATFFGASLVAKLGGYDLDFGL 178
Query 175 AADQEFILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRR 234
ADQ FI RAAL+ PVTI V+C+FD TG GS + + LRR+ DLH YP GGRR
Sbjct 179 EADQLFIYRAALIRPPVTIDRVVCDFDVTGPGSTQPIREHYRTLRRLWDLHGDYPLGGRR 238
Query 235 ISHAYLRGREFY------AYNS-RFWENVFTRMSK 262
+S AYLR +E+ A+N+ +F F R S+
Sbjct 239 VSWAYLRVKEYLIRADLAAFNAVKFLRAKFARASR 273
>gi|254775707|ref|ZP_05217223.1| hypothetical protein MaviaA2_13715 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=271
Score = 321 bits (823), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 171/269 (64%), Positives = 201/269 (75%), Gaps = 15/269 (5%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+SVITIS NDL+GL+ TV+SVRAQRY GRIEHIVIDGGSGD VV YLSG +PGFAYWQS+
Sbjct 1 MSVITISLNDLEGLKSTVESVRAQRYGGRIEHIVIDGGSGDAVVEYLSG-DPGFAYWQSQ 59
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRL 126
PD GRYDAMNQGIAH+SGDLLWF+HS DRFS PD VA VEALSG GPV +LWG+G + L
Sbjct 60 PDNGRYDAMNQGIAHSSGDLLWFMHSTDRFSDPDAVASVVEALSGHGPVRDLWGYGKNNL 119
Query 127 VGLD------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
VGLD R G +PF +RKFL G V HQA+FFG+SLVAK+GGYDLDFG+ ADQ F
Sbjct 120 VGLDGKPLFPRPYGYMPFKMRKFLLGATVA-HQATFFGASLVAKLGGYDLDFGLEADQLF 178
Query 181 ILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYL 240
I RAAL+ PVTI V+C+FD TG GS + + LRR+ DLH YP GGRR+S AYL
Sbjct 179 IYRAALIRPPVTIDRVVCDFDVTGPGSTQPIREHYRTLRRLWDLHGDYPLGGRRVSWAYL 238
Query 241 RGREFY------AYNS-RFWENVFTRMSK 262
R +E+ A+N+ +F F R S+
Sbjct 239 RVKEYLIRADLAAFNAVKFLRAKFARASR 267
>gi|5524325|gb|AAD44218.1|AF143772_24 GtfC [Mycobacterium avium]
Length=269
Score = 317 bits (813), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 169/267 (64%), Positives = 199/267 (75%), Gaps = 15/267 (5%)
Query 9 VITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEPD 68
+ITIS NDL+GL+ TV+SVRAQRY GRIEHIVIDGGSGD VV YLSG +PGFAYWQS+PD
Sbjct 1 MITISLNDLEGLKSTVESVRAQRYGGRIEHIVIDGGSGDAVVEYLSG-DPGFAYWQSQPD 59
Query 69 GGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVG 128
GRYDAMNQGIAH+SGDLLWF+HS DRFS PD VA VEALSG GPV +LWG+G + LVG
Sbjct 60 NGRYDAMNQGIAHSSGDLLWFMHSTDRFSDPDAVASVVEALSGHGPVRDLWGYGKNNLVG 119
Query 129 LD------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFIL 182
LD R G +PF +RKFL G V HQA+FFG+SLVAK+GGYDLDFG+ ADQ FI
Sbjct 120 LDGKPLFPRPYGYMPFKMRKFLLGATVA-HQATFFGASLVAKLGGYDLDFGLEADQLFIY 178
Query 183 RAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYLRG 242
RAAL+ PVTI V+C+FD TG GS + + LRR+ DLH YP GGRR+S AYLR
Sbjct 179 RAALIRPPVTIDRVVCDFDVTGPGSTQPIREHYRTLRRLWDLHGDYPLGGRRVSWAYLRV 238
Query 243 REFY------AYNS-RFWENVFTRMSK 262
+E+ A+N+ +F F R S+
Sbjct 239 KEYLIRADLAAFNAVKFLRAKFARASR 265
>gi|8250407|emb|CAB93583.1| Gsa protein [Mycobacterium avium]
Length=377
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/191 (55%), Positives = 127/191 (67%), Gaps = 14/191 (7%)
Query 85 DLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVGLD------RVRGPIPF 138
D WF+HS DRFS PD VA VEALSG GPV +LWG+G + LVGLD R G +PF
Sbjct 184 DWPWFMHSTDRFSDPDAVASVVEALSGHGPVRDLWGYGKNNLVGLDGKPLFPRPYGYMPF 243
Query 139 SLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAALVCEPVTIRCVLC 198
+RKFL G V HQA+FFG+SLVAK+GGYDLDFG+ ADQ FI RAAL+ VTI V+C
Sbjct 244 KMRKFLLGATVA-HQATFFGASLVAKLGGYDLDFGLEADQLFIYRAALIRPSVTIDRVVC 302
Query 199 EFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYLRGREFY------AYNS-R 251
+FD TG GS + + LRR+ DLH YP GGRR+S AYLR +E+ A+N+ +
Sbjct 303 DFDVTGPGSTQPIREHYRTLRRLWDLHGDYPLGGRRVSWAYLRVKEYLIRADLAAFNAVK 362
Query 252 FWENVFTRMSK 262
F F R S+
Sbjct 363 FLRAKFARASR 373
>gi|229492287|ref|ZP_04386094.1| glycosyltransferase GtfC [Rhodococcus erythropolis SK121]
gi|229320696|gb|EEN86510.1| glycosyltransferase GtfC [Rhodococcus erythropolis SK121]
Length=263
Score = 173 bits (439), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/260 (41%), Positives = 143/260 (55%), Gaps = 12/260 (4%)
Query 4 APTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYW 63
+V+VITI+ NDL GL RT++SVRAQ +G +EHIVIDGGS D V YL W
Sbjct 2 TSSVTVITITRNDLSGLVRTIESVRAQSAQG-VEHIVIDGGSTDGTVEYLDSLGDSLV-W 59
Query 64 QSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGM 123
+ D GRYDAMN G ASG+LLWF+HS+D F DV+ + G G VS WG+G+
Sbjct 60 NTGLDSGRYDAMNIGAELASGELLWFMHSSDVFHSCDVLERVSRDHQG-GGVSWRWGYGL 118
Query 124 DRL------VGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAAD 177
+ +G+ G IPFSL +F G ++PHQA SL +IGGYD +FG++AD
Sbjct 119 SNIHDSGSSIGIG---GRIPFSLSRFALGGNIIPHQACVIEKSLHVEIGGYDTEFGLSAD 175
Query 178 QEFILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISH 237
Q ++L+ A+ +P C+FD+ G GS R + D+RR L F + + +
Sbjct 176 QLYLLKMAVRSQPRVWTEFFCDFDSAGAGSVRGMMDHYRDMRRSRALTGITVFRAQWLDY 235
Query 238 AYLRGREFYAYNSRFWENVF 257
A G A R V
Sbjct 236 AVSLGVALLAKTDRVQRRVL 255
>gi|86132903|ref|ZP_01051494.1| glycosyl transferase family 2 [Dokdonia donghaensis MED134]
gi|85816609|gb|EAQ37796.1| glycosyl transferase family 2 [Dokdonia donghaensis MED134]
Length=259
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/184 (38%), Positives = 103/184 (56%), Gaps = 7/184 (3%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P +S+IT+++NDL+GL++TV+SV Q + EHI+IDGGS D V Y+ + F+YW
Sbjct 2 PKLSIITVNYNDLNGLKKTVQSVHTQTSQ-EFEHIIIDGGSTDASVEYIKKHQDSFSYWV 60
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SE D G Y+AMN+GI ASG+ L FL++ D F + +A+ LSG+ + F +
Sbjct 61 SELDKGVYNAMNKGIEQASGEYLLFLNAGDHFINEQSLQRALTHLSGEHIIY----FNLH 116
Query 125 RLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRA 184
+ G P L + +PH A+F + AK G Y DF I +D +F + A
Sbjct 117 VVEGSGDFTKTYPSRLSFSYFVEDTLPHPATFIKKTAFAKAGTYREDFKIVSDWKFFIDA 176
Query 185 ALVC 188
+C
Sbjct 177 --IC 178
>gi|226334855|ref|YP_002784527.1| putative glycosyltransferase [Rhodococcus opacus B4]
gi|226246075|dbj|BAH56175.1| putative glycosyltransferase [Rhodococcus opacus B4]
Length=264
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/210 (36%), Positives = 114/210 (55%), Gaps = 10/210 (4%)
Query 2 TSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEP--G 59
T++P +SVIT+++ND DGL T +S+ QR EH+++DGGS D ++
Sbjct 4 TTSPVLSVITVNYNDHDGLVETTESLTNQR-DCTYEHVIVDGGSSDGSAEFIDNYRQRDS 62
Query 60 FAYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELW 119
SEPD G +DAMN+GI ASGD++ F+++ DRF+ + ++ G W
Sbjct 63 RVVAVSEPDDGIFDAMNKGIRMASGDVVVFMNAGDRFTDEETLSFVAADYQAVGWD---W 119
Query 120 GFGMDRLVGLD-RVRGPI---PFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIA 175
+G R + + V G PF R+F G + VPHQA++ + +GG+D+DFG+A
Sbjct 120 AYGCLRYMSEEGDVFGATVQAPFLQRRFEMGIRFVPHQATYLSLEFIRGLGGFDVDFGVA 179
Query 176 ADQEFILRAALVCEPVTIRCVLCEFDTTGV 205
DQE +RAAL P + +F GV
Sbjct 180 CDQELAMRAALKSNPRVWVRFMSDFRVGGV 209
>gi|224536713|ref|ZP_03677252.1| hypothetical protein BACCELL_01589 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521629|gb|EEF90734.1| hypothetical protein BACCELL_01589 [Bacteroides cellulosilyticus
DSM 14838]
Length=271
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 70/212 (34%), Positives = 115/212 (55%), Gaps = 16/212 (7%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGD---DVVAYLSGCEPGFA 61
PT+S+IT++FN+ +GL+RT++S+++Q Y EHI+ID S D +++ S
Sbjct 4 PTLSIITVNFNNRNGLERTIQSIKSQTYTDY-EHIIIDANSTDGSKEIILEYSQNTAHLT 62
Query 62 YWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGF 121
YW SEPD G YD MN+GI HA G+ ++ L+S D S P V++ ++S G +
Sbjct 63 YWVSEPDKGIYDGMNKGIEHAQGEYIYLLNSGDILSNPTVLS----SISFDGTK---YIC 115
Query 122 GMDRLVGLDRVRGPI--PFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQE 179
G R++ + + P + K +PH ASF SL K Y++D+ I +D
Sbjct 116 GNMRIIHSETQYEEVVPPNEIDGLFLFKSYLPHPASFIHHSLFNK-QKYNIDYKIISDWI 174
Query 180 FILRAALV--CEPVTIRCVLCEFDTTGVGSHR 209
++ ++ C I ++ +FDTTG+ S+
Sbjct 175 HMVENIVLKGCSYKHINLLISDFDTTGISSNN 206
>gi|86160715|ref|YP_467500.1| glycosyl transferase family protein [Anaeromyxobacter dehalogenans
2CP-C]
gi|85777226|gb|ABC84063.1| glycosyl transferase, family 2 [Anaeromyxobacter dehalogenans
2CP-C]
Length=283
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/197 (41%), Positives = 111/197 (57%), Gaps = 16/197 (8%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P++SV+TI ND GL+ T+ SV Q YR +E +V+DGGS D + + EP A W
Sbjct 3 PSLSVVTICLNDRAGLEATLDSVIGQTYRD-LEVVVVDGGSTDGSLDAIRAREPRIARWV 61
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVS-----ELW 119
SEPDGG Y+ +G A A G+ L FL++ DR + PD A+E L PV ++W
Sbjct 62 SEPDGGIYEGQGKGAALAKGEWLLFLNAGDRLAAPD----ALERLMTPRPVEDVVYGDVW 117
Query 120 GFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQE 179
M+R L R P S+ + + ++PHQA+ F +L ++GGYDL F IAAD +
Sbjct 118 ---MERGGRLRPWRAPDHPSVLHLM--RSMLPHQATAFRRALFLRLGGYDLSFRIAADYD 172
Query 180 FILRAALVCEPVTIRCV 196
+LR ALV T R V
Sbjct 173 LLLR-ALVEARATTRHV 188
>gi|255016223|ref|ZP_05288349.1| glycosyl transferase family protein [Bacteroides sp. 2_1_7]
gi|256842375|ref|ZP_05547878.1| glycosyltransferase, family 2 [Parabacteroides sp. D13]
gi|256735982|gb|EEU49313.1| glycosyltransferase, family 2 [Parabacteroides sp. D13]
Length=253
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/215 (34%), Positives = 115/215 (54%), Gaps = 14/215 (6%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+ PT S+ITI++N L+RT+ SV +Q Y IE+I+IDG S D V + E G
Sbjct 1 MNQQPTFSIITITYNAERWLERTILSVLSQSYPN-IEYILIDGASKDKTVEIIKQYESGI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
A W SEPD G YDAMN+G+ A+GD +WFL++ D D V + V +L K + ++
Sbjct 60 ASWISEPDKGLYDAMNKGLKRATGDYVWFLNAGDTLYTADTVQRIVASLKKKVSLPDV-I 118
Query 121 FGMDRLV-------GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+G R+V G+ R++ P + + F G +V HQ SF +A + Y+ ++
Sbjct 119 YGETRIVDAEGRSLGMRRLKAPEKLTWKSFRMG-MLVCHQ-SFISKREIAPL--YNTEYR 174
Query 174 IAADQEFILRAALVCEPV-TIRCVLCEFDTTGVGS 207
+ AD ++ ++ + + R +L F G+ S
Sbjct 175 LTADYDWCIQCLRRSKRIHNTRLILSNFLEEGLSS 209
>gi|150009370|ref|YP_001304113.1| glycosyl transferase family protein [Parabacteroides distasonis
ATCC 8503]
gi|149937794|gb|ABR44491.1| glycosyltransferase family 2 [Parabacteroides distasonis ATCC
8503]
Length=253
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/222 (34%), Positives = 118/222 (54%), Gaps = 14/222 (6%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+ PT S+ITI++N L+RT+ SV +Q Y IE+I+IDG S D V + E G
Sbjct 1 MNQQPTFSIITITYNAERWLERTILSVLSQSYPN-IEYILIDGASKDKTVEIIKQYESGI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
A W SEPD G YDAMN+G+ A+GD +WFL++ D D V + V +L K + ++
Sbjct 60 ASWISEPDKGLYDAMNKGLRRATGDYVWFLNAGDTLYTADTVQRIVASLKKKVSLPDV-I 118
Query 121 FGMDRLV-------GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+G R+V G+ R++ P S + F G +V HQ SF +A + Y+ D+
Sbjct 119 YGETRIVDAAGKSLGMRRLKAPERLSWKGFRMG-MLVCHQ-SFISKREIAPL--YNTDYR 174
Query 174 IAADQEFILRAALVCEPV-TIRCVLCEFDTTGVGSHREPSAV 214
+ AD ++ ++ + V VL +F G+ S +++
Sbjct 175 LTADYDWCIQCLKRAKRVHNTHLVLSDFLEEGLSSANRKASL 216
>gi|262384715|ref|ZP_06077848.1| glycosyltransferase family 2 [Bacteroides sp. 2_1_33B]
gi|262293696|gb|EEY81631.1| glycosyltransferase family 2 [Bacteroides sp. 2_1_33B]
Length=253
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 72/215 (34%), Positives = 114/215 (54%), Gaps = 14/215 (6%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+ PT S+ITI++N L+RT+ SV +Q Y IE+I+IDG S D V + E G
Sbjct 1 MNQQPTFSIITITYNAERWLERTILSVLSQSYPN-IEYILIDGASKDKTVEIIKQYESGI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
A W SEPD G YDAMN+G+ A+GD +WFL++ D D V + V +L K + ++
Sbjct 60 ASWISEPDKGLYDAMNKGLKRATGDYVWFLNAGDTLYTADTVQRIVASLKKKVSLPDV-I 118
Query 121 FGMDRLV-------GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+G R+V G+ R++ P + + F G +V HQ SF +A + Y+ ++
Sbjct 119 YGETRIVDAEGRSLGMRRLKAPEKLTWKSFRMG-MLVCHQ-SFISKREIAPL--YNTEYR 174
Query 174 IAADQEFILRAALVCEPV-TIRCVLCEFDTTGVGS 207
+ AD ++ ++ + + R L F G+ S
Sbjct 175 LTADYDWCIQCLRRSKRIHNTRLTLSNFLEEGLSS 209
>gi|301312280|ref|ZP_07218197.1| putative glycosyltransferase [Bacteroides sp. 20_3]
gi|300829702|gb|EFK60355.1| putative glycosyltransferase [Bacteroides sp. 20_3]
Length=253
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/215 (34%), Positives = 114/215 (54%), Gaps = 14/215 (6%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+ PT S+ITI++N L+RT+ SV +Q Y IE+I+IDG S D V + E G
Sbjct 1 MNQQPTFSIITITYNAERWLERTILSVLSQSYPN-IEYILIDGASKDKTVEIIKQYESGI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
A W SEPD G YDAMN+G+ A+GD +WFL++ D D V + V +L K + ++
Sbjct 60 ASWISEPDKGLYDAMNKGLKRATGDYVWFLNAGDTLYTADTVQRIVASLKKKVSLPDV-I 118
Query 121 FGMDRLV-------GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+G ++V G+ R++ P + + F G +V HQ SF +A + Y+ ++
Sbjct 119 YGETQIVDAEGRSLGMRRLKAPEKLTWKSFRMG-MLVCHQ-SFISKREIAPL--YNTEYR 174
Query 174 IAADQEFILRAALVCEPV-TIRCVLCEFDTTGVGS 207
+ AD ++ ++ + + R L F G+ S
Sbjct 175 LTADYDWCIQCLRRSKRIHNTRLTLSNFLEEGLSS 209
>gi|224024984|ref|ZP_03643350.1| hypothetical protein BACCOPRO_01716 [Bacteroides coprophilus
DSM 18228]
gi|224018220|gb|EEF76218.1| hypothetical protein BACCOPRO_01716 [Bacteroides coprophilus
DSM 18228]
Length=262
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/240 (29%), Positives = 119/240 (50%), Gaps = 25/240 (10%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+SVITI++N+ +GL+ T+KSV +Q Y ++E+I+IDGGS D+ V + YW SE
Sbjct 6 ISVITINYNNKNGLEETIKSVISQTY-SKLEYIIIDGGSSDESVDIIKKYANKINYWVSE 64
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRL 126
PD G Y+AMN+G+ A G+ F++S D F +V+ + ++ G +
Sbjct 65 PDKGIYNAMNKGVIQAHGEYCIFMNSGDCFHDQNVLCNVFSGIQNADIIT---GITQLKT 121
Query 127 VGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAAL 186
+ P S + F + HQASF ++L+ + Y+ +F I +D F ++ +
Sbjct 122 KPIQFWYPPKEISFKWFHIAS--LSHQASFIKTNLLIE-NPYNENFKIVSDWAFFMQTLI 178
Query 187 V--CEPVTIRCVLCEFDTTGVGSHRE-------------PSAVFGDLRRM---GDLHRRY 228
+ C I+ ++C FD G+ + + P+ + D ++ DL RY
Sbjct 179 INGCTYKPIQTIICIFDMGGISNTKSFEKERQLAFQQMFPTRILIDYQKFIPANDLVERY 238
>gi|298377844|ref|ZP_06987794.1| glycosyltransferase [Bacteroides sp. 3_1_19]
gi|298265290|gb|EFI06953.1| glycosyltransferase [Bacteroides sp. 3_1_19]
Length=253
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/215 (34%), Positives = 115/215 (54%), Gaps = 14/215 (6%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
+ PT S+ITI++N L+RT+ SV +Q Y IE+I+IDG S D V + E G
Sbjct 1 MNQQPTFSIITITYNAERWLERTILSVLSQSYPD-IEYILIDGASKDKTVEIIKQYESGI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
+ W SEPD G YDAMN+G+ A+GD +WFL++ D D V + V +L K + ++
Sbjct 60 SSWISEPDKGLYDAMNKGLKLATGDYVWFLNAGDTLYTADTVQRIVASLKKKVSLPDV-I 118
Query 121 FGMDRLV-------GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+G R+V G+ R++ P + + F G +V HQ SF +A + Y+ ++
Sbjct 119 YGETRIVDAEGRSLGMRRLKAPEKLNWKSFRMG-MLVCHQ-SFISKREIAPL--YNTEYR 174
Query 174 IAADQEFILRAALVCEPV-TIRCVLCEFDTTGVGS 207
+ AD ++ ++ + + R +L F G+ S
Sbjct 175 LTADYDWCIQCLRRSKRIHNTRLILSNFLEEGLSS 209
>gi|212691360|ref|ZP_03299488.1| hypothetical protein BACDOR_00852 [Bacteroides dorei DSM 17855]
gi|265751886|ref|ZP_06087679.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_33FAA]
gi|212666113|gb|EEB26685.1| hypothetical protein BACDOR_00852 [Bacteroides dorei DSM 17855]
gi|263236678|gb|EEZ22148.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_33FAA]
Length=271
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 77/227 (34%), Positives = 115/227 (51%), Gaps = 29/227 (12%)
Query 3 SAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAY 62
S S+ITI++N+ +GL++T++SV Q +R E+I+IDGGS D + + Y
Sbjct 2 SQRKFSIITINYNNKEGLRKTIESVVGQSFRD-FEYIIIDGGSTDGSIEVIKEYAGKVDY 60
Query 63 WQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV------S 116
W SEPD G Y AMN+G+ A G+ L F++S D F V+ + +L V +
Sbjct 61 WVSEPDKGIYHAMNKGVLQAHGEYLNFMNSGDEFYNNGVLQEVAPSLDSDIVVGKIVHGT 120
Query 117 ELWGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAA 176
E+WGF + + +D +RG V HQASFF L + YD ++ I +
Sbjct 121 EVWGFHKEDITLMDLIRG--------------TVLHQASFFRKELFDE-NRYDENYKIVS 165
Query 177 DQEFILRAALVCEPVT---IRCVLCEFDTTGVGSHREPSAVFGDLRR 220
D +F ++ L+ VT IR ++C F GV E A DL R
Sbjct 166 DWKFYIQ-TLIFNNVTFRNIRSIVCRFVPGGVS---ETDAGTRDLER 208
>gi|237726342|ref|ZP_04556823.1| glycosyltransferase family 2 protein [Bacteroides sp. D4]
gi|229434868|gb|EEO44945.1| glycosyltransferase family 2 protein [Bacteroides dorei 5_1_36/D4]
Length=271
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/227 (34%), Positives = 115/227 (51%), Gaps = 29/227 (12%)
Query 3 SAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAY 62
S S+ITI++N+ +GL++T++SV Q +R E+I+IDGGS D + + Y
Sbjct 2 SQRKFSIITINYNNKEGLRKTIESVVGQSFRD-FEYIIIDGGSTDGSIEVIKEYAGKVDY 60
Query 63 WQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV------S 116
W SEPD G Y AMN+G+ A G+ L F++S D F V+ + +L V +
Sbjct 61 WVSEPDKGIYHAMNKGVLQAHGEYLNFMNSGDEFYNNGVLQEVAPSLDSDIVVGKIVHGT 120
Query 117 ELWGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAA 176
E+WGF + + +D +RG V HQASFF L + YD ++ I +
Sbjct 121 EVWGFHKEDITLMDLIRG--------------TVLHQASFFRKELFDE-NRYDENYKIVS 165
Query 177 DQEFILRAALVCEPVT---IRCVLCEFDTTGVGSHREPSAVFGDLRR 220
D +F ++ L+ VT IR ++C F GV E A DL R
Sbjct 166 DWKFYIQ-TLIFNNVTFRNIRSIVCRFVPGGVS---ETGAGTRDLER 208
>gi|120434921|ref|YP_860607.1| colanic acid biosynthesis glycosyl transferase [Gramella forsetii
KT0803]
gi|117577071|emb|CAL65540.1| colanic acid biosynthesis glycosyl transferase [Gramella forsetii
KT0803]
Length=262
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/196 (34%), Positives = 104/196 (54%), Gaps = 6/196 (3%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
++ P +S+IT++ ND+ G+++T+KSV Q ++ E IVIDGGS D + +
Sbjct 1 MSKTPLISIITVNLNDVKGMEKTLKSVFEQTWKT-FEFIVIDGGSKDGSKELIESHDDKI 59
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
YW SEPD G Y+AMN+GI ++G L FL+S D FS +V+ + E L+G+ + +
Sbjct 60 DYWVSEPDKGIYNAMNKGIEISTGQYLLFLNSGDVFSANNVLENSFEHLTGEAII--YFN 117
Query 121 FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
+D L+ + P L G + HQ+ F L ++G YD I +D +F
Sbjct 118 VFVDENDHLNLIEYPDELRFSDLLYG--TLCHQSVFIRKELFQQVGRYDESLKIVSDWKF 175
Query 181 ILRAALVCEPVTIRCV 196
+ +L E T + V
Sbjct 176 FI-LSLFKENCTYKKV 190
>gi|197124816|ref|YP_002136767.1| family 2 glycosyl transferase [Anaeromyxobacter sp. K]
gi|196174665|gb|ACG75638.1| glycosyl transferase family 2 [Anaeromyxobacter sp. K]
Length=284
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 77/194 (40%), Positives = 110/194 (57%), Gaps = 10/194 (5%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P++SV+TI ND GL+ T+ SV Q YR +E +V+DGGS D V + E A W
Sbjct 3 PSLSVVTICLNDRAGLEATLDSVLGQTYRD-LELVVVDGGSTDGSVDAIRARELRIARWV 61
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSE-LWG-FG 122
SEPDGG Y+ +G + A G+ L FL++ DR PD A+E L PV + ++G
Sbjct 62 SEPDGGIYEGQGKGASLAEGEWLLFLNAGDRLGAPD----ALERLMTPRPVEDVVYGDVW 117
Query 123 MDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFIL 182
++R L R P S+ + + ++PHQA+ F +L ++GGYDL + IAAD + +L
Sbjct 118 VERGGRLRPWRAPDHPSVLHLM--RSMLPHQATAFRRALFHRLGGYDLSYRIAADYDLLL 175
Query 183 RAALVCEPVTIRCV 196
R ALV T R V
Sbjct 176 R-ALVEARATTRHV 188
>gi|150006448|ref|YP_001301192.1| glycosyl transferase family protein [Bacteroides vulgatus ATCC
8482]
gi|237712195|ref|ZP_04542676.1| glycosyltransferase family 2 protein [Bacteroides sp. 9_1_42FAA]
gi|254881635|ref|ZP_05254345.1| glycosyltransferase family 2 [Bacteroides sp. 4_3_47FAA]
7 more sequence titles
Length=271
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/247 (32%), Positives = 118/247 (48%), Gaps = 30/247 (12%)
Query 3 SAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAY 62
S S+ITI++N+ +GL++T++SV Q +R E+I+IDGGS D + + Y
Sbjct 2 SQRKFSIITINYNNKEGLRKTIESVVGQSFRD-FEYIIIDGGSTDGSIEVIKEYAGKVDY 60
Query 63 WQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV------S 116
W SEPD G Y AMN+G+ A G+ L F++S D F V+ + +L V +
Sbjct 61 WVSEPDKGIYHAMNKGVLQAHGEYLNFMNSGDEFYNNGVLQEVAPSLDSDIVVGKIVHGT 120
Query 117 ELWGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAA 176
E+WGF + + +D +RG V HQASFF L + YD + I +
Sbjct 121 EVWGFHKEDITLMDLIRG--------------TVLHQASFFRKELFDE-NRYDESYKIVS 165
Query 177 DQEFILRAALVCEPV--TIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRR 234
D +F ++ + IR ++C F GV E A D+ R +P R
Sbjct 166 DWKFYIQTLIFNNATFRNIRSIVCRFVPGGVS---ETDAGTRDMERKRVYKELFP---DR 219
Query 235 ISHAYLR 241
+ Y+R
Sbjct 220 MMKDYIR 226
>gi|218259478|ref|ZP_03475201.1| hypothetical protein PRABACTJOHN_00859 [Parabacteroides johnsonii
DSM 18315]
gi|218225071|gb|EEC97721.1| hypothetical protein PRABACTJOHN_00859 [Parabacteroides johnsonii
DSM 18315]
Length=252
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/187 (37%), Positives = 104/187 (56%), Gaps = 13/187 (6%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
PT S+ITI++N + +++T+ +V +Q Y IE+IVIDGGS D V + E G AYW
Sbjct 4 PTFSIITITYNAVRLVEQTLLNVLSQSYPN-IEYIVIDGGSTDGTVDIIKRYESGLAYWV 62
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SEPD G YDAMN+G+ A+GD +WF+++ D D V V L + + ++ +G
Sbjct 63 SEPDKGIYDAMNKGLQKATGDYVWFINAGDSLCSSDTVQSVVSRLQKRKTLPDI-IYGET 121
Query 125 RLVGLD-------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAAD 177
+V + R+R P S + F G +V HQ SF VA + YDL + +AD
Sbjct 122 NIVDEERRSLGLRRLRAPRKLSWKSFRMG-MLVCHQ-SFIPKRTVAPL--YDLQYRYSAD 177
Query 178 QEFILRA 184
++ +R
Sbjct 178 FDWCIRC 184
>gi|333378622|ref|ZP_08470352.1| hypothetical protein HMPREF9456_01947 [Dysgonomonas mossii DSM
22836]
gi|332883026|gb|EGK03310.1| hypothetical protein HMPREF9456_01947 [Dysgonomonas mossii DSM
22836]
Length=272
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/185 (33%), Positives = 101/185 (55%), Gaps = 9/185 (4%)
Query 2 TSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFA 61
T+ P ++VIT+S+N L +++T+ SV Q YR IE+I+IDGGS D V + E A
Sbjct 14 TNNPKITVITVSYNVLSTIEKTILSVINQTYRN-IEYIIIDGGSSDGTVDIIKKYEDKLA 72
Query 62 YWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGF 121
YW SEPD G YDAMN+GI A+GD + F++S D F +V+++ ++ + ++G
Sbjct 73 YWISEPDNGIYDAMNKGITKATGDWINFINSGDYFLDNNVISKFIQLHDVNADI--VYGN 130
Query 122 GMDRLVGLDR--VRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQE 179
+ ++ + P P L + K H +SF + L+ ++ +D F + D
Sbjct 131 TEISISNIETSYIHKPEPIEL---IEKKMTFGHPSSFVKAYLLKEM-PFDTSFKSSGDYN 186
Query 180 FILRA 184
F ++
Sbjct 187 FFIQC 191
>gi|340348308|ref|ZP_08671398.1| glycosyl transferase [Prevotella dentalis DSM 3688]
gi|339607579|gb|EGQ12513.1| glycosyl transferase [Prevotella dentalis DSM 3688]
Length=283
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 99/180 (55%), Gaps = 10/180 (5%)
Query 8 SVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEP 67
S+IT++FN +GLQ T++SV Q Y+ E+I++DGGS D V + E YW SEP
Sbjct 4 SIITVNFNHAEGLQNTIRSVIGQSYKNY-EYIIVDGGSTDGSVDIIRSYEADLTYWVSEP 62
Query 68 DGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLV 127
DGG Y AMN+G A+GD +L+S D FS + +E ++G ++ D+ +
Sbjct 63 DGGIYQAMNKGTKMATGDYCIYLNSGDIFSN----IRVLENVAGMSLTEDI--ISGDQQI 116
Query 128 GLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRAALV 187
G +R P +L+ F + + HQA+F ++L+ K YD AAD +F + ++
Sbjct 117 GTKLLRSPDFVTLKIFFI--RSLFHQATFIKTNLI-KTHPYDESMKFAADWKFFMEMIVL 173
>gi|212691303|ref|ZP_03299431.1| hypothetical protein BACDOR_00794 [Bacteroides dorei DSM 17855]
gi|254881580|ref|ZP_05254290.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA]
gi|319642963|ref|ZP_07997599.1| glycosyltransferase [Bacteroides sp. 3_1_40A]
gi|212666535|gb|EEB27107.1| hypothetical protein BACDOR_00794 [Bacteroides dorei DSM 17855]
gi|254834373|gb|EET14682.1| hypothetical protein BSFG_00829 [Bacteroides sp. 4_3_47FAA]
gi|317385511|gb|EFV66454.1| glycosyltransferase [Bacteroides sp. 3_1_40A]
Length=258
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/220 (35%), Positives = 114/220 (52%), Gaps = 20/220 (9%)
Query 6 TVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF---AY 62
T+S+IT+++N + L T++SV +Q + IE+I++DG S D + L EP F
Sbjct 2 TISLITVTYNSGNTLFSTLQSVLSQTHLD-IEYIIVDGVSKDHTIDVLKEYEPKFNGRLR 60
Query 63 WQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG-- 120
W SEPD G YDAMN+GI A+GD++ L+S D F+G DV+AQ V A ++ ++G
Sbjct 61 WISEPDKGLYDAMNKGIRMATGDVVGILNSDDFFTGKDVLAQVVRAFETDDRLAAVYGDV 120
Query 121 --FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQ 178
D+L R F RK + + H + + K G Y D+ IAAD
Sbjct 121 HFVHPDKLDKCVRYYSSKVFK-RKLMRFGFMPAHPSFYLRRKYFGKFGCYKTDYRIAADF 179
Query 179 EFILRAA---------LVCEPVTIRCVLCEFDTTGVGSHR 209
EF+LR L + VT+R + T+GV SH+
Sbjct 180 EFLLRVIYKGNISIKYLPIDMVTMR--MGGVSTSGVVSHK 217
>gi|295132440|ref|YP_003583116.1| glycosyl transferase [Zunongwangia profunda SM-A87]
gi|294980455|gb|ADF50920.1| glycosyl transferase [Zunongwangia profunda SM-A87]
Length=248
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/181 (35%), Positives = 96/181 (54%), Gaps = 7/181 (3%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+S+ITI FN+++ + ++SV+AQ Y IEHIVIDG S D + + G E + SE
Sbjct 3 ISIITIVFNNVNTIASCIESVQAQNYNN-IEHIVIDGESTDGTIDVIKGFENNIDVFISE 61
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGM--- 123
D G YDA+N+GI A+GD++ LHS D F P ++ V+ + G L+G G+
Sbjct 62 EDNGLYDALNKGIKKATGDVIGILHSDDMFYSPHTISNVVKCMQKSGA-DLLYGNGVYVE 120
Query 124 -DRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFIL 182
+ + RV F R +L + H + S++ K G YD + IA+D + L
Sbjct 121 RNNTSNVKRVYSSKSFK-RSYLKYGWIPLHTTIYVRSNIFDKYGDYDERYSIASDYDISL 179
Query 183 R 183
R
Sbjct 180 R 180
>gi|149277635|ref|ZP_01883776.1| putative glycosyltransferase [Pedobacter sp. BAL39]
gi|149231868|gb|EDM37246.1| putative glycosyltransferase [Pedobacter sp. BAL39]
Length=249
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/185 (37%), Positives = 100/185 (55%), Gaps = 11/185 (5%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P +SVITI +N++ ++RT+ SV Q Y IE++VIDG S D + + Y+
Sbjct 3 PKLSVITIVYNNVKDIERTMLSVLNQTYTN-IEYLVIDGASKDGTKELIEKYQSRLGYFI 61
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV----SELWG 120
SEPD G YDAMN+G+A A+GD + F++S D P VA+ E G + +E++
Sbjct 62 SEPDKGIYDAMNKGLAAATGDYVLFMNSGDEIYSPVTVAEIFEC-GGHADIYYGETEMYD 120
Query 121 FGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
L G R P F+ + F G + HQA + SL A YDL + +AD ++
Sbjct 121 ENWKSL-GQRRHTAPAVFNWKSFRYGMN-ISHQAIYVRRSLTAP---YDLQYKYSADIDW 175
Query 181 ILRAA 185
I+RAA
Sbjct 176 IIRAA 180
>gi|154490349|ref|ZP_02030610.1| hypothetical protein PARMER_00582 [Parabacteroides merdae ATCC
43184]
gi|154088960|gb|EDN88004.1| hypothetical protein PARMER_00582 [Parabacteroides merdae ATCC
43184]
Length=253
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/187 (37%), Positives = 103/187 (56%), Gaps = 13/187 (6%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
PT S+ITI++N + +++T+ +V +Q Y IE+IVIDGGS D V + E G AYW
Sbjct 5 PTFSIITITYNAVRLVEQTLLNVLSQSYPN-IEYIVIDGGSTDGTVDIIRRYESGLAYWV 63
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SEPD G YDAMN+G+ A+GD +WF+++ D D V V L + + ++ +G
Sbjct 64 SEPDKGIYDAMNKGLQKATGDYVWFINAGDTLCSSDTVQSIVSKLQKRKVLPDI-IYGET 122
Query 125 RLVGLD-------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAAD 177
+V + R+R P S + F G +V HQ SF +A YDL + +AD
Sbjct 123 NIVDEERRSLGLRRLRAPRKLSWKSFRMG-MLVCHQ-SFIPKRAIAP--SYDLQYRYSAD 178
Query 178 QEFILRA 184
++ +R
Sbjct 179 FDWCIRC 185
>gi|336477564|ref|YP_004616705.1| family 2 glycosyl transferase [Methanosalsum zhilinae DSM 4017]
gi|335930945|gb|AEH61486.1| glycosyl transferase family 2 [Methanosalsum zhilinae DSM 4017]
Length=257
Score = 105 bits (261), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 58/186 (32%), Positives = 99/186 (54%), Gaps = 9/186 (4%)
Query 2 TSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFA 61
P VSVIT N ++ T++SV Q Y IE+++IDGGS D+ + + + A
Sbjct 8 NQKPVVSVITACLNSDQYIEDTIESVIKQTYPN-IEYVIIDGGSTDNTLNIIKKYDNQIA 66
Query 62 YWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGF 121
W SEPD G YDAMN+GI H++G++++FL++ D +++ + V + S L +
Sbjct 67 NWISEPDNGVYDAMNKGIVHSTGEIIYFLNAGDYLYDNEIIEKIVNEFLDE---SVLAVY 123
Query 122 GMDRLVGLDRVRGPI---PFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQ 178
G ++ ++ + I + L + V HQA F SL ++G + F +++D
Sbjct 124 GNVEVINYEKNKKSIRGCKVTYNNLLYRR--VCHQALFVRRSLFDEMGLFSTSFKLSSDH 181
Query 179 EFILRA 184
EFI++
Sbjct 182 EFIIKC 187
>gi|146300270|ref|YP_001194861.1| glycosyl transferase family protein [Flavobacterium johnsoniae
UW101]
gi|146154688|gb|ABQ05542.1| Candidate beta-D-/alpha-L glycosyltransferase; Glycosyltransferase
family 2 [Flavobacterium johnsoniae UW101]
Length=273
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/230 (30%), Positives = 119/230 (52%), Gaps = 17/230 (7%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+SVIT++FND +GLQ+T++SV Q Y+ E IVIDGGS D + + YW SE
Sbjct 4 ISVITVNFNDKNGLQKTLESVTNQSYKD-FEFIVIDGGSTDGSKELIEEYQDKITYWVSE 62
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRL 126
D G ++AMN+GI A+G+ + F++ D F+ V+ V +L+ + + +G +
Sbjct 63 KDSGVFNAMNKGIKAATGEFVIFMNGGDCFNDNLVLQDNVSSLTSEYDI--YYGDNYKQS 120
Query 127 VGLDRVRG---PIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILR 183
G R++ + FS F + HQ++F SL K Y+ ++ IA+D EF +
Sbjct 121 NGSKRLKTYTEKLHFSF--FYTSS--INHQSTFIRKSLFDKYFYYNENYKIASDWEFFVY 176
Query 184 AALVCEPVT----IRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYP 229
+C ++ + +D TG+ S+ + ++ + ++ + +P
Sbjct 177 T--ICHQNVAYKYLKKTIAIYDFTGISSNPKFRDLYNEEKKQT-FQKYFP 223
>gi|329962365|ref|ZP_08300370.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
gi|328530226|gb|EGF57107.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
Length=248
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/246 (30%), Positives = 124/246 (51%), Gaps = 14/246 (5%)
Query 4 APTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYW 63
P SVIT+++N L+ T++SV +Q Y +E+I++DG S D+ ++ ++ + +
Sbjct 6 TPKFSVITVTYNAEAVLEDTIQSVISQTYH-HVEYIIVDGASKDNTLSVINRYKEHISRI 64
Query 64 QSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFG- 122
SEPD G YDAMN+GI A+GD L FL++ D F D + Q V +++GK L+G
Sbjct 65 VSEPDKGLYDAMNKGIKLATGDYLCFLNAGDSFHEDDTLQQMVHSITGKELPDILYGETA 124
Query 123 -MDR---LVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQ 178
+D+ + + R+ P + + F G +V HQA F +SLV YDL + +AD
Sbjct 125 LVDKEGHFLRMRRLSAPQALTWKSFKQG-MLVCHQAFFAKTSLVE---PYDLQYRFSADF 180
Query 179 EFILRAALVCEPV-TIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISH 237
++ +R + + ++ G+ + +++ R M R Y + H
Sbjct 181 DWCIRVMKKARTLHNTHLTIIDYLEEGITTQNRKASLRERFRIMA---RHYGWLSTTAHH 237
Query 238 AYLRGR 243
A+ R
Sbjct 238 AWFVLR 243
>gi|336412731|ref|ZP_08593084.1| hypothetical protein HMPREF1017_00192 [Bacteroides ovatus 3_8_47FAA]
gi|335942777|gb|EGN04619.1| hypothetical protein HMPREF1017_00192 [Bacteroides ovatus 3_8_47FAA]
Length=249
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/188 (36%), Positives = 102/188 (55%), Gaps = 10/188 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
V P S+IT+++N L+ T++SV +Q Y IE+I++DG S D ++ ++ P
Sbjct 4 VHPTPKFSIITVTYNAEKVLEDTIQSVISQTYH-HIEYIIVDGASKDGTLSIINRYRPHI 62
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
SEPD G YDAMN+GIA ASGD L FL++ D F D + Q V ++G L+G
Sbjct 63 HTVVSEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGNELPDILYG 122
Query 121 ----FGMDR-LVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIA 175
DR + + R+ P + + F G +V HQA F +LV G Y+L + +
Sbjct 123 ETAIVDKDRHFLRMRRLSAPETLTWKSFKQG-MLVCHQAFFPRHTLV---GPYNLQYRFS 178
Query 176 ADQEFILR 183
AD ++ +R
Sbjct 179 ADFDWCIR 186
>gi|145225343|ref|YP_001136021.1| glycosyl transferase family protein [Mycobacterium gilvum PYR-GCK]
gi|145217829|gb|ABP47233.1| glycosyl transferase, family 2 [Mycobacterium gilvum PYR-GCK]
Length=269
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/205 (33%), Positives = 104/205 (51%), Gaps = 10/205 (4%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ-S 65
+S++T+ +NDL+ +QRT+ S+ Q R EH++IDG S D + F + S
Sbjct 9 LSIVTVCWNDLENVQRTMSSLNRQTERHGWEHVLIDGASSDGTADWYRSAHFDFPHHALS 68
Query 66 EPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDR 125
EPD G +DAMN+ + GD + F+++ D ++ V + + + P WG+ R
Sbjct 69 EPDDGLFDAMNKSLDVVKGDYVVFMNAGDTYADDGAVRRVLHRIQTNPP----WGYSRAR 124
Query 126 LVGLD--RVR---GPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
+V +VR G IP+S L G+ + HQA + L+ + G+D G AAD
Sbjct 125 VVDNSGRKVRPHVGRIPYSRTMHLLGRATICHQAVVMRTDLLRDLKGFDKRLGTAADYHL 184
Query 181 ILRAALVCEPVTIRCVLCEFDTTGV 205
++RAAL P T V +F GV
Sbjct 185 LIRAALRQVPDTWAEVDVDFLAGGV 209
>gi|336415876|ref|ZP_08596214.1| hypothetical protein HMPREF1017_03322 [Bacteroides ovatus 3_8_47FAA]
gi|335939779|gb|EGN01651.1| hypothetical protein HMPREF1017_03322 [Bacteroides ovatus 3_8_47FAA]
Length=265
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/220 (32%), Positives = 116/220 (53%), Gaps = 7/220 (3%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P +SV+T+S+N + +++T+ SV Q+Y IE+I+IDGGS D V + E +YW
Sbjct 19 PLISVVTVSYNAIATIEQTILSVINQKYSA-IEYIIIDGGSTDGTVDIIKKYEDKISYWV 77
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SE D G YDAMN+GI HA+G ++ ++S D + D A + KG + L+ +
Sbjct 78 SERDNGIYDAMNKGIRHATGKIVGIINSDDWYE-MDAFATVADLYLKKGDLY-LYHGDIR 135
Query 125 RLVGLDRVRGPIP-FSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILR 183
+G R P +L F G V H F ++ K+G +D + IA D +F+LR
Sbjct 136 YCLGKQETRVDSPNLNLSDFYKG-SCVFHPTVFVPLAVYRKVGVFDTAYKIAGDYDFLLR 194
Query 184 AAL-VCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMG 222
L + + I ++ TG+ S+R+ + +++ +G
Sbjct 195 CYLNSIKFMYINRIIANMRITGI-SNRQRIMTYEEVKTIG 233
>gi|332292692|ref|YP_004431301.1| glycosyl transferase family 2 [Krokinobacter diaphorus 4H-3-7-5]
gi|332170778|gb|AEE20033.1| glycosyl transferase family 2 [Krokinobacter sp. 4H-3-7-5]
Length=279
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/184 (36%), Positives = 94/184 (52%), Gaps = 9/184 (4%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
VS+ITI+ ND GL++T SV AQ ++ EHI+IDG S D + Y+ + F YW SE
Sbjct 3 VSIITINLNDKTGLEKTCSSVFAQTFKDY-EHIIIDGASIDGSLDYIETHKDKFTYWISE 61
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV--SELWGFGMD 124
PD G Y+AMN+GI A GD + FL+S D +V ++ +L + LW G
Sbjct 62 PDSGVYNAMNKGIRQAKGDYVLFLNSGDILYNENVFSKVENSLVKDIDLVYGNLWIEGRK 121
Query 125 RLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILRA 184
++ G I F+ K + + H A+F +L G Y D I +D F +
Sbjct 122 GEGFTNKYPGVIDFNFFK----RTSLGHAATFIKRTLFETYGVYRTDLKIVSDWAFFFK- 176
Query 185 ALVC 188
L+C
Sbjct 177 -LIC 179
>gi|299149042|ref|ZP_07042104.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_23]
gi|298513803|gb|EFI37690.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_23]
Length=262
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/186 (35%), Positives = 101/186 (55%), Gaps = 7/186 (3%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P VS+IT+ +N + +QRT+ SV +Q Y IE+I+IDG S D +A + E ++W
Sbjct 6 PRVSIITVCYNAVSAIQRTILSVLSQTYEN-IEYIIIDGKSTDGTLAIIKKYESRISFWL 64
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SEPD G YDAMN+ I +A+G+ + F+++ D F VV + V + + +G +
Sbjct 65 SEPDNGIYDAMNKAINNATGEWVLFMNADDEFVDNKVVERFVFVVKTHKNYAVYYGDTIL 124
Query 125 RLVGLDR-VRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFILR 183
R DR V+ S+R +A HQ+ F + L+ K Y L + +A+D +F+L
Sbjct 125 RYAIGDRPVKALGLKSMRYMMA----FSHQSVFIRTDLM-KGKKYSLKYKLASDYDFLLS 179
Query 184 AALVCE 189
L E
Sbjct 180 IYLAGE 185
>gi|237725561|ref|ZP_04556042.1| glycosyltransferase family 2 protein [Bacteroides sp. D4]
gi|229436248|gb|EEO46325.1| glycosyltransferase family 2 protein [Bacteroides dorei 5_1_36/D4]
Length=271
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 64/194 (33%), Positives = 101/194 (53%), Gaps = 26/194 (13%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQ 64
P +SVIT+S+N + +++T+ SV Q Y IE+I+IDGGS D V + AYW
Sbjct 4 PLISVITVSYNAVLTIEQTILSVINQTYLN-IEYIIIDGGSTDGTVNVIKKYADKIAYWV 62
Query 65 SEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMD 124
SEPD G YDAMN+GIA++ G+ F+++ D+F ++ Q ++ + D
Sbjct 63 SEPDKGIYDAMNKGIAYSHGEYCNFINAGDKFCSSSILKQVMD-----------FNHVAD 111
Query 125 RLVGLD-----------RVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+VG D R P ++L F +PHQ+ F +SL+ K YD
Sbjct 112 IIVGQDLHVNEHNKIVSRSVLPRRYNLLHFYI--TTIPHQSCFIRASLLKKY-YYDTSLK 168
Query 174 IAADQEFILRAALV 187
I +D +F L++ ++
Sbjct 169 IVSDWKFYLQSIVL 182
>gi|330996431|ref|ZP_08320314.1| glycosyltransferase, group 2 family protein [Paraprevotella xylaniphila
YIT 11841]
gi|329573289|gb|EGG54903.1| glycosyltransferase, group 2 family protein [Paraprevotella xylaniphila
YIT 11841]
Length=263
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 73/241 (31%), Positives = 118/241 (49%), Gaps = 24/241 (9%)
Query 2 TSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYL-------- 53
T PTV++ T+++N L T++SV AQ Y IEH++IDG S D +A +
Sbjct 4 TPFPTVTIATVTYNAEKTLPCTLESVAAQTY-PHIEHLIIDGCSKDQTMALIHRYVDANT 62
Query 54 -SGCEPGFAYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGK 112
C P EPD G YDAMN+ + HA+GD + FL++ DR GPD + Q +
Sbjct 63 GKTC-PHDIRVIREPDKGLYDAMNKALEHATGDYIVFLNAGDRLHGPDTLQQVFGHIGRN 121
Query 113 GPVSEL------WGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIG 166
+ W G R + R++ P + +KF G +V HQ SF+ + +A+
Sbjct 122 TSRPAILYGETDWVDGQGRFIRHRRLQAPKQLTWKKFRHG-MLVCHQ-SFYVRTDLARQE 179
Query 167 GYDLDFGIAADQEFIL----RAALVCEPV-TIRCVLCEFDTTGVGSHREPSAVFGDLRRM 221
YDL F +AD ++ + RA + +P+ R +L ++ + G+ + ++ R M
Sbjct 180 LYDLSFRFSADYDWCIRLLKRAERLKQPIYNTRLILTDYLSEGLTTQHHQESLRERFRIM 239
Query 222 G 222
Sbjct 240 A 240
>gi|237755814|ref|ZP_04584414.1| glycosyltransferase family 2 [Sulfurihydrogenibium yellowstonense
SS-5]
gi|237692024|gb|EEP61032.1| glycosyltransferase family 2 [Sulfurihydrogenibium yellowstonense
SS-5]
Length=403
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/215 (31%), Positives = 108/215 (51%), Gaps = 12/215 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
T P ++VIT+ +N L+ T+KSV Q Y +E+I+IDGGS D + + E
Sbjct 141 TTFLPLITVITVVYNGAKYLEDTIKSVINQTYPN-VEYIIIDGGSTDGTIDIIKKYEDYI 199
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDV---VAQAVEALSGKGPVSE 117
YW SEPD G YDAMN+GI +G+ + F+++ D+F + + Q V++ +
Sbjct 200 DYWVSEPDRGIYDAMNKGIDLVTGEWINFMNAGDKFFNDNTIFFIYQNVKSFDSNYDIVY 259
Query 118 LWGFGMD---RLVGLDRVRGPIPF-SLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFG 173
+D LV + + F L+K+++ +PHQ++F+ KIG YD +
Sbjct 260 GKAGVIDEKRELVSIQGLNERESFRKLKKYMS----IPHQSTFYRLEFFKKIGKYDDNLK 315
Query 174 IAADQEFILRAALVCEPVTIRCVLCEFDTTGVGSH 208
+A D E +LR + I +L + G+ +
Sbjct 316 LAGDYEILLRNYKNLQVRFIDSILSLMLSNGIAAK 350
>gi|336404145|ref|ZP_08584843.1| hypothetical protein HMPREF0127_02156 [Bacteroides sp. 1_1_30]
gi|335943473|gb|EGN05312.1| hypothetical protein HMPREF0127_02156 [Bacteroides sp. 1_1_30]
Length=249
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/188 (36%), Positives = 101/188 (54%), Gaps = 10/188 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
V P S+IT+++N L+ T++SV +Q Y IE+I++DG S D ++ ++ P
Sbjct 4 VHPTPKFSIITVTYNAEKVLEDTIQSVISQTYH-HIEYIIVDGASKDGTISIINRYRPHI 62
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
SEPD G YDAMN+GIA ASGD L FL++ D F D + Q V ++G L+G
Sbjct 63 HTVVSEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGSELPDVLYG 122
Query 121 ----FGMDR-LVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIA 175
DR + + R+ P + + F G +V HQA F +LV Y+L + +
Sbjct 123 ETAIVDQDRHFLRMRRLSAPETLTWKSFKQG-MLVCHQAFFPRHTLVEP---YNLKYRFS 178
Query 176 ADQEFILR 183
AD ++ +R
Sbjct 179 ADFDWCIR 186
>gi|298479932|ref|ZP_06998131.1| glycosyl transferase, group 2 family [Bacteroides sp. D22]
gi|295085639|emb|CBK67162.1| Glycosyltransferases involved in cell wall biogenesis [Bacteroides
xylanisolvens XB1A]
gi|298273741|gb|EFI15303.1| glycosyl transferase, group 2 family [Bacteroides sp. D22]
Length=249
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/188 (36%), Positives = 101/188 (54%), Gaps = 10/188 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
V P S+IT+++N L+ T++SV +Q Y IE+I++DG S D ++ ++ P
Sbjct 4 VHPTPKFSIITVTYNAEKVLEDTIQSVISQTYH-HIEYIIVDGASKDGTISIINRYRPHI 62
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
SEPD G YDAMN+GIA ASGD L FL++ D F D + Q V ++G L+G
Sbjct 63 HTVVSEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGSELPDVLYG 122
Query 121 ----FGMDR-LVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIA 175
DR + + R+ P + + F G +V HQA F +LV Y+L + +
Sbjct 123 ETAIVDQDRHFLRMRRLSAPETLTWKSFKQG-MLVCHQAFFPRHTLVEP---YNLKYRFS 178
Query 176 ADQEFILR 183
AD ++ +R
Sbjct 179 ADFDWCIR 186
>gi|329963019|ref|ZP_08300799.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
gi|328529060|gb|EGF55990.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
Length=254
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 64/186 (35%), Positives = 97/186 (53%), Gaps = 12/186 (6%)
Query 6 TVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF---AY 62
+S+IT++FN L+ T++SV +Q Y IE+I+IDG S DD V+ + EP F
Sbjct 2 NISLITVTFNSAASLRYTIQSVLSQSYPN-IEYIIIDGASKDDTVSIIKEYEPKFYGRMR 60
Query 63 WQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFG 122
W SEPD G YDAMN+GI A+GD++ ++S D + DV+A+ V + + +G
Sbjct 61 WISEPDKGLYDAMNKGIRMATGDIVGIINSDDFYHRTDVMAKVVNVFQDEAVQT---VYG 117
Query 123 MDRLVGLDRVRGPIPFSLRKFLAGKQ----VVPHQASFFG-SSLVAKIGGYDLDFGIAAD 177
R V D + + + K K +P +FF + G Y D+ IAAD
Sbjct 118 DVRFVNSDNLDKTVRYYSSKNFFPKLFRYGFMPAHPTFFTYRRYFEEFGYYKTDYKIAAD 177
Query 178 QEFILR 183
E ++R
Sbjct 178 YELLIR 183
>gi|337754896|ref|YP_004647407.1| Colanic acid biosynthesis glycosyl transferase WcaE [Francisella
sp. TX077308]
gi|336446501|gb|AEI35807.1| Colanic acid biosynthesis glycosyl transferase WcaE [Francisella
sp. TX077308]
Length=256
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 66/189 (35%), Positives = 101/189 (54%), Gaps = 17/189 (8%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
VS+IT+ +N + +++T++SV +Q Y+ IE+I++DGGS D + L+ + + SE
Sbjct 3 VSIITVCYNSAETIEKTIQSVISQSYQD-IEYIIVDGGSYDGTLDILAKYQSKLSKVVSE 61
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEAL--SGKGPVSELWGFGMD 124
D G YDAMN+GIA A+GD++ L+S D +S DV+ V +GK + F D
Sbjct 62 SDDGIYDAMNKGIALATGDIIGILNSDDCYSNSDVIDNIVSTFRQNGKDMLFADLKFVND 121
Query 125 RLVGLDRVRGPIP--FSLRKFLAGKQ----VVPHQASFFGSSLVAKIGGYDLDFGIAADQ 178
G I +S ++F + + PH F ++ K G Y LDF IAAD
Sbjct 122 --------NGKILRYYSAKRFTPNRLKFGIMPPHPTLFVKKAIYQKYGVYSLDFKIAADY 173
Query 179 EFILRAALV 187
E +R LV
Sbjct 174 EIFVRFLLV 182
>gi|153806959|ref|ZP_01959627.1| hypothetical protein BACCAC_01235 [Bacteroides caccae ATCC 43185]
gi|149130079|gb|EDM21289.1| hypothetical protein BACCAC_01235 [Bacteroides caccae ATCC 43185]
Length=251
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/188 (37%), Positives = 99/188 (53%), Gaps = 10/188 (5%)
Query 1 VTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGF 60
V P SVIT+++N L+ T++SV AQ Y +E+I++DG S D ++ + P
Sbjct 4 VHPTPKFSVITVTYNAEKVLEDTIQSVIAQTYH-HVEYIIVDGASKDGTLSIIDRYRPRI 62
Query 61 AYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG 120
SEPD G YDAMN+GIA ASGD L FL++ D F D + Q V ++G L+G
Sbjct 63 HTVVSEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGNELPDVLYG 122
Query 121 FG--MDRLVGLDRVR---GPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIA 175
+D R+R P + + F G +V HQA F +LV YDL F +
Sbjct 123 ETAIVDEKRHFQRMRRLSAPEVLTWKSFKQG-MLVCHQAFFARHTLVEP---YDLKFRYS 178
Query 176 ADQEFILR 183
AD ++ +R
Sbjct 179 ADFDWCVR 186
>gi|220918345|ref|YP_002493649.1| family 2 glycosyl transferase [Anaeromyxobacter dehalogenans
2CP-1]
gi|219956199|gb|ACL66583.1| glycosyl transferase family 2 [Anaeromyxobacter dehalogenans
2CP-1]
Length=261
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/204 (39%), Positives = 110/204 (54%), Gaps = 16/204 (7%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
VS+IT SFN ++RTV+SV +QR +E++V+DGGS D ++ L E G W SE
Sbjct 3 VSIITPSFNQGGFIERTVESVLSQRGDFDLEYLVVDGGSTDATLSVLRRYE-GRLRWISE 61
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRL 126
PDGG+ DA+N+G+ SG+++ +L+S D + P V AVEAL G W FG R+
Sbjct 62 PDGGQCDAINKGLRMTSGEVVTWLNSDDTYE-PGAVQAAVEALRGGAR----WCFGECRI 116
Query 127 VGL-DRV--RGPIPFS---LRKF----LAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAA 176
V DR RG + R++ L G+ + A+FF LVA++G D A
Sbjct 117 VDEHDREIRRGVTAYKNAQARRYGLARLLGRNFISQPATFFRRDLVAEVGPLDEALHFAM 176
Query 177 DQEFILRAALVCEPVTIRCVLCEF 200
D + LR A V PV + L F
Sbjct 177 DYDLWLRFARVARPVFVPRPLAAF 200
>gi|255536067|ref|YP_003096438.1| hypothetical protein FIC_01934 [Flavobacteriaceae bacterium 3519-10]
gi|255342263|gb|ACU08376.1| hypothetical protein FIC_01934 [Flavobacteriaceae bacterium 3519-10]
Length=249
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/182 (33%), Positives = 95/182 (53%), Gaps = 11/182 (6%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+S+IT+ +N ++ T+KSV+ Q Y+ IE+IVIDG S D + + + Y+ SE
Sbjct 3 ISIITVCYNSAKTIEETLKSVQNQTYKN-IEYIVIDGASKDATTSIIEKYKDIITYFSSE 61
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWG----FG 122
PD G YDAMN+GI A+GD + ++S D F + + + + + + G
Sbjct 62 PDKGLYDAMNKGIQQATGDYVGIINSDDVFYDNKTIEKVADYFAEHPHLDAITGNIVQHK 121
Query 123 MDRLVGLDRVRGPIPFSLRK-FLAGKQVVPHQASFFGSSLVAKIGGYDLDFGIAADQEFI 181
+R++ P L++ F+ PH A F +SL K G Y LD+ I+AD E I
Sbjct 122 NNRIIRKYSSANWSPEKLKRGFMP-----PHPAIFMKTSLYHKFGLYRLDYKISADYELI 176
Query 182 LR 183
+R
Sbjct 177 IR 178
>gi|343085763|ref|YP_004775058.1| family 2 glycosyl transferase [Cyclobacterium marinum DSM 745]
gi|342354297|gb|AEL26827.1| glycosyl transferase family 2 [Cyclobacterium marinum DSM 745]
Length=252
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/186 (35%), Positives = 95/186 (52%), Gaps = 10/186 (5%)
Query 7 VSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSE 66
+S+IT+++N LQ T+ SV Q Y IE+IV+DG S D+ V + E + W SE
Sbjct 5 ISLITVTYNSAQTLQDTISSVAKQEYPD-IEYIVVDGSSKDNTVDIIRRNESTISNWVSE 63
Query 67 PDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRL 126
PD G YDAMN+GI A+GD++ L+S D F V+A+ A + + G D +
Sbjct 64 PDKGMYDAMNKGINMATGDVIGILNSDDFFFDTKVLAEIAAAFNTSSSPDAVIG---DIV 120
Query 127 VGLDRVRGPIP--FSLRKFLAGK----QVVPHQASFFGSSLVAKIGGYDLDFGIAADQEF 180
D I +S +K+ K + PH + F K+G Y D+ IAAD E
Sbjct 121 FVDDEDENKIIRRYSAKKWKPSKFAWGYMPPHPSFFVRKEFFDKLGPYKTDYKIAADYEL 180
Query 181 ILRAAL 186
++R L
Sbjct 181 LIRFLL 186
>gi|332881317|ref|ZP_08448967.1| glycosyltransferase, group 2 family protein [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332680693|gb|EGJ53640.1| glycosyltransferase, group 2 family protein [Capnocytophaga sp.
oral taxon 329 str. F0087]
Length=263
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/238 (29%), Positives = 117/238 (50%), Gaps = 24/238 (10%)
Query 5 PTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYL---------SG 55
PTV++ T+++N L T++SV +Q + IEH++IDG S D + +
Sbjct 7 PTVTIATVTYNAEKTLPCTLESVASQTF-SHIEHLIIDGCSKDQTMELIHRYVDANTGKT 65
Query 56 CEPGFAYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPV 115
C P EPD G YDAMN+ + HA+GD + FL++ DR PD + Q + +
Sbjct 66 C-PHDIRVVREPDKGLYDAMNKALEHATGDYIVFLNAGDRLHSPDTLQQVFGHIDMNASL 124
Query 116 SEL------WGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSSLVAKIGGYD 169
+ W G R + R++ P + +KF G +V HQ SF+ + +A+ YD
Sbjct 125 PAILYGETDWVDGQGRFIRHRRLQAPKQLTWKKFRHG-MLVCHQ-SFYVRTDLARQELYD 182
Query 170 LDFGIAADQEFIL----RAALVCEPV-TIRCVLCEFDTTGVGSHREPSAVFGDLRRMG 222
L+F +AD ++ + RA + +P+ R +L ++ + G+ + ++ R M
Sbjct 183 LNFRFSADYDWCIRLLKRAERIKQPIYNTRLILTDYLSEGLTTQHHQESLRERFRIMA 240
Lambda K H
0.324 0.141 0.440
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 391676602750
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40