BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2957
Length=275
Score E
Sequences producing significant alignments: (Bits) Value
gi|289746755|ref|ZP_06506133.1| conserved hypothetical protein [... 567 8e-160
gi|15610094|ref|NP_217473.1| glycosyl transferase [Mycobacterium... 563 1e-158
gi|289448625|ref|ZP_06438369.1| glycosyltransferase [Mycobacteri... 561 4e-158
gi|254365594|ref|ZP_04981639.1| hypothetical glycosyl transferas... 559 1e-157
gi|15842506|ref|NP_337543.1| glycosyl transferase [Mycobacterium... 524 5e-147
gi|308232295|ref|ZP_07664049.1| glycosyltransferase [Mycobacteri... 516 2e-144
gi|240171778|ref|ZP_04750437.1| putative glycosyl transferase [M... 387 1e-105
gi|289763135|ref|ZP_06522513.1| glycosyl transferase [Mycobacter... 362 3e-98
gi|41407332|ref|NP_960168.1| hypothetical protein MAP1234 [Mycob... 311 6e-83
gi|296170616|ref|ZP_06852192.1| PGL/P-HBAD biosynthesis glycosyl... 310 1e-82
gi|254775703|ref|ZP_05217219.1| hypothetical protein MaviaA2_136... 306 2e-81
gi|118467159|ref|YP_882435.1| hypothetical protein MAV_3253 [Myc... 199 5e-49
gi|325105037|ref|YP_004274691.1| glycosyl transferase family 2 [... 179 4e-43
gi|313203617|ref|YP_004042274.1| glycosyl transferase family 2 [... 170 2e-40
gi|258516003|ref|YP_003192225.1| glycosyl transferase family 2 [... 155 4e-36
gi|89890626|ref|ZP_01202136.1| glycosyl transferase, family 2 [F... 153 2e-35
gi|148265823|ref|YP_001232529.1| glycosyl transferase family pro... 151 1e-34
gi|218245578|ref|YP_002370949.1| family 2 glycosyl transferase [... 147 1e-33
gi|257058624|ref|YP_003136512.1| family 2 glycosyl transferase [... 144 1e-32
gi|325105035|ref|YP_004274689.1| glycosyl transferase family 2 [... 144 2e-32
gi|213962286|ref|ZP_03390549.1| putative colanic acid biosynthes... 143 3e-32
gi|158520542|ref|YP_001528412.1| glycosyl transferase family pro... 141 1e-31
gi|171910983|ref|ZP_02926453.1| hypothetical glycosyl transferas... 139 4e-31
gi|265766794|ref|ZP_06094623.1| glycosyl transferase [Bacteroide... 136 4e-30
gi|237808793|ref|YP_002893233.1| glycosyl transferase family 2 [... 135 5e-30
gi|302340722|ref|YP_003805928.1| family 2 glycosyl transferase [... 134 2e-29
gi|311746617|ref|ZP_07720402.1| PGL/P-HBAD biosynthesis glycosyl... 131 1e-28
gi|148265247|ref|YP_001231953.1| glycosyl transferase family pro... 130 2e-28
gi|225849016|ref|YP_002729180.1| glycosyl transferase, family 2 ... 129 5e-28
gi|198274036|ref|ZP_03206568.1| hypothetical protein BACPLE_0017... 127 1e-27
gi|254881236|ref|ZP_05253946.1| glycosyltransferase family 2 [Ba... 126 4e-27
gi|150003465|ref|YP_001298209.1| glycosyl transferase family pro... 126 4e-27
gi|294777653|ref|ZP_06743104.1| glycosyltransferase, group 2 fam... 125 5e-27
gi|212695416|ref|ZP_03303544.1| hypothetical protein BACDOR_0496... 124 1e-26
gi|340355316|ref|ZP_08678004.1| glycosyl transferase [Sporosarci... 124 2e-26
gi|218131537|ref|ZP_03460341.1| hypothetical protein BACEGG_0315... 124 2e-26
gi|317474796|ref|ZP_07934070.1| glycosyl transferase family 2 [B... 123 3e-26
gi|336412731|ref|ZP_08593084.1| hypothetical protein HMPREF1017_... 123 3e-26
gi|337754893|ref|YP_004647404.1| glycosyl transferase family pro... 122 4e-26
gi|149277304|ref|ZP_01883446.1| putative LPS biosynthesis relate... 121 1e-25
gi|224025460|ref|ZP_03643826.1| hypothetical protein BACCOPRO_02... 120 2e-25
gi|312892413|ref|ZP_07751908.1| glycosyl transferase family 2 [M... 120 2e-25
gi|312892412|ref|ZP_07751907.1| glycosyl transferase family 2 [M... 120 3e-25
gi|329962365|ref|ZP_08300370.1| glycosyltransferase, group 2 fam... 120 3e-25
gi|189423848|ref|YP_001951025.1| family 2 glycosyl transferase [... 119 4e-25
gi|315921310|ref|ZP_07917550.1| conserved hypothetical protein [... 119 7e-25
gi|313149202|ref|ZP_07811395.1| conserved hypothetical protein [... 118 1e-24
gi|167763889|ref|ZP_02436016.1| hypothetical protein BACSTE_0227... 117 1e-24
gi|53715791|ref|YP_101783.1| putative glycosyltransferase [Bacte... 117 2e-24
gi|329956536|ref|ZP_08297133.1| glycosyltransferase, group 2 fam... 117 2e-24
>gi|289746755|ref|ZP_06506133.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289687283|gb|EFD54771.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=542
Score = 567 bits (1461), Expect = 8e-160, Method: Compositional matrix adjust.
Identities = 275/275 (100%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 60
MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG
Sbjct 268 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 327
Query 61 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 120
GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD
Sbjct 328 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 387
Query 121 TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 180
TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT
Sbjct 388 TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 447
Query 181 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 240
IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS
Sbjct 448 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 507
Query 241 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR
Sbjct 508 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 542
>gi|15610094|ref|NP_217473.1| glycosyl transferase [Mycobacterium tuberculosis H37Rv]
gi|31794133|ref|NP_856626.1| glycosyl transferase [Mycobacterium bovis AF2122/97]
gi|121638838|ref|YP_979062.1| putative glycosyl transferase [Mycobacterium bovis BCG str. Pasteur
1173P2]
52 more sequence titles
Length=275
Score = 563 bits (1451), Expect = 1e-158, Method: Compositional matrix adjust.
Identities = 275/275 (100%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 60
MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG
Sbjct 1 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 60
Query 61 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 120
GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD
Sbjct 61 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 120
Query 121 TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 180
TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT
Sbjct 121 TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 180
Query 181 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 240
IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS
Sbjct 181 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 240
Query 241 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR
Sbjct 241 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
>gi|289448625|ref|ZP_06438369.1| glycosyltransferase [Mycobacterium tuberculosis CPHL_A]
gi|289421583|gb|EFD18784.1| glycosyltransferase [Mycobacterium tuberculosis CPHL_A]
Length=275
Score = 561 bits (1446), Expect = 4e-158, Method: Compositional matrix adjust.
Identities = 274/275 (99%), Positives = 274/275 (99%), Gaps = 0/275 (0%)
Query 1 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 60
MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG
Sbjct 1 MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDG 60
Query 61 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 120
GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD
Sbjct 61 GSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEAD 120
Query 121 TLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 180
TLARVAAFIGEHEPSDLVYGDVIMRSTN RWGGAFDLDRLLFKRNICHQAIFYRRGLFGT
Sbjct 121 TLARVAAFIGEHEPSDLVYGDVIMRSTNLRWGGAFDLDRLLFKRNICHQAIFYRRGLFGT 180
Query 181 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 240
IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS
Sbjct 181 IGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMS 240
Query 241 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR
Sbjct 241 TRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
>gi|254365594|ref|ZP_04981639.1| hypothetical glycosyl transferase [Mycobacterium tuberculosis
str. Haarlem]
gi|134151107|gb|EBA43152.1| hypothetical glycosyl transferase [Mycobacterium tuberculosis
str. Haarlem]
Length=274
Score = 559 bits (1441), Expect = 1e-157, Method: Compositional matrix adjust.
Identities = 273/274 (99%), Positives = 274/274 (100%), Gaps = 0/274 (0%)
Query 2 VQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGG 61
+QTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGG
Sbjct 1 MQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGG 60
Query 62 STDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADT 121
STDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADT
Sbjct 61 STDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADT 120
Query 122 LARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTI 181
LARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTI
Sbjct 121 LARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTI 180
Query 182 GPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMST 241
GPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMST
Sbjct 181 GPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMST 240
Query 242 RLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
RLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR
Sbjct 241 RLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 274
>gi|15842506|ref|NP_337543.1| glycosyl transferase [Mycobacterium tuberculosis CDC1551]
gi|61250837|sp|P0A599.1|GLTR1_MYCTU RecName: Full=PGL/p-HBAD biosynthesis glycosyltransferase Rv2957/MT3031
gi|61250839|sp|P0A5A0.1|GLTR1_MYCBO RecName: Full=PGL/p-HBAD biosynthesis glycosyltransferase Mb2981
gi|166233531|sp|A1KMV1.2|GLTR1_MYCBP RecName: Full=PGL/p-HBAD biosynthesis glycosyltransferase BCG_2978
gi|166233532|sp|A5U6W5.2|GLTR1_MYCTA RecName: Full=PGL/p-HBAD biosynthesis glycosyltransferase MRA_2984
gi|13882815|gb|AAK47357.1| glycosyl transferase [Mycobacterium tuberculosis CDC1551]
gi|339295794|gb|AEJ47905.1| hypothetical protein CCDC5079_2715 [Mycobacterium tuberculosis
CCDC5079]
gi|339299407|gb|AEJ51517.1| hypothetical protein CCDC5180_2680 [Mycobacterium tuberculosis
CCDC5180]
Length=256
Score = 524 bits (1350), Expect = 5e-147, Method: Compositional matrix adjust.
Identities = 256/256 (100%), Positives = 256/256 (100%), Gaps = 0/256 (0%)
Query 20 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 79
MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE
Sbjct 1 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 60
Query 80 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 139
RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY
Sbjct 61 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 120
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR
Sbjct 121 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 180
Query 200 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVI 259
CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVI
Sbjct 181 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVI 240
Query 260 SRAMVMRTVISWRRRR 275
SRAMVMRTVISWRRRR
Sbjct 241 SRAMVMRTVISWRRRR 256
>gi|308232295|ref|ZP_07664049.1| glycosyltransferase [Mycobacterium tuberculosis SUMu001]
gi|308369914|ref|ZP_07666826.1| glycosyltransferase [Mycobacterium tuberculosis SUMu002]
gi|308371186|ref|ZP_07667105.1| glycosyltransferase [Mycobacterium tuberculosis SUMu003]
14 more sequence titles
Length=252
Score = 516 bits (1329), Expect = 2e-144, Method: Compositional matrix adjust.
Identities = 252/252 (100%), Positives = 252/252 (100%), Gaps = 0/252 (0%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII
Sbjct 1 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 60
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 143
HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI
Sbjct 61 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 120
Query 144 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 203
MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN
Sbjct 121 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 180
Query 204 PALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAM 263
PALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAM
Sbjct 181 PALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAM 240
Query 264 VMRTVISWRRRR 275
VMRTVISWRRRR
Sbjct 241 VMRTVISWRRRR 252
>gi|240171778|ref|ZP_04750437.1| putative glycosyl transferase [Mycobacterium kansasii ATCC 12478]
Length=240
Score = 387 bits (994), Expect = 1e-105, Method: Compositional matrix adjust.
Identities = 190/239 (80%), Positives = 208/239 (88%), Gaps = 0/239 (0%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
MFSIIIPTLNVA+ L CLDSIA QT DFELVLVDGGSTD TLDIA+ F PN GERL++
Sbjct 1 MFSIIIPTLNVASALGFCLDSIACQTFRDFELVLVDGGSTDATLDIAHSFVPNFGERLVV 60
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 143
TD+GVYDAMNRGV LA+GTWLLFLGADD+LYEA TLARVAAFIGEHEPSDLVYGDV+
Sbjct 61 RSGTDRGVYDAMNRGVSLASGTWLLFLGADDALYEAATLARVAAFIGEHEPSDLVYGDVL 120
Query 144 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 203
MRST FR+GGAFDLDRLLFKRNICHQ+IFYRR LF +IGPYNLRYR LADWDFNIRCF+N
Sbjct 121 MRSTKFRYGGAFDLDRLLFKRNICHQSIFYRRELFSSIGPYNLRYRSLADWDFNIRCFAN 180
Query 204 PALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRA 262
PALVTR+M VVVA +NE GGLSNT VDK FLKRLP++TRLGIRLVI+ R W K +R
Sbjct 181 PALVTRHMDVVVARFNELGGLSNTFVDKAFLKRLPITTRLGIRLVIIAARWWVKWRARG 239
>gi|289763135|ref|ZP_06522513.1| glycosyl transferase [Mycobacterium tuberculosis GM 1503]
gi|289710641|gb|EFD74657.1| glycosyl transferase [Mycobacterium tuberculosis GM 1503]
Length=177
Score = 362 bits (930), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 176/177 (99%), Positives = 177/177 (100%), Gaps = 0/177 (0%)
Query 99 VDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLD 158
+DLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLD
Sbjct 1 MDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLD 60
Query 159 RLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASY 218
RLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASY
Sbjct 61 RLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASY 120
Query 219 NEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 275
NEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR
Sbjct 121 NEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR 177
>gi|41407332|ref|NP_960168.1| hypothetical protein MAP1234 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|5524329|gb|AAD44222.1|AF143772_28 GtfD [Mycobacterium avium]
gi|3550476|emb|CAA11574.1| gsd [Mycobacterium avium subsp. silvaticum]
gi|3550481|emb|CAA11578.1| gsd [Mycobacterium avium subsp. paratuberculosis]
gi|4416476|gb|AAD20376.1| glycosyltransferase gtfD [Mycobacterium avium]
gi|41395684|gb|AAS03551.1| hypothetical protein MAP_1234 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336457968|gb|EGO36956.1| glycosyl transferase [Mycobacterium avium subsp. paratuberculosis
S397]
Length=266
Score = 311 bits (798), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 152/221 (69%), Positives = 171/221 (78%), Gaps = 1/221 (0%)
Query 20 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 79
M AP+FSIIIPT N A L ACL SI QT + E+VLVDGGSTD TLDIAN F P LG
Sbjct 1 MTAPVFSIIIPTFNAAVTLQACLGSIVGQTYREVEVVLVDGGSTDRTLDIANSFRPELGS 60
Query 80 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 139
RL++H D G YDAMNRGV +ATG W+LFLGADD+LYE TLA+VAAF+G+H S LVY
Sbjct 61 RLVVHSGPDDGPYDAMNRGVGVATGEWVLFLGADDTLYEPTTLAQVAAFLGDHAASHLVY 120
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
GDV+MRST R G FDLDRLLF+ N+CHQ+IFYRR LF IGPYNLRYRV ADWDFNIR
Sbjct 121 GDVVMRSTKSRHAGPFDLDRLLFETNLCHQSIFYRRELFDGIGPYNLRYRVWADWDFNIR 180
Query 200 CFSNPALVTRYMHVVVASYNEFGGLS-NTIVDKEFLKRLPM 239
CFSNPAL+TRYM VV++ YN+ G S DKEF KRLPM
Sbjct 181 CFSNPALITRYMDVVISEYNDMTGFSMRQGTDKEFRKRLPM 221
>gi|296170616|ref|ZP_06852192.1| PGL/P-HBAD biosynthesis glycosyltransferase [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894707|gb|EFG74440.1| PGL/P-HBAD biosynthesis glycosyltransferase [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=252
Score = 310 bits (795), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/247 (64%), Positives = 184/247 (75%), Gaps = 5/247 (2%)
Query 20 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 79
M APMFSI++PT N AA L ACLDS+ RQTC DFE++LVDG STD TL +A A L
Sbjct 2 MTAPMFSIVVPTFNAAATLRACLDSLVRQTCDDFEVILVDGESTDGTLGVAGSHADQLRG 61
Query 80 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 139
R+++H D+G YDAMN G+ +A G WLLFLGADD+LYEA+TL VA FI E SDLVY
Sbjct 62 RMVVHCGPDRGAYDAMNHGIGMAKGQWLLFLGADDTLYEAETLENVAVFINECGYSDLVY 121
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
GDVI+ ST R GGAFDLDRLLF+ NICHQAIFYR+ LF IGPYNLRYR+ ADWDFNIR
Sbjct 122 GDVILCSTGSRLGGAFDLDRLLFETNICHQAIFYRKELFAGIGPYNLRYRLWADWDFNIR 181
Query 200 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVR----RW 255
CFSNPAL+TRYM VVVA YN+ GLS + DKEF KRLP STR +R+ + R RW
Sbjct 182 CFSNPALITRYMSVVVAKYNDVTGLSVS-EDKEFKKRLPNSTRFVLRVPLTSRRTLDLRW 240
Query 256 PKVISRA 262
+ +R+
Sbjct 241 GRPYARS 247
>gi|254775703|ref|ZP_05217219.1| hypothetical protein MaviaA2_13695 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=262
Score = 306 bits (785), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 150/217 (70%), Positives = 168/217 (78%), Gaps = 1/217 (0%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
MFSIIIPT N A L ACL SI QT + E+VLVDGGSTD TLDIAN F P LG RL++
Sbjct 1 MFSIIIPTFNAAVTLQACLGSIVGQTYREVEVVLVDGGSTDRTLDIANSFRPELGSRLVV 60
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 143
H D G YDAMNRGV +ATG W+LFLGADD+LYE TLA+VAAF+G+H S LVYGDV+
Sbjct 61 HSGPDDGPYDAMNRGVGVATGEWVLFLGADDTLYEPTTLAQVAAFLGDHAASHLVYGDVV 120
Query 144 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 203
MRST R G FDLDRLLF+ N+CHQ+IFYRR LF IGPYNLRYRV ADWDFNIRCFSN
Sbjct 121 MRSTKSRHAGPFDLDRLLFETNLCHQSIFYRRELFDGIGPYNLRYRVWADWDFNIRCFSN 180
Query 204 PALVTRYMHVVVASYNEFGGLS-NTIVDKEFLKRLPM 239
PAL+TRYM VV++ YN+ G S DKEF KRLPM
Sbjct 181 PALITRYMDVVISEYNDMTGFSMRQGTDKEFRKRLPM 217
>gi|118467159|ref|YP_882435.1| hypothetical protein MAV_3253 [Mycobacterium avium 104]
gi|118168446|gb|ABK69343.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|213019972|dbj|BAG84180.1| hypothetical protein [Mycobacterium intracellulare]
Length=223
Score = 199 bits (505), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 99/144 (69%), Positives = 110/144 (77%), Gaps = 1/144 (0%)
Query 97 RGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFD 156
RG WLLFLGADD+L + DTLARVAAFI +HE DLVYGDV+MRST R G FD
Sbjct 36 RGDGPRPQEWLLFLGADDALDQPDTLARVAAFIDDHETCDLVYGDVVMRSTGTRHAGPFD 95
Query 157 LDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVA 216
LDRLLF+ N+CHQ+IFYRR LF IGPYNLRY + ADWDFNIRCFSNPALVT YM +VVA
Sbjct 96 LDRLLFETNMCHQSIFYRRKLFEGIGPYNLRYPIWADWDFNIRCFSNPALVTCYMDIVVA 155
Query 217 SYNEFGGLS-NTIVDKEFLKRLPM 239
YN+ GLS D+EF KRLPM
Sbjct 156 RYNDMTGLSMRESTDREFRKRLPM 179
>gi|325105037|ref|YP_004274691.1| glycosyl transferase family 2 [Pedobacter saltans DSM 12145]
gi|324973885|gb|ADY52869.1| glycosyl transferase family 2 [Pedobacter saltans DSM 12145]
Length=281
Score = 179 bits (454), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/217 (43%), Positives = 140/217 (65%), Gaps = 15/217 (6%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P+FSIIIPT N L + +DS+ +QT DFE++++DG S D TL++AN F N R+
Sbjct 2 PLFSIIIPTYNSELFLHSAIDSVLKQTYSDFEILIIDGASLDNTLNVANSFNDN---RIK 58
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
I+ + D+G+YDAMN+G+ + G WL FLG+DD LY+ + LA V + I + +YG+V
Sbjct 59 IYSEKDKGIYDAMNKGIQKSNGEWLYFLGSDDKLYDTEVLADVKSNI--KSTNSFIYGNV 116
Query 143 -IMRSTNFR-----WGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDF 196
I+ TN+ + G FD+ + LFK+NI HQAIFY R F G +N+ Y++ AD+D+
Sbjct 117 KIISDTNWAKKGELYDGKFDMKK-LFKKNISHQAIFYNRNTFVKFGVFNIDYKICADFDY 175
Query 197 NIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEF 233
N++ FSN +Y++ +A ++ GG+S T VD+ F
Sbjct 176 NLKIFSNST--PQYINRTIAYFSG-GGISTTGVDEAF 209
>gi|313203617|ref|YP_004042274.1| glycosyl transferase family 2 [Paludibacter propionicigenes WB4]
gi|312442933|gb|ADQ79289.1| glycosyl transferase family 2 [Paludibacter propionicigenes WB4]
Length=274
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/221 (43%), Positives = 137/221 (62%), Gaps = 16/221 (7%)
Query 20 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 79
M FSIIIPT N + LP C++S+ Q+ DFE++++DG STD+TL+IA F E
Sbjct 1 MKHVKFSIIIPTFNSDSTLPQCINSVILQSFMDFEILILDGVSTDDTLNIALRFND---E 57
Query 80 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 139
R+ I + D G+YDAMN+G+ LA G WL FLG+DD LY+ LA++ I + ++Y
Sbjct 58 RIKIISEPDNGIYDAMNKGIKLAQGEWLFFLGSDDELYDEFVLAKINIEIAQ-TNRKIIY 116
Query 140 GDVIMRSTNFRWG-------GAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLA 192
G+V + N W G F+L++LL + NICHQAIFY +F G YN Y++ A
Sbjct 117 GNVKIVG-NAGWATDGQIYDGVFNLNKLL-EYNICHQAIFYHSSVFKKNGLYNENYKICA 174
Query 193 DWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEF 233
D+DFNIRC++N Y+ ++++ +N GG SN ++D F
Sbjct 175 DYDFNIRCWANFHF--HYIDLIISLFNG-GGASNFLIDDNF 212
>gi|258516003|ref|YP_003192225.1| glycosyl transferase family 2 [Desulfotomaculum acetoxidans DSM
771]
gi|257779708|gb|ACV63602.1| glycosyl transferase family 2 [Desulfotomaculum acetoxidans DSM
771]
Length=246
Score = 155 bits (393), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 87/224 (39%), Positives = 131/224 (59%), Gaps = 14/224 (6%)
Query 18 VAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNL 77
+++ +FSII PT N AA + ++S+ Q FE ++VDG S D T + ++ +
Sbjct 4 ISLERKIFSIITPTYNCAAKIKKTIESVLSQNKSLFEYIIVDGLSDDGTSEEIEKYSDEI 63
Query 78 GERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSD- 136
+LI+ +D +GVYDAMN+G+D+A+G +L FLGA D L E L ++ + P D
Sbjct 64 --KLIVGKD--KGVYDAMNKGIDIASGNYLYFLGAGDQLKEG-ILEKIKPIM----PQDG 114
Query 137 --LVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
VYG+V N + G F +L KRNICHQAIFY R +F +G Y+L++R AD+
Sbjct 115 LSFVYGNVYRLDHNRIYDGQFKKHKLS-KRNICHQAIFYERNIFDLVGKYDLKFRTKADY 173
Query 195 DFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLP 238
N+RCF + Y++ V+A Y F GLS+ D++F+K P
Sbjct 174 ALNLRCFGCREIDKIYVNEVIAEYEGF-GLSSDKEDEDFMKEKP 216
>gi|89890626|ref|ZP_01202136.1| glycosyl transferase, family 2 [Flavobacteria bacterium BBFL7]
gi|89517541|gb|EAS20198.1| glycosyl transferase, family 2 [Flavobacteria bacterium BBFL7]
Length=247
Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/220 (39%), Positives = 126/220 (58%), Gaps = 14/220 (6%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSIIIPT N ++ L ++SI QT +E++++DG S+D T+++A F N +
Sbjct 6 PFFSIIIPTYNASSTLATAIESILSQTYTSYEILIMDGESSDTTVEVAKSFLIN-NSVIS 64
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHE-----PSDL 137
I +D G+YDAMN G+ A G +L FLGADD L + + L + H+ +D+
Sbjct 65 ITSQSDDGIYDAMNHGIKKANGIYLYFLGADDYLIDNNVLEDI------HQQLILTSTDV 118
Query 138 VYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFN 197
+YG+V S + G D D+LLF +NI HQ+IF+ + +F G +NL YR ADW N
Sbjct 119 IYGNVQSPSLGSNYMGECD-DQLLFHKNIAHQSIFFHKRVFELTGNFNLEYRTHADWAHN 177
Query 198 IRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRL 237
I F NP + +Y++ ++A Y + G S+ D F K L
Sbjct 178 INWFFNPEISNQYLNRIIAHYAD-DGYSSRTKDHHFRKDL 216
>gi|148265823|ref|YP_001232529.1| glycosyl transferase family protein [Geobacter uraniireducens
Rf4]
gi|146399323|gb|ABQ27956.1| glycosyl transferase, family 2 [Geobacter uraniireducens Rf4]
Length=302
Score = 151 bits (381), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/208 (38%), Positives = 124/208 (60%), Gaps = 4/208 (1%)
Query 26 SIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHR 85
SI+ PT N + CL S+A QT + E +++DG STD TL+I +A I
Sbjct 8 SIVTPTYNAVNDIETCLLSVAHQTYKNKEHLIIDGASTDATLEIVKRYADKYSHIKFI-S 66
Query 86 DTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMR 145
+ D G+YDAMN+ +DL +G W+ FLG DD Y DT+ IG + D+VYG+V+
Sbjct 67 EKDNGIYDAMNKAIDLTSGEWIYFLGCDDVFYN-DTVLEEIFNIGGIDLFDVVYGNVLWG 125
Query 146 STNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPA 205
T + G F +L+ +NICHQAIFY + +F +G ++ +Y++LAD+ FN++ F+N
Sbjct 126 DTGKIYDGKFSTLKLM-DQNICHQAIFYEKSIFSKLGKFDTKYKILADYVFNMKWFNNDG 184
Query 206 LVTRYMHVVVASYNEFGGLSNTIVDKEF 233
+ ++Y++ ++A Y G S+ + D +F
Sbjct 185 VRSKYVNNIIAKYG-INGRSSNVQDMDF 211
>gi|218245578|ref|YP_002370949.1| family 2 glycosyl transferase [Cyanothece sp. PCC 8801]
gi|218166056|gb|ACK64793.1| glycosyl transferase family 2 [Cyanothece sp. PCC 8801]
Length=249
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/252 (37%), Positives = 140/252 (56%), Gaps = 23/252 (9%)
Query 26 SIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHR 85
S+IIPT N + L+SI Q D+E+ +VDG S D T+ I + + +
Sbjct 5 SVIIPTYNSQETISKSLESIINQDYSDYEICIVDGNSNDNTIKIIQEYCRRF-DCIKFLS 63
Query 86 DTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARV-AAFIGEHEPSDLVYGDVIM 144
+ DQG YDAMN+G+D++ G WL FLG+ D +Y+ +T +V A I E L+YG+VI
Sbjct 64 EKDQGPYDAMNKGIDISGGEWLFFLGSSDLIYDKNTFQKVFTAPISED--IGLLYGNVIF 121
Query 145 ------RSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNI 198
+ G F +++LL K NICHQ+IFYRR +F +G YNL+Y + ADW+ N+
Sbjct 122 LDDIGGAKAGQIYDGEFTIEKLLVK-NICHQSIFYRRSVFKKLGKYNLKYPICADWEMNM 180
Query 199 RCFSNPALVTRYMHVVVASYNEFGGLSNT-------IVDKEFLKRLPMSTRLGIRLVIVL 251
R FS+ + Y+ + +A ++ GGLS+T D +FLK L R + L
Sbjct 181 RFFSSTKV--SYLDLTIACFSG-GGLSSTRNIQDPIYNDLQFLKIKYFMNYLIYRKIYRL 237
Query 252 VRR--WPKVISR 261
V R + K+++R
Sbjct 238 VNRLFFNKIMAR 249
>gi|257058624|ref|YP_003136512.1| family 2 glycosyl transferase [Cyanothece sp. PCC 8802]
gi|256588790|gb|ACU99676.1| glycosyl transferase family 2 [Cyanothece sp. PCC 8802]
Length=249
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 91/255 (36%), Positives = 138/255 (55%), Gaps = 29/255 (11%)
Query 26 SIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHR 85
S+IIPT N + L+SI Q D+E+ +VDG S D T+ I + + +
Sbjct 5 SVIIPTYNSQETISKSLESIINQDYSDYEICIVDGNSNDNTIKIIQEYCRRF-DFIKFLS 63
Query 86 DTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSD----LVYGD 141
+ DQG YDAMN+G+D++ G WL FLG+ D +Y+ +T +V EP L+YG+
Sbjct 64 EKDQGPYDAMNKGIDISGGEWLFFLGSSDLIYDKNTFQKVFT-----EPISEDIGLLYGN 118
Query 142 VIM------RSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWD 195
VI + G F +++LL K NICHQ+IFYRR + +G YNL+Y + ADW+
Sbjct 119 VIFLDDIGCAKAGQIYDGEFTIEKLLVK-NICHQSIFYRRSVLKKLGKYNLKYPICADWE 177
Query 196 FNIRCFSNPALVTRYMHVVVASYNEFGGLSNT-------IVDKEFLKRLPMSTRLGIRLV 248
N+R FS+ + Y+ + +A ++ GGLS+T D +FLK L R +
Sbjct 178 MNMRFFSSTKV--SYLDLTIACFSG-GGLSSTRNIEDPIYNDLQFLKIKYFMNYLIYRKI 234
Query 249 IVLVRR--WPKVISR 261
LV R + K+++R
Sbjct 235 YRLVNRLFFNKIMAR 249
>gi|325105035|ref|YP_004274689.1| glycosyl transferase family 2 [Pedobacter saltans DSM 12145]
gi|324973883|gb|ADY52867.1| glycosyl transferase family 2 [Pedobacter saltans DSM 12145]
Length=240
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/223 (37%), Positives = 125/223 (57%), Gaps = 6/223 (2%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
MFSIIIPT N + + S Q+ DFE++++D STD T+++ F +R+ +
Sbjct 1 MFSIIIPTYNSEKTIHRAIKSCINQSFKDFEVIVIDNLSTDNTINLITSFD---DKRISV 57
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 143
D G+YDAMN+G+ + G WLLFLG+DD L+ ++TLA V I + S L+YGDV+
Sbjct 58 FSQADIGIYDAMNKGILKSAGEWLLFLGSDDQLFNSNTLALVWETILKAPRSKLIYGDVM 117
Query 144 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 203
R ++ +LL + ICHQ+I+Y + LF I Y+ Y++ ADWDFN++ F
Sbjct 118 KSDGTLRSYSNYNYIKLL-RTCICHQSIYYHKSLFINI-KYDTNYKICADWDFNLKVFRK 175
Query 204 PALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIR 246
T Y++ ++ +N G SN E+L+ + IR
Sbjct 176 KNHPT-YINTPLSIFNLSGISSNWSKHPEYLQNFANNKTAAIR 217
>gi|213962286|ref|ZP_03390549.1| putative colanic acid biosynthesis glycosyl transferase WcaE
[Capnocytophaga sputigena Capno]
gi|213954952|gb|EEB66271.1| putative colanic acid biosynthesis glycosyl transferase WcaE
[Capnocytophaga sputigena Capno]
Length=233
Score = 143 bits (361), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/235 (36%), Positives = 139/235 (60%), Gaps = 18/235 (7%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGD---FELVLVDGGSTDETLDIANIFAPNLGER 80
+ SIIIPT N A+ + CL S+ +Q G+ +E+V+VDG STDETL I F + +
Sbjct 2 LLSIIIPTYNSASHIAHCL-SLIQQQLGENQLYEVVIVDGKSTDETLTIVESFTQKIPQL 60
Query 81 LIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSL-YEADTLARVAAFIGEHEPSDLVY 139
I + D+G+YDAMN+G+ LA G +L FLGADD + + + + +V + + Y
Sbjct 61 RWI-SEKDKGIYDAMNKGIALAHGKFLYFLGADDEITFSINQITQVL-----QNNNTIYY 114
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
G+V++++T + G F +L+ ++NICHQAIFY + + PYNL+Y++LAD++ N++
Sbjct 115 GNVLLKNTQTLYDGVFTTAKLI-EQNICHQAIFYPKKIVEQ-HPYNLKYKLLADYELNLK 172
Query 200 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRL--PMSTRLGIRLVIVLV 252
+ Y+ + +A Y+ G+S + DK+F + LGI+ VI+ +
Sbjct 173 LWKKYKF--EYVDLTIAVYSN-DGISTQVKDKQFKSDFLSILYKYLGIKYVIIKI 224
>gi|158520542|ref|YP_001528412.1| glycosyl transferase family protein [Desulfococcus oleovorans
Hxd3]
gi|158509368|gb|ABW66335.1| glycosyl transferase family 2 [Desulfococcus oleovorans Hxd3]
Length=285
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/213 (40%), Positives = 121/213 (57%), Gaps = 9/213 (4%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P+ SII LN A L + S+ QT DFE +++DGGSTD TLDI + +L +
Sbjct 17 PLVSIITVVLNGADTLERAIKSVVSQTFRDFEYIIIDGGSTDGTLDIIRRYESHLTCWI- 75
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ D+G+YDAMN+GV L+ G W+ FLGADD Y D AA + + + + YG+V
Sbjct 76 --SEPDKGLYDAMNKGVALSKGEWVYFLGADD--YLLDGFTSAAARL--KDGTTVYYGNV 129
Query 143 IMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFS 202
+ + G F +L RNICHQ+IFY R ++ YNL+Y V AD+D N+RCF
Sbjct 130 YRPVADRIYDGRFSFYKLAC-RNICHQSIFYPRCVWKKFS-YNLKYPVFADYDLNLRCFV 187
Query 203 NPALVTRYMHVVVASYNEFGGLSNTIVDKEFLK 235
+ + +Y+ +A + + GGLS DK F K
Sbjct 188 DDGIQFKYIPDTIAVFTDEGGLSPRQADKAFEK 220
>gi|171910983|ref|ZP_02926453.1| hypothetical glycosyl transferase [Verrucomicrobium spinosum
DSM 4136]
Length=246
Score = 139 bits (351), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/203 (42%), Positives = 121/203 (60%), Gaps = 6/203 (2%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
+FS+I+P L+ L LDS+A Q D E++++DGG++ T + + N G L
Sbjct 4 LFSVILPVLHADGKLGETLDSVASQAFEDLEVLVMDGGASPSTERLVQRSSLNEGGGLRY 63
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVI 143
DQG+YDAMN +DLA+G ++ F+GA D+L LA V + ++ S LVYG+V
Sbjct 64 VGKPDQGIYDAMNEALDLASGRYIYFIGAGDTL-RPHALAAVRDQLPTND-SSLVYGNV- 120
Query 144 MRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSN 203
MR + G FD RL RNICHQA FY R +F +G ++L Y ADW+FN+RC+ +
Sbjct 121 MRGAQI-YDGPFDAWRLC-HRNICHQAAFYGRDIFQLLGRFSLSYPACADWEFNMRCYGD 178
Query 204 PALVTRYMHVVVASYNEFGGLSN 226
+ RY+ VVA + E GGLS+
Sbjct 179 SRIPKRYIETVVADF-EQGGLSS 200
>gi|265766794|ref|ZP_06094623.1| glycosyl transferase [Bacteroides sp. 2_1_16]
gi|263253171|gb|EEZ24647.1| glycosyl transferase [Bacteroides sp. 2_1_16]
Length=241
Score = 136 bits (342), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/217 (36%), Positives = 122/217 (57%), Gaps = 9/217 (4%)
Query 20 MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGE 79
M P SII + N + + + S+ +Q+ + E +++DGGSTD TLDI + +
Sbjct 1 MIEPKISIITVSYNAVSTIEETILSVIKQSYVNIEYIIIDGGSTDGTLDIIKKYQSKISY 60
Query 80 RLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVY 139
+ H D G+YDAMN+G+ ATG W+ FLGADD L D + +V ++ + +VY
Sbjct 61 WVSEH---DNGIYDAMNKGIAYATGDWIYFLGADDLLVR-DIINKVYPYLNLKQ--GIVY 114
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
G+V M ++ G F +L RNI HQA FY + +F YNL Y LAD+ N++
Sbjct 115 GNVYMIGKQKKYDGKFS-SLMLVTRNIPHQATFYSKKIFCNYS-YNLEYVYLADYYLNLQ 172
Query 200 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKR 236
C+ +P +V +Y+ + +A YN+ +S+ +D+ F R
Sbjct 173 CWKDPNIVFKYIPLTIAIYND-TSVSSVEIDRCFESR 208
>gi|237808793|ref|YP_002893233.1| glycosyl transferase family 2 [Tolumonas auensis DSM 9187]
gi|237501054|gb|ACQ93647.1| glycosyl transferase family 2 [Tolumonas auensis DSM 9187]
Length=242
Score = 135 bits (341), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/214 (40%), Positives = 128/214 (60%), Gaps = 12/214 (5%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P+FS+II N A + + S+ Q EL+++DGGS D T+DI + +
Sbjct 3 PIFSVIIVVFNAANYIEDAILSVISQHYKKIELIIIDGGSIDGTVDIIKKYDEYIS---F 59
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGE-HEPSDLVYGD 141
+ D G+YDAMN+GV +A G +L FLGADD L + L VA IG+ + + + YGD
Sbjct 60 WRSEPDNGIYDAMNKGVRIAKGDYLYFLGADDRL--LNVLDEVARLIGKIDDNNSIFYGD 117
Query 142 VIMR-STNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIG-PYNLRYRVLADWDFNIR 199
VI+ + + GG F+ ++LFK NI HQAIFY RG F +G YN +Y++LAD+ N+
Sbjct 118 VILDGDISKKLGGRFNKVKILFK-NIPHQAIFYPRGFF--LGREYNCKYKLLADYALNLE 174
Query 200 CFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEF 233
FS + + +YM++VVA+Y+ G S+ + D +F
Sbjct 175 AFSLNSNIWKYMNLVVAAYST-NGASSVLKDNDF 207
>gi|302340722|ref|YP_003805928.1| family 2 glycosyl transferase [Spirochaeta smaragdinae DSM 11293]
gi|301637907|gb|ADK83334.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293]
Length=249
Score = 134 bits (336), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 82/248 (34%), Positives = 131/248 (53%), Gaps = 19/248 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P+FSII N + ++S+ Q+ DFE ++VDGGSTD TLDI N +A +L
Sbjct 2 PLFSIITVCYNAKTTISNTINSVLNQSFTDFEYIIVDGGSTDGTLDIVNPYAEQ--RKLG 59
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ ++D G+YDAMN+G+ LA G +L FL ADD+L + L VA I + YG++
Sbjct 60 LISESDDGIYDAMNKGIQLANGKYLNFLNADDTLCDESVLMSVADQIHNMPNKQIYYGNL 119
Query 143 IMRSTNFRWGGAF----DLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNI 198
+ R+ + F D+ L I QA+FY + +F +G +N R R++ D+++ +
Sbjct 120 LYRNRDGNIDQYFRPNQDIRNFLRDNCITQQALFYDKEVFSVVGKFNSRIRIVGDYEWLL 179
Query 199 RCFSNPALVTRYMHVVVASYNEFGGLS---------NTIVDKEFLKRLPMSTRLGIRLVI 249
R + + YM+V ++++ GG+S N +K +L P+ IR I
Sbjct 180 RAVFRKKITSYYMNVNISNF-TIGGVSTGRMYKRIHNKEREKLYLVYFPVKY---IRRKI 235
Query 250 VLVRRWPK 257
+RRW K
Sbjct 236 SNLRRWGK 243
>gi|311746617|ref|ZP_07720402.1| PGL/P-HBAD biosynthesis glycosyltransferase [Algoriphagus sp.
PR1]
gi|126578279|gb|EAZ82443.1| PGL/P-HBAD biosynthesis glycosyltransferase [Algoriphagus sp.
PR1]
Length=249
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/209 (36%), Positives = 117/209 (56%), Gaps = 9/209 (4%)
Query 26 SIIIPTLNVAAVLPACLDSI-ARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIH 84
SIIIP N L L SI ++TC FE++++D S+D F E ++
Sbjct 8 SIIIPNFNSGKFLENTLSSIFLKKTCFSFEVLIIDNNSSDNPFQYVKKFPL---ESILFC 64
Query 85 RDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIM 144
D D GVY+AMN+GV LA G W++FLGA D L + +++ ++ + L+YG+ ++
Sbjct 65 SDKDNGVYEAMNKGVKLAKGNWIIFLGAGDEL-KVESVNQIE--FNPLQNLKLIYGNTLL 121
Query 145 RSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNP 204
+ G F L +L+ KRNI HQAIFY LF G ++ YR+ AD+ +N++ F N
Sbjct 122 LKNKKTYDGEFSLLKLM-KRNISHQAIFYHHSLFLENGSFDTEYRIAADYMYNLKVFLNT 180
Query 205 ALVTRYMHVVVASYNEFGGLSNTIVDKEF 233
++ +Y+ +VV+ + GGLS+ D F
Sbjct 181 SIKVQYVPLVVSHFLG-GGLSDIKRDDLF 208
>gi|148265247|ref|YP_001231953.1| glycosyl transferase family protein [Geobacter uraniireducens
Rf4]
gi|146398747|gb|ABQ27380.1| glycosyl transferase, family 2 [Geobacter uraniireducens Rf4]
Length=268
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/264 (33%), Positives = 144/264 (55%), Gaps = 16/264 (6%)
Query 16 KKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAP 75
KK P+ S+IIP LN L + S+ Q + E+V++DGGS D T++I +
Sbjct 13 KKSWPGKPLISVIIPVLNRVGSLEETVRSVLHQQYDNVEIVVIDGGSIDGTVEIIKKY-- 70
Query 76 NLGERLIIH--RDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHE 133
E I + + D G+YDAMN+G A+G WL FLG+DD L + L +VAA++ +
Sbjct 71 ---EDYIDYWCSNPDNGIYDAMNKGARAASGDWLYFLGSDDIL--LNVLHKVAAYL--QK 123
Query 134 PSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLAD 193
+++ YGDV + + + G F +L+ K NI HQA FY + LF Y+L+Y+ D
Sbjct 124 ENEVYYGDVYLPKWHKIYDGPFTPYKLM-KINIPHQATFYPQALFRK-HSYDLKYKSAGD 181
Query 194 WDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLK-RLPMSTRLGIRLVI--V 250
+ FN+ C+++ Y+ ++VA +++ GG+S T D +F + R + R +
Sbjct 182 YFFNLVCYNDQDFTYVYLPILVAIFDDAGGVSATAGDPDFERDRCHILKEYFGRFIYYQY 241
Query 251 LVRRWPKVISRAMVMRTVISWRRR 274
++R+ K I R ++ + V +R+
Sbjct 242 VLRKALKYIERNLIRKVVHRMKRK 265
>gi|225849016|ref|YP_002729180.1| glycosyl transferase, family 2 [Sulfurihydrogenibium azorense
Az-Fu1]
gi|225644103|gb|ACN99153.1| glycosyl transferase, family 2 [Sulfurihydrogenibium azorense
Az-Fu1]
Length=361
Score = 129 bits (324), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/219 (37%), Positives = 122/219 (56%), Gaps = 19/219 (8%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P+ ++I+ T N L L S+ Q + E +++DGGSTD T+DI + E I
Sbjct 143 PLITVILATFNAEEHLEKALQSVINQIYPNIEYIIIDGGSTDGTIDIIKKY-----EDFI 197
Query 83 IH--RDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEP---SDL 137
+ D+G+YDAMN+G+ + G +L FLG+DD + + + FI E + S L
Sbjct 198 DYWVSKKDEGIYDAMNKGIICSLGDYLYFLGSDDEVILETVFSSI--FIEEIDNIKNSKL 255
Query 138 VYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFN 197
+YG+ I R + + G F +L FK N+CHQA FY+ LF T G YN +Y++ AD + N
Sbjct 256 IYGNAIFRDSAEIYDGKFLSYKLRFK-NLCHQATFYKSELFRTYGLYNTKYKIWADHERN 314
Query 198 IRCF--SNPALVTRYMHVVVASYNEFGGLSNTIVDKEFL 234
IR + SNP +Y++ +A Y G S+ +DK+FL
Sbjct 315 IRFWKVSNP----KYLNKAIAVYRRSGVSSSGEIDKDFL 349
>gi|198274036|ref|ZP_03206568.1| hypothetical protein BACPLE_00173 [Bacteroides plebeius DSM 17135]
gi|198273114|gb|EDY97383.1| hypothetical protein BACPLE_00173 [Bacteroides plebeius DSM 17135]
Length=254
Score = 127 bits (320), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/204 (39%), Positives = 111/204 (55%), Gaps = 15/204 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N AVL + S+ QT + E ++VDGGS D TL I + ++ +
Sbjct 9 PKFSIITVTYNAGAVLEDTIQSVITQTYKNVEYIIVDGGSKDRTLQIVEQYKEHIHTVI- 67
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD- 141
+ D+G+YDAMN+G+ LATG +L FL A D L+E DTL + I E D++YGD
Sbjct 68 --SEPDKGLYDAMNKGIKLATGDYLCFLNAGDELHEDDTLQLMVHSITEPTLPDVLYGDT 125
Query 142 -VIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
++ +F R +L+ FK +CHQA F RR L + PYNL YR AD+
Sbjct 126 AIVDEEGHFLHMRRLAPPENLNWKSFKEGMLVCHQAFFARRDL---VEPYNLHYRFSADF 182
Query 195 DFNIRCFSNPALVTRYMHVVVASY 218
D+ IR + V + H+V+ Y
Sbjct 183 DWCIRIMKK-SQVLHHTHLVLIDY 205
>gi|254881236|ref|ZP_05253946.1| glycosyltransferase family 2 [Bacteroides sp. 4_3_47FAA]
Length=258
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 81/204 (40%), Positives = 112/204 (55%), Gaps = 15/204 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N A VL + SI QT + E ++VDGGSTDETLDI + + ++ +
Sbjct 18 PKFSIITVTYNAAKVLEDTIQSIVTQTYKNLEYIIVDGGSTDETLDIIHKYQEHITTVI- 76
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYG-- 140
+ DQG+YDAMN+G+ LATG +L FL A D L+E DTL ++ I ++YG
Sbjct 77 --SEPDQGLYDAMNKGIKLATGDYLCFLNAGDGLHEDDTLLQMVHSINGTALPGVLYGET 134
Query 141 DVIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+++ +F R L FK+ +CHQA F RR L + PY+LRYR AD+
Sbjct 135 EIVDSQGHFLYMRRLSAPATLTWKSFKQGMLVCHQAFFARRDL---VEPYDLRYRFSADF 191
Query 195 DFNIRCFSNPALVTRYMHVVVASY 218
D+ IR A V H+ + Y
Sbjct 192 DWCIRIMKK-ADVLHNTHLTLIDY 214
>gi|150003465|ref|YP_001298209.1| glycosyl transferase family protein [Bacteroides vulgatus ATCC
8482]
gi|149931889|gb|ABR38587.1| glycosyltransferase family 2 [Bacteroides vulgatus ATCC 8482]
Length=259
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 81/204 (40%), Positives = 112/204 (55%), Gaps = 15/204 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N A VL + SI QT + E ++VDGGSTDETLDI + + ++ +
Sbjct 19 PKFSIITVTYNAAKVLEDTIQSIVTQTYKNLEYIIVDGGSTDETLDIIHKYQEHITTVI- 77
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYG-- 140
+ DQG+YDAMN+G+ LATG +L FL A D L+E DTL ++ I ++YG
Sbjct 78 --SEPDQGLYDAMNKGIKLATGDYLCFLNAGDGLHEDDTLLQMVHSINGTALPGVLYGET 135
Query 141 DVIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+++ +F R L FK+ +CHQA F RR L + PY+LRYR AD+
Sbjct 136 EIVDSQGHFLYMRRLSAPATLTWKSFKQGMLVCHQAFFARRDL---VEPYDLRYRFSADF 192
Query 195 DFNIRCFSNPALVTRYMHVVVASY 218
D+ IR A V H+ + Y
Sbjct 193 DWCIRIMKK-ADVLHNTHLTLIDY 215
>gi|294777653|ref|ZP_06743104.1| glycosyltransferase, group 2 family protein [Bacteroides vulgatus
PC510]
gi|319640238|ref|ZP_07994963.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_40A]
gi|294448721|gb|EFG17270.1| glycosyltransferase, group 2 family protein [Bacteroides vulgatus
PC510]
gi|317388013|gb|EFV68867.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_40A]
gi|345457795|gb|EET14338.2| glycosyltransferase family 2 [Bacteroides sp. 4_3_47FAA]
Length=248
Score = 125 bits (315), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/204 (40%), Positives = 112/204 (55%), Gaps = 15/204 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N A VL + SI QT + E ++VDGGSTDETLDI + + ++ +
Sbjct 8 PKFSIITVTYNAAKVLEDTIQSIVTQTYKNLEYIIVDGGSTDETLDIIHKYQEHITTVI- 66
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYG-- 140
+ DQG+YDAMN+G+ LATG +L FL A D L+E DTL ++ I ++YG
Sbjct 67 --SEPDQGLYDAMNKGIKLATGDYLCFLNAGDGLHEDDTLLQMVHSINGTALPGVLYGET 124
Query 141 DVIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+++ +F R L FK+ +CHQA F RR L + PY+LRYR AD+
Sbjct 125 EIVDSQGHFLYMRRLSAPATLTWKSFKQGMLVCHQAFFARRDL---VEPYDLRYRFSADF 181
Query 195 DFNIRCFSNPALVTRYMHVVVASY 218
D+ IR A V H+ + Y
Sbjct 182 DWCIRIMKK-ADVLHNTHLTLIDY 204
>gi|212695416|ref|ZP_03303544.1| hypothetical protein BACDOR_04965 [Bacteroides dorei DSM 17855]
gi|237711547|ref|ZP_04542028.1| glycosyltransferase family 2 protein [Bacteroides sp. 9_1_42FAA]
gi|237725999|ref|ZP_04556480.1| glycosyltransferase family 2 protein [Bacteroides sp. D4]
gi|265752974|ref|ZP_06088543.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_33FAA]
gi|212662051|gb|EEB22625.1| hypothetical protein BACDOR_04965 [Bacteroides dorei DSM 17855]
gi|229435807|gb|EEO45884.1| glycosyltransferase family 2 protein [Bacteroides dorei 5_1_36/D4]
gi|229454242|gb|EEO59963.1| glycosyltransferase family 2 protein [Bacteroides sp. 9_1_42FAA]
gi|263236160|gb|EEZ21655.1| glycosyltransferase family 2 [Bacteroides sp. 3_1_33FAA]
Length=248
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/204 (40%), Positives = 112/204 (55%), Gaps = 15/204 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N A VL + SI QT + E ++VDGGSTDETLDI + + ++ +
Sbjct 8 PKFSIITVTYNAAKVLEDTIQSIVTQTYKNLEYIIVDGGSTDETLDIIHKYQEHITTVI- 66
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYG-- 140
+ DQG+YDAMN+G+ LATG +L FL A D L+E DTL ++ I ++YG
Sbjct 67 --SEPDQGLYDAMNKGIKLATGDYLCFLNAGDGLHEDDTLLQMVHSINGTALPGVLYGET 124
Query 141 DVIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+++ +F R L FK+ +CHQA F RR L + PY+L+YR AD+
Sbjct 125 EIVDSQGHFLYMRRLSAPATLTWKSFKQGMLVCHQAFFARRDL---VEPYDLQYRFSADF 181
Query 195 DFNIRCFSNPALVTRYMHVVVASY 218
D+ IR A V H+ + Y
Sbjct 182 DWCIRIMKK-ADVLHNTHLTLIDY 204
>gi|340355316|ref|ZP_08678004.1| glycosyl transferase [Sporosarcina newyorkensis 2681]
gi|339622513|gb|EGQ27032.1| glycosyl transferase [Sporosarcina newyorkensis 2681]
Length=362
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/218 (32%), Positives = 120/218 (56%), Gaps = 14/218 (6%)
Query 18 VAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNL 77
+ P ++I N + + S+ QT G+ E ++VDG STD T++I + +
Sbjct 1 MTQGGPKITVITACYNSERTIEQTIQSVLDQTYGNIEYIIVDGDSTDGTMNIVEKYRDRI 60
Query 78 GERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDL 137
++ + D+GVYDA N+G++LATG ++ F+ ADD + +A ++ +H +
Sbjct 61 D---VVVSEKDRGVYDAFNKGIELATGDYINFMNADDYFSSDTIVNEIAGYLKDHRDIMM 117
Query 138 VYGDV-----IMRSTNFRWGGAFDLDRLLFKR-NIC-HQAIFYRRGLFGTIGPYNLRYRV 190
++GDV + +FR G +L + F++ N+C HQ++F + LF G ++L+Y +
Sbjct 118 LHGDVKAFDEVSGHWHFRGG---ELSLVDFEKGNMCPHQSVFTHKQLFKQFGGFDLKYEI 174
Query 191 LADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTI 228
LAD DF I+CF+ Y+ + VA++ GGLSN +
Sbjct 175 LADVDFTIKCFTQCEQKVVYLPIQVANF-RLGGLSNAL 211
>gi|218131537|ref|ZP_03460341.1| hypothetical protein BACEGG_03156 [Bacteroides eggerthii DSM
20697]
gi|217986469|gb|EEC52806.1| hypothetical protein BACEGG_03156 [Bacteroides eggerthii DSM
20697]
Length=248
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/208 (37%), Positives = 114/208 (55%), Gaps = 15/208 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N AVL + S+ QT E +++DG S D+TL IA+ + ++ + +
Sbjct 7 PKFSIITVTYNAEAVLEDTIQSVISQTYHHVEYIIIDGASKDKTLSIADRYKEHIAQVV- 65
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ + ++ D++YG+
Sbjct 66 --SEPDKGLYDAMNKGIRLATGDYLCFLNAGDSFHEDDTLQQMVHTLTGNQLPDVLYGET 123
Query 143 IM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+ + +F R L FK+ +CHQA F +R L + PYNL+YR AD+
Sbjct 124 ALVDKEGHFIRMRRLSAPETLTWKSFKQGMLVCHQAFFAKRSL---VEPYNLKYRFSADF 180
Query 195 DFNIRCFSNPALVTRYMHVVVASYNEFG 222
D+ IR A H+ + Y E G
Sbjct 181 DWCIRIMKK-AQRLHNTHLTIIDYLEEG 207
>gi|317474796|ref|ZP_07934070.1| glycosyl transferase family 2 [Bacteroides eggerthii 1_2_48FAA]
gi|316909477|gb|EFV31157.1| glycosyl transferase family 2 [Bacteroides eggerthii 1_2_48FAA]
Length=248
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/208 (37%), Positives = 114/208 (55%), Gaps = 15/208 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N AVL + S+ QT E +++DG S D+TL IA+ + ++ + +
Sbjct 7 PKFSIITVTYNAEAVLEDTIQSVISQTYHHVEYIIIDGASKDKTLSIADRYKEHIAQVV- 65
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ + ++ D++YG+
Sbjct 66 --SEPDKGLYDAMNKGIRLATGDYLCFLNAGDSFHEDDTLQQMVHTLTGNQLPDVLYGET 123
Query 143 IM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+ + +F R L FK+ +CHQA F +R L + PYNL+YR AD+
Sbjct 124 ALVDKEGHFIRMRRLSAPETLTWKSFKQGMLVCHQAFFAKRSL---MEPYNLKYRFSADF 180
Query 195 DFNIRCFSNPALVTRYMHVVVASYNEFG 222
D+ IR A H+ + Y E G
Sbjct 181 DWCIRIMKK-AQRLHNTHLTIIDYLEEG 207
>gi|336412731|ref|ZP_08593084.1| hypothetical protein HMPREF1017_00192 [Bacteroides ovatus 3_8_47FAA]
gi|335942777|gb|EGN04619.1| hypothetical protein HMPREF1017_00192 [Bacteroides ovatus 3_8_47FAA]
Length=249
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 77/209 (37%), Positives = 111/209 (54%), Gaps = 15/209 (7%)
Query 22 APMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERL 81
P FSII T N VL + S+ QT E ++VDG S D TL I N + P++ +
Sbjct 7 TPKFSIITVTYNAEKVLEDTIQSVISQTYHHIEYIIVDGASKDGTLSIINRYRPHIHTVV 66
Query 82 IIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD 141
+ D+G+YDAMN+G+ LA+G +L FL A D +E DTL ++ I +E D++YG+
Sbjct 67 ---SEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGNELPDILYGE 123
Query 142 --VIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLAD 193
++ + +F R L FK+ +CHQA F R L +GPYNL+YR AD
Sbjct 124 TAIVDKDRHFLRMRRLSAPETLTWKSFKQGMLVCHQAFFPRHTL---VGPYNLQYRFSAD 180
Query 194 WDFNIRCFSNPALVTRYMHVVVASYNEFG 222
+D+ IR A H+ + Y E G
Sbjct 181 FDWCIRIMKK-ARTLHNTHLTIIDYLEEG 208
>gi|337754893|ref|YP_004647404.1| glycosyl transferase family protein [Francisella sp. TX077308]
gi|336446498|gb|AEI35804.1| Glycosyl transferase, group 2 family protein [Francisella sp.
TX077308]
Length=249
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/253 (31%), Positives = 128/253 (51%), Gaps = 26/253 (10%)
Query 13 ANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANI 72
+N++ P+ +II N +L + S+ QT + E ++VDGGSTD TLD+
Sbjct 2 SNSQIQKSNYPLVTIITVVYNAKELLEETILSVLNQTYSNIEYIIVDGGSTDGTLDVIKK 61
Query 73 FAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEH 132
+ + + + + DQG+YDAMN+G+ LA G WL F+ A DS Y D L I EH
Sbjct 62 YQDKITKYI---SEPDQGIYDAMNKGMALAKGQWLNFMNAGDSFYSNDVLQE----IFEH 114
Query 133 EPSD---LVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYR 189
E S ++YG+ + ++ +L ++ +CHQ+ FYR YNL+Y+
Sbjct 115 EFSQDTAVIYGNTDIGHKILKYKSGLNLQDMVLGMMLCHQSTFYR---LDKAIKYNLKYK 171
Query 190 VLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLS----NTIVDKEF-------LKRLP 238
+ AD DF ++ F ++Y+ + V+ Y + G+S N I+ ++F L LP
Sbjct 172 ICADQDFTMQYFK-LGKKSKYLDMTVSKY-DLDGISSQNLNKILKEKFQINKSYNLSYLP 229
Query 239 MSTRLGIRLVIVL 251
+ I L++ L
Sbjct 230 VIKSYIISLLVKL 242
>gi|149277304|ref|ZP_01883446.1| putative LPS biosynthesis related glycosyltransferase [Pedobacter
sp. BAL39]
gi|149232181|gb|EDM37558.1| putative LPS biosynthesis related glycosyltransferase [Pedobacter
sp. BAL39]
Length=246
Score = 121 bits (303), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/235 (34%), Positives = 124/235 (53%), Gaps = 14/235 (5%)
Query 24 MFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLII 83
+ SIII T N + L +++ +E+++VDG S D T+ I N + +
Sbjct 13 LLSIIIATYNAERYIREGLLAVSNNIQAAYEIIIVDGNSQDNTVKIINQLHL---DHCHL 69
Query 84 HRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGE-HEPSDLVYGDV 142
+ D+G+YDAMN+G+ A G W+LFLGADD L L + E + + YG+
Sbjct 70 TSEPDKGIYDAMNKGIMSAKGKWILFLGADDRL-----LPSFNEILPELQDQHSIYYGNC 124
Query 143 IMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFS 202
R+ + GGAF RL K N+CHQAI Y R +F +NL+Y V AD+ N++C+
Sbjct 125 --RNGEHQLGGAFSQARLA-KMNLCHQAILYPRQVFEKY-KFNLKYPVFADYLLNMQCWG 180
Query 203 NPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPK 257
+ A+ Y+ V +A Y + G S+T D +F++ P + + L I + R+ K
Sbjct 181 DKAIHKIYVPVDIAYY-DMEGYSSTARDPDFIRDKPFFVKKHLGLFIYIKYRFRK 234
>gi|224025460|ref|ZP_03643826.1| hypothetical protein BACCOPRO_02200 [Bacteroides coprophilus
DSM 18228]
gi|224018696|gb|EEF76694.1| hypothetical protein BACCOPRO_02200 [Bacteroides coprophilus
DSM 18228]
Length=248
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/185 (42%), Positives = 105/185 (57%), Gaps = 14/185 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FSII T N AVL + S+ QT + E ++VDGGS D TLDI N + ++ +
Sbjct 7 PKFSIITVTYNAGAVLEDTIQSVITQTYRNVEYIIVDGGSKDHTLDIINRYREHIHTLV- 65
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD- 141
+ D+G+YDAMN+G+ LATG +L FL A D L+E DTL + I E D++YG+
Sbjct 66 --SEPDKGLYDAMNKGIRLATGDYLCFLNAGDELHEDDTLQLMVHSITGTELPDVLYGET 123
Query 142 -VIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
++ +F R +L+ FK +CHQA F RR L PY+LRYR AD+
Sbjct 124 AIVDEEGHFLRMRRLSAPENLNWKSFKDGMLVCHQAFFPRREL---AEPYDLRYRFSADF 180
Query 195 DFNIR 199
D+ IR
Sbjct 181 DWCIR 185
>gi|312892413|ref|ZP_07751908.1| glycosyl transferase family 2 [Mucilaginibacter paludis DSM 18603]
gi|311295197|gb|EFQ72371.1| glycosyl transferase family 2 [Mucilaginibacter paludis DSM 18603]
Length=244
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/215 (36%), Positives = 118/215 (55%), Gaps = 15/215 (6%)
Query 21 AAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGER 80
A + SIII T N A L CL+SI +QT E+V++DGGSTD ++D I N +
Sbjct 7 APSLVSIIIVTYNAAQFLQNCLNSIYKQTYPAIEIVVLDGGSTDGSVD---IIKANDNKI 63
Query 81 LIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLV-Y 139
+ D+G+YDAMN+ ++ G W+ FLGADD + E +A + E S+ + Y
Sbjct 64 AFWKSEPDEGIYDAMNKALEYIKGDWVYFLGADDIVLEG-----FSALAYDLEDSNAIYY 118
Query 140 GDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIR 199
G VI+R + +G A L K +CHQA+ Y +F +N RY++ AD++ N+R
Sbjct 119 GSVILRGNKY-YGKASKYQ--LAKSTLCHQAMIYPARVFKKY-RFNTRYQISADFELNMR 174
Query 200 CFSNPALVTRYMHVVVASYNEFGGLS--NTIVDKE 232
C+ + +L + VA++N G S + I +KE
Sbjct 175 CWRDKSLRFEFRDHTVANFNHTGASSKKDPIFEKE 209
>gi|312892412|ref|ZP_07751907.1| glycosyl transferase family 2 [Mucilaginibacter paludis DSM 18603]
gi|311295196|gb|EFQ72370.1| glycosyl transferase family 2 [Mucilaginibacter paludis DSM 18603]
Length=238
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/204 (36%), Positives = 110/204 (54%), Gaps = 11/204 (5%)
Query 22 APMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERL 81
AP SIII T N L A LDSI Q + E+V++DGGSTD TL+ + N +
Sbjct 6 APKISIIIVTYNAVKTLQAGLDSIYAQKYPNIEVVVIDGGSTDGTLE---LLKDNTSKLA 62
Query 82 IIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD 141
+ D+G+YDAMN+G++ +G W+ F+GADD L D + +A + P L Y +
Sbjct 63 YWKSEPDEGIYDAMNKGLNHISGQWVYFMGADDEL--RDEFSEMALEL--KRPGALYYAN 118
Query 142 VIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCF 201
V+ +N + F + K + HQAI Y +F T Y+ RY + AD++ N+RC+
Sbjct 119 VM---SNGKKKSGFVNEYYQAKAGVFHQAIIYSASIFKTY-RYDTRYSISADYELNMRCW 174
Query 202 SNPALVTRYMHVVVASYNEFGGLS 225
+ ++ Y V+A++N G S
Sbjct 175 RDQSIAFIYRDYVIANFNHNGASS 198
>gi|329962365|ref|ZP_08300370.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
gi|328530226|gb|EGF57107.1| glycosyltransferase, group 2 family protein [Bacteroides fluxus
YIT 12057]
Length=248
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/208 (37%), Positives = 111/208 (54%), Gaps = 15/208 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FS+I T N AVL + S+ QT E ++VDG S D TL + N + ++ R++
Sbjct 7 PKFSVITVTYNAEAVLEDTIQSVISQTYHHVEYIIVDGASKDNTLSVINRYKEHIS-RIV 65
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ I E D++YG+
Sbjct 66 --SEPDKGLYDAMNKGIKLATGDYLCFLNAGDSFHEDDTLQQMVHSITGKELPDILYGET 123
Query 143 IM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+ + +F R L FK+ +CHQA F + L + PY+L+YR AD+
Sbjct 124 ALVDKEGHFLRMRRLSAPQALTWKSFKQGMLVCHQAFFAKTSL---VEPYDLQYRFSADF 180
Query 195 DFNIRCFSNPALVTRYMHVVVASYNEFG 222
D+ IR A H+ + Y E G
Sbjct 181 DWCIRVMKK-ARTLHNTHLTIIDYLEEG 207
>gi|189423848|ref|YP_001951025.1| family 2 glycosyl transferase [Geobacter lovleyi SZ]
gi|189420107|gb|ACD94505.1| glycosyl transferase family 2 [Geobacter lovleyi SZ]
Length=296
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/229 (33%), Positives = 125/229 (55%), Gaps = 15/229 (6%)
Query 6 RYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDE 65
R+ G A T P+FS+I+ N +AVL + S+ Q + EL+++DGGSTD
Sbjct 36 RFLGFEKAGT----AGKPLFSVIMAVFNSSAVLEHAIRSVIYQDYDNVELIIIDGGSTDG 91
Query 66 TLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARV 125
++++ + + + + D G+Y A+N+G++ A G WL FLG+DD + + L ++
Sbjct 92 SVELIRKYDECIDYWV---SEPDDGIYYALNKGIEAAKGDWLYFLGSDDVM--LNCLHQL 146
Query 126 AA-FIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFGTIGPY 184
A F+ +P+ + YGDV + + +GG F R++ I HQA FY + LF +
Sbjct 147 ACRFV---DPNGVYYGDVYYPARHKLFGGEFSAYRVM-NCQIPHQATFYPKQLFLRY-KF 201
Query 185 NLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEF 233
+ RY +D+ FNI C+++ RY+ V+VA Y + G S +DK F
Sbjct 202 DTRYISASDYAFNIVCYNDKDFKYRYLPVLVAVYEDTSGFSTQDLDKRF 250
>gi|315921310|ref|ZP_07917550.1| conserved hypothetical protein [Bacteroides sp. D2]
gi|313695185|gb|EFS32020.1| conserved hypothetical protein [Bacteroides sp. D2]
Length=249
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 76/209 (37%), Positives = 109/209 (53%), Gaps = 15/209 (7%)
Query 22 APMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERL 81
P FSII T N VL + S+ QT E ++VDG S D TL I N + + +
Sbjct 7 TPKFSIITVTYNAEKVLEDTIQSVISQTYHHIEYIIVDGASKDGTLSIINRYRSRIHTVV 66
Query 82 IIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD 141
+ D+G+YDAMN+G+ LA+G +L FL A D +E DTL ++ I +E D++YG+
Sbjct 67 ---SEPDKGLYDAMNKGIALASGDYLCFLNAGDCFHEDDTLQQMVHTINGNELPDVLYGE 123
Query 142 --VIMRSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLAD 193
++ + +F R L FK+ +CHQA F R L +GPYNL+YR AD
Sbjct 124 TAIVDKDRHFLRMRRLSAPETLTWKSFKQGMLVCHQAFFPRHTL---VGPYNLQYRFSAD 180
Query 194 WDFNIRCFSNPALVTRYMHVVVASYNEFG 222
+D+ IR A H+ + Y E G
Sbjct 181 FDWCIRIMKK-ARTLHNTHLTIIDYLEEG 208
>gi|313149202|ref|ZP_07811395.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313137969|gb|EFR55329.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length=257
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/229 (35%), Positives = 118/229 (52%), Gaps = 16/229 (6%)
Query 22 APMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERL 81
+P FS+I T N VL + S+ QT E +++DG S D T+ I N + + R+
Sbjct 6 SPRFSVITVTYNAEKVLEDTIQSVISQTYHHVEYIIIDGASKDGTMAIVNRYRDRIS-RV 64
Query 82 IIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGD 141
+ + D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ I +E D++YG
Sbjct 65 V--SEPDKGLYDAMNKGIALATGDYLCFLNAGDSFHEDDTLQKIVHSINANELPDVLYGQ 122
Query 142 VIMRST--NF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLAD 193
+ T +F R L FK+ +CHQA F R L + PY+L+Y AD
Sbjct 123 TALVDTERHFLRMRRLSAPETLSWKSFKQGMLVCHQAFFARHTL---VEPYDLQYHFSAD 179
Query 194 WDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTR 242
+D+ IR A H+ + Y E G++ K L+R + TR
Sbjct 180 FDWCIRIMKK-ARTLHNTHLTLVDYLE-EGMTTQNRKKSLLERFRIMTR 226
>gi|167763889|ref|ZP_02436016.1| hypothetical protein BACSTE_02270 [Bacteroides stercoris ATCC
43183]
gi|167698005|gb|EDS14584.1| hypothetical protein BACSTE_02270 [Bacteroides stercoris ATCC
43183]
Length=248
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/185 (38%), Positives = 105/185 (57%), Gaps = 14/185 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FS+I T N AVL + S+ QT E ++VDG S D+TL I + + ++ +
Sbjct 7 PKFSVITVTYNAEAVLEDTIQSVISQTYRHVEYIIVDGASKDKTLSIIDRYKDHIAK--- 63
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ + D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ + E D++YG+
Sbjct 64 VTSEPDKGLYDAMNKGIRLATGDYLCFLNAGDSFHEDDTLQQMVHTLPGSELPDVMYGET 123
Query 143 IM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+ + +F R L F++ +CHQA F +R L + PYNL+YR AD+
Sbjct 124 ALVDKEGHFVRMRRLSAPETLTWKSFRQGMLVCHQAFFAKRTL---VEPYNLKYRFSADF 180
Query 195 DFNIR 199
D+ IR
Sbjct 181 DWCIR 185
>gi|53715791|ref|YP_101783.1| putative glycosyltransferase [Bacteroides fragilis YCH46]
gi|52218656|dbj|BAD51249.1| putative glycosyltransferase [Bacteroides fragilis YCH46]
Length=257
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/232 (34%), Positives = 120/232 (52%), Gaps = 16/232 (6%)
Query 19 AMAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLG 78
+ +P FS+I T N VL + S+ QT E +++DG S D TL+I N + +
Sbjct 3 SHPSPKFSVITVTYNAEKVLEDTVQSVISQTYHHVEYIIIDGASKDGTLEIVNRYRDRIN 62
Query 79 ERLIIHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLV 138
+ + + D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ I +E D++
Sbjct 63 QLV---SEPDKGLYDAMNKGIALATGDYLCFLNAGDSFHEDDTLQKIVHSINGNELPDIL 119
Query 139 YGDVIM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRV 190
YG+ + +F R L+ FK+ +CHQA F R L I PY+L+YR
Sbjct 120 YGETALVDAERHFLRMRRLSAPETLNWKSFKQGMLVCHQAFFPRHTL---IEPYDLQYRF 176
Query 191 LADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTR 242
AD+D+ IR A H+++ Y G++ L+R + TR
Sbjct 177 SADFDWCIRIMKK-ARTFHNTHLILIDYLA-EGMTTQNHKASLLERFRIMTR 226
>gi|329956536|ref|ZP_08297133.1| glycosyltransferase, group 2 family protein [Bacteroides clarus
YIT 12056]
gi|328524433|gb|EGF51503.1| glycosyltransferase, group 2 family protein [Bacteroides clarus
YIT 12056]
Length=248
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/208 (36%), Positives = 110/208 (53%), Gaps = 15/208 (7%)
Query 23 PMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLI 82
P FS+I N AVL + S+ QT E ++VDG S D+TL I + + ++ +
Sbjct 7 PKFSVITVAYNAEAVLEDTIQSVISQTYRHVEYIIVDGASKDKTLSITDRYKNHIAQ--- 63
Query 83 IHRDTDQGVYDAMNRGVDLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDV 142
+ + D+G+YDAMN+G+ LATG +L FL A DS +E DTL ++ + E D++YG+
Sbjct 64 VVSEPDKGLYDAMNKGIRLATGDYLCFLNAGDSFHEDDTLQQMVHTLTGSELPDVLYGET 123
Query 143 IM--RSTNF----RWGGAFDLDRLLFKRN--ICHQAIFYRRGLFGTIGPYNLRYRVLADW 194
+ + +F R L FK+ +CHQA F +R L PYNL+YR AD+
Sbjct 124 ALVDKEGHFVRMRRLSAPETLTWKSFKQGMLVCHQAFFAKRAL---AEPYNLKYRFSADF 180
Query 195 DFNIRCFSNPALVTRYMHVVVASYNEFG 222
D+ IR A H+ + Y + G
Sbjct 181 DWCIRIMKK-AQTLHNTHLTIIDYLDEG 207
Lambda K H
0.328 0.141 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 432410969436
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40