BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1525
Length=261
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608663|ref|NP_216041.1| rhamnosyl transferase WbbL2 [Mycoba... 536 1e-150
gi|254231754|ref|ZP_04925081.1| rhamnosyl transferase wbbL2 [Myc... 526 1e-147
gi|323719973|gb|EGB29085.1| rhamnosyl transferase wbbL2 [Mycobac... 522 2e-146
gi|339294501|gb|AEJ46612.1| rhamnosyl transferase WbbL2 [Mycobac... 514 3e-144
gi|183982362|ref|YP_001850653.1| rhamnosyl transferase WbbL2 [My... 356 2e-96
gi|118617177|ref|YP_905509.1| rhamnosyl transferase WbbL2 [Mycob... 352 3e-95
gi|240167918|ref|ZP_04746577.1| rhamnosyl transferase WbbL2 [Myc... 349 2e-94
gi|289745281|ref|ZP_06504659.1| rhamnosyl transferase wbbL2 [Myc... 230 1e-58
gi|78357137|ref|YP_388586.1| hypothetical protein Dde_2094 [Desu... 73.6 3e-11
gi|311745299|ref|ZP_07719084.1| rhamnosyl transferase [Algoripha... 70.1 3e-10
gi|154687695|ref|YP_001422856.1| hypothetical protein RBAM_03295... 69.3 5e-10
gi|308175312|ref|YP_003922017.1| polypeptide N-acetylgalactosami... 67.8 2e-09
gi|78223545|ref|YP_385292.1| glycosyl transferase family protein... 67.0 2e-09
gi|189423846|ref|YP_001951023.1| family 2 glycosyl transferase [... 66.6 3e-09
gi|311070087|ref|YP_003975010.1| hypothetical protein BATR1942_1... 65.5 9e-09
gi|197120101|ref|YP_002140528.1| bifunctional SAM-dependent meth... 65.1 1e-08
gi|322421641|ref|YP_004200864.1| family 2 glycosyl transferase [... 64.7 1e-08
gi|218885556|ref|YP_002434877.1| hypothetical protein DvMF_0452 ... 63.9 2e-08
gi|168703373|ref|ZP_02735650.1| glycosyl transferase family 2 [G... 63.9 2e-08
gi|344339949|ref|ZP_08770876.1| glycosyl transferase family 2 [T... 62.8 6e-08
gi|333980867|ref|YP_004518812.1| family 2 glycosyl transferase [... 62.4 7e-08
gi|253702409|ref|YP_003023598.1| glycosyl transferase family 2 [... 62.0 7e-08
gi|343084999|ref|YP_004774294.1| family 2 glycosyl transferase [... 61.2 1e-07
gi|333998081|ref|YP_004530693.1| glycosyltransferase, group 2 fa... 60.8 2e-07
gi|260892179|ref|YP_003238276.1| glycosyl transferase family 2 [... 60.1 3e-07
gi|70729409|ref|YP_259147.1| group 2 family glycosyl transferase... 60.1 4e-07
gi|337285241|ref|YP_004624715.1| glycosyltransferase [Pyrococcus... 59.3 5e-07
gi|15807708|ref|NP_285362.1| rhamnosyltransferase [Deinococcus r... 59.3 6e-07
gi|120602571|ref|YP_966971.1| hypothetical protein Dvul_1527 [De... 59.3 6e-07
gi|260893001|ref|YP_003239098.1| glycosyl transferase family 2 [... 58.9 7e-07
gi|337750761|ref|YP_004644923.1| family 2 glycosyl transferase [... 58.5 9e-07
gi|312137291|ref|YP_004004628.1| glycosyl transferase family 2 [... 58.5 9e-07
gi|333923089|ref|YP_004496669.1| type 12 methyltransferase [Desu... 58.5 9e-07
gi|156740479|ref|YP_001430608.1| glycosyl transferase family pro... 58.5 1e-06
gi|326404006|ref|YP_004284088.1| putative glycosyltransferase [A... 57.8 2e-06
gi|256827929|ref|YP_003156657.1| hypothetical protein Dbac_0112 ... 57.8 2e-06
gi|222053755|ref|YP_002536117.1| glycosyl transferase family 2 [... 57.4 2e-06
gi|329923363|ref|ZP_08278847.1| glycosyltransferase, group 2 fam... 56.6 3e-06
gi|304404178|ref|ZP_07385840.1| Methyltransferase type 11 [Paeni... 56.6 4e-06
gi|323701361|ref|ZP_08113035.1| Methyltransferase type 12 [Desul... 56.2 5e-06
gi|46580018|ref|YP_010826.1| hypothetical protein DVU1607 [Desul... 55.8 6e-06
gi|345126148|gb|EGW56014.1| uncharacterized protein Y4gI [endosy... 55.5 7e-06
gi|291562820|emb|CBL41636.1| Predicted glycosyltransferases [but... 55.5 8e-06
gi|309792813|ref|ZP_07687255.1| glycosyl transferase family prot... 55.5 8e-06
gi|296505695|ref|YP_003667395.1| FkbM family methyltransferase [... 55.1 1e-05
gi|251795463|ref|YP_003010194.1| glycosyl transferase family 2 [... 55.1 1e-05
gi|304408100|ref|ZP_07389750.1| glycosyl transferase family 2 [P... 54.3 2e-05
gi|193211833|ref|YP_001997786.1| family 2 glycosyl transferase [... 54.3 2e-05
gi|323701368|ref|ZP_08113042.1| glycosyl transferase family 2 [D... 53.9 2e-05
gi|3928919|gb|AAC79722.1| O-antigen biosynthesis protein RbfC [e... 53.9 2e-05
>gi|15608663|ref|NP_216041.1| rhamnosyl transferase WbbL2 [Mycobacterium tuberculosis H37Rv]
gi|15840992|ref|NP_336029.1| hypothetical protein MT1576 [Mycobacterium tuberculosis CDC1551]
gi|31792711|ref|NP_855204.1| rhamnosyl transferase WbbL2 [Mycobacterium bovis AF2122/97]
43 more sequence titles
Length=261
Score = 536 bits (1381), Expect = 1e-150, Method: Compositional matrix adjust.
Identities = 260/261 (99%), Positives = 261/261 (100%), Gaps = 0/261 (0%)
Query 1 LYAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLG 60
+YAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLG
Sbjct 1 MYAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLG 60
Query 61 WAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFA 120
WAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFA
Sbjct 61 WAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFA 120
Query 121 VADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALR 180
VADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALR
Sbjct 121 VADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALR 180
Query 181 ARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILRE 240
ARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILRE
Sbjct 181 ARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILRE 240
Query 241 MGMAHHRKWHKSFPLTCPASC 261
MGMAHHRKWHKSFPLTCPASC
Sbjct 241 MGMAHHRKWHKSFPLTCPASC 261
>gi|254231754|ref|ZP_04925081.1| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis C]
gi|308231864|ref|ZP_07414052.2| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis SUMu001]
gi|308369468|ref|ZP_07417877.2| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis SUMu002]
19 more sequence titles
Length=256
Score = 526 bits (1355), Expect = 1e-147, Method: Compositional matrix adjust.
Identities = 255/256 (99%), Positives = 256/256 (100%), Gaps = 0/256 (0%)
Query 6 VSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 65
+SLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS
Sbjct 1 MSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 60
Query 66 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEK 125
ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEK
Sbjct 61 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEK 120
Query 126 PDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSG 185
PDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSG
Sbjct 121 PDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSG 180
Query 186 YGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAH 245
YGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAH
Sbjct 181 YGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAH 240
Query 246 HRKWHKSFPLTCPASC 261
HRKWHKSFPLTCPASC
Sbjct 241 HRKWHKSFPLTCPASC 256
>gi|323719973|gb|EGB29085.1| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis CDC1551A]
Length=253
Score = 522 bits (1345), Expect = 2e-146, Method: Compositional matrix adjust.
Identities = 253/253 (100%), Positives = 253/253 (100%), Gaps = 0/253 (0%)
Query 9 MITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELG 68
MITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELG
Sbjct 1 MITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELG 60
Query 69 FRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDA 128
FRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDA
Sbjct 61 FRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDA 120
Query 129 ESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGL 188
ESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGL
Sbjct 121 ESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGL 180
Query 189 YTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRK 248
YTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRK
Sbjct 181 YTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRK 240
Query 249 WHKSFPLTCPASC 261
WHKSFPLTCPASC
Sbjct 241 WHKSFPLTCPASC 253
>gi|339294501|gb|AEJ46612.1| rhamnosyl transferase WbbL2 [Mycobacterium tuberculosis CCDC5079]
gi|339298137|gb|AEJ50247.1| rhamnosyl transferase WbbL2 [Mycobacterium tuberculosis CCDC5180]
Length=250
Score = 514 bits (1325), Expect = 3e-144, Method: Compositional matrix adjust.
Identities = 249/250 (99%), Positives = 250/250 (100%), Gaps = 0/250 (0%)
Query 12 VPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRL 71
+PVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRL
Sbjct 1 MPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRL 60
Query 72 AFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESY 131
AFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESY
Sbjct 61 AFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESY 120
Query 132 VPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTT 191
VPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTT
Sbjct 121 VPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTT 180
Query 192 EMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRKWHK 251
EMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRKWHK
Sbjct 181 EMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRKWHK 240
Query 252 SFPLTCPASC 261
SFPLTCPASC
Sbjct 241 SFPLTCPASC 250
>gi|183982362|ref|YP_001850653.1| rhamnosyl transferase WbbL2 [Mycobacterium marinum M]
gi|183175688|gb|ACC40798.1| rhamnosyl transferase WbbL2 [Mycobacterium marinum M]
Length=255
Score = 356 bits (914), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 174/249 (70%), Positives = 199/249 (80%), Gaps = 16/249 (6%)
Query 6 VSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 65
+S++I VPVFGQHEYTHALV DLERE A+YLIVDNRGDYP+IG ERV TP ENLGWAGGS
Sbjct 1 MSVLIIVPVFGQHEYTHALVGDLERESAEYLIVDNRGDYPKIGNERVVTPRENLGWAGGS 60
Query 66 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEK 125
+LGFR+AF+EG+S+AMTLNNDTR+SKGFVA LLD RLPAD G+VGPM D GFP A D+K
Sbjct 61 DLGFRIAFSEGHSYAMTLNNDTRISKGFVAGLLDPRLPADTGIVGPMIDHGFPCAEDDQK 120
Query 126 PDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSG 185
P+A +Y+PR RYR V AVEGTAL +SR+CW +VGGMDL TFGRYGWG+DLDLALRAR +
Sbjct 121 PEAANYIPRPRYRTVSAVEGTALALSRECWRSVGGMDLRTFGRYGWGVDLDLALRARDAE 180
Query 186 YGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAH 245
+GLYTTEMAYINHFGR TAN HFG RY GA++AM R +RR HG
Sbjct 181 FGLYTTEMAYINHFGRGTANKHFGKLRYLIGANSAMERAMRRVHG--------------- 225
Query 246 HRKWHKSFP 254
RKW K FP
Sbjct 226 -RKWRKLFP 233
>gi|118617177|ref|YP_905509.1| rhamnosyl transferase WbbL2 [Mycobacterium ulcerans Agy99]
gi|118569287|gb|ABL04038.1| rhamnosyl transferase WbbL2 [Mycobacterium ulcerans Agy99]
Length=255
Score = 352 bits (904), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 172/249 (70%), Positives = 197/249 (80%), Gaps = 16/249 (6%)
Query 6 VSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 65
+S++I VPVFGQHEYTHALV DLERE A+YLIVDNRGDYP+IG ERV TP ENLGW GGS
Sbjct 1 MSVLIIVPVFGQHEYTHALVGDLERESAEYLIVDNRGDYPKIGNERVVTPRENLGWGGGS 60
Query 66 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEK 125
+LG R+AF+EG+S+AMTLNNDTR+SKGFVA LLD RLPAD G+VGPM D GFP A D+K
Sbjct 61 DLGLRIAFSEGHSYAMTLNNDTRISKGFVAGLLDPRLPADTGIVGPMIDHGFPCAEDDQK 120
Query 126 PDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSG 185
P+A +Y+PR RYR V AVEGTAL +SR+CW +VGGMDL TFGRYGWG+DLDLALRAR +
Sbjct 121 PEAANYIPRPRYRAVSAVEGTALALSRECWRSVGGMDLRTFGRYGWGVDLDLALRARDAE 180
Query 186 YGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAH 245
+GLYTTEMAYINHFGR TAN HFG RY GA++AM R +RR HG
Sbjct 181 FGLYTTEMAYINHFGRGTANKHFGKFRYLIGANSAMERAMRRVHG--------------- 225
Query 246 HRKWHKSFP 254
RKW K FP
Sbjct 226 -RKWCKLFP 233
>gi|240167918|ref|ZP_04746577.1| rhamnosyl transferase WbbL2 [Mycobacterium kansasii ATCC 12478]
Length=247
Score = 349 bits (896), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 169/241 (71%), Positives = 195/241 (81%), Gaps = 16/241 (6%)
Query 14 VFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRLAF 73
+FGQHEYTHALV DLEREGA+YLIVDNRGDYPRI ERV TPGENLGWAGGS+LGFR+AF
Sbjct 1 MFGQHEYTHALVGDLEREGAEYLIVDNRGDYPRISNERVVTPGENLGWAGGSDLGFRIAF 60
Query 74 AEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESYVP 133
+EG+SHAMTLNNDTR+SKGFV LLD RLP DAG+VGPMFD GFP A D+KP+A Y+P
Sbjct 61 SEGHSHAMTLNNDTRISKGFVTGLLDPRLPGDAGIVGPMFDHGFPCAEDDQKPNAADYIP 120
Query 134 RARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEM 193
R RYR V AVEGTAL++SR+CW A+GGMD+ TFGRYGWG+DLDLALRAR + +GLYTTEM
Sbjct 121 RPRYRAVSAVEGTALMLSRECWQAIGGMDVRTFGRYGWGVDLDLALRARDAEFGLYTTEM 180
Query 194 AYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILREMGMAHHRKWHKSF 253
+Y+NHFGR+TANTHFG RY GA++AM R +RR +G RKW K F
Sbjct 181 SYVNHFGRRTANTHFGRWRYLLGANSAMERTMRRVYG----------------RKWRKRF 224
Query 254 P 254
P
Sbjct 225 P 225
>gi|289745281|ref|ZP_06504659.1| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis 02_1987]
gi|289685809|gb|EFD53297.1| rhamnosyl transferase wbbL2 [Mycobacterium tuberculosis 02_1987]
Length=115
Score = 230 bits (587), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 112/113 (99%), Positives = 113/113 (100%), Gaps = 0/113 (0%)
Query 6 VSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 65
+SLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS
Sbjct 1 MSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGS 60
Query 66 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFP 118
ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFP
Sbjct 61 ELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFP 113
>gi|78357137|ref|YP_388586.1| hypothetical protein Dde_2094 [Desulfovibrio alaskensis G20]
gi|78219542|gb|ABB38891.1| hypothetical protein Dde_2094 [Desulfovibrio alaskensis G20]
Length=243
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/191 (33%), Positives = 88/191 (47%), Gaps = 14/191 (7%)
Query 24 LVADLEREGADYLIVDNRGDYPRIGT-ERVSTPGENLGWAGGSELGFRLAFAEGYSHAMT 82
L AD +R AD +++DN P G +R S ENL WAG E GY+H
Sbjct 24 LAADPQR-AADIMVLDNCAPEPFCGAWQRTS---ENLYWAGALEYTLASVMGLGYTHLWF 79
Query 83 LNND----TRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESYVPRARYR 138
LNND TR A +R+ G +G V P V + PR ++R
Sbjct 80 LNNDIMFDTRPPLLARAEARMARMQRRTGAIG----VWAPAVVRSPYHEHMVADPRYQFR 135
Query 139 KVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINH 198
+V V+G A V+S DCW GG+D + YG+G+D+ +L A ++G+ + + H
Sbjct 136 RVAYVDGIAPVISIDCWRQTGGVDYAG-NPYGYGVDIWFSLCAHRAGWAVVVDNGIVVQH 194
Query 199 FGRKTANTHFG 209
TA T G
Sbjct 195 AYHSTARTVSG 205
>gi|311745299|ref|ZP_07719084.1| rhamnosyl transferase [Algoriphagus sp. PR1]
gi|126577832|gb|EAZ82052.1| rhamnosyl transferase [Algoriphagus sp. PR1]
Length=312
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/275 (26%), Positives = 120/275 (44%), Gaps = 49/275 (17%)
Query 2 YAPLVSLMITVPVFGQHEYTHALVADLER-EGADYLIV-------DNRGD-----YPRIG 48
+ P S+ I + + +++T A + L + E DY ++ +N G +P I
Sbjct 3 FTPDPSVAIILVNWNGYDFTAACINSLRKLEFPDYQVILVDNASGNNEGQKLLTAFPEI- 61
Query 49 TERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLD--SRLPADA 106
+ ENLG+AGG+ +G + A +GYSH M LNNDT V+ F+ ++ +
Sbjct 62 --ILLENKENLGFAGGNNVGIKKALEDGYSHIMLLNNDTEVAPDFLGQMVRVFNSSNQQL 119
Query 107 GMVGPMFD------------------VGFPFAVADEKPDAESYVPRARYRKVPAVEGTAL 148
G+V P+ +G + D KP E Y +A+ K+ G +
Sbjct 120 GVVQPLITFLNDKGKIWSAGGKWNSLLGRSITLGDRKP-VEDY--QAKDVKLDWATGCCM 176
Query 149 VMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINH-FGRKTANTH 207
++SR+ G ++ S F + D++ +LR R+ GY + A I H G + H
Sbjct 177 LISREAMMKTGLLNESYFAYFE---DVEWSLRFREKGYQIALASKAMIYHEAGASSKKAH 233
Query 208 FGG------HRYHWGASAAMIRGLRRTHGWPAAMG 236
G YH +IRG+++ + ++G
Sbjct 234 AEGVLSPKVFYYHVRNQFYLIRGMQKGMSFLLSLG 268
>gi|154687695|ref|YP_001422856.1| hypothetical protein RBAM_032950 [Bacillus amyloliquefaciens
FZB42]
gi|154353546|gb|ABS75625.1| conserved hypothetical protein [Bacillus amyloliquefaciens FZB42]
Length=470
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 63/219 (29%), Positives = 104/219 (48%), Gaps = 26/219 (11%)
Query 6 VSLMITVPVFGQHEYTHALVADLER----EGADYLIVDN------RGDYPRIGTERVSTP 55
V I + + + E T +A +ER + + +IVDN R + +V
Sbjct 12 VKTSIIILTYNELELTKKCLASIERYTDKDKIELIIVDNGSSDGTREYISNLPDVKVILN 71
Query 56 GENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPAD-AGMVGPMFD 114
+NLG+A G G A G + + LNNDT V+K ++ ++L ++ GMVGP+ +
Sbjct 72 EKNLGFAKGCNQGIE---AAGGENLLFLNNDTMVTKNWLTSMLRVLYQSEKTGMVGPVSN 128
Query 115 -------VGFPFAVADE-KPDAESYVPRARY--RKVPAVEGTALVMSRDCWDAVGGMDLS 164
+ + DE + AE+Y R ++ ++V + G L++ R + +G D
Sbjct 129 YVSGAQMIHDAYQSLDELESFAEAYCAREKHHTKRVLRLVGFCLLVKRSVIEDIGPFD-E 187
Query 165 TFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
FG YG D DL LRA + GY L+ A+++H G T
Sbjct 188 QFG-YGSFEDDDLCLRALQKGYSLHIALDAFVHHHGHAT 225
>gi|308175312|ref|YP_003922017.1| polypeptide N-acetylgalactosaminyltransferase-like protein [Bacillus
amyloliquefaciens DSM 7]
gi|307608176|emb|CBI44547.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 Polypeptide GalNAc transferase-like protein 1; pp-GaNTase-like
protein 1; GalNAc-T-like protein 1; Protein-UDP
acetylgalactosaminyltransferase-like protein 1; UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Bacillus amyloliquefaciens DSM 7]
gi|328555285|gb|AEB25777.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
Polypeptide GalNAc transferase-like protein 1; pp-GaNTase-like
protein 1; GalNAc-T-like protein 1; Protein-UDP acetylgalactosaminyltransferase-like
protein 1; UDP-GalNAc:polypeptide
N-acetylgala [Bacillus amyloliquefaciens TA208]
gi|328913647|gb|AEB65243.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 Polypeptide GalNAc transferase-like protein 1 [Bacillus
amyloliquefaciens LL3]
gi|341829567|gb|AEK90818.1| hypothetical protein BAXH7_03706 [Bacillus amyloliquefaciens
XH7]
Length=459
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/215 (29%), Positives = 106/215 (50%), Gaps = 26/215 (12%)
Query 10 ITVPVFGQHEYTHALVADLE----REGADYLIVDN------RGDYPRIGTERVSTPGENL 59
I + + + E T +A ++ +E + +IVDN R + +V +NL
Sbjct 5 IIILTYNELELTKKCLASIDQYTDKEEIELIIVDNGSSDGTRDYISSLPDVKVILNEKNL 64
Query 60 GWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPAD-AGMVGPMFD---- 114
G+A G G + AEG + + LNNDT V+K +++A+L ++ GMVGP+ +
Sbjct 65 GFAKGCNQG--IEAAEG-DNLLFLNNDTIVTKNWLSAMLRVLYQSEKTGMVGPVSNYVSG 121
Query 115 ---VGFPFAVADE-KPDAESYVPRARY--RKVPAVEGTALVMSRDCWDAVGGMDLSTFGR 168
+ + DE + AE+Y R ++ ++V + G L++ + + +G D FG
Sbjct 122 DQMIHDAYQSLDELESFAEAYCEREKHNTKRVFRLVGFCLLVKKSVMEDIGPFD-EQFG- 179
Query 169 YGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
YG D DL LRA + GY L+ A+++H G T
Sbjct 180 YGSFEDDDLCLRALQKGYSLHIARDAFVHHHGHAT 214
>gi|78223545|ref|YP_385292.1| glycosyl transferase family protein [Geobacter metallireducens
GS-15]
gi|78194800|gb|ABB32567.1| Glycosyl transferase, family 2 [Geobacter metallireducens GS-15]
Length=270
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/251 (28%), Positives = 104/251 (42%), Gaps = 34/251 (13%)
Query 8 LMITVPVFGQHEYTHALVADLER---EGADYLIVDN-----RGDYPRIGTE-RVSTPGEN 58
+ I +P++ Q YT V L R G D L++DN +Y + ++ RV EN
Sbjct 2 ISIVIPLYNQVAYTQLCVQSLRRAGLNGVDVLLIDNGSTDGTAEYLKSCSDLRVIRNREN 61
Query 59 LGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFP 118
LG AG G R +E M LNND VS G++ LL++ +V P G
Sbjct 62 LGCAGAWNQGVRETTSE---WVMILNNDVVVSPGWLDGLLEAAGKEGLDVVSPAIREG-- 116
Query 119 FAVADEKPDAESYVPRARYRKVPAV-----EGTALVMSRDCWDAVGGMDLSTFGRYGWGL 173
+ D E Y R V +G ++ R ++ VG D + R G
Sbjct 117 ----EYNYDIEGYSQEFVGRMGDVVRRGVADGICFMVRRRVFEVVGLFDENF--RIGQFE 170
Query 174 DLDLALRARKSGYGLYTTEMAYINHFGRKTANT---------HFGGHRYHWGASAAMIRG 224
D D RAR +G+ L TT ++I+HFG T + + +R ++ + RG
Sbjct 171 DADFFRRARNAGFRLGTTGRSFIHHFGSVTQDAIRKNRPERPYVAENRAYFREKWQLTRG 230
Query 225 LRRTHGWPAAM 235
R W +
Sbjct 231 RRLIERWRRKL 241
>gi|189423846|ref|YP_001951023.1| family 2 glycosyl transferase [Geobacter lovleyi SZ]
gi|189420105|gb|ACD94503.1| glycosyl transferase family 2 [Geobacter lovleyi SZ]
Length=280
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/210 (30%), Positives = 95/210 (46%), Gaps = 23/210 (10%)
Query 7 SLMITVPVFGQHEYTHALVADLEREGA---DYLIVDN-----RGDYPRIGTERVSTPG-E 57
++ I +P++ Q EYT + L R A + ++VDN DY + + V E
Sbjct 11 AVSIVIPLYNQLEYTRGCLDSLRRTSAADVELILVDNASSDGTADYLQTVPDLVLIANRE 70
Query 58 NLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMF---D 114
N G+AG G A E + +NND +S+G++ LL + MV P D
Sbjct 71 NRGFAGACNQGILAASGE---WIVVMNNDVILSQGWLQGLLSAAGERQLDMVSPAIREGD 127
Query 115 VGFPFAVADEKPDAESYVPRAR-YRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGL 173
+ + D P AE R + + V G + R ++ +G D + R G
Sbjct 128 LNY-----DLDPYAEELTNRMKSVVRSGTVNGICFMAHRRVFETIGVFDENF--RIGQYE 180
Query 174 DLDLALRARKSGYGLYTTEMAYINHFGRKT 203
D DL LRAR++G+ L T A+I+HFG T
Sbjct 181 DKDLFLRARRAGFRLGTVGAAFIHHFGSIT 210
>gi|311070087|ref|YP_003975010.1| hypothetical protein BATR1942_15800 [Bacillus atrophaeus 1942]
gi|310870604|gb|ADP34079.1| hypothetical protein BATR1942_15800 [Bacillus atrophaeus 1942]
Length=467
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 60/219 (28%), Positives = 104/219 (48%), Gaps = 26/219 (11%)
Query 6 VSLMITVPVFGQHEYTHALVADLER----EGADYLIVDN------RGDYPRIGTERVSTP 55
V I + + + T +A +E+ E + +IVDN R + + R
Sbjct 8 VKTSIIILTYNELVLTKKCLASIEKHTQEEDIELIIVDNGSTDGTREYLAGLSSVRTILN 67
Query 56 GENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPAD-AGMVGPMFD 114
ENLG+A G G A + + LNNDT V++ +++A+L +D GMVGP+ +
Sbjct 68 DENLGFAKGCNQGIEAAKGDNL---LFLNNDTMVTQNWLSAMLRVLDQSDQTGMVGPVSN 124
Query 115 -------VGFPFAVADE-KPDAESYVPRARYR--KVPAVEGTALVMSRDCWDAVGGMDLS 164
+ + +E +P A+++ + +++ +V + G L++ R D +GG D
Sbjct 125 YVSGAQMIPNAYQTLEEMEPFAKAHCEKEKHKTKRVLRLVGFCLLIKRAVIDEIGGFD-E 183
Query 165 TFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
F YG D DL LRA ++GY L ++++H G T
Sbjct 184 RFA-YGSFEDDDLCLRAIQAGYRLEIALDSFVHHHGHAT 221
>gi|197120101|ref|YP_002140528.1| bifunctional SAM-dependent methyltransferase, type 11,/glycosyltransferase,
family 2, hypothetical protein [Geobacter bemidjiensis
Bem]
gi|197089461|gb|ACH40732.1| SAM-dependent methyltransferase, type 11, and glycosyltransferase,
family 2, TPR domain-containing protein [Geobacter bemidjiensis
Bem]
Length=1523
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/245 (27%), Positives = 118/245 (49%), Gaps = 40/245 (16%)
Query 3 APLVSLMITVPVFGQHEY--THALVADLER---EGADYLIVDN---RGDYPRI-----GT 49
AP S + ++ + +E T + ++R E + ++VDN G P + G
Sbjct 617 APKASRLTSIVILTWNELSCTRECLESIQRYTPEPHEVILVDNGSSDGTIPFLREFCAGK 676
Query 50 E--RVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS-RLPADA 106
E R+ G+NLG+A G +G R A H + LNNDT V++G+++ ++++ + A
Sbjct 677 ENYRLIENGKNLGFAAGCNIGMRAARG---GHVLLLNNDTVVTRGWLSGMIEALQRDPKA 733
Query 107 GMVGPMF-DVGFPFAVADEK----PDAESYVPRARYR------KVPAVEGTALVMSRDCW 155
G+VGPM ++ P +A + +++ R R +V V G ++ +R+
Sbjct 734 GIVGPMTNEIAGPQKLAQVPYRGMEELQAFAERFRSEHYGRRIEVDRVVGFCMLFTRELL 793
Query 156 DAVGGMDLSTFGRYGWG--LDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRY 213
+ VG +D R+G G D D LRA +GY +I+H+G ++ F G+R
Sbjct 794 ETVGELD----ERFGSGNFEDDDFCLRAALAGYRCLIAGDVFIHHYGSRS----FAGNRV 845
Query 214 HWGAS 218
+ A+
Sbjct 846 DYAAA 850
>gi|322421641|ref|YP_004200864.1| family 2 glycosyl transferase [Geobacter sp. M18]
gi|320128028|gb|ADW15588.1| glycosyl transferase family 2 [Geobacter sp. M18]
Length=1523
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/244 (29%), Positives = 112/244 (46%), Gaps = 40/244 (16%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDN---RGDYPRI-------GTERVSTPG 56
I + + + E T + +ER E + ++VDN G P + R+ G
Sbjct 627 IIILTWNELECTRECLESIERHTPEPHEVILVDNGSTDGTLPYLREFCAERSGYRLIENG 686
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS--RLPADAGMVGPMFD 114
NLG+A G +G A H + LNND V++G+++ +L++ R P AG+ GPM +
Sbjct 687 SNLGFAAGCNIGMAEAVG---GHILLLNNDVVVTRGWLSGMLEALAREPG-AGLAGPMTN 742
Query 115 -VGFPFAVA----DEKPDAESYVPRARY----RKVPA--VEGTALVMSRDCWDAVGGMDL 163
+ P + + ES+ R R++P + G L+ SR+ +AVG +D
Sbjct 743 QIAGPQKTGGVGYRSRAELESFAGSFRSANYGRRIPVDRLVGFCLLFSREVLEAVGELDT 802
Query 164 STFGRYGWG--LDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAM 221
R+G G D D LRA GYG +I+H+G ++ F G+R + A+
Sbjct 803 ----RFGSGNFEDDDFCLRAALKGYGCVIAGDVFIHHYGSRS----FTGNRIDYAAAMRK 854
Query 222 IRGL 225
R L
Sbjct 855 NRRL 858
>gi|218885556|ref|YP_002434877.1| hypothetical protein DvMF_0452 [Desulfovibrio vulgaris str. 'Miyazaki
F']
gi|218756510|gb|ACL07409.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki
F']
Length=280
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/205 (33%), Positives = 90/205 (44%), Gaps = 16/205 (7%)
Query 15 FGQHEYTHALVADLEREGADY----LIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFR 70
+G L L R D+ L++DN P IG R P +NL W G
Sbjct 38 YGDPALAANLADTLRRADPDHAERVLVLDNAAPRPAIGAWR-RLP-DNLFWGGALAYCMD 95
Query 71 LAFAEGYSHAMTLNND----TRVSKGFVAALLDSRLPADA---GMVGPMFDVGFPFAVAD 123
A G +H LNND TR AAL +R+ A G+ GP VG AD
Sbjct 96 AARDMGCTHLWFLNNDISFETRPPLVARAALRMARMGAALAARGLGGPENKVGVYAPAAD 155
Query 124 EKPDAESYVPRAR--YRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRA 181
P V AR YR+V V+G A ++ DC VGG D + R G+G+D+ L+LRA
Sbjct 156 RNPYHPQMVRDARLQYRRVALVDGIAPIIDLDCAREVGGADCADNPR-GYGVDVWLSLRA 214
Query 182 RKSGYGLYTTEMAYINHFGRKTANT 206
++G+ + + H TA +
Sbjct 215 HRAGWPVVVDHQLAVRHRYHTTARS 239
>gi|168703373|ref|ZP_02735650.1| glycosyl transferase family 2 [Gemmata obscuriglobus UQM 2246]
Length=324
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 75/256 (30%), Positives = 111/256 (44%), Gaps = 36/256 (14%)
Query 25 VADLEREGADYLIVDNRGD---YPRIGTE----RVSTPGENLGWAGGSELGFRLAFAEGY 77
+ADL A+ ++VDN + P + E V G NLGW+GG+ G R A G
Sbjct 25 LADLS-PAANVVLVDNASEPDPTPALRAEFRWAHVVRNGANLGWSGGNNTGIRYALERGA 83
Query 78 SHAMTLNNDTRVSKGFVAALLDS-RLPADAGMVGPMF------DV-----------GFP- 118
+ + LNNDT V+ V LLD+ R G++GP+ DV GFP
Sbjct 84 DYVLLLNNDTTVAPDIVGRLLDAFRSHPSFGVIGPVIRYMDEPDVVMTDGVTFNPPGFPG 143
Query 119 FAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLA 178
F E P+ + P + V G +++ RD + VG +D F + + D
Sbjct 144 FFQRKEVPERAAEPPAVDDTDI--VNGCCMMVRRDVLERVGLIDDHFFLIHE---EADFC 198
Query 179 LRARKSGYGLYTTEMAYINHFGRKTANTHFGG--HRYHWGASAA-MIRGLRRTHGWPAAM 235
LR +++GYG A + H G TA G RY+ + A +IR R+ G
Sbjct 199 LRVKEAGYGCGVLAEALVWHKG-STAFKRAGKKWQRYYDTRNLARLIRKHRKLSGRGLLS 257
Query 236 GILREMGMAHHRKWHK 251
+ A+HR H+
Sbjct 258 AYSAYLRYAYHRYCHE 273
>gi|344339949|ref|ZP_08770876.1| glycosyl transferase family 2 [Thiocapsa marina 5811]
gi|343800128|gb|EGV18075.1| glycosyl transferase family 2 [Thiocapsa marina 5811]
Length=705
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 78/244 (32%), Positives = 110/244 (46%), Gaps = 43/244 (17%)
Query 3 APLVSLMITVPVFGQHEYTHALVADLEREGA----DYLIVD---NRGDYPRIGTE---RV 52
APLVS++I PV Q YTH +A L R GA + ++VD N G R+ R+
Sbjct 73 APLVSVVI--PVHNQFGYTHHCLAALARVGARRAFEVIVVDDCSNDGTAERLARYPGLRL 130
Query 53 STPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPA-DAGMVGP 111
ENLG+ G G A A ++ + LNNDT+V ++ AL+D+ A DAG+VG
Sbjct 131 IRNLENLGFVGSCNRG---AEAGQGAYLVFLNNDTQVQPDWLDALIDTFEDAPDAGIVGS 187
Query 112 ---------------MFDVGFPFAVAD-EKPDAESYVPRARYRK-VPAVEGTALVMSRDC 154
+F G + + PD PR YR+ V G AL +
Sbjct 188 RLLYPDGRQQEAGALLFSDGTAWNYGHLDAPDK----PRYSYRRAVDYCSGAALAIETSL 243
Query 155 WDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTA----NTHFGG 210
+ +GG D S F + D DLA R R +G +Y ++ + HF +A T G
Sbjct 244 FRRLGGFD-SAFA-PAYYEDADLAFRVRAAGRTVYYQPLSRVVHFEGISAGRDEQTQTGL 301
Query 211 HRYH 214
RY
Sbjct 302 KRYQ 305
>gi|333980867|ref|YP_004518812.1| family 2 glycosyl transferase [Desulfotomaculum kuznetsovii DSM
6115]
gi|333824348|gb|AEG17011.1| glycosyl transferase family 2 [Desulfotomaculum kuznetsovii DSM
6115]
Length=272
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 62/219 (29%), Positives = 97/219 (45%), Gaps = 35/219 (15%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDN-----RGDYPR-IGTERVSTPGENLG 60
I +P Q T + + + E + ++VDN ++ R + R+ G NLG
Sbjct 9 IIIPTRNQLSLTVLCLDSIFKYTPEPVEIIVVDNGSTDGTTEHLRSVPGIRLLENGRNLG 68
Query 61 WAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS-RLPADAGMVGPMFDVGFPF 119
+A G LA A G + + LNNDT V+ G+++ L+ R AG+VGP+ + G
Sbjct 69 FAAACNRG--LAVARG-DYLLLLNNDTLVTPGWLSGLITHLRKYPGAGLVGPLSNCG--- 122
Query 120 AVADEKPDAESYVPR-------------ARYRKVPAVEGTALVMSRDCWDAVGGMDLSTF 166
V P A + R R P + G L+ +R + +GG+D
Sbjct 123 GVTQTMPVALPSLDRLEEFSRRLALLNCGRCHPAPVLSGFCLLFTRRVLNTIGGLD---- 178
Query 167 GRYGWG--LDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
R+ G D D LRAR +G+ L ++ HFG+KT
Sbjct 179 PRFNPGNFEDDDFCLRARLAGFVLLVAADVFVYHFGQKT 217
>gi|253702409|ref|YP_003023598.1| glycosyl transferase family 2 [Geobacter sp. M21]
gi|251777259|gb|ACT19840.1| glycosyl transferase family 2 [Geobacter sp. M21]
Length=1523
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 63/237 (27%), Positives = 112/237 (48%), Gaps = 40/237 (16%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDN---RGDYPRI-------GTERVSTPG 56
I + + + T + +ER E + ++VDN G P + R+ G
Sbjct 626 IVILTWNELSCTRECLESIERHTPEPHEVILVDNGSSDGTVPFLREFCAQRENYRLIENG 685
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS--RLPADAGMVGPMF- 113
+NLG+A G +G R A H + LNNDT V++G+++ ++++ R P AG+VGPM
Sbjct 686 KNLGFAAGCNIGMREAVG---GHILLLNNDTVVTRGWLSGMIEALERDP-KAGIVGPMTN 741
Query 114 DVGFPFAVA----DEKPDAESYVPR------ARYRKVPAVEGTALVMSRDCWDAVGGMDL 163
++ P +A + + +++ R R +V V G ++ +R+ + VG +D
Sbjct 742 EIAGPQKLARVPYRDMEELQAFAERFRGEHYGRRIEVDRVVGFCMLFTREVLETVGELD- 800
Query 164 STFGRYGWG--LDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGAS 218
R+G G D D LR +GY +I+H+G ++ F G+R + A+
Sbjct 801 ---ERFGSGNFEDDDFCLRGALAGYRCLIAGDVFIHHYGSRS----FTGNRVDYAAA 850
>gi|343084999|ref|YP_004774294.1| family 2 glycosyl transferase [Cyclobacterium marinum DSM 745]
gi|342353533|gb|AEL26063.1| glycosyl transferase family 2 [Cyclobacterium marinum DSM 745]
Length=301
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/213 (24%), Positives = 92/213 (44%), Gaps = 24/213 (11%)
Query 7 SLMITVPVFGQHEYTHALVADLEREG---ADYLIVDN---RGDYPRIGTE----RVSTPG 56
S+ I + + EYT + +A L G D +IVDN G P I + +
Sbjct 4 SVAIVILNWNGWEYTQSCIASLLEAGYSEGDLIIVDNASTDGSIPNIKSHFPKAHLIQNA 63
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS-RLPADAGMVGPM--- 112
+N G+ GG+ G + A +GY++ + LNNDT+V GF+ L++ ++ D + P+
Sbjct 64 KNEGFTGGNNRGIQYALTKGYAYVLLLNNDTQVLPGFLEFLIEEMKISGDLAAIQPLIYF 123
Query 113 -------FDVGFPFAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLST 165
++ G + + + + G A+++ + +G +D +
Sbjct 124 MGDKDEIWNAGGNYHAWLGYSQTNYQIQSDQPYLTEWISGCAILVRSTVLEKIGLLDHNY 183
Query 166 FGRYGWGLDLDLALRARKSGYGLYTTEMAYINH 198
F + D+D +LR R G+ L + I H
Sbjct 184 FAYFE---DVDWSLRMRSQGFELKVHPKSIIYH 213
>gi|333998081|ref|YP_004530693.1| glycosyltransferase, group 2 family [Treponema primitia ZAS-2]
gi|333741062|gb|AEF86552.1| glycosyltransferase, group 2 family [Treponema primitia ZAS-2]
Length=298
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/192 (27%), Positives = 82/192 (43%), Gaps = 36/192 (18%)
Query 34 DYLIVDN----------RGDYPRIGTERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTL 83
+ L+VDN R +YP + V G+NLG+A + +G + A +G + L
Sbjct 31 NILVVDNGSTDGTLNIIRKEYPLV---EVIETGQNLGFAKANNIGIKYALDKGTEYVFLL 87
Query 84 NNDTRVSKGFVAALLD--SRLPADAGMVGPMFDVGFPFAVADEKPDAESYVPR------- 134
N D +V + L+D LP +AG++ P+ G A +P +++
Sbjct 88 NQDAQVKSNTIKELIDLFPSLP-NAGILSPIHLNG---TSAYLEPSFANFIREHCSYEII 143
Query 135 --------ARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGY 186
Y ++ V A ++SR C + VGG D + F YG D++ + R G+
Sbjct 144 SDLYFNNLKNYYEIKFVNAAAWLISRRCIETVGGFDTNIFFHYGE--DINYSQRLIFHGF 201
Query 187 GLYTTEMAYINH 198
Y T I H
Sbjct 202 KSYITTKTTICH 213
>gi|260892179|ref|YP_003238276.1| glycosyl transferase family 2 [Ammonifex degensii KC4]
gi|260864320|gb|ACX51426.1| glycosyl transferase family 2 [Ammonifex degensii KC4]
Length=1340
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/241 (29%), Positives = 105/241 (44%), Gaps = 41/241 (17%)
Query 2 YAPLVSLM-ITVPVFGQHEYTHALVADLER---EGADYLIVDNRG-----------DYPR 46
++P L+ + +PV+G+ E T + L + + ++VDN G
Sbjct 915 FSPQKELVSVVMPVYGEVEPTRVSIDALLKFTHRPLEVILVDNSGRQEMEEFLEGVSAKH 974
Query 47 IGTERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALL-----DSR 101
G +V PGEN+G+ G LG LA AEG + + +NND VS + + LL D R
Sbjct 975 QGKVKVVRPGENVGYTLGCNLG--LAMAEG-DYLVVMNNDVVVSPYWASYLLAAFAADPR 1031
Query 102 LPADAGMVGPMFDVGFPFAVADEKPDAESYVP-------RARYRKVPAVE---GTALVMS 151
L G+VGP + F ++ P P + R K+ V G +M
Sbjct 1032 L----GIVGPCTNYAAGFQGSNLSPPDLISFPDWAGRWHQERAGKLTIVNRLIGFLWMMR 1087
Query 152 RDCWDAVGGMD-LSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGG 210
R+ + +GG D L G Y D D LRA+ +GY L E Y+ H+G + G
Sbjct 1088 RELLEEIGGFDPLFGTGNYE---DDDYCLRAQLAGYRLALAEDVYVYHYGSHSFRKQPGS 1144
Query 211 H 211
+
Sbjct 1145 Y 1145
>gi|70729409|ref|YP_259147.1| group 2 family glycosyl transferase [Pseudomonas fluorescens
Pf-5]
gi|68343708|gb|AAY91314.1| glycosyltransferase, group 1 family domain/glycosyl transferase,
group 2 family domain protein [Pseudomonas fluorescens Pf-5]
Length=729
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/176 (31%), Positives = 85/176 (49%), Gaps = 17/176 (9%)
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFD-V 115
+NLG+A + G RLA +G+ + + LNNDT V KG++A LL + G+VGP + V
Sbjct 529 QNLGFAAANNQGLRLA--QGH-YLVLLNNDTVVPKGWLAPLLRHLDDPEVGLVGPTTNAV 585
Query 116 GFPFAVA---DEKPDAESYVPR--ARYR----KVPAVEGTALVMSRDCWDAVGGMDLSTF 166
G V+ + D E + R ++YR +P + + M R+ D VG +D F
Sbjct 586 GNEARVSIDYQDLDDMEDFADRRASQYRGRCFDIPMLAMFCVAMRREVLDQVGWLD-EAF 644
Query 167 GRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYH--WGASAA 220
G G D D + R + +G E +YI+H+G+ + Y W + A
Sbjct 645 G-IGMFEDDDYSRRVQAAGLRTVCAEDSYIHHYGQASFKALIASGEYQRLWDKNQA 699
>gi|337285241|ref|YP_004624715.1| glycosyltransferase [Pyrococcus yayanosii CH1]
gi|334901175|gb|AEH25443.1| glycosyltransferase [Pyrococcus yayanosii CH1]
Length=323
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/170 (29%), Positives = 78/170 (46%), Gaps = 15/170 (8%)
Query 40 NRGDYPRIGTER---VSTPGENLGWAGGSELGFRLAFAE-GYSHAMTLNNDTRVSKGFVA 95
NR Y + R + +N G+AGG+ +G + A + + M LNNDT V K F+
Sbjct 90 NRPLYEKFDVNRRMILIKNRDNYGFAGGNNVGIKFALSVLNPDYVMLLNNDTVVDKKFLD 149
Query 96 ALLD-SRLPADAGMVGPMF------DVGFPFAVADEKPDAESYVPRARYRKVPAVEGTAL 148
L+ + G+ GP+ + + F + +SY+ +VP V G AL
Sbjct 150 ELVKVAESDKKIGIAGPLILYYDHPTIVWAFGTVALRGKPKSYILYTPI-EVPWVIGCAL 208
Query 149 VMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINH 198
++ D +VG +D + F +G + DL LRAR Y LY + + H
Sbjct 209 LLKADLLRSVGLLDENIFL---YGEEYDLCLRARSQSYYLYAVPTSTVIH 255
>gi|15807708|ref|NP_285362.1| rhamnosyltransferase [Deinococcus radiodurans R1]
gi|6460565|gb|AAF12271.1|AE001862_97 rhamnosyltransferase, putative [Deinococcus radiodurans R1]
Length=271
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/186 (25%), Positives = 84/186 (46%), Gaps = 19/186 (10%)
Query 29 EREGADYLIVDNRGDYPRIGT-------ERVSTPGENLGWAGGSELGFRLAFAEGYSHAM 81
E D +IVDN I RV G+N G+A G +G R A + GY++
Sbjct 28 EIRQCDIIIVDNDSQDESINVIKEHFTDVRVIESGKNGGFAYGCNVGIRAALSSGYNYVW 87
Query 82 TLNNDTRVSKGFVAALLDSRLPADAGMVGPMF-DVGFPFAVADEKPDAESYVPRA----- 135
LNNDT V K + +++ +VG + D+ FP++V +++
Sbjct 88 LLNNDTIVDKTTLPNMIEEIKRNKKAIVGSIIRDIEFPWSVQAVGGGTINFLTGTGRHIK 147
Query 136 ---RYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTE 192
K+ + G ++++SR+ +++G ++ F + D + +RAR+ GY L +
Sbjct 148 TFDDVDKLDYLLGASILVSREALESIGLLNEDFFMYWE---DTEFCIRARRKGYFLKVAK 204
Query 193 MAYINH 198
+ + H
Sbjct 205 QSNVYH 210
>gi|120602571|ref|YP_966971.1| hypothetical protein Dvul_1527 [Desulfovibrio vulgaris DP4]
gi|120562800|gb|ABM28544.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4]
Length=248
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/164 (33%), Positives = 75/164 (46%), Gaps = 12/164 (7%)
Query 52 VSTPG------ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTR-VSKGFVAALLDSRLPA 104
VS PG +NL WAG E LA EG++H LNND VS+ A + L
Sbjct 47 VSYPGAWRRATDNLYWAGALEDCVSLAREEGFTHLWFLNNDIEFVSRPPFMARAAAHLWR 106
Query 105 DAGMVGPMFDVGFPFAVADEKPDAESYVPR--ARYRKVPAVEGTALVMSRDCWDAVGGMD 162
+G DVG P V R + +V +G A V + DC +A+GG+D
Sbjct 107 LGERLG--HDVGVWAPAVLRSPYHPQMVHREGVAFSRVAVADGIAPVYALDCIEAIGGVD 164
Query 163 LSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANT 206
S YG+G+D+ L+LRA +G+ + + H TA T
Sbjct 165 ASD-NPYGYGVDVWLSLRAHHAGWPVIVDHGVVVRHRHHTTART 207
>gi|260893001|ref|YP_003239098.1| glycosyl transferase family 2 [Ammonifex degensii KC4]
gi|260865142|gb|ACX52248.1| glycosyl transferase family 2 [Ammonifex degensii KC4]
Length=785
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 59/220 (27%), Positives = 96/220 (44%), Gaps = 31/220 (14%)
Query 8 LMITVPVFGQHEYTHALVADLEREGA---DYLIVDNRGD------YPRIGTERVSTPGEN 58
+ I +PVF Q +T + + + + +I+DN +V T N
Sbjct 512 ISIIIPVFNQLSFTKRCLESIRNFTSLPYELIIIDNGSTDETSAFLKSQKDLKVITNESN 571
Query 59 LGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS-RLPADAGMVGPM--FDV 115
G+ G LA E + + LNNDT V+ G++ LL + + AG+ GP+ +
Sbjct 572 RGFGPAVNQGLALARGE---YLVVLNNDTLVTPGWLEELLYALKTGPAAGLAGPVTNYAS 628
Query 116 GFPFAVADEKPDAESYVP----------RARYRKVPAVEGTALVMSRDCWDAVGGMDLST 165
G +PD E + + + R+V + G L + R + +GG D
Sbjct 629 GPQCIPVPYQPDNEESIVTFARERAVAYKGKIREVFRLVGFCLAIRRAVVEKIGGFD--- 685
Query 166 FGRYGWG--LDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
R+ G D D LRAR +G+ LY E A+I+H+G +T
Sbjct 686 -PRFATGNFEDDDFCLRARLAGFRLYIAEGAFIHHYGHQT 724
>gi|337750761|ref|YP_004644923.1| family 2 glycosyl transferase [Paenibacillus mucilaginosus KNP414]
gi|336301950|gb|AEI45053.1| glycosyl transferase family 2 [Paenibacillus mucilaginosus KNP414]
Length=457
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 63/227 (28%), Positives = 99/227 (44%), Gaps = 44/227 (19%)
Query 4 PLVSLMITVPVFGQHEYTHALVADLER---EGADYLIVDN-----RGDYPRI--GTERVS 53
P I +P + Q A + + + E + +++DN +Y + G R
Sbjct 59 PWEGCSIIIPTYNQAGLLKACIESIRQYTPEPHEIIVIDNASTDGTAEYLQSMRGQLRFR 118
Query 54 TPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALL-----DSRLPADAGM 108
EN+G+AGG G R+A S + LNNDT V++G++ LL D R+ G+
Sbjct 119 INAENIGFAGGVNQGMRMARG---STLLLLNNDTVVTQGWLGNLLRCLHSDERI----GL 171
Query 109 VGPMFDVGFPFAVADEKPDAESY--------------VPRA-RYRKVPAVEGTALVMSRD 153
VGPM + ++ E+ SY VP A R+R + G L++ R+
Sbjct 172 VGPMTNY-----ISGEQLQPVSYASMKEMHRFAREFNVPDASRWRPTTRITGFCLLLRRE 226
Query 154 CWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFG 200
+D +G D F + G D D LR R G+ L +I+H G
Sbjct 227 LFDRLGYFD-EGFEK-GNCEDDDFCLRVRLLGHELVIAGDTFIHHVG 271
>gi|312137291|ref|YP_004004628.1| glycosyl transferase family 2 [Methanothermus fervidus DSM 2088]
gi|311225010|gb|ADP77866.1| glycosyl transferase family 2 [Methanothermus fervidus DSM 2088]
Length=292
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 55/203 (28%), Positives = 90/203 (45%), Gaps = 32/203 (15%)
Query 38 VDNRGDYPRIGTERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFV--- 94
++N+ + I E+ NLG+AGG+ +G R A + + + LNNDT V + F+
Sbjct 53 IENKPKFKLIANEK------NLGFAGGNNIGIRYALNNEFEYILLLNNDTVVDENFLKPM 106
Query 95 AALLDSRLPADAGMVGP---MFDVGFPFAVAD----EKPDAESYV------PRARYRKVP 141
LL+S D G VGP +D VA + A +Y+ ++Y +V
Sbjct 107 VELLES--SDDIGFVGPKTYFYDKKNVIQVAGGGYVDLTTARAYLLGYMENDSSKYNEVI 164
Query 142 A---VEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINH 198
V G+ ++ RD + +G +D F Y D++ R +K GY + A I H
Sbjct 165 ELDYVSGSCILTKRDVIEEIGLLDERFFMYYE---DVEWCYRGKKHGYKSFYQPEAVIWH 221
Query 199 FGRKTANTHFGGHRYHWGASAAM 221
+ A+T+ YH S +
Sbjct 222 --KHGASTNKCFELYHLNKSRIL 242
>gi|333923089|ref|YP_004496669.1| type 12 methyltransferase [Desulfotomaculum carboxydivorans CO-1-SRB]
gi|333748650|gb|AEF93757.1| Methyltransferase type 12 [Desulfotomaculum carboxydivorans CO-1-SRB]
Length=741
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 63/226 (28%), Positives = 96/226 (43%), Gaps = 31/226 (13%)
Query 1 LYAPLVSLMITVPVFGQHEYTHALVADLER---EGADYLIVDNRGDYPRI----GTERVS 53
L P + I +P Q E T +A +ER E + +IVDN D I E V
Sbjct 469 LSKPKGPVSIVIPSCNQLELTKLCLASIERYTTEDYEIIIVDNGSDQATIEYLAAQENVH 528
Query 54 --TPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAAL---LDSRLPADAGM 108
G+NLG+A G +A + + LNNDT V+ G+++ L L+ A +
Sbjct 529 LICNGKNLGFAAACNQGMTVAKGK---YICLLNNDTIVTPGWLSHLAYHLEQN--PQALV 583
Query 109 VGPMFDVG-----FPFAVADEKPDAESYVPRARYR------KVPAVEGTALVMSRDCWDA 157
VGPM + P A P+ E++ + R + + G L++ +
Sbjct 584 VGPMSNYASRAQTLPVAFRS-IPEIENFAAQNRINNWLKSSQAQTLSGFCLLLKEEVKGI 642
Query 158 VGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
+GG D + G D D LR + SG+ L YI+H G +T
Sbjct 643 IGGFDTRFWP--GNFEDNDYCLRIKLSGHQLLVAHDVYIHHIGSRT 686
>gi|156740479|ref|YP_001430608.1| glycosyl transferase family protein [Roseiflexus castenholzii
DSM 13941]
gi|156231807|gb|ABU56590.1| glycosyl transferase family 2 [Roseiflexus castenholzii DSM 13941]
Length=303
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/223 (27%), Positives = 96/223 (44%), Gaps = 42/223 (18%)
Query 8 LMITVPVFGQHEYTHALVADLER---EGADYLIVDN----------RGDYPRIGTERVST 54
++I + + T A + L+R D L++DN R +P++ +V
Sbjct 8 VLIIILCYNGVADTLACLESLQRVEYPSFDLLVIDNASSDGTPEVVRARFPQV---KVIE 64
Query 55 PGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLD-SRLPADAGMVGP-- 111
G NLG+A G+ +G R A GY +A+ LNNDT V+ F+A L+D + G VGP
Sbjct 65 NGANLGFAAGNNVGLRYALRHGYDYALLLNNDTEVAPDFLARLIDAAESDPKIGAVGPTI 124
Query 112 --------------MFDVGFPFAV--ADEKPDAESYVPRARYRKVPAVEGTALVMSRDCW 155
M D AV A+E + + P V V G AL++ R
Sbjct 125 TYYERPDLIWSAGGMIDWKRGIAVMRANEVDSGQYHEP----ADVDFVTGCALLVKRAVL 180
Query 156 DAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINH 198
+ G +D F + + + +R R++G+ + + + H
Sbjct 181 ERAGLLDERFFMYFE---ETEWCVRIRRAGFRILHVPRSRVLH 220
>gi|326404006|ref|YP_004284088.1| putative glycosyltransferase [Acidiphilium multivorum AIU301]
gi|325050868|dbj|BAJ81206.1| putative glycosyltransferase [Acidiphilium multivorum AIU301]
Length=317
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/241 (25%), Positives = 95/241 (40%), Gaps = 46/241 (19%)
Query 2 YAPLVSLMITVPVFGQHEYTHALVADLER---EGADYLIVDNRGDYPRIGTERVSTP--- 55
Y ++ I V + + T A + L+R I+DN + I R P
Sbjct 4 YEDYPAVAIIVLNWNDSKNTIACIESLQRIIYPKFKIYIIDNNSEDSSIENIRKIFPYIE 63
Query 56 ----GENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGP 111
G NLGWAGG+ G +LA GY++ +N D RV ++AL+++ D VG
Sbjct 64 IYNSGSNLGWAGGNNYGIKLAKLHGYNYFYLINPDIRVEPKTLSALVEAIDAEDVAAVGS 123
Query 112 MFDVGFPFAVADEKPD----------AESYVPRARYRK------------VPAVEGTALV 149
+ ++ PD +ES R K VP ++G + +
Sbjct 124 V-------VISYSNPDWLEFAGSQLTSESGFSRQTSMKKSEFVFDGPDIDVPELKGCSFL 176
Query 150 MSRDCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFG 209
++ +D +G D F Y + D RAR + ++ + H G +T FG
Sbjct 177 ITEKGFDKIGYFDERYFLNYD---ETDWCFRARSNNMRCVFSKKSICFHIGAQT----FG 229
Query 210 G 210
G
Sbjct 230 G 230
>gi|256827929|ref|YP_003156657.1| hypothetical protein Dbac_0112 [Desulfomicrobium baculatum DSM
4028]
gi|256577105|gb|ACU88241.1| conserved hypothetical protein [Desulfomicrobium baculatum DSM
4028]
Length=241
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/148 (32%), Positives = 67/148 (46%), Gaps = 3/148 (2%)
Query 58 NLGWAGGSELGFRLAFAEGYSHAMTLNNDTR-VSKGFVAALLDSRLPADAGMVGPMFDVG 116
NL WAG E + +G SH LNND VSK + + RL + GP+ V
Sbjct 55 NLYWAGAFEHVMGILAGQGASHVWFLNNDALFVSKPPLIDRVRQRLARAERLAGPV-GVY 113
Query 117 FPFAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLD 176
P A+ P +YR+V V+G A ++S D W GG+D + YG+G+D+
Sbjct 114 SPAVTANPYHPQMVERPSGQYRQVRYVDGIAPLVSVDYWRRAGGIDFAG-NPYGYGVDVW 172
Query 177 LALRARKSGYGLYTTEMAYINHFGRKTA 204
+ RA + G+ I H TA
Sbjct 173 FSSRAEELGFACVVDHQVVIRHSYHSTA 200
>gi|222053755|ref|YP_002536117.1| glycosyl transferase family 2 [Geobacter sp. FRC-32]
gi|221563044|gb|ACM19016.1| glycosyl transferase family 2 [Geobacter sp. FRC-32]
Length=3011
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 62/239 (26%), Positives = 107/239 (45%), Gaps = 35/239 (14%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDN----------RGDYPRIGTERVSTPG 56
I + Q EYT ++ +++ E + + VDN R + R+
Sbjct 2101 IIILTLNQLEYTRECLSSIQKHTPEKHEIIFVDNGSKDGTVKWLRQEMKSHKNYRLIENK 2160
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFD-- 114
+NLG+A G G +A A+G + + LNND V++G++ + + + G+V PM +
Sbjct 2161 KNLGFAKGCNQG--MAAAKG-EYLLLLNNDVVVTEGWLTGMRECLEASADGIVAPMTNNI 2217
Query 115 ------VGFPFAVADE-KPDAESYVPRARYRKVPA--VEGTALVMSRDCWDAVGGMDLS- 164
+ P+A DE + A + R+R++P V G ++ R D VG +D +
Sbjct 2218 AGPQQLLSVPYANMDEMQAFAAGFRSAHRHRRIPVKLVVGFCMLFRRSLADQVGMLDEAF 2277
Query 165 TFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIR 223
G + D D LRA +G +I+H G +T F G+R + A+ A+ R
Sbjct 2278 ALGNFE---DDDFCLRASLAGLQSVIAGDVFIHHHGSRT----FEGNRIDFNAAMAVNR 2329
>gi|329923363|ref|ZP_08278847.1| glycosyltransferase, group 2 family protein [Paenibacillus sp.
HGF5]
gi|328941455|gb|EGG37747.1| glycosyltransferase, group 2 family protein [Paenibacillus sp.
HGF5]
Length=866
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 79/283 (28%), Positives = 115/283 (41%), Gaps = 65/283 (22%)
Query 4 PLVSLMITVPVFGQHEYTHALVADLEREG----ADYLIVDNRG-DYPRIGTERVSTP--- 55
P VS++I V + YT + +L ++G + +IVDN D + ++ P
Sbjct 456 PKVSVIIVV--HNKWSYTEQCLRNLLQKGHYPNLEVVIVDNASTDQTSFHLKALNNPLVK 513
Query 56 ----GENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADA--GMV 109
ENLG++ G+ LG + E + + LNNDT V G L D GM
Sbjct 514 VISSPENLGFSAGNTLGCLESTGE---YMILLNNDTLVPDGSWITRLLRPFHEDPQIGMT 570
Query 110 GPMF-----DVGFPFAVADEKPDA---------ESYVPRARYRKVPAVEGTALVMSRDCW 155
GPM D + + D A E Y RY + A M R W
Sbjct 571 GPMSNYVGNDQALDYFIGDAVYGADVRWLHDFYELYHQNIRYTDLLGFFCVA--MKRSVW 628
Query 156 DAVGGMDLSTFGRYGWGL--DLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRY 213
D VG +D + YG G+ D D R R +GY L E A++ H+G
Sbjct 629 DKVGSLDPA----YGIGMFEDDDYCERVRNAGYRLAIVEDAFVYHYG------------- 671
Query 214 HWGASAAMIRGLRRTHGWPAAMGILREMGMAHH-RKWHKSFPL 255
+A I+ L+ P A L E A++ +KW+KS+ L
Sbjct 672 -----SATIKELK-----PQAYDSLWEKNKAYYEQKWNKSWRL 704
>gi|304404178|ref|ZP_07385840.1| Methyltransferase type 11 [Paenibacillus curdlanolyticus YK9]
gi|304347156|gb|EFM12988.1| Methyltransferase type 11 [Paenibacillus curdlanolyticus YK9]
Length=764
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/210 (29%), Positives = 93/210 (45%), Gaps = 22/210 (10%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSE 66
I + + + +YT + + + EG LIV + G + P L A
Sbjct 5 IILVTYNKLDYTKQCIDSIRQRTDEGRYELIVVDNGSMDGTQEWLAAQPDIRL-IANSDN 63
Query 67 LGFRLAFAEGYSHA-----MTLNNDTRVSKGFVAALLDSRLPA-DAGMVGPMFD------ 114
GF A +G + A M LNNDT V+ ++ L + L A D G VGP+ +
Sbjct 64 AGFPRACNQGLAAAAGDLLMLLNNDTVVTPSWLEGLTTALLSATDIGAVGPVTNAASYLT 123
Query 115 -VGFPFAVADEKP---DAESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYG 170
+ P+A DE +A + A++ + + G L+M R+ ++ VG +D FG G
Sbjct 124 SIPVPYATMDEMEAFAEAHNRPDPAKWEERVKLIGYCLLMKREAYEQVGELD-EQFG-IG 181
Query 171 WGLDLDLALRARKSGYGLYTTEMAYINHFG 200
D D LR R +GY L +I+H+G
Sbjct 182 NFEDDDFCLRMRLAGYKLLLCRDTFIHHYG 211
>gi|323701361|ref|ZP_08113035.1| Methyltransferase type 12 [Desulfotomaculum nigrificans DSM 574]
gi|323533620|gb|EGB23485.1| Methyltransferase type 12 [Desulfotomaculum nigrificans DSM 574]
Length=741
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 60/217 (28%), Positives = 93/217 (43%), Gaps = 31/217 (14%)
Query 10 ITVPVFGQHEYTHALVADLER---EGADYLIVDNRGDYPRI----GTERVS--TPGENLG 60
I +P Q E T +A +ER E + +IVDN D I E V G+NLG
Sbjct 478 IVIPSCNQLELTKLCLASIERYTTEDYEIIIVDNGSDQATIEYLAAQENVHLICNGKNLG 537
Query 61 WAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAAL---LDSRLPADAGMVGPMFDVG- 116
+A G +A + + LNNDT V+ G+++ L L+ A +VGPM +
Sbjct 538 FAAACNQGMTVAKGK---YICLLNNDTIVTPGWLSHLAYHLEQN--PQALVVGPMSNYAS 592
Query 117 ----FPFAVADEKPDAESYVPRA------RYRKVPAVEGTALVMSRDCWDAVGGMDLSTF 166
P A P+ E++ + + + + G L++ + +GG D +
Sbjct 593 RAQTLPVAFRS-IPEIENFAAQNCINNWLKSSQAQTLSGFCLLLKEEVKGIIGGFDTRFW 651
Query 167 GRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKT 203
G D D LR + SG+ L YI+H G +T
Sbjct 652 P--GNFEDNDYCLRIKLSGHQLLVAHDVYIHHIGSRT 686
>gi|46580018|ref|YP_010826.1| hypothetical protein DVU1607 [Desulfovibrio vulgaris str. Hildenborough]
gi|46449434|gb|AAS96085.1| hypothetical protein DVU_1607 [Desulfovibrio vulgaris str. Hildenborough]
gi|311233983|gb|ADP86837.1| hypothetical protein Deval_1686 [Desulfovibrio vulgaris RCH1]
Length=248
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/164 (32%), Positives = 73/164 (45%), Gaps = 12/164 (7%)
Query 52 VSTPG------ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTR-VSKGFVAALLDSRLPA 104
VS PG +NL WAG E LA EG++H LNND VS+ A + L
Sbjct 47 VSYPGAWRRATDNLYWAGALEDCVSLAREEGFTHLWFLNNDIEFVSRPPFMARAAAHLWR 106
Query 105 DAGMVGPMFDVGFPFAVADEKPDAESYVPR--ARYRKVPAVEGTALVMSRDCWDAVGGMD 162
+G DVG P V R + +V +G A V + DC + +GG+D
Sbjct 107 LGERLG--HDVGVWAPAVLRSPYHPQMVHREGVAFSRVAVADGIAPVYALDCIETIGGVD 164
Query 163 LSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANT 206
YG+G+D+ L+LRA +G+ + + H TA T
Sbjct 165 AGD-NPYGYGVDVWLSLRAHHAGWPVIVDHGVVVRHRHHTTART 207
>gi|345126148|gb|EGW56014.1| uncharacterized protein Y4gI [endosymbiont of Tevnia jerichonana
(vent Tica)]
Length=544
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/148 (31%), Positives = 71/148 (48%), Gaps = 26/148 (17%)
Query 81 MTLNNDTRVS-KGFVAALLDSRLPADAGMVGPMF------------------DVGFPFAV 121
M L++DT V ++ LL L G+VGP D G PF
Sbjct 1 MFLDSDTMVVLSNWLERLLAQALRPQVGVVGPRLIYTSERIESAGLVLGMGGDAGSPFVG 60
Query 122 ADEKPDAESYVPRARY-RKVPAVEGTALVMSRDCWDAVGGMDLSTFG-RYGWGLDLDLAL 179
+ D Y+ R + ++ AV G+ L++ + +D VGG+D FG RY D+DL L
Sbjct 61 ISMREDG--YMGRLQVAQEYSAVSGSCLLIRHELFDQVGGLDEQHFGVRYS---DVDLCL 115
Query 180 RARKSGYGLYTTEMAYINHFGRKTANTH 207
+ + +GY + T A + HFGR+T +++
Sbjct 116 KVKAAGYKVIWTPFASLVHFGRQTIHSY 143
>gi|291562820|emb|CBL41636.1| Predicted glycosyltransferases [butyrate-producing bacterium
SS3/4]
Length=1037
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 66/238 (28%), Positives = 103/238 (44%), Gaps = 45/238 (18%)
Query 4 PLVSLMITVPVFGQHEYTHALVADLEREGADY----LIVDNRGDYPRIGTERVSTPGENL 59
P VS I +PV+ Q YT+A + + D +I D D T+ +S ENL
Sbjct 399 PTVS--IVIPVYNQIHYTYACLLSILENTKDVTYEIIIAD---DVSTDATKELSLYAENL 453
Query 60 GWA-GGSELGF-----RLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADA--GMVGP 111
+ GF + A A + M LNNDT+V+ G++++L++ + +D GMVG
Sbjct 454 VICRNKTNQGFLRNCNQAAKAARGKYVMFLNNDTQVTPGWLSSLVN-LIESDPTIGMVGS 512
Query 112 MF-----------------DVGFPFAVAD--EKPDAESYVPRARYRKVPAVEGTALVMSR 152
G+ + D EKP+ +YV Y + G A+++S
Sbjct 513 KLVYPDGRLQEAGGIIWSDGSGWNYGRLDDPEKPEY-NYVKDVDY-----ISGAAILLSN 566
Query 153 DCWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGG 210
D W +GG D F + D DLA RK+GY + ++ + HF + T G
Sbjct 567 DLWKQIGGFD-ELFAP-AYCEDSDLAFEVRKAGYRVVYQPLSKVIHFEGVSNGTDVNG 622
>gi|309792813|ref|ZP_07687255.1| glycosyl transferase family protein [Oscillochloris trichoides
DG6]
gi|308225125|gb|EFO78911.1| glycosyl transferase family protein [Oscillochloris trichoides
DG6]
Length=315
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 62/223 (28%), Positives = 87/223 (40%), Gaps = 31/223 (13%)
Query 21 THALVADLERE---GADYLIVDNRGDYPRIGTERVS---------TPGENLGWAGGSELG 68
T A V L+ E A LIVDN + R + TP NLG+AGG G
Sbjct 26 TIACVRSLQAELPTTAQILIVDNHSSDDSVAQFRAAFADTVHVLVTPA-NLGFAGGMNAG 84
Query 69 FRLAFAEGYSHAMTLNNDTRVSKGFVAALLD-SRLPADAGMVGP-MFDVGFPFAVADEKP 126
R A G + LNNDT V +A LL + AG++GP +F P +
Sbjct 85 IRAALDAGADAVLLLNNDTVVDSAMLAELLHAAHARPQAGVLGPAIFYYDHPTRIWQLGS 144
Query 127 DAESYVP-------------RARYRKVPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGL 173
+P R ++ V G A+ + R ++ VG +D S Y
Sbjct 145 REYPLLPIPMNLGMRDLHHAAGRPFRLDYVTGCAMFVRRHVFEQVGLLDTSYVMYYE--- 201
Query 174 DLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWG 216
D D R R++GY ++ A + H +A RY W
Sbjct 202 DADFCRRVRQAGYEIWCVPQARMWHRVSLSAGMVKPASRYAWA 244
>gi|296505695|ref|YP_003667395.1| FkbM family methyltransferase [Bacillus thuringiensis BMB171]
gi|296326747|gb|ADH09675.1| methyltransferase FkbM family [Bacillus thuringiensis BMB171]
Length=299
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/258 (25%), Positives = 101/258 (40%), Gaps = 54/258 (20%)
Query 10 ITVPVFGQ-HEYTHALVADLEREGADYLIVDNRGD----------YPRIGTERVST--PG 56
I +PVF E T + + ++ D ++ D R E + T
Sbjct 6 ILLPVFNSIQETTQCIESIIKNTSKDKYVLHILNDCSTDSLVKDTIDRYAGENIITYHQE 65
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFV-----AALLDSRLPA-----DA 106
ENLG+ G + + + LN+DT V+ G++ AA D R+ A +
Sbjct 66 ENLGFTKNVNFGLK----STKNDVVILNSDTIVTVGWLEKLEQAAYKDDRIAAVNPFSNY 121
Query 107 GMVGPM------FDVGFPF-------AVADEKPDAESYVPRARYRKVPAVEGTALVMSRD 153
G + F++G + A+ D PD Y +VP + G + R+
Sbjct 122 GQFSSIPTKFEGFNLGVDYQKVVELLALMDNIPD---------YIEVPCLVGFCMYFKRE 172
Query 154 CWDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRY 213
+ VG +D +F R G+G + D RARK GY L T +Y+ H G FG +
Sbjct 173 ILNKVGLLDEESFPR-GYGEETDWCGRARKQGYKLIVTPKSYVFHIG----GVSFGAEKE 227
Query 214 HWGASAAMIRGLRRTHGW 231
+A I R H +
Sbjct 228 ILRKNATKIIKERYPHFY 245
>gi|251795463|ref|YP_003010194.1| glycosyl transferase family 2 [Paenibacillus sp. JDR-2]
gi|247543089|gb|ACT00108.1| glycosyl transferase family 2 [Paenibacillus sp. JDR-2]
Length=448
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/214 (28%), Positives = 93/214 (44%), Gaps = 30/214 (14%)
Query 10 ITVPVFGQHEYTHALV---ADLEREGADYLIVDN-----RGDYPRI--GTERVSTPGENL 59
I +P + Q Y + D + + ++VDN +Y G R +N
Sbjct 51 IIIPSYNQVNYLRGCIDSIVDNTEQPHEIIVVDNASTDGTAEYLEQLDGQVRYRVLDQNY 110
Query 60 GWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPAD--AGMVGPMF---- 113
G+AG G L A+G + + LNNDT V+ ++ LL L +D GMVGP+
Sbjct 111 GFAGAVNTG--LMMAKG-TTMLVLNNDTLVTDNWLGNLLIC-LHSDPTIGMVGPVTNYIS 166
Query 114 -DVGFP--FAVADEKPD---AESYVPRARYRKVPAVEGTALVMSRDCWDAVGGMDLS-TF 166
D P + ++ P+ + R+ + + G L+ R+ W+ VG +D F
Sbjct 167 GDQQIPVLYERIEDMPEFARLNNISDPKRWERTDRLPGFCLLFRRELWEQVGFLDEGFEF 226
Query 167 GRYGWGLDLDLALRARKSGYGLYTTEMAYINHFG 200
G Y D D +R R GY L ++I+HFG
Sbjct 227 GNYE---DDDYNIRTRLLGYSLVIARDSFIHHFG 257
>gi|304408100|ref|ZP_07389750.1| glycosyl transferase family 2 [Paenibacillus curdlanolyticus
YK9]
gi|304343119|gb|EFM08963.1| glycosyl transferase family 2 [Paenibacillus curdlanolyticus
YK9]
Length=468
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/220 (30%), Positives = 99/220 (45%), Gaps = 42/220 (19%)
Query 10 ITVPVFGQHEYTHALV------ADLEREGADYLIVDNR-----GDYPR-IGTE-RVSTPG 56
I +P + + Y + + DL E ++VDN+ G Y R IGT+ R
Sbjct 65 IIIPTYNKAAYLKSCIESIGDHTDLPYE---IIVVDNQSTDETGAYLREIGTQVRYRILD 121
Query 57 ENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRLPADA--GMVGPMFD 114
NLG+AG G L A+G + + LNNDT V+ ++ +L + L +DA GMVGP+ +
Sbjct 122 SNLGFAGAVNAG--LMMAKG-TTVLLLNNDTLVTDNWLDNML-ACLHSDARIGMVGPVTN 177
Query 115 -------VGFPFAVADEKPDAESYVPR------ARYRKVPAVEGTALVMSRDCWDAVGGM 161
V P+ +E D Y R A++++ + G L+ R + G
Sbjct 178 YISGDQRVEVPY---EEIEDMYEYAKRNNETNPAKWQRTDRLTGFCLLFRRTLLERTGYF 234
Query 162 DLS-TFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFG 200
D G Y D D +R R GY L ++I+HFG
Sbjct 235 DEGYEIGNYE---DDDFNIRVRLQGYSLVIARDSFIHHFG 271
>gi|193211833|ref|YP_001997786.1| family 2 glycosyl transferase [Chlorobaculum parvum NCIB 8327]
gi|193085310|gb|ACF10586.1| glycosyl transferase family 2 [Chlorobaculum parvum NCIB 8327]
Length=285
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/224 (27%), Positives = 96/224 (43%), Gaps = 30/224 (13%)
Query 17 QHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRLAFAEG 76
Q E+T + + +G+ L+ R +P + + G NLG+AGG+ FR G
Sbjct 31 QPEFTVIMADNGSTDGSVALV---RRAFPEV---EILELGRNLGFAGGNNAAFRSLRGRG 84
Query 77 YSHAMTLNNDTRVSKGFVAALLDSRLPADAGMVGPMF--------------DVGFPFAVA 122
+ + + LNNDT V GF+ L+++ A G+V P V +
Sbjct 85 FDNVVFLNNDTIVDPGFLQPLIETLRDAWVGIVAPKILYIDDPARIWYAGGAVNPATGLI 144
Query 123 DEKPDAESYVPRARYRK-VPAVEGTALVMSRDCWDAVGGMDLSTFGRYGWGLDLDLALRA 181
+ +S PR K V G L M +D VGG D F YG D+DL+++
Sbjct 145 EHTGIRQSDGPRFDALKPVGYATGCCLAMRCPVFDEVGGFD-ERFRMYGE--DVDLSMKV 201
Query 182 RKSGYGLYTTEMAYINHFGRKTANT----HFGGHRYHWGASAAM 221
R+ G + + I H R +A++ +FG GA+ +
Sbjct 202 RERGLAVMYQPASRIWH--RVSASSGGEMNFGKQLRKSGAAMTL 243
>gi|323701368|ref|ZP_08113042.1| glycosyl transferase family 2 [Desulfotomaculum nigrificans DSM
574]
gi|333923096|ref|YP_004496676.1| family 2 glycosyl transferase [Desulfotomaculum carboxydivorans
CO-1-SRB]
gi|323533627|gb|EGB23492.1| glycosyl transferase family 2 [Desulfotomaculum nigrificans DSM
574]
gi|333748657|gb|AEF93764.1| glycosyl transferase family 2 [Desulfotomaculum carboxydivorans
CO-1-SRB]
Length=278
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/238 (28%), Positives = 98/238 (42%), Gaps = 43/238 (18%)
Query 5 LVSLMITVPVFGQHEYTHALVADL------EREGADYLIVDN-----RGDYPRIGTE--R 51
LVS++I PVF Q YT + L R ++ ++VDN GDY + + +
Sbjct 31 LVSIII--PVFNQLAYTKECIGSLLETLTLYRHDSEVILVDNGSTDGTGDYLKTLPDYFK 88
Query 52 VSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDS-RLPADAGMVG 110
V T +NLG+A LG A A + LNNDT G++ +L+ G+VG
Sbjct 89 VITNQQNLGFARACNLG---AAAARGDFLVFLNNDTVALPGWLGQMLNVMEKEEQVGIVG 145
Query 111 P--MFDVG-------------FPFAVADEKPDAESYVPRA-RYRKVPAVEGTALVMSRDC 154
+F G +P + +P A + R AV G L++ RD
Sbjct 146 SKLLFPDGTIQHAGVVVAAAPYPISPYHAYYQQPGDLPAANKMRDYQAVTGACLLIKRDL 205
Query 155 WDAVGGMDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHF-----GRKTANTH 207
+ AVG D Y D+DL + R+ GY + + + H GR N H
Sbjct 206 FVAVGAFDEDFINGYE---DVDLCFKVRQQGYRVVYCPTSVLYHHQSVSEGRADYNYH 260
>gi|3928919|gb|AAC79722.1| O-antigen biosynthesis protein RbfC [endosymbiont of Riftia pachyptila]
Length=746
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/148 (30%), Positives = 71/148 (48%), Gaps = 26/148 (17%)
Query 81 MTLNNDTRVS-KGFVAALLDSRLPADAGMVGPMF------------------DVGFPFAV 121
+ L++DT V ++ LL L G+VGP D G PF
Sbjct 203 LFLDSDTMVVLSNWLERLLAQALRPQVGVVGPRLIYTSERIESAGLVLGMGGDAGSPFVG 262
Query 122 ADEKPDAESYVPRARY-RKVPAVEGTALVMSRDCWDAVGGMDLSTFG-RYGWGLDLDLAL 179
+ D Y+ R + ++ AV G+ L++ + +D VGG+D FG RY D+DL L
Sbjct 263 ISMREDG--YMGRLQVAQEYSAVSGSCLLIRHELFDQVGGLDEQHFGVRYS---DVDLCL 317
Query 180 RARKSGYGLYTTEMAYINHFGRKTANTH 207
+ + +GY + T A + HFGR+T +++
Sbjct 318 KVKAAGYKVIWTPFASLVHFGRQTIHSY 345
Lambda K H
0.322 0.138 0.445
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 388543189928
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40