BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0295c
Length=267
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607436|ref|NP_214809.1| hypothetical protein Rv0295c [Mycob... 541 5e-152
gi|167967146|ref|ZP_02549423.1| hypothetical protein MtubH3_0354... 539 2e-151
gi|289748778|ref|ZP_06508156.1| conserved hypothetical protein [... 531 3e-149
gi|323721312|gb|EGB30367.1| hypothetical protein TMMG_03972 [Myc... 446 2e-123
gi|41408216|ref|NP_961052.1| hypothetical protein MAP2118 [Mycob... 416 2e-114
gi|108797369|ref|YP_637566.1| hypothetical protein Mmcs_0389 [My... 414 1e-113
gi|254819362|ref|ZP_05224363.1| hypothetical protein MintA_05530... 411 6e-113
gi|342857467|ref|ZP_08714123.1| hypothetical protein MCOL_01285 ... 409 2e-112
gi|118463655|ref|YP_881275.1| sulfotransferase [Mycobacterium av... 408 4e-112
gi|296166331|ref|ZP_06848768.1| conserved hypothetical protein [... 407 8e-112
gi|240169052|ref|ZP_04747711.1| hypothetical protein MkanA1_0704... 405 2e-111
gi|169628636|ref|YP_001702285.1| hypothetical protein MAB_1546c ... 404 8e-111
gi|183984227|ref|YP_001852518.1| hypothetical protein MMAR_4255 ... 401 6e-110
gi|118467454|ref|YP_885041.1| hypothetical protein MSMEG_0630 [M... 400 1e-109
gi|51247647|pdb|1TEX|A Chain A, Mycobacterium Smegmatis Stf0 Sul... 399 2e-109
gi|111018275|ref|YP_701247.1| hypothetical protein RHA1_ro01265 ... 301 5e-80
gi|226360399|ref|YP_002778177.1| hypothetical protein ROP_09850 ... 295 4e-78
gi|226303617|ref|YP_002763575.1| hypothetical protein RER_01280 ... 290 2e-76
gi|229492798|ref|ZP_04386596.1| Stf0 sulphotransferase [Rhodococ... 289 3e-76
gi|343926960|ref|ZP_08766450.1| hypothetical protein GOALK_075_0... 268 7e-70
gi|296141012|ref|YP_003648255.1| Stf0 sulfotransferase [Tsukamur... 258 5e-67
gi|262201523|ref|YP_003272731.1| Stf0 sulfotransferase [Gordonia... 254 1e-65
gi|213861381|ref|ZP_03385851.1| hypothetical protein SentesT_278... 178 8e-43
gi|108805153|ref|YP_645090.1| hypothetical protein Rxyl_2350 [Ru... 123 2e-26
gi|56698243|ref|YP_168616.1| hypothetical protein SPO3420 [Ruege... 102 5e-20
gi|334863090|gb|AEH13561.1| Stf0 sulfotransferase [Shewanella ba... 101 9e-20
gi|126173929|ref|YP_001050078.1| hypothetical protein Sbal_1699 ... 101 9e-20
gi|304409802|ref|ZP_07391422.1| hypothetical protein Sbal183DRAF... 100 2e-19
gi|222149388|ref|YP_002550345.1| hypothetical protein Avi_3258 [... 96.7 3e-18
gi|334316995|ref|YP_004549614.1| hypothetical protein Sinme_2280... 85.1 1e-14
gi|297622543|ref|YP_003703977.1| Stf0 sulfotransferase [Truepera... 80.9 2e-13
gi|337265909|ref|YP_004609964.1| Stf0 sulfotransferase [Mesorhiz... 78.6 9e-13
gi|119484874|ref|ZP_01619356.1| hypothetical protein L8106_15415... 76.6 3e-12
gi|118590612|ref|ZP_01548013.1| hypothetical protein SIAM614_055... 76.6 4e-12
gi|15966051|ref|NP_386404.1| hypothetical protein SMc01744 [Sino... 76.3 4e-12
gi|83592960|ref|YP_426712.1| hypothetical protein Rru_A1625 [Rho... 75.5 7e-12
gi|13473251|ref|NP_104818.1| hypothetical protein mll3788 [Mesor... 74.7 1e-11
gi|319781100|ref|YP_004140576.1| hypothetical protein Mesci_1366... 74.3 2e-11
gi|220926780|ref|YP_002502082.1| Stf0 sulfotransferase [Methylob... 72.0 9e-11
gi|89055278|ref|YP_510729.1| hypothetical protein Jann_2787 [Jan... 65.1 1e-08
gi|254504448|ref|ZP_05116599.1| Stf0 sulphotransferase superfami... 59.3 6e-07
gi|126735488|ref|ZP_01751233.1| hypothetical protein RCCS2_16471... 57.0 3e-06
gi|227822469|ref|YP_002826441.1| hypothetical protein NGR_c19240... 56.2 6e-06
gi|319781596|ref|YP_004141072.1| hypothetical protein Mesci_1869... 54.7 2e-05
gi|15965791|ref|NP_386144.1| hypothetical protein SMc04267 [Sino... 53.5 3e-05
gi|85704664|ref|ZP_01035766.1| hypothetical protein ROS217_06279... 51.6 1e-04
gi|114571053|ref|YP_757733.1| hypothetical protein Mmar10_2509 [... 51.6 1e-04
gi|325981293|ref|YP_004293695.1| hypothetical protein NAL212_059... 47.4 0.002
gi|296131408|ref|YP_003638658.1| Stf0 sulfotransferase [Cellulom... 44.3 0.018
gi|46109596|ref|XP_381856.1| hypothetical protein FG01680.1 [Gib... 41.6 0.13
>gi|15607436|ref|NP_214809.1| hypothetical protein Rv0295c [Mycobacterium tuberculosis H37Rv]
gi|15839681|ref|NP_334718.1| hypothetical protein MT0308 [Mycobacterium tuberculosis CDC1551]
gi|31791474|ref|NP_853967.1| hypothetical protein Mb0303c [Mycobacterium bovis AF2122/97]
74 more sequence titles
Length=267
Score = 541 bits (1393), Expect = 5e-152, Method: Compositional matrix adjust.
Identities = 267/267 (100%), Positives = 267/267 (100%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD
Sbjct 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS
Sbjct 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA
Sbjct 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM
Sbjct 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
LERQANQRSDEWVDRYRAEAPRLGLPT
Sbjct 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
>gi|167967146|ref|ZP_02549423.1| hypothetical protein MtubH3_03542 [Mycobacterium tuberculosis
H37Ra]
Length=267
Score = 539 bits (1388), Expect = 2e-151, Method: Compositional matrix adjust.
Identities = 266/267 (99%), Positives = 267/267 (100%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD
Sbjct 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS
Sbjct 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA
Sbjct 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWR+LTAIVASVLDAIGQDPKLAPAPM
Sbjct 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRHLTAIVASVLDAIGQDPKLAPAPM 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
LERQANQRSDEWVDRYRAEAPRLGLPT
Sbjct 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
>gi|289748778|ref|ZP_06508156.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689365|gb|EFD56794.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|339293353|gb|AEJ45464.1| hypothetical protein CCDC5079_0274 [Mycobacterium tuberculosis
CCDC5079]
gi|339297000|gb|AEJ49110.1| hypothetical protein CCDC5180_0273 [Mycobacterium tuberculosis
CCDC5180]
Length=263
Score = 531 bits (1369), Expect = 3e-149, Method: Compositional matrix adjust.
Identities = 262/263 (99%), Positives = 263/263 (100%), Gaps = 0/263 (0%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
+RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ
Sbjct 1 MRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 60
Query 65 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGL 124
LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGL
Sbjct 61 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGL 120
Query 125 RAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHI 184
RAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHI
Sbjct 121 RAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHI 180
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ
Sbjct 181 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 240
Query 245 ANQRSDEWVDRYRAEAPRLGLPT 267
ANQRSDEWVDRYRAEAPRLGLPT
Sbjct 241 ANQRSDEWVDRYRAEAPRLGLPT 263
>gi|323721312|gb|EGB30367.1| hypothetical protein TMMG_03972 [Mycobacterium tuberculosis CDC1551A]
Length=238
Score = 446 bits (1146), Expect = 2e-123, Method: Compositional matrix adjust.
Identities = 221/223 (99%), Positives = 221/223 (99%), Gaps = 0/223 (0%)
Query 45 GMAPQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWN 104
G PQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWN
Sbjct 16 GWPPQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWN 75
Query 105 QTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRG 164
QTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRG
Sbjct 76 QTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRG 135
Query 165 HPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVAS 224
HPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVAS
Sbjct 136 HPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVAS 195
Query 225 VLDAIGQDPKLAPAPMLERQANQRSDEWVDRYRAEAPRLGLPT 267
VLDAIGQDPKLAPAPMLERQANQRSDEWVDRYRAEAPRLGLPT
Sbjct 196 VLDAIGQDPKLAPAPMLERQANQRSDEWVDRYRAEAPRLGLPT 238
>gi|41408216|ref|NP_961052.1| hypothetical protein MAP2118 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254774783|ref|ZP_05216299.1| hypothetical protein MaviaA2_08940 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41396571|gb|AAS04435.1| hypothetical protein MAP_2118 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336461662|gb|EGO40525.1| hypothetical protein MAPs_28240 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=267
Score = 416 bits (1068), Expect = 2e-114, Method: Compositional matrix adjust.
Identities = 199/267 (75%), Positives = 227/267 (86%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+ + YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLPST APQPREWFAGVDD+
Sbjct 1 MTNSPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPSTSQAPQPREWFAGVDDE 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL LLDPLD GTPD A P WR ++RT GRTPNGVWGGKLMWNQT LL RA LPDRS
Sbjct 61 SILSLLDPLDAGTPDLAPPEIWRSYIRTVGRTPNGVWGGKLMWNQTPLLLDRAKNLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGLRAAIRDVIG EP+ +HV+RPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGA
Sbjct 121 GDGLRAAIRDVIGEEPLLIHVYRPDVVSQAVSFWRAVQTRVWRGRPDPARDARATYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH++ LR QE GWR WFAEE + P++I YPVLWRNLT +VA++L+ +G DP+LAP P+
Sbjct 181 IAHVVTMLRAQEEGWRNWFAEEDLKPMEIPYPVLWRNLTQVVAAILEQLGLDPQLAPEPV 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
LERQA+ RSDEWVDRYRA+A + GLPT
Sbjct 241 LERQADHRSDEWVDRYRADAEKYGLPT 267
>gi|108797369|ref|YP_637566.1| hypothetical protein Mmcs_0389 [Mycobacterium sp. MCS]
gi|119866453|ref|YP_936405.1| hypothetical protein Mkms_0398 [Mycobacterium sp. KMS]
gi|126432990|ref|YP_001068681.1| hypothetical protein Mjls_0377 [Mycobacterium sp. JLS]
gi|108767788|gb|ABG06510.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692542|gb|ABL89615.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126232790|gb|ABN96190.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=267
Score = 414 bits (1063), Expect = 1e-113, Method: Compositional matrix adjust.
Identities = 197/260 (76%), Positives = 227/260 (88%), Gaps = 0/260 (0%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP T APQPREWFA ++D++IL+LLD
Sbjct 8 YLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPETSQAPQPREWFADIEDESILRLLD 67
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAA 127
PLD G PD A WR+++RT GRTPNGVWGGKLMWNQT LL RA LPDRSGDGL +A
Sbjct 68 PLDEGKPDLAPATIWRDYIRTVGRTPNGVWGGKLMWNQTPLLLDRAKDLPDRSGDGLLSA 127
Query 128 IRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRN 187
IRDV+G++PV VHV+RPDV+SQAVSFWRAVQT+VWRG PDP RD++A YHAGAIAH++R
Sbjct 128 IRDVVGSDPVLVHVYRPDVISQAVSFWRAVQTRVWRGRPDPNRDARAEYHAGAIAHVVRM 187
Query 188 LRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQANQ 247
LR QE+GWR WFAEE ++PID+ YPVLWRNLT +VA VLD +GQDP+LAPAP+LERQA+Q
Sbjct 188 LRAQEDGWRNWFAEENVEPIDVPYPVLWRNLTQVVADVLDRLGQDPRLAPAPVLERQADQ 247
Query 248 RSDEWVDRYRAEAPRLGLPT 267
RSDEWVDRYRA+A R GLPT
Sbjct 248 RSDEWVDRYRADAERDGLPT 267
>gi|254819362|ref|ZP_05224363.1| hypothetical protein MintA_05530 [Mycobacterium intracellulare
ATCC 13950]
Length=267
Score = 411 bits (1056), Expect = 6e-113, Method: Compositional matrix adjust.
Identities = 196/267 (74%), Positives = 227/267 (86%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+ YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T +PQPREWFAGVDD+
Sbjct 1 MTNRPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPATSQSPQPREWFAGVDDE 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL LLDPLD GTPD A P WR ++RT GRTPNGVWGGKLMWNQT LL RA LPDRS
Sbjct 61 SILNLLDPLDAGTPDLAPPEIWRAYIRTVGRTPNGVWGGKLMWNQTPLLLDRAKNLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGL AAIRDV+G +P+ +HV+RPDV+SQAVSFWRAVQT+VWRG PDP RD++A YHAGA
Sbjct 121 GDGLLAAIRDVVGEDPLLIHVYRPDVISQAVSFWRAVQTRVWRGRPDPARDARATYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH++ LR QE GWR WFA+EGI P++I YPVLWRNLT +VAS+L+A+G DP+LAP P+
Sbjct 181 IAHVVTMLRAQEEGWRNWFAQEGITPMEIPYPVLWRNLTQVVASILEALGLDPQLAPEPV 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
LERQA+ RSDEWVDRYRA+A + GLPT
Sbjct 241 LERQADHRSDEWVDRYRADAEKRGLPT 267
>gi|342857467|ref|ZP_08714123.1| hypothetical protein MCOL_01285 [Mycobacterium colombiense CECT
3035]
gi|342134800|gb|EGT87966.1| hypothetical protein MCOL_01285 [Mycobacterium colombiense CECT
3035]
Length=267
Score = 409 bits (1052), Expect = 2e-112, Method: Compositional matrix adjust.
Identities = 196/267 (74%), Positives = 227/267 (86%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+++ YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLPST +PQPREWFAGV+D+
Sbjct 1 MTKSPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPSTSQSPQPREWFAGVEDE 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL LLDPLD GT D A P WR ++RT GRTPNGVWGGKLMWNQT LL RA LPDRS
Sbjct 61 SILNLLDPLDVGTRDLAPPEIWRAYIRTVGRTPNGVWGGKLMWNQTPLLLDRAKNLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGL AAI DV+G +P+ +HV+RPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGA
Sbjct 121 GDGLLAAITDVVGEQPLLIHVYRPDVVSQAVSFWRAVQTRVWRGRPDPARDARATYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH++ LR QE GWR WFAEE I P+DI+YPVLWRNLT +V S+L+A+G DP+LAP P+
Sbjct 181 IAHVVTMLRAQEEGWRNWFAEEDIKPMDISYPVLWRNLTQVVGSILEALGLDPQLAPDPV 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
LERQA+QRSDEWVDRYRA+A + GLPT
Sbjct 241 LERQADQRSDEWVDRYRADAEKHGLPT 267
>gi|118463655|ref|YP_881275.1| sulfotransferase [Mycobacterium avium 104]
gi|118164942|gb|ABK65839.1| putative sulfotransferase [Mycobacterium avium 104]
Length=258
Score = 408 bits (1049), Expect = 4e-112, Method: Compositional matrix adjust.
Identities = 195/258 (76%), Positives = 222/258 (87%), Gaps = 0/258 (0%)
Query 10 VLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLDPL 69
+LA+QRSGSTLLVESLRATG AGEPQEFFQYLPST APQPREWFAGVDD++IL LLDPL
Sbjct 1 MLASQRSGSTLLVESLRATGVAGEPQEFFQYLPSTSQAPQPREWFAGVDDESILSLLDPL 60
Query 70 DPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAAIR 129
D GTPD A P WR ++RT GRTPNGVWGGKLMWNQT LL RA LPDRSGDGLRAAIR
Sbjct 61 DAGTPDLAPPEIWRSYIRTVGRTPNGVWGGKLMWNQTPLLLDRAKNLPDRSGDGLRAAIR 120
Query 130 DVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRNLR 189
DVIG EP+ +HV+RPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGAIAH++ LR
Sbjct 121 DVIGEEPLLIHVYRPDVVSQAVSFWRAVQTRVWRGRPDPARDARATYHAGAIAHVVTMLR 180
Query 190 DQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQANQRS 249
QE GWR WFAEE + P++I YPVLWRNLT +VA++L+ +G DP+LAP P+LERQA+ RS
Sbjct 181 AQEEGWRNWFAEEDLKPMEIPYPVLWRNLTQVVAAILEQLGLDPQLAPEPVLERQADHRS 240
Query 250 DEWVDRYRAEAPRLGLPT 267
DEWVDRYRA+A + GLPT
Sbjct 241 DEWVDRYRADAEKYGLPT 258
>gi|296166331|ref|ZP_06848768.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898340|gb|EFG77909.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=267
Score = 407 bits (1046), Expect = 8e-112, Method: Compositional matrix adjust.
Identities = 194/260 (75%), Positives = 223/260 (86%), Gaps = 0/260 (0%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T APQPREWFAGV+D++IL LLD
Sbjct 8 YLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPATSQAPQPREWFAGVEDESILSLLD 67
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAA 127
PLD GTPD A WR+++RT GRTPNG+WGGKLMWNQT LL +RA LP+RSGDGL AA
Sbjct 68 PLDAGTPDLAPAEIWRDYIRTVGRTPNGIWGGKLMWNQTPLLLKRAKNLPNRSGDGLLAA 127
Query 128 IRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRN 187
IRDV+G EP+ ++VHRPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGAIAH++
Sbjct 128 IRDVVGEEPLLIYVHRPDVVSQAVSFWRAVQTRVWRGRPDPLRDARATYHAGAIAHVVTM 187
Query 188 LRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQANQ 247
LR Q+ GWR WFAEE I P+DI YPVLWRNLT VA +L A+G DP+LAP P+LERQA+Q
Sbjct 188 LRAQDEGWRNWFAEENITPMDIPYPVLWRNLTEAVAGILSALGLDPRLAPEPVLERQADQ 247
Query 248 RSDEWVDRYRAEAPRLGLPT 267
RSDEWVDRYRA+A + GLPT
Sbjct 248 RSDEWVDRYRADAEKYGLPT 267
>gi|240169052|ref|ZP_04747711.1| hypothetical protein MkanA1_07049 [Mycobacterium kansasii ATCC
12478]
Length=267
Score = 405 bits (1042), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 194/259 (75%), Positives = 225/259 (87%), Gaps = 0/259 (0%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T PQPREWFAGVDD++IL+LLD
Sbjct 8 YLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPTTSQPPQPREWFAGVDDESILRLLD 67
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAA 127
PLD G PD A WR+++RT GRTPNGVWGGKLMWNQT LL RA++LPDRSG+GL+AA
Sbjct 68 PLDDGKPDLAPAEIWRDYIRTVGRTPNGVWGGKLMWNQTPLLVNRASELPDRSGEGLKAA 127
Query 128 IRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRN 187
IRDV+G P V+V+RPDVVSQAVSFWRAVQT+VWRG PDP RD++AVYHAGAIAH++
Sbjct 128 IRDVVGENPFLVYVYRPDVVSQAVSFWRAVQTRVWRGRPDPVRDARAVYHAGAIAHVVTM 187
Query 188 LRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQANQ 247
LR QE GWR+WF EE I P++IAYPVLWRNLT +V ++L+A+G DP+LAPAP LERQA+Q
Sbjct 188 LRAQEAGWRSWFVEENITPMEIAYPVLWRNLTELVGTILEALGLDPRLAPAPALERQADQ 247
Query 248 RSDEWVDRYRAEAPRLGLP 266
RSDEWVDRYRA+A R GLP
Sbjct 248 RSDEWVDRYRADAERDGLP 266
>gi|169628636|ref|YP_001702285.1| hypothetical protein MAB_1546c [Mycobacterium abscessus ATCC
19977]
gi|169240603|emb|CAM61631.1| Conserved hypothetical protein (sulfotransferase?) [Mycobacterium
abscessus]
Length=267
Score = 404 bits (1038), Expect = 8e-111, Method: Compositional matrix adjust.
Identities = 194/259 (75%), Positives = 225/259 (87%), Gaps = 0/259 (0%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T M+PQPREWFA V D++IL+LLD
Sbjct 8 YLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPTTSMSPQPREWFADVQDESILRLLD 67
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAA 127
PLD G PD A WR+++RT GRTPNG+WGGKLMWNQT LL RA LPDRSG+GL AA
Sbjct 68 PLDEGKPDLAPATIWRDYIRTVGRTPNGIWGGKLMWNQTPLLLNRAQGLPDRSGEGLLAA 127
Query 128 IRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRN 187
IRDVIG++PV VHV+RPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGAIAH+I
Sbjct 128 IRDVIGSDPVLVHVYRPDVVSQAVSFWRAVQTRVWRGRPDPVRDARAEYHAGAIAHVITM 187
Query 188 LRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQANQ 247
L+ QE GWR WFAEE I+PIDI+YP LWRNLT +V +VL+A+GQDP+LAP P+LERQA+Q
Sbjct 188 LQAQETGWRRWFAEENIEPIDISYPYLWRNLTEVVGTVLEALGQDPRLAPPPVLERQADQ 247
Query 248 RSDEWVDRYRAEAPRLGLP 266
RSD+WVDRYRA+A + GLP
Sbjct 248 RSDDWVDRYRADAEKEGLP 266
>gi|183984227|ref|YP_001852518.1| hypothetical protein MMAR_4255 [Mycobacterium marinum M]
gi|183177553|gb|ACC42663.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=267
Score = 401 bits (1030), Expect = 6e-110, Method: Compositional matrix adjust.
Identities = 195/267 (74%), Positives = 225/267 (85%), Gaps = 0/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+ + YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T PQPREWFAGV+D
Sbjct 1 MADSPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPTTSQPPQPREWFAGVEDA 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL+LLDPLD G PD A P WR++VRT GRTPNGVWGGKLMWNQT LL RA QLP+RS
Sbjct 61 SILRLLDPLDEGKPDLAPPEIWRDYVRTVGRTPNGVWGGKLMWNQTPLLLNRARQLPNRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGL AAIRDVIG P+ V+V+RPDVVSQAVSFWRAVQT VWRGHPDP RD++A YHAGA
Sbjct 121 GDGLSAAIRDVIGENPLLVYVYRPDVVSQAVSFWRAVQTGVWRGHPDPARDARASYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH++ LR QE GWR+WFAEEGI P++I+YPVLWRNLT +V ++L+A+G D +LAP
Sbjct 181 IAHVVSMLRAQEQGWRSWFAEEGIAPMEISYPVLWRNLTELVGNILEALGLDARLAPTAP 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLPT 267
L RQA++RSDEWVDRYRA+A R GLPT
Sbjct 241 LVRQADERSDEWVDRYRADAERAGLPT 267
>gi|118467454|ref|YP_885041.1| hypothetical protein MSMEG_0630 [Mycobacterium smegmatis str.
MC2 155]
gi|118168741|gb|ABK69637.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=267
Score = 400 bits (1028), Expect = 1e-109, Method: Compositional matrix adjust.
Identities = 192/266 (73%), Positives = 224/266 (85%), Gaps = 0/266 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
MS YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T M+PQPREWFA V+D
Sbjct 1 MSDHPTAYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPNTSMSPQPREWFADVEDQ 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL+LLDPL G PD A WR++++T GRTPNGVWGGKLMWNQT LL QRA LPDRS
Sbjct 61 SILRLLDPLIEGKPDLAPATIWRDYIQTVGRTPNGVWGGKLMWNQTPLLVQRAKDLPDRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
G GL +AIRDV+G++PV +H+HRPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGA
Sbjct 121 GSGLLSAIRDVVGSDPVLIHIHRPDVVSQAVSFWRAVQTRVWRGRPDPVRDARAEYHAGA 180
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH+I LR QE GWRAWF EE ++PID+ YP LWRNLT +V +VL+A+GQDP+LAP P+
Sbjct 181 IAHVITMLRAQEEGWRAWFTEENVEPIDVDYPYLWRNLTEVVGTVLEALGQDPRLAPKPV 240
Query 241 LERQANQRSDEWVDRYRAEAPRLGLP 266
LERQA+QRSDEWV+RYR +A R GLP
Sbjct 241 LERQADQRSDEWVERYRRDAQRDGLP 266
>gi|51247647|pdb|1TEX|A Chain A, Mycobacterium Smegmatis Stf0 Sulfotransferase With Trehalose
gi|51247648|pdb|1TEX|B Chain B, Mycobacterium Smegmatis Stf0 Sulfotransferase With Trehalose
gi|51247649|pdb|1TEX|C Chain C, Mycobacterium Smegmatis Stf0 Sulfotransferase With Trehalose
gi|51247650|pdb|1TEX|D Chain D, Mycobacterium Smegmatis Stf0 Sulfotransferase With Trehalose
Length=287
Score = 399 bits (1026), Expect = 2e-109, Method: Compositional matrix adjust.
Identities = 192/266 (73%), Positives = 224/266 (85%), Gaps = 0/266 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
MS YLVLA+QRSGSTLLVESLRATG AGEPQEFFQYLP+T M+PQPREWFA V+D
Sbjct 21 MSDHPTAYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPNTSMSPQPREWFADVEDQ 80
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
+IL+LLDPL G PD A WR++++T GRTPNGVWGGKLMWNQT LL QRA LPDRS
Sbjct 81 SILRLLDPLIEGKPDLAPATIWRDYIQTVGRTPNGVWGGKLMWNQTPLLVQRAKDLPDRS 140
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
G GL +AIRDV+G++PV +H+HRPDVVSQAVSFWRAVQT+VWRG PDP RD++A YHAGA
Sbjct 141 GSGLLSAIRDVVGSDPVLIHIHRPDVVSQAVSFWRAVQTRVWRGRPDPVRDARAEYHAGA 200
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM 240
IAH+I LR QE GWRAWF EE ++PID+ YP LWRNLT +V +VL+A+GQDP+LAP P+
Sbjct 201 IAHVITMLRAQEEGWRAWFTEENVEPIDVDYPYLWRNLTEVVGTVLEALGQDPRLAPKPV 260
Query 241 LERQANQRSDEWVDRYRAEAPRLGLP 266
LERQA+QRSDEWV+RYR +A R GLP
Sbjct 261 LERQADQRSDEWVERYRRDAQRDGLP 286
>gi|111018275|ref|YP_701247.1| hypothetical protein RHA1_ro01265 [Rhodococcus jostii RHA1]
gi|110817805|gb|ABG93089.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=267
Score = 301 bits (772), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/262 (60%), Positives = 186/262 (71%), Gaps = 1/262 (0%)
Query 6 RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQL 65
R YLV A+QRSGSTLLVESLRAT AG P+EFFQYLPST +PQPR+WF GV D+ +L L
Sbjct 5 RSYLVCASQRSGSTLLVESLRATTVAGNPEEFFQYLPSTSRSPQPRQWFEGVTDEAVLSL 64
Query 66 LDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR 125
L PL+PGT DT T WR+ + + GRTPNGVWGGKLMWNQ L+ RAA LPDRSGD LR
Sbjct 65 LAPLEPGTADTRTAEQWRDQLLSLGRTPNGVWGGKLMWNQVPLVLDRAAGLPDRSGDDLR 124
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKR-DSQAVYHAGAIAHI 184
+A+ D++G + F+HV+R DVV+QAVS WRAVQTQVWR P A YHAG IAH+
Sbjct 125 SALDDILGGDLAFIHVYRRDVVAQAVSMWRAVQTQVWRDDATPPAPHDGAEYHAGGIAHL 184
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
+R LRDQ+ WR WF EG+D IDI + L A A VL +G D LAP P L+RQ
Sbjct 185 VRILRDQDEQWRNWFEVEGLDHIDIGFDDLVAAPQATAAKVLVELGLDADLAPPPPLKRQ 244
Query 245 ANQRSDEWVDRYRAEAPRLGLP 266
++ RS EW +RY ++A GLP
Sbjct 245 SDGRSKEWAERYLSDATANGLP 266
>gi|226360399|ref|YP_002778177.1| hypothetical protein ROP_09850 [Rhodococcus opacus B4]
gi|226238884|dbj|BAH49232.1| hypothetical protein [Rhodococcus opacus B4]
Length=267
Score = 295 bits (756), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 155/262 (60%), Positives = 183/262 (70%), Gaps = 1/262 (0%)
Query 6 RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQL 65
R YLV A+QRSGSTLLVESLRAT AG P+EFFQYLPST +PQPR+WF V D+T+L L
Sbjct 5 RSYLVCASQRSGSTLLVESLRATTVAGNPEEFFQYLPSTSRSPQPRQWFESVTDETVLSL 64
Query 66 LDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR 125
L PL+PGT DT T WR+ + GRTPNGVWGGKLMWNQ L+ RAA LPDRSGD LR
Sbjct 65 LAPLEPGTADTRTAEQWRDQLLNVGRTPNGVWGGKLMWNQVPLVLDRAAGLPDRSGDDLR 124
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKR-DSQAVYHAGAIAHI 184
+A+ D++G + VF+HV R DVV+QAVS WRAVQTQVWR P A YHA IAH+
Sbjct 125 SALGDILGRDLVFIHVFRRDVVAQAVSMWRAVQTQVWRDDATPPTPHDGAEYHADGIAHL 184
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
+ LRDQ+ WR WF EG+D IDI + L A A VL +G D LAP P L++Q
Sbjct 185 VGILRDQDVQWRNWFETEGLDHIDIGFDDLVAAPQATAAKVLVELGLDADLAPPPPLKQQ 244
Query 245 ANQRSDEWVDRYRAEAPRLGLP 266
++ RS EW RYR+EA GLP
Sbjct 245 SDGRSREWALRYRSEAAANGLP 266
>gi|226303617|ref|YP_002763575.1| hypothetical protein RER_01280 [Rhodococcus erythropolis PR4]
gi|226182732|dbj|BAH30836.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=269
Score = 290 bits (741), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 151/267 (57%), Positives = 186/267 (70%), Gaps = 1/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+ A R +LV A+QRSGSTLLVESLRATG AGEP+EFFQYLP T +PQPR+WF V D+
Sbjct 1 MTEAQRSFLVCASQRSGSTLLVESLRATGVAGEPEEFFQYLPETSRSPQPRQWFEDVTDE 60
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
++L LL P PGTPDT T WR + GRTPNGVWGGKLMWNQT LL RAA LP RS
Sbjct 61 SVLGLLAPFHPGTPDTRTSEQWRTQLLELGRTPNGVWGGKLMWNQTPLLLDRAAGLPWRS 120
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQ-AVYHAG 179
G LR+A+ D + ++ F+HV+R DVV+QAVS WRAVQTQVWR P S A Y+A
Sbjct 121 GTDLRSALHDTLDHDLQFIHVYREDVVAQAVSMWRAVQTQVWRDDATPPNLSDGAQYNAL 180
Query 180 AIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAP 239
IAH++ L +QE W+ WF EE I PI+I + L + ++VA L ++G D +LAP P
Sbjct 181 GIAHLVTILGEQERQWKRWFEEEDISPIEIGFRDLTEDPQSVVAKTLISLGLDGQLAPPP 240
Query 240 MLERQANQRSDEWVDRYRAEAPRLGLP 266
L RQ++ RS EWV RYR +A + G P
Sbjct 241 PLRRQSDGRSREWVQRYRIDAEQNGYP 267
>gi|229492798|ref|ZP_04386596.1| Stf0 sulphotransferase [Rhodococcus erythropolis SK121]
gi|229320238|gb|EEN86061.1| Stf0 sulphotransferase [Rhodococcus erythropolis SK121]
Length=281
Score = 289 bits (739), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 150/267 (57%), Positives = 186/267 (70%), Gaps = 1/267 (0%)
Query 1 MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDD 60
M+ A R +LV A+QRSGSTLLVESLRATG AGEP+EFFQYLP T +PQPR+WF V D+
Sbjct 13 MTEAQRSFLVCASQRSGSTLLVESLRATGVAGEPEEFFQYLPETSRSPQPRQWFEDVTDE 72
Query 61 TILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
++L LL P PGTPDT T WR + GRTPNGVWGGKLMWNQT LL RAA LP RS
Sbjct 73 SVLGLLAPFHPGTPDTRTSEQWRTQLLELGRTPNGVWGGKLMWNQTPLLLDRAAGLPWRS 132
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQ-AVYHAG 179
G LR+A+ D + ++ F+HV+R DVV+QAVS WRAVQTQVWR P S A Y+A
Sbjct 133 GTDLRSALHDTLDHDLQFIHVYREDVVAQAVSMWRAVQTQVWRDDATPPNLSDGAQYNAV 192
Query 180 AIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAP 239
IAH++ L +QE W+ WF EE I PI++ + L + ++VA L ++G D +LAP P
Sbjct 193 GIAHLVTILGEQERQWKRWFEEEDISPIEVGFRDLTEDPQSVVAKTLISLGLDGQLAPPP 252
Query 240 MLERQANQRSDEWVDRYRAEAPRLGLP 266
L RQ++ RS EWV RYR +A + G P
Sbjct 253 PLRRQSDGRSREWVQRYRIDAEQNGYP 279
>gi|343926960|ref|ZP_08766450.1| hypothetical protein GOALK_075_00060 [Gordonia alkanivorans NBRC
16433]
gi|343763119|dbj|GAA13376.1| hypothetical protein GOALK_075_00060 [Gordonia alkanivorans NBRC
16433]
Length=269
Score = 268 bits (684), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 155/262 (60%), Positives = 181/262 (70%), Gaps = 8/262 (3%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLV A+QRSGSTLLVESL AT AG P+EFFQY S+ +PQPREWFAGV D TIL+LLD
Sbjct 12 YLVCASQRSGSTLLVESLSATEVAGTPEEFFQYFVSSSQSPQPREWFAGVTDPTILELLD 71
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDG-LRA 126
P+DPGT DT WR + +GR+ NGVWGGKLMWNQT LL R+ R+G G LR
Sbjct 72 PVDPGTVDTRDSEIWRADILAAGRSANGVWGGKLMWNQTPLLIARS-----RAGSGSLRT 126
Query 127 AIRDVI-GNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRG-HPDPKRDSQAVYHAGAIAHI 184
AIR + G +PV+VHV+R DVV QAVS WRAVQT+VWR D D AVYHA IAH+
Sbjct 127 AIRWIFDGADPVYVHVYRDDVVPQAVSMWRAVQTRVWRNDGSDDDGDDGAVYHAAGIAHL 186
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
LR+QE WR WFA EGI+P+DI + L + T A VL+ IGQDP LAP P L+ Q
Sbjct 187 AGLLREQERQWRNWFAAEGIEPLDIEFRDLVNDPTKAAARVLEKIGQDPALAPPPPLKPQ 246
Query 245 ANQRSDEWVDRYRAEAPRLGLP 266
+N RS EW RYR +A R G P
Sbjct 247 SNSRSKEWAQRYREDAERNGYP 268
>gi|296141012|ref|YP_003648255.1| Stf0 sulfotransferase [Tsukamurella paurometabola DSM 20162]
gi|296029146|gb|ADG79916.1| Stf0 sulfotransferase [Tsukamurella paurometabola DSM 20162]
Length=265
Score = 258 bits (660), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 155/263 (59%), Positives = 183/263 (70%), Gaps = 9/263 (3%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLV A+QRSGSTLLVESL ATG AG PQEFFQY PS+ ++PQPREWFAGVDD +L LLD
Sbjct 7 YLVCASQRSGSTLLVESLAATGVAGNPQEFFQYFPSSSLSPQPREWFAGVDDPDLLALLD 66
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDG-LRA 126
P + GT DT T WR V TSGRT NGVWGGKLMWNQT +L R R G LR
Sbjct 67 PTEAGTVDTRTQEQWRADVLTSGRTSNGVWGGKLMWNQTPILISRT-----RVASGSLRT 121
Query 127 AIRDVI-GNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDP--KRDSQAVYHAGAIAH 183
A+R + G +PV+VHV RPDVV QAVS WRAVQT+ WR PD +RD +AVY A IAH
Sbjct 122 AVRSLFDGADPVYVHVFRPDVVPQAVSMWRAVQTRTWRDDPDHDRERDERAVYRAEGIAH 181
Query 184 IIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLER 243
+ L +QE WRAWFA E I+P++I + L + A VL+A+GQDP LAP P L+
Sbjct 182 LAGILLEQERAWRAWFAAEAIEPLEIDFTELIADPRTSTARVLEALGQDPALAPPPPLKP 241
Query 244 QANQRSDEWVDRYRAEAPRLGLP 266
Q+N+RS EW RYRA+A + G P
Sbjct 242 QSNERSKEWAQRYRADAAQNGYP 264
>gi|262201523|ref|YP_003272731.1| Stf0 sulfotransferase [Gordonia bronchialis DSM 43247]
gi|262084870|gb|ACY20838.1| Stf0 sulphotransferase [Gordonia bronchialis DSM 43247]
Length=263
Score = 254 bits (648), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 148/261 (57%), Positives = 172/261 (66%), Gaps = 7/261 (2%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
YLV A+QRSGSTLLVESL TG AG P+EFFQY ++ +PQPREWFAGV D IL LL
Sbjct 7 YLVCASQRSGSTLLVESLAHTGVAGRPEEFFQYFATSSQSPQPREWFAGVTDPEILSLLA 66
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDG-LRA 126
PLD GT D WR V +GRT NGVWGGKLMWNQT LL R R G LR
Sbjct 67 PLDHGTVDIRNTDDWRSDVLAAGRTDNGVWGGKLMWNQTPLLIART-----RVASGSLRT 121
Query 127 AIRDVI-GNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHII 185
AIR + G +PV+VHV+R D+V QAVS WRAVQT+VWR + D AVYHA IAH+
Sbjct 122 AIRSLFDGADPVYVHVYREDIVPQAVSMWRAVQTRVWRDDGGDRSDDGAVYHARGIAHLA 181
Query 186 RNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQA 245
L +QE WR WFA E I+P+DI + L ++ T A VL+AI QDP LAP P L+ Q+
Sbjct 182 GILAEQERQWRKWFAAEEIEPLDIEFVELIKDPTKATARVLEAIRQDPALAPPPPLKPQS 241
Query 246 NQRSDEWVDRYRAEAPRLGLP 266
N RS EW RYR +A R G P
Sbjct 242 NARSKEWAQRYRKDATRNGYP 262
>gi|213861381|ref|ZP_03385851.1| hypothetical protein SentesT_27870 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
Length=86
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 86/86 (100%), Positives = 86/86 (100%), Gaps = 0/86 (0%)
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 180
GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA
Sbjct 1 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGA 60
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDP 206
IAHIIRNLRDQENGWRAWFAEEGIDP
Sbjct 61 IAHIIRNLRDQENGWRAWFAEEGIDP 86
>gi|108805153|ref|YP_645090.1| hypothetical protein Rxyl_2350 [Rubrobacter xylanophilus DSM
9941]
gi|108766396|gb|ABG05278.1| conserved hypothetical protein [Rubrobacter xylanophilus DSM
9941]
Length=284
Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/273 (33%), Positives = 126/273 (47%), Gaps = 27/273 (9%)
Query 9 LVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLDP 68
++ AT RSGSTLL E LR TG AG P+E FQ L TG +P ++F +D + LLD
Sbjct 1 MICATPRSGSTLLCEGLRGTGIAGRPEEHFQMLQETGRPRRPGDYFQRSNDPDVWVLLD- 59
Query 69 LDPGTPDT-------ATPVAWRE------------HVRTSGRTPNGVWGGKLMWNQTALL 109
DPG D A W E V TPNGV+G K+MW
Sbjct 60 -DPGFRDVFGEERRPANEPTWMEVWGVSRFEELLDRVVAEATTPNGVFGTKIMWAYFRDF 118
Query 110 QQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDP- 168
+ A + G V N +V + R D V QAVS WRA+QT WR D
Sbjct 119 VRLARRSRRAHGASPCEVPGAVFPNLRRYVWIRRRDTVRQAVSLWRALQTWRWRQDADDD 178
Query 169 --KRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVL 226
+R + + AI H+ + + W+ +F G DP+++ Y L R+ + V+
Sbjct 179 PGERGERLRFSFAAIDHLRLRIDEHNAAWQRFFRRCGADPVEVVYEDLVRDYEGTIVRVV 238
Query 227 DAIG---QDPKLAPAPMLERQANQRSDEWVDRY 256
D +G + P ++RQ++ S+EWV RY
Sbjct 239 DEVGIPAPEGVRVLRPRMKRQSDGLSEEWVRRY 271
>gi|56698243|ref|YP_168616.1| hypothetical protein SPO3420 [Ruegeria pomeroyi DSS-3]
gi|56679980|gb|AAV96646.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3]
Length=255
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/263 (34%), Positives = 121/263 (47%), Gaps = 31/263 (11%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ T RSGSTLL L ATG AG+P FF+ Q +W+A L +
Sbjct 8 YIICGTPRSGSTLLCGYLAATGAAGDPDSFFR--------TQSIDWWA-----RYWGLPE 54
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLM-WNQTALLQQRAAQLPDRSGDGLRA 126
L PG A+ E GR V+G +LM N +L P GD
Sbjct 55 TLRPGV--VGFDRAYLEAALREGRGETPVFGLRLMRENLGDMLGMLDHLYPGLPGD---T 109
Query 127 AIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPD--------PKRDSQAVYHA 178
A+ + ++H+ R D V+QAVS RA Q+ +W PD P RD VY
Sbjct 110 ALIEAAFGPTRYLHLRRRDKVAQAVSRVRAEQSGLWHIAPDGREIERLAPHRDP--VYDF 167
Query 179 GAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPA 238
AI +R L E GW WF +GI P+ I Y L +V+++L +GQDP+ A
Sbjct 168 DAIDSHVRALETYEAGWTDWFLAQGITPLGIDYEDLANTPIEVVSAILAHLGQDPERAQG 227
Query 239 --PMLERQANQRSDEWVDRYRAE 259
P + + + + S +W RYRA+
Sbjct 228 LTPAVAKLSGEESRDWARRYRAQ 250
>gi|334863090|gb|AEH13561.1| Stf0 sulfotransferase [Shewanella baltica OS117]
Length=257
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 82/265 (31%), Positives = 125/265 (48%), Gaps = 35/265 (13%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ A RSGSTLL + L T AG P FF+ RE F L+
Sbjct 7 YIICAKPRSGSTLLCDLLTDTQVAGCPDSFFR-----------REDF--------LEWAS 47
Query 68 PLDPGTPDTATPVAWREHVRTS----GRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGD 122
D + + + T+ G ++G +LMW L +R A P D
Sbjct 48 YFDVSVTNWGNEQEFDQSYLTAVLQEGTGGTSIFGMRLMWESLGELSKRLASFHPGLPND 107
Query 123 GLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWR-GHPDPKRD----SQA-VY 176
R + V G+ P +VH+ R + V+QAVS +A Q+ +W G +R+ QA +Y
Sbjct 108 NAR--FQAVFGS-PRYVHLTRENKVAQAVSRLKAEQSGLWHLGADGTERERLKFGQAPIY 164
Query 177 HAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLA 236
AG++A I+ L +Q+ W WF ++ ++PI I Y L N A++ VL A+G D +A
Sbjct 165 DAGSLAKIVARLEEQDAAWSNWFVQQEVEPICITYEALSDNPLAVLEVVLAALGLDSAIA 224
Query 237 P--APMLERQANQRSDEWVDRYRAE 259
P + A+ +S EW +R+R E
Sbjct 225 KTVTPRTAKLADSQSREWAERFREE 249
>gi|126173929|ref|YP_001050078.1| hypothetical protein Sbal_1699 [Shewanella baltica OS155]
gi|125997134|gb|ABN61209.1| conserved hypothetical protein [Shewanella baltica OS155]
Length=260
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 82/265 (31%), Positives = 125/265 (48%), Gaps = 35/265 (13%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ A RSGSTLL + L T AG P FF+ RE F L+
Sbjct 10 YIICAKPRSGSTLLCDLLTDTQVAGCPDSFFR-----------REDF--------LEWAS 50
Query 68 PLDPGTPDTATPVAWREHVRTS----GRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGD 122
D + + + T+ G ++G +LMW L +R A P D
Sbjct 51 YFDVSVTNWGNEQEFDQSYLTAVLQEGTGGTSIFGMRLMWESLGELSKRLASFHPGLPND 110
Query 123 GLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWR-GHPDPKRD----SQA-VY 176
R + V G+ P +VH+ R + V+QAVS +A Q+ +W G +R+ QA +Y
Sbjct 111 NAR--FQAVFGS-PRYVHLTRENKVAQAVSRLKAEQSGLWHLGADGTERERLKFGQAPIY 167
Query 177 HAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLA 236
AG++A I+ L +Q+ W WF ++ ++PI I Y L N A++ VL A+G D +A
Sbjct 168 DAGSLAKIVARLEEQDAAWSNWFVQQEVEPICITYEALSDNPLAVLEVVLAALGLDSAIA 227
Query 237 P--APMLERQANQRSDEWVDRYRAE 259
P + A+ +S EW +R+R E
Sbjct 228 KTVTPRTAKLADSQSREWAERFREE 252
>gi|304409802|ref|ZP_07391422.1| hypothetical protein Sbal183DRAFT_1258 [Shewanella baltica OS183]
gi|304352320|gb|EFM16718.1| hypothetical protein Sbal183DRAFT_1258 [Shewanella baltica OS183]
gi|333819222|gb|AEG11888.1| Stf0 sulfotransferase [Shewanella baltica BA175]
Length=260
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/265 (31%), Positives = 120/265 (46%), Gaps = 35/265 (13%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ AT RSGSTLL + L T AG P FF+ RE F L+
Sbjct 10 YIICATPRSGSTLLCDLLTDTQVAGCPDSFFR-----------REDF--------LEWAS 50
Query 68 PLDPGTPDTATPVAWREHVRTS----GRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGD 122
D + + + T+ G ++G +LMW L +R A P D
Sbjct 51 YFDVSVTNWGNEQEFDQSYLTAVLQEGTGGTSIFGMRLMWESLGELSKRLASFHPGLPND 110
Query 123 GLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDP------KRDSQAVY 176
R + V G+ P +VH+ R + V+QAVS +A Q+ +W D K VY
Sbjct 111 NAR--FQAVFGS-PRYVHLIRENKVAQAVSRLKAEQSGLWHLGADGTERERLKFGQAPVY 167
Query 177 HAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLA 236
G++A I+ L +Q+ W WF ++ ++PI I Y L N ++ VL A+G D +A
Sbjct 168 DTGSLAKIVARLEEQDAAWSNWFVQQEVEPICITYEALSDNPLVVLEVVLAALGLDTAIA 227
Query 237 P--APMLERQANQRSDEWVDRYRAE 259
P + A+ +S EW +R+R E
Sbjct 228 KTVTPRTAKLADSQSREWAERFREE 252
>gi|222149388|ref|YP_002550345.1| hypothetical protein Avi_3258 [Agrobacterium vitis S4]
gi|221736371|gb|ACM37334.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length=256
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 74/264 (29%), Positives = 122/264 (47%), Gaps = 27/264 (10%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
+ Y++ + RSGSTLL L ATG AG+P+ +F + T +W + ++ D
Sbjct 4 FQSYVICTSPRSGSTLLCNMLAATGVAGKPKSYFHHGSIT-------DWLSYLNLD---- 52
Query 65 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGDG 123
++P P++ A E +GR G++G +L + L ++ A L P + D
Sbjct 53 ----INPSLPESDLLAAIFEAAVETGRNGTGLFGLRLQRHSFDLFVEKLAVLYPTLASD- 107
Query 124 LRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQA------VYH 177
R I G+ +F+H+ R D V QAVS+ +A QT +W PD + +Y
Sbjct 108 -RQRIEAAFGS-TLFIHLTRLDKVQQAVSYVKAQQTGLWHRAPDGTELERLSAPRDPIYD 165
Query 178 AGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAP 237
A I + W++WF +GI P+ I Y L + A + VL +G + + A
Sbjct 166 AAKIHASYDEFIQYDRAWQSWFDMQGIQPLRITYEALCADPIASLKDVLVQLGVNGEAAS 225
Query 238 A--PMLERQANQRSDEWVDRYRAE 259
+ P + A+ + W R+R E
Sbjct 226 SVVPGTAKLADGINQSWETRFRTE 249
>gi|334316995|ref|YP_004549614.1| hypothetical protein Sinme_2280 [Sinorhizobium meliloti AK83]
gi|333812359|gb|AEG05028.1| hypothetical protein SinmeB_2124 [Sinorhizobium meliloti BL225C]
gi|334095989|gb|AEG54000.1| hypothetical protein Sinme_2280 [Sinorhizobium meliloti AK83]
gi|336032302|gb|AEH78234.1| hypothetical protein SM11_chr0957 [Sinorhizobium meliloti SM11]
Length=266
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/261 (29%), Positives = 108/261 (42%), Gaps = 27/261 (10%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ + RSGSTLL + L ATG +G P +F P EW A +
Sbjct 20 YVICTSPRSGSTLLCKLLAATGISGNPGSYFH-------RPSIAEWLAYFEPAA------ 66
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGDGLRA 126
D P+ G ++G +L + Q+ A L P+RS D R
Sbjct 67 --DASRPEADILATIFRAAIAKGSGDTSMFGLRLQRHSFDFFVQKLAVLHPERSSDLQR- 123
Query 127 AIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKR------DSQAVYHAGA 180
I G + +F+H+ R D V QAVS +A QT +W PD S VY++
Sbjct 124 -IEAAFG-QTLFLHLTRLDKVEQAVSLVKAEQTGLWHAAPDGTELERTAPPSAPVYNSDE 181
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPA-- 238
I + W WF +GI+P IAY L + + VL +G + A
Sbjct 182 IRTWYERFAAYDQAWNDWFEMQGIEPFRIAYEALSADPLGSLRKVLSRLGLKCEGASGIT 241
Query 239 PMLERQANQRSDEWVDRYRAE 259
P + + A+ + EW R+R E
Sbjct 242 PGVGKLADATNHEWAMRFRLE 262
>gi|297622543|ref|YP_003703977.1| Stf0 sulfotransferase [Truepera radiovictrix DSM 17093]
gi|297163723|gb|ADI13434.1| Stf0 sulfotransferase [Truepera radiovictrix DSM 17093]
Length=251
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/269 (31%), Positives = 118/269 (44%), Gaps = 44/269 (16%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFF------QYLPSTGMAPQPREWFAGVDDDT 61
Y + T RSGS+ L ++L ATG AG P E+F + G+A + E
Sbjct 8 YWLCTTPRSGSSALGDALSATGVAGRPTEYFNRRFWPELFARFGLAGRAEE--------- 58
Query 62 ILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWN-QTALLQQRAAQLPDRS 120
A P R V + +PNGV+G K M + A L +
Sbjct 59 -------------AEAVPDYLRALVFQTA-SPNGVFGVKAMLDADMAPFFAGLRTLRGCA 104
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVW-RGHPDPKRD-SQAVYHA 178
IR V FV++ R + V QAVSFWRA Q+ VW R H D R+ ++A +
Sbjct 105 AHSEAELIRTVFPG-VRFVYLTRRNKVRQAVSFWRAQQSGVWERYHGDAVREGARAHFDF 163
Query 179 GAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPA 238
A++ +++ L +E W+ F P + Y R+ V +LD + +LAP
Sbjct 164 AALSGLVQELSLREARWQELFDALEATPYTVVYEDYVRDPEGTVRGILDFL----ELAPP 219
Query 239 P-------MLERQANQRSDEWVDRYRAEA 260
P +ER A++ SD WV RY AEA
Sbjct 220 PGWSLPRLTMERLADETSDAWVARYLAEA 248
>gi|337265909|ref|YP_004609964.1| Stf0 sulfotransferase [Mesorhizobium opportunistum WSM2075]
gi|336026219|gb|AEH85870.1| Stf0 sulfotransferase [Mesorhizobium opportunistum WSM2075]
Length=250
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 68/269 (26%), Positives = 121/269 (45%), Gaps = 41/269 (15%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQ----PREWFAGVDDDTIL 63
Y++ T R+GSTLL + L +TG +G+P F++ T A + RE ++ DT
Sbjct 5 YIICGTPRTGSTLLCKLLASTGASGDPHSFYRRQDVTEWAQEWKLPARETMGELEFDT-- 62
Query 64 QLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQ----TALLQQRAAQLPDR 119
A+ + +G+ ++G +LM +A+L + P
Sbjct 63 -----------------AYLDAAIAAGKGGTDIFGLRLMRENLDELSAILNR---IFPGL 102
Query 120 SGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDS------Q 173
+ D R G+ +++H+ R D ++QAVS +A QT +W PD Q
Sbjct 103 AADTAR--FEKAFGH-VLYIHLSREDKLAQAVSLVKAQQTGLWHVAPDGTEIERVGVPGQ 159
Query 174 AVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDP 233
A Y I + L + W WFA++G+ P+ I Y L + A + ++ +A+G
Sbjct 160 ARYDFQRIKGELTELEAYDAAWNTWFAKQGVTPLRIGYERLSADPAAALLTICEALGVQA 219
Query 234 KLAPA--PMLERQANQRSDEWVDRYRAEA 260
A A P + + +++ S +W+ R+ +A
Sbjct 220 PDAEAVRPGVAKLSDETSLDWMRRFHVDA 248
>gi|119484874|ref|ZP_01619356.1| hypothetical protein L8106_15415 [Lyngbya sp. PCC 8106]
gi|119457692|gb|EAW38816.1| hypothetical protein L8106_15415 [Lyngbya sp. PCC 8106]
Length=283
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/271 (26%), Positives = 115/271 (43%), Gaps = 46/271 (16%)
Query 6 RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQL 65
+ Y++ +T RSGSTLL + L T AG+PQEFF LP +W DT
Sbjct 5 KTYIICSTMRSGSTLLCDLLTNTKLAGQPQEFF--LP---------QWEKKSKFDT---- 49
Query 66 LDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR 125
T P + + + S + NGV G KLMW + +R + + S
Sbjct 50 ----------TNYP-EYLQKMLESFASSNGVSGVKLMWCNCEYVIRRLHKSSESSSKPDL 98
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQA-------VYHA 178
+++V + FV + R V QA+S R+V+T+ W + D + + +Y++
Sbjct 99 ELLKEVFPDLK-FVFISRRSKVRQAISLARSVKTKQWNKYQDSQNPGKTSFNRYGNIYNS 157
Query 179 ----------GAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDA 228
G + + ++ E+ W +F I+P I Y L +N + ++L
Sbjct 158 QKNPYPYISPGTLEVYLSQIKKDESAWFEFFKNNNIEPQIIIYEELAQNKQKNINNILQF 217
Query 229 --IGQDPKLAPAPMLERQANQRSDEWVDRYR 257
I L ++QA+ +D V +Y+
Sbjct 218 LDIQTLEDLNIDSFFKKQADFYTDFLVLQYQ 248
>gi|118590612|ref|ZP_01548013.1| hypothetical protein SIAM614_05578 [Stappia aggregata IAM 12614]
gi|118436588|gb|EAV43228.1| hypothetical protein SIAM614_05578 [Stappia aggregata IAM 12614]
Length=255
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/269 (30%), Positives = 116/269 (44%), Gaps = 35/269 (13%)
Query 4 AVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTIL 63
A Y++ + RSGSTLL + L AT AG P+ +F P W GV
Sbjct 3 AYSSYILCTSPRSGSTLLCKLLSATDVAGHPRSYFH-------EPSLTAWSEGVGVAAA- 54
Query 64 QLLDPLDPGTPDTATPVAWREHVRTSG----RTPNGVWGGKLMWNQTALLQQRAAQL-PD 118
P P+ +R + + G++G +L + ++ A L PD
Sbjct 55 -------PDEPE----AEFRRRIFAAAIELGTGGTGLFGLRLQRHSFDFFMKQLACLHPD 103
Query 119 RSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDP---KRDS--- 172
D R + V GN +F+H+ R D V QAVSF RA Q+ +W PD +R S
Sbjct 104 APSDLAR--LEAVFGN-TLFIHLTRTDKVEQAVSFVRAEQSGLWHRAPDGTELERLSEPR 160
Query 173 QAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQD 232
+A Y A I E+ W+AWF + I P+ I Y L + A + VL +G
Sbjct 161 EAHYDAAEIRACYERFTRFESDWQAWFESQRIAPLRITYDALSADPQATLRLVLQHLGLK 220
Query 233 PKLAP--APMLERQANQRSDEWVDRYRAE 259
A P + + A+ S +WV R+R +
Sbjct 221 ETAADGVVPGVTKLADATSADWVSRFRVD 249
>gi|15966051|ref|NP_386404.1| hypothetical protein SMc01744 [Sinorhizobium meliloti 1021]
gi|15075321|emb|CAC46877.1| Conserved hypothetical protein [Sinorhizobium meliloti 1021]
Length=213
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/214 (30%), Positives = 88/214 (42%), Gaps = 25/214 (11%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ + RSGSTLL + L ATG +G P +F P EW A +
Sbjct 7 YVICTSPRSGSTLLCKLLAATGISGNPGSYFH-------RPSIAEWLAYFEPAA------ 53
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGDGLRA 126
D P+ G ++G +L + Q+ A L P+RS D R
Sbjct 54 --DASRPEADILATIFRAAIAKGSGDTSMFGLRLQRHSFDFFVQKLAVLHPERSSDLQR- 110
Query 127 AIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKR------DSQAVYHAGA 180
I G + +F+H+ R D V QAVS +A QT +W PD S VY++
Sbjct 111 -IEAAFG-QTLFLHLTRLDKVEQAVSLVKAEQTGLWHAAPDGTELERTAPPSAPVYNSDE 168
Query 181 IAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVL 214
I + W WF +GI+P IAY L
Sbjct 169 IRTWYERFAAYDQAWNDWFEMQGIEPFRIAYEAL 202
>gi|83592960|ref|YP_426712.1| hypothetical protein Rru_A1625 [Rhodospirillum rubrum ATCC 11170]
gi|83575874|gb|ABC22425.1| conserved hypothetical protein [Rhodospirillum rubrum ATCC 11170]
Length=255
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 69/265 (27%), Positives = 114/265 (44%), Gaps = 28/265 (10%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGV-DDDTIL 63
+ Y++ T RSGSTLL L ATG G P F Y + M EW G+ D DT+
Sbjct 2 IASYIICTTPRSGSTLLCRILAATGKTGNPDSF--YHKADFMHEWAVEW--GLPDRDTL- 56
Query 64 QLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQL-PDRSGD 122
T A+ +G+ ++G +L LL + L P D
Sbjct 57 ----------SKTEFARAYLAAALKAGKAGTDLFGLRLQAQYLGLLSETLDHLYPGLPSD 106
Query 123 GLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAV------Y 176
R G + +++H+ R D V+QAVS +A Q+ +W H D + Y
Sbjct 107 AHR--FERAFG-KTLYLHLSRADKVAQAVSLQKAQQSGLWHLHADGTELERLTPSQTPRY 163
Query 177 HAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLA 236
++ +R L + W WF I+P+ ++Y V+ + A+G++P A
Sbjct 164 DFQSLDRQVRALERDDEAWTTWFDRHQINPLRVSYETFVDQPVETVSDICRALGKEPPQA 223
Query 237 PAPM--LERQANQRSDEWVDRYRAE 259
A L++ +++ + EW+ RY+ +
Sbjct 224 TAVRIDLKKLSDEVNLEWIGRYKED 248
>gi|13473251|ref|NP_104818.1| hypothetical protein mll3788 [Mesorhizobium loti MAFF303099]
gi|14023999|dbj|BAB50604.1| mll3788 [Mesorhizobium loti MAFF303099]
Length=257
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/263 (26%), Positives = 118/263 (45%), Gaps = 29/263 (11%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ R+GSTLL + L +TG +G+P F++ ++ EW
Sbjct 12 YIICGAPRTGSTLLCKLLASTGTSGDPHSFYR---RQDLSEWAEEWKL------------ 56
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR-- 125
P + VA+ + +G+ G++G +LM L + +A L DR GL
Sbjct 57 PRRNTMGELEFDVAYLKAAIVAGKGDTGIFGLRLMREN---LDELSAIL-DRILPGLASD 112
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAV------YHAG 179
AA + +++H+ R + ++QA+S +A QT +W PD + Y
Sbjct 113 AARFERAFGRILYIHLSRENKLAQAISLIKAQQTGLWHIAPDGTEIERVAPAQEPHYDFE 172
Query 180 AIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG-QDPKLAPA 238
I + L + W WFA +G+ P+ I Y L + A + ++ +A+G Q P
Sbjct 173 RIKGELAKLEAYDAAWNIWFAAQGLTPLRIGYERLSADPVAALLAICEALGVQQPNAKDI 232
Query 239 -PMLERQANQRSDEWVDRYRAEA 260
P + + A++ S +W+ RY +A
Sbjct 233 RPGVAKLADETSLDWMRRYHLDA 255
>gi|319781100|ref|YP_004140576.1| hypothetical protein Mesci_1366 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
gi|317166988|gb|ADV10526.1| hypothetical protein Mesci_1366 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length=271
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/267 (27%), Positives = 119/267 (45%), Gaps = 37/267 (13%)
Query 8 YLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLD 67
Y++ T R+GSTLL + L +T AG+P F++ + EW + D + L+
Sbjct 26 YIICGTPRTGSTLLCKLLASTKTAGDPHSFYR---RQDVVEWAEEW--KLPDRAAMSELE 80
Query 68 PLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQ----TALLQQRAAQLPDRSGDG 123
A+ + +G+ G++G +LM +A+L + P R D
Sbjct 81 ----------FDAAYLDAAIAAGKGGTGLFGLRLMRENLDELSAILDR---IFPKRPSD- 126
Query 124 LRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPD--------PKRDSQAV 175
RA GN +++H+ R D ++QAVS +A QT +W PD P ++ Q
Sbjct 127 -RARFERAFGN-VLYIHLSREDKLAQAVSLIKAEQTGLWHIAPDGTEIERVAPPKEPQ-- 182
Query 176 YHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG-QDPK 234
Y I + L + W WFA +GI P + Y L N A + + + +G Q P
Sbjct 183 YDFERIRREVAELETYDAAWNIWFAAQGISPHRVGYERLSSNPAATLLGICEVLGVQAPN 242
Query 235 LAPA-PMLERQANQRSDEWVDRYRAEA 260
P + + ++ S +W+ RYR +A
Sbjct 243 ADDVRPGVAKLSDDTSLDWMRRYRLDA 269
>gi|220926780|ref|YP_002502082.1| Stf0 sulfotransferase [Methylobacterium nodulans ORS 2060]
gi|219951387|gb|ACL61779.1| Stf0 sulphotransferase [Methylobacterium nodulans ORS 2060]
Length=235
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 67/258 (26%), Positives = 112/258 (44%), Gaps = 49/258 (18%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
++ Y V RSGS + L +TG G P+E+F G A + +
Sbjct 1 MKGYAVCGAPRSGSNYFCDVLTSTGQLGRPREYF-----NGDARRRYD------------ 43
Query 65 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKL---MWNQTALLQQRAAQLPDRSG 121
DP PD P +H+ T+G TPNGV+ KL ++++ + + LP+ +
Sbjct 44 -----DPSYPD--DPALQIKHILTTGATPNGVYALKLFPGLFDRVSPHLKLTQALPNLT- 95
Query 122 DGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAI 181
FV + R DV+ QA+S+ R++QT +R + Q + I
Sbjct 96 ----------------FVRLRRLDVLGQALSWVRSIQTGQFRSTETANAEPQ--FDGPLI 137
Query 182 AHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQ--DPKLAPAP 239
A + + + W +FA G+ P+++ Y L N V V +G P++ P+
Sbjct 138 ATYLGQVCQRNARWDMYFARTGLRPVEVTYEDLAENPQEAVDQVAGRLGVHPSPRIDPSQ 197
Query 240 -MLERQANQRSDEWVDRY 256
+L RQ++ S EW R+
Sbjct 198 VLLRRQSDAVSAEWRARF 215
>gi|89055278|ref|YP_510729.1| hypothetical protein Jann_2787 [Jannaschia sp. CCS1]
gi|88864827|gb|ABD55704.1| hypothetical protein Jann_2787 [Jannaschia sp. CCS1]
Length=252
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/269 (28%), Positives = 109/269 (41%), Gaps = 39/269 (14%)
Query 4 AVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREW--FAGVDD-- 59
A + Y++ + RSGSTLL L+ G AG P F AP W + G+
Sbjct 3 AFKSYVICTSPRSGSTLLCRLLQDAGIAGCPDSHFH-------APSVDAWCGYYGLSAER 55
Query 60 -DTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTAL-LQQRAAQLP 117
D+ LLD + R GR+ V+G ++ LQQ P
Sbjct 56 FDSRHALLDAIVNAA-----------QARGKGRS--DVFGLRMQRQSIGFFLQQLGLLYP 102
Query 118 DRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQA--- 174
+ D R I G +F+++ R D + QA+S+ +A Q+ +W D +
Sbjct 103 SLTNDKSR--IEAAFGR-TLFIYLTREDKLDQAISYVKAKQSGLWHMAADGTELERLSDP 159
Query 175 ---VYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQ 231
Y A AIA + E W WF E I+P+ + Y L +A VL A+
Sbjct 160 RDPTYDARAIASQLALAEQMEREWEDWFKVEQIEPLRVTYDALSAAPSATRDLVLRALWL 219
Query 232 DP---KLAPAPMLERQANQRSDEWVDRYR 257
D K P P + A+ S +W DR+R
Sbjct 220 DMRTLKDGPPPT-AKLADVTSRDWADRFR 247
>gi|254504448|ref|ZP_05116599.1| Stf0 sulphotransferase superfamily [Labrenzia alexandrii DFL-11]
gi|222440519|gb|EEE47198.1| Stf0 sulphotransferase superfamily [Labrenzia alexandrii DFL-11]
Length=232
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 64/244 (27%), Positives = 102/244 (42%), Gaps = 31/244 (12%)
Query 25 LRATGCAGEPQEFFQYLPSTG--MAPQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAW 82
L TG AG P+ +F + P G + AG+ + L+LL DTA
Sbjct 2 LTETGVAGHPESYF-HKPDLGNWASYLGVSRSAGMGELEYLRLL-------IDTAI---- 49
Query 83 REHVRTSGRTPNGVWGGKLMWNQ-TALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHV 141
G G++G +L + +Q P+ D +A V G +FVH+
Sbjct 50 -----EQGTANTGMFGLRLQRHSFDFFFRQLRILCPNEPTD--KARFEAVFGRT-LFVHL 101
Query 142 HRPDVVSQAVSFWRAVQTQVWRGHPDPKR------DSQAVYHAGAIAHIIRNLRDQENGW 195
RPD +SQAVSF +A Q+ +W D + VY A+ + W
Sbjct 102 TRPDKLSQAVSFVKAQQSGLWHRAADGSELERLSPPADPVYDFAALKDCCDQFIQFDRDW 161
Query 196 RAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAP--APMLERQANQRSDEWV 253
WFAE+ I P+ ++Y L + + VL A+ P A P + + A+ + +W+
Sbjct 162 NDWFAEQAIKPLRLSYDDLCGDPQTELKRVLTALDLPPSAADPVQPGVAKLADSINADWI 221
Query 254 DRYR 257
R++
Sbjct 222 KRFQ 225
>gi|126735488|ref|ZP_01751233.1| hypothetical protein RCCS2_16471 [Roseobacter sp. CCS2]
gi|126714675|gb|EBA11541.1| hypothetical protein RCCS2_16471 [Roseobacter sp. CCS2]
Length=285
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/257 (24%), Positives = 100/257 (39%), Gaps = 53/257 (20%)
Query 10 VLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLDPL 69
T R GS L L TG G P E+ P W + +
Sbjct 37 FCTTPRCGSHFLGHRLHGTGAFGYPLEYLN----------PGNW----------HVWEKR 76
Query 70 DPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQ-TALLQQRAAQLPDRSGDGLRAAI 128
TP P+ + + VRT PNGV+ KL A L+Q A L +
Sbjct 77 AGPTP----PLDYIKSVRTG---PNGVFSVKLHHEHLAAFLKQEVAPLDYK--------- 120
Query 129 RDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRNL 188
F+H+ R D++ QA+SF RA QT W D + Y I + +
Sbjct 121 ---------FIHLQRRDLMKQAISFARAQQTGAWIS--DMPEKAAGSYDWSLITDKMDAI 169
Query 189 RDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG--QDPKLAPAPMLERQAN 246
W+++ + GI P+ + Y + + +A +A + D +G D + A Q
Sbjct 170 SRGNADWQSFLSSMGIQPLQLYYEDVVADASAAIAQIADYLGVAMDSVVTTATTFTPQQQ 229
Query 247 QRSDE---WVDRYRAEA 260
+++ + W+ RY++++
Sbjct 230 KKTAQAADWLSRYQSDS 246
>gi|227822469|ref|YP_002826441.1| hypothetical protein NGR_c19240 [Sinorhizobium fredii NGR234]
gi|227341470|gb|ACP25688.1| conserved hypothetical protein [Sinorhizobium fredii NGR234]
Length=256
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 104/252 (42%), Gaps = 38/252 (15%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
+R YL+L RSGS L + +G G+ E+ ++P+ +
Sbjct 1 MRGYLLLTEARSGSNWLGSLINNSGNMGQSSEW--------LSPK-------------IH 39
Query 65 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGL 124
LD T + + E +R S T NG +G K+ N L R D L
Sbjct 40 RLD-----TSSLSWEEFFEEIIRKSS-TENGNFGLKIFPNH--LFITREIYGMDFIQYCL 91
Query 125 RAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHI 184
++ DV V + R D + QA+S+ RA QT+ + H D Q Y+ IA
Sbjct 92 --SVHDV-----ALVFLRRDDTLRQAISYARARQTRSFAAHVQGNADPQ--YNFEEIAKC 142
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
+RD + WR++ G + + Y L + + ++ V + +G A + Q
Sbjct 143 FFYIRDSYSFWRSYLELTGAESTEFVYENLVPDPSPFISCVAEHLGVPAPGALETTMAVQ 202
Query 245 ANQRSDEWVDRY 256
++ +DEWV R+
Sbjct 203 RDEVTDEWVARF 214
>gi|319781596|ref|YP_004141072.1| hypothetical protein Mesci_1869 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
gi|317167484|gb|ADV11022.1| hypothetical protein Mesci_1869 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length=255
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/266 (24%), Positives = 100/266 (38%), Gaps = 48/266 (18%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
+R +L RSGS L ATG G +E+
Sbjct 1 MRGVAILTEGRSGSNWLGSLTNATGLMGRSEEW--------------------------- 33
Query 65 LLDP----LDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRS 120
LDP DP T D A + +GR ++ L W + ++
Sbjct 34 -LDPAYLRFDPRTYDDLEKAAIEKAATDNGRFAIKLFPRHLAWCK------------EKF 80
Query 121 GDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAV-YHAG 179
G IR G E F+ + R D + QA+SF+RA + VW + K + +AV Y
Sbjct 81 GKDFLFEIRRKHGLE--FILLERRDRIQQAISFYRARMSGVWTSRHEGKVNPRAVPYSFA 138
Query 180 AIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG-QDPKLAPA 238
I+ + WR++ G+D Y L ++ + +V D +G Q P+
Sbjct 139 DISQAYFQVDRSYAFWRSYLQLAGLDCRQFVYEDLQQDPRPYLEAVADYMGVQVPEDTAN 198
Query 239 PMLERQANQRSDEWVDRYRAEAPRLG 264
Q + ++EW+ R+R +A G
Sbjct 199 SRFTVQRDSLTEEWIVRFREDAAAKG 224
>gi|15965791|ref|NP_386144.1| hypothetical protein SMc04267 [Sinorhizobium meliloti 1021]
gi|334316732|ref|YP_004549351.1| hypothetical protein Sinme_2014 [Sinorhizobium meliloti AK83]
gi|15075060|emb|CAC46617.1| LPS sulfotransferase [Sinorhizobium meliloti 1021]
gi|333812096|gb|AEG04765.1| hypothetical protein SinmeB_1857 [Sinorhizobium meliloti BL225C]
gi|334095726|gb|AEG53737.1| hypothetical protein Sinme_2014 [Sinorhizobium meliloti AK83]
gi|336032630|gb|AEH78562.1| LpsS [Sinorhizobium meliloti SM11]
Length=256
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/255 (26%), Positives = 106/255 (42%), Gaps = 44/255 (17%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQ 64
+R YL+L RSGS L + G G E+ ++P+
Sbjct 1 MRGYLLLTEARSGSNWLGSLVNGAGNMGRSSEW--------LSPK--------------- 37
Query 65 LLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGL 124
+ LD G + ++E +R TPNGV+G K+ NQ + + + R
Sbjct 38 -IHRLDTGA--LSWDAFFQELLRKCS-TPNGVFGSKIFPNQLFVTHE----VYGRDFIQH 89
Query 125 RAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHI 184
A+ DV V + R D + QA+S+ RA QT+ + H + + + Q Y IA
Sbjct 90 CLAMHDV-----ALVFLRRRDTLRQAISYARARQTRSFAAHVEGRANPQ--YDFEQIARC 142
Query 185 IRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPMLERQ 244
+RD W+++ G++ + Y L + V+ + + + Q P PA +
Sbjct 143 FFYIRDSYAFWQSYLELTGVEFAEFVYEELAADPIPFVSHLAEHL-QVP--LPAQLQTSM 199
Query 245 ANQRSD---EWVDRY 256
A QR D EW+ R+
Sbjct 200 AVQRDDLTEEWIARF 214
>gi|85704664|ref|ZP_01035766.1| hypothetical protein ROS217_06279 [Roseovarius sp. 217]
gi|85671072|gb|EAQ25931.1| hypothetical protein ROS217_06279 [Roseovarius sp. 217]
Length=240
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/131 (30%), Positives = 64/131 (49%), Gaps = 14/131 (10%)
Query 138 FVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRA 197
F+ + R D+V QAVSF RA QT W D ++A Y IA + + D GW +
Sbjct 73 FIQLQRRDLVRQAVSFARAQQTGAW--ISDMPERAEARYDRNLIAAKVDAIADFNAGWTS 130
Query 198 WFAEEGIDPIDIAYPVLW---RNLTAIVASVLD------AIGQDPKLAPAPMLERQANQR 248
+ A G+ P+++ Y + R +A+ L + G+D P +R +N
Sbjct 131 FLASLGVKPLELFYEDVVADRRGAMQRIAAYLSIELPDASTGED---VFQPKAQRASNDP 187
Query 249 SDEWVDRYRAE 259
++ WV+R+++E
Sbjct 188 TEVWVERFKSE 198
>gi|114571053|ref|YP_757733.1| hypothetical protein Mmar10_2509 [Maricaulis maris MCS10]
gi|114341515|gb|ABI66795.1| conserved hypothetical protein [Maricaulis maris MCS10]
Length=260
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/260 (26%), Positives = 103/260 (40%), Gaps = 44/260 (16%)
Query 6 RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQL 65
R Y + RSGST L L+ TG G P E+ + G A +
Sbjct 17 RQYAICLVPRSGSTFLAHLLKNTGRFGFPNEWMAVALAEGEARET--------------- 61
Query 66 LDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR 125
G+PD T V + NGV G +L +Q A PD
Sbjct 62 ------GSPDWDTLF---RRVMARYASDNGVSGIELALAHLTWGRQ-ATGRPD------- 104
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQV---WRGHPDPKRDSQAV-YHAGAI 181
++ + ++ R ++V QA+S A Q+ V ++ D ++ AV Y AI
Sbjct 105 -----ILDPGWTYFYLRRRNIVRQAISMHVAHQSGVLHSFQMTDDARKVRDAVLYDTPAI 159
Query 182 AHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG--QDPKLAPAP 239
I+ L+D+E W F GI+PI + Y + V + +G + P + +
Sbjct 160 RSWIKFLQDEELKWEREFGRMGIEPIRLYYEDITARPERAVRLFSNVLGLPETPTIKTS- 218
Query 240 MLERQANQRSDEWVDRYRAE 259
+ER R+D+W RYR E
Sbjct 219 TIERIGTSRTDDWEARYRDE 238
>gi|325981293|ref|YP_004293695.1| hypothetical protein NAL212_0593 [Nitrosomonas sp. AL212]
gi|325530812|gb|ADZ25533.1| hypothetical protein NAL212_0593 [Nitrosomonas sp. AL212]
Length=301
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/263 (23%), Positives = 101/263 (39%), Gaps = 46/263 (17%)
Query 6 RPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQL 65
R ++ T R GS L E L AT G EFF + + E DD + L
Sbjct 56 RQVILCFTNRCGSNWLAELLYATELMGLADEFFN---TERIQADCAECGLSSLDDFVRHL 112
Query 66 LDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLR 125
PG T N ++ KL W+Q L R
Sbjct 113 -----PGNHSTL----------------NKIFATKLSWDQLYFLS--------------R 137
Query 126 AAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRGHPDPKRDSQ---AVYHAGAIA 182
+ I P F+++ R DV +QA+SF A QT W+ + + + + A I
Sbjct 138 VKVIPWIIPNPQFIYIVRDDVAAQALSFLVAQQTGQWKSNWNSGVNGKIELADISNEQII 197
Query 183 HIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIG----QDPKLAPA 238
I + ++ + +F ++P I Y L I+A +L + + ++ A
Sbjct 198 MAISEILFAQSKFELYFEMLKLNPCRIHYEDLLAKPEFIIARILQYLKIPAPKKMEINSA 257
Query 239 PM-LERQANQRSDEWVDRYRAEA 260
+ LE+Q +++S++ + R+ E
Sbjct 258 KLQLEKQRDEQSEKRLARFHRET 280
>gi|296131408|ref|YP_003638658.1| Stf0 sulfotransferase [Cellulomonas flavigena DSM 20109]
gi|296023223|gb|ADG76459.1| Stf0 sulfotransferase [Cellulomonas flavigena DSM 20109]
Length=257
Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 73/274 (27%), Positives = 107/274 (40%), Gaps = 47/274 (17%)
Query 5 VRPYLVLATQRSGSTLLVESLRATGCAGEPQEFF------QYLPSTGMAPQPREWFAGVD 58
V Y+V +R+GS LL +L A G G P E+ Q L +G A
Sbjct 5 VAAYVVACQERTGSNLLCGALSAQGGLGAPDEWLGRSRLHQRLVDSGTAAPSST------ 58
Query 59 DDTILQLLDPLDPGTPDTATPVAWREHVRTSGRT-PNGVWGGKLMWNQTALLQQRAAQLP 117
PG P A+ + + + RT P V+G K+ W Q AA L
Sbjct 59 ------------PGAPRPGDLDAYVDAM--AARTAPGAVFGAKVHWYQL------AAALD 98
Query 118 DRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQVWRG-----------HP 166
D D +R A+ ++ V V + R D V+QAVS RA T + HP
Sbjct 99 DGWLDDVRGAVPRAARSDAVVVRLRRRDRVAQAVSMLRAQATGTYVAPADGSAVDEVRHP 158
Query 167 DPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVL 226
+P + I ++ + + W A + +++ Y L + A V VL
Sbjct 159 EPYWATGGGDPLEEIERVVGTIDAHDARWSQHLAALDVPVLEVDYEWLTADYDATVRDVL 218
Query 227 DAIGQD-PKLA--PAPMLERQANQRSDEWVDRYR 257
+ P A P P RQA+ RS E ++ YR
Sbjct 219 AFLDHPLPATAAVPEPRTARQADARSAELIEAYR 252
>gi|46109596|ref|XP_381856.1| hypothetical protein FG01680.1 [Gibberella zeae PH-1]
Length=1649
Score = 41.6 bits (96), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/89 (33%), Positives = 40/89 (45%), Gaps = 11/89 (12%)
Query 28 TGCAGEPQEF-------FQYLPSTGMAPQPREW---FAGVDDDTILQLLDPLDPGTPDTA 77
TG P +F FQ + ++P PR+W A D T+LQ P P ++
Sbjct 171 TGKTRVPSQFASVLWKTFQEIYLDLLSPSPRDWRRVLASNTDKTLLQSFLPRTPTKIQSS 230
Query 78 TPVAWREHVRTSGRTPN-GVWGGKLMWNQ 105
WRE VRT+ R P W G L +N+
Sbjct 231 VIDLWRESVRTAPRAPAVDAWDGTLTYNE 259
Lambda K H
0.319 0.135 0.432
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 407343666860
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40