BLASTP 2.2.25+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           15,229,318 sequences; 5,219,829,388 total letters



Query= Rv3077

Length=603
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|57117053|ref|YP_177923.1|  hydrolase [Mycobacterium tuberculos...  1215    0.0   
gi|31794256|ref|NP_856749.1|  hydrolase [Mycobacterium bovis AF21...  1212    0.0   
gi|15842647|ref|NP_337684.1|  sulfatase family protein [Mycobacte...  1209    0.0   
gi|289751751|ref|ZP_06511129.1|  hydrolase [Mycobacterium tubercu...  1209    0.0   
gi|183981593|ref|YP_001849884.1|  hydrolase [Mycobacterium marinu...  1059    0.0   
gi|118618760|ref|YP_907092.1|  hydrolase [Mycobacterium ulcerans ...  1053    0.0   
gi|240168676|ref|ZP_04747335.1|  hydrolase [Mycobacterium kansasi...  1006    0.0   
gi|118469124|ref|YP_885676.1|  sulfatase [Mycobacterium smegmatis...   969    0.0   
gi|126433508|ref|YP_001069199.1|  sulfatase [Mycobacterium sp. JL...   951    0.0   
gi|342857443|ref|ZP_08714099.1|  hydrolase [Mycobacterium colombi...   939    0.0   
gi|296166344|ref|ZP_06848780.1|  sulfatase [Mycobacterium parascr...   932    0.0   
gi|108797868|ref|YP_638065.1|  sulfatase [Mycobacterium sp. MCS] ...   931    0.0   
gi|145222959|ref|YP_001133637.1|  sulfatase [Mycobacterium gilvum...   902    0.0   
gi|120405228|ref|YP_955057.1|  sulfatase [Mycobacterium vanbaalen...   892    0.0   
gi|295704481|ref|YP_003597556.1|  sulfatase family protein [Bacil...   373    5e-101
gi|345444703|gb|AEN89720.1|  Arylsulfatase A family protein [Baci...   369    8e-100
gi|311032599|ref|ZP_07710689.1|  sulfatase [Bacillus sp. m3-13]        368    1e-99 
gi|338535752|ref|YP_004669086.1|  sulfatase family protein [Myxoc...   368    2e-99 
gi|308067126|ref|YP_003868731.1|  arylsulfatase A [Paenibacillus ...   362    1e-97 
gi|258515285|ref|YP_003191507.1|  sulfatase [Desulfotomaculum ace...   358    1e-96 
gi|288555022|ref|YP_003426957.1|  sulfatase [Bacillus pseudofirmu...   348    1e-93 
gi|226315218|ref|YP_002775114.1|  sulfatase [Brevibacillus brevis...   342    8e-92 
gi|251794620|ref|YP_003009351.1|  sulfatase [Paenibacillus sp. JD...   335    2e-89 
gi|77163732|ref|YP_342257.1|  arylsulfatase A and related enzyme ...   325    1e-86 
gi|254436243|ref|ZP_05049750.1|  sulfatase, putative [Nitrosococc...   325    1e-86 
gi|149924951|ref|ZP_01913279.1|  sulfatase [Plesiocystis pacifica...   292    1e-76 
gi|288960770|ref|YP_003451110.1|  sulfatase [Azospirillum sp. B51...   182    2e-43 
gi|167644209|ref|YP_001681872.1|  sulfatase [Caulobacter sp. K31]...   166    1e-38 
gi|149922160|ref|ZP_01910599.1|  Arylsulfatase A and related enzy...   164    4e-38 
gi|312139132|ref|YP_004006468.1|  sulfatase [Rhodococcus equi 103...   163    7e-38 
gi|325673566|ref|ZP_08153257.1|  arylsulfatase [Rhodococcus equi ...   163    9e-38 
gi|163754242|ref|ZP_02161365.1|  POSSIBLE HYDROLASE [Kordia algic...   161    2e-37 
gi|226305445|ref|YP_002765405.1|  sulfatase [Rhodococcus erythrop...   144    4e-32 
gi|229489534|ref|ZP_04383397.1|  sulfatase [Rhodococcus erythropo...   144    4e-32 
gi|284046572|ref|YP_003396912.1|  sulfatase [Conexibacter woesei ...   139    2e-30 
gi|294673172|ref|YP_003573788.1|  sulfatase family protein [Prevo...   135    2e-29 
gi|108756971|ref|YP_632689.1|  sulfatase family protein [Myxococc...   134    5e-29 
gi|258655369|ref|YP_003204525.1|  sulfatase [Nakamurella multipar...   130    5e-28 
gi|242278822|ref|YP_002990951.1|  sulfatase [Desulfovibrio salexi...   130    6e-28 
gi|254427464|ref|ZP_05041171.1|  sulfatase, putative [Alcanivorax...   130    7e-28 
gi|338972465|ref|ZP_08627838.1|  choline-sulfatase [Bradyrhizobia...   129    1e-27 
gi|111021151|ref|YP_704123.1|  arylsulfatase [Rhodococcus jostii ...   129    2e-27 
gi|226363511|ref|YP_002781293.1|  sulfatase [Rhodococcus opacus B...   127    4e-27 
gi|325523292|gb|EGD01647.1|  arylsulfatase A like protein [Burkho...   125    2e-26 
gi|116694269|ref|YP_728480.1|  arylsulfatase [Ralstonia eutropha ...   124    4e-26 
gi|296395322|ref|YP_003660206.1|  sulfatase [Segniliparus rotundu...   124    4e-26 
gi|169631543|ref|YP_001705192.1|  sulfatase family protein [Mycob...   124    6e-26 
gi|217969899|ref|YP_002355133.1|  sulfatase [Thauera sp. MZ1T] >g...   124    7e-26 
gi|73538537|ref|YP_298904.1|  twin-arginine translocation pathway...   122    3e-25 
gi|78061333|ref|YP_371241.1|  arylsulfatase A like protein [Burkh...   120    6e-25 


>gi|57117053|ref|YP_177923.1| hydrolase [Mycobacterium tuberculosis H37Rv]
 gi|148662931|ref|YP_001284454.1| putative hydrolase [Mycobacterium tuberculosis H37Ra]
 gi|167969682|ref|ZP_02551959.1| putative hydrolase [Mycobacterium tuberculosis H37Ra]
 gi|307085811|ref|ZP_07494924.1| hydrolase [Mycobacterium tuberculosis SUMu012]
 gi|41352782|emb|CAE55546.1| POSSIBLE HYDROLASE [Mycobacterium tuberculosis H37Rv]
 gi|148507083|gb|ABQ74892.1| putative hydrolase [Mycobacterium tuberculosis H37Ra]
 gi|308364728|gb|EFP53579.1| hydrolase [Mycobacterium tuberculosis SUMu012]
Length=603

 Score = 1215 bits (3143),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 603/603 (100%), Positives = 603/603 (100%), Gaps = 0/603 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS
Sbjct  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF
Sbjct  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP
Sbjct  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA
Sbjct  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD
Sbjct  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP
Sbjct  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR
Sbjct  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600

Query  601  FVR  603
            FVR
Sbjct  601  FVR  603


>gi|31794256|ref|NP_856749.1| hydrolase [Mycobacterium bovis AF2122/97]
 gi|121638962|ref|YP_979186.1| putative hydrolase [Mycobacterium bovis BCG str. Pasteur 1173P2]
 gi|148824269|ref|YP_001289023.1| hydrolase [Mycobacterium tuberculosis F11]
 60 more sequence titles
 Length=603

 Score = 1212 bits (3137),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 602/603 (99%), Positives = 602/603 (99%), Gaps = 0/603 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS
Sbjct  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF
Sbjct  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP
Sbjct  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVRRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA
Sbjct  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD
Sbjct  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP
Sbjct  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR
Sbjct  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600

Query  601  FVR  603
            FVR
Sbjct  601  FVR  603


>gi|15842647|ref|NP_337684.1| sulfatase family protein [Mycobacterium tuberculosis CDC1551]
 gi|254233702|ref|ZP_04927027.1| hypothetical protein TBCG_03012 [Mycobacterium tuberculosis C]
 gi|254365705|ref|ZP_04981750.1| hypothetical hydrolase [Mycobacterium tuberculosis str. Haarlem]
 gi|13882964|gb|AAK47498.1| sulfatase family protein [Mycobacterium tuberculosis CDC1551]
 gi|124599231|gb|EAY58335.1| hypothetical protein TBCG_03012 [Mycobacterium tuberculosis C]
 gi|134151218|gb|EBA43263.1| hypothetical hydrolase [Mycobacterium tuberculosis str. Haarlem]
 gi|323718309|gb|EGB27487.1| hydrolase [Mycobacterium tuberculosis CDC1551A]
Length=603

 Score = 1209 bits (3129),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 601/603 (99%), Positives = 601/603 (99%), Gaps = 0/603 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS
Sbjct  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF
Sbjct  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP
Sbjct  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVRRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA
Sbjct  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            SADEGRAIYLMTRDNVLEGDTGASLLSRQLG IVNPPAPLRIKVPAHVAANFEGLVVRVD
Sbjct  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGHIVNPPAPLRIKVPAHVAANFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP
Sbjct  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR
Sbjct  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600

Query  601  FVR  603
            FVR
Sbjct  601  FVR  603


>gi|289751751|ref|ZP_06511129.1| hydrolase [Mycobacterium tuberculosis T92]
 gi|289692338|gb|EFD59767.1| hydrolase [Mycobacterium tuberculosis T92]
Length=603

 Score = 1209 bits (3128),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 601/603 (99%), Positives = 601/603 (99%), Gaps = 0/603 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS
Sbjct  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF
Sbjct  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP
Sbjct  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVRRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA
Sbjct  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAP RIKVPAHVAANFEGLVVRVD
Sbjct  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPRRIKVPAHVAANFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP
Sbjct  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR
Sbjct  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600

Query  601  FVR  603
            FVR
Sbjct  601  FVR  603


>gi|183981593|ref|YP_001849884.1| hydrolase [Mycobacterium marinum M]
 gi|183174919|gb|ACC40029.1| hydrolase [Mycobacterium marinum M]
Length=603

 Score = 1059 bits (2739),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 515/601 (86%), Positives = 555/601 (93%), Gaps = 0/601 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M   PDI+I+MTDEERAVPPYESA+VLAWRQR+LTGRRWFDEHG+SF RHYTGSLACVPS
Sbjct  1    MPEGPDIVIIMTDEERAVPPYESADVLAWRQRTLTGRRWFDEHGVSFARHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDP TG  LATND++G +D AAV+RYLDADPLGPYGFSGWVGPEPHGAG ANSGF
Sbjct  121  SHADLEDPDTGLSLATNDDDGEIDPAAVQRYLDADPLGPYGFSGWVGPEPHGAGNANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+ADRVVAWL +RYARRRAGD AA+RPFLLVASF+NPHD+VLFPAWV  SPLKPS 
Sbjct  181  RRDPLIADRVVAWLEDRYARRRAGDEAALRPFLLVASFINPHDVVLFPAWVRFSPLKPSH  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPA PTADEDL TKPAAQ+A+R+AYY+GYG+   + R Y RNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAPPTADEDLRTKPAAQIAFRQAYYTGYGVAPAIKRTYQRNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            E DGPIDRV RAVTEGGS++A+LVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR+
Sbjct  301  ENDGPIDRVRRAVTEGGSDNAVLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARV  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G++AT  RTVSAPTSHVDLVPTLLSAAG+D D VAA LAESF+EVHPLPGRDLM VVDGA
Sbjct  361  GDEATTARTVSAPTSHVDLVPTLLSAAGIDTDAVAANLAESFTEVHPLPGRDLMAVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
             ADE RAIYLMTRDNVLEGDTGASLLSRQLGR VNPPAPLRIK+PAHVAANFEGLVVRV+
Sbjct  421  PADEDRAIYLMTRDNVLEGDTGASLLSRQLGRTVNPPAPLRIKLPAHVAANFEGLVVRVE  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            D+DA GGAGHLWKLVRTFDDPATWTEPGVRHLA+NG+GG+ YRTDP+DDQWELYDLTADP
Sbjct  481  DSDAPGGAGHLWKLVRTFDDPATWTEPGVRHLASNGVGGETYRTDPVDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IE  NRWTDP+LHELRQHLRM LKQQRA SVPERNQPWPYA+R P +G S G +RR+LGR
Sbjct  541  IETDNRWTDPELHELRQHLRMQLKQQRASSVPERNQPWPYANRQPETGESQGPIRRLLGR  600

Query  601  F  601
             
Sbjct  601  I  601


>gi|118618760|ref|YP_907092.1| hydrolase [Mycobacterium ulcerans Agy99]
 gi|118570870|gb|ABL05621.1| hydrolase [Mycobacterium ulcerans Agy99]
Length=603

 Score = 1053 bits (2724),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 512/601 (86%), Positives = 552/601 (92%), Gaps = 0/601 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M   PDI+I+MTDEERAVPPYESA+VLAWRQR+LTGRRWFDEHG+SF RHYTGSLACVPS
Sbjct  1    MPEGPDIVIIMTDEERAVPPYESADVLAWRQRTLTGRRWFDEHGVSFARHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDP TG  LATND++G +D AAV+RYLDADPLGPYGFSGWVGPEPHGAG ANSGF
Sbjct  121  SHADLEDPDTGLSLATNDDDGEIDPAAVQRYLDADPLGPYGFSGWVGPEPHGAGNANSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+ADRVVAWL +RYARRRAGD AA+RPFLLVASF+NPHD+VLFPAWV   PLKPS 
Sbjct  181  RRDPLIADRVVAWLEDRYARRRAGDEAALRPFLLVASFINPHDVVLFPAWVRFGPLKPSH  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPA PTADEDL TKPAAQ+A+R+AYY+GYG+   + R Y RNAQRYRDLYYRLHA
Sbjct  241  LDPPHVPAPPTADEDLRTKPAAQIAFRQAYYTGYGVAPAIKRTYQRNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            E DGPIDRV RAVTEGGS++A+LVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVI R+
Sbjct  301  ENDGPIDRVRRAVTEGGSDNAVLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVITRV  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G +AT  RTVSAPTSHVDLVPTLLSAAG+D + VAA LAESF+EVHPLPGRDLM VVDGA
Sbjct  361  GNEATTARTVSAPTSHVDLVPTLLSAAGIDTEAVAANLAESFTEVHPLPGRDLMAVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
             ADE RAIYLMTRDNVLEGDTGASLLSRQLGR VNPPAPLRIK+PAHVAANFEGLVVRV+
Sbjct  421  PADEDRAIYLMTRDNVLEGDTGASLLSRQLGRTVNPPAPLRIKLPAHVAANFEGLVVRVE  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            D+DA GGAGHLWKLVRTFDDPATWTEPGVRHLA+NG+GG+ YRTDP+DDQWELYDLTADP
Sbjct  481  DSDAPGGAGHLWKLVRTFDDPATWTEPGVRHLASNGVGGETYRTDPVDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IE  NRWTDP+LHELRQHLRM LKQQRA SVPERNQPWPYA+R P +G S G +RR+LGR
Sbjct  541  IETDNRWTDPELHELRQHLRMQLKQQRASSVPERNQPWPYANRQPETGQSQGPIRRLLGR  600

Query  601  F  601
             
Sbjct  601  I  601


>gi|240168676|ref|ZP_04747335.1| hydrolase [Mycobacterium kansasii ATCC 12478]
Length=602

 Score = 1006 bits (2602),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 514/601 (86%), Positives = 551/601 (92%), Gaps = 0/601 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MA+RPDI+IVMTDEERAVPPYESA++LAWRQR+LTGRRWFDEHG++FTRHYTGSLACVPS
Sbjct  1    MADRPDIVIVMTDEERAVPPYESADILAWRQRTLTGRRWFDEHGVNFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTG YPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTIFTGHYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADL DP TG  LATND++GVVD AAV+RYLDADPLGPYGFSGWVGPEPHGA +++ G 
Sbjct  121  SHADLHDPETGGSLATNDDDGVVDPAAVQRYLDADPLGPYGFSGWVGPEPHGAAMSDCGL  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+ADRVVAWL +RYARRRAGD AA+RPFLLVASFVNPHDIVLFPAWV R+PLKPS 
Sbjct  181  RRDPLIADRVVAWLNDRYARRRAGDAAALRPFLLVASFVNPHDIVLFPAWVRRNPLKPSL  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPP V  APTADEDL  KPAAQ+A+REAYYSGYG  R+V RNY RNAQRYRDLYYRLHA
Sbjct  241  LDPPPVHPAPTADEDLQAKPAAQIAFREAYYSGYGPARVVKRNYGRNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGS++A+LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVRRAVTEGGSDNAVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G  AT  RTVSAPTSHVDLVPTLL AAGVD DV AA LAESF+EVHPLPGR+LMPVVDG 
Sbjct  361  GADATTARTVSAPTSHVDLVPTLLGAAGVDADVAAAQLAESFTEVHPLPGRNLMPVVDGG  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
             AD  RA+Y+MTRDNVLEGDTGAS  +RQLGR VNPPAPLRIKVPAHVA+NFEGLVVRVD
Sbjct  421  PADRSRAVYVMTRDNVLEGDTGASAFARQLGRTVNPPAPLRIKVPAHVASNFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            D+DA GGAGHLWKLVRTFDDPATWTEPGVRHLA NG+GG+AYRTDP+DDQWELYDLT DP
Sbjct  481  DSDAVGGAGHLWKLVRTFDDPATWTEPGVRHLAGNGIGGEAYRTDPVDDQWELYDLTTDP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGR  600
            IEA NRWTDP LHELRQHLRM LKQQRAVSVPERNQPWPYA R PP+  S G++RR LGR
Sbjct  541  IEADNRWTDPALHELRQHLRMQLKQQRAVSVPERNQPWPYAKRQPPADPSGGVLRRALGR  600

Query  601  F  601
            F
Sbjct  601  F  601


>gi|118469124|ref|YP_885676.1| sulfatase [Mycobacterium smegmatis str. MC2 155]
 gi|118170411|gb|ABK71307.1| sulfatase family protein [Mycobacterium smegmatis str. MC2 155]
Length=586

 Score =  969 bits (2506),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 480/583 (83%), Positives = 523/583 (90%), Gaps = 0/583 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M +RPDI+IVMTDEERA+PPYES+ VLAWR+R LTGRRWFDEHG++FTRHYTGSLACVPS
Sbjct  1    MTDRPDIVIVMTDEERAIPPYESSSVLAWRERVLTGRRWFDEHGVNFTRHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPT+FTG YPDLHG+TQTDG+GKR+DDSRLRWLR GEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTMFTGHYPDLHGITQTDGLGKRYDDSRLRWLRRGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHADLEDPATG PLATND++GV+D AAV+ YL+ADPL P+GFSGWVGPEPHGA ++NSGF
Sbjct  121  SHADLEDPATGEPLATNDDDGVIDHAAVQAYLEADPLAPFGFSGWVGPEPHGAAMSNSGF  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDP+VADRVVAWL +RYARRRAGD  A+RPFLLVASFVNPHDIVLFPAW  R P  PS 
Sbjct  181  RRDPIVADRVVAWLKDRYARRRAGDPDALRPFLLVASFVNPHDIVLFPAWSRRMPFGPSE  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTA+EDL  KPAAQ+A+REAYY+GYG    V R Y R AQ+YRDLYYRLHA
Sbjct  241  LDPPHVPAAPTAEEDLRDKPAAQIAFREAYYTGYGPAMAVERTYRRKAQQYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGSE+A+LVRTSDHG+LLGAHGGLHQKWFNLYDEATRVPFVIARI
Sbjct  301  EVDGPIDRVRRAVTEGGSENAVLVRTSDHGELLGAHGGLHQKWFNLYDEATRVPFVIARI  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G +AT+ RTV APTSHVDLVPTLL AAGVDVD  A  L ESFSEVHPLPGRDLMPVV G 
Sbjct  361  GTEATEARTVDAPTSHVDLVPTLLGAAGVDVDAAAEQLRESFSEVHPLPGRDLMPVVSGE  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            SADE R IYLMTRDNVLEGDTGAS ++RQLGR VNPPAPLRIKVPAHVA+NFEGLVVRVD
Sbjct  421  SADEHRPIYLMTRDNVLEGDTGASGVARQLGRDVNPPAPLRIKVPAHVASNFEGLVVRVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            D DA GGAGHLWKLVRTFDDP+TWTEPGVRHLA +G+GG+ YR+DPLDDQWELYDLTADP
Sbjct  481  DADAHGGAGHLWKLVRTFDDPSTWTEPGVRHLAADGLGGETYRSDPLDDQWELYDLTADP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHR  583
            +EA NRW  P LHE+RQ+L   LKQ RA SVPERN PWPYA R
Sbjct  541  VEAVNRWHYPDLHEVRQYLLAQLKQVRASSVPERNVPWPYARR  583


>gi|126433508|ref|YP_001069199.1| sulfatase [Mycobacterium sp. JLS]
 gi|126233308|gb|ABN96708.1| sulfatase [Mycobacterium sp. JLS]
Length=598

 Score =  951 bits (2459),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 479/602 (80%), Positives = 526/602 (88%), Gaps = 11/602 (1%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M+N PDI+I+MTDEERAVPPYES EVLAWR R+L  R+WFD+HG+SF RHYTGSLACVPS
Sbjct  1    MSN-PDIVILMTDEERAVPPYESPEVLAWRDRTLPCRKWFDDHGVSFGRHYTGSLACVPS  59

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGK + DSR+RWLR GEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  60   RPTIFTGQYPDLHGVTQTDGIGKTYGDSRMRWLRPGEVPTLGNWFRAAGYDTHYDGKWHI  119

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHAD+ DPATG PL TND++GVVD+ AVRRYLDAD L PYGFSGWVGPEPHGA L+NSGF
Sbjct  120  SHADVTDPATGLPLDTNDDDGVVDADAVRRYLDADSLAPYGFSGWVGPEPHGAALSNSGF  179

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+A RVVAWL +RYARRRAGD  A+RPFLLVASFVNPHDIVLFP WV RSP+KPSP
Sbjct  180  RRDPLIAARVVAWLEDRYARRRAGDPQALRPFLLVASFVNPHDIVLFPQWVRRSPVKPSP  239

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQ+A+REAYYSGYG   ++ R Y RNAQ+YRDLYYRLHA
Sbjct  240  LDPPHVPAAPTADEDLSTKPAAQIAFREAYYSGYGPAAVMERTYRRNAQQYRDLYYRLHA  299

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            +VDGP++RV RAV E GS+DA+LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR 
Sbjct  300  QVDGPLERVRRAVVE-GSQDAVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIART  358

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G  AT  RTV+APTSHVDLVPTLLSAAGVDV   AA LAESF+EVHPLPGRDLMPVVDGA
Sbjct  359  GVNATAARTVTAPTSHVDLVPTLLSAAGVDVAATAATLAESFTEVHPLPGRDLMPVVDGA  418

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            + DE RA+YLMTRDN+LEGD+GAS L+R+L R VNPP PLRI+VPAHVA+NFEGLV +VD
Sbjct  419  APDEDRAVYLMTRDNMLEGDSGASGLARKLKRTVNPPGPLRIRVPAHVASNFEGLVTQVD  478

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
                    GHLWKLVR+FDDPATWTEPGVRHLA NG+GG+AYR+ PLDDQWELYDLTADP
Sbjct  479  --------GHLWKLVRSFDDPATWTEPGVRHLAANGVGGEAYRSSPLDDQWELYDLTADP  530

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASN-GLVRRVLG  599
             EA NRW DP L ELR HLR  LK  R  S+PERNQPWPYA R PP+G +  GLVRR LG
Sbjct  531  TEAVNRWPDPSLDELRAHLRRQLKHVRTESIPERNQPWPYAVRRPPTGGARVGLVRRALG  590

Query  600  RF  601
            R 
Sbjct  591  RL  592


>gi|342857443|ref|ZP_08714099.1| hydrolase [Mycobacterium colombiense CECT 3035]
 gi|342134776|gb|EGT87942.1| hydrolase [Mycobacterium colombiense CECT 3035]
Length=603

 Score =  939 bits (2427),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 468/578 (81%), Positives = 513/578 (89%), Gaps = 0/578 (0%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RPD+I+++TDEERAVPPYE+ EVLAWR R L+GRRWF+EHG+SF RHYTGSLACVPSRPT
Sbjct  8    RPDVIVIVTDEERAVPPYEAPEVLAWRDRILSGRRWFEEHGVSFGRHYTGSLACVPSRPT  67

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            IFTG YPDLHGVTQTDGIGK   DSR+RWLR GEVPTLGNWFRAAGYDTHYDGKWHISHA
Sbjct  68   IFTGHYPDLHGVTQTDGIGKTAGDSRMRWLRQGEVPTLGNWFRAAGYDTHYDGKWHISHA  127

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
            DL DPATG  LATND++G VD  AVRRYL+ADPL P+GFSGWVGPEPHGA LAN+G RRD
Sbjct  128  DLIDPATGRSLATNDDDGNVDPGAVRRYLEADPLAPFGFSGWVGPEPHGAALANAGIRRD  187

Query  184  PLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSPLDP  243
            PL+ADR+VAWLT+RYARRRAGD AA+RPFLLVASFVNPHDIVLFP W  R P+KPSPLDP
Sbjct  188  PLIADRIVAWLTDRYARRRAGDPAALRPFLLVASFVNPHDIVLFPTWSRRGPVKPSPLDP  247

Query  244  PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHAEVD  303
            P VP APTA+EDLS+KPAAQ+A+REAYYSGYG    +   Y RNAQRYRDLYYRLHAEVD
Sbjct  248  PPVPPAPTAEEDLSSKPAAQIAFREAYYSGYGPAPAIEWTYRRNAQRYRDLYYRLHAEVD  307

Query  304  GPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARIGEK  363
            GPIDRV RAVT+ GS DA+LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR+G +
Sbjct  308  GPIDRVRRAVTDNGSRDAVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIARVGVR  367

Query  364  ATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGASAD  423
             TQ R V APTSHVDLVPTLL AAG+DVD VAA LAESFSEVH LPGRDLMP+VDGA+AD
Sbjct  368  TTQRRVVEAPTSHVDLVPTLLGAAGIDVDAVAATLAESFSEVHRLPGRDLMPIVDGAAAD  427

Query  424  EGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVDDTD  483
            E RA+YLMTRDN+LEGD+GAS L+RQL R VNPPAPLRI++PAH A+NFEGLVVRVD+  
Sbjct  428  ETRAVYLMTRDNMLEGDSGASGLARQLKRTVNPPAPLRIRIPAHTASNFEGLVVRVDEAT  487

Query  484  AAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADPIEA  543
            AAGG GHLWKLVRTFDDP TWTEPGVRHLA NG+GG+AYRT P+DDQWELYDLT DPIEA
Sbjct  488  AAGGGGHLWKLVRTFDDPGTWTEPGVRHLAANGLGGEAYRTSPVDDQWELYDLTTDPIEA  547

Query  544  YNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYA  581
             NRWTDP LH+LRQHLR  LK  RA S+PERN PWPYA
Sbjct  548  ANRWTDPLLHDLRQHLRTQLKHVRASSIPERNNPWPYA  585


>gi|296166344|ref|ZP_06848780.1| sulfatase [Mycobacterium parascrofulaceum ATCC BAA-614]
 gi|295898307|gb|EFG77877.1| sulfatase [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=600

 Score =  932 bits (2409),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 472/581 (82%), Positives = 520/581 (90%), Gaps = 0/581 (0%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M++RPD++++MTDEERA PPYE+ +VLAWR R+LTGRRWF+EHG+SF RHYTGSLACVPS
Sbjct  1    MSDRPDVVVIMTDEERAAPPYEAPDVLAWRARTLTGRRWFEEHGVSFARHYTGSLACVPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPT+FTG YPD+HGVTQTDGIGK  DDSR+RWLR GEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  61   RPTLFTGHYPDVHGVTQTDGIGKTADDSRMRWLRQGEVPTLGNWFRAAGYDTHYDGKWHI  120

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHAD+ DPATG PLATND  G VD+AAVRRYL+ADPLGP+GFSGWVGPEPHGA LA++G 
Sbjct  121  SHADITDPATGRPLATNDKNGAVDAAAVRRYLEADPLGPFGFSGWVGPEPHGAALADAGV  180

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+ADRVVAWL +RYARRR GD AA+RPFLLVASFVNPHDIVLFPAWV RSP++PSP
Sbjct  181  RRDPLIADRVVAWLADRYARRRDGDPAALRPFLLVASFVNPHDIVLFPAWVRRSPVEPSP  240

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPP VPA PTADEDLSTKPAAQ+A+REAYYSGYG    V R Y RNAQRYRDLYYRLHA
Sbjct  241  LDPPAVPAPPTADEDLSTKPAAQIAFREAYYSGYGPAPAVDRTYGRNAQRYRDLYYRLHA  300

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            EVDGPIDRV RAVTEGGS +A+LVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR 
Sbjct  301  EVDGPIDRVRRAVTEGGSANAVLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARS  360

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G++ T+PR V+APTSHVDLVPTLL+AAGVDV  VA  LA SFSEVH LPGRDLM VVDGA
Sbjct  361  GDRVTRPRRVTAPTSHVDLVPTLLAAAGVDVAAVADTLARSFSEVHRLPGRDLMAVVDGA  420

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
             ADE RA+YLMTRDN+LEGD+GAS L+R+L R V+PPAPLRI+VPAH A+NFEGLVV VD
Sbjct  421  PADEARAVYLMTRDNMLEGDSGASGLARRLKRTVDPPAPLRIRVPAHTASNFEGLVVSVD  480

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            D  A GGAGHLWKLVRTFDDP+TWTEPGVRHLA NG+GG+AYRT PLDDQWELYDLT DP
Sbjct  481  DATAGGGAGHLWKLVRTFDDPSTWTEPGVRHLAANGLGGEAYRTSPLDDQWELYDLTVDP  540

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYA  581
            IEA NRW DP+LH LRQHLR  LK  RA +VPERN+PWPYA
Sbjct  541  IEAINRWADPELHALRQHLRTRLKHARADAVPERNRPWPYA  581


>gi|108797868|ref|YP_638065.1| sulfatase [Mycobacterium sp. MCS]
 gi|119866962|ref|YP_936914.1| sulfatase [Mycobacterium sp. KMS]
 gi|108768287|gb|ABG07009.1| sulfatase [Mycobacterium sp. MCS]
 gi|119693051|gb|ABL90124.1| sulfatase [Mycobacterium sp. KMS]
Length=598

 Score =  931 bits (2407),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 479/602 (80%), Positives = 527/602 (88%), Gaps = 11/602 (1%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M+N PDI+I+MTDEERAVPPYE+ EVLAWR R+L  R+WFD+HG+SF RHYTGSLACVPS
Sbjct  1    MSN-PDIVILMTDEERAVPPYETPEVLAWRDRTLPCRKWFDDHGVSFGRHYTGSLACVPS  59

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            RPTIFTGQYPDLHGVTQTDGIGK + DSR+RWLR GEVPTLGNWFRAAGYDTHYDGKWHI
Sbjct  60   RPTIFTGQYPDLHGVTQTDGIGKTYGDSRMRWLRPGEVPTLGNWFRAAGYDTHYDGKWHI  119

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            SHAD+ DPATG PL TND++GVVD+ AVRRYLDADPL PYGFSGWVGPEPHGA L+NSGF
Sbjct  120  SHADVTDPATGLPLDTNDDDGVVDADAVRRYLDADPLAPYGFSGWVGPEPHGAALSNSGF  179

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
            RRDPL+A RVVAWL +RYARRRAGD  A+RPFLLVASFVNPHDIVLFP WV RSP+KPSP
Sbjct  180  RRDPLIAARVVAWLEDRYARRRAGDPQALRPFLLVASFVNPHDIVLFPQWVRRSPVKPSP  239

Query  241  LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHA  300
            LDPPHVPAAPTADEDLSTKPAAQ+A+REAYYSGYG   ++ R Y RNAQ+YRDLYYRLHA
Sbjct  240  LDPPHVPAAPTADEDLSTKPAAQIAFREAYYSGYGPAAVMERTYRRNAQQYRDLYYRLHA  299

Query  301  EVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            +VDGP++RV RAV E GS+DA+LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR 
Sbjct  300  QVDGPLERVRRAVVE-GSQDAVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIART  358

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G  AT  RTV+APTSHVDLVPTLLSAAGVDV   AA LAESF+EVHPLPGRDLMPVVDGA
Sbjct  359  GANATAARTVTAPTSHVDLVPTLLSAAGVDVAAAAATLAESFTEVHPLPGRDLMPVVDGA  418

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
            + DE RA+YLMTRDN+LEGD+GAS L+R+L R VNPP PLRI+VPAHVA+NFEGLV +VD
Sbjct  419  APDEDRAVYLMTRDNMLEGDSGASGLARKLKRTVNPPGPLRIRVPAHVASNFEGLVTQVD  478

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
                    GHLWKLVR+FDDPATWTEPGVRHLA NG+GG+AYR+ PLDDQWELYDLTADP
Sbjct  479  --------GHLWKLVRSFDDPATWTEPGVRHLAANGVGGEAYRSSPLDDQWELYDLTADP  530

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGASN-GLVRRVLG  599
             EA NRW DP L ELR HLR  LK  R  S+PERNQPWPYA R PP+G +  GLVRR LG
Sbjct  531  TEAVNRWPDPSLDELRAHLRRQLKHVRTESIPERNQPWPYAVRRPPTGGARVGLVRRALG  590

Query  600  RF  601
            R 
Sbjct  591  RL  592


>gi|145222959|ref|YP_001133637.1| sulfatase [Mycobacterium gilvum PYR-GCK]
 gi|315443421|ref|YP_004076300.1| arylsulfatase A family protein [Mycobacterium sp. Spyr1]
 gi|145215445|gb|ABP44849.1| sulfatase [Mycobacterium gilvum PYR-GCK]
 gi|315261724|gb|ADT98465.1| arylsulfatase A family protein [Mycobacterium sp. Spyr1]
Length=604

 Score =  902 bits (2332),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 470/602 (79%), Positives = 511/602 (85%), Gaps = 4/602 (0%)

Query  2    ANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSR  61
            A RPD++IVMTDEERA+PPYES  V  WR  +LTGRRWF+EHG+SFTRHYTGSLACVPSR
Sbjct  3    AQRPDVVIVMTDEERAIPPYESDRVRTWRDETLTGRRWFEEHGVSFTRHYTGSLACVPSR  62

Query  62   PTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHIS  121
            PTIFTG YPDLHGVTQTDGIGK  DDSRLRWLR GEVPTLGNWFRAAGYDTHYDGKWHIS
Sbjct  63   PTIFTGHYPDLHGVTQTDGIGKSHDDSRLRWLRRGEVPTLGNWFRAAGYDTHYDGKWHIS  122

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
            HADL DP+TG PLATND++GVVD  AV+RYLDADPL PYGFSGWVGPEPHGAGLAN+G R
Sbjct  123  HADLTDPSTGRPLATNDSDGVVDPGAVKRYLDADPLAPYGFSGWVGPEPHGAGLANAGIR  182

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSPL  241
            RDPL+ADRVVAWLT RYA R AGD+AA+RPFLLVASFVNPHDIVLFPAW  R+PL PSPL
Sbjct  183  RDPLIADRVVAWLTARYAARAAGDSAALRPFLLVASFVNPHDIVLFPAWARRNPLSPSPL  242

Query  242  DPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHAE  301
            DPP VP APTADEDLSTKPAAQ+A+REAYYSGYG    + R Y RNAQRYRDLYYRLHAE
Sbjct  243  DPPSVPPAPTADEDLSTKPAAQIAFREAYYSGYGPAGSIERTYRRNAQRYRDLYYRLHAE  302

Query  302  VDGPIDRVGRAVTEG-GSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARI  360
            VD PIDRV RAVT+G G+   +LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIAR 
Sbjct  303  VDEPIDRVRRAVTDGAGAHPTVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIART  362

Query  361  GEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGA  420
            G  AT  RTV+APTSHVDLVPTLL+AAG+D + VAA L ESF+EVHPLPGRDLMPVVDGA
Sbjct  363  GPDATTARTVTAPTSHVDLVPTLLAAAGIDAESVAATLGESFTEVHPLPGRDLMPVVDGA  422

Query  421  SADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVRVD  480
             ADE R +YLMTRDNVLEGDTGAS L+R L      PAPLRI+VPAH AANFEGLV+RV 
Sbjct  423  PADEDRPVYLMTRDNVLEGDTGASGLARALRLTSRVPAPLRIRVPAHTAANFEGLVLRVP  482

Query  481  DTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTADP  540
            +T AAGG GHLWKLVR+FDDP TWTEPGVR LA +G+GG  YR++PLDDQWELYDLT DP
Sbjct  483  ETSAAGGGGHLWKLVRSFDDPGTWTEPGVRQLAADGVGGPTYRSEPLDDQWELYDLTDDP  542

Query  541  IEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSG---ASNGLVRRV  597
            IE  NRW DP LH LR +LR  LK  RA S+PERNQPWPYA R PP          +RR+
Sbjct  543  IEQTNRWPDPALHALRAYLRTQLKHARAQSIPERNQPWPYARRQPPPARRWTPGRALRRL  602

Query  598  LG  599
            LG
Sbjct  603  LG  604


>gi|120405228|ref|YP_955057.1| sulfatase [Mycobacterium vanbaalenii PYR-1]
 gi|119958046|gb|ABM15051.1| sulfatase [Mycobacterium vanbaalenii PYR-1]
Length=607

 Score =  892 bits (2305),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 466/591 (79%), Positives = 503/591 (86%), Gaps = 4/591 (0%)

Query  3    NRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRP  62
             RPDI++VMTDEERA PPYE   V AWR R+L GRRWFDE+G+SF RHYTGSLACVPSRP
Sbjct  4    ERPDIVVVMTDEERATPPYEPDTVRAWRSRTLGGRRWFDENGVSFLRHYTGSLACVPSRP  63

Query  63   TIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISH  122
            TIFTGQYPDLHGVTQTDGIGK  DDSRLRWLR GEVPTLGNW RAAGYDTHYDGKWHISH
Sbjct  64   TIFTGQYPDLHGVTQTDGIGKAHDDSRLRWLRRGEVPTLGNWLRAAGYDTHYDGKWHISH  123

Query  123  ADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRR  182
            ADL DP TG  L TND++GVVD AAV RYL+ADPL PYGFSGWVGPEPHGA L+N+G RR
Sbjct  124  ADLIDPGTGRSLDTNDDDGVVDPAAVHRYLEADPLSPYGFSGWVGPEPHGAKLSNAGIRR  183

Query  183  DPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSPLD  242
            DPL+ADRVVAWL +RYARRRAGD  AMRPFLLVASFVNPHDIVLFPAW  R+PL  SPLD
Sbjct  184  DPLIADRVVAWLKDRYARRRAGDPDAMRPFLLVASFVNPHDIVLFPAWARRNPLPASPLD  243

Query  243  PPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHAEV  302
            PP VPAAPTADEDLSTKPAAQ+A+REAYYSGYG    + R Y RNAQRYRDLYYRLHAEV
Sbjct  244  PPPVPAAPTADEDLSTKPAAQIAFREAYYSGYGPAWSIERTYRRNAQRYRDLYYRLHAEV  303

Query  303  DGPIDRVGRAVTEGGS----EDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIA  358
            D PIDRV RAVTEGGS    +D +LVRT+DHGDLLGAHGGLHQKWFNLYDEATRVPFVIA
Sbjct  304  DTPIDRVRRAVTEGGSGDGPDDTVLVRTADHGDLLGAHGGLHQKWFNLYDEATRVPFVIA  363

Query  359  RIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVD  418
            R+G + T  RTV+APTSHVDLVPTLLSAAGVDVD  A  LAESFSEVHPLPG DLMPVVD
Sbjct  364  RVGARPTTARTVTAPTSHVDLVPTLLSAAGVDVDAAATVLAESFSEVHPLPGSDLMPVVD  423

Query  419  GASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGLVVR  478
            GA AD+ R +YLMTRDNVLEGDTGAS L+R L      PAPLRI++PAH AANFEGLV+R
Sbjct  424  GAPADDHRCVYLMTRDNVLEGDTGASGLARALKLTSKVPAPLRIRIPAHTAANFEGLVIR  483

Query  479  VDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYDLTA  538
            VD+  A GG GHLWKLVR+FDDP TWTEPGVRHLA +G+GG  YRTDPLDDQWELYDLT 
Sbjct  484  VDEDAAPGGRGHLWKLVRSFDDPGTWTEPGVRHLAADGIGGPMYRTDPLDDQWELYDLTD  543

Query  539  DPIEAYNRWTDPQLHELRQHLRMLLKQQRAVSVPERNQPWPYAHRLPPSGA  589
            DP+E +NRWTDP LHELR +LR  LK  RA S+PERN+PWPY  R PP  A
Sbjct  544  DPVEQHNRWTDPDLHELRAYLRAQLKSVRAESIPERNRPWPYVRRQPPQPA  594


>gi|295704481|ref|YP_003597556.1| sulfatase family protein [Bacillus megaterium DSM 319]
 gi|294802140|gb|ADF39206.1| sulfatase family protein [Bacillus megaterium DSM 319]
Length=580

 Score =  373 bits (958),  Expect = 5e-101, Method: Compositional matrix adjust.
 Identities = 216/590 (37%), Positives = 317/590 (54%), Gaps = 56/590 (9%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M  +P+ ++++ DEER  P YES E+  WR++ L    +  + G+ FTRHY GS AC PS
Sbjct  1    MQKQPNFLLIIVDEERFPPLYESKELKKWRKKHLKAHEFLKQQGMEFTRHYVGSTACSPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T+FTGQY  LHGV QT GI K+  D  + WL+   VPT+G +F+ AGY T Y GKWHI
Sbjct  61   RATLFTGQYSSLHGVMQTQGIAKKSQDPDMFWLQPNTVPTMGEYFKKAGYQTFYKGKWHI  120

Query  121  SHADLEDPATGAPLATNDNE-GVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSG  179
            S  ++  P T    ++   + G+ D      Y  A+ L  +GFSGW+GPEP G    NSG
Sbjct  121  SDENILIPGTHNVFSSYQMQTGIPDPEKECLYQRANKLEKFGFSGWIGPEPEGRNPRNSG  180

Query  180  FR-------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVW  232
                     RD + A+ ++  +      +R    A  +P+L+VASFVNPHDI LF A   
Sbjct  181  SSAAIGVSGRDEIYAEEIIELI------QRLEHQAPAQPWLMVASFVNPHDIALFGAITK  234

Query  233  RSPLKPSPLDP--PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQR  290
              P+    +D   P++   PT  E L TKP+AQ +Y+  Y      T        +N+  
Sbjct  235  HLPMFQFSIDETIPNIDPPPTIREPLQTKPSAQSSYKYIYPKALQPT--------QNSSF  286

Query  291  YRDLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDE  349
            YR LYY+L   VD  I +V R + +    E+ +++ TSDHGDLLGAHGGLHQKW+N+Y+E
Sbjct  287  YRRLYYQLQKNVDKQILKVLRTLEQSSFYENTIIIFTSDHGDLLGAHGGLHQKWYNMYEE  346

Query  350  ATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLP  409
            +  VP +I       +  RT +  TSH+DL+PT+LS A +D   +   L +S +EVHP  
Sbjct  347  SIHVPLLIHHKRLFPSYQRTDTL-TSHLDLIPTMLSLANIDASAIQKQLQKSHTEVHPFV  405

Query  410  GRDLMPVVDGASA--DEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAH  467
            GRDL  ++ G +    E  AI+ MT D+  +G    + L      +V          P H
Sbjct  406  GRDLSGILRGETCAEQEETAIFFMTDDDPTKGLHQTNFLGESYPSVVQ---------PNH  456

Query  468  VAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAY-----  522
            + A      V V+ + AAG    +WK  R  D+   W+    +    +G   + Y     
Sbjct  457  IQA------VIVEFSSAAGKE--IWKYARYHDNLQFWSADDEKDEFIHGEQYEGYSVNFT  508

Query  523  --RTDPLDDQWELYDLTADPIEAYN----RWTDPQLHELRQHLRMLLKQQ  566
              +T+ + DQ E+Y+LT DP+E  N     ++  +  ++++ L ++LK+Q
Sbjct  509  TLKTNRVPDQIEMYNLTKDPLETVNLAHLYFSTNETRKIQRQLDVILKEQ  558


>gi|345444703|gb|AEN89720.1| Arylsulfatase A family protein [Bacillus megaterium WSH-002]
Length=580

 Score =  369 bits (947),  Expect = 8e-100, Method: Compositional matrix adjust.
 Identities = 221/590 (38%), Positives = 316/590 (54%), Gaps = 56/590 (9%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M  +P+ +++M DEER  P YES EV  WR++ L    +  +HG++FTRHY GS AC PS
Sbjct  1    MKEQPNFLLIMVDEERFPPVYESKEVKKWRKKHLKAHEFLKQHGMAFTRHYVGSTACSPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T+FTGQYP LHGVTQT GI K+  D  + WL+   VPT+G++F+ AGY T Y GKWHI
Sbjct  61   RATLFTGQYPSLHGVTQTQGIAKKSQDPDMFWLQPNTVPTMGDYFKQAGYQTFYKGKWHI  120

Query  121  SHADLEDPATGAPLATNDNE-GVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSG  179
            S  ++  P T    ++   + G  D    R Y  A+ L  +GFS W+GPEP G    NSG
Sbjct  121  SDENILIPGTHNVFSSYQMQTGRPDPEKERLYQRANKLEKFGFSEWIGPEPEGRNPHNSG  180

Query  180  FR-------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVW  232
                     RD + A+ ++  L +R   ++       +P+L+VASFVNPHDI LF A   
Sbjct  181  SSAAIGVSGRDKIYAEEIIE-LIQRLEYQKTA-----QPWLMVASFVNPHDIALFGAITK  234

Query  233  RSPLKPSPLDP--PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQR  290
              P+    +D   P +   PT  E L TKP+AQ +Y+  Y      T         N+  
Sbjct  235  HLPMFQFSIDETIPDIEPPPTIRESLQTKPSAQSSYKYIYPKALQPT--------PNSSF  286

Query  291  YRDLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDE  349
            YR LYY+L   VD  I ++   + +    E+ +++ TSDHGDLLGAHGGLHQKW+N+Y+E
Sbjct  287  YRRLYYQLQKNVDKQILKILGTIEQSSFYENTIIIFTSDHGDLLGAHGGLHQKWYNMYEE  346

Query  350  ATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLP  409
            +  VPF+I       +  RT    TSHVDL+PT+LS A +D   V   L +S +EVHPL 
Sbjct  347  SIHVPFLIHNKHLFPSYQRT-DLLTSHVDLIPTMLSLANIDASAVQKQLQKSHTEVHPLV  405

Query  410  GRDLMPVVDGASA--DEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAH  467
            GRDL   + G +    E   I+ MT D+  +G    + L          P+   +  P H
Sbjct  406  GRDLSGTLLGETTLHQEKTPIFFMTDDDPTKGLHQTNFLKESY------PS---VDQPNH  456

Query  468  VAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAY-----  522
            + A      V V+ + AAG    +WK  R  D+   W+         +    + Y     
Sbjct  457  IQA------VIVEFSSAAGKE--IWKYARYHDNLQFWSASDEPDEVVHREQYEGYPVSLT  508

Query  523  --RTDPLDDQWELYDLTADPIEAYN----RWTDPQLHELRQHLRMLLKQQ  566
              +T  + DQ E+Y+LT DP+E  N     ++  +  ++++ L ++LK+Q
Sbjct  509  TPKTTGVPDQIEMYNLTKDPLETVNLAHPYFSTNETRKIQRQLDVILKEQ  558


>gi|311032599|ref|ZP_07710689.1| sulfatase [Bacillus sp. m3-13]
Length=594

 Score =  368 bits (945),  Expect = 1e-99, Method: Compositional matrix adjust.
 Identities = 207/588 (36%), Positives = 312/588 (54%), Gaps = 49/588 (8%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            +P+ +I+M D++R    YE+ E+  W++ +L  +    ++G  FT+HY GS AC PSR T
Sbjct  10   KPNFLILMVDQQRYPSVYENEELRRWQRENLQTQELLKKNGFEFTKHYVGSTACSPSRTT  69

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            ++TGQYP LHGVTQT G  K   D  + WL    VPT+G +FRAAGY T + GKWH S  
Sbjct  70   LYTGQYPSLHGVTQTSGAAKTSFDPDMFWLDPNTVPTMGEYFRAAGYKTFWKGKWHASEE  129

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
            D+  P T   LA+  + G  D   V +YL ++ L  YGF GWVGPEPHG+   NS     
Sbjct  130  DILIPGTKNSLASYTSTGRPDKRNVEKYLASNRLSDYGFDGWVGPEPHGSSPRNSASSAA  189

Query  184  PLVADRVVAWLTERYARRRAGDTAAMR---PFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
              V+ R V +  E     ++ +    R   P+ ++ S VNPHDI ++  +   SP+    
Sbjct  190  IGVSGRDVIYAQETVELLQSLENECHRDSSPWFVMCSLVNPHDIAIYGIYTELSPMFNFE  249

Query  241  LDP--PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRL  298
            +DP  P +P +PT +E LSTKP+AQ +YRE Y               R+   YR LYY L
Sbjct  250  IDPSVPFIPPSPTDNESLSTKPSAQESYREIYPKAL--------QPIRDNVSYRQLYYSL  301

Query  299  HAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVI  357
              + D  + +V R + +    E+ +++  SDHG+LLGAHGGL+QKW N Y+E+  VP +I
Sbjct  302  QKKADQELGKVFRTLQDSTFYENTIVIFLSDHGELLGAHGGLYQKWNNTYEESIHVPLII  361

Query  358  ARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVV  417
                +  +   T    TSHVD++PT+L  A +D + +   L    +EVHPL GRDL P++
Sbjct  362  HS-PKLFSGKETTDMLTSHVDVLPTMLGLADIDAEEIQQQLKRDHTEVHPLVGRDLTPLL  420

Query  418  DGASA--DEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGL  475
             G +        +Y M+ D+V +G    S        ++ P              + E +
Sbjct  421  MGKNKFYRANEPLYFMSDDDVTQGPNQVSATGEPYHAVIQP-------------NHMEAI  467

Query  476  VVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGD-------------AY  522
            + ++  +    G   +WKL R +D P  W+ PGV ++ T  +                + 
Sbjct  468  ITKI--STGENGTKEIWKLTRYYDSPQFWSNPGVENVTTTQVSKTSTGEHIDCALCIMST  525

Query  523  RTDPLDDQWELYDLTADPIE----AYNRWTDPQLHELRQHLRMLLKQQ  566
            +T P+ DQ+ELY+LT DP+E    AY+    P+   +++ L + L++Q
Sbjct  526  KTRPVPDQYELYNLTKDPLEESNLAYSDNRTPETMAIQKLLMLGLEEQ  573


>gi|338535752|ref|YP_004669086.1| sulfatase family protein [Myxococcus fulvus HW-1]
 gi|337261848|gb|AEI68008.1| sulfatase family protein [Myxococcus fulvus HW-1]
Length=563

 Score =  368 bits (945),  Expect = 2e-99, Method: Compositional matrix adjust.
 Identities = 234/576 (41%), Positives = 300/576 (53%), Gaps = 49/576 (8%)

Query  2    ANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSR  61
              RP+ +I+ TDEER  PPYE+ E   +R  +    +   EHGI F RH+T S AC PSR
Sbjct  5    GKRPNFLIITTDEERFPPPYENEEARRFRVENDRVGQELREHGIEFLRHHTASTACAPSR  64

Query  62   PTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHIS  121
             T++TGQYP LHGV+QT GIGK   D  + WL    VPTLG +FR  GY THY GKWH+S
Sbjct  65   TTLYTGQYPSLHGVSQTPGIGKSSFDPDMYWLAPNTVPTLGEYFRKGGYQTHYRGKWHLS  124

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
              D+  P T  PL +ND  G V    V  Y  +  L  +GFSGW+GPEPHG+  AN G  
Sbjct  125  DEDILVPGTQTPLMSNDATGDVYPERVALYEQSGRLEKFGFSGWIGPEPHGSSQANDGTV  184

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLK--PS  239
            RDP  AD+V   L++   +  AG+TA   P+LLV+SFVNPHDIV F    W +      +
Sbjct  185  RDPGFADQVCRLLSDLDRQATAGETA---PWLLVSSFVNPHDIV-FSGLPWFTVFNNLQA  240

Query  240  PLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNY--ARNAQRYRDLYYR  297
                P V  APTA E L +KP  Q  Y   Y           R Y   R+   YR LYY 
Sbjct  241  AGKLPDVEPAPTAGESLESKPRCQKDYVYTY----------PRMYLPQRDTASYRQLYYF  290

Query  298  LHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFV  356
            L AEV   I RV   +      E+ ++V TSDHG++LGAHGG+ QKW+N Y E   VPFV
Sbjct  291  LMAEVSKHIHRVYEHLKRTSFFENTIVVLTSDHGEMLGAHGGMMQKWYNAYQETLHVPFV  350

Query  357  IARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPV  416
            I+  G    +PR     TSHVDLVPTLL  AG+DV+     LA   SE  PL GRDL  +
Sbjct  351  ISNPG-LFPEPRRTELVTSHVDLVPTLLGLAGIDVEAARRELARDHSEAQPLVGRDLSGL  409

Query  417  VDGASADEGRAIYLMTRDNVLEG-DTGASLLSRQLGRIVNPPAPLRIKVPAHVAANFEGL  475
            V G   +    IY MT DNV  G     +L  ++   ++ P              + E +
Sbjct  410  VLGREPERHEPIYFMTDDNVESGLQMTNNLTGQEYAGVIQP-------------KHIETV  456

Query  476  VVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQWELYD  535
            V R+ +         LWK     D+P  +           G   +      +  ++E YD
Sbjct  457  VTRLPELT----GDTLWKFSCYSDNPRFFA-------GAPGNTDEVATARFIPREYECYD  505

Query  536  LTADPIEAYNRWT----DPQLHELRQHLRMLLKQQR  567
            LT DP+E  NR +     P   ++R  L  +LK+QR
Sbjct  506  LTEDPLETRNRCSAVAAQPLPQDVRDALEKVLKEQR  541


>gi|308067126|ref|YP_003868731.1| arylsulfatase A [Paenibacillus polymyxa E681]
 gi|305856405|gb|ADM68193.1| Arylsulfatase A [Paenibacillus polymyxa E681]
Length=583

 Score =  362 bits (928),  Expect = 1e-97, Method: Compositional matrix adjust.
 Identities = 227/593 (39%), Positives = 314/593 (53%), Gaps = 54/593 (9%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            +  +P+ ++++ DEER    YE+ E+  W +++L  +     +G+ F RHY GS AC PS
Sbjct  10   LLEQPNFLVLLVDEERYPAVYENPEIKEWSRQNLITQGLLRSYGLEFHRHYIGSAACSPS  69

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T+FTG YP LHGVTQTDG+ K   DS + WL    VPT+G++FRAAGY T+Y GKWHI
Sbjct  70   RTTLFTGHYPSLHGVTQTDGVAKEAFDSDMFWLDRNTVPTMGDYFRAAGYQTYYKGKWHI  129

Query  121  SHADLEDPATGAPLAT-NDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSG  179
            S  D+  P T   L + +   GV        Y  AD L  +GFS W+GPEPHG    NSG
Sbjct  130  SDEDIIIPGTHKALPSYHPVTGVPYRKREDLYNQADRLDQFGFSRWIGPEPHGRNPRNSG  189

Query  180  FR-------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVW  232
                     RD + A   V  L E   RR+  D  A +P+L+VASFVNPHDIVL+ A   
Sbjct  190  SSAAFGLSGRDEVYAADTVE-LIEALDRRKRNDNHA-KPWLVVASFVNPHDIVLYGAITA  247

Query  233  RSPLKPSPLDP-PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRY  291
            R P+    ++P P V   PT +E L+TKP  Q +YR+ Y     L  ++ + +      Y
Sbjct  248  RLPMFRFEVEPMPAVAHPPTINEWLATKPRCQASYRDIY--PRALQPIIDQPF------Y  299

Query  292  RDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNLYDEA  350
            R LYY+L    D  + +V  A+T     D  +++ TSDHGDLLGAHG LHQK++  Y+E 
Sbjct  300  RKLYYQLQKNADRQMFKVFEALTRSSFYDNTIVIFTSDHGDLLGAHGNLHQKFYCAYEEI  359

Query  351  TRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPG  410
              VP VI        Q ++    T+HVDL+PT+L  A VD+  + + L  SF+E  PL G
Sbjct  360  VHVPLVIHN-QHLFPQYKSEHILTNHVDLLPTMLGLANVDITAIQSRLQNSFTEARPLVG  418

Query  411  RDLMPVVDGASADE--GRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAHV  468
            RDL PV+ G    E   + +Y MT D+V  G    S+L          P P  I+ P H+
Sbjct  419  RDLTPVIRGQDQGEIADQPVYFMTDDDVTRGQRQISVLGE--------PYPSVIQ-PNHI  469

Query  469  AANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMG----------  518
                  L           G   LWK  R FD    W++PGV  +    +G          
Sbjct  470  ETVIAPL--------QRDGVQELWKFSRYFDSAQFWSQPGVMDVTIRPVGDHTCGPYSQW  521

Query  519  GDAYRTDPLDDQWELYDLTADPIEAYN----RWTDPQLHELRQHLRMLLKQQR  567
                + +P  D++ELY+LT+DP+E  N     +   Q   ++Q +  LL++QR
Sbjct  522  ATQVKIEPDHDEYELYNLTSDPLEVCNLAHPAFATQQTRSIQQQMMHLLEEQR  574


>gi|258515285|ref|YP_003191507.1| sulfatase [Desulfotomaculum acetoxidans DSM 771]
 gi|257778990|gb|ACV62884.1| sulfatase [Desulfotomaculum acetoxidans DSM 771]
Length=601

 Score =  358 bits (920),  Expect = 1e-96, Method: Compositional matrix adjust.
 Identities = 213/595 (36%), Positives = 305/595 (52%), Gaps = 55/595 (9%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            + ++P+ ++++ D++R    YE+ E+  WR+  L  + +    G  F  HY GS AC PS
Sbjct  15   LCHKPNFLVILVDQQRYAVSYENEEIKVWRKTRLKAQEFLKSRGFEFKNHYAGSAACCPS  74

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T++TGQYP LHGV+QTDG  K   D  + WL    VPT+G++FR AGY T++ GKWH 
Sbjct  75   RATLYTGQYPSLHGVSQTDGAAKGAYDPDMFWLNPNTVPTMGDYFRTAGYQTYWKGKWHA  134

Query  121  SHADLEDPATGAP-LATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSG  179
            S AD+  P T  P L+ N   GV      + Y++A+ L  +GF+GW+GPEPHG    N+G
Sbjct  135  SAADILVPGTHKPFLSYNQGNGVPIPDNEKLYINANVLASFGFNGWIGPEPHGVNPRNTG  194

Query  180  FR-------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVW  232
                     RD + +   V  +          D    RP+L++ SFVNPHDI LF A   
Sbjct  195  SSAAAGLSGRDVVYSQDTVELIRVLEKEYNESDECRPRPWLIMCSFVNPHDIALFGAISG  254

Query  233  RSP---LKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQ  289
              P    K + L  P++  APTA E L TKP+AQ +YR  Y   Y    ++   +     
Sbjct  255  SLPQFNFKVN-LSVPYISPAPTASESLLTKPSAQSSYRRIY--AYAFQPLLDTLF-----  306

Query  290  RYRDLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYD  348
             YR LYY L  E D  I RV  A+ E     + +++ TSDHG+LLGAH GL QKW+  Y+
Sbjct  307  -YRQLYYSLEMEADTQICRVINALRETSFYNNTIIIFTSDHGELLGAH-GLFQKWYQAYE  364

Query  349  EATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPL  408
            E+  VP +I        +P +    TSHVD++PT+L  +G+D   +   LA S +EVH L
Sbjct  365  ESIHVPLIIHN-PTLFDKPESTDMLTSHVDILPTMLGISGLDTGAIHKVLANSHTEVHSL  423

Query  409  PGRDLMPVVDGAS--ADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPA  466
             GR+L P++   +   + G AIY MT DN+ +G    S                   VP 
Sbjct  424  VGRNLSPLLKSKTDFIEAGEAIYFMTDDNITKGLNQISFAG----------------VPY  467

Query  467  HVAANFEGLVVRVDDTDAA-GGAGHLWKLVRTFDDPATWTEPGVR-HLATNG-----MGG  519
            H  A    +   +       GG   +WK  R FD+P  W   G R     NG        
Sbjct  468  HSVAQPNSIETVIAALPTGRGGTKQIWKYSRYFDNPHFWNISGRRDQFVYNGPVRRKFNP  527

Query  520  DAYRTDPL---DDQWELYDLTADPIE----AYNRWTDPQLHELRQHLRMLLKQQR  567
              Y   P+    DQ+E+Y++T DP+E    +Y  + +    ++R+ L  LL++QR
Sbjct  528  CNYNDTPIRPQADQYEIYNITTDPLEIRNVSYESYNNRYFMQIREILNELLEEQR  582


>gi|288555022|ref|YP_003426957.1| sulfatase [Bacillus pseudofirmus OF4]
 gi|288546182|gb|ADC50065.1| sulfatase [Bacillus pseudofirmus OF4]
Length=558

 Score =  348 bits (894),  Expect = 1e-93, Method: Compositional matrix adjust.
 Identities = 210/558 (38%), Positives = 298/558 (54%), Gaps = 58/558 (10%)

Query  3    NRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRP  62
             +P+I+I+M D++R    YE+ EV  W + +L  ++   ++G+ FT HY  S AC PSR 
Sbjct  7    KKPNILILMVDQQRYPAVYETNEVKKWCEENLCAQQMLKKNGMVFTNHYAASTACSPSRT  66

Query  63   TIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISH  122
            T++TGQYP LHGVTQT G+ K   D  + WL A  VPT+G++FR AGY+T + GKWH S 
Sbjct  67   TLYTGQYPSLHGVTQTTGVAKGAFDPDVFWLDANTVPTMGHYFRTAGYETFWKGKWHASD  126

Query  123  ADLEDPAT-GAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
             D+  P T  A  + N + GV +   V+ Y  A+ L  +GFSGW+GPEPHG    NS   
Sbjct  127  EDIFIPGTHDAYSSYNLDTGVPEKDKVKMYKQANRLDAFGFSGWIGPEPHGTDPRNSASS  186

Query  182  -------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRS  234
                   RD + A+  V  L  +   +++      +P+ ++ S VNPHDI L+       
Sbjct  187  AATGMSGRDQVYAEDTVKLL--QALDKKSQKEEGHKPWFVMCSLVNPHDIALYGVLTAVQ  244

Query  235  PLKPSPLDP--PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYR  292
            P     +D   P +P APT +E LSTKP AQ +YR  Y     L  ++  N+      YR
Sbjct  245  PNYHFEVDQTLPFIPPAPTVEESLSTKPRAQESYRYTY--PRALQPIIDNNF------YR  296

Query  293  DLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEAT  351
             LYY L  + D  +++V +A+ +    ED +++ TSDHG+LLGAHGGLHQKW+N+Y+E+ 
Sbjct  297  QLYYSLQKKADQEMEKVLKALQQSSFYEDTIVLFTSDHGELLGAHGGLHQKWYNMYEESI  356

Query  352  RVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGR  411
             VP +I        +P      TSHVD++PTLL  AGV V+ V A L+++ +EV PL GR
Sbjct  357  HVPLIIHN-PLLFNEPEETGMLTSHVDVLPTLLGLAGVKVEKVQAKLSKNHTEVRPLVGR  415

Query  412  DLMPVVDGAS----ADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAH  467
            DL  ++ G++    ADE   IY MT D+V  G    +        ++          P H
Sbjct  416  DLSKLIKGSNEFHEADE--PIYFMTDDDVTRGLNQTTARGEPYQSVLQ---------PNH  464

Query  468  VAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPL  527
            + A    L          G    +WK  R FD P +               GD  R   L
Sbjct  465  IEAVIATL------PSGKGSKKEVWKYARYFDIPQS--------------DGDQERKHVL  504

Query  528  DDQWELYDLTADPIEAYN  545
            D+ +ELY+LT DP+E  N
Sbjct  505  DE-FELYNLTQDPLEEKN  521


>gi|226315218|ref|YP_002775114.1| sulfatase [Brevibacillus brevis NBRC 100599]
 gi|226098168|dbj|BAH46610.1| putative sulfatase [Brevibacillus brevis NBRC 100599]
Length=553

 Score =  342 bits (878),  Expect = 8e-92, Method: Compositional matrix adjust.
 Identities = 218/587 (38%), Positives = 292/587 (50%), Gaps = 70/587 (11%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            M  RP+I+ ++ D+ER  P YE   +  WR+ +L    +  EHG+ F RHY GS AC PS
Sbjct  1    MRRRPNILFIIVDQERFPPVYEEPAIREWREDTLHAHAFLREHGLEFKRHYVGSTACCPS  60

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T+FTGQYP LHGVTQT G  KR  DS + WL    VPT+GN+FR AGY   Y GKWH 
Sbjct  61   RATLFTGQYPSLHGVTQTSGAAKRSADSDMFWLDCNTVPTMGNYFRQAGYRCFYKGKWHF  120

Query  121  SHADLEDPATGAPLATND-NEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSG  179
            S AD+  P T  P  +     GV D    R YL AD L  YGFS W+GPEPHG    NSG
Sbjct  121  SDADIWVPGTHVPTPSYTLGTGVPDPDKERLYLLADRLDGYGFSSWIGPEPHGIAPHNSG  180

Query  180  FR-------RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVW  232
                     RD + +  V+  L      +  G   +  P+L+VASFVNPHDI ++     
Sbjct  181  SSAAIGVNGRDVVYSSEVIELL--HALDQEKGSAESYHPWLIVASFVNPHDIAIYGDISA  238

Query  233  RSPLKPSPLDP--PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQR  290
             SP     +D   P V   PT  E L+TKP  Q +Y+E Y   +            +   
Sbjct  239  SSPFFRFHVDKSVPTVAPPPTQYESLATKPRCQTSYQEVYPQAF--------QPISDQAH  290

Query  291  YRDLYYRLHAEVDGPIDRVGRA-VTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDE  349
            YR LYY+L    D  + RV  A V      + ++V TSDHG+LLGAHG L+QKW+  Y+E
Sbjct  291  YRRLYYQLQKNADREVMRVLEALVASSFYPETLVVFTSDHGELLGAHGKLYQKWYCAYEE  350

Query  350  ATRVPFVIAR--IGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHP  407
            A  +P +I    +   AT    +   TSHVD++PTLL  AG D D +   L  + SEV P
Sbjct  351  AIHIPLIIHNPLLFPLATSTELL---TSHVDILPTLLGMAGADTDRLRDELTYTHSEVRP  407

Query  408  LPGRDLMPVVDGASADE--GRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVP  465
            L GRDL P++     D      IY MT D+V +G    ++  +    ++ P         
Sbjct  408  LVGRDLTPIILTHETDTIPSVPIYFMTDDDVTKGQHQVNVQHQPYDSVIPP---------  458

Query  466  AHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTD  525
                   E ++  ++    + G   LWK  R F                    GD Y  +
Sbjct  459  ----NRIETVIACMN----SAGHSALWKYSRYF-------------------AGDVY--N  489

Query  526  PLDDQWELYDLTADPIEAYNR----WTDPQLHELRQHLRMLLKQQRA  568
            P    +ELY+LT DP+E  N     + +     +R  + +LL++QRA
Sbjct  490  PTMTDYELYNLTTDPLETRNMVIPLYKNTHSERVRLKMELLLEEQRA  536


>gi|251794620|ref|YP_003009351.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247542246|gb|ACS99264.1| sulfatase [Paenibacillus sp. JDR-2]
Length=572

 Score =  335 bits (858),  Expect = 2e-89, Method: Compositional matrix adjust.
 Identities = 218/604 (37%), Positives = 311/604 (52%), Gaps = 63/604 (10%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            MA +P+I++++ DE R  P YE A++  WR+++L  ++W  ++G+ F RHY GS AC PS
Sbjct  3    MAEKPNILLLLVDEMRYPPLYEKADIRVWREQNLVTQQWLRDNGLEFHRHYIGSAACAPS  62

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T+FTG YP LHGVTQT+GI K+  DS + WL    VPT+G++FR AGY T Y GKWH+
Sbjct  63   RTTLFTGHYPSLHGVTQTNGIAKQAADSDMFWLDRNTVPTMGDYFREAGYRTFYKGKWHL  122

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAG------  174
            S+ D+  P T   L + +            Y +AD L  +GFS W+GPEPHG        
Sbjct  123  SYEDIIVPGTQQGLPSYNPATGYPDHNQDLYENADRLEAFGFSSWIGPEPHGRNPRDSGS  182

Query  175  -LANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWR  233
               +    RD   A   V  +      +  GD  +  P+L+V S VNPHDI L+     R
Sbjct  183  SAGSGASGRDEFYAAETVQLIEALEQNKLGGDDES--PWLIVTSLVNPHDITLYGDLTAR  240

Query  234  SP-----LKPSP-LDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARN  287
             P     + P P +DPP     PT  E+L TKP  Q +YR+ Y         V+     N
Sbjct  241  IPAFRFDVGPVPDIDPP-----PTRHENLHTKPRCQASYRDLY--------PVALQPITN  287

Query  288  AQRYRDLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNL  346
               YR LYY+L    D  + +V  A+      ED +++ TSDHGDLLG+HGGLHQK + +
Sbjct  288  EPFYRKLYYQLQKNADEQLRKVVEALARTSFYEDTIIILTSDHGDLLGSHGGLHQKMYCV  347

Query  347  YDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVH  406
            Y+E  RVP ++    ++  +PR+V + TSH+DL+PTLL  AG++ D +   L   FSE  
Sbjct  348  YEEVLRVPLLVCN-KKRFPEPRSVHSLTSHLDLLPTLLGLAGINSDEIRGRLDSRFSEAR  406

Query  407  PLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPA  466
            PL GR+L   +      E   +Y MT D+++ G    S         V  P P  +  P 
Sbjct  407  PLIGRNLAGAMSMQPEPEA-PVYFMTDDDIMRGQHQIS--------PVGIPYP-SVAQPN  456

Query  467  HVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGG-------  519
            H+      L           G    WKL R +D+P  W+EPG+  +    + G       
Sbjct  457  HIETVIAPLY--------RNGHKEYWKLSRYYDNPQFWSEPGILDVTYAPVKGGQDNKEI  508

Query  520  ---DAYRTDPLDDQWELYDLTADPIEAYN----RWTDPQLHELRQHLRMLLKQQ-RAVSV  571
                  RT P+ +++ELY LT DP+E  N     ++   + E    + +L +Q+ R    
Sbjct  509  AWASRVRTVPVQEEYELYSLTDDPLETRNLANPAYSATYMEEFALMMNLLTEQRSRKRLA  568

Query  572  PERN  575
            P RN
Sbjct  569  PGRN  572


>gi|77163732|ref|YP_342257.1| arylsulfatase A and related enzyme [Nitrosococcus oceani ATCC 
19707]
 gi|76882046|gb|ABA56727.1| Arylsulfatase A-like enzyme [Nitrosococcus oceani ATCC 19707]
Length=620

 Score =  325 bits (833),  Expect = 1e-86, Method: Compositional matrix adjust.
 Identities = 211/600 (36%), Positives = 311/600 (52%), Gaps = 75/600 (12%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RP+I++++ DE R  P +E      +RQ  L  +      G+ F RHY  + AC PSR +
Sbjct  47   RPNILLMLVDEMRYPPVFEGLGAQQFRQTYLKTQNALRASGVEFHRHYAAATACAPSRAS  106

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            IFTG YP LHGVTQT G  K  +D  + WL    VPT+G++F+A GY T Y GKWH+S+A
Sbjct  107  IFTGHYPSLHGVTQTTGAAKEENDPDVFWLDPASVPTMGDYFQAGGYRTFYKGKWHVSNA  166

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR--  181
            DL+ P T   L + D++G  D    + YL+AD L  YGF GW+GPEPHG    N+G    
Sbjct  167  DLQIPGTHDQLLSYDDQGNPDPGKQQLYLEADRLADYGFEGWIGPEPHGKAPLNTGSSPA  226

Query  182  ----RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRS---  234
                RD   A +VV  + +    R +       P+L VAS VNPHDI L+  +V R    
Sbjct  227  QGQGRDVGFATQVVNLIQQLGTERHSA------PWLTVASLVNPHDIALW-GYVARHTGL  279

Query  235  ---------PLKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYA  285
                     P      DP  V  A T  +DL+TKP+ Q +Y+E+Y     +  +   +Y 
Sbjct  280  FNFTVEDIVPAFTELFDP--VMFAQTLADDLTTKPSCQQSYQESY--NEWMQGVPPHDYF  335

Query  286  RNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWF  344
            R        YY+LH  VD  + ++ +A+ +    D  +++ TSDHGDLLGAH  +HQKW+
Sbjct  336  R-------FYYQLHKNVDDELYKLYQALQQSPFYDNTIVIFTSDHGDLLGAHRYMHQKWY  388

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE  404
              YDEA RVP +I+       +PR++ + TSHVDL+PTLLS A +    +   +A+  S+
Sbjct  389  QAYDEAVRVPLIISN-PHLFPEPRSIDSVTSHVDLLPTLLSLARLKQARLRRKVAKGHSD  447

Query  405  VHPLPGRDLMPVVDGASADE-GRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIK  463
              PL GR+L  +V G +       +Y MT D++  G    + +    G ++         
Sbjct  448  PVPLVGRNLRRLVLGRNRRPVADPVYFMTDDDMSRGLDQENFIGIAYGSVIQ--------  499

Query  464  VPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPG----VRHLATNGM--  517
             P+HV    E ++V +D        G +WK  R FD+   W++P     V     N +  
Sbjct  500  -PSHV----ETVIVEID--------GEVWKYSRYFDNKQFWSDPSQPKDVVTQVENKLID  546

Query  518  ---------GGDAYRTDPLDDQWELYDLTADPIEAYNRWTDPQLHELRQHLRMLLKQQRA  568
                        +++ +P  D++E+Y++T DP+E  N + +     ++ HL  LL QQRA
Sbjct  547  PPAGTYDVNATQSFKYEPEPDEYEMYNVTQDPMELDNLYGNLVYAAMQTHLATLLDQQRA  606


>gi|254436243|ref|ZP_05049750.1| sulfatase, putative [Nitrosococcus oceani AFC27]
 gi|207089354|gb|EDZ66626.1| sulfatase, putative [Nitrosococcus oceani AFC27]
Length=621

 Score =  325 bits (833),  Expect = 1e-86, Method: Compositional matrix adjust.
 Identities = 211/600 (36%), Positives = 311/600 (52%), Gaps = 75/600 (12%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RP+I++++ DE R  P +E      +RQ  L  +      G+ F RHY  + AC PSR +
Sbjct  48   RPNILLMLVDEMRYPPVFEGLGAQQFRQTYLKTQNALRASGVEFHRHYAAATACAPSRAS  107

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            IFTG YP LHGVTQT G  K  +D  + WL    VPT+G++F+A GY T Y GKWH+S+A
Sbjct  108  IFTGHYPSLHGVTQTTGAAKEENDPDVFWLDPASVPTMGDYFQAGGYRTFYKGKWHVSNA  167

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR--  181
            DL+ P T   L + D++G  D    + YL+AD L  YGF GW+GPEPHG    N+G    
Sbjct  168  DLQIPGTHDQLLSYDDQGNPDPGKQQLYLEADRLADYGFEGWIGPEPHGKAPLNTGSSPA  227

Query  182  ----RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRS---  234
                RD   A +VV  + +    R +       P+L VAS VNPHDI L+  +V R    
Sbjct  228  QGQGRDVGFATQVVNLIQQLGTERHSA------PWLTVASLVNPHDIALW-GYVARHTGL  280

Query  235  ---------PLKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYA  285
                     P      DP  V  A T  +DL+TKP+ Q +Y+E+Y     +  +   +Y 
Sbjct  281  FNFTVEDIVPAFTELFDP--VMFAQTLADDLTTKPSCQQSYQESY--NEWMQGVPPHDYF  336

Query  286  RNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWF  344
            R        YY+LH  VD  + ++ +A+ +    D  +++ TSDHGDLLGAH  +HQKW+
Sbjct  337  R-------FYYQLHKNVDDELYKLYQALQQSPFYDNTIVIFTSDHGDLLGAHRYMHQKWY  389

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE  404
              YDEA RVP +I+       +PR++ + TSHVDL+PTLLS A +    +   +A+  S+
Sbjct  390  QAYDEAVRVPLIISN-PHLFPEPRSIDSVTSHVDLLPTLLSLARLKQARLRRKVAKGHSD  448

Query  405  VHPLPGRDLMPVVDGASADE-GRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIK  463
              PL GR+L  +V G +       +Y MT D++  G    + +    G ++         
Sbjct  449  PVPLVGRNLRRLVLGRNRRPVADPVYFMTDDDMSRGLDQENFIGIAYGSVIQ--------  500

Query  464  VPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPG----VRHLATNGM--  517
             P+HV    E ++V +D        G +WK  R FD+   W++P     V     N +  
Sbjct  501  -PSHV----ETVIVEID--------GEVWKYSRYFDNKQFWSDPSQPKDVVTQVENKLID  547

Query  518  ---------GGDAYRTDPLDDQWELYDLTADPIEAYNRWTDPQLHELRQHLRMLLKQQRA  568
                        +++ +P  D++E+Y++T DP+E  N + +     ++ HL  LL QQRA
Sbjct  548  PPAGTYDVNATQSFKYEPEPDEYEMYNVTQDPMELDNLYGNLVYAAMQTHLATLLDQQRA  607


>gi|149924951|ref|ZP_01913279.1| sulfatase [Plesiocystis pacifica SIR-1]
 gi|149814176|gb|EDM73791.1| sulfatase [Plesiocystis pacifica SIR-1]
Length=553

 Score =  292 bits (747),  Expect = 1e-76, Method: Compositional matrix adjust.
 Identities = 204/554 (37%), Positives = 283/554 (52%), Gaps = 55/554 (9%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            ++++++TD++RA P YE   +          R      G+SF +H   S ACVPSR ++F
Sbjct  4    NVVLIITDQDRARPSYERVAL------RCPARERLRASGLSFEQHRIASAACVPSRASMF  57

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHADL  125
            TG  P +HGVTQTDG+ K  DD  +RWL   ++PTLG+  RA  YD  Y GKWH+S ADL
Sbjct  58   TGHSPWVHGVTQTDGLAKGHDDPAMRWLSPTQLPTLGHCLRALDYDAAYLGKWHLSAADL  117

Query  126  ED-PATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRDP  184
             D   T A +      GV   A   RY +A+PL  +GF GW+GPEPHGA + NSG  RDP
Sbjct  118  RDGQGTVATVRREGRRGVRAPAGEARYREANPLSAFGFDGWIGPEPHGAAMHNSGTIRDP  177

Query  185  LVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSP--LKPSPLD  242
            L A++ V WL ER  R   G+    +PF L  +FVNPHDIV +P W    P  L      
Sbjct  178  LYAEQAVEWLRERGRRFEGGER---KPFFLAVNFVNPHDIVFWPEWSVFRPRWLGSGVPS  234

Query  243  PPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRLHAEV  302
             P  P A    ++L  +P  +  YR+ Y   YG   ++   Y    + YR  Y+ L   V
Sbjct  235  SPPPPTAGLGVKELLREPPVRNQYRDRYLRAYGPPDLIRSAYELRGEAYRRFYHALIERV  294

Query  303  DGPIDRVGRAV-TEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARIG  361
            D  I  V  A+  +  +E   ++ T+DHG+LLGAH  +HQKWFN ++E  RVPFV+    
Sbjct  295  DRHIAAVLDALDAQPFAEQTAVIWTADHGELLGAH-DMHQKWFNAFEETVRVPFVVRAPQ  353

Query  362  EKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAA-----LAESFSEVHPLPGRDLMPV  416
             +A   + V   +SH+DL+PT+L  AG      A A     L E F +  P PG+DL+  
Sbjct  354  LRARAGQRVEERSSHLDLLPTILGLAGAGKGTAARARLETQLDERFPKARPWPGQDLL--  411

Query  417  VDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKVPAH-----VAAN  471
             + A  D     Y +T D ++ G+   + ++R+       PA  R+ +  +      A  
Sbjct  412  AERAELDS----YFVTADAIVNGNQRLAAVTRR------APALRRLSMLHYTPIDGCATG  461

Query  472  FEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRTDPLDDQW  531
             E LV  V+        G  +KL +TFD   T  +     LA N              + 
Sbjct  462  VEALVGSVE--------GRPYKLCQTFDPRGTVLDT----LALNP-------RQRFPGER  502

Query  532  ELYDLTADPIEAYN  545
            EL+DL ADP EA N
Sbjct  503  ELFDLEADPAEARN  516


>gi|288960770|ref|YP_003451110.1| sulfatase [Azospirillum sp. B510]
 gi|288913078|dbj|BAI74566.1| sulfatase [Azospirillum sp. B510]
Length=691

 Score =  182 bits (462),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 195/684 (29%), Positives = 261/684 (39%), Gaps = 200/684 (29%)

Query  2    ANRPDIIIVMTDEERAVPPYESA----------EVLAWRQRSLTGRRWFDEHGISFTR--  49
            A  P+I++++ D+ R  P +             E+L +R  +      + E+    TR  
Sbjct  13   AQHPNILLIVVDQYR-YPRFSYGPEGGFAEPLKEILGFRGPADVSGNPYAEYFPGLTRLR  71

Query  50   --------HYTGSLACVPSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTL  101
                    H   S AC PSR  +FTGQY    GVTQTDG+ K  +     WL+A   PTL
Sbjct  72   RNAVALHNHTIASSACTPSRAVMFTGQYGTRTGVTQTDGMFKDGNTPTFPWLQANGYPTL  131

Query  102  GNWFRAAGYDTHYDGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYG  161
            G+W RA GY +HY GKWH+S+                                  L  +G
Sbjct  132  GHWMRAVGYSSHYFGKWHVSNP-----------------------------PGHSLNRFG  162

Query  162  FSGW--VGPEPHGAGLANSGFRRDPLVADRVVAWLTER-------YARRRA---GDT---  206
            FS W    PEPHGA + N G  RDP  AD    +L  R       YA   A   G+    
Sbjct  163  FSDWELSYPEPHGAAINNLGIYRDPGFADNACLFLRRRGLALPYDYATSAAEARGEQESP  222

Query  207  ---AAMRPFLLVASFVNPHDIVLFPAWV----------------------------WRSP  235
               A  RP+  V SF NPHDI  +P  V                             +S 
Sbjct  223  AIDATQRPWFAVVSFTNPHDIATYPTVVSQALPQTEAQGAQDKPQSAFGPLDVPDQGQSS  282

Query  236  LKPS------PLDPPHVPA-----APTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNY  284
              P+      PL+P   P       PT DEDLS+KP  Q  +  AY  G  L+   S   
Sbjct  283  FPPNEGTMTIPLNPQGFPQDCAGPIPTWDEDLSSKPVCQ--FDAAYKIGLALSAKASHGA  340

Query  285  ARNAQRYRD------------------------------------------LYYRLHAEV  302
             +      D                                           Y  LH++V
Sbjct  341  VQGITNGHDDGKPDTGVGREDWSAAVKLALKFTIPFQLSEHPEQYSIEFLQFYGWLHSQV  400

Query  303  DGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIA---  358
            D  I+RV +++ E G +E+ +++  +DHG+L  AH  + +KW   Y EA  VP V+    
Sbjct  401  DPQINRVLQSLEESGQAENTIVLFVADHGELGAAHNMMLEKWHVAYQEAVHVPMVVQFPP  460

Query  359  --RIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVH---PLPGRDL  413
              R  +  T    V A TSH DLVPT+L   GV  + +  A AE  +E H   PLPG DL
Sbjct  461  SMRSDDGLTH---VDAVTSHADLVPTILGLTGVGPEALEKAEAE-LAERHRMAPLPGVDL  516

Query  414  MPVVD---------GASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKV  464
             P +          G    EG  +  +T D V E   G        GR+  P        
Sbjct  517  TPTLKVPGTPVTYPGGRVREG--VLFITDDEVTEPTKG--------GRLTEP--------  558

Query  465  PAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPG-VRHLATNGMGGDAYR  523
                   F    V     +A    GH  K V     P    +P  +R + T       Y 
Sbjct  559  -----DYFGAFEVYCQTVEAVRTGGHGAKEVPGL-APGPVRQPNHIRCVRTKEAKLSRYF  612

Query  524  --TDPLDDQWELYDLTADPIEAYN  545
              ++P   +WE+YDL  DP E  N
Sbjct  613  DPSNPRLLEWEMYDLVNDPNEIVN  636


>gi|167644209|ref|YP_001681872.1| sulfatase [Caulobacter sp. K31]
 gi|167346639|gb|ABZ69374.1| sulfatase [Caulobacter sp. K31]
Length=674

 Score =  166 bits (419),  Expect = 1e-38, Method: Compositional matrix adjust.
 Identities = 175/635 (28%), Positives = 233/635 (37%), Gaps = 209/635 (32%)

Query  49   RHYTGSLACVPSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAA  108
             H   + AC PSR  I+TGQY    GVTQTDG+ K  D     WL A  +PTLG W R A
Sbjct  69   NHTIAASACTPSRAVIYTGQYGTKTGVTQTDGLFKSGDSYNFPWLAADGIPTLGTWMREA  128

Query  109  GYDTHYDGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVG-  167
            GY THY GKWH+S               N  E  +D               YGF  W   
Sbjct  129  GYSTHYFGKWHVS---------------NPPEHSLDR--------------YGFDDWEES  159

Query  168  -PEPHGAGLANSGFRRDPLVADRVVAWLTER-----YARRRAGDTA-----------AMR  210
             PEPHGA + N G  RD    D+  A++  +     Y R +A + A            + 
Sbjct  160  YPEPHGAAINNLGVYRDAGFTDQACAFIRRKALALNYNRAQAVEQARDPYAAGPDADNIP  219

Query  211  PFLLVASFVNPHDIVLFPAWVWRS------------------PLKPSPLDPP--------  244
            P+  VASF NPHDI  +PA + ++                  PL+     PP        
Sbjct  220  PWFAVASFTNPHDIATYPAVIAQALPTPDNSGTQSIFGPLTVPLQGQKTPPPTAGTIQIA  279

Query  245  ---------HVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARN--------  287
                         +PT +E L+ KP+ Q  Y  AY  G  L      N            
Sbjct  280  LNALGFPQDCAKPSPTQNESLADKPSCQRDY--AYKVGLALNAKTGFNIVNTVGSKLHDQ  337

Query  288  -----------------------------------AQRYRDLYYRLHAEVDGPIDRVGRA  312
                                               A ++  LY  LHA VD  +  V + 
Sbjct  338  FPNLSETPDLARRAAVQQALKGTIPFQLSDDPDGYALQFLQLYGWLHAVVDTHVTAVLKT  397

Query  313  VTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVI-----ARIGEK---  363
            + E G  D  +++  +DHG+   AHG + +KW   Y EA  VP V+      ++ E    
Sbjct  398  LEETGQADNTIVIFLADHGEYAAAHGMMIEKWHTAYQEALHVPVVVRFPPSTKVVENEPG  457

Query  364  ------ATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE---VHPLPGRDLM  414
                     PR + A TSH+D++PT+L  AGV  D     +AE         PLPG DL 
Sbjct  458  TGEGPLGFTPRQIDALTSHIDILPTVLGLAGVTPD-QRTTIAERLGRHRPTPPLPGVDLS  516

Query  415  PVVDGA-------SADEGRAIYLMTRDNVL----EGDTGASL-------LSRQLGRIVNP  456
             ++ G           E + +  +T D +       D  A+L       + RQ+   VN 
Sbjct  517  GLLKGEIHAVIEPDGRERQGVLFITDDEITAPSASNDDPANLKCDKEFEVYRQVVETVND  576

Query  457  P------APLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVR  510
                   AP  ++ P HV        VR              KL R FD      E    
Sbjct  577  QHRLLNLAPGSVRQPNHVR------CVRT----------LRHKLSRYFDPSGEAAE----  616

Query  511  HLATNGMGGDAYRTDPLDDQWELYDLTADPIEAYN  545
                               +WE+YDL  DP EA N
Sbjct  617  -------------------EWEMYDLERDPNEAVN  632


>gi|149922160|ref|ZP_01910599.1| Arylsulfatase A and related enzyme [Plesiocystis pacifica SIR-1]
 gi|149817004|gb|EDM76488.1| Arylsulfatase A and related enzyme [Plesiocystis pacifica SIR-1]
Length=672

 Score =  164 bits (416),  Expect = 4e-38, Method: Compositional matrix adjust.
 Identities = 187/632 (30%), Positives = 246/632 (39%), Gaps = 192/632 (30%)

Query  42   EHGISFTRHYTGSLACVPSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTL  101
            ++ +    H   S ACVPSR  +F+GQY  +   TQTDG+ K   D +  WL   + PTL
Sbjct  50   DNAVVLRNHRIASSACVPSRTVVFSGQYGTITKATQTDGVLKNGADRKFPWLGPDDFPTL  109

Query  102  GNWFRAAGYDTHYDGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYG  161
            G+W RA GY +HY GKWH+S             AT D EG                  YG
Sbjct  110  GDWMRANGYTSHYFGKWHVSGE-----------ATTDLEG------------------YG  140

Query  162  FSGW--VGPEPHGAGLANSGFRRDPLVADRVVAWLTER-----YARRRAGDTAAMR----  210
            FS W    P+PHG    N G  RD   AD V ++L  R     Y  + A    A      
Sbjct  141  FSDWELSYPDPHGTLPNNLGHYRDYQFADIVTSFLRRRGLGIPYCVQHAAHNVAEATKRE  200

Query  211  ------------PFLLVASFVNPHDIVLFPA--WVWRSPLKPSP----------------  240
                        P+  VASF NPHDI  FP    V+ + ++ +P                
Sbjct  201  RDDVEEPQDPPAPWFAVASFTNPHDIGSFPIPRAVYDADVEGAPYTLAVPPKGAKGTLPK  260

Query  241  -------LDPPHVP-----AAPTADEDLSTKPAAQVAYRE----AYYSGYGLT-------  277
                   L+P   P       PT DE L  KP+ Q+ Y      A  S  GL        
Sbjct  261  GGTMAIDLNPLGFPQNNADVPPTWDEKLRNKPSCQLDYVYKWGLALMSKAGLNAATSVDN  320

Query  278  ---------RMVSRNYARNA---------------QRYRDLYYRLHAEVDGPIDRVGRAV  313
                     R V    A NA               + +   Y     +VD  IDRV RA+
Sbjct  321  PGTKREQLARAVKVTLASNASGMPLALTDNPELACRAFIQYYAYAIQQVDQHIDRVLRAL  380

Query  314  TEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIARIGEKATQP---RT  369
             E G  D  +++   DHG+  GAH  + +KW + Y+E T VP V+         P   R 
Sbjct  381  DESGQADNTIVIFAPDHGEYAGAHNKMSEKWHSAYEEFTHVPVVVRFPDSLHVVPGGTRQ  440

Query  370  VSAPTSHVDLVPTLLSAAGVDVDVVAAALAE---SFSEVHPLPGRDLMPVVDGASA----  422
            V   TSH DL+PT+L  AGV    + A LA+   +  + +   G DL  ++ G +A    
Sbjct  441  VDELTSHADLLPTILGLAGVKGPALKATLAQLRRTHDKSYMPVGSDLSELLYGRAARAQD  500

Query  423  -DEGR---AIYLMTRDNV---LEGDTGASLLSRQL-----------------GR-----I  453
             + GR    +  MT D +   L+G+T   L    +                 GR      
Sbjct  501  PETGRPREGVLFMTHDTITAPLDGETETDLDDESVPLSAYDVFLAAVDELRKGREDWPDE  560

Query  454  VNPPAPLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLA  513
            V   AP  +  P  V A     VV  D+          WKLVR +  PA    PGV    
Sbjct  561  VEDIAPGEVCQPCLVNA-----VVSRDN----------WKLVR-YHAPADQA-PGV----  599

Query  514  TNGMGGDAYRTDPLDDQWELYDLTADPIEAYN  545
                           DQ+ELYDL  DP E +N
Sbjct  600  --------------PDQYELYDLDRDPTEEHN  617


>gi|312139132|ref|YP_004006468.1| sulfatase [Rhodococcus equi 103S]
 gi|311888471|emb|CBH47783.1| putative secreted sulfatase [Rhodococcus equi 103S]
Length=558

 Score =  163 bits (413),  Expect = 7e-38, Method: Compositional matrix adjust.
 Identities = 182/603 (31%), Positives = 252/603 (42%), Gaps = 138/603 (22%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            + +RP+I++V+TD+ERA  P    E   W + +L  R+   + G++F      +  C PS
Sbjct  54   LPHRPNIVVVITDQERA--PMFWPE--GWAETNLPNRKRLADTGLTFDGCCCNAAMCSPS  109

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T FTG YP  HGVT T   G     +    L  G +  +     +AGY+ HY GKWH+
Sbjct  110  RSTFFTGLYPAQHGVTATLTEGGTVSPTEPT-LPLG-IQNMAKLLDSAGYNVHYRGKWHM  167

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            S  +               EG   S+A         +  YGF GWV PE         G 
Sbjct  168  SKGE---------------EGGDPSSA--------DVAAYGFRGWVPPE--------GGQ  196

Query  181  RRDPLVADRVVAWLTERYARR-----RAGDTAAMRPFLLVASFVNPHDIVLFP-AW-VWR  233
              DP       A L  RYA       +  D    RPF LV SFVNPHD++ +P  W    
Sbjct  197  DTDPDHFGGGCADLDSRYASEAVEFLQGLDPNDDRPFALVVSFVNPHDVLAYPQTWDAIN  256

Query  234  SPLKPSPLDPPHV-----PAAPTADEDL--STKPAAQVAYREAYYSGYGLTRMVSRNYAR  286
                    D P +        PT DE L  + KP AQ+  +    +G G   +  ++ AR
Sbjct  257  GTCDNYGSDAPGIFEQGIDLPPTYDEALARNFKPTAQIQSQVLLAAGLG--PLPGQDAAR  314

Query  287  NAQRYRDLYYRLHAEVDGPIDRVGRAVTE--GGSEDAMLVRTSDHGDLLGAHGGLHQKWF  344
            N   Y + Y  +H  VD  I  V  A+    G  E  +++R SDHG++  +HGGL QK F
Sbjct  315  N---YVNFYAYMHKVVDEHIGAVLDALESRPGMRESTVVIRISDHGEMGMSHGGLRQKVF  371

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE  404
            N Y+E  RVP VI+       QP   +A +S +D++PTL S A    DV   A  +    
Sbjct  372  NAYEETLRVPLVISN-PLLFPQPVHTAALSSLIDVMPTLASLA----DVPDRAAWD----  422

Query  405  VHPLPGRDLMPVVD------GASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPA  458
                 G DLMP+VD       A + E + +   T D     D   +    Q   IV  P 
Sbjct  423  ---FRGVDLMPIVDDAAANPAAPSAEVQDVLHFTYD-----DENCATPDGQ--NIVTQPN  472

Query  459  PLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMG  518
             +R               +R           H WK    F DPA  T P           
Sbjct  473  HMR--------------TIR----------DHRWKYSAYF-DPAGVTPP-----------  496

Query  519  GDAYRTDPLDDQWELYDLTADPIEAYNR-------WTDPQLHELRQHLRMLLKQQRAVSV  571
                       Q+E+YDL  DP+E +NR       + DP +  +R H ++    +R  + 
Sbjct  497  -----------QFEMYDLQTDPLELHNRANPLNLGYFDP-VQSMRMHAKLFEVMERCGTT  544

Query  572  PER  574
            P R
Sbjct  545  PAR  547


>gi|325673566|ref|ZP_08153257.1| arylsulfatase [Rhodococcus equi ATCC 33707]
 gi|325555587|gb|EGD25258.1| arylsulfatase [Rhodococcus equi ATCC 33707]
Length=558

 Score =  163 bits (412),  Expect = 9e-38, Method: Compositional matrix adjust.
 Identities = 182/603 (31%), Positives = 252/603 (42%), Gaps = 138/603 (22%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            + +RP+I++V+TD+ERA  P    E   W + +L  R+   + G++F      +  C PS
Sbjct  54   LPHRPNIVVVITDQERA--PMFWPE--GWAETNLPNRKRLADTGLTFDGCCCNAAMCSPS  109

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R T FTG YP  HGVT T   G     +    L  G +  +     +AGY+ HY GKWH+
Sbjct  110  RSTFFTGLYPAQHGVTATLTEGGTVSPTEPT-LPLG-IQNMAKLLDSAGYNVHYRGKWHM  167

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
            S  +               EG   S+A         +  YGF GWV PE         G 
Sbjct  168  SKGE---------------EGGDPSSA--------DVAAYGFRGWVPPE--------GGQ  196

Query  181  RRDPLVADRVVAWLTERYARR-----RAGDTAAMRPFLLVASFVNPHDIVLFP-AW-VWR  233
              DP       A L  RYA       +  D    RPF LV SFVNPHD++ +P  W    
Sbjct  197  DTDPDHFGGGCADLDSRYASEAVEFLQGLDPNDDRPFALVVSFVNPHDVLAYPQTWDAIN  256

Query  234  SPLKPSPLDPPHV-----PAAPTADEDL--STKPAAQVAYREAYYSGYGLTRMVSRNYAR  286
                    D P +        PT DE L  + KP AQ+  +    +G G   +  ++ AR
Sbjct  257  GTCDNYGSDAPGIFEQGIDLPPTYDEALARNFKPTAQIQSQVLLAAGLG--PLPGQDAAR  314

Query  287  NAQRYRDLYYRLHAEVDGPIDRVGRAVTE--GGSEDAMLVRTSDHGDLLGAHGGLHQKWF  344
            N   Y + Y  +H  VD  I  V  A+    G  E  +++R SDHG++  +HGGL QK F
Sbjct  315  N---YVNFYAYMHKVVDEHIGAVLDALESRPGMRESTVVIRISDHGEMGMSHGGLRQKVF  371

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE  404
            N Y+E  RVP VI+       QP   +A +S +D++PTL S A      V    A  F  
Sbjct  372  NAYEETLRVPLVISN-PLLFPQPVHTAALSSLIDVMPTLASLAD-----VPDRSAWDFR-  424

Query  405  VHPLPGRDLMPVVD------GASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPA  458
                 G DLMP+VD       A + E + +   T D     D   +  + Q   IV  P 
Sbjct  425  -----GVDLMPIVDDAAANPAAPSAEVQDVLHFTYD-----DQNCATPNGQ--SIVTQPN  472

Query  459  PLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMG  518
             +R               +R           H WK    F DPA  T P           
Sbjct  473  HMR--------------TIR----------DHRWKYSAYF-DPAGVTPP-----------  496

Query  519  GDAYRTDPLDDQWELYDLTADPIEAYNR-------WTDPQLHELRQHLRMLLKQQRAVSV  571
                       Q+E+YDL  DP+E +NR       + DP +  +R H ++    +R  + 
Sbjct  497  -----------QFEMYDLQTDPLELHNRANPLNLGYFDP-VQSMRMHAKLFEVMERCGTT  544

Query  572  PER  574
            P R
Sbjct  545  PAR  547


>gi|163754242|ref|ZP_02161365.1| POSSIBLE HYDROLASE [Kordia algicida OT-1]
 gi|161326456|gb|EDP97782.1| POSSIBLE HYDROLASE [Kordia algicida OT-1]
Length=507

 Score =  161 bits (408),  Expect = 2e-37, Method: Compositional matrix adjust.
 Identities = 121/480 (26%), Positives = 203/480 (43%), Gaps = 87/480 (18%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            +PD+I+++TD+ERA   +       W   +L    +  ++G +F + +  S  C PSR T
Sbjct  9    QPDMILIITDQERATQNFPEG----WESENLKTMTFLKDNGFTFNKAFCNSCMCSPSRTT  64

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            +FTG YP  HGVTQT   G R+ D+  +     E+  +       GYD  Y GKWH+S  
Sbjct  65   LFTGIYPSQHGVTQTLTFGGRYSDAETQL--NPEIYNMARMLSNEGYDVQYRGKWHLSKG  122

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVG--------PEPHGAGL  175
            + E+  T + +A                         GF GW+         P+  G G 
Sbjct  123  ESENGLTASEIALT-----------------------GFKGWIAPDAGEDVKPQNFGGGY  159

Query  176  ANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSP  235
            AN     D +   + + +L +    R +G      PF LV S VNPHD++ +P  V    
Sbjct  160  AN----HDEMYIQQGIEFLRKVRTERESGHKRV--PFCLVLSLVNPHDVLAYPNGV-NYG  212

Query  236  LKPSPLDPPHVPAAPTADEDL--STKPAAQVAYREAYYSGYGLTRMVSRNYARNAQR--Y  291
               S      V    + +EDL  + KP AQ    ++         ++  +   N ++  Y
Sbjct  213  YSESDWSGRSVGLPYSINEDLLKNKKPMAQFQIVQS-------ANILLGDLPTNEEKLNY  265

Query  292  RDLYYRLHAEVDGP----IDRVGRAVTEGG--SEDAMLVRTSDHGDLLGAHGGLHQKWFN  345
             + Y     ++D      ID +      G   +++A+++R SDHG++  AHGG+ QK FN
Sbjct  266  INFYAHTLTKIDHQIGEFIDELYHVSDTGNRMADEALVIRISDHGEMGLAHGGMRQKAFN  325

Query  346  LYDEATRVPFVIAR--------IGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAA  397
            +Y+E   VP V +            KA   R+ +   + +D++PT+   A +        
Sbjct  326  VYEETLNVPMVFSNPILFPSEDENGKAIPQRSSNELATLIDIMPTMAEIANIK-------  378

Query  398  LAESFSEVHP---LPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIV  454
                    HP   L G +L+P++      +   ++          D  +++L+    R +
Sbjct  379  --------HPSKALQGNNLLPIITDGKGVQDEVLFTFDDTKASSADHASAVLAANRIRCI  430


>gi|226305445|ref|YP_002765405.1| sulfatase [Rhodococcus erythropolis PR4]
 gi|226184562|dbj|BAH32666.1| putative sulfatase [Rhodococcus erythropolis PR4]
Length=545

 Score =  144 bits (364),  Expect = 4e-32, Method: Compositional matrix adjust.
 Identities = 158/563 (29%), Positives = 232/563 (42%), Gaps = 128/563 (22%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            +P+I++++TD+ER  P Y       W  ++L  R+   +HG++F +    +  C PSR T
Sbjct  55   KPNIVVIITDQERR-PMYWPQ---GWADQNLPNRKRIADHGLTFDQAVCNTAMCSPSRST  110

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
             FTG +P  HGVT+T   G     +  + L+  E   +     +AGY+  Y GKWH+S  
Sbjct  111  FFTGLFPAQHGVTRTLTEGGTVSPTEPQ-LQVSE-QNMAKLLASAGYNVQYRGKWHLSKG  168

Query  124  -DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWV--------GPEPHGAG  174
             +  DP +                        D +  +GF GW+         P+  G G
Sbjct  169  VEGGDPTS------------------------DDVAGFGFEGWIPPDAGQDTNPDHFGGG  204

Query  175  LANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRS  234
             A+     D  VA+  V +L+           A+ +P+ L+ SFVNPHD++ +P   W +
Sbjct  205  CAD----HDRRVAEEAVDFLS-------GSAVASGQPWALIVSFVNPHDVLAYPQ-TWNA  252

Query  235  PLKPSPLDPPHVPAA--------PTADEDLST--KPAAQVAYREAYYSGYGLTRMVSRNY  284
                        P A        PT DE L+   KP AQV  +       GL  ++  + 
Sbjct  253  MNGTCDNYGSDAPGAFEQGIDLPPTFDEILALNHKPTAQV--QSELLLAAGLGPLLGPDQ  310

Query  285  ARNAQRYRDLYYRLHAEVDGPIDRVGRAV--TEGGSEDAMLVRTSDHGDLLGAHGGLHQK  342
            ARN   Y + Y  LH  VD  I  V  A+  T    +D ++VR SDHG++  +HGGL QK
Sbjct  311  ARN---YINFYAYLHKVVDEHIGSVLDAIEATPQMLDDTVIVRMSDHGEMGMSHGGLRQK  367

Query  343  WFNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESF  402
             FN Y+E  RVP VI+       +P    A  S +D++PT  + A        A   ES+
Sbjct  368  VFNAYEETLRVPLVISN-PLLFPEPVRTDALASLIDVMPTFATLA-------QAPARESW  419

Query  403  SEVHPLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRI  462
            +      G DL PV+  A+A          +D +        L +        P     +
Sbjct  420  N----FSGTDLTPVIINAAA-YPHGPSAQVQDTI--------LFTYDDQNCATPDGQNIV  466

Query  463  KVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAY  522
              P H+    E                + WK    F DPA    P               
Sbjct  467  TQPNHIRCIRE----------------NRWKYTMYF-DPAGVAAP---------------  494

Query  523  RTDPLDDQWELYDLTADPIEAYN  545
                   Q+ELYDL ADP+E  N
Sbjct  495  -------QYELYDLQADPLELNN  510


>gi|229489534|ref|ZP_04383397.1| sulfatase [Rhodococcus erythropolis SK121]
 gi|229323631|gb|EEN89389.1| sulfatase [Rhodococcus erythropolis SK121]
Length=545

 Score =  144 bits (364),  Expect = 4e-32, Method: Compositional matrix adjust.
 Identities = 158/562 (29%), Positives = 233/562 (42%), Gaps = 126/562 (22%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            +P+I++++TD+ER  P Y       W +++L  R+   +HG+SF +    +  C PSR T
Sbjct  55   KPNIVVIITDQERR-PMYWPQ---GWAEQNLPNRKRIADHGLSFDQAVCNTAMCSPSRST  110

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
             FTG YP  HGVT+T   G     +  + L+  E   +     +AGY+  Y GKWH+S  
Sbjct  111  FFTGLYPAQHGVTRTLTEGGTVSPTEPQ-LQVSE-QNMAKLLASAGYNVQYRGKWHLSKG  168

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWV--------GPEPHGAGL  175
                   G    + D  G                  +GF GW+         P+  G G 
Sbjct  169  -----VEGGDPTSEDVAG------------------FGFEGWIPPDAGQDTNPDHFGGGC  205

Query  176  ANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSP  235
            A+   R    VA+  V +L+            + +P+ L+ SFVNPHD++ +P   W + 
Sbjct  206  ADHDRR----VAEEAVEFLS-------GPAVTSGQPWALIVSFVNPHDVLAYPQ-TWNAM  253

Query  236  LKPSPLDPPHVPAA--------PTADEDLST--KPAAQVAYREAYYSGYGLTRMVSRNYA  285
                       P A        PT DE L+   KP AQV  +       GL  ++  + A
Sbjct  254  NGTCDNYGSDAPGAFEQGIDLPPTFDEILALNHKPTAQV--QSELLLAAGLGPLLGPDQA  311

Query  286  RNAQRYRDLYYRLHAEVDGPIDRVGRAV--TEGGSEDAMLVRTSDHGDLLGAHGGLHQKW  343
            RN   Y + Y  +H  VD  I  V  A+  T    +D ++VR SDHG++  +HGGL QK 
Sbjct  312  RN---YINFYAYMHKVVDEHIGSVLDAIEATPQMLDDTVIVRMSDHGEMGMSHGGLRQKV  368

Query  344  FNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFS  403
            FN Y+E  RVP VI+       +P    A  S +D++PTL + A        A   +S++
Sbjct  369  FNAYEETLRVPLVISN-PLLFPEPVRTDALASLIDVMPTLATLA-------QAPARQSWN  420

Query  404  EVHPLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIK  463
             +    G DL PV+  A+A   ++     +D +        L +        P     + 
Sbjct  421  FL----GTDLTPVIVDAAA-YPQSPSAQVQDTI--------LFTYDDQNCATPDGQNIVT  467

Query  464  VPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYR  523
             P H+    E                  WK    F DPA    P                
Sbjct  468  QPNHIRCIRES----------------RWKYTMYF-DPAGVAAP----------------  494

Query  524  TDPLDDQWELYDLTADPIEAYN  545
                  Q+ELYDL ADP+E  N
Sbjct  495  ------QYELYDLQADPLELNN  510


>gi|284046572|ref|YP_003396912.1| sulfatase [Conexibacter woesei DSM 14684]
 gi|283950793|gb|ADB53537.1| sulfatase [Conexibacter woesei DSM 14684]
Length=540

 Score =  139 bits (349),  Expect = 2e-30, Method: Compositional matrix adjust.
 Identities = 118/402 (30%), Positives = 179/402 (45%), Gaps = 65/402 (16%)

Query  6    DIIIVMTDEERAV---PPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRP  62
            ++++ +TD++RA+   PP        W QR++ G      HG++F   +T +  C P+R 
Sbjct  61   NVLLFLTDQQRAIQHFPP-------GWSQRNMPGLTRLQRHGLTFANAFTNACMCSPARS  113

Query  63   TIFTGQYPDLHGVT---QTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWH  119
            T+ TG +P  HGV    +TD    ++    L    A           AAGY   Y GK+H
Sbjct  114  TLMTGYFPAQHGVKYTLETDMPSPQYPQVEL----ATTFKNPATVVAAAGYTPVYKGKFH  169

Query  120  ISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPE--------PH  171
                    PA G+    +D                  +  YGF+ W  P+          
Sbjct  170  CVK-----PANGSTWVPSD------------------VNQYGFTRWDPPDAGANQDIPEE  206

Query  172  GAGLANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWV  231
            G G  ++  R   + +       TE   +  +   A  +PF +V S VNPHD++ +P   
Sbjct  207  GGGTYDNDGRF--MNSQGTPEAGTEGALQYLSSVAAQSQPFFMVVSLVNPHDVLFYPKTY  264

Query  232  WRSPLKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGL-TRMVSRNYARNAQR  290
                   S L     P A TA+EDLSTKPA Q  ++  + +   L T  + RNY      
Sbjct  265  ESGGYDDSWLRGEIEPPA-TANEDLSTKPAVQRQFQRLFSATGPLPTPQMKRNYL-----  318

Query  291  YRDLYYRLHAEVDGPIDRVGRAVTEGGS-EDAMLVRTSDHGDLLGAHGGLHQKWFNLYDE  349
              + Y  L    D  + ++   +   G  +D +++ T+DHG++  AHGGL QK FN Y+E
Sbjct  319  --NFYGNLMKASDAYLVKLLDTLKSTGLLDDTLVIATADHGEMGTAHGGLRQKNFNFYEE  376

Query  350  ATRVPFVIA--RIGEKATQPRTVSAPTSHVDLVPTLLSAAGV  389
            +TRVP V +  R+  +   P    A  SHVD +PTL S  G 
Sbjct  377  STRVPLVYSNPRLFRR---PERSDALVSHVDFLPTLASLVGA  415


>gi|294673172|ref|YP_003573788.1| sulfatase family protein [Prevotella ruminicola 23]
 gi|294474072|gb|ADE83461.1| sulfatase family protein [Prevotella ruminicola 23]
Length=551

 Score =  135 bits (341),  Expect = 2e-29, Method: Compositional matrix adjust.
 Identities = 120/453 (27%), Positives = 193/453 (43%), Gaps = 81/453 (17%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            + +II + TD+E  +  Y +    A R+R         + G +F +HY  +     SR  
Sbjct  42   KYNIIFITTDQEAYMEQYPAGSDYAARER-------LRQMGTTFEKHYACANVSTSSRSV  94

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            I+TG++  +      D     F +   R     ++PT+G+  R AGY T + GKWHIS  
Sbjct  95   IYTGRH--ITETCMLDNTNYAFVNDMSR-----DLPTVGDMLRDAGYYTAFKGKWHISE-  146

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
                                         D + L  YGFS W   + +G+     G++ D
Sbjct  147  -----------------------------DTESLEEYGFSDWTEGDMYGSVW--EGYKED  175

Query  184  PLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLF---PAWVWRSPLKPSP  240
              +AD  + WL  +  ++   D    + F L  +F+NPHDI+ F   P         P+P
Sbjct  176  GTIADHAIDWLKNK-GKQLNNDG---QSFFLAVNFLNPHDIMYFNETPGTYIAGEATPAP  231

Query  241  LDPPH-------VPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRD  293
             DP +       VPA+     D   +PAA   Y + +    G +   S  +      +RD
Sbjct  232  DDPVYKKNYNVPVPASWNESFDKPGRPAAHKEYNKQWQDWVGPSPTDSTGW----HTFRD  287

Query  294  LYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATR  352
             Y+    + D  +  + + + + G   +++++ TSDHG++ G H GL  K  N+Y+    
Sbjct  288  YYFNTIQDEDNHMLVLLKYLEKAGLLNNSIVIYTSDHGEMQGEH-GLKGKGGNIYENNIH  346

Query  353  VPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEV-HPLPGR  411
            VP +I     K    R     TSH+DL PT        VD+  A   + FS +   L G 
Sbjct  347  VPLIIYHPEMKGG--RHCYNLTSHLDLAPTF-------VDIATAGNVQQFSAITQELHGH  397

Query  412  DLMPVVDGASAD----EGRAIYLMTRDNVLEGD  440
             LMP V   + D    EG A++     ++++GD
Sbjct  398  SLMPAVKNPAIDIRNSEG-ALFCFEMISMIDGD  429


>gi|108756971|ref|YP_632689.1| sulfatase family protein [Myxococcus xanthus DK 1622]
 gi|108460851|gb|ABF86036.1| sulfatase family protein [Myxococcus xanthus DK 1622]
Length=290

 Score =  134 bits (337),  Expect = 5e-29, Method: Compositional matrix adjust.
 Identities = 99/286 (35%), Positives = 135/286 (48%), Gaps = 29/286 (10%)

Query  286  RNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGS-EDAMLVRTSDHGDLLGAHGGLHQKWF  344
            R+   YR  YY L AEV     RV   + +    E+ ++V TSDHG++LGAHGG+ QKW+
Sbjct  6    RDTASYRQFYYFLMAEVSKHSQRVYEHLKKTSFLENTIVVLTSDHGEMLGAHGGMMQKWY  65

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSE  404
            N Y E   VP VI+       +PR     TSHVDLVPTLL  AG+D D     LA   SE
Sbjct  66   NAYQETLHVPCVISN-PRLFPEPRKTEVVTSHVDLVPTLLGLAGIDADAARRELARDHSE  124

Query  405  VHPLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPAPLRIKV  464
               L GRDL  +V G  ++    IY MT DNV  G    + L+ Q        A   +  
Sbjct  125  AQLLVGRDLSGLVLGRESERHEPIYFMTDDNVESGLQMTNNLTGQ--------AFAGVIQ  176

Query  465  PAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPGVRHLATNGMGGDAYRT  524
            P H+    E +V R+ +          WK     D+P  +       +   G   +    
Sbjct  177  PKHI----ETVVTRLPELTGDTP----WKYSCYSDNPRFF-------VGAAGNTDEVATA  221

Query  525  DPLDDQWELYDLTADPIEAYNRWT----DPQLHELRQHLRMLLKQQ  566
              +  ++E YDLT DP+E  NR +     P   ++R  L  +LK+Q
Sbjct  222  RFIPREYECYDLTEDPLETRNRCSAVAAQPLSQDIRDALDKVLKEQ  267


>gi|258655369|ref|YP_003204525.1| sulfatase [Nakamurella multipartita DSM 44233]
 gi|258558594|gb|ACV81536.1| sulfatase [Nakamurella multipartita DSM 44233]
Length=537

 Score =  130 bits (328),  Expect = 5e-28, Method: Compositional matrix adjust.
 Identities = 139/443 (32%), Positives = 193/443 (44%), Gaps = 79/443 (17%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RP+++++ T EER   P            +L  R W  + G SF  +Y  S  C  SR  
Sbjct  12   RPNVLLITTGEERYTLPKLDG-------FTLPARDWLHQRGTSFDDYYVASAMCSSSRSV  64

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGE--VPTLGNWFRAAGYDTHYDGKWHIS  121
            ++TGQ+     VT T      FD+  + ++R  +  + TLG   +AAGY T Y GKWH+S
Sbjct  65   MYTGQH-----VTST----MIFDNDNMPYIRPLDPGMATLGTMMQAAGYYTAYQGKWHLS  115

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
            +A            T  N G    A          L PYGF+ +        G A +G +
Sbjct  116  NA----------YRTPQNPGETSKA----------LQPYGFTEFNDWGDIDGG-AWAGLK  154

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSPL  241
             DP++A + V WL     R +A   A  +P+ +  +FVNPHDI+ +     RS + P P 
Sbjct  155  VDPVIAGQAVRWL-----RDKAPVVARDQPWFMTVNFVNPHDIMSYDYGSTRS-ITPPPN  208

Query  242  DPPHVPAAPTADEDLSTK--------------PAAQVAYREAYYSGYGLTRMVSRNYARN  287
                V   P A+  L +K                A  A RE  Y+G            ++
Sbjct  209  LAEAVKVKPPAETPLYSKVWDIDVPDNAGDDLSGAPQAVRE--YAGLADAMFGPVVDPQD  266

Query  288  AQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNL  346
             +   + Y     +VD  +  V  A+   G  D  ++V TSDHG+L G+H GL QK   +
Sbjct  267  WRLGLNFYVNCIRDVDRSVSLVLDALVASGQADRTVVVFTSDHGELAGSH-GLRQKGNLV  325

Query  347  YDEATRVPFVIAR---IGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFS  403
            YDE   VP VI      G   TQ     A  S VDL PT+L  AGVD D       E   
Sbjct  326  YDENFHVPLVIVHPDIPGGGRTQ-----ALGSAVDLAPTILHLAGVDPD-------ELRG  373

Query  404  EVHPLPGRDLMP-VVDGASADEG  425
            E   L G  L+P + DGA   +G
Sbjct  374  EFDGLGGHSLVPALADGAQVRDG  396


>gi|242278822|ref|YP_002990951.1| sulfatase [Desulfovibrio salexigens DSM 2638]
 gi|242121716|gb|ACS79412.1| sulfatase [Desulfovibrio salexigens DSM 2638]
Length=554

 Score =  130 bits (327),  Expect = 6e-28, Method: Compositional matrix adjust.
 Identities = 116/435 (27%), Positives = 200/435 (46%), Gaps = 47/435 (10%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RP+I++++TD++R     E      W   ++       +HG++F+ ++  + AC PSR +
Sbjct  49   RPNILLIITDQQRQ----EQHWPAGWLNENMPSMARLQKHGVTFSNNFIAASACSPSRAS  104

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
              TG YP +HGVTQ         +  LR     ++  +      AGYD  Y GK H+   
Sbjct  105  FLTGLYPSVHGVTQVP------PNPPLR----NDITNIFKLAEKAGYDIAYKGKMHL-FT  153

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWV-GPEPHGAGLANSGFRR  182
               +P+      ++D +   D+ +  R+   D     G + W+ G +P+       G   
Sbjct  154  PQNNPSMDN-FTSSDIKWASDNYSAHRWNPPDCAVDIGGNPWIGGGDPNNDQRFVDGV--  210

Query  183  DPLVADRVVAWLTER---YARRRAGDTAAMRPFLLVASFVNPHDIVLFP---AWVWRSPL  236
             P   +R+   +T+    Y      D+   +PFL+VASF NPHDI  +P    W +    
Sbjct  211  -PETYNRMTPAITKGETIYEYLDNHDSKRDKPFLMVASFGNPHDISAWPDQDKWGYN---  266

Query  237  KPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYY  296
            +    D   +   P  +++L  KP+AQ  Y++         ++ S    ++   +   Y 
Sbjct  267  RADYADLKEINLPPNYNDNLDEKPSAQKEYQKL------CEKVSSCPTEKDRIEFCRFYA  320

Query  297  RLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPF  355
             LH  VD  I  V   + E G +ED ++ R +DHG+   AH  + QK  N Y E   VP 
Sbjct  321  HLHRVVDKQISAVLDKLEEKGLTEDTVIFRFADHGEQSWAHMMI-QKGVNSYQETINVPL  379

Query  356  VIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMP  415
            +I+   +   + +T  + +S +DLVPT+        ++  AA  E  +E   + G+ L+P
Sbjct  380  IISN-PKMFPKGKTTESFSSLIDLVPTV-------AELTGAATPEELNEAG-IHGKSLVP  430

Query  416  VVDGASAD-EGRAIY  429
            +++ A A    RA++
Sbjct  431  IMNDAKAQVRDRAMF  445


>gi|254427464|ref|ZP_05041171.1| sulfatase, putative [Alcanivorax sp. DG881]
 gi|196193633|gb|EDX88592.1| sulfatase, putative [Alcanivorax sp. DG881]
Length=565

 Score =  130 bits (327),  Expect = 7e-28, Method: Compositional matrix adjust.
 Identities = 131/445 (30%), Positives = 190/445 (43%), Gaps = 73/445 (16%)

Query  4    RPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPT  63
            RP+++++++D+ER+      +         L G       G SF  ++  +  C PSR  
Sbjct  37   RPNVLLLVSDQERSGLDLPGS-------LDLPGHERLRRQGTSFNHYHVNTSPCSPSRSV  89

Query  64   IFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            ++TGQ+  +H     +     F           ++ TLG+ FR  GY T Y GKWH+S  
Sbjct  90   MYTGQHT-MHTHMTANLHAPPFP------ALNDKLKTLGHHFRDQGYYTAYKGKWHLS--  140

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
            D+ED   G  L         +  +  R L+      Y  +G V    HG+     G+  D
Sbjct  141  DIED---GPGLLYG------NYPSRNRALEKHGFSDYNLTGDV----HGS--VWQGYIAD  185

Query  184  PLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRS---------  234
             +V      WL  +      G T   +P+ L  +FVNPHDI+ F     +S         
Sbjct  186  RMVTAEACRWLMGK------GQTEE-KPWFLAVNFVNPHDIMFFSTGEKQSRSRTNPQFM  238

Query  235  -PLKPSPLDP------PHVPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYARN  287
             PL+P+P DP       H+    +    L  KP  Q AY +   S YG     +      
Sbjct  239  APLRPAPHDPVYAKDWSHISLPASFRASLDNKPWCQQAYAKLIDSVYGHIDKDNEAAWLA  298

Query  288  AQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNL  346
             Q Y   Y+    +V   +D+V +A+ E G  D  ++V T+DHG++ GAH GL QK    
Sbjct  299  NQSY---YFNCLRDVSRQVDQVLQALEESGQADNTIIVYTADHGEMAGAH-GLRQKGPFA  354

Query  347  YDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVH  406
            Y E +RVP +I+     A Q R V    S VDLVPTLLS            LA       
Sbjct  355  YKENSRVPLIISH--PDARQQRDVDNIGSSVDLVPTLLS------------LATEGKADT  400

Query  407  PLPGRDLMPVVDGASADEGRAIYLM  431
              PG DL   +DG  +D     +L 
Sbjct  401  QTPGTDLSAALDGRPSDRDSKGHLF  425


>gi|338972465|ref|ZP_08627838.1| choline-sulfatase [Bradyrhizobiaceae bacterium SG-6C]
 gi|338234250|gb|EGP09367.1| choline-sulfatase [Bradyrhizobiaceae bacterium SG-6C]
Length=572

 Score =  129 bits (324),  Expect = 1e-27, Method: Compositional matrix adjust.
 Identities = 131/450 (30%), Positives = 192/450 (43%), Gaps = 79/450 (17%)

Query  2    ANRPDIIIVMTDEER---AVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACV  58
            A RP+I+I+M+D+ER   ++P              L G     E GISF  ++  +  C 
Sbjct  21   AKRPNILIIMSDQERHWSSLP----------NDLPLPGHDLLRERGISFANYHIHTTPCS  70

Query  59   PSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVP----TLGNWFRAAGYDTHY  114
            PSR T + GQ+     +    G                EVP    +LG+ FRA GY T Y
Sbjct  71   PSRSTFYFGQHTQHTKMVVNHGAPP-----------FPEVPNTLVSLGDLFRAQGYYTAY  119

Query  115  DGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGW-VGPEPHGA  173
             GKWH+SH       T  P     +E                L P+GFS + +  +PHGA
Sbjct  120  KGKWHLSHIGGNHNLTYGPFPNTSDE----------------LEPFGFSDFNIDGDPHGA  163

Query  174  GLANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAW--V  231
                +GFR D  +A     WL +  + ++  D    +P+LL  +FVNPHDI+ F +    
Sbjct  164  TW--TGFRYDGQIAADASIWLKD--SGKKLNDEG--KPWLLAVNFVNPHDIMYFSSGDDQ  217

Query  232  WRSPLKPSPLDPPHVP----------AAPTADE----DLSTKPAAQVAYREAYYSGYGLT  277
             RS + P+ L P   P            P  D     D+S +  +Q +Y       YG  
Sbjct  218  VRSRIDPNMLAPISRPPVGGVYDTAWPGPLPDSFYKADISKRNWSQRSYAAFCDMIYGRF  277

Query  278  RMVSRNYARNAQRYRDLYYRLHAEVDGPIDRV-GRAVTEGGSEDAMLVRTSDHGDLLGAH  336
                    R+ Q Y   Y+    +VD     V  R    G  ++ +++  SDHG++ GA 
Sbjct  278  PKDDETVWRDNQSY---YFNCLRDVDHHASTVLARLKDLGLDDNTIVIYLSDHGEMAGAQ  334

Query  337  GGLHQKWFNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAA  396
              L QK  +++ E   VP ++     K     T S   S +D++PTLL+ AGVD      
Sbjct  335  -KLRQKGPHMFRENIHVPLIVCHPDVKTGGGSTTSGLASPIDMIPTLLAWAGVDD-----  388

Query  397  ALAESFSEVHPLPGRDLMPVVDGASADEGR  426
              A   ++   L G D+   V GAS+   R
Sbjct  389  --AARRTKYPYLKGIDVSSAVTGASSPSER  416


>gi|111021151|ref|YP_704123.1| arylsulfatase [Rhodococcus jostii RHA1]
 gi|110820681|gb|ABG95965.1| probable arylsulfatase [Rhodococcus jostii RHA1]
Length=627

 Score =  129 bits (324),  Expect = 2e-27, Method: Compositional matrix adjust.
 Identities = 139/488 (29%), Positives = 196/488 (41%), Gaps = 93/488 (19%)

Query  4    RPDIIIVMTDEER--AVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSR  61
            +P+I+ ++ DE R   V P        + QR +         G+ FT+HYT  +AC P R
Sbjct  52   QPNIVFIVVDEMRFPQVFPAGFTTPDQFLQRFMPNLYTLWAPGVKFTQHYTAGVACSPGR  111

Query  62   PTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHIS  121
                TG YP  + + QT   G R        +   + PT G   R AGY T Y GKWH+S
Sbjct  112  ACFVTGLYPLQNWMLQTR-TGNRASPVPSPAM-GRDFPTYGKLLRQAGYVTPYVGKWHLS  169

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
             +  ED                 S     YL+      YGF G   P+    G+   GF 
Sbjct  170  PSPDED-----------------SGLAPGYLEE-----YGFDGLTMPDI--IGMNGEGFE  205

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLF--------------  227
             D  +AD+  AWL+ R    + GD     PF L ASFVNPHD   F              
Sbjct  206  FDGHIADQAAAWLSTR----KPGDG----PFCLTASFVNPHDQQFFWAGTEAERYQSLYA  257

Query  228  -------PAWVWRSPL---KPSPLDPPHVPAAPTADEDLSTKPAAQVAYREA--------  269
                   PA  W        P  L  P VP     ++ L +KP+ QV  RE         
Sbjct  258  NNVPPLSPARAWSVTTGESDPPRLGYPSVPPNWEPEKALQSKPSTQVFAREFQALVWGGV  317

Query  270  ----------YYSGYGLTRMVSRNYARNA----QRYRDLYYRLHAEVDGPIDRVGRAVTE  315
                      Y   YG      R+ A       +R  D Y  + + VD  I  V  ++ E
Sbjct  318  TDDIQNLSNYYLQPYGQGTDPDRHIAFAPYTYWERALDSYTNVLSMVDHHIGTVINSLPE  377

Query  316  GGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIA----RIGEKATQPRTVS  371
              + + + V TSDHG+  GAHG +  K    YDEA  +P ++A    R       PR   
Sbjct  378  DVAANTVFVMTSDHGEYAGAHGFVAGKLSTAYDEAFHIPLIVADPTGRFTGDTDTPR--G  435

Query  372  APTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGASADEGRAIYLM  431
              TS VD+ P L +    + + ++  L  +++E       DLMP++   +A     + L 
Sbjct  436  QLTSSVDVAPLLATLGHGNRNWMSGDLFATYAER-----ADLMPLLRSNTAAGRDHVVLA  490

Query  432  TRDNVLEG  439
            T ++  +G
Sbjct  491  TNEHAPQG  498


>gi|226363511|ref|YP_002781293.1| sulfatase [Rhodococcus opacus B4]
 gi|226242000|dbj|BAH52348.1| putative sulfatase [Rhodococcus opacus B4]
Length=627

 Score =  127 bits (320),  Expect = 4e-27, Method: Compositional matrix adjust.
 Identities = 140/488 (29%), Positives = 195/488 (40%), Gaps = 93/488 (19%)

Query  4    RPDIIIVMTDEER--AVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSR  61
            +P+I+ ++ DE R   V P        + QR +         G+ FT+HYT  +AC P R
Sbjct  52   QPNIVFIVVDEMRFPQVFPAGITTPDQFLQRFMPNLYKLWAPGVKFTQHYTAGVACSPGR  111

Query  62   PTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHIS  121
                TG YP  + + QT   G R        +   + PT G   R AGY T Y GKWH+S
Sbjct  112  ACFVTGLYPLQNWMLQTR-TGNRASPVPSPAM-GRDFPTYGKLLRQAGYVTPYVGKWHLS  169

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
             +  ED                 S     YL+      YGF G   P+    GL   GF 
Sbjct  170  PSPDED-----------------SGLAPGYLEE-----YGFDGLTMPDI--IGLNGEGFE  205

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLF--------------  227
             D  +AD+  AWL+ R    +  D     PF L ASFVNPHD   F              
Sbjct  206  FDGHIADQAAAWLSTR----KPSDG----PFCLTASFVNPHDQQFFWAGTEAERYQSLYA  257

Query  228  -------PAWVWRSPL---KPSPLDPPHVPAAPTADEDLSTKPAAQVAYREA--------  269
                   PA  W        P  L  P VP     ++ L +KP+ QV  RE         
Sbjct  258  NNVPPLSPARTWSVTTGESDPPRLGFPSVPPNWEPEKALQSKPSTQVFAREFQALVWGGV  317

Query  270  ----------YYSGYGLTRMVSRNYARNA----QRYRDLYYRLHAEVDGPIDRVGRAVTE  315
                      Y   YG      R+ A       +R  D Y  +   VD  I  V  ++ E
Sbjct  318  TDDVQNLSNYYLQPYGQGTDPDRHIAFAPYTYWERALDSYTNVLTMVDHHIGTVIDSLPE  377

Query  316  GGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFVIA----RIGEKATQPRTVS  371
              + + + V TSDHG+  GAHG +  K    YDEA  +P ++A    R       PR   
Sbjct  378  DVAANTVFVMTSDHGEYAGAHGFVAGKLSTAYDEAFHIPLIVADPTGRFTGDTDTPR--G  435

Query  372  APTSHVDLVPTLLSAAGVDVDVVAAALAESFSEVHPLPGRDLMPVVDGASADEGRAIYLM  431
              TS VD+VP L +    D + ++  L  +++E       DL+P++   +A     + L 
Sbjct  436  QLTSSVDVVPLLATLGHGDRNWMSGDLFATYAER-----ADLLPLLRSNAAAGRDHVVLA  490

Query  432  TRDNVLEG  439
            T ++  +G
Sbjct  491  TNEHAPQG  498


>gi|325523292|gb|EGD01647.1| arylsulfatase A like protein [Burkholderia sp. TJI49]
Length=607

 Score =  125 bits (315),  Expect = 2e-26, Method: Compositional matrix adjust.
 Identities = 159/595 (27%), Positives = 239/595 (41%), Gaps = 113/595 (18%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            +I+ V+ D+ER           AW   S+ GR    + G+SF  H   +  C PSR TI+
Sbjct  68   NILFVLVDQERYFD--------AW-PVSVPGRERLAKSGVSFINHQIAACVCSPSRATIY  118

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWL--RAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            TGQ+     V         FD++ L W       + T+G+  + AGY   Y GKWH+S  
Sbjct  119  TGQHMQHTAV---------FDNAGLPWQPDMPTSIRTVGHMMKDAGYQAVYVGKWHLS--  167

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
                       AT         A V  Y  A  +  YGF  + G      G A+ G+  D
Sbjct  168  -----------ATLHESNSPYDAPVAEYNKA--MRAYGFDDYFGVG-DLVGSAHGGYNFD  213

Query  184  PLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLF-------PAWVWRSPL  236
             + A   ++W+     R +  + A  +P+ L  + VNPHD +         P      P 
Sbjct  214  GVTAQAAISWM-----REQQRNAAGAKPWFLAVNLVNPHDAMWLNTDPAGRPNGSGLIPT  268

Query  237  KPSP--------LDPPHVPAA---PTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNYA  285
            +P+P         D   +PA+   P A  D   +P A   Y  A+ +  G          
Sbjct  269  RPAPDTRLYDARWDQVPLPASRRQPLASPD---RPKAHAMYAAAHEALIGRIEFDD----  321

Query  286  RNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWF  344
               +RY+D Y     + D  ++R+   + + G  D  ++V TSDHGDL G H  +  K  
Sbjct  322  ATVKRYQDYYLNCIRDCDRHVERLLDELDDLGIADRTIVVLTSDHGDLAGHHQMI-DKGA  380

Query  345  NLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAA-----ALA  399
            N Y +   VP ++     +    +T  A TSH+D+ PTL++  G   D VA      A  
Sbjct  381  NAYRQQNHVPMLVRHPAYRGG--KTCRALTSHLDVAPTLVALTGASADTVARVVGPDAKG  438

Query  400  ESFSEVHPLPGR-DLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQLGRIVNPPA  458
             SF+ +   P R DL  + D    +    +Y  +   + E  T      R+ G    PPA
Sbjct  439  SSFAHLLAQPERADLHAIRDATLFNYAMLLYYDSEWMLAEFKT-----MRERGV---PPA  490

Query  459  PLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKL--VRTFDDPATWTEPGVRHLATNG  516
             +     A  AA    L  R        G     +   +  F++P T  +     LA N 
Sbjct  491  EMH----ARAAALQPDLAQRGAIRSVFDGRYRFSRYFALSAFNEPETLDD----LLANND  542

Query  517  MGGDAYRTDPLDDQWELYDLTADPIEAYNRWTDPQLH-----ELRQHLRMLLKQQ  566
            +              EL+DL  DP E +N  T P+LH     E+   L  L++Q+
Sbjct  543  L--------------ELFDLYVDPDEMHNLATRPELHRALMMEMNAKLNRLIRQE  583


>gi|116694269|ref|YP_728480.1| arylsulfatase [Ralstonia eutropha H16]
 gi|113528768|emb|CAJ95115.1| Arylsulfatase [Ralstonia eutropha H16]
Length=600

 Score =  124 bits (312),  Expect = 4e-26, Method: Compositional matrix adjust.
 Identities = 129/445 (29%), Positives = 192/445 (44%), Gaps = 74/445 (16%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            +I+ ++ D+ER   P E       R   L       + G +F  H   S  C PSR  ++
Sbjct  58   NILFILVDQERYFRPGELP-----RGYGLPAHERLMKRGTTFVNHRINSCVCTPSRSVLY  112

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWLRA--GEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            TGQ+     +  T    + FD++   W+ +   E+ TLG+  R AGY T Y GKWH++  
Sbjct  113  TGQH-----IQHT----RMFDNTNFPWISSMSTEIRTLGDMLRDAGYYTAYKGKWHLTKE  163

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANS--GFR  181
                   G P      E                +  YGFS ++G    G  +A++  G+ 
Sbjct  164  FETVNKLGTPTKIFTQE----------------MEAYGFSDYIGI---GDIIAHTSGGYL  204

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAM-RPFLLVASFVNPHDIVLF---------PAWV  231
             D ++A   V+WL     R +  + AA  +P+ L  + VNPHD++ +          A  
Sbjct  205  HDGVIAAMGVSWL-----RGKGSELAAQGKPWFLAVNLVNPHDVMFYDTDAPGTEVQAMR  259

Query  232  WRSPLKPSPLDPPH-------VPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNY  284
              + +   P DP +       +PA+     D   +PA   A+R+   S   L   +    
Sbjct  260  GLAHVARDPADPLYGKQWQFMLPASRKQALDAPGRPA---AHRDFLRSHDALVGAIPNED  316

Query  285  ARNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKW  343
            AR  +R+ + Y     +VD  I  V  A+   G  D  ++V TSDHGD+ GAH  LH K 
Sbjct  317  ARWHRRH-NYYLNCMRDVDRNIAAVLAALDAAGLSDKTIVVLTSDHGDMDGAH-QLHAKG  374

Query  344  FNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFS  403
               Y E   VP VIA         +   A TSH+D+ PTL++  GV  D  AA       
Sbjct  375  AVSYREQNNVPLVIAHPSYHGG--KQCRAVTSHLDIAPTLVALTGVATDKRAAI------  426

Query  404  EVHPLPGRDLMPVVDGASADEGRAI  428
             V  LPG+D   ++    A E  AI
Sbjct  427  -VKGLPGKDFSRLLAKPGAAEANAI  450


>gi|296395322|ref|YP_003660206.1| sulfatase [Segniliparus rotundus DSM 44985]
 gi|296182469|gb|ADG99375.1| sulfatase [Segniliparus rotundus DSM 44985]
Length=505

 Score =  124 bits (311),  Expect = 4e-26, Method: Compositional matrix adjust.
 Identities = 116/394 (30%), Positives = 165/394 (42%), Gaps = 63/394 (15%)

Query  1    MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPS  60
            +A RP+I++V+ DE R    + + + L     SLT  R   +  ++F RHYT +  C  +
Sbjct  13   VAARPNILVVLVDEMRFPMWFPTQDQLDTLLPSLTRIR---KSAVAFERHYTAANVCTAA  69

Query  61   RPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHI  120
            R  + TG Y    G  Q  G+             + + PT G+  R  GY++ + GKWH+
Sbjct  70   RGALVTGLYSHQTGC-QLVGMSTL----------SPKFPTWGSMLREHGYESWWYGKWHL  118

Query  121  SHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGF  180
             HA   DPA                           L  YGF+G   P P GA     G 
Sbjct  119  GHAPDTDPAA--------------------------LAAYGFAGGTFPSPDGA--PGDGL  150

Query  181  RRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSP  240
              D  +AD+   W           D A   P+    S VNPHDI+ +P W    P   +P
Sbjct  151  AHDGAIADQFAVWFH---------DNAGKGPWCTTVSLVNPHDIMFWPKW---QPPAQAP  198

Query  241  LDPPHVPAAPTADEDL--STKPAAQVAYREAYYSGYGLTRMVSRNYARNAQRYRDLYYRL  298
                 +P      E L    KP AQ+   EA     G       + A    +YRDLY  L
Sbjct  199  RRFSGLPGNFETPEQLRARNKPRAQLNQIEAMQRHSGELPYSGDDVAARWAQYRDLYLWL  258

Query  299  HAEVDGPIDRVGRAVTEGGSEDA--MLVRTSDHGDLLGAHGGLHQKWFNLYDEATRVPFV  356
              +VD  I ++   +      DA  +++ T+DHG+  G+H G+  K   LY+E  R+P  
Sbjct  259  QQQVDAQIGKILDTLASRPDVDANTVVLFTADHGEYAGSH-GMRAKGSGLYEENIRIPLY  317

Query  357  IARIGEKATQPR---TVSAPTSHVDLVPTLLSAA  387
            + R   KA  P    T    TS VD+   LL+ A
Sbjct  318  V-RDPRKALTPDPGGTRGQLTSSVDVAAFLLTVA  350


>gi|169631543|ref|YP_001705192.1| sulfatase family protein [Mycobacterium abscessus ATCC 19977]
 gi|169243510|emb|CAM64538.1| Sulfatase family protein [Mycobacterium abscessus]
Length=558

 Score =  124 bits (310),  Expect = 6e-26, Method: Compositional matrix adjust.
 Identities = 123/401 (31%), Positives = 170/401 (43%), Gaps = 78/401 (19%)

Query  2    ANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSR  61
             N+P+I++++ D+ RA   +   + L      L          +SF  HYT S  C PSR
Sbjct  43   GNKPNILVIVVDQMRAPQWFPDVQKLT---NILPNLSRLHRDSVSFASHYTASNMCTPSR  99

Query  62   PTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLGNWFRAAGYDTHYDGKWHIS  121
              + TG Y    G   T   G+   +S L    A + PT G   R  GY T + GKWH+ 
Sbjct  100  GAMTTGLYSHQTGCLYT---GEGPSESSL----APQFPTWGTMLRQQGYRTWWWGKWHLG  152

Query  122  HADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFR  181
                 +P           EG          LDA     +GFSG   P P+  G+ N G +
Sbjct  153  DWSDTNP-----------EG----------LDA-----HGFSGGTFPSPN--GMPNQGLQ  184

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLFPAWVWRSPLKPSPL  241
            +DP + D+   W             A   P+    S VNPHDI  +P         P P 
Sbjct  185  KDPGIVDQFAGWFDAE---------AGKGPWCTTVSLVNPHDICWWPK-------NPLPE  228

Query  242  DPPH----VPAAPTADEDLST--KPAAQVAYREAYYSGYGLTRMVSR---NYARNAQRYR  292
            D PH    +P      ++L    KP  Q+ Y  A +    +T  V+    + AR   R  
Sbjct  229  DVPHWFDGLPVNFQTPDELRQHGKPRLQIDY--ANFMSPIMTGAVTYSGPDMARQWARCL  286

Query  293  DLYYRLHAEVDGPIDRV-----GRAVTEGGSEDAMLVRTSDHGDLLGAHGGLHQKWFNLY  347
            D+Y  L  +VD  I RV      R   +G   + ++V TSDHG+  G+H GL  K    Y
Sbjct  287  DMYLWLQQQVDAQIGRVLDKLASRPEIDG---NTIVVFTSDHGEYAGSH-GLRGKGATAY  342

Query  348  DEATRVPFVIARIGEKATQPR---TVSAPTSHVDLVPTLLS  385
            +EA RVP  I R  +    P+   T +  TS VDL P LL+
Sbjct  343  EEAIRVPLYI-RDPQGVLTPKPGETRTQLTSSVDLAPLLLT  382


>gi|217969899|ref|YP_002355133.1| sulfatase [Thauera sp. MZ1T]
 gi|217507226|gb|ACK54237.1| sulfatase [Thauera sp. MZ1T]
Length=558

 Score =  124 bits (310),  Expect = 7e-26, Method: Compositional matrix adjust.
 Identities = 125/485 (26%), Positives = 209/485 (44%), Gaps = 82/485 (16%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            +I+ ++TD+ER   P E          +L  R    ++G+ F  H   S  C PSR  I+
Sbjct  15   NIVFILTDQERYFRPDELPA-----GYTLPARERLAKNGVVFENHRINSCVCTPSRSVIY  69

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWLRA--GEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            TG++     + QT    + FD++   W+ +   ++ TLG+  R AGY T Y GKWH++  
Sbjct  70   TGRH-----IQQT----RMFDNTNFPWISSMSTDIKTLGHMMREAGYYTAYKGKWHLTRE  120

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANS--GFR  181
               D    AP      E                +  YGFS ++G    G  +A++  G+ 
Sbjct  121  FETDNTLAAPQKIFTKE----------------MEAYGFSDYLGV---GDIIAHTQGGYL  161

Query  182  RDPLVADRVVAWLTERYARRRAGDTA-AMRPFLLVASFVNPHDIVLFPAWVWRSPLKPS-  239
             D L+A    +WL     R +A + A   +P+ L  + VNPHD++ +       P++   
Sbjct  162  HDGLIAAAAASWL-----RSKAAELAEQQKPWFLAVNLVNPHDVMFYNTDEPGQPVQGKH  216

Query  240  -----PLDPPH----------VPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNY  284
                   DP H          +PA+     D   +PAA + +         +T ++  N 
Sbjct  217  HLTHLAGDPEHAMYKKQWDIDLPASFKQPIDAPGRPAAHIDHT---IGNDVMTGVIPTNE  273

Query  285  ARNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGG-SEDAMLVRTSDHGDLLGAHGGLHQKW  343
                ++  + Y     +VD  I  +   + + G + + +++ TSDHG+L GAH  +  K 
Sbjct  274  EWRWRKRHNFYLNALQDVDRHIMTLLDELEDRGLASNTIVILTSDHGELGGAH-QMTGKG  332

Query  344  FNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAAALAESFS  403
               Y E   VP ++A         +   A T+H+DL PTL++      +   AA+A++  
Sbjct  333  ATSYREQNNVPLIVAHPAFAGG--KRCKAVTTHLDLAPTLIALTNASPE-TKAAIAQT--  387

Query  404  EVHPLPGRDLMPVV---DGASADEGR--AIYLMTRDNVLEGDTGASLLSRQLGRIVNPPA  458
                LPG+D  PV+   + A+ D  R   +Y       L+G    S L +    +  P  
Sbjct  388  ----LPGKDFSPVLAAPEQANVDTVRDGQLYCFNMFASLDG----SFLQKASALLAQPGG  439

Query  459  PLRIK  463
              +IK
Sbjct  440  AAKIK  444


>gi|73538537|ref|YP_298904.1| twin-arginine translocation pathway signal protein [Ralstonia 
eutropha JMP134]
 gi|72121874|gb|AAZ64060.1| Twin-arginine translocation pathway signal [Ralstonia eutropha 
JMP134]
Length=600

 Score =  122 bits (305),  Expect = 3e-25, Method: Compositional matrix adjust.
 Identities = 122/442 (28%), Positives = 197/442 (45%), Gaps = 72/442 (16%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            +I++++ D+ER   P E          SL       + G +F  H   S  C PSR  ++
Sbjct  58   NILLIVVDQERRFRPGELPV-----GYSLPAHERLMKRGTTFLNHQINSCVCTPSRSVLY  112

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWL--RAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            TGQ+     + QT    + FD++   W+   + ++PTLG+  R AGY T Y GKWH+   
Sbjct  113  TGQH-----IQQT----RMFDNTNFPWITSMSTDIPTLGDMLRDAGYYTAYKGKWHL---  160

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANS--GFR  181
                        T + E V       +   A+ +  YGFS ++G    G  +A++  G+ 
Sbjct  161  ------------TKEFETVNKLGTPTKIFTAE-MEAYGFSDYIGI---GDIIAHTSGGYL  204

Query  182  RDPLVADRVVAWLTERYARRRAGDTAAM-RPFLLVASFVNPHDIVLFPAWVWRSPLKPS-  239
             D ++A    +WL     R +  D AA  +P+ L  + VNPHD++ +      + ++ + 
Sbjct  205  HDGVIAAMGTSWL-----RGKGRDLAAQGKPWFLAMNLVNPHDVMFYDTDAPGTKVQATR  259

Query  240  --------PLDPPH-------VPAAPTADEDLSTKPAAQVAYREAYYSGYGLTRMVSRNY  284
                    P DP +       +PA+     D   +P    A+R+   S   +   +    
Sbjct  260  GLAHVARDPADPLYAKQWNFTLPASHAQPLDAPGRPP---AHRDFLRSHDAMVGAIPNEE  316

Query  285  ARNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKW  343
            AR  +R+ + Y     +VD  I  V   +   G  D  +++ TSDHGD+ GAH  LH K 
Sbjct  317  ARWRRRH-NYYLNCMRDVDRNIASVLAELDAAGLTDKTIVILTSDHGDMDGAH-QLHAKG  374

Query  344  FNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVD----VVAAALA  399
               Y E   VP +I+         R   A TSH+D+ PTL++ +GV+ D    +V     
Sbjct  375  AVSYREQNNVPLIISHPAYPGG--RQCRAVTSHLDIAPTLVAMSGVNADKRATLVKGLAG  432

Query  400  ESFSEVHPLPGR-DLMPVVDGA  420
            + FS +   P + D   + DGA
Sbjct  433  KDFSGLLSAPEKADANAIRDGA  454


>gi|78061333|ref|YP_371241.1| arylsulfatase A like protein [Burkholderia sp. 383]
 gi|77969218|gb|ABB10597.1| Arylsulfatase A like protein [Burkholderia sp. 383]
Length=611

 Score =  120 bits (302),  Expect = 6e-25, Method: Compositional matrix adjust.
 Identities = 122/429 (29%), Positives = 185/429 (44%), Gaps = 69/429 (16%)

Query  6    DIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHGISFTRHYTGSLACVPSRPTIF  65
            +I+ V+ D+ER           AW   S+ GR      GISF  H   +  C PSR TI+
Sbjct  72   NILFVLVDQERYFD--------AWPM-SVPGRERLARSGISFINHQIAACVCSPSRSTIY  122

Query  66   TGQYPDLHGVTQTDGIGKRFDDSRLRWL--RAGEVPTLGNWFRAAGYDTHYDGKWHISHA  123
            TGQ+    GV         FD++ L W       + T+G+  + AGY   Y GKWH+S  
Sbjct  123  TGQHMQRTGV---------FDNAGLPWQPDMPTSIRTVGHMMKDAGYQAVYVGKWHLS--  171

Query  124  DLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPYGFSGWVGPEPHGAGLANSGFRRD  183
                       AT        +A V  Y  A  +  YGF  + G      G A+ G+  D
Sbjct  172  -----------ATMHESNSPYNAPVADYNKA--MRSYGFDDYFGVGDL-VGSAHGGYNFD  217

Query  184  PLVADRVVAWLTERYARRRAGDTAAMRPFLLVASFVNPHDIVLF-------PAWVWRSPL  236
             +     ++W+ E+  RR A   A  +P++L  + VNPHD++         P      P 
Sbjct  218  GVTTQAAISWMREQ--RRNA---AGAKPWMLAVNLVNPHDVMWLNTDPSGRPNGSGLIPT  272

Query  237  KPSP---LDPPH---VPAAPTADEDLST--KPAAQVAYREAYYSGYGLTRMVSRNYARNA  288
            +P+P   L   H   VP   +  + L+   +P A   Y  A+ +  G             
Sbjct  273  RPAPDTQLYGAHWDKVPLPVSRRQPLAAPDRPKAHAMYSAAHEALIGKIEFDD----ATV  328

Query  289  QRYRDLYYRLHAEVDGPIDRVGRAVTEGGSED-AMLVRTSDHGDLLGAHGGLHQKWFNLY  347
            +RY+D Y     + D  ++R+   + + G  D  ++V TSDHGDL G H  +  K  N Y
Sbjct  329  KRYQDYYLNCIRDCDRHVERLLDELDDLGIADKTIVVLTSDHGDLAGHHQMI-DKGANAY  387

Query  348  DEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVDVVAA-----ALAESF  402
             +   VP ++     +    ++  A TSH+D+ PTL++  G   D VA+     A   SF
Sbjct  388  RQQNHVPMIVRHPAFRGG--KSCRALTSHLDVAPTLVALTGAPADKVASVVGPDAKGSSF  445

Query  403  SEVHPLPGR  411
            + +   P R
Sbjct  446  AHLLAQPER  454



Lambda     K      H
   0.319    0.137    0.431 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 1369331474720


  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Sep 5, 2011  4:36 AM
  Number of letters in database: 5,219,829,388
  Number of sequences in database:  15,229,318



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40