BLASTP 2.2.25+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           15,229,318 sequences; 5,219,829,388 total letters



Query= Rv3142c

Length=142
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|15610278|ref|NP_217658.1|  hypothetical protein Rv3142c [Mycob...   296    6e-79
gi|15842718|ref|NP_337755.1|  hypothetical protein MT3229 [Mycoba...   296    7e-79
gi|340628118|ref|YP_004746570.1|  hypothetical protein MCAN_31561...   295    2e-78
gi|31794318|ref|NP_856811.1|  hypothetical protein Mb3166c [Mycob...   294    3e-78
gi|148824336|ref|YP_001289090.1|  hypothetical protein TBFG_13163...   293    4e-78
gi|254821190|ref|ZP_05226191.1|  hypothetical protein MintA_14732...   226    7e-58
gi|342858065|ref|ZP_08714721.1|  hypothetical protein MCOL_04285 ...   218    2e-55
gi|169631403|ref|YP_001705052.1|  hypothetical protein MAB_4326c ...   174    5e-42
gi|108801803|ref|YP_642000.1|  hypothetical protein Mmcs_4840 [My...   165    2e-39
gi|145225160|ref|YP_001135838.1|  hypothetical protein Mflv_4582 ...   165    2e-39
gi|120401345|ref|YP_951174.1|  hypothetical protein Mvan_0320 [My...   164    4e-39
gi|315443031|ref|YP_004075910.1|  hypothetical protein Mspyr1_140...   164    4e-39
gi|126437152|ref|YP_001072843.1|  hypothetical protein Mjls_4587 ...   163    7e-39
gi|118465896|ref|YP_882973.1|  hypothetical protein MAV_3802 [Myc...   156    9e-37
gi|183984807|ref|YP_001853098.1|  hypothetical protein MMAR_4838 ...   152    1e-35
gi|296164074|ref|ZP_06846697.1|  conserved hypothetical protein [...   148    3e-34
gi|333990706|ref|YP_004523320.1|  hypothetical protein JDM601_206...   142    2e-32
gi|120402130|ref|YP_951959.1|  hypothetical protein Mvan_1117 [My...  55.1    3e-06
gi|311894351|dbj|BAJ26759.1|  hypothetical protein KSE_09220 [Kit...  40.8    0.062
gi|327304475|ref|XP_003236929.1|  arylsulfatase [Trichophyton rub...  35.4    3.1  
gi|117530185|ref|YP_851028.1|  hypothetical protein MaLMM01_gp014...  35.0    3.8  
gi|329113659|ref|ZP_08242435.1|  Nitrite reductase large subunit ...  34.7    4.9  
gi|226973313|gb|ACO94460.1|  polyketide synthase type I [Streptom...  34.3    6.3  
gi|167841435|ref|ZP_02468119.1|  HpcH/HpaI aldolase [Burkholderia...  33.9    8.6  
gi|302501502|ref|XP_003012743.1|  arylsulfatase, putative [Arthro...  33.9    8.8  


>gi|15610278|ref|NP_217658.1| hypothetical protein Rv3142c [Mycobacterium tuberculosis H37Rv]
 gi|121639025|ref|YP_979249.1| hypothetical protein BCG_3165c [Mycobacterium bovis BCG str. 
Pasteur 1173P2]
 gi|148662997|ref|YP_001284520.1| hypothetical protein MRA_3175 [Mycobacterium tuberculosis H37Ra]
 72 more sequence titles
 Length=142

 Score =  296 bits (758),  Expect = 6e-79, Method: Compositional matrix adjust.
 Identities = 142/142 (100%), Positives = 142/142 (100%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP
Sbjct  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA
Sbjct  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLAPGKLRVVRHDVADANGDQ
Sbjct  121  ACLAPGKLRVVRHDVADANGDQ  142


>gi|15842718|ref|NP_337755.1| hypothetical protein MT3229 [Mycobacterium tuberculosis CDC1551]
 gi|13883039|gb|AAK47569.1| hypothetical protein MT3229 [Mycobacterium tuberculosis CDC1551]
Length=175

 Score =  296 bits (758),  Expect = 7e-79, Method: Compositional matrix adjust.
 Identities = 142/142 (100%), Positives = 142/142 (100%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP
Sbjct  34   MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  93

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA
Sbjct  94   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  153

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLAPGKLRVVRHDVADANGDQ
Sbjct  154  ACLAPGKLRVVRHDVADANGDQ  175


>gi|340628118|ref|YP_004746570.1| hypothetical protein MCAN_31561 [Mycobacterium canettii CIPT 
140010059]
 gi|340006308|emb|CCC45487.1| hypothetical protein MCAN_31561 [Mycobacterium canettii CIPT 
140010059]
Length=142

 Score =  295 bits (754),  Expect = 2e-78, Method: Compositional matrix adjust.
 Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP
Sbjct  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            NDPADHERPLFDFAGATCTAF WYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA
Sbjct  61   NDPADHERPLFDFAGATCTAFAWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLAPGKLRVVRHDVADANGDQ
Sbjct  121  ACLAPGKLRVVRHDVADANGDQ  142


>gi|31794318|ref|NP_856811.1| hypothetical protein Mb3166c [Mycobacterium bovis AF2122/97]
 gi|31619914|emb|CAD95258.1| HYPOTHETICAL PROTEIN Mb3166c [Mycobacterium bovis AF2122/97]
Length=142

 Score =  294 bits (752),  Expect = 3e-78, Method: Compositional matrix adjust.
 Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP
Sbjct  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA
Sbjct  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLAPGKLRVVR DVADANGDQ
Sbjct  121  ACLAPGKLRVVRQDVADANGDQ  142


>gi|148824336|ref|YP_001289090.1| hypothetical protein TBFG_13163 [Mycobacterium tuberculosis F11]
 gi|148722863|gb|ABR07488.1| hypothetical protein TBFG_13163 [Mycobacterium tuberculosis F11]
Length=142

 Score =  293 bits (751),  Expect = 4e-78, Method: Compositional matrix adjust.
 Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQEMTEQWLEG AVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP
Sbjct  1    MTEQEMTEQWLEGSAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA
Sbjct  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLAPGKLRVVRHDVADANGDQ
Sbjct  121  ACLAPGKLRVVRHDVADANGDQ  142


>gi|254821190|ref|ZP_05226191.1| hypothetical protein MintA_14732 [Mycobacterium intracellulare 
ATCC 13950]
Length=142

 Score =  226 bits (577),  Expect = 7e-58, Method: Compositional matrix adjust.
 Identities = 107/142 (76%), Positives = 123/142 (87%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            MTEQ++ EQWLEGCAVQRIMFRDGLVLNF+DYNELVI+ P++LTLPAIETSPAEV+AIDP
Sbjct  1    MTEQKVIEQWLEGCAVQRIMFRDGLVLNFEDYNELVITAPMRLTLPAIETSPAEVIAIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            N PA   RPLFDFAG++CT+ VW DTGDLHLEFSD H+IDV  +D   AWELY K+HGYA
Sbjct  61   NHPAGQLRPLFDFAGSSCTSAVWSDTGDLHLEFSDDHKIDVPCNDNAIAWELYSKHHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACLA G+LRVVR D A A+GD+
Sbjct  121  ACLAHGELRVVRLDTARADGDR  142


>gi|342858065|ref|ZP_08714721.1| hypothetical protein MCOL_04285 [Mycobacterium colombiense CECT 
3035]
 gi|342135398|gb|EGT88564.1| hypothetical protein MCOL_04285 [Mycobacterium colombiense CECT 
3035]
Length=142

 Score =  218 bits (556),  Expect = 2e-55, Method: Compositional matrix adjust.
 Identities = 105/142 (74%), Positives = 115/142 (81%), Gaps = 0/142 (0%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP  60
            M E +M   WLEGCA+QRIMFRDGLVLNFDD NELVISVP++LTLPAI  +PAEVV IDP
Sbjct  1    MAESKMIGHWLEGCALQRIMFRDGLVLNFDDDNELVISVPIRLTLPAIANAPAEVVEIDP  60

Query  61   NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA  120
            N PA  ERPLFDF+G  CT F W+D+GDLHLEFSDGH IDV  DD  TAWELYGKYHGYA
Sbjct  61   NGPAVQERPLFDFSGQNCTGFDWFDSGDLHLEFSDGHIIDVPADDHATAWELYGKYHGYA  120

Query  121  ACLAPGKLRVVRHDVADANGDQ  142
            ACL  GK+RVVRHDV   N D+
Sbjct  121  ACLPHGKVRVVRHDVDATNIDE  142


>gi|169631403|ref|YP_001705052.1| hypothetical protein MAB_4326c [Mycobacterium abscessus ATCC 
19977]
 gi|169243370|emb|CAM64398.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=136

 Score =  174 bits (440),  Expect = 5e-42, Method: Compositional matrix adjust.
 Identities = 83/135 (62%), Positives = 98/135 (73%), Gaps = 0/135 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M+E W+EGC VQRIMFRDGLV++  DYNE+VI+VP+ LTLP     P E+V +DP    D
Sbjct  1    MSEAWIEGCPVQRIMFRDGLVISLGDYNEVVIAVPMWLTLPPAGKWPREIVCVDPKAILD  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
             ERPLF  +G+TCT   W D GDLH+EFSD H IDV   D  TAWE+YGKYHGY A L  
Sbjct  61   EERPLFSISGSTCTEARWNDAGDLHMEFSDDHVIDVPHHDFDTAWEIYGKYHGYVASLPR  120

Query  126  GKLRVVRHDVADANG  140
            GK+RVVRHDVA+  G
Sbjct  121  GKVRVVRHDVAEEAG  135


>gi|108801803|ref|YP_642000.1| hypothetical protein Mmcs_4840 [Mycobacterium sp. MCS]
 gi|119870955|ref|YP_940907.1| hypothetical protein Mkms_4928 [Mycobacterium sp. KMS]
 gi|108772222|gb|ABG10944.1| conserved hypothetical protein [Mycobacterium sp. MCS]
 gi|119697044|gb|ABL94117.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=142

 Score =  165 bits (418),  Expect = 2e-39, Method: Compositional matrix adjust.
 Identities = 81/132 (62%), Positives = 94/132 (72%), Gaps = 0/132 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M  QW+E C VQR+  RDGLVL+ DDYNE+VIS PL LTLPA    P E V I+P   + 
Sbjct  1    MYTQWIEDCVVQRVSVRDGLVLDLDDYNEVVISRPLLLTLPAAGRFPTEAVLINPLRISV  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
            HERPL + AGA CT     D G LHL FS GH+IDV PD++VTAWELYGK HGY ACL  
Sbjct  61   HERPLLNLAGAVCTQAWSGDDGGLHLAFSRGHRIDVDPDEQVTAWELYGKRHGYMACLPQ  120

Query  126  GKLRVVRHDVAD  137
            G++RVVRHD+ D
Sbjct  121  GRVRVVRHDIPD  132


>gi|145225160|ref|YP_001135838.1| hypothetical protein Mflv_4582 [Mycobacterium gilvum PYR-GCK]
 gi|145217646|gb|ABP47050.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=154

 Score =  165 bits (417),  Expect = 2e-39, Method: Compositional matrix adjust.
 Identities = 81/134 (61%), Positives = 92/134 (69%), Gaps = 0/134 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M  QW+E   VQR+  R GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP   A 
Sbjct  1    MYTQWIENLVVQRLSLRGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLRIAT  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
            HERPL + AGA CT     D G LHL FS GH+IDV PD   TAWELYG  HGY ACL  
Sbjct  61   HERPLLNLAGAVCTQAWSSDDGGLHLSFSGGHRIDVEPDVEQTAWELYGMRHGYMACLPR  120

Query  126  GKLRVVRHDVADAN  139
            G++RVVRHD+ D +
Sbjct  121  GRVRVVRHDLPDTD  134


>gi|120401345|ref|YP_951174.1| hypothetical protein Mvan_0320 [Mycobacterium vanbaalenii PYR-1]
 gi|119954163|gb|ABM11168.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=161

 Score =  164 bits (415),  Expect = 4e-39, Method: Compositional matrix adjust.
 Identities = 81/137 (60%), Positives = 92/137 (68%), Gaps = 0/137 (0%)

Query  3    EQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPND  62
            E  M  QW+E   VQR+    GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP  
Sbjct  5    ETAMYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLR  64

Query  63   PADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAAC  122
             A HERPL + AGA CT     D G LHL FS GH+IDV PD   TAWELYG  HGY AC
Sbjct  65   IATHERPLLNLAGAVCTQAWSSDDGGLHLSFSGGHRIDVEPDVEQTAWELYGMRHGYMAC  124

Query  123  LAPGKLRVVRHDVADAN  139
            L  G++RVVRHD+ D +
Sbjct  125  LPRGRVRVVRHDLPDTD  141


>gi|315443031|ref|YP_004075910.1| hypothetical protein Mspyr1_14000 [Mycobacterium sp. Spyr1]
 gi|315261334|gb|ADT98075.1| hypothetical protein Mspyr1_14000 [Mycobacterium sp. Spyr1]
Length=154

 Score =  164 bits (415),  Expect = 4e-39, Method: Compositional matrix adjust.
 Identities = 80/134 (60%), Positives = 91/134 (68%), Gaps = 0/134 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M  QW+E   VQR+    GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP   A 
Sbjct  1    MYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLRIAT  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
            HERPL + AGA CT     D G LHL FS GH+IDV PD   TAWELYG  HGY ACL  
Sbjct  61   HERPLLNLAGAVCTQACSSDDGGLHLSFSRGHRIDVDPDTEQTAWELYGMRHGYMACLPQ  120

Query  126  GKLRVVRHDVADAN  139
            G++RVVRHD+ D +
Sbjct  121  GRVRVVRHDLPDTD  134


>gi|126437152|ref|YP_001072843.1| hypothetical protein Mjls_4587 [Mycobacterium sp. JLS]
 gi|126236952|gb|ABO00353.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=161

 Score =  163 bits (413),  Expect = 7e-39, Method: Compositional matrix adjust.
 Identities = 81/137 (60%), Positives = 92/137 (68%), Gaps = 0/137 (0%)

Query  3    EQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPND  62
            E  M  QW+E   VQR+    GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP  
Sbjct  5    ETAMYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLR  64

Query  63   PADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAAC  122
             A HERPL + AGA CT     D G LHL FS GH+IDV PD   TAWELYG  HGY AC
Sbjct  65   IATHERPLLNLAGALCTQAWSSDDGGLHLSFSRGHRIDVDPDAEQTAWELYGMRHGYMAC  124

Query  123  LAPGKLRVVRHDVADAN  139
            L  G++RVVRHD+ D +
Sbjct  125  LPQGRVRVVRHDLPDTD  141


>gi|118465896|ref|YP_882973.1| hypothetical protein MAV_3802 [Mycobacterium avium 104]
 gi|118167183|gb|ABK68080.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=141

 Score =  156 bits (395),  Expect = 9e-37, Method: Compositional matrix adjust.
 Identities = 76/133 (58%), Positives = 93/133 (70%), Gaps = 1/133 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETS-PAEVVAIDPNDPA  64
            M   W+E C VQR+  RDGLVL+ DDYNELVI+ P++LTLP I +S P E V IDP + +
Sbjct  1    MHTPWIERCTVQRVSLRDGLVLDLDDYNELVIATPIRLTLPPIGSSYPEEQVLIDPGNVS  60

Query  65   DHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLA  124
              +RPL D AGA CT     + G LHL FS GH+IDV P +  T+WELYGK HGY ACL 
Sbjct  61   VQQRPLLDLAGAVCTGAWCDEGGGLHLGFSRGHRIDVAPQEAATSWELYGKRHGYMACLP  120

Query  125  PGKLRVVRHDVAD  137
             G++RVVRHD+ D
Sbjct  121  RGRVRVVRHDLPD  133


>gi|183984807|ref|YP_001853098.1| hypothetical protein MMAR_4838 [Mycobacterium marinum M]
 gi|183178133|gb|ACC43243.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=140

 Score =  152 bits (385),  Expect = 1e-35, Method: Compositional matrix adjust.
 Identities = 76/136 (56%), Positives = 92/136 (68%), Gaps = 0/136 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M  QW+E C VQR+  + GLVL+ DDY+ELVIS P++LTLPA+   P E V IDP   + 
Sbjct  1    MPAQWIEQCTVQRVSLQGGLVLDLDDYSELVISRPMRLTLPAVGAWPEEEVLIDPAHLSP  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
             ER L D AGA CT   + D G LHL FS GH+IDV PD   TAWELYGK HGY ACL  
Sbjct  61   EERTLLDLAGAVCTRAWFDDDGSLHLGFSRGHRIDVLPDAAATAWELYGKGHGYMACLPR  120

Query  126  GKLRVVRHDVADANGD  141
            G++R VRHD++  + D
Sbjct  121  GRVRAVRHDLSAEDDD  136


>gi|296164074|ref|ZP_06846697.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
 gi|295900622|gb|EFG80005.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
Length=148

 Score =  148 bits (373),  Expect = 3e-34, Method: Compositional matrix adjust.
 Identities = 72/130 (56%), Positives = 89/130 (69%), Gaps = 0/130 (0%)

Query  6    MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD  65
            M  QW+E C VQR+   +GLV++ DD+N+LVIS P++LTLP     P E V IDP + + 
Sbjct  1    MYTQWIEQCTVQRVSLHEGLVVDLDDHNQLVISRPMRLTLPPAAGWPEEEVLIDPINLSA  60

Query  66   HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP  125
             ERPL D AGA CT     D G LHL FS GH+IDV PD   T+WELYGK HGY ACL  
Sbjct  61   EERPLLDLAGAICTRAWCDDDGALHLCFSRGHRIDVDPDAAATSWELYGKCHGYMACLPR  120

Query  126  GKLRVVRHDV  135
            G++RV+RHD+
Sbjct  121  GRVRVIRHDL  130


>gi|333990706|ref|YP_004523320.1| hypothetical protein JDM601_2067 [Mycobacterium sp. JDM601]
 gi|333486675|gb|AEF36067.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=131

 Score =  142 bits (357),  Expect = 2e-32, Method: Compositional matrix adjust.
 Identities = 71/121 (59%), Positives = 86/121 (72%), Gaps = 0/121 (0%)

Query  19   IMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHERPLFDFAGATC  78
            +  RDGLVL+FDD NE+VI  PL+LTLPA+   P E V IDP   A HERPL D AGA C
Sbjct  1    MSLRDGLVLDFDDCNEVVIYRPLRLTLPAVGDFPVEAVFIDPGRVATHERPLLDLAGAVC  60

Query  79   TAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAPGKLRVVRHDVADA  138
            T     D G LHL FS GH+IDV    +VTAWELYG++HGY ACL  G++RVVR+D+ +A
Sbjct  61   TQAWCGDGGGLHLGFSSGHRIDVDAHPQVTAWELYGRHHGYMACLPHGRVRVVRYDIPEA  120

Query  139  N  139
            +
Sbjct  121  D  121


>gi|120402130|ref|YP_951959.1| hypothetical protein Mvan_1117 [Mycobacterium vanbaalenii PYR-1]
 gi|119954948|gb|ABM11953.1| hypothetical protein Mvan_1117 [Mycobacterium vanbaalenii PYR-1]
Length=126

 Score = 55.1 bits (131),  Expect = 3e-06, Method: Compositional matrix adjust.
 Identities = 34/121 (29%), Positives = 54/121 (45%), Gaps = 7/121 (5%)

Query  11   LEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHE-RP  69
            L G ++Q ++    L +   D   +VI  P  +       S  E V++ P + AD   +P
Sbjct  5    LNGKSLQSVLIEYTLRMQLSDVYFIVIESPFNVD------SHGESVSLSPEEDADEAFQP  58

Query  70   LFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAPGKLR  129
            +    G T       +TG L + FSDG +++V PD+   AW + G       C   GKL 
Sbjct  59   IRQLVGQTVEEATADETGALRVRFSDGTRLEVPPDEAYEAWSVSGPNGALVVCTPGGKLA  118

Query  130  V  130
            +
Sbjct  119  I  119


>gi|311894351|dbj|BAJ26759.1| hypothetical protein KSE_09220 [Kitasatospora setae KM-6054]
Length=132

 Score = 40.8 bits (94),  Expect = 0.062, Method: Compositional matrix adjust.
 Identities = 37/112 (34%), Positives = 45/112 (41%), Gaps = 24/112 (21%)

Query  1    MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLT--------LPAIETSP  52
            M E   +E  LEG AV+ +   D LVL+ DD   L +    +L          PA+  SP
Sbjct  1    MAESLGSE--LEGRAVRSLRGGDRLVLDLDDGLRLTVRNDFRLRHGAAVDHFYPALGLSP  58

Query  53   AEVVAIDPNDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPD  104
            A               PL   AGA  TA      G L L F  GH + V PD
Sbjct  59   AG--------------PLERLAGAVVTATTVTPAGGLQLSFDTGHVLAVAPD  96


>gi|327304475|ref|XP_003236929.1| arylsulfatase [Trichophyton rubrum CBS 118892]
 gi|326459927|gb|EGD85380.1| arylsulfatase [Trichophyton rubrum CBS 118892]
Length=541

 Score = 35.4 bits (80),  Expect = 3.1, Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 40/100 (40%), Gaps = 29/100 (29%)

Query  33   NELVISVPLQLTLPAIETSPA-----------------EVVAIDPNDPADHERPLFDFAG  75
            +E  I VPL L  P +    A                 E+  I    P+   RP++   G
Sbjct  373  SEGGIRVPLILNYPPLTAGKAGIDHTFGTVMDIAPTLLELAGIKHPAPSYRSRPVYPMRG  432

Query  76   ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGK  115
            ++     W       L +  G Q  +H +D VT+WEL+G+
Sbjct  433  SS-----W-------LPYLKGEQTQIHDEDHVTSWELFGR  460


>gi|117530185|ref|YP_851028.1| hypothetical protein MaLMM01_gp014 [Microcystis phage Ma-LMM01]
 gi|117165797|dbj|BAF36105.1| hypothetical protein [Microcystis phage Ma-LMM01]
Length=696

 Score = 35.0 bits (79),  Expect = 3.8, Method: Composition-based stats.
 Identities = 27/99 (28%), Positives = 40/99 (41%), Gaps = 10/99 (10%)

Query  19   IMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHERPLFDFAG---  75
            I+F DGL+     +N + ++ P+    P+    P  +     N  AD  R     AG   
Sbjct  388  ILFEDGLI-----HNLVTMTAPVDTDAPSFFQWPNGMGWSYNNQLADSSRQRVKAAGGKV  442

Query  76   --ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWEL  112
              A C   +WY+T DL L       I  + D +    EL
Sbjct  443  DGALCCRLMWYNTDDLDLHLQFNGNIIYYGDKKACGGEL  481


>gi|329113659|ref|ZP_08242435.1| Nitrite reductase large subunit [Acetobacter pomorum DM001]
 gi|326697019|gb|EGE48684.1| Nitrite reductase large subunit [Acetobacter pomorum DM001]
Length=975

 Score = 34.7 bits (78),  Expect = 4.9, Method: Compositional matrix adjust.
 Identities = 28/108 (26%), Positives = 46/108 (43%), Gaps = 15/108 (13%)

Query  9    QWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHER  68
            +W E C    ++ + G+V  +D  N++ +       LP  E +PA+V AID +DP  +  
Sbjct  854  EWTELCQSHDLVVKSGVVAWYDG-NQIAL-----FYLPETEVTPAQVYAIDNHDPFSNAN  907

Query  69   P-----LFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWE  111
                  + D  G    A   Y     H    DG  ++  P  R+  W+
Sbjct  908  VIGRGIMGDLKGQLVVASPLYKQ---HFRLEDGQCLE-DPAIRLRTWD  951


>gi|226973313|gb|ACO94460.1| polyketide synthase type I [Streptomyces sp. DSM 21069]
Length=3527

 Score = 34.3 bits (77),  Expect = 6.3, Method: Compositional matrix adjust.
 Identities = 30/95 (32%), Positives = 42/95 (45%), Gaps = 6/95 (6%)

Query  2     TEQEMTEQWLEGCA---VQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAI  58
             T   + + WLE  A   V  ++ R  L +N +D  +L  +    L   A   +P  VV I
Sbjct  3014  TTLAVLQSWLESGADDAVLAVVTRGALSVNGEDVTDLAGAAVWGLVRSAQTENPGRVVLI  3073

Query  59    D---PNDPADHERPLFDFAGATCTAFVWYDTGDLH  90
             D    +D ADH     D A AT  A +   +G LH
Sbjct  3074  DLDAVDDRADHTDADIDAAVATGEAQIAIRSGTLH  3108


>gi|167841435|ref|ZP_02468119.1| HpcH/HpaI aldolase [Burkholderia thailandensis MSMB43]
Length=275

 Score = 33.9 bits (76),  Expect = 8.6, Method: Compositional matrix adjust.
 Identities = 18/39 (47%), Positives = 24/39 (62%), Gaps = 2/39 (5%)

Query  95   DGHQIDVHPDDRVTAWELYGKYHGYAA--CLAPGKLRVV  131
            DG   DVH  DR+TA  L GK HG+ A  C+ P ++ +V
Sbjct  181  DGVTPDVHDADRLTADALNGKRHGFGAKLCIHPAQVGIV  219


>gi|302501502|ref|XP_003012743.1| arylsulfatase, putative [Arthroderma benhamiae CBS 112371]
 gi|291176303|gb|EFE32103.1| arylsulfatase, putative [Arthroderma benhamiae CBS 112371]
Length=558

 Score = 33.9 bits (76),  Expect = 8.8, Method: Compositional matrix adjust.
 Identities = 25/100 (25%), Positives = 40/100 (40%), Gaps = 29/100 (29%)

Query  33   NELVISVPLQLTLPAIETSPA-----------------EVVAIDPNDPADHERPLFDFAG  75
            +E  I VPL L  P +    A                 E+  I    P+   RP++   G
Sbjct  390  SEGGIRVPLILNYPPLTAGKAGIDHTFGTVMDIAPTLLELAGIKHPAPSYRSRPVYPMRG  449

Query  76   ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGK  115
            ++     W       L +  G +  +H +D VT+WEL+G+
Sbjct  450  SS-----W-------LPYLKGERTQIHDEDHVTSWELFGR  477



Lambda     K      H
   0.321    0.138    0.441 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 129798780480


  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Sep 5, 2011  4:36 AM
  Number of letters in database: 5,219,829,388
  Number of sequences in database:  15,229,318



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40