BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3142c Length=142 Score E Sequences producing significant alignments: (Bits) Value gi|15610278|ref|NP_217658.1| hypothetical protein Rv3142c [Mycob... 296 6e-79 gi|15842718|ref|NP_337755.1| hypothetical protein MT3229 [Mycoba... 296 7e-79 gi|340628118|ref|YP_004746570.1| hypothetical protein MCAN_31561... 295 2e-78 gi|31794318|ref|NP_856811.1| hypothetical protein Mb3166c [Mycob... 294 3e-78 gi|148824336|ref|YP_001289090.1| hypothetical protein TBFG_13163... 293 4e-78 gi|254821190|ref|ZP_05226191.1| hypothetical protein MintA_14732... 226 7e-58 gi|342858065|ref|ZP_08714721.1| hypothetical protein MCOL_04285 ... 218 2e-55 gi|169631403|ref|YP_001705052.1| hypothetical protein MAB_4326c ... 174 5e-42 gi|108801803|ref|YP_642000.1| hypothetical protein Mmcs_4840 [My... 165 2e-39 gi|145225160|ref|YP_001135838.1| hypothetical protein Mflv_4582 ... 165 2e-39 gi|120401345|ref|YP_951174.1| hypothetical protein Mvan_0320 [My... 164 4e-39 gi|315443031|ref|YP_004075910.1| hypothetical protein Mspyr1_140... 164 4e-39 gi|126437152|ref|YP_001072843.1| hypothetical protein Mjls_4587 ... 163 7e-39 gi|118465896|ref|YP_882973.1| hypothetical protein MAV_3802 [Myc... 156 9e-37 gi|183984807|ref|YP_001853098.1| hypothetical protein MMAR_4838 ... 152 1e-35 gi|296164074|ref|ZP_06846697.1| conserved hypothetical protein [... 148 3e-34 gi|333990706|ref|YP_004523320.1| hypothetical protein JDM601_206... 142 2e-32 gi|120402130|ref|YP_951959.1| hypothetical protein Mvan_1117 [My... 55.1 3e-06 gi|311894351|dbj|BAJ26759.1| hypothetical protein KSE_09220 [Kit... 40.8 0.062 gi|327304475|ref|XP_003236929.1| arylsulfatase [Trichophyton rub... 35.4 3.1 gi|117530185|ref|YP_851028.1| hypothetical protein MaLMM01_gp014... 35.0 3.8 gi|329113659|ref|ZP_08242435.1| Nitrite reductase large subunit ... 34.7 4.9 gi|226973313|gb|ACO94460.1| polyketide synthase type I [Streptom... 34.3 6.3 gi|167841435|ref|ZP_02468119.1| HpcH/HpaI aldolase [Burkholderia... 33.9 8.6 gi|302501502|ref|XP_003012743.1| arylsulfatase, putative [Arthro... 33.9 8.8 >gi|15610278|ref|NP_217658.1| hypothetical protein Rv3142c [Mycobacterium tuberculosis H37Rv] gi|121639025|ref|YP_979249.1| hypothetical protein BCG_3165c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|148662997|ref|YP_001284520.1| hypothetical protein MRA_3175 [Mycobacterium tuberculosis H37Ra] 72 more sequence titlesLength=142 Score = 296 bits (758), Expect = 6e-79, Method: Compositional matrix adjust. Identities = 142/142 (100%), Positives = 142/142 (100%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP Sbjct 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA Sbjct 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLAPGKLRVVRHDVADANGDQ Sbjct 121 ACLAPGKLRVVRHDVADANGDQ 142 >gi|15842718|ref|NP_337755.1| hypothetical protein MT3229 [Mycobacterium tuberculosis CDC1551] gi|13883039|gb|AAK47569.1| hypothetical protein MT3229 [Mycobacterium tuberculosis CDC1551] Length=175 Score = 296 bits (758), Expect = 7e-79, Method: Compositional matrix adjust. Identities = 142/142 (100%), Positives = 142/142 (100%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP Sbjct 34 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 93 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA Sbjct 94 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 153 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLAPGKLRVVRHDVADANGDQ Sbjct 154 ACLAPGKLRVVRHDVADANGDQ 175 >gi|340628118|ref|YP_004746570.1| hypothetical protein MCAN_31561 [Mycobacterium canettii CIPT 140010059] gi|340006308|emb|CCC45487.1| hypothetical protein MCAN_31561 [Mycobacterium canettii CIPT 140010059] Length=142 Score = 295 bits (754), Expect = 2e-78, Method: Compositional matrix adjust. Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP Sbjct 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 NDPADHERPLFDFAGATCTAF WYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA Sbjct 61 NDPADHERPLFDFAGATCTAFAWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLAPGKLRVVRHDVADANGDQ Sbjct 121 ACLAPGKLRVVRHDVADANGDQ 142 >gi|31794318|ref|NP_856811.1| hypothetical protein Mb3166c [Mycobacterium bovis AF2122/97] gi|31619914|emb|CAD95258.1| HYPOTHETICAL PROTEIN Mb3166c [Mycobacterium bovis AF2122/97] Length=142 Score = 294 bits (752), Expect = 3e-78, Method: Compositional matrix adjust. Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP Sbjct 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA Sbjct 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLAPGKLRVVR DVADANGDQ Sbjct 121 ACLAPGKLRVVRQDVADANGDQ 142 >gi|148824336|ref|YP_001289090.1| hypothetical protein TBFG_13163 [Mycobacterium tuberculosis F11] gi|148722863|gb|ABR07488.1| hypothetical protein TBFG_13163 [Mycobacterium tuberculosis F11] Length=142 Score = 293 bits (751), Expect = 4e-78, Method: Compositional matrix adjust. Identities = 141/142 (99%), Positives = 141/142 (99%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQEMTEQWLEG AVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP Sbjct 1 MTEQEMTEQWLEGSAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA Sbjct 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLAPGKLRVVRHDVADANGDQ Sbjct 121 ACLAPGKLRVVRHDVADANGDQ 142 >gi|254821190|ref|ZP_05226191.1| hypothetical protein MintA_14732 [Mycobacterium intracellulare ATCC 13950] Length=142 Score = 226 bits (577), Expect = 7e-58, Method: Compositional matrix adjust. Identities = 107/142 (76%), Positives = 123/142 (87%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 MTEQ++ EQWLEGCAVQRIMFRDGLVLNF+DYNELVI+ P++LTLPAIETSPAEV+AIDP Sbjct 1 MTEQKVIEQWLEGCAVQRIMFRDGLVLNFEDYNELVITAPMRLTLPAIETSPAEVIAIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 N PA RPLFDFAG++CT+ VW DTGDLHLEFSD H+IDV +D AWELY K+HGYA Sbjct 61 NHPAGQLRPLFDFAGSSCTSAVWSDTGDLHLEFSDDHKIDVPCNDNAIAWELYSKHHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACLA G+LRVVR D A A+GD+ Sbjct 121 ACLAHGELRVVRLDTARADGDR 142 >gi|342858065|ref|ZP_08714721.1| hypothetical protein MCOL_04285 [Mycobacterium colombiense CECT 3035] gi|342135398|gb|EGT88564.1| hypothetical protein MCOL_04285 [Mycobacterium colombiense CECT 3035] Length=142 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 105/142 (74%), Positives = 115/142 (81%), Gaps = 0/142 (0%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDP 60 M E +M WLEGCA+QRIMFRDGLVLNFDD NELVISVP++LTLPAI +PAEVV IDP Sbjct 1 MAESKMIGHWLEGCALQRIMFRDGLVLNFDDDNELVISVPIRLTLPAIANAPAEVVEIDP 60 Query 61 NDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYA 120 N PA ERPLFDF+G CT F W+D+GDLHLEFSDGH IDV DD TAWELYGKYHGYA Sbjct 61 NGPAVQERPLFDFSGQNCTGFDWFDSGDLHLEFSDGHIIDVPADDHATAWELYGKYHGYA 120 Query 121 ACLAPGKLRVVRHDVADANGDQ 142 ACL GK+RVVRHDV N D+ Sbjct 121 ACLPHGKVRVVRHDVDATNIDE 142 >gi|169631403|ref|YP_001705052.1| hypothetical protein MAB_4326c [Mycobacterium abscessus ATCC 19977] gi|169243370|emb|CAM64398.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=136 Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 83/135 (62%), Positives = 98/135 (73%), Gaps = 0/135 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M+E W+EGC VQRIMFRDGLV++ DYNE+VI+VP+ LTLP P E+V +DP D Sbjct 1 MSEAWIEGCPVQRIMFRDGLVISLGDYNEVVIAVPMWLTLPPAGKWPREIVCVDPKAILD 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 ERPLF +G+TCT W D GDLH+EFSD H IDV D TAWE+YGKYHGY A L Sbjct 61 EERPLFSISGSTCTEARWNDAGDLHMEFSDDHVIDVPHHDFDTAWEIYGKYHGYVASLPR 120 Query 126 GKLRVVRHDVADANG 140 GK+RVVRHDVA+ G Sbjct 121 GKVRVVRHDVAEEAG 135 >gi|108801803|ref|YP_642000.1| hypothetical protein Mmcs_4840 [Mycobacterium sp. MCS] gi|119870955|ref|YP_940907.1| hypothetical protein Mkms_4928 [Mycobacterium sp. KMS] gi|108772222|gb|ABG10944.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119697044|gb|ABL94117.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length=142 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 81/132 (62%), Positives = 94/132 (72%), Gaps = 0/132 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M QW+E C VQR+ RDGLVL+ DDYNE+VIS PL LTLPA P E V I+P + Sbjct 1 MYTQWIEDCVVQRVSVRDGLVLDLDDYNEVVISRPLLLTLPAAGRFPTEAVLINPLRISV 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 HERPL + AGA CT D G LHL FS GH+IDV PD++VTAWELYGK HGY ACL Sbjct 61 HERPLLNLAGAVCTQAWSGDDGGLHLAFSRGHRIDVDPDEQVTAWELYGKRHGYMACLPQ 120 Query 126 GKLRVVRHDVAD 137 G++RVVRHD+ D Sbjct 121 GRVRVVRHDIPD 132 >gi|145225160|ref|YP_001135838.1| hypothetical protein Mflv_4582 [Mycobacterium gilvum PYR-GCK] gi|145217646|gb|ABP47050.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] Length=154 Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 81/134 (61%), Positives = 92/134 (69%), Gaps = 0/134 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M QW+E VQR+ R GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP A Sbjct 1 MYTQWIENLVVQRLSLRGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLRIAT 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 HERPL + AGA CT D G LHL FS GH+IDV PD TAWELYG HGY ACL Sbjct 61 HERPLLNLAGAVCTQAWSSDDGGLHLSFSGGHRIDVEPDVEQTAWELYGMRHGYMACLPR 120 Query 126 GKLRVVRHDVADAN 139 G++RVVRHD+ D + Sbjct 121 GRVRVVRHDLPDTD 134 >gi|120401345|ref|YP_951174.1| hypothetical protein Mvan_0320 [Mycobacterium vanbaalenii PYR-1] gi|119954163|gb|ABM11168.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=161 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 81/137 (60%), Positives = 92/137 (68%), Gaps = 0/137 (0%) Query 3 EQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPND 62 E M QW+E VQR+ GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP Sbjct 5 ETAMYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLR 64 Query 63 PADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAAC 122 A HERPL + AGA CT D G LHL FS GH+IDV PD TAWELYG HGY AC Sbjct 65 IATHERPLLNLAGAVCTQAWSSDDGGLHLSFSGGHRIDVEPDVEQTAWELYGMRHGYMAC 124 Query 123 LAPGKLRVVRHDVADAN 139 L G++RVVRHD+ D + Sbjct 125 LPRGRVRVVRHDLPDTD 141 >gi|315443031|ref|YP_004075910.1| hypothetical protein Mspyr1_14000 [Mycobacterium sp. Spyr1] gi|315261334|gb|ADT98075.1| hypothetical protein Mspyr1_14000 [Mycobacterium sp. Spyr1] Length=154 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 80/134 (60%), Positives = 91/134 (68%), Gaps = 0/134 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M QW+E VQR+ GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP A Sbjct 1 MYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLRIAT 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 HERPL + AGA CT D G LHL FS GH+IDV PD TAWELYG HGY ACL Sbjct 61 HERPLLNLAGAVCTQACSSDDGGLHLSFSRGHRIDVDPDTEQTAWELYGMRHGYMACLPQ 120 Query 126 GKLRVVRHDVADAN 139 G++RVVRHD+ D + Sbjct 121 GRVRVVRHDLPDTD 134 >gi|126437152|ref|YP_001072843.1| hypothetical protein Mjls_4587 [Mycobacterium sp. JLS] gi|126236952|gb|ABO00353.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=161 Score = 163 bits (413), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 81/137 (60%), Positives = 92/137 (68%), Gaps = 0/137 (0%) Query 3 EQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPND 62 E M QW+E VQR+ GLVL+FDDYNE+VIS PL LTLPA+ T P E V IDP Sbjct 5 ETAMYTQWIENLVVQRLSLHGGLVLDFDDYNEIVISCPLLLTLPAVGTYPIEAVRIDPLR 64 Query 63 PADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAAC 122 A HERPL + AGA CT D G LHL FS GH+IDV PD TAWELYG HGY AC Sbjct 65 IATHERPLLNLAGALCTQAWSSDDGGLHLSFSRGHRIDVDPDAEQTAWELYGMRHGYMAC 124 Query 123 LAPGKLRVVRHDVADAN 139 L G++RVVRHD+ D + Sbjct 125 LPQGRVRVVRHDLPDTD 141 >gi|118465896|ref|YP_882973.1| hypothetical protein MAV_3802 [Mycobacterium avium 104] gi|118167183|gb|ABK68080.1| conserved hypothetical protein [Mycobacterium avium 104] Length=141 Score = 156 bits (395), Expect = 9e-37, Method: Compositional matrix adjust. Identities = 76/133 (58%), Positives = 93/133 (70%), Gaps = 1/133 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETS-PAEVVAIDPNDPA 64 M W+E C VQR+ RDGLVL+ DDYNELVI+ P++LTLP I +S P E V IDP + + Sbjct 1 MHTPWIERCTVQRVSLRDGLVLDLDDYNELVIATPIRLTLPPIGSSYPEEQVLIDPGNVS 60 Query 65 DHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLA 124 +RPL D AGA CT + G LHL FS GH+IDV P + T+WELYGK HGY ACL Sbjct 61 VQQRPLLDLAGAVCTGAWCDEGGGLHLGFSRGHRIDVAPQEAATSWELYGKRHGYMACLP 120 Query 125 PGKLRVVRHDVAD 137 G++RVVRHD+ D Sbjct 121 RGRVRVVRHDLPD 133 >gi|183984807|ref|YP_001853098.1| hypothetical protein MMAR_4838 [Mycobacterium marinum M] gi|183178133|gb|ACC43243.1| conserved hypothetical protein [Mycobacterium marinum M] Length=140 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 76/136 (56%), Positives = 92/136 (68%), Gaps = 0/136 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M QW+E C VQR+ + GLVL+ DDY+ELVIS P++LTLPA+ P E V IDP + Sbjct 1 MPAQWIEQCTVQRVSLQGGLVLDLDDYSELVISRPMRLTLPAVGAWPEEEVLIDPAHLSP 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 ER L D AGA CT + D G LHL FS GH+IDV PD TAWELYGK HGY ACL Sbjct 61 EERTLLDLAGAVCTRAWFDDDGSLHLGFSRGHRIDVLPDAAATAWELYGKGHGYMACLPR 120 Query 126 GKLRVVRHDVADANGD 141 G++R VRHD++ + D Sbjct 121 GRVRAVRHDLSAEDDD 136 >gi|296164074|ref|ZP_06846697.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900622|gb|EFG80005.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=148 Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 72/130 (56%), Positives = 89/130 (69%), Gaps = 0/130 (0%) Query 6 MTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPAD 65 M QW+E C VQR+ +GLV++ DD+N+LVIS P++LTLP P E V IDP + + Sbjct 1 MYTQWIEQCTVQRVSLHEGLVVDLDDHNQLVISRPMRLTLPPAAGWPEEEVLIDPINLSA 60 Query 66 HERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAP 125 ERPL D AGA CT D G LHL FS GH+IDV PD T+WELYGK HGY ACL Sbjct 61 EERPLLDLAGAICTRAWCDDDGALHLCFSRGHRIDVDPDAAATSWELYGKCHGYMACLPR 120 Query 126 GKLRVVRHDV 135 G++RV+RHD+ Sbjct 121 GRVRVIRHDL 130 >gi|333990706|ref|YP_004523320.1| hypothetical protein JDM601_2067 [Mycobacterium sp. JDM601] gi|333486675|gb|AEF36067.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=131 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 71/121 (59%), Positives = 86/121 (72%), Gaps = 0/121 (0%) Query 19 IMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHERPLFDFAGATC 78 + RDGLVL+FDD NE+VI PL+LTLPA+ P E V IDP A HERPL D AGA C Sbjct 1 MSLRDGLVLDFDDCNEVVIYRPLRLTLPAVGDFPVEAVFIDPGRVATHERPLLDLAGAVC 60 Query 79 TAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAPGKLRVVRHDVADA 138 T D G LHL FS GH+IDV +VTAWELYG++HGY ACL G++RVVR+D+ +A Sbjct 61 TQAWCGDGGGLHLGFSSGHRIDVDAHPQVTAWELYGRHHGYMACLPHGRVRVVRYDIPEA 120 Query 139 N 139 + Sbjct 121 D 121 >gi|120402130|ref|YP_951959.1| hypothetical protein Mvan_1117 [Mycobacterium vanbaalenii PYR-1] gi|119954948|gb|ABM11953.1| hypothetical protein Mvan_1117 [Mycobacterium vanbaalenii PYR-1] Length=126 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 34/121 (29%), Positives = 54/121 (45%), Gaps = 7/121 (5%) Query 11 LEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHE-RP 69 L G ++Q ++ L + D +VI P + S E V++ P + AD +P Sbjct 5 LNGKSLQSVLIEYTLRMQLSDVYFIVIESPFNVD------SHGESVSLSPEEDADEAFQP 58 Query 70 LFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYHGYAACLAPGKLR 129 + G T +TG L + FSDG +++V PD+ AW + G C GKL Sbjct 59 IRQLVGQTVEEATADETGALRVRFSDGTRLEVPPDEAYEAWSVSGPNGALVVCTPGGKLA 118 Query 130 V 130 + Sbjct 119 I 119 >gi|311894351|dbj|BAJ26759.1| hypothetical protein KSE_09220 [Kitasatospora setae KM-6054] Length=132 Score = 40.8 bits (94), Expect = 0.062, Method: Compositional matrix adjust. Identities = 37/112 (34%), Positives = 45/112 (41%), Gaps = 24/112 (21%) Query 1 MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLT--------LPAIETSP 52 M E +E LEG AV+ + D LVL+ DD L + +L PA+ SP Sbjct 1 MAESLGSE--LEGRAVRSLRGGDRLVLDLDDGLRLTVRNDFRLRHGAAVDHFYPALGLSP 58 Query 53 AEVVAIDPNDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPD 104 A PL AGA TA G L L F GH + V PD Sbjct 59 AG--------------PLERLAGAVVTATTVTPAGGLQLSFDTGHVLAVAPD 96 >gi|327304475|ref|XP_003236929.1| arylsulfatase [Trichophyton rubrum CBS 118892] gi|326459927|gb|EGD85380.1| arylsulfatase [Trichophyton rubrum CBS 118892] Length=541 Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust. Identities = 26/100 (26%), Positives = 40/100 (40%), Gaps = 29/100 (29%) Query 33 NELVISVPLQLTLPAIETSPA-----------------EVVAIDPNDPADHERPLFDFAG 75 +E I VPL L P + A E+ I P+ RP++ G Sbjct 373 SEGGIRVPLILNYPPLTAGKAGIDHTFGTVMDIAPTLLELAGIKHPAPSYRSRPVYPMRG 432 Query 76 ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGK 115 ++ W L + G Q +H +D VT+WEL+G+ Sbjct 433 SS-----W-------LPYLKGEQTQIHDEDHVTSWELFGR 460 >gi|117530185|ref|YP_851028.1| hypothetical protein MaLMM01_gp014 [Microcystis phage Ma-LMM01] gi|117165797|dbj|BAF36105.1| hypothetical protein [Microcystis phage Ma-LMM01] Length=696 Score = 35.0 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 27/99 (28%), Positives = 40/99 (41%), Gaps = 10/99 (10%) Query 19 IMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHERPLFDFAG--- 75 I+F DGL+ +N + ++ P+ P+ P + N AD R AG Sbjct 388 ILFEDGLI-----HNLVTMTAPVDTDAPSFFQWPNGMGWSYNNQLADSSRQRVKAAGGKV 442 Query 76 --ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWEL 112 A C +WY+T DL L I + D + EL Sbjct 443 DGALCCRLMWYNTDDLDLHLQFNGNIIYYGDKKACGGEL 481 >gi|329113659|ref|ZP_08242435.1| Nitrite reductase large subunit [Acetobacter pomorum DM001] gi|326697019|gb|EGE48684.1| Nitrite reductase large subunit [Acetobacter pomorum DM001] Length=975 Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust. Identities = 28/108 (26%), Positives = 46/108 (43%), Gaps = 15/108 (13%) Query 9 QWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHER 68 +W E C ++ + G+V +D N++ + LP E +PA+V AID +DP + Sbjct 854 EWTELCQSHDLVVKSGVVAWYDG-NQIAL-----FYLPETEVTPAQVYAIDNHDPFSNAN 907 Query 69 P-----LFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWE 111 + D G A Y H DG ++ P R+ W+ Sbjct 908 VIGRGIMGDLKGQLVVASPLYKQ---HFRLEDGQCLE-DPAIRLRTWD 951 >gi|226973313|gb|ACO94460.1| polyketide synthase type I [Streptomyces sp. DSM 21069] Length=3527 Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust. Identities = 30/95 (32%), Positives = 42/95 (45%), Gaps = 6/95 (6%) Query 2 TEQEMTEQWLEGCA---VQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAI 58 T + + WLE A V ++ R L +N +D +L + L A +P VV I Sbjct 3014 TTLAVLQSWLESGADDAVLAVVTRGALSVNGEDVTDLAGAAVWGLVRSAQTENPGRVVLI 3073 Query 59 D---PNDPADHERPLFDFAGATCTAFVWYDTGDLH 90 D +D ADH D A AT A + +G LH Sbjct 3074 DLDAVDDRADHTDADIDAAVATGEAQIAIRSGTLH 3108 >gi|167841435|ref|ZP_02468119.1| HpcH/HpaI aldolase [Burkholderia thailandensis MSMB43] Length=275 Score = 33.9 bits (76), Expect = 8.6, Method: Compositional matrix adjust. Identities = 18/39 (47%), Positives = 24/39 (62%), Gaps = 2/39 (5%) Query 95 DGHQIDVHPDDRVTAWELYGKYHGYAA--CLAPGKLRVV 131 DG DVH DR+TA L GK HG+ A C+ P ++ +V Sbjct 181 DGVTPDVHDADRLTADALNGKRHGFGAKLCIHPAQVGIV 219 >gi|302501502|ref|XP_003012743.1| arylsulfatase, putative [Arthroderma benhamiae CBS 112371] gi|291176303|gb|EFE32103.1| arylsulfatase, putative [Arthroderma benhamiae CBS 112371] Length=558 Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust. Identities = 25/100 (25%), Positives = 40/100 (40%), Gaps = 29/100 (29%) Query 33 NELVISVPLQLTLPAIETSPA-----------------EVVAIDPNDPADHERPLFDFAG 75 +E I VPL L P + A E+ I P+ RP++ G Sbjct 390 SEGGIRVPLILNYPPLTAGKAGIDHTFGTVMDIAPTLLELAGIKHPAPSYRSRPVYPMRG 449 Query 76 ATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGK 115 ++ W L + G + +H +D VT+WEL+G+ Sbjct 450 SS-----W-------LPYLKGERTQIHDEDHVTSWELFGR 477 Lambda K H 0.321 0.138 0.441 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129798780480 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40