BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1893 Length=72 Score E Sequences producing significant alignments: (Bits) Value gi|15609030|ref|NP_216409.1| hypothetical protein Rv1893 [Mycoba... 139 1e-31 gi|240170540|ref|ZP_04749199.1| hypothetical protein MkanA1_1460... 115 2e-24 gi|41407715|ref|NP_960551.1| hypothetical protein MAP1617 [Mycob... 110 5e-23 gi|254822530|ref|ZP_05227531.1| hypothetical protein MintA_21554... 110 5e-23 gi|342859542|ref|ZP_08716195.1| hypothetical protein MCOL_11703 ... 107 5e-22 gi|118616861|ref|YP_905193.1| hypothetical protein MUL_1142 [Myc... 106 9e-22 gi|296164914|ref|ZP_06847470.1| conserved hypothetical protein [... 103 8e-21 gi|15828094|ref|NP_302357.1| hypothetical protein ML2022 [Mycoba... 98.2 3e-19 gi|4582350|emb|CAB40293.1| hypothetical protein MLCB561.11 [Myco... 96.7 9e-19 gi|333991121|ref|YP_004523735.1| hypothetical protein JDM601_248... 94.0 7e-18 gi|333921982|ref|YP_004495563.1| hypothetical protein AS9A_4330 ... 50.1 1e-04 gi|296140165|ref|YP_003647408.1| hypothetical protein Tpau_2464 ... 42.7 0.018 gi|256423262|ref|YP_003123915.1| amidase [Chitinophaga pinensis ... 39.3 0.17 gi|298706849|emb|CBJ25813.1| Glycosyltransferase, family GT49 [E... 37.7 0.62 gi|299136689|ref|ZP_07029872.1| Amidase [Acidobacterium sp. MP5A... 36.2 1.6 gi|91790145|ref|YP_551097.1| heavy metal translocating P-type AT... 34.7 5.0 gi|28211049|ref|NP_781993.1| ferric enterobactin transport ATP-b... 34.7 5.0 >gi|15609030|ref|NP_216409.1| hypothetical protein Rv1893 [Mycobacterium tuberculosis H37Rv] gi|15841363|ref|NP_336400.1| hypothetical protein MT1943 [Mycobacterium tuberculosis CDC1551] gi|31793085|ref|NP_855578.1| hypothetical protein Mb1926 [Mycobacterium bovis AF2122/97] 78 more sequence titlesLength=72 Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 72/72 (100%), Positives = 72/72 (100%), Gaps = 0/72 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT Sbjct 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 Query 61 KEVFTGKTDDEG 72 KEVFTGKTDDEG Sbjct 61 KEVFTGKTDDEG 72 >gi|240170540|ref|ZP_04749199.1| hypothetical protein MkanA1_14600 [Mycobacterium kansasii ATCC 12478] Length=72 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 60/70 (86%), Positives = 63/70 (90%), Gaps = 0/70 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+ NPKDAVDA +DIA NAVEKASDIVENAG IIRGDIAGGASGIV+ SIDIATHAVDR Sbjct 1 MAINPKDAVDAAKDIATNAVEKASDIVENAGDIIRGDIAGGASGIVQSSIDIATHAVDRV 60 Query 61 KEVFTGKTDD 70 KEVFTGK DD Sbjct 61 KEVFTGKDDD 70 >gi|41407715|ref|NP_960551.1| hypothetical protein MAP1617 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118464696|ref|YP_881995.1| hypothetical protein MAV_2808 [Mycobacterium avium 104] gi|254775288|ref|ZP_05216804.1| hypothetical protein MaviaA2_11541 [Mycobacterium avium subsp. avium ATCC 25291] gi|41396068|gb|AAS03934.1| hypothetical protein MAP_1617 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118165983|gb|ABK66880.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336457554|gb|EGO36560.1| hypothetical protein MAPs_22260 [Mycobacterium avium subsp. paratuberculosis S397] Length=76 Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 57/69 (83%), Positives = 63/69 (92%), Gaps = 0/69 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+FNPKDAVDA RDIA NAVEKASDIVE+A IIRGDIAGGASGIV++SIDI T+AVDRT Sbjct 1 MAFNPKDAVDAARDIATNAVEKASDIVEHASDIIRGDIAGGASGIVQNSIDIGTYAVDRT 60 Query 61 KEVFTGKTD 69 KEVFTG+ D Sbjct 61 KEVFTGRHD 69 >gi|254822530|ref|ZP_05227531.1| hypothetical protein MintA_21554 [Mycobacterium intracellulare ATCC 13950] Length=73 Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 55/69 (80%), Positives = 63/69 (92%), Gaps = 0/69 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+FNPKDAVDA RDIA NAVEKASDIVE+A IIRGD+AGGASGIV++SIDI T+A+DRT Sbjct 1 MAFNPKDAVDAARDIATNAVEKASDIVEHASDIIRGDLAGGASGIVQNSIDIGTYALDRT 60 Query 61 KEVFTGKTD 69 KEVFTG+ D Sbjct 61 KEVFTGRQD 69 >gi|342859542|ref|ZP_08716195.1| hypothetical protein MCOL_11703 [Mycobacterium colombiense CECT 3035] gi|342132674|gb|EGT85894.1| hypothetical protein MCOL_11703 [Mycobacterium colombiense CECT 3035] Length=73 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 54/67 (81%), Positives = 61/67 (92%), Gaps = 0/67 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+FNPKDAVDA RDIA NAVEKASDIVE+A IIRGD+AGGASGIV +SIDI T+A+DRT Sbjct 1 MAFNPKDAVDAARDIATNAVEKASDIVEHASDIIRGDLAGGASGIVANSIDIGTYALDRT 60 Query 61 KEVFTGK 67 KEVFTG+ Sbjct 61 KEVFTGR 67 >gi|118616861|ref|YP_905193.1| hypothetical protein MUL_1142 [Mycobacterium ulcerans Agy99] gi|183982792|ref|YP_001851083.1| hypothetical protein MMAR_2786 [Mycobacterium marinum M] gi|118568971|gb|ABL03722.1| conserved secreted protein [Mycobacterium ulcerans Agy99] gi|183176118|gb|ACC41228.1| conserved secreted protein [Mycobacterium marinum M] Length=72 Score = 106 bits (265), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 54/70 (78%), Positives = 61/70 (88%), Gaps = 0/70 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+ NPKDAV+A RDIA NAVEKASDIVENA IIRGD++GGASGIV++SIDIATHAVD+ Sbjct 1 MAINPKDAVNAARDIATNAVEKASDIVENASDIIRGDVSGGASGIVQNSIDIATHAVDKV 60 Query 61 KEVFTGKTDD 70 KEV GK DD Sbjct 61 KEVLIGKDDD 70 >gi|296164914|ref|ZP_06847470.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899756|gb|EFG79206.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=74 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 54/73 (74%), Positives = 63/73 (87%), Gaps = 2/73 (2%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M F PKDAVDA RDIA NAVEKASDIVE+A IIRGDI+GGASGIV++SIDI T+AVDRT Sbjct 1 MGFKPKDAVDAARDIATNAVEKASDIVEHASDIIRGDISGGASGIVQNSIDIGTYAVDRT 60 Query 61 KEVFTGK--TDDE 71 +E+F+G+ DDE Sbjct 61 REMFSGRDELDDE 73 >gi|15828094|ref|NP_302357.1| hypothetical protein ML2022 [Mycobacterium leprae TN] gi|221230571|ref|YP_002503987.1| hypothetical protein MLBr_02022 [Mycobacterium leprae Br4923] gi|13093648|emb|CAC30977.1| conserved hypothetical protein [Mycobacterium leprae] gi|219933678|emb|CAR72119.1| conserved hypothetical protein [Mycobacterium leprae Br4923] Length=174 Score = 98.2 bits (243), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 48/67 (72%), Positives = 58/67 (87%), Gaps = 0/67 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+F+PK+AV+A RDIA N VEKASDIVEN II+GDIAGGA+ IV++SIDIATHAVD+ Sbjct 101 MTFDPKNAVNAARDIATNFVEKASDIVENVSDIIKGDIAGGANDIVQNSIDIATHAVDKA 160 Query 61 KEVFTGK 67 KE+F GK Sbjct 161 KEIFIGK 167 >gi|4582350|emb|CAB40293.1| hypothetical protein MLCB561.11 [Mycobacterium leprae] Length=74 Score = 96.7 bits (239), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 48/67 (72%), Positives = 58/67 (87%), Gaps = 0/67 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+F+PK+AV+A RDIA N VEKASDIVEN II+GDIAGGA+ IV++SIDIATHAVD+ Sbjct 1 MTFDPKNAVNAARDIATNFVEKASDIVENVSDIIKGDIAGGANDIVQNSIDIATHAVDKA 60 Query 61 KEVFTGK 67 KE+F GK Sbjct 61 KEIFIGK 67 >gi|333991121|ref|YP_004523735.1| hypothetical protein JDM601_2481 [Mycobacterium sp. JDM601] gi|333487089|gb|AEF36481.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=76 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 46/67 (69%), Positives = 58/67 (87%), Gaps = 0/67 (0%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 M+ PKDA +A RDIA +AVEKASDIVE+AG IIRGD++GG + IV+DS++IATHAV+RT Sbjct 1 MAITPKDAFNAARDIATHAVEKASDIVEDAGDIIRGDVSGGVNAIVQDSMEIATHAVERT 60 Query 61 KEVFTGK 67 KEVFT + Sbjct 61 KEVFTDR 67 >gi|333921982|ref|YP_004495563.1| hypothetical protein AS9A_4330 [Amycolicicoccus subflavus DQS3-9A1] gi|333484203|gb|AEF42763.1| hypothetical protein AS9A_4330 [Amycolicicoccus subflavus DQS3-9A1] Length=92 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 23/61 (38%), Positives = 40/61 (66%), Gaps = 0/61 (0%) Query 5 PKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRTKEVF 64 P + VDA++ IA +++K+ +IVE A ++G+++GG I+K S DIAT+A + + Sbjct 11 PTNTVDAMKHIAQTSLDKSGEIVEGAADTLKGNVSGGIGKIIKSSTDIATNAAGKGTRIL 70 Query 65 T 65 T Sbjct 71 T 71 >gi|296140165|ref|YP_003647408.1| hypothetical protein Tpau_2464 [Tsukamurella paurometabola DSM 20162] gi|296028299|gb|ADG79069.1| hypothetical protein Tpau_2464 [Tsukamurella paurometabola DSM 20162] Length=87 Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust. Identities = 22/51 (44%), Positives = 32/51 (63%), Gaps = 0/51 (0%) Query 6 KDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHA 56 KD +DA ++A N+V ++ +IVE A +RGD+ GG I + DIAT A Sbjct 7 KDNIDAAVNVATNSVARSGEIVEGAAQALRGDVKGGIGKIATSATDIATTA 57 >gi|256423262|ref|YP_003123915.1| amidase [Chitinophaga pinensis DSM 2588] gi|256038170|gb|ACU61714.1| Amidase [Chitinophaga pinensis DSM 2588] Length=475 Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust. Identities = 29/72 (41%), Positives = 40/72 (56%), Gaps = 8/72 (11%) Query 2 SFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGI---VKDSIDIATHAVD 58 +FNP A++A+ IA NA+E A ENA ++RG+ G G+ VKDSID A Sbjct 43 AFNP--AINAIVTIADNAMEVAKK-AENA--VLRGNELGPLHGVPFTVKDSIDTANILTQ 97 Query 59 RTKEVFTGKTDD 70 R +F G+ D Sbjct 98 RGSPIFKGRIPD 109 >gi|298706849|emb|CBJ25813.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus] Length=1541 Score = 37.7 bits (86), Expect = 0.62, Method: Composition-based stats. Identities = 23/72 (32%), Positives = 36/72 (50%), Gaps = 6/72 (8%) Query 7 DAVDAVRDIAANAVEKASDIVENAGHII------RGDIAGGASGIVKDSIDIATHAVDRT 60 + +DA D+AA+ ++ A D+ E+ G + GD+AGGA + D++D VD Sbjct 1118 EGLDAAGDLAADGLDAAGDLAEDIGEGMGDGLEAAGDLAGGAVDAITDAVDGVADDVDMY 1177 Query 61 KEVFTGKTDDEG 72 DDEG Sbjct 1178 AGAGPRDLDDEG 1189 >gi|299136689|ref|ZP_07029872.1| Amidase [Acidobacterium sp. MP5ACTX8] gi|298601204|gb|EFI57359.1| Amidase [Acidobacterium sp. MP5ACTX8] Length=469 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 27/72 (38%), Positives = 37/72 (52%), Gaps = 8/72 (11%) Query 2 SFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGI---VKDSIDIATHAVD 58 + NPK V+A+ IA A+E A E ++RGD G G+ VKDSID A Sbjct 37 AVNPK--VNAIVTIADGALESAK---EAEAAVLRGDELGPLHGVPFTVKDSIDTANVLTQ 91 Query 59 RTKEVFTGKTDD 70 R +F G+T + Sbjct 92 RGSPIFKGRTPE 103 >gi|91790145|ref|YP_551097.1| heavy metal translocating P-type ATPase [Polaromonas sp. JS666] gi|91699370|gb|ABE46199.1| Heavy metal translocating P-type ATPase [Polaromonas sp. JS666] Length=760 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 18/36 (50%), Positives = 23/36 (64%), Gaps = 0/36 (0%) Query 11 AVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIV 46 A+ A AV A +I E AGH ++G IAGGA+G V Sbjct 508 AIVQAVAGAVPGAENITEIAGHGLQGQIAGGAAGRV 543 >gi|28211049|ref|NP_781993.1| ferric enterobactin transport ATP-binding protein fepC [Clostridium tetani E88] gi|28203488|gb|AAO35930.1| ferric enterobactin transport ATP-binding protein fepC [Clostridium tetani E88] Length=397 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 20/72 (28%), Positives = 45/72 (63%), Gaps = 5/72 (6%) Query 1 MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRT 60 +S ++A ++++DI ++KA +++N +I D A I KD++ + ++A+++ Sbjct 304 LSVLKENAFESIKDIN---IKKAKSMIDNVSFVI--DSAYKIGEINKDNVQLISYALEQG 358 Query 61 KEVFTGKTDDEG 72 K+VF+ + +DEG Sbjct 359 KKVFSFRKEDEG 370 Lambda K H 0.312 0.131 0.357 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127392703088 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40