BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1374c Length=152 Score E Sequences producing significant alignments: (Bits) Value gi|57116855|ref|NP_215890.2| hypothetical protein Rv1374c [Mycob... 311 3e-83 gi|15840832|ref|NP_335869.1| hypothetical protein MT1418.1 [Myco... 309 1e-82 gi|340626388|ref|YP_004744840.1| hypothetical protein MCAN_13901... 307 3e-82 gi|296808429|ref|XP_002844553.1| 2-isopropylmalate synthase [Art... 34.7 5.0 >gi|57116855|ref|NP_215890.2| hypothetical protein Rv1374c [Mycobacterium tuberculosis H37Rv] gi|148661165|ref|YP_001282688.1| hypothetical protein MRA_1383A [Mycobacterium tuberculosis H37Ra] gi|167968426|ref|ZP_02550703.1| hypothetical protein MtubH3_10476 [Mycobacterium tuberculosis H37Ra] gi|308399939|ref|ZP_07669396.1| hypothetical protein TMLG_03060 [Mycobacterium tuberculosis SUMu012] gi|38490261|emb|CAB02652.2| HYPOTHETICAL PROTEIN Rv1374c [Mycobacterium tuberculosis H37Rv] gi|148505317|gb|ABQ73126.1| hypothetical protein MRA_1383A [Mycobacterium tuberculosis H37Ra] gi|308366410|gb|EFP55261.1| hypothetical protein TMLG_03060 [Mycobacterium tuberculosis SUMu012] Length=152 Score = 311 bits (796), Expect = 3e-83, Method: Compositional matrix adjust. Identities = 151/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%) Query 1 VVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 +VTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI Sbjct 1 MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 Query 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF Sbjct 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 Query 121 VRHALTPRCSRTDSKTSYTQLNRICKFPPHWV 152 VRHALTPRCSRTDSKTSYTQLNRICKFPPHWV Sbjct 121 VRHALTPRCSRTDSKTSYTQLNRICKFPPHWV 152 >gi|15840832|ref|NP_335869.1| hypothetical protein MT1418.1 [Mycobacterium tuberculosis CDC1551] gi|31792568|ref|NP_855061.1| hypothetical protein Mb1409c [Mycobacterium bovis AF2122/97] gi|121637304|ref|YP_977527.1| hypothetical protein BCG_1435c [Mycobacterium bovis BCG str. Pasteur 1173P2] 41 more sequence titlesLength=152 Score = 309 bits (791), Expect = 1e-82, Method: Compositional matrix adjust. Identities = 150/152 (99%), Positives = 151/152 (99%), Gaps = 0/152 (0%) Query 1 VVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 +VTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI Sbjct 1 MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 Query 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF Sbjct 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 Query 121 VRHALTPRCSRTDSKTSYTQLNRICKFPPHWV 152 VRHALTPRCSRTDSK SYTQLNRICKFPPHWV Sbjct 121 VRHALTPRCSRTDSKASYTQLNRICKFPPHWV 152 >gi|340626388|ref|YP_004744840.1| hypothetical protein MCAN_13901 [Mycobacterium canettii CIPT 140010059] gi|340004578|emb|CCC43722.1| hypothetical protein MCAN_13901 [Mycobacterium canettii CIPT 140010059] Length=152 Score = 307 bits (787), Expect = 3e-82, Method: Compositional matrix adjust. Identities = 149/152 (99%), Positives = 150/152 (99%), Gaps = 0/152 (0%) Query 1 VVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 +VTSV DENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI Sbjct 1 MVTSVTDENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKI 60 Query 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF Sbjct 61 CRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREVGNYAQRRVGRFAFFEQTF 120 Query 121 VRHALTPRCSRTDSKTSYTQLNRICKFPPHWV 152 VRHALTPRCSRTDSK SYTQLNRICKFPPHWV Sbjct 121 VRHALTPRCSRTDSKASYTQLNRICKFPPHWV 152 >gi|296808429|ref|XP_002844553.1| 2-isopropylmalate synthase [Arthroderma otae CBS 113480] gi|238844036|gb|EEQ33698.1| 2-isopropylmalate synthase [Arthroderma otae CBS 113480] Length=656 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 22/89 (25%), Positives = 39/89 (44%), Gaps = 5/89 (5%) Query 24 DPRLDYAHAHLKGRRGRSPARPNAPIGARSFAVGRKICRVERFTLLEHGFVGHALHRVPC 83 +PR + ++ R +SPA P + + R+ + ++HG VG V Sbjct 472 NPRFNLIDYNITADRSQSPAPPTPGKAVNTQNLKRRFTGIIEIDGIQHGIVG-----VGT 526 Query 84 AGLVALVMSACSLAVCREVGNYAQRRVGR 112 + AL + SL + +V NY + +GR Sbjct 527 GAISALAHALHSLGIDLDVVNYTEHAIGR 555 Lambda K H 0.326 0.137 0.444 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128332939266 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40