BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2304c Length=69 Score E Sequences producing significant alignments: (Bits) Value gi|15609441|ref|NP_216820.1| hypothetical protein Rv2304c [Mycob... 140 4e-32 gi|294993109|ref|ZP_06798800.1| hypothetical protein Mtub2_01016... 80.1 9e-14 >gi|15609441|ref|NP_216820.1| hypothetical protein Rv2304c [Mycobacterium tuberculosis H37Rv] gi|15841795|ref|NP_336832.1| hypothetical protein MT2361 [Mycobacterium tuberculosis CDC1551] gi|31793482|ref|NP_855975.1| hypothetical protein Mb2326c [Mycobacterium bovis AF2122/97] 38 more sequence titlesLength=69 Score = 140 bits (354), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 69/69 (100%), Positives = 69/69 (100%), Gaps = 0/69 (0%) Query 1 MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFEQAFAVERDAGFDDFLH 60 MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFEQAFAVERDAGFDDFLH Sbjct 1 MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFEQAFAVERDAGFDDFLH 60 Query 61 GPVGPRSTP 69 GPVGPRSTP Sbjct 61 GPVGPRSTP 69 >gi|294993109|ref|ZP_06798800.1| hypothetical protein Mtub2_01016 [Mycobacterium tuberculosis 210] Length=38 Score = 80.1 bits (196), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 38/38 (100%), Positives = 38/38 (100%), Gaps = 0/38 (0%) Query 32 EATCTGRPATTFEQAFAVERDAGFDDFLHGPVGPRSTP 69 EATCTGRPATTFEQAFAVERDAGFDDFLHGPVGPRSTP Sbjct 1 EATCTGRPATTFEQAFAVERDAGFDDFLHGPVGPRSTP 38 Lambda K H 0.319 0.136 0.421 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128671965800 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40