BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2283 Length=64 Score E Sequences producing significant alignments: (Bits) Value gi|15609420|ref|NP_216799.1| hypothetical protein Rv2283 [Mycoba... 130 5e-29 gi|7648573|gb|AAF65589.1|AF139916_10 hypothetical protein [Brevi... 36.2 1.9 >gi|15609420|ref|NP_216799.1| hypothetical protein Rv2283 [Mycobacterium tuberculosis H37Rv] gi|31793460|ref|NP_855953.1| hypothetical protein Mb2304 [Mycobacterium bovis AF2122/97] gi|121638163|ref|YP_978387.1| hypothetical protein BCG_2298 [Mycobacterium bovis BCG str. Pasteur 1173P2] 30 more sequence titlesLength=64 Score = 130 bits (328), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 64/64 (100%), Positives = 64/64 (100%), Gaps = 0/64 (0%) Query 1 MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHAAGPLGSTSRAGHRSYG 60 MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHAAGPLGSTSRAGHRSYG Sbjct 1 MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHAAGPLGSTSRAGHRSYG 60 Query 61 GVAS 64 GVAS Sbjct 61 GVAS 64 >gi|7648573|gb|AAF65589.1|AF139916_10 hypothetical protein [Brevibacterium linens] Length=409 Score = 36.2 bits (82), Expect = 1.9, Method: Compositional matrix adjust. Identities = 19/51 (38%), Positives = 29/51 (57%), Gaps = 1/51 (1%) Query 3 EKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHAAGPLGSTSR 53 E + SV+ A +G+TD+ P A +RL R+PP++ AA +G T R Sbjct 118 EAALNYSVEVTAEIVGLTDS-PIDAMAKRLVGFFRQPPVDLAAPAMGRTRR 167 Lambda K H 0.313 0.128 0.384 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130804070320 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40