BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3851 Length=94 Score E Sequences producing significant alignments: (Bits) Value gi|15610987|ref|NP_218368.1| hypothetical protein Rv3851 [Mycoba... 178 3e-43 gi|254548857|ref|ZP_05139304.1| hypothetical protein Mtube_00050... 169 2e-40 >gi|15610987|ref|NP_218368.1| hypothetical protein Rv3851 [Mycobacterium tuberculosis H37Rv] gi|15843482|ref|NP_338519.1| hypothetical protein MT3966 [Mycobacterium tuberculosis CDC1551] gi|31795025|ref|NP_857518.1| hypothetical protein Mb3881 [Mycobacterium bovis AF2122/97] 58 more sequence titlesLength=94 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 94/94 (100%), Positives = 94/94 (100%), Gaps = 0/94 (0%) Query 1 MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPA 60 MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPA Sbjct 1 MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPA 60 Query 61 VAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR 94 VAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR Sbjct 61 VAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR 94 >gi|254548857|ref|ZP_05139304.1| hypothetical protein Mtube_00050 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|297733532|ref|ZP_06962650.1| hypothetical protein MtubKR_20675 [Mycobacterium tuberculosis KZN R506] Length=89 Score = 169 bits (427), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 89/89 (100%), Positives = 89/89 (100%), Gaps = 0/89 (0%) Query 6 MSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPAVAATD 65 MSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPAVAATD Sbjct 1 MSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTAQLSTATPAPAVAATD 60 Query 66 ANDVPTWPFVVGTVAAVAVAALWAVRRGR 94 ANDVPTWPFVVGTVAAVAVAALWAVRRGR Sbjct 61 ANDVPTWPFVVGTVAAVAVAALWAVRRGR 89 Lambda K H 0.322 0.130 0.395 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127354591080 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40