BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2142A Length=71 Score E Sequences producing significant alignments: (Bits) Value gi|15841634|ref|NP_336671.1| hypothetical protein MT2201 [Mycoba... 139 1e-31 gi|167967855|ref|ZP_02550132.1| hypothetical protein MtubH3_0740... 137 4e-31 >gi|15841634|ref|NP_336671.1| hypothetical protein MT2201 [Mycobacterium tuberculosis CDC1551] gi|148661958|ref|YP_001283481.1| hypothetical protein MRA_2157 [Mycobacterium tuberculosis H37Ra] gi|253798793|ref|YP_003031794.1| toxin [Mycobacterium tuberculosis KZN 1435] 51 more sequence titlesLength=71 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 70/71 (99%), Positives = 71/71 (100%), Gaps = 0/71 (0%) Query 1 VVVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTID 60 +VVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTID Sbjct 1 MVVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTID 60 Query 61 DFDKRIRARLG 71 DFDKRIRARLG Sbjct 61 DFDKRIRARLG 71 >gi|167967855|ref|ZP_02550132.1| hypothetical protein MtubH3_07406 [Mycobacterium tuberculosis H37Ra] gi|254551179|ref|ZP_05141626.1| toxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] Length=70 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 69/70 (99%), Positives = 70/70 (100%), Gaps = 0/70 (0%) Query 2 VVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTIDD 61 +VNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTIDD Sbjct 1 MVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTIDD 60 Query 62 FDKRIRARLG 71 FDKRIRARLG Sbjct 61 FDKRIRARLG 70 Lambda K H 0.316 0.131 0.363 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127819123992 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40