BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2929 Length=103 Score E Sequences producing significant alignments: (Bits) Value gi|15610066|ref|NP_217445.1| hypothetical protein Rv2929 [Mycoba... 210 7e-53 gi|340627920|ref|YP_004746372.1| hypothetical protein MCAN_29511... 208 2e-52 gi|302828566|ref|XP_002945850.1| hypothetical protein VOLCADRAFT... 34.7 4.3 >gi|15610066|ref|NP_217445.1| hypothetical protein Rv2929 [Mycobacterium tuberculosis H37Rv] gi|15842475|ref|NP_337512.1| hypothetical protein MT2998.1 [Mycobacterium tuberculosis CDC1551] gi|31794106|ref|NP_856599.1| hypothetical protein Mb2954 [Mycobacterium bovis AF2122/97] 33 more sequence titlesLength=103 Score = 210 bits (534), Expect = 7e-53, Method: Compositional matrix adjust. Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASDTHGGGGGRPV 60 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASDTHGGGGGRPV Sbjct 1 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASDTHGGGGGRPV 60 Query 61 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT 103 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT Sbjct 61 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT 103 >gi|340627920|ref|YP_004746372.1| hypothetical protein MCAN_29511 [Mycobacterium canettii CIPT 140010059] gi|340006110|emb|CCC45282.1| hypothetical protein MCAN_29511 [Mycobacterium canettii CIPT 140010059] Length=103 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 102/103 (99%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASDTHGGGGGRPV 60 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFA+DSPYVGTGLERMASDTHGGGGGRPV Sbjct 1 MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFADDSPYVGTGLERMASDTHGGGGGRPV 60 Query 61 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT 103 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT Sbjct 61 TPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVLT 103 >gi|302828566|ref|XP_002945850.1| hypothetical protein VOLCADRAFT_55637 [Volvox carteri f. nagariensis] gi|300268665|gb|EFJ52845.1| hypothetical protein VOLCADRAFT_55637 [Volvox carteri f. nagariensis] Length=327 Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust. Identities = 20/49 (41%), Positives = 24/49 (49%), Gaps = 1/49 (2%) Query 34 AEDSPYVGTGLERMASDTHGGGG-GRPVTPPPPGMHHLGCSRGVLLISS 81 A S GT +ER+A HG G G PPP G H + CS L+ S Sbjct 76 AARSLKAGTDIERIADFVHGADGFGDIGLPPPQGQHLMDCSAAEFLVRS 124 Lambda K H 0.315 0.135 0.440 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127822873252 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40