BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1831 Length=85 Score E Sequences producing significant alignments: (Bits) Value gi|15608968|ref|NP_216347.1| hypothetical protein Rv1831 [Mycoba... 174 5e-42 gi|41407642|ref|NP_960478.1| hypothetical protein MAP1544 [Mycob... 65.1 3e-09 gi|229494851|ref|ZP_04388604.1| hypothetical protein RHOER0001_6... 46.6 0.001 >gi|15608968|ref|NP_216347.1| hypothetical protein Rv1831 [Mycobacterium tuberculosis H37Rv] gi|31793021|ref|NP_855514.1| hypothetical protein Mb1862 [Mycobacterium bovis AF2122/97] gi|121637734|ref|YP_977957.1| hypothetical protein BCG_1866 [Mycobacterium bovis BCG str. Pasteur 1173P2] 22 more sequence titlesLength=85 Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 85/85 (100%), Positives = 85/85 (100%), Gaps = 0/85 (0%) Query 1 MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTARDYDASGKRWRPLAVL 60 MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTARDYDASGKRWRPLAVL Sbjct 1 MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTARDYDASGKRWRPLAVL 60 Query 61 TRRWGKAIHLTVDRVAESLRRLACR 85 TRRWGKAIHLTVDRVAESLRRLACR Sbjct 61 TRRWGKAIHLTVDRVAESLRRLACR 85 >gi|41407642|ref|NP_960478.1| hypothetical protein MAP1544 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41395995|gb|AAS03861.1| hypothetical protein MAP_1544 [Mycobacterium avium subsp. paratuberculosis K-10] Length=81 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 44/79 (56%), Positives = 48/79 (61%), Gaps = 11/79 (13%) Query 9 VDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTARDY----DASGKRWRPLAVLTRRW 64 V+W HR AGEF Q RTPKEQ+LSVNLSGTRTAR+ +G R R TRRW Sbjct 8 VNWAAHRIRAGEFRDGQSRTPKEQHLSVNLSGTRTARNMMPLESGAGPRAR-----TRRW 62 Query 65 GKAIHLTVDRVAESLRRLA 83 GKA AESLRR A Sbjct 63 GKAAATAS--AAESLRRPA 79 >gi|229494851|ref|ZP_04388604.1| hypothetical protein RHOER0001_6518 [Rhodococcus erythropolis SK121] gi|229318209|gb|EEN84077.1| hypothetical protein RHOER0001_6518 [Rhodococcus erythropolis SK121] Length=85 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/62 (49%), Positives = 36/62 (59%), Gaps = 6/62 (9%) Query 26 LRTPKEQYLSVNLSGTRTARDYDASGKRW-RPLAVLTRRW--GKAIHLTVDRVAESLRRL 82 +R PKEQYL VNLSGT+ DASGKRW P A R W A H R++++L Sbjct 4 IRAPKEQYLPVNLSGTKDRTGSDASGKRWDNPPADGERLWLPPSADHY---RISQALGNR 60 Query 83 AC 84 AC Sbjct 61 AC 62 Lambda K H 0.325 0.133 0.441 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131466506940 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40