BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2662 Length=90 Score E Sequences producing significant alignments: (Bits) Value gi|15609799|ref|NP_217178.1| hypothetical protein Rv2662 [Mycoba... 180 5e-44 gi|296164681|ref|ZP_06847247.1| conserved hypothetical protein [... 42.0 0.031 >gi|15609799|ref|NP_217178.1| hypothetical protein Rv2662 [Mycobacterium tuberculosis H37Rv] gi|31793833|ref|NP_856326.1| hypothetical protein Mb2680 [Mycobacterium bovis AF2122/97] gi|121638536|ref|YP_978760.1| hypothetical protein BCG_2674 [Mycobacterium bovis BCG str. Pasteur 1173P2] 39 more sequence titlesLength=90 Score = 180 bits (457), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 90/90 (100%), Positives = 90/90 (100%), Gaps = 0/90 (0%) Query 1 MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTASPPQPVSPGGPRLRLVR 60 MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTASPPQPVSPGGPRLRLVR Sbjct 1 MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTASPPQPVSPGGPRLRLVR 60 Query 61 LTTNPSARAAPIGNGGDSSVCAGEKQCRPP 90 LTTNPSARAAPIGNGGDSSVCAGEKQCRPP Sbjct 61 LTTNPSARAAPIGNGGDSSVCAGEKQCRPP 90 >gi|296164681|ref|ZP_06847247.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899989|gb|EFG79429.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=99 Score = 42.0 bits (97), Expect = 0.031, Method: Compositional matrix adjust. Identities = 18/33 (55%), Positives = 23/33 (70%), Gaps = 0/33 (0%) Query 4 LTRLRRELLDRFDVRDFTDWPPASLRALIATYD 36 + LR E++DR D + F DW PA LRALIA +D Sbjct 49 IEYLRAEVIDRLDAQPFEDWSPALLRALIAVFD 81 Lambda K H 0.319 0.138 0.446 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129182109240 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40