BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0959A Length=73 Score E Sequences producing significant alignments: (Bits) Value gi|15840384|ref|NP_335421.1| hypothetical protein MT0987 [Mycoba... 141 3e-32 gi|87300650|ref|ZP_01083492.1| hypothetical protein WH5701_04360... 68.2 4e-10 gi|87124165|ref|ZP_01080015.1| hypothetical protein RS9917_11156... 68.2 4e-10 >gi|15840384|ref|NP_335421.1| hypothetical protein MT0987 [Mycobacterium tuberculosis CDC1551] gi|121636885|ref|YP_977108.1| antitoxin [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|148660739|ref|YP_001282262.1| hypothetical protein MRA_0967 [Mycobacterium tuberculosis H37Ra] 58 more sequence titlesLength=73 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 73/73 (100%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 MKTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLGDLPDIGIDTT 60 MKTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLGDLPDIGIDTT Sbjct 1 MKTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLGDLPDIGIDTT 60 Query 61 ELIGGIDAERAGR 73 ELIGGIDAERAGR Sbjct 61 ELIGGIDAERAGR 73 >gi|87300650|ref|ZP_01083492.1| hypothetical protein WH5701_04360 [Synechococcus sp. WH 5701] gi|87284521|gb|EAQ76473.1| hypothetical protein WH5701_04360 [Synechococcus sp. WH 5701] Length=74 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 37/72 (52%), Positives = 47/72 (66%), Gaps = 0/72 (0%) Query 2 KTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLGDLPDIGIDTTE 61 +TLY+R+VPDDV ERLERLA A +S A+REL+E +RRADN LL LP ID Sbjct 3 RTLYIRHVPDDVAERLERLANRAGLPLSTFALRELSETARRADNADLLNALPSAPIDHGR 62 Query 62 LIGGIDAERAGR 73 ++ + RA R Sbjct 63 ILEALRQIRAER 74 >gi|87124165|ref|ZP_01080015.1| hypothetical protein RS9917_11156 [Synechococcus sp. RS9917] gi|86168734|gb|EAQ69991.1| hypothetical protein RS9917_11156 [Synechococcus sp. RS9917] Length=74 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 34/70 (49%), Positives = 48/70 (69%), Gaps = 0/70 (0%) Query 2 KTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLGDLPDIGIDTTE 61 +TLY+R+VPDDV ERLERLA A +S A++EL+E +RRADN LL LP +++ E Sbjct 3 RTLYIRHVPDDVAERLERLASRAGLPLSTFALQELSETARRADNAELLQALPSAQLESGE 62 Query 62 LIGGIDAERA 71 ++ + RA Sbjct 63 ILEALRQSRA 72 Lambda K H 0.315 0.134 0.358 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131942442484 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40