BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2231B Length=58 Score E Sequences producing significant alignments: (Bits) Value gi|167969367|ref|ZP_02551644.1| hypothetical protein MtubH3_1560... 115 2e-24 gi|294993621|ref|ZP_06799312.1| antitoxin [Mycobacterium tubercu... 102 2e-20 gi|336178544|ref|YP_004583919.1| prevent-host-death family prote... 39.7 0.16 >gi|167969367|ref|ZP_02551644.1| hypothetical protein MtubH3_15605 [Mycobacterium tuberculosis H37Ra] gi|253798701|ref|YP_003031702.1| antitoxin [Mycobacterium tuberculosis KZN 1435] gi|254551275|ref|ZP_05141722.1| antitoxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] 23 more sequence titlesLength=58 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 57/58 (99%), Positives = 58/58 (100%), Gaps = 0/58 (0%) Query 1 LALWYQAMIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR 58 +ALWYQAMIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR Sbjct 1 MALWYQAMIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR 58 >gi|294993621|ref|ZP_06799312.1| antitoxin [Mycobacterium tuberculosis 210] gi|313659151|ref|ZP_07816031.1| antitoxin [Mycobacterium tuberculosis KZN V2475] Length=51 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 51/51 (100%), Positives = 51/51 (100%), Gaps = 0/51 (0%) Query 8 MIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR 58 MIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR Sbjct 1 MIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLVYGGQRLRLPAAASR 51 >gi|336178544|ref|YP_004583919.1| prevent-host-death family protein [Frankia symbiont of Datisca glomerata] gi|334859524|gb|AEH09998.1| prevent-host-death family protein [Frankia symbiont of Datisca glomerata] Length=80 Score = 39.7 bits (91), Expect = 0.16, Method: Compositional matrix adjust. Identities = 19/28 (68%), Positives = 22/28 (79%), Gaps = 0/28 (0%) Query 25 AKRVGVHEAKTRLSELLRLVYGGQRLRL 52 A +VGVHEAKTRLSELLR V GQ + + Sbjct 5 AIQVGVHEAKTRLSELLRAVAAGQEVDI 32 Lambda K H 0.323 0.134 0.403 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128188454310 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40