BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1545 Length=75 Score E Sequences producing significant alignments: (Bits) Value gi|15608683|ref|NP_216061.1| hypothetical protein Rv1545 [Mycoba... 152 2e-35 gi|167968100|ref|ZP_02550377.1| hypothetical protein MtubH3_0870... 143 8e-33 gi|289442999|ref|ZP_06432743.1| conserved hypothetical protein [... 94.4 5e-18 gi|189218335|ref|YP_001938977.1| adenine-specific DNA methylase ... 35.4 3.1 >gi|15608683|ref|NP_216061.1| hypothetical protein Rv1545 [Mycobacterium tuberculosis H37Rv] gi|15841012|ref|NP_336049.1| hypothetical protein MT1596.1 [Mycobacterium tuberculosis CDC1551] gi|31792731|ref|NP_855224.1| hypothetical protein Mb1572 [Mycobacterium bovis AF2122/97] 48 more sequence titlesLength=75 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 74/75 (99%), Positives = 75/75 (100%), Gaps = 0/75 (0%) Query 1 VPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDE 60 +PNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDE Sbjct 1 MPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDE 60 Query 61 CGVCDKTAGLDGAAP 75 CGVCDKTAGLDGAAP Sbjct 61 CGVCDKTAGLDGAAP 75 >gi|167968100|ref|ZP_02550377.1| hypothetical protein MtubH3_08705 [Mycobacterium tuberculosis H37Ra] Length=71 Score = 143 bits (361), Expect = 8e-33, Method: Compositional matrix adjust. Identities = 70/71 (99%), Positives = 71/71 (100%), Gaps = 0/71 (0%) Query 5 VLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDECGVC 64 +LGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDECGVC Sbjct 1 MLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVGITTILAGRDECGVC 60 Query 65 DKTAGLDGAAP 75 DKTAGLDGAAP Sbjct 61 DKTAGLDGAAP 71 >gi|289442999|ref|ZP_06432743.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289415918|gb|EFD13158.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] Length=54 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 45/46 (98%), Positives = 46/46 (100%), Gaps = 0/46 (0%) Query 1 VPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACR 46 VPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVAC+ Sbjct 2 VPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACQ 47 >gi|189218335|ref|YP_001938977.1| adenine-specific DNA methylase containing a Zn-ribbon [Methylacidiphilum infernorum V4] gi|189185193|gb|ACD82378.1| Adenine-specific DNA methylase containing a Zn-ribbon [Methylacidiphilum infernorum V4] Length=731 Score = 35.4 bits (80), Expect = 3.1, Method: Composition-based stats. Identities = 18/54 (34%), Positives = 27/54 (50%), Gaps = 2/54 (3%) Query 19 GLQLAHESQCCQMHNLPSAARQVTVAC--REEVGITTILAGRDECGVCDKTAGL 70 GLQ A +C ++H LP +++ C R + T+ G+ C C TAGL Sbjct 215 GLQWAFCKECHEIHELPIERKEIRCRCGTRTRIAQGTLNRGKVRCPACGDTAGL 268 Lambda K H 0.319 0.135 0.418 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131059142040 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40