BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0378 Length=73 Score E Sequences producing significant alignments: (Bits) Value gi|254230738|ref|ZP_04924065.1| conserved hypothetical glycine r... 120 5e-26 gi|308231535|ref|ZP_07412809.2| hypothetical protein TMAG_01637 ... 120 6e-26 gi|15607519|ref|NP_214892.1| glycine rich protein [Mycobacterium... 119 1e-25 gi|340625408|ref|YP_004743860.1| hypothetical protein MCAN_03791... 119 1e-25 >gi|254230738|ref|ZP_04924065.1| conserved hypothetical glycine rich protein [Mycobacterium tuberculosis C] gi|254549321|ref|ZP_05139768.1| hypothetical protein Mtube_02478 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|289748863|ref|ZP_06508241.1| conserved glycine rich protein [Mycobacterium tuberculosis T92] gi|294995135|ref|ZP_06800826.1| hypothetical protein Mtub2_11622 [Mycobacterium tuberculosis 210] gi|124599797|gb|EAY58807.1| conserved hypothetical glycine rich protein [Mycobacterium tuberculosis C] gi|289689450|gb|EFD56879.1| conserved glycine rich protein [Mycobacterium tuberculosis T92] Length=75 Score = 120 bits (302), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 73/73 (100%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS Sbjct 3 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 62 Query 61 GERGAVGKPGAPG 73 GERGAVGKPGAPG Sbjct 63 GERGAVGKPGAPG 75 >gi|308231535|ref|ZP_07412809.2| hypothetical protein TMAG_01637 [Mycobacterium tuberculosis SUMu001] gi|308369378|ref|ZP_07417556.2| hypothetical protein TMBG_03609 [Mycobacterium tuberculosis SUMu002] gi|308370388|ref|ZP_07421328.2| hypothetical protein TMCG_03063 [Mycobacterium tuberculosis SUMu003] 20 more sequence titlesLength=74 Score = 120 bits (302), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 73/73 (100%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS Sbjct 2 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 61 Query 61 GERGAVGKPGAPG 73 GERGAVGKPGAPG Sbjct 62 GERGAVGKPGAPG 74 >gi|15607519|ref|NP_214892.1| glycine rich protein [Mycobacterium tuberculosis H37Rv] gi|31791555|ref|NP_854048.1| glycine rich protein [Mycobacterium bovis AF2122/97] gi|121636291|ref|YP_976514.1| hypothetical protein BCG_0416 [Mycobacterium bovis BCG str. Pasteur 1173P2] 31 more sequence titles Length=73 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 72/73 (99%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 +SGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS Sbjct 1 MSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 Query 61 GERGAVGKPGAPG 73 GERGAVGKPGAPG Sbjct 61 GERGAVGKPGAPG 73 >gi|340625408|ref|YP_004743860.1| hypothetical protein MCAN_03791 [Mycobacterium canettii CIPT 140010059] gi|340003598|emb|CCC42719.1| conserved hypothetical glycine rich protein [Mycobacterium canettii CIPT 140010059] Length=79 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 72/73 (99%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 VSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 +SGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS Sbjct 1 MSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSINGNAGDPGNS 60 Query 61 GERGAVGKPGAPG 73 GERGAVGKPGAPG Sbjct 61 GERGAVGKPGAPG 73 Lambda K H 0.301 0.136 0.414 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131942442484 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40