BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2767c Length=117 Score E Sequences producing significant alignments: (Bits) Value gi|15609904|ref|NP_217283.1| hypothetical protein Rv2767c [Mycob... 233 9e-60 gi|121638646|ref|YP_978870.1| hypothetical protein BCG_2784c [My... 230 5e-59 gi|340627769|ref|YP_004746221.1| hypothetical protein MCAN_27951... 203 7e-51 gi|322703130|gb|EFY94744.1| nonribosomal peptide synthase [Metar... 35.0 3.6 gi|333382852|ref|ZP_08474517.1| hypothetical protein HMPREF9455_... 34.7 4.5 >gi|15609904|ref|NP_217283.1| hypothetical protein Rv2767c [Mycobacterium tuberculosis H37Rv] gi|31793942|ref|NP_856435.1| hypothetical protein Mb2789c [Mycobacterium bovis AF2122/97] gi|148662609|ref|YP_001284132.1| hypothetical protein MRA_2792 [Mycobacterium tuberculosis H37Ra] 33 more sequence titlesLength=117 Score = 233 bits (593), Expect = 9e-60, Method: Compositional matrix adjust. Identities = 117/117 (100%), Positives = 117/117 (100%), Gaps = 0/117 (0%) Query 1 MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA 60 MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA Sbjct 1 MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA 60 Query 61 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 117 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK Sbjct 61 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 117 >gi|121638646|ref|YP_978870.1| hypothetical protein BCG_2784c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224991138|ref|YP_002645827.1| hypothetical protein JTY_2778 [Mycobacterium bovis BCG str. Tokyo 172] gi|121494294|emb|CAL72772.1| Possible membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224774253|dbj|BAH27059.1| hypothetical protein JTY_2778 [Mycobacterium bovis BCG str. Tokyo 172] gi|341602684|emb|CCC65360.1| possible membrane protein [Mycobacterium bovis BCG str. Moreau RDJ] Length=117 Score = 230 bits (587), Expect = 5e-59, Method: Compositional matrix adjust. Identities = 116/117 (99%), Positives = 116/117 (99%), Gaps = 0/117 (0%) Query 1 MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA 60 MVGYEGARGRAGREMSESATAGAR SRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA Sbjct 1 MVGYEGARGRAGREMSESATAGARLSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVA 60 Query 61 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 117 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK Sbjct 61 QVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 117 >gi|340627769|ref|YP_004746221.1| hypothetical protein MCAN_27951 [Mycobacterium canettii CIPT 140010059] gi|340005959|emb|CCC45126.1| putative membrane protein [Mycobacterium canettii CIPT 140010059] Length=103 Score = 203 bits (516), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 102/103 (99%), Positives = 102/103 (99%), Gaps = 0/103 (0%) Query 15 MSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVAQVWREVVQATAIAI 74 MSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVAQVWREVVQA AIAI Sbjct 1 MSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVAQVWREVVQAKAIAI 60 Query 75 APPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 117 APPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK Sbjct 61 APPLPVVSWGLISLAFLSHTVRGRYRRSPPAESGHHSNRRQAK 103 >gi|322703130|gb|EFY94744.1| nonribosomal peptide synthase [Metarhizium anisopliae ARSEF 23] Length=10277 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 26/93 (28%), Positives = 38/93 (41%), Gaps = 10/93 (10%) Query 4 YEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHARDTPQMVAVAQVW 63 Y+G E A + S +PF P +NH D Q + VA W Sbjct 1956 YDGWSISLMLNKLEDAYSLNEGSALPFS---------PPLQTFVNHIMDIDQSI-VATYW 2005 Query 64 REVVQATAIAIAPPLPVVSWGLISLAFLSHTVR 96 E Q + I P LP V++ +S AF+ H ++ Sbjct 2006 GEQFQGSEAQIFPSLPSVTYQPMSNAFIKHRIQ 2038 >gi|333382852|ref|ZP_08474517.1| hypothetical protein HMPREF9455_02683 [Dysgonomonas gadei ATCC BAA-286] gi|332828182|gb|EGK00894.1| hypothetical protein HMPREF9455_02683 [Dysgonomonas gadei ATCC BAA-286] Length=551 Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 19/65 (30%), Positives = 30/65 (47%), Gaps = 0/65 (0%) Query 33 IRNHEAVRPRRSRHLNHARDTPQMVAVAQVWREVVQATAIAIAPPLPVVSWGLISLAFLS 92 + N+E + R LN R+T + VA + +AI + +P +WG+ L FL Sbjct 470 VNNNELEITHKLRGLNDIRNTISHLVVAMIISASTIGSAILVLADMPPTAWGVSILGFLG 529 Query 93 HTVRG 97 V G Sbjct 530 FVVSG 534 Lambda K H 0.320 0.131 0.393 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130038700308 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40