BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3612c Length=109 Score E Sequences producing significant alignments: (Bits) Value gi|15610748|ref|NP_218129.1| hypothetical protein Rv3612c [Mycob... 223 6e-57 gi|339633611|ref|YP_004725253.1| hypothetical protein MAF_36250 ... 192 2e-47 gi|119173335|ref|XP_001239137.1| predicted protein [Coccidioides... 35.4 2.5 >gi|15610748|ref|NP_218129.1| hypothetical protein Rv3612c [Mycobacterium tuberculosis H37Rv] gi|15843223|ref|NP_338260.1| hypothetical protein MT3715 [Mycobacterium tuberculosis CDC1551] gi|31794788|ref|NP_857281.1| hypothetical protein Mb3642c [Mycobacterium bovis AF2122/97] 38 more sequence titlesLength=109 Score = 223 bits (569), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 109/109 (100%), Positives = 109/109 (100%), Gaps = 0/109 (0%) Query 1 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP 60 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP Sbjct 1 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP 60 Query 61 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLASGDPLHPALG 109 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLASGDPLHPALG Sbjct 61 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLASGDPLHPALG 109 >gi|339633611|ref|YP_004725253.1| hypothetical protein MAF_36250 [Mycobacterium africanum GM041182] gi|339332967|emb|CCC28694.1| conserved hypothetical protein [Mycobacterium africanum GM041182] Length=93 Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 93/93 (100%), Positives = 93/93 (100%), Gaps = 0/93 (0%) Query 1 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP 60 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP Sbjct 1 MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGAVTHATGAMCP 60 Query 61 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRREL 93 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRREL Sbjct 61 TLGAHQFEPNQVRCTACLTRTLSCRIFRRRREL 93 >gi|119173335|ref|XP_001239137.1| predicted protein [Coccidioides immitis RS] Length=265 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 16/56 (29%), Positives = 25/56 (45%), Gaps = 0/56 (0%) Query 38 VAESWADRVSPGAVTHATGAMCPTLGAHQFEPNQVRCTACLTRTLSCRIFRRRREL 93 V SW D S + + P++ H FE N V C ++R + C + RR+ Sbjct 142 VVPSWVDSKSVLLRVRGSAGVLPSMMRHPFEKNLVNCKGLVSRVIRCLLLSRRKRF 197 Lambda K H 0.323 0.133 0.424 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129509500864 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40