BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1958c Length=204 Score E Sequences producing significant alignments: (Bits) Value gi|15609095|ref|NP_216474.1| hypothetical protein Rv1958c [Mycob... 405 1e-111 gi|340626967|ref|YP_004745419.1| hypothetical protein MCAN_19741... 404 6e-111 gi|289574641|ref|ZP_06454868.1| conserved hypothetical protein [... 402 2e-110 gi|254364775|ref|ZP_04980821.1| hypothetical protein TBHG_01912 ... 348 2e-94 gi|85375308|ref|YP_459370.1| alkanal monooxygenase alpha chain [... 37.4 1.2 >gi|15609095|ref|NP_216474.1| hypothetical protein Rv1958c [Mycobacterium tuberculosis H37Rv] gi|31793150|ref|NP_855643.1| hypothetical protein Mb1993c [Mycobacterium bovis AF2122/97] gi|121637863|ref|YP_978086.1| hypothetical protein BCG_1997c [Mycobacterium bovis BCG str. Pasteur 1173P2] 41 more sequence titlesLength=204 Score = 405 bits (1042), Expect = 1e-111, Method: Compositional matrix adjust. Identities = 204/204 (100%), Positives = 204/204 (100%), Gaps = 0/204 (0%) Query 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR 60 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR Sbjct 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR 60 Query 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR Sbjct 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 Query 121 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP Sbjct 121 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 Query 181 GETGIGRDNISRVNGGSARRPVRS 204 GETGIGRDNISRVNGGSARRPVRS Sbjct 181 GETGIGRDNISRVNGGSARRPVRS 204 >gi|340626967|ref|YP_004745419.1| hypothetical protein MCAN_19741 [Mycobacterium canettii CIPT 140010059] gi|340005157|emb|CCC44306.1| hypothetical protein MCAN_19741 [Mycobacterium canettii CIPT 140010059] Length=204 Score = 404 bits (1037), Expect = 6e-111, Method: Compositional matrix adjust. Identities = 203/204 (99%), Positives = 203/204 (99%), Gaps = 0/204 (0%) Query 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR 60 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTN RRLSMNPGGMRIRCRR Sbjct 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNSRRLSMNPGGMRIRCRR 60 Query 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR Sbjct 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 Query 121 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP Sbjct 121 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 Query 181 GETGIGRDNISRVNGGSARRPVRS 204 GETGIGRDNISRVNGGSARRPVRS Sbjct 181 GETGIGRDNISRVNGGSARRPVRS 204 >gi|289574641|ref|ZP_06454868.1| conserved hypothetical protein [Mycobacterium tuberculosis K85] gi|289539072|gb|EFD43650.1| conserved hypothetical protein [Mycobacterium tuberculosis K85] Length=204 Score = 402 bits (1033), Expect = 2e-110, Method: Compositional matrix adjust. Identities = 203/204 (99%), Positives = 203/204 (99%), Gaps = 0/204 (0%) Query 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR 60 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR Sbjct 1 MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRR 60 Query 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR Sbjct 61 GDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVFENLELRAAAGLAFGFRLR 120 Query 121 PFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 PFGGTAADSPPVAAQDLDP RWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP Sbjct 121 PFGGTAADSPPVAAQDLDPFRWADSPALHLAVGVETMVVGQLDSPSFGQGVPLVAGHWAP 180 Query 181 GETGIGRDNISRVNGGSARRPVRS 204 GETGIGRDNISRVNGGSARRPVRS Sbjct 181 GETGIGRDNISRVNGGSARRPVRS 204 >gi|254364775|ref|ZP_04980821.1| hypothetical protein TBHG_01912 [Mycobacterium tuberculosis str. Haarlem] gi|134150289|gb|EBA42334.1| hypothetical protein TBHG_01912 [Mycobacterium tuberculosis str. Haarlem] Length=176 Score = 348 bits (893), Expect = 2e-94, Method: Compositional matrix adjust. Identities = 175/176 (99%), Positives = 176/176 (100%), Gaps = 0/176 (0%) Query 29 IHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAA 88 +HPRRYLPRKHGGTNPRRLSMNPGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAA Sbjct 1 MHPRRYLPRKHGGTNPRRLSMNPGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAA 60 Query 89 NAPPSRARTASPVFENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPAL 148 NAPPSRARTASPVFENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPAL Sbjct 61 NAPPSRARTASPVFENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPAL 120 Query 149 HLAVGVETMVVGQLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARRPVRS 204 HLAVGVETMVVGQLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARRPVRS Sbjct 121 HLAVGVETMVVGQLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARRPVRS 176 >gi|85375308|ref|YP_459370.1| alkanal monooxygenase alpha chain [Erythrobacter litoralis HTCC2594] gi|84788391|gb|ABC64573.1| alkanal monooxygenase alpha chain [Erythrobacter litoralis HTCC2594] Length=331 Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust. Identities = 36/123 (30%), Positives = 53/123 (44%), Gaps = 16/123 (13%) Query 1 MIPTPSIGAVINAKI-------SHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMNPGG 53 M+ + GA + AK+ SH A +I+ R + P + T R M G Sbjct 164 MLGSSLFGAQLAAKLGLPYAFASHFAPDHLDEALEIYRRDFQPSQ---TLDRPHVM--AG 218 Query 54 MRIRCRRGDKSRKLLSRSQVQPLV----GRPAKIPSPAANAPPSRARTASPVFENLELRA 109 M++ C D+ +LLS SQ Q V G P K+P P + + A + +LE A Sbjct 219 MQVICADTDEDARLLSSSQAQAFVRLRSGNPGKLPPPIEDYRETLPAPARAMLVHLEQAA 278 Query 110 AAG 112 A G Sbjct 279 AVG 281 Lambda K H 0.320 0.137 0.427 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 226797436674 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40