BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0997 Length=143 Score E Sequences producing significant alignments: (Bits) Value gi|15608137|ref|NP_215512.1| hypothetical protein Rv0997 [Mycoba... 287 4e-76 gi|340626009|ref|YP_004744461.1| hypothetical protein MCAN_10021... 283 4e-75 gi|15840425|ref|NP_335462.1| hypothetical protein MT1026 [Mycoba... 173 8e-42 gi|313755428|gb|ADR74205.1| (E)-beta-ocimene synthase [Vitis vin... 42.4 0.020 gi|225447406|ref|XP_002281392.1| PREDICTED: hypothetical protein... 42.4 0.020 gi|225447404|ref|XP_002281379.1| PREDICTED: hypothetical protein... 42.4 0.020 gi|296081223|emb|CBI17967.3| unnamed protein product [Vitis vini... 41.2 0.056 gi|4102963|gb|AAD01632.1| ladder protein [Caenorhabditis elegans] 37.4 0.76 >gi|15608137|ref|NP_215512.1| hypothetical protein Rv0997 [Mycobacterium tuberculosis H37Rv] gi|31792188|ref|NP_854681.1| hypothetical protein Mb1024 [Mycobacterium bovis AF2122/97] gi|121636926|ref|YP_977149.1| hypothetical protein BCG_1054 [Mycobacterium bovis BCG str. Pasteur 1173P2] 42 more sequence titlesLength=143 Score = 287 bits (734), Expect = 4e-76, Method: Compositional matrix adjust. Identities = 142/143 (99%), Positives = 143/143 (100%), Gaps = 0/143 (0%) Query 1 LAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRP 60 +AGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRP Sbjct 1 MAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRP 60 Query 61 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP 120 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP Sbjct 61 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP 120 Query 121 LGSGGDAAGKVEGGRTPQPFVQP 143 LGSGGDAAGKVEGGRTPQPFVQP Sbjct 121 LGSGGDAAGKVEGGRTPQPFVQP 143 >gi|340626009|ref|YP_004744461.1| hypothetical protein MCAN_10021 [Mycobacterium canettii CIPT 140010059] gi|340004199|emb|CCC43339.1| hypothetical protein MCAN_10021 [Mycobacterium canettii CIPT 140010059] Length=143 Score = 283 bits (725), Expect = 4e-75, Method: Compositional matrix adjust. Identities = 140/143 (98%), Positives = 142/143 (99%), Gaps = 0/143 (0%) Query 1 LAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRP 60 +AGIA VDRDPPGWPQHSHLLAGDPERFRHQL+RAETTNSIECFVAEWHHAGVAADMTRP Sbjct 1 MAGIASVDRDPPGWPQHSHLLAGDPERFRHQLRRAETTNSIECFVAEWHHAGVAADMTRP 60 Query 61 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP 120 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP Sbjct 61 WPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP 120 Query 121 LGSGGDAAGKVEGGRTPQPFVQP 143 LGSGGDAAGKVEGGRTPQPFVQP Sbjct 121 LGSGGDAAGKVEGGRTPQPFVQP 143 >gi|15840425|ref|NP_335462.1| hypothetical protein MT1026 [Mycobacterium tuberculosis CDC1551] gi|13880595|gb|AAK45276.1| hypothetical protein MT1026 [Mycobacterium tuberculosis CDC1551] Length=87 Score = 173 bits (438), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 87/87 (100%), Positives = 87/87 (100%), Gaps = 0/87 (0%) Query 57 MTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHR 116 MTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHR Sbjct 1 MTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHR 60 Query 117 GAVPLGSGGDAAGKVEGGRTPQPFVQP 143 GAVPLGSGGDAAGKVEGGRTPQPFVQP Sbjct 61 GAVPLGSGGDAAGKVEGGRTPQPFVQP 87 >gi|313755428|gb|ADR74205.1| (E)-beta-ocimene synthase [Vitis vinifera] Length=579 Score = 42.4 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 28/107 (27%), Positives = 47/107 (44%), Gaps = 9/107 (8%) Query 14 WPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRPWPTVVQGGAGQRR 73 WP L D + +L+R E+ NSI C++ H GV+ + R ++ G + ++ Sbjct 461 WPSTILRLCNDLATSKAELERGESANSISCYM---HQTGVSEESAREHMKILTGESWKKM 517 Query 74 RRDVEPDRKTPVR------WMSGQRLSEITWPTTDIEHSVGAAEVQR 114 + EPD +P + R+SE T+ D + A QR Sbjct 518 NKVREPDYDSPFSKPFMEIAFNLARISECTYQYGDAHGAPDARSRQR 564 >gi|225447406|ref|XP_002281392.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length=547 Score = 42.4 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 28/107 (27%), Positives = 47/107 (44%), Gaps = 9/107 (8%) Query 14 WPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRPWPTVVQGGAGQRR 73 WP L D + +L+R E+ NSI C++ H GV+ + R ++ G + ++ Sbjct 429 WPSTILRLCNDLATSKAELERGESANSISCYM---HQTGVSEESAREHMKILTGESWKKM 485 Query 74 RRDVEPDRKTPVR------WMSGQRLSEITWPTTDIEHSVGAAEVQR 114 + EPD +P + R+SE T+ D + A QR Sbjct 486 NKVREPDYDSPFSKPFMEIAFNLARISECTYQYGDAHGAPDARSRQR 532 >gi|225447404|ref|XP_002281379.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length=579 Score = 42.4 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 28/107 (27%), Positives = 47/107 (44%), Gaps = 9/107 (8%) Query 14 WPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRPWPTVVQGGAGQRR 73 WP L D + +L+R E+ NSI C++ H GV+ + R ++ G + ++ Sbjct 461 WPSTILRLCNDLATSKAELERGESANSISCYM---HQTGVSEESAREHMKILTGESWKKM 517 Query 74 RRDVEPDRKTPVR------WMSGQRLSEITWPTTDIEHSVGAAEVQR 114 + EPD +P + R+SE T+ D + A QR Sbjct 518 NKVREPDYDSPFSKPFMEIAFNLARISECTYQYGDAHGAPDARSRQR 564 >gi|296081223|emb|CBI17967.3| unnamed protein product [Vitis vinifera] Length=206 Score = 41.2 bits (95), Expect = 0.056, Method: Compositional matrix adjust. Identities = 26/101 (26%), Positives = 45/101 (45%), Gaps = 9/101 (8%) Query 8 DRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHHAGVAADMTRPWPTVVQG 67 D + WP L D + +L+R E+ NSI C++ H GV+ + R ++ G Sbjct 82 DHELLRWPSTILRLCNDLATSKAELERGESANSISCYM---HQTGVSEESAREHMKILTG 138 Query 68 GAGQRRRRDVEPDRKTPVR------WMSGQRLSEITWPTTD 102 + ++ + EPD +P + R+SE T+ D Sbjct 139 ESWKKMNKVREPDYDSPFSKPFMEIAFNLARISECTYQYGD 179 >gi|4102963|gb|AAD01632.1| ladder protein [Caenorhabditis elegans] Length=1198 Score = 37.4 bits (85), Expect = 0.76, Method: Composition-based stats. Identities = 18/49 (37%), Positives = 31/49 (64%), Gaps = 0/49 (0%) Query 72 RRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHSVGAAEVQRHRGAVP 120 RRRR++EP + V+WM+ ++L +IT T + S A+V+ + G +P Sbjct 647 RRRREIEPAFQDFVKWMTPEQLGDITALKTAGKESEVQAKVKEYFGQLP 695 Lambda K H 0.317 0.135 0.440 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129250525032 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40