BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0609A Length=75 Score E Sequences producing significant alignments: (Bits) Value gi|31791792|ref|NP_854285.1| hypothetical protein Mb0626 [Mycoba... 152 1e-35 gi|289441999|ref|ZP_06431743.1| conserved hypothetical protein [... 149 1e-34 gi|15840010|ref|NP_335047.1| hypothetical protein MT0638.1 [Myco... 117 7e-25 gi|15607752|ref|NP_215126.1| hypothetical protein Rv0612 [Mycoba... 49.7 1e-04 gi|308231600|ref|ZP_07413056.2| hypothetical protein TMAG_02492 ... 49.7 2e-04 gi|307083110|ref|ZP_07492223.1| hypothetical protein TMLG_03359 ... 42.7 0.020 gi|260907600|ref|ZP_05915922.1| hypothetical protein BlinB_19847... 38.9 0.29 gi|333989262|ref|YP_004521876.1| hypothetical protein JDM601_062... 34.7 4.6 >gi|31791792|ref|NP_854285.1| hypothetical protein Mb0626 [Mycobacterium bovis AF2122/97] gi|57116759|ref|YP_177628.1| hypothetical protein Rv0609A [Mycobacterium tuberculosis H37Rv] gi|121636528|ref|YP_976751.1| hypothetical protein BCG_0656 [Mycobacterium bovis BCG str. Pasteur 1173P2] 27 more sequence titlesLength=75 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 74/75 (99%), Positives = 75/75 (100%), Gaps = 0/75 (0%) Query 1 VEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPG 60 +EGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPG Sbjct 1 MEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPG 60 Query 61 NATGSGRLPKPLRHS 75 NATGSGRLPKPLRHS Sbjct 61 NATGSGRLPKPLRHS 75 >gi|289441999|ref|ZP_06431743.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289568544|ref|ZP_06448771.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289749108|ref|ZP_06508486.1| conserved hypothetical protein [Mycobacterium tuberculosis T92] gi|289414918|gb|EFD12158.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289542298|gb|EFD45946.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289689695|gb|EFD57124.1| conserved hypothetical protein [Mycobacterium tuberculosis T92] Length=75 Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 73/75 (98%), Positives = 74/75 (99%), Gaps = 0/75 (0%) Query 1 VEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPG 60 +EGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRS DPVAAAGFVVSAVWSCGRGPG Sbjct 1 MEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSVDPVAAAGFVVSAVWSCGRGPG 60 Query 61 NATGSGRLPKPLRHS 75 NATGSGRLPKPLRHS Sbjct 61 NATGSGRLPKPLRHS 75 >gi|15840010|ref|NP_335047.1| hypothetical protein MT0638.1 [Mycobacterium tuberculosis CDC1551] gi|13880153|gb|AAK44861.1| hypothetical protein MT0638.1 [Mycobacterium tuberculosis CDC1551] Length=57 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 56/57 (99%), Positives = 57/57 (100%), Gaps = 0/57 (0%) Query 19 VIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPGNATGSGRLPKPLRHS 75 +IDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPGNATGSGRLPKPLRHS Sbjct 1 MIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPGNATGSGRLPKPLRHS 57 >gi|15607752|ref|NP_215126.1| hypothetical protein Rv0612 [Mycobacterium tuberculosis H37Rv] gi|15840014|ref|NP_335051.1| hypothetical protein MT0642 [Mycobacterium tuberculosis CDC1551] gi|31791795|ref|NP_854288.1| hypothetical protein Mb0629 [Mycobacterium bovis AF2122/97] 60 more sequence titles Length=201 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 26/35 (75%), Positives = 28/35 (80%), Gaps = 2/35 (5%) Query 22 VSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCG 56 + LAR EA GYDYFRSDDPVAAAGFVVS V + G Sbjct 152 IDLAR--EADGYDYFRSDDPVAAAGFVVSDVAAAG 184 >gi|308231600|ref|ZP_07413056.2| hypothetical protein TMAG_02492 [Mycobacterium tuberculosis SUMu001] gi|308369992|ref|ZP_07419888.2| hypothetical protein TMBG_03473 [Mycobacterium tuberculosis SUMu002] gi|308370461|ref|ZP_07421579.2| hypothetical protein TMCG_03818 [Mycobacterium tuberculosis SUMu003] 15 more sequence titles Length=182 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/35 (75%), Positives = 28/35 (80%), Gaps = 2/35 (5%) Query 22 VSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCG 56 + LAR EA GYDYFRSDDPVAAAGFVVS V + G Sbjct 133 IDLAR--EADGYDYFRSDDPVAAAGFVVSDVAAAG 165 >gi|307083110|ref|ZP_07492223.1| hypothetical protein TMLG_03359 [Mycobacterium tuberculosis SUMu012] gi|308367186|gb|EFP56037.1| hypothetical protein TMLG_03359 [Mycobacterium tuberculosis SUMu012] Length=189 Score = 42.7 bits (99), Expect = 0.020, Method: Compositional matrix adjust. Identities = 21/26 (81%), Positives = 22/26 (85%), Gaps = 2/26 (7%) Query 22 VSLARRCEAHGYDYFRSDDPVAAAGF 47 + LAR EA GYDYFRSDDPVAAAGF Sbjct 152 IDLAR--EADGYDYFRSDDPVAAAGF 175 >gi|260907600|ref|ZP_05915922.1| hypothetical protein BlinB_19847 [Brevibacterium linens BL2] Length=299 Score = 38.9 bits (89), Expect = 0.29, Method: Compositional matrix adjust. Identities = 21/48 (44%), Positives = 28/48 (59%), Gaps = 3/48 (6%) Query 22 VSLARRCEAHGYDYFRSDDPVAAAGFVVS-AVWSCGRGPGNATGSGRL 68 ++LA CE G+DYFRS+DP+ A+ V+S V G G A G L Sbjct 129 ITLA--CETEGFDYFRSEDPIEASKVVISEMVDQLWTGIGRALGPEEL 174 >gi|333989262|ref|YP_004521876.1| hypothetical protein JDM601_0622 [Mycobacterium sp. JDM601] gi|333485230|gb|AEF34622.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=256 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 17/25 (68%), Positives = 20/25 (80%), Gaps = 4/25 (16%) Query 35 YFRSDDPVAAAGFVVS----AVWSC 55 YFRS+DPV+AAGFVVS AVW+ Sbjct 137 YFRSEDPVSAAGFVVSDAAAAVWNA 161 Lambda K H 0.320 0.136 0.447 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131059142040 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40