BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2401 Length=109 Score E Sequences producing significant alignments: (Bits) Value gi|15841917|ref|NP_336954.1| hypothetical protein MT2472 [Mycoba... 224 4e-57 gi|31793579|ref|NP_856072.1| hypothetical protein Mb2423 [Mycoba... 221 3e-56 gi|340627410|ref|YP_004745862.1| hypothetical protein MCAN_24331... 203 6e-51 gi|170044326|ref|XP_001849803.1| hypothetical protein CpipJ_CPIJ... 33.5 9.4 >gi|15841917|ref|NP_336954.1| hypothetical protein MT2472 [Mycobacterium tuberculosis CDC1551] gi|167966970|ref|ZP_02549247.1| hypothetical protein MtubH3_02503 [Mycobacterium tuberculosis H37Ra] gi|289758527|ref|ZP_06517905.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|13882187|gb|AAK46768.1| hypothetical protein MT2472 [Mycobacterium tuberculosis CDC1551] gi|289714091|gb|EFD78103.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] Length=134 Score = 224 bits (570), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 109/109 (100%), Positives = 109/109 (100%), Gaps = 0/109 (0%) Query 1 VRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV 60 VRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV Sbjct 26 VRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV 85 Query 61 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA 109 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA Sbjct 86 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA 134 >gi|31793579|ref|NP_856072.1| hypothetical protein Mb2423 [Mycobacterium bovis AF2122/97] gi|57116984|ref|NP_216917.2| hypothetical protein Rv2401 [Mycobacterium tuberculosis H37Rv] gi|121638281|ref|YP_978505.1| hypothetical protein BCG_2416 [Mycobacterium bovis BCG str. Pasteur 1173P2] 43 more sequence titlesLength=109 Score = 221 bits (563), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 108/109 (99%), Positives = 109/109 (100%), Gaps = 0/109 (0%) Query 1 VRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV 60 +RDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV Sbjct 1 MRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV 60 Query 61 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA 109 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA Sbjct 61 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA 109 >gi|340627410|ref|YP_004745862.1| hypothetical protein MCAN_24331 [Mycobacterium canettii CIPT 140010059] gi|340005600|emb|CCC44764.1| hypothetical protein MCAN_24331 [Mycobacterium canettii CIPT 140010059] Length=110 Score = 203 bits (517), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 100/103 (98%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 VRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERADIAPRKTRCCV 60 +RDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAA+ERADIAPRKTRCCV Sbjct 1 MRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAADERADIAPRKTRCCV 60 Query 61 HVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPRH 103 HVAKPNRIRLADQLARSSMGEKPGHDHQR+QRDQNQRDVRPRH Sbjct 61 HVAKPNRIRLADQLARSSMGEKPGHDHQRHQRDQNQRDVRPRH 103 >gi|170044326|ref|XP_001849803.1| hypothetical protein CpipJ_CPIJ008168 [Culex quinquefasciatus] gi|167867520|gb|EDS30903.1| hypothetical protein CpipJ_CPIJ008168 [Culex quinquefasciatus] Length=409 Score = 33.5 bits (75), Expect = 9.4, Method: Compositional matrix adjust. Identities = 25/70 (36%), Positives = 32/70 (46%), Gaps = 3/70 (4%) Query 24 LHIRPRTGGE--SATTVQVGRSAANERADIA-PRKTRCCVHVAKPNRIRLADQLARSSMG 80 L RP E A T ++GR A + +TR C+HV+ P RLA L G Sbjct 222 LQFRPVPPYERTPAETTRLGRYIEERHARLCGAVRTRRCMHVSTPESARLALWLWTRQQG 281 Query 81 EKPGHDHQRN 90 PGH +RN Sbjct 282 TAPGHGIERN 291 Lambda K H 0.319 0.132 0.402 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129509500864 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40