BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2269c Length=110 Score E Sequences producing significant alignments: (Bits) Value gi|15609406|ref|NP_216785.1| hypothetical protein Rv2269c [Mycob... 218 2e-55 gi|340627275|ref|YP_004745727.1| hypothetical protein MCAN_22931... 213 7e-54 gi|289570389|ref|ZP_06450616.1| hypothetical protein TBJG_00756 ... 207 4e-52 >gi|15609406|ref|NP_216785.1| hypothetical protein Rv2269c [Mycobacterium tuberculosis H37Rv] gi|31793448|ref|NP_855941.1| hypothetical protein Mb2292c [Mycobacterium bovis AF2122/97] gi|121638151|ref|YP_978375.1| hypothetical protein BCG_2286c [Mycobacterium bovis BCG str. Pasteur 1173P2] 39 more sequence titlesLength=110 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 109/110 (99%), Positives = 110/110 (100%), Gaps = 0/110 (0%) Query 1 VANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHG 60 +ANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHG Sbjct 1 MANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHG 60 Query 61 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 110 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL Sbjct 61 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 110 >gi|340627275|ref|YP_004745727.1| hypothetical protein MCAN_22931 [Mycobacterium canettii CIPT 140010059] gi|340005465|emb|CCC44625.1| hypothetical protein MCAN_22931 [Mycobacterium canettii CIPT 140010059] Length=110 Score = 213 bits (542), Expect = 7e-54, Method: Compositional matrix adjust. Identities = 107/110 (98%), Positives = 108/110 (99%), Gaps = 0/110 (0%) Query 1 VANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHG 60 +ANDARPLARLANCRVGDQSSATHA TVGPVLGVPPTGGVDLRYGGRAGIGRSETV DHG Sbjct 1 MANDARPLARLANCRVGDQSSATHADTVGPVLGVPPTGGVDLRYGGRAGIGRSETVADHG 60 Query 61 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 110 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL Sbjct 61 AVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 110 >gi|289570389|ref|ZP_06450616.1| hypothetical protein TBJG_00756 [Mycobacterium tuberculosis T17] gi|289544143|gb|EFD47791.1| hypothetical protein TBJG_00756 [Mycobacterium tuberculosis T17] Length=107 Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 104/104 (100%), Positives = 104/104 (100%), Gaps = 0/104 (0%) Query 7 PLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHGAVGRRY 66 PLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHGAVGRRY Sbjct 4 PLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHGAVGRRY 63 Query 67 HQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 110 HQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL Sbjct 64 HQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPLPCDCSTPL 107 Lambda K H 0.322 0.139 0.445 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129022162688 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40