BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2432c Length=136 Score E Sequences producing significant alignments: (Bits) Value gi|15609569|ref|NP_216948.1| hypothetical protein Rv2432c [Mycob... 272 1e-71 gi|340627445|ref|YP_004745897.1| hypothetical protein MCAN_24701... 269 1e-70 gi|328858467|gb|EGG07579.1| hypothetical protein MELLADRAFT_6232... 38.1 0.46 >gi|15609569|ref|NP_216948.1| hypothetical protein Rv2432c [Mycobacterium tuberculosis H37Rv] gi|31793612|ref|NP_856105.1| hypothetical protein Mb2458c [Mycobacterium bovis AF2122/97] gi|121638314|ref|YP_978538.1| hypothetical protein BCG_2451c [Mycobacterium bovis BCG str. Pasteur 1173P2] 52 more sequence titlesLength=136 Score = 272 bits (696), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%) Query 1 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL 60 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL Sbjct 1 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL 60 Query 61 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWLASDGHWGMVSYIPT 120 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWLASDGHWGMVSYIPT Sbjct 61 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWLASDGHWGMVSYIPT 120 Query 121 ALNVSMGGIVGWRCVP 136 ALNVSMGGIVGWRCVP Sbjct 121 ALNVSMGGIVGWRCVP 136 >gi|340627445|ref|YP_004745897.1| hypothetical protein MCAN_24701 [Mycobacterium canettii CIPT 140010059] gi|340005635|emb|CCC44801.1| hypothetical protein MCAN_24701 [Mycobacterium canettii CIPT 140010059] Length=136 Score = 269 bits (687), Expect = 1e-70, Method: Compositional matrix adjust. Identities = 135/136 (99%), Positives = 135/136 (99%), Gaps = 0/136 (0%) Query 1 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL 60 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL Sbjct 1 MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHL 60 Query 61 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWLASDGHWGMVSYIPT 120 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWW ASDGHWGMVSYIPT Sbjct 61 SRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWPASDGHWGMVSYIPT 120 Query 121 ALNVSMGGIVGWRCVP 136 ALNVSMGGIVGWRCVP Sbjct 121 ALNVSMGGIVGWRCVP 136 >gi|328858467|gb|EGG07579.1| hypothetical protein MELLADRAFT_62323 [Melampsora larici-populina 98AG31] Length=309 Score = 38.1 bits (87), Expect = 0.46, Method: Compositional matrix adjust. Identities = 30/103 (30%), Positives = 41/103 (40%), Gaps = 6/103 (5%) Query 10 GAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGFPCRPALLPHLSRAVMRCVR 69 G G + P+V P + T + AA Q P A+ RPA+ S V RCVR Sbjct 57 GQGSVE--PAVKPSNET----ERAATKAGQSNQNPRAIYKLRARPAVRHAPSEDVYRCVR 110 Query 70 TRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHVRWWLASDGHW 112 T SVI +P S R+ P + + DG + Sbjct 111 TFEVENGDAGSVICVTIPNRVSSLRIDLPSKRAAFLYRLDGFY 153 Lambda K H 0.325 0.136 0.470 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128858389450 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40