BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0500B Length=33 Score E Sequences producing significant alignments: (Bits) Value gi|38233010|ref|NP_938777.1| hypothetical protein DIP0396 [Coryn... 60.1 1e-07 gi|237786467|ref|YP_002907172.1| hypothetical protein ckrop_1916... 58.5 3e-07 gi|302329977|gb|ADL20171.1| Hypothetical protein Cp1002_0268 [Co... 58.5 3e-07 gi|148821702|ref|YP_001286456.1| hypothetical protein TBFG_10511... 58.5 3e-07 gi|15828310|ref|NP_302573.1| hypothetical protein ML2428A [Mycob... 57.8 5e-07 gi|330804191|ref|XP_003290081.1| hypothetical protein DICPUDRAFT... 38.1 0.45 gi|213965046|ref|ZP_03393245.1| conserved hypothetical protein [... 33.5 9.3 >gi|38233010|ref|NP_938777.1| hypothetical protein DIP0396 [Corynebacterium diphtheriae NCTC 13129] gi|38199269|emb|CAE48900.1| Conserved hypothetical protein [Corynebacterium diphtheriae] Length=75 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK Sbjct 43 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 75 >gi|237786467|ref|YP_002907172.1| hypothetical protein ckrop_1916 [Corynebacterium kroppenstedtii DSM 44385] gi|237759379|gb|ACR18629.1| hypothetical protein ckrop_1916 [Corynebacterium kroppenstedtii DSM 44385] Length=51 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK Sbjct 19 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 51 >gi|302329977|gb|ADL20171.1| Hypothetical protein Cp1002_0268 [Corynebacterium pseudotuberculosis 1002] gi|308275661|gb|ADO25560.1| hypothetical protein CpI19_0270 [Corynebacterium pseudotuberculosis I19] gi|341824087|gb|AEK91608.1| Hypothetical protein CpPAT10_0273 [Corynebacterium pseudotuberculosis PAT10] Length=58 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK Sbjct 26 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 58 >gi|148821702|ref|YP_001286456.1| hypothetical protein TBFG_10511 [Mycobacterium tuberculosis F11] gi|253797432|ref|YP_003030433.1| hypothetical protein TBMG_00507 [Mycobacterium tuberculosis KZN 1435] gi|289552754|ref|ZP_06441964.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN 605] 10 more sequence titlesLength=61 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK Sbjct 29 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 61 >gi|15828310|ref|NP_302573.1| hypothetical protein ML2428A [Mycobacterium leprae TN] gi|57116744|ref|YP_177625.1| hypothetical protein Rv0500B [Mycobacterium tuberculosis H37Rv] gi|108797650|ref|YP_637847.1| hypothetical protein Mmcs_0670 [Mycobacterium sp. MCS] 64 more sequence titles Length=33 Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK Sbjct 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 >gi|330804191|ref|XP_003290081.1| hypothetical protein DICPUDRAFT_154564 [Dictyostelium purpureum] gi|325079790|gb|EGC33373.1| hypothetical protein DICPUDRAFT_154564 [Dictyostelium purpureum] Length=130 Score = 38.1 bits (87), Expect = 0.45, Method: Compositional matrix adjust. Identities = 20/33 (61%), Positives = 27/33 (82%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 M S++ KRRK+MSK KHRKL +RTR +++LGK Sbjct 97 MSSILIKRRKKMSKHKHRKLRKRTRALKKRLGK 129 >gi|213965046|ref|ZP_03393245.1| conserved hypothetical protein [Corynebacterium amycolatum SK46] gi|213952582|gb|EEB63965.1| conserved hypothetical protein [Corynebacterium amycolatum SK46] Length=48 Score = 33.5 bits (75), Expect = 9.3, Method: Compositional matrix adjust. Identities = 32/33 (97%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33 MGSVIKKRRKRMSKKKHRK+LRRTRVQRRKLGK Sbjct 16 MGSVIKKRRKRMSKKKHRKMLRRTRVQRRKLGK 48 Lambda K H 0.327 0.135 0.357 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127449871100 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40