BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0500B
Length=33
Score E
Sequences producing significant alignments: (Bits) Value
gi|38233010|ref|NP_938777.1| hypothetical protein DIP0396 [Coryn... 60.1 1e-07
gi|237786467|ref|YP_002907172.1| hypothetical protein ckrop_1916... 58.5 3e-07
gi|302329977|gb|ADL20171.1| Hypothetical protein Cp1002_0268 [Co... 58.5 3e-07
gi|148821702|ref|YP_001286456.1| hypothetical protein TBFG_10511... 58.5 3e-07
gi|15828310|ref|NP_302573.1| hypothetical protein ML2428A [Mycob... 57.8 5e-07
gi|330804191|ref|XP_003290081.1| hypothetical protein DICPUDRAFT... 38.1 0.45
gi|213965046|ref|ZP_03393245.1| conserved hypothetical protein [... 33.5 9.3
>gi|38233010|ref|NP_938777.1| hypothetical protein DIP0396 [Corynebacterium diphtheriae NCTC
13129]
gi|38199269|emb|CAE48900.1| Conserved hypothetical protein [Corynebacterium diphtheriae]
Length=75
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
Sbjct 43 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 75
>gi|237786467|ref|YP_002907172.1| hypothetical protein ckrop_1916 [Corynebacterium kroppenstedtii
DSM 44385]
gi|237759379|gb|ACR18629.1| hypothetical protein ckrop_1916 [Corynebacterium kroppenstedtii
DSM 44385]
Length=51
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
Sbjct 19 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 51
>gi|302329977|gb|ADL20171.1| Hypothetical protein Cp1002_0268 [Corynebacterium pseudotuberculosis
1002]
gi|308275661|gb|ADO25560.1| hypothetical protein CpI19_0270 [Corynebacterium pseudotuberculosis
I19]
gi|341824087|gb|AEK91608.1| Hypothetical protein CpPAT10_0273 [Corynebacterium pseudotuberculosis
PAT10]
Length=58
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
Sbjct 26 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 58
>gi|148821702|ref|YP_001286456.1| hypothetical protein TBFG_10511 [Mycobacterium tuberculosis F11]
gi|253797432|ref|YP_003030433.1| hypothetical protein TBMG_00507 [Mycobacterium tuberculosis KZN
1435]
gi|289552754|ref|ZP_06441964.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
10 more sequence titles
Length=61
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
Sbjct 29 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 61
>gi|15828310|ref|NP_302573.1| hypothetical protein ML2428A [Mycobacterium leprae TN]
gi|57116744|ref|YP_177625.1| hypothetical protein Rv0500B [Mycobacterium tuberculosis H37Rv]
gi|108797650|ref|YP_637847.1| hypothetical protein Mmcs_0670 [Mycobacterium sp. MCS]
64 more sequence titles
Length=33
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
Sbjct 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
>gi|330804191|ref|XP_003290081.1| hypothetical protein DICPUDRAFT_154564 [Dictyostelium purpureum]
gi|325079790|gb|EGC33373.1| hypothetical protein DICPUDRAFT_154564 [Dictyostelium purpureum]
Length=130
Score = 38.1 bits (87), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 20/33 (61%), Positives = 27/33 (82%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
M S++ KRRK+MSK KHRKL +RTR +++LGK
Sbjct 97 MSSILIKRRKKMSKHKHRKLRKRTRALKKRLGK 129
>gi|213965046|ref|ZP_03393245.1| conserved hypothetical protein [Corynebacterium amycolatum SK46]
gi|213952582|gb|EEB63965.1| conserved hypothetical protein [Corynebacterium amycolatum SK46]
Length=48
Score = 33.5 bits (75), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 32/33 (97%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 1 MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK 33
MGSVIKKRRKRMSKKKHRK+LRRTRVQRRKLGK
Sbjct 16 MGSVIKKRRKRMSKKKHRKMLRRTRVQRRKLGK 48
Lambda K H
0.327 0.135 0.357
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127449871100
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40