BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1012
Length=97
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608152|ref|NP_215528.1| hypothetical protein Rv1012 [Mycoba... 198 2e-49
gi|289442433|ref|ZP_06432177.1| conserved hypothetical protein [... 131 4e-29
gi|31792203|ref|NP_854696.1| hypothetical protein Mb1040 [Mycoba... 130 5e-29
gi|289749541|ref|ZP_06508919.1| LOW QUALITY PROTEIN: hypothetica... 130 7e-29
gi|319794967|ref|YP_004156607.1| amino acid adenylation domain-c... 35.0 3.7
>gi|15608152|ref|NP_215528.1| hypothetical protein Rv1012 [Mycobacterium tuberculosis H37Rv]
gi|148660794|ref|YP_001282317.1| hypothetical protein MRA_1020A [Mycobacterium tuberculosis H37Ra]
gi|167968130|ref|ZP_02550407.1| hypothetical protein MtubH3_08885 [Mycobacterium tuberculosis
H37Ra]
17 more sequence titles
Length=97
Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/97 (100%), Positives = 97/97 (100%), Gaps = 0/97 (0%)
Query 1 MPRAARGIRACRGRWVDRLAHQHASGRAAGIRPREVGGAHQSQAQKPYHDATEPLGESLR 60
MPRAARGIRACRGRWVDRLAHQHASGRAAGIRPREVGGAHQSQAQKPYHDATEPLGESLR
Sbjct 1 MPRAARGIRACRGRWVDRLAHQHASGRAAGIRPREVGGAHQSQAQKPYHDATEPLGESLR 60
Query 61 YRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL 97
YRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL
Sbjct 61 YRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL 97
>gi|289442433|ref|ZP_06432177.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289573652|ref|ZP_06453879.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289415352|gb|EFD12592.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289538083|gb|EFD42661.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=75
Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 62/63 (99%), Positives = 63/63 (100%), Gaps = 0/63 (0%)
Query 35 EVGGAHQSQAQKPYHDATEPLGESLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 94
EVGGAHQSQAQKPYHDATEPLGE+LRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV
Sbjct 13 EVGGAHQSQAQKPYHDATEPLGENLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 72
Query 95 TKL 97
TKL
Sbjct 73 TKL 75
>gi|31792203|ref|NP_854696.1| hypothetical protein Mb1040 [Mycobacterium bovis AF2122/97]
gi|121636941|ref|YP_977164.1| hypothetical protein BCG_1069 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224989413|ref|YP_002644100.1| hypothetical protein JTY_1041 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=104
Score = 130 bits (328), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 62/63 (99%), Positives = 63/63 (100%), Gaps = 0/63 (0%)
Query 35 EVGGAHQSQAQKPYHDATEPLGESLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 94
EVGGAHQSQAQKPYHDATEPLGE+LRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV
Sbjct 42 EVGGAHQSQAQKPYHDATEPLGENLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 101
Query 95 TKL 97
TKL
Sbjct 102 TKL 104
>gi|289749541|ref|ZP_06508919.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_01766 [Mycobacterium
tuberculosis T92]
gi|289690128|gb|EFD57557.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_01766 [Mycobacterium
tuberculosis T92]
Length=97
Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 62/63 (99%), Positives = 63/63 (100%), Gaps = 0/63 (0%)
Query 35 EVGGAHQSQAQKPYHDATEPLGESLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 94
EVGGAHQSQAQKPYHDATEPLGE+LRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV
Sbjct 35 EVGGAHQSQAQKPYHDATEPLGENLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAV 94
Query 95 TKL 97
TKL
Sbjct 95 TKL 97
>gi|319794967|ref|YP_004156607.1| amino acid adenylation domain-containing protein [Variovorax
paradoxus EPS]
gi|315597430|gb|ADU38496.1| amino acid adenylation domain protein [Variovorax paradoxus EPS]
Length=1766
Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/104 (31%), Positives = 49/104 (48%), Gaps = 20/104 (19%)
Query 1 MPRAARGI---RACRGRWVDRLAHQHA-SGRAAGIRPREVGGAHQSQAQKPYHDATEPLG 56
MP+ G ACR W+DR A +A R A ++ GG+ ++QA+ P D TE
Sbjct 547 MPKTTSGKLQRNACRAGWLDRSADAYAIYERGAFVK----GGSVEAQAEAPVLDETEHAV 602
Query 57 ESL---RYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL 97
+++ RPA P AR+S FT G ++ T++
Sbjct 603 DAIWREVLRPADA---------KPFARDSHFFTRGGSSLTATQV 637
Lambda K H
0.316 0.129 0.398
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130655526400
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40