BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2808
Length=85
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842345|ref|NP_337382.1| hypothetical protein MT2875 [Mycoba... 176 2e-42
gi|15609945|ref|NP_217324.1| hypothetical protein Rv2808 [Mycoba... 174 5e-42
gi|240173505|ref|ZP_04752163.1| hypothetical protein MkanA1_2961... 167 3e-40
gi|118616853|ref|YP_905185.1| hypothetical protein MUL_1132 [Myc... 116 1e-24
>gi|15842345|ref|NP_337382.1| hypothetical protein MT2875 [Mycobacterium tuberculosis CDC1551]
gi|253798107|ref|YP_003031108.1| hypothetical protein TBMG_01165 [Mycobacterium tuberculosis KZN
1435]
gi|254551870|ref|ZP_05142317.1| hypothetical protein Mtube_15662 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
29 more sequence titles
Length=89
Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 85/85 (100%), Positives = 85/85 (100%), Gaps = 0/85 (0%)
Query 1 VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 60
VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD
Sbjct 5 VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 64
Query 61 WVPNDYGLKIERAIDAYLETWPIYR 85
WVPNDYGLKIERAIDAYLETWPIYR
Sbjct 65 WVPNDYGLKIERAIDAYLETWPIYR 89
>gi|15609945|ref|NP_217324.1| hypothetical protein Rv2808 [Mycobacterium tuberculosis H37Rv]
gi|31793984|ref|NP_856477.1| hypothetical protein Mb2831 [Mycobacterium bovis AF2122/97]
gi|121638688|ref|YP_978912.1| hypothetical protein BCG_2826 [Mycobacterium bovis BCG str. Pasteur
1173P2]
44 more sequence titles
Length=85
Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 84/85 (99%), Positives = 85/85 (100%), Gaps = 0/85 (0%)
Query 1 VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 60
+SNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD
Sbjct 1 MSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 60
Query 61 WVPNDYGLKIERAIDAYLETWPIYR 85
WVPNDYGLKIERAIDAYLETWPIYR
Sbjct 61 WVPNDYGLKIERAIDAYLETWPIYR 85
>gi|240173505|ref|ZP_04752163.1| hypothetical protein MkanA1_29611 [Mycobacterium kansasii ATCC
12478]
Length=85
Score = 167 bits (424), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/85 (96%), Positives = 82/85 (97%), Gaps = 0/85 (0%)
Query 1 VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 60
+SNVLDAIS EHRPVI QELENRNPALFDELRRTEKPTNEQSDAVID LSDALMKTFGPD
Sbjct 1 MSNVLDAISPEHRPVIAQELENRNPALFDELRRTEKPTNEQSDAVIDALSDALMKTFGPD 60
Query 61 WVPNDYGLKIERAIDAYLETWPIYR 85
WVPNDYGLKIERAIDAYLETWPIYR
Sbjct 61 WVPNDYGLKIERAIDAYLETWPIYR 85
>gi|118616853|ref|YP_905185.1| hypothetical protein MUL_1132 [Mycobacterium ulcerans Agy99]
gi|183980510|ref|YP_001848801.1| hypothetical protein MMAR_0481 [Mycobacterium marinum M]
gi|118568963|gb|ABL03714.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183173836|gb|ACC38946.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=89
Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/85 (64%), Positives = 64/85 (76%), Gaps = 0/85 (0%)
Query 1 VSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLSDALMKTFGPD 60
V+N+LD + +EHR +I EL RNP L ELR +KPTN+QSDAV+D LS AL +GP
Sbjct 5 VNNILDVVPSEHRVLIMDELTRRNPDLLAELRTVQKPTNDQSDAVVDALSHALSANYGPG 64
Query 61 WVPNDYGLKIERAIDAYLETWPIYR 85
VPNDYGL +ERAIDAYLE WPIYR
Sbjct 65 HVPNDYGLAVERAIDAYLEAWPIYR 89
Lambda K H
0.315 0.134 0.397
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131466506940
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40