BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0615
Length=80
Score E
Sequences producing significant alignments: (Bits) Value
gi|326905163|gb|EGE52096.1| hypothetical protein TBPG_03095 [Myc... 153 7e-36
gi|15840017|ref|NP_335054.1| hypothetical protein MT0645 [Mycoba... 151 4e-35
gi|15607755|ref|NP_215129.1| integral membrane protein [Mycobact... 150 8e-35
gi|148821817|ref|YP_001286571.1| hypothetical protein TBFG_10626... 150 8e-35
gi|240170350|ref|ZP_04749009.1| hypothetical protein MkanA1_1364... 98.6 3e-19
gi|183980149|ref|YP_001848440.1| hypothetical protein MMAR_0115 ... 41.2 0.049
gi|345330792|gb|EGW63256.1| colicin-Ia [Escherichia coli STEC_B2F1] 35.4 3.2
>gi|326905163|gb|EGE52096.1| hypothetical protein TBPG_03095 [Mycobacterium tuberculosis W-148]
Length=346
Score = 153 bits (387), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 80/80 (100%), Positives = 80/80 (100%), Gaps = 0/80 (0%)
Query 1 VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 60
VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL
Sbjct 267 VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 326
Query 61 VALGGPLVVVNHRRAERSRG 80
VALGGPLVVVNHRRAERSRG
Sbjct 327 VALGGPLVVVNHRRAERSRG 346
>gi|15840017|ref|NP_335054.1| hypothetical protein MT0645 [Mycobacterium tuberculosis CDC1551]
gi|289552868|ref|ZP_06442078.1| membrane protein [Mycobacterium tuberculosis KZN 605]
gi|289749114|ref|ZP_06508492.1| membrane protein [Mycobacterium tuberculosis T92]
26 more sequence titles
Length=86
Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/80 (100%), Positives = 80/80 (100%), Gaps = 0/80 (0%)
Query 1 VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 60
VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL
Sbjct 7 VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 66
Query 61 VALGGPLVVVNHRRAERSRG 80
VALGGPLVVVNHRRAERSRG
Sbjct 67 VALGGPLVVVNHRRAERSRG 86
>gi|15607755|ref|NP_215129.1| integral membrane protein [Mycobacterium tuberculosis H37Rv]
gi|31791798|ref|NP_854291.1| integral membrane protein [Mycobacterium bovis AF2122/97]
gi|121636534|ref|YP_976757.1| putative integral membrane protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
36 more sequence titles
Length=80
Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 79/80 (99%), Positives = 80/80 (100%), Gaps = 0/80 (0%)
Query 1 VMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 60
+MDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL
Sbjct 1 MMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFL 60
Query 61 VALGGPLVVVNHRRAERSRG 80
VALGGPLVVVNHRRAERSRG
Sbjct 61 VALGGPLVVVNHRRAERSRG 80
>gi|148821817|ref|YP_001286571.1| hypothetical protein TBFG_10626 [Mycobacterium tuberculosis F11]
gi|167969040|ref|ZP_02551317.1| hypothetical protein MtubH3_13842 [Mycobacterium tuberculosis
H37Ra]
gi|254230951|ref|ZP_04924278.1| hypothetical protein TBCG_00610 [Mycobacterium tuberculosis C]
7 more sequence titles
Length=79
Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 79/79 (100%), Positives = 79/79 (100%), Gaps = 0/79 (0%)
Query 2 MDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFLV 61
MDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFLV
Sbjct 1 MDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAFLV 60
Query 62 ALGGPLVVVNHRRAERSRG 80
ALGGPLVVVNHRRAERSRG
Sbjct 61 ALGGPLVVVNHRRAERSRG 79
>gi|240170350|ref|ZP_04749009.1| hypothetical protein MkanA1_13640 [Mycobacterium kansasii ATCC
12478]
Length=88
Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/78 (70%), Positives = 57/78 (74%), Gaps = 1/78 (1%)
Query 3 DVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQTLMAISVAF-LV 61
DVL A I +GA LA WGAWRPHYR ASYLVAGAV L LI LVVTGQT M I AF LV
Sbjct 3 DVLEAAILSGAFVLALWGAWRPHYRVASYLVAGAVTLVLIAGLVVTGQTNMTIVSAFLLV 62
Query 62 ALGGPLVVVNHRRAERSR 79
A+ P+ V NHRRAER R
Sbjct 63 AMFVPMTVFNHRRAERGR 80
>gi|183980149|ref|YP_001848440.1| hypothetical protein MMAR_0115 [Mycobacterium marinum M]
gi|183173475|gb|ACC38585.1| conserved hypothetical membrane protein [Mycobacterium marinum
M]
Length=86
Score = 41.2 bits (95), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/76 (49%), Positives = 46/76 (61%), Gaps = 6/76 (7%)
Query 1 VMDVLAAGIAAGALTLAAWGAW------RPHYRAASYLVAGAVELALIGLLVVTGQTLMA 54
+ D+L A I AGA LA WGA RPHY ASYL AG V +ALI +L+ TG+T
Sbjct 1 MTDLLVAAILAGAFGLALWGAGAFGLARRPHYPLASYLAAGGVVVALIVVLLTTGRTDWV 60
Query 55 ISVAFLVALGGPLVVV 70
I A +AL PL+V+
Sbjct 61 IVSAGALALFIPLIVI 76
>gi|345330792|gb|EGW63256.1| colicin-Ia [Escherichia coli STEC_B2F1]
Length=424
Score = 35.4 bits (80), Expect = 3.2, Method: Composition-based stats.
Identities = 22/73 (31%), Positives = 33/73 (46%), Gaps = 12/73 (16%)
Query 8 GIAAGALTLAAW----------GAWRPHYRAASYLVAGAVELALIGLL--VVTGQTLMAI 55
G A +LA W G WRP + ++AG AL+ L+ ++TG L I
Sbjct 338 GYAGKFTSLADWITEFGKAARTGNWRPFFVKTETIIAGNAATALVALVFSILTGSALGII 397
Query 56 SVAFLVALGGPLV 68
L+A+ G L+
Sbjct 398 GYGLLMAVTGALI 410
Lambda K H
0.326 0.137 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128850890930
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40