BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1573
Length=136
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608711|ref|NP_216089.1| phiRV1 phage protein [Mycobacterium... 273 5e-72
gi|306809485|ref|ZP_07446153.1| hypothetical protein TMGG_03958 ... 271 2e-71
gi|289569611|ref|ZP_06449838.1| conserved hypothetical protein [... 244 2e-63
gi|289750142|ref|ZP_06509520.1| phiRv1 phage protein [Mycobacter... 103 7e-21
gi|308173804|ref|YP_003920509.1| iturin A synthetase A [Bacillus... 35.4 2.8
gi|341827505|gb|AEK88756.1| bacillomycin D synthetase A [Bacillu... 35.4 2.9
gi|328553269|gb|AEB23761.1| iturin A synthetase A [Bacillus amyl... 35.4 2.9
gi|55376536|ref|YP_134388.1| hypothetical protein pNG6142 [Haloa... 35.4 3.2
gi|325473673|gb|EGC76862.1| translocase subunit secA [Treponema ... 34.3 7.1
gi|42527403|ref|NP_972501.1| preprotein translocase subunit SecA... 34.3 7.1
>gi|15608711|ref|NP_216089.1| phiRV1 phage protein [Mycobacterium tuberculosis H37Rv]
gi|15843078|ref|NP_338115.1| hypothetical protein MT3573.15 [Mycobacterium tuberculosis CDC1551]
gi|31792759|ref|NP_855252.1| phiRV1 phage protein [Mycobacterium bovis AF2122/97]
37 more sequence titles
Length=136
Score = 273 bits (699), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE 60
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE
Sbjct 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE 60
Query 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW
Sbjct 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
Query 121 SRMIGLGGGSPAEDER 136
SRMIGLGGGSPAEDER
Sbjct 121 SRMIGLGGGSPAEDER 136
>gi|306809485|ref|ZP_07446153.1| hypothetical protein TMGG_03958 [Mycobacterium tuberculosis SUMu007]
gi|308344216|gb|EFP33067.1| hypothetical protein TMGG_03958 [Mycobacterium tuberculosis SUMu007]
Length=136
Score = 271 bits (694), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/136 (99%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE 60
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRL+AALNRHE
Sbjct 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLRAALNRHE 60
Query 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW
Sbjct 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
Query 121 SRMIGLGGGSPAEDER 136
SRMIGLGGGSPAEDER
Sbjct 121 SRMIGLGGGSPAEDER 136
>gi|289569611|ref|ZP_06449838.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289543365|gb|EFD47013.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=121
Score = 244 bits (624), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 121/121 (100%), Positives = 121/121 (100%), Gaps = 0/121 (0%)
Query 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE 60
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE
Sbjct 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALVRLQAALNRHE 60
Query 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW
Sbjct 61 HTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINRQLGLAGDDEPDGDDTPPW 120
Query 121 S 121
S
Sbjct 121 S 121
>gi|289750142|ref|ZP_06509520.1| phiRv1 phage protein [Mycobacterium tuberculosis T92]
gi|289690729|gb|EFD58158.1| phiRv1 phage protein [Mycobacterium tuberculosis T92]
Length=50
Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 50/50 (100%), Positives = 50/50 (100%), Gaps = 0/50 (0%)
Query 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALV 50
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALV
Sbjct 1 MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALV 50
>gi|308173804|ref|YP_003920509.1| iturin A synthetase A [Bacillus amyloliquefaciens DSM 7]
gi|307606668|emb|CBI43039.1| iturin A synthetase A (B. subtilis) [Bacillus amyloliquefaciens
DSM 7]
gi|328911945|gb|AEB63541.1| iturin A synthetase A [Bacillus amyloliquefaciens LL3]
Length=3982
Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats.
Identities = 18/52 (35%), Positives = 26/52 (50%), Gaps = 0/52 (0%)
Query 32 ETIRAWFPDAPLEVREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDE 83
E R W APL V E R +A LN ++ +E+FL+ + +A DE
Sbjct 1258 ERNRCWAEAAPLSVNEGEERGEAVLNINQSKAHIESFLKTVISNASGIRADE 1309
>gi|341827505|gb|AEK88756.1| bacillomycin D synthetase A [Bacillus amyloliquefaciens XH7]
Length=3982
Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats.
Identities = 18/52 (35%), Positives = 26/52 (50%), Gaps = 0/52 (0%)
Query 32 ETIRAWFPDAPLEVREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDE 83
E R W APL V E R +A LN ++ +E+FL+ + +A DE
Sbjct 1258 ERNRCWAEAAPLSVNEGEERGEAVLNINQSKAHIESFLKTVISNASGIRADE 1309
>gi|328553269|gb|AEB23761.1| iturin A synthetase A [Bacillus amyloliquefaciens TA208]
Length=3982
Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats.
Identities = 18/52 (35%), Positives = 26/52 (50%), Gaps = 0/52 (0%)
Query 32 ETIRAWFPDAPLEVREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDE 83
E R W APL V E R +A LN ++ +E+FL+ + +A DE
Sbjct 1258 ERNRCWAEAAPLSVNEGEERGEAVLNINQSKAHIESFLKTVISNASGIRADE 1309
>gi|55376536|ref|YP_134388.1| hypothetical protein pNG6142 [Haloarcula marismortui ATCC 43049]
gi|55229261|gb|AAV44682.1| unknown [Haloarcula marismortui ATCC 43049]
Length=316
Score = 35.4 bits (80), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 26/87 (30%), Positives = 37/87 (43%), Gaps = 4/87 (4%)
Query 41 APLEVREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAI 100
+PL+ R L R + H+ L AF R +VEH A E P+ P++A +
Sbjct 123 SPLKHRALLSRSDSKYALHDEVSPLLAFARATVEHEHRARVREIAPSATIAWCDPKRALV 182
Query 101 NRQLGLAGDDEPDGDDTPPWSRMIGLG 127
Q D++ D P M GLG
Sbjct 183 RLQ----TDEDTDALQAAPDWEMTGLG 205
>gi|325473673|gb|EGC76862.1| translocase subunit secA [Treponema denticola F0402]
Length=922
Score = 34.3 bits (77), Expect = 7.1, Method: Composition-based stats.
Identities = 20/77 (26%), Positives = 34/77 (45%), Gaps = 4/77 (5%)
Query 40 DAPLEVREALVRLQAALNRHEHTGELEAFLRISVEHAD----AAGGDECGPAILAGRSGP 95
D +E+ LVR+Q + H ++ + +H + G GP L+ RS P
Sbjct 829 DIRIEIASRLVRVQISTEEEAHASRQMRSIQGNAQHNSMGSFSGSGHGMGPTALSARSRP 888
Query 96 EQAAINRQLGLAGDDEP 112
E A + R + G ++P
Sbjct 889 ENAQVVRTVPKVGRNDP 905
>gi|42527403|ref|NP_972501.1| preprotein translocase subunit SecA [Treponema denticola ATCC
35405]
gi|81831381|sp|Q73LG6.1|SECA_TREDE RecName: Full=Protein translocase subunit secA
gi|41817988|gb|AAS12412.1| preprotein translocase, SecA subunit [Treponema denticola ATCC
35405]
Length=922
Score = 34.3 bits (77), Expect = 7.1, Method: Composition-based stats.
Identities = 20/77 (26%), Positives = 34/77 (45%), Gaps = 4/77 (5%)
Query 40 DAPLEVREALVRLQAALNRHEHTGELEAFLRISVEHAD----AAGGDECGPAILAGRSGP 95
D +E+ LVR+Q + H ++ + +H + G GP L+ RS P
Sbjct 829 DIRIEIASRLVRVQISTEEEAHASRQMRSIQGNAQHNSMGSFSGSGHGMGPTALSARSRP 888
Query 96 EQAAINRQLGLAGDDEP 112
E A + R + G ++P
Sbjct 889 ENAQVVRTVPKVGRNDP 905
Lambda K H
0.315 0.135 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128858389450
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40