BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
21,062,489 sequences; 7,218,481,314 total letters
Query= Rv3190A Rv3190A Conserved protein 3556855:3557064 forward MW:7632
Length=69
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842769|ref|NP_337806.1| hypothetical protein MT3279 [Mycoba... 141 4e-32
gi|323718049|gb|EGB27231.1| hypothetical protein TMMG_02328 [Myc... 129 1e-28
gi|392416071|ref|YP_006452676.1| hypothetical protein Mycch_2214... 62.0 4e-08
gi|226349334|ref|YP_002776448.1| hypothetical protein ROP_pROB01... 43.1 0.017
gi|384105189|ref|ZP_10006114.1| hypothetical protein W59_27636 [... 43.1 0.018
>gi|15842769|ref|NP_337806.1| hypothetical protein MT3279 [Mycobacterium tuberculosis CDC1551]
gi|148663045|ref|YP_001284568.1| hypothetical protein MRA_3223 [Mycobacterium tuberculosis H37Ra]
gi|167966986|ref|ZP_02549263.1| hypothetical protein MtubH3_02583 [Mycobacterium tuberculosis
H37Ra]
14 more sequence titles
Length=69
Score = 141 bits (356), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 69/69 (100%), Positives = 69/69 (100%), Gaps = 0/69 (0%)
Query 1 MITVLDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSE 60
MITVLDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSE
Sbjct 1 MITVLDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSE 60
Query 61 GKGLLSRLS 69
GKGLLSRLS
Sbjct 61 GKGLLSRLS 69
>gi|323718049|gb|EGB27231.1| hypothetical protein TMMG_02328 [Mycobacterium tuberculosis CDC1551A]
gi|379029535|dbj|BAL67268.1| hypothetical protein ERDMAN_3493 [Mycobacterium tuberculosis
str. Erdman = ATCC 35801]
Length=63
Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/63 (100%), Positives = 63/63 (100%), Gaps = 0/63 (0%)
Query 7 MNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSEGKGLLS 66
MNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSEGKGLLS
Sbjct 1 MNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSEGKGLLS 60
Query 67 RLS 69
RLS
Sbjct 61 RLS 63
>gi|392416071|ref|YP_006452676.1| hypothetical protein Mycch_2214 [Mycobacterium chubuense NBB4]
gi|390615847|gb|AFM16997.1| hypothetical protein Mycch_2214 [Mycobacterium chubuense NBB4]
Length=61
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 38/50 (76%), Gaps = 0/50 (0%)
Query 8 NGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQ 57
NG+++ RPD LPLSA+VWD + + + TV + LY+A+K+LEA VIA++
Sbjct 10 NGYQEPRPDHLPLSAAVWDASHNFGRSSQTVVQKLYDAVKQLEADVIAMR 59
>gi|226349334|ref|YP_002776448.1| hypothetical protein ROP_pROB01-00970 [Rhodococcus opacus B4]
gi|226245249|dbj|BAH55596.1| hypothetical protein [Rhodococcus opacus B4]
Length=82
Score = 43.1 bits (100), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/57 (39%), Positives = 33/57 (58%), Gaps = 1/57 (1%)
Query 5 LDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSEG 61
+GFK RP SA +W A+ + K V+EA + A+ LEA+V+AL+R+ G
Sbjct 24 FSSHGFKIPRPSG-EHSARIWQAAETFGKDSNAVSEATFHAVVALEAEVVALRRATG 79
>gi|384105189|ref|ZP_10006114.1| hypothetical protein W59_27636 [Rhodococcus imtechensis RKJ300]
gi|383836003|gb|EID75417.1| hypothetical protein W59_27636 [Rhodococcus imtechensis RKJ300]
Length=72
Score = 43.1 bits (100), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/57 (39%), Positives = 33/57 (58%), Gaps = 1/57 (1%)
Query 5 LDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELEAQVIALQRSEG 61
+GFK RP SA +W A+ + K V+EA + A+ LEA+V+AL+R+ G
Sbjct 14 FSSHGFKIPRPSG-EHSARIWQAAETFGKDSNAVSEATFHAVVALEAEVVALRRATG 69
Lambda K H
0.316 0.133 0.370
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 177937739420
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Oct 14, 2012 4:13 PM
Number of letters in database: 7,218,481,314
Number of sequences in database: 21,062,489
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40