BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1954c
Length=173
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609091|ref|NP_216470.1| hypothetical protein Rv1954c [Mycob... 342 1e-92
gi|289750537|ref|ZP_06509915.1| hypothetical protein TBDG_00746 ... 340 4e-92
gi|340626963|ref|YP_004745415.1| hypothetical protein MCAN_19701... 340 6e-92
gi|31793146|ref|NP_855639.1| hypothetical protein Mb1989c [Mycob... 339 8e-92
gi|302798641|ref|XP_002981080.1| hypothetical protein SELMODRAFT... 37.4 0.86
gi|302801592|ref|XP_002982552.1| hypothetical protein SELMODRAFT... 36.6 1.5
gi|134103271|ref|YP_001108932.1| hypothetical protein SACE_6843 ... 35.0 4.1
gi|164660216|ref|XP_001731231.1| hypothetical protein MGL_1414 [... 34.3 7.1
>gi|15609091|ref|NP_216470.1| hypothetical protein Rv1954c [Mycobacterium tuberculosis H37Rv]
gi|167970530|ref|ZP_02552807.1| hypothetical protein MtubH3_21853 [Mycobacterium tuberculosis
H37Ra]
gi|289443436|ref|ZP_06433180.1| hypothetical protein TBLG_00537 [Mycobacterium tuberculosis T46]
17 more sequence titles
Length=173
Score = 342 bits (877), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 173/173 (100%), Positives = 173/173 (100%), Gaps = 0/173 (0%)
Query 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA
Sbjct 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
Query 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP
Sbjct 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
Query 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI
Sbjct 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
>gi|289750537|ref|ZP_06509915.1| hypothetical protein TBDG_00746 [Mycobacterium tuberculosis T92]
gi|289691124|gb|EFD58553.1| hypothetical protein TBDG_00746 [Mycobacterium tuberculosis T92]
Length=173
Score = 340 bits (873), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 172/173 (99%), Positives = 172/173 (99%), Gaps = 0/173 (0%)
Query 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA
Sbjct 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
Query 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP
Sbjct 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
Query 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKP YTRI
Sbjct 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPVYTRI 173
>gi|340626963|ref|YP_004745415.1| hypothetical protein MCAN_19701 [Mycobacterium canettii CIPT
140010059]
gi|340005153|emb|CCC44302.1| hypothetical protein MCAN_19701 [Mycobacterium canettii CIPT
140010059]
Length=173
Score = 340 bits (871), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 172/173 (99%), Positives = 172/173 (99%), Gaps = 0/173 (0%)
Query 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
MA GSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA
Sbjct 1 MARGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
Query 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP
Sbjct 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
Query 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI
Sbjct 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
>gi|31793146|ref|NP_855639.1| hypothetical protein Mb1989c [Mycobacterium bovis AF2122/97]
gi|121637859|ref|YP_978082.1| hypothetical protein BCG_1993c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990343|ref|YP_002645030.1| hypothetical protein JTY_1977 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31618737|emb|CAD94691.1| HYPOTHETICAL PROTEIN Mb1989c [Mycobacterium bovis AF2122/97]
gi|121493506|emb|CAL71980.1| Hypothetical protein BCG_1993c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773456|dbj|BAH26262.1| hypothetical protein JTY_1977 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341601886|emb|CCC64560.1| hypothetical protein BCGM_1967c [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=173
Score = 339 bits (870), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 172/173 (99%), Positives = 172/173 (99%), Gaps = 0/173 (0%)
Query 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA
Sbjct 1 MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAA 60
Query 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP
Sbjct 61 LVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCP 120
Query 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPRWGPFRLKPAYTRI 173
RLDDHQHRHPTRCRAEHAGCTVATCIPNA DPAPGHQTPRWGPFRLKPAYTRI
Sbjct 121 RLDDHQHRHPTRCRAEHAGCTVATCIPNARDPAPGHQTPRWGPFRLKPAYTRI 173
>gi|302798641|ref|XP_002981080.1| hypothetical protein SELMODRAFT_444776 [Selaginella moellendorffii]
gi|300151134|gb|EFJ17781.1| hypothetical protein SELMODRAFT_444776 [Selaginella moellendorffii]
Length=969
Score = 37.4 bits (85), Expect = 0.86, Method: Composition-based stats.
Identities = 41/148 (28%), Positives = 64/148 (44%), Gaps = 27/148 (18%)
Query 5 SGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAALVLR 64
+ GG G+++ + + G+P + E + L L D ++ + A L +
Sbjct 716 AAGGAYGIIISCMQT-----GSPRMKEDAAAVLTRLTDSELDANSEQE-----LARLGVM 765
Query 65 RIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCPRLDD 124
R+ L TG R R+ A AN A LS+R LT+ +SF L+ RL
Sbjct 766 RLLRDTLETGSERAREH---------ACANLANLSKRTPSLTQEQSFFKRLLA---RLGL 813
Query 125 HQHR----HPTRCRAEHAGCTV-ATCIP 147
Q+R HP +C A + C V A +P
Sbjct 814 KQYRLCVVHPGKCNARASFCMVEAGVVP 841
>gi|302801592|ref|XP_002982552.1| hypothetical protein SELMODRAFT_445238 [Selaginella moellendorffii]
gi|300149651|gb|EFJ16305.1| hypothetical protein SELMODRAFT_445238 [Selaginella moellendorffii]
Length=969
Score = 36.6 bits (83), Expect = 1.5, Method: Composition-based stats.
Identities = 41/148 (28%), Positives = 64/148 (44%), Gaps = 27/148 (18%)
Query 5 SGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAALVLR 64
+ GG G+++ + + G+P + E + L L D ++ + A L +
Sbjct 716 AAGGAYGIIISCMQT-----GSPRMKEEAAAVLTRLTDSVLDANSEQE-----LARLGVM 765
Query 65 RIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCPRLDD 124
R+ L TG R R+ A AN A LS+R LT+ +SF L+ RL
Sbjct 766 RLLRDTLETGSERAREH---------ACANLANLSKRTPSLTQEQSFFKRLLA---RLGL 813
Query 125 HQHR----HPTRCRAEHAGCTV-ATCIP 147
Q+R HP +C A + C V A +P
Sbjct 814 KQYRLCVVHPGKCNARASFCMVEAGVVP 841
>gi|134103271|ref|YP_001108932.1| hypothetical protein SACE_6843 [Saccharopolyspora erythraea NRRL
2338]
gi|291007936|ref|ZP_06565909.1| hypothetical protein SeryN2_25736 [Saccharopolyspora erythraea
NRRL 2338]
gi|133915894|emb|CAM06007.1| hypothetical protein SACE_6843 [Saccharopolyspora erythraea NRRL
2338]
Length=547
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 39/149 (27%), Positives = 55/149 (37%), Gaps = 20/149 (13%)
Query 24 DGAPTVPEGSDKALM-HLGDPPRRCDTHPDGTSSAAAALVLRRIDVHPLLTGLGR----- 77
DG P PE ++A + HL P RC T P A + R+DV L G R
Sbjct 32 DGEPVEPERVERAFVDHLPVPASRCPTFPQPAGYGPATEPVLRVDVEAELVGALRRLAPE 91
Query 78 GRQTVSLR---NGHLVATANRAILSRRRSRLTRGRSFTSHLITSCPRLDDHQHRHPTRCR 134
G Q +SL G +A + A++ L +HR T
Sbjct 92 GWQRLSLSCAALGERIAVSATAVVGGAELSWIAPFEVVEWL---------RRHRALTYTP 142
Query 135 AEHAGCTVATCIPNAHDPA--PGHQTPRW 161
A + + + +PA P H+ PRW
Sbjct 143 GAGAWSNLGIEVADGGEPAFTPDHEPPRW 171
>gi|164660216|ref|XP_001731231.1| hypothetical protein MGL_1414 [Malassezia globosa CBS 7966]
gi|159105131|gb|EDP44017.1| hypothetical protein MGL_1414 [Malassezia globosa CBS 7966]
Length=1123
Score = 34.3 bits (77), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 22/57 (39%), Positives = 28/57 (50%), Gaps = 1/57 (1%)
Query 14 LPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDT-HPDGTSSAAAALVLRRIDVH 69
L RV LD P +PEG + DP R CD H +S A+ALVL ++ H
Sbjct 520 LERVLRPESLDDEPHMPEGLSRTFAWTIDPARVCDILHRAKRTSVASALVLDAMEEH 576
Lambda K H
0.321 0.136 0.432
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 143230884104
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40