BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1943c
Length=125
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609080|ref|NP_216459.1| hypothetical protein Rv1943c [Mycob... 248 2e-64
gi|108802469|ref|YP_642665.1| hypothetical protein Mmcs_5509 [My... 97.1 7e-19
gi|284991044|ref|YP_003409598.1| hypothetical protein Gobs_2570 ... 95.5 2e-18
gi|15609083|ref|NP_216462.1| lipoprotein [Mycobacterium tubercul... 43.9 0.009
gi|31793138|ref|NP_855631.1| lipoprotein [Mycobacterium bovis AF... 43.5 0.009
gi|289574629|ref|ZP_06454856.1| lipoprotein lppG [Mycobacterium ... 43.5 0.011
gi|332307877|ref|YP_004435728.1| glycoside hydrolase family 2 su... 36.6 1.2
gi|20338775|emb|CAD30260.1| putative insulin receptor [Echinococ... 35.4 2.8
gi|116255140|ref|YP_770974.1| putative transcriptional regulator... 35.0 3.6
gi|37521856|ref|NP_925233.1| hypothetical protein gsr2287 [Gloeo... 34.7 4.9
gi|324508515|gb|ADY43594.1| Carboxypeptidase A2 [Ascaris suum] 33.9 8.0
>gi|15609080|ref|NP_216459.1| hypothetical protein Rv1943c [Mycobacterium tuberculosis H37Rv]
gi|15841414|ref|NP_336451.1| hypothetical protein MT1993 [Mycobacterium tuberculosis CDC1551]
gi|31793135|ref|NP_855628.1| hypothetical protein Mb1978c [Mycobacterium bovis AF2122/97]
81 more sequence titles
Length=125
Score = 248 bits (634), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 124/125 (99%), Positives = 125/125 (100%), Gaps = 0/125 (0%)
Query 1 VKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNRIRLSTTVDAA 60
+KTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNRIRLSTTVDAA
Sbjct 1 MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNRIRLSTTVDAA 60
Query 61 LLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRR 120
LLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRR
Sbjct 61 LLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRR 120
Query 121 AAGDS 125
AAGDS
Sbjct 121 AAGDS 125
>gi|108802469|ref|YP_642665.1| hypothetical protein Mmcs_5509 [Mycobacterium sp. MCS]
gi|119855296|ref|YP_935899.1| hypothetical protein Mkms_5910 [Mycobacterium sp. KMS]
gi|108772888|gb|ABG11609.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119698013|gb|ABL95084.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=78
Score = 97.1 bits (240), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 48/74 (65%), Positives = 57/74 (78%), Gaps = 0/74 (0%)
Query 48 MNRIRLSTTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYDKHPVD 107
M R R+STTVDA LL SAR + D AL DEAL ALLA HR+AEVDA YAAYD+HP D
Sbjct 1 MPRTRVSTTVDADLLASARALLPAAKDHALFDEALGALLAAHRAAEVDAGYAAYDEHPAD 60
Query 108 EPDEWGDLASWRRA 121
EPD+WGD+ +WR++
Sbjct 61 EPDQWGDVVAWRQS 74
>gi|284991044|ref|YP_003409598.1| hypothetical protein Gobs_2570 [Geodermatophilus obscurus DSM
43160]
gi|284064289|gb|ADB75227.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=58
Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/58 (80%), Positives = 53/58 (92%), Gaps = 0/58 (0%)
Query 68 MRAGITDAALIDEALAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRRAAGDS 125
MR+G TDAA+IDEAL AL+ARHRSAEV+ASY AYD+HP+DEPDEWGDLAS+R AAG S
Sbjct 1 MRSGNTDAAMIDEALRALVARHRSAEVEASYTAYDEHPLDEPDEWGDLASFREAAGRS 58
>gi|15609083|ref|NP_216462.1| lipoprotein [Mycobacterium tuberculosis H37Rv]
gi|15841417|ref|NP_336454.1| hypothetical protein MT1997 [Mycobacterium tuberculosis CDC1551]
gi|148661754|ref|YP_001283277.1| putative lipoprotein LppG [Mycobacterium tuberculosis H37Ra]
40 more sequence titles
Length=150
Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/28 (72%), Positives = 21/28 (75%), Gaps = 0/28 (0%)
Query 11 RCAVDLINSSSDQCFARIEHVASDQADP 38
+ AVDLIN S QCFARIEHVA QA P
Sbjct 33 KVAVDLINLSGIQCFARIEHVAHAQAHP 60
>gi|31793138|ref|NP_855631.1| lipoprotein [Mycobacterium bovis AF2122/97]
gi|121637851|ref|YP_978074.1| putative lipoprotein [Mycobacterium bovis BCG str. Pasteur 1173P2]
gi|224990335|ref|YP_002645022.1| putative lipoprotein [Mycobacterium bovis BCG str. Tokyo 172]
gi|31618729|emb|CAD94683.1| POSSIBLE LIPOPROTEIN [Mycobacterium bovis AF2122/97]
gi|121493498|emb|CAL71972.1| Possible lipoprotein [Mycobacterium bovis BCG str. Pasteur 1173P2]
gi|224773448|dbj|BAH26254.1| putative lipoprotein [Mycobacterium bovis BCG str. Tokyo 172]
gi|341601878|emb|CCC64552.1| possible lipoprotein [Mycobacterium bovis BCG str. Moreau RDJ]
Length=150
Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/28 (72%), Positives = 21/28 (75%), Gaps = 0/28 (0%)
Query 11 RCAVDLINSSSDQCFARIEHVASDQADP 38
+ AVDLIN S QCFARIEHVA QA P
Sbjct 33 KVAVDLINLSGIQCFARIEHVAHAQAHP 60
>gi|289574629|ref|ZP_06454856.1| lipoprotein lppG [Mycobacterium tuberculosis K85]
gi|289539060|gb|EFD43638.1| lipoprotein lppG [Mycobacterium tuberculosis K85]
Length=150
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/28 (72%), Positives = 21/28 (75%), Gaps = 0/28 (0%)
Query 11 RCAVDLINSSSDQCFARIEHVASDQADP 38
+ AVDLIN S QCFARIEHVA QA P
Sbjct 33 KVAVDLINLSGIQCFARIEHVAHAQAHP 60
>gi|332307877|ref|YP_004435728.1| glycoside hydrolase family 2 sugar binding protein [Glaciecola
agarilytica 4H-3-7+YE-5]
gi|332175206|gb|AEE24460.1| glycoside hydrolase family 2 sugar binding protein [Glaciecola
sp. 4H-3-7+YE-5]
Length=850
Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats.
Identities = 25/96 (27%), Positives = 44/96 (46%), Gaps = 12/96 (12%)
Query 26 ARIEHVASDQADPRPGVWHSSGMNRIRLSTTVDAALLTSARDMRAGITDAALIDEALAAL 85
+++H+A R + S+ + ++ LS V A ++T + + G TDA L
Sbjct 479 GQVDHLA------RTAKYLSNIVKKLDLSRPVTAGMVTPSVNFVTGYTDA------LDVA 526
Query 86 LARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRRA 121
+RS E DA++ Y P+ + WG W+ A
Sbjct 527 GFNYRSQEYDAAHEMYPDKPIYGSENWGTWPEWKAA 562
>gi|20338775|emb|CAD30260.1| putative insulin receptor [Echinococcus multilocularis]
Length=1749
Score = 35.4 bits (80), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/91 (33%), Positives = 42/91 (47%), Gaps = 8/91 (8%)
Query 36 ADPRPGVWHSSGMNRIRLS-TTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEV 94
A P VW SG RL + L+ + +D R+ +T DE A + R S++
Sbjct 1362 ASPTKRVWTGSG----RLGRGSFSRLLMKNQKDFRSTLTTQ---DETGVATVQRCSSSDS 1414
Query 95 DASYAAYDKHPVDEPDEWGDLASWRRAAGDS 125
++ Y V E E GDLAS+ R GDS
Sbjct 1415 IRPFSQYGLFVVMELMESGDLASYLRKLGDS 1445
>gi|116255140|ref|YP_770974.1| putative transcriptional regulator [Rhizobium leguminosarum bv.
viciae 3841]
gi|115259788|emb|CAK02880.1| putative transcriptional regulator [Rhizobium leguminosarum bv.
viciae 3841]
Length=327
Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/114 (29%), Positives = 53/114 (47%), Gaps = 9/114 (7%)
Query 16 LINSSSDQCFARIEHVASDQADPRPGVWHSSGMNRI-RLSTTVDAALLTSARDM---RAG 71
L N QCF + + D D + + G+NRI +LS +D AL+ S D+
Sbjct 167 LANQLGAQCFLLMAPLLVDSPDTKERLIEKCGLNRIMKLSADLDIALV-SVGDIGTHSTS 225
Query 72 ITDAALIDEALAALLARHRSAEVDASYAAYDK----HPVDEPDEWGDLASWRRA 121
++ A+L E L L+ + +V ++ D HPV++ DL + RRA
Sbjct 226 LSVASLAPEELETLIGKGAMCDVLCNFLDRDGRTVDHPVNDRVMSVDLDTVRRA 279
>gi|37521856|ref|NP_925233.1| hypothetical protein gsr2287 [Gloeobacter violaceus PCC 7421]
gi|35212855|dbj|BAC90228.1| gsr2287 [Gloeobacter violaceus PCC 7421]
Length=77
Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/44 (46%), Positives = 26/44 (60%), Gaps = 0/44 (0%)
Query 82 LAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRRAAGDS 125
L ALL R RSA+ A A KH ++P+ +LA+W AGDS
Sbjct 29 LNALLHRQRSAQRRAELIASLKHDAEDPEYLAELAAWDGVAGDS 72
>gi|324508515|gb|ADY43594.1| Carboxypeptidase A2 [Ascaris suum]
Length=564
Score = 33.9 bits (76), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 33/108 (31%), Positives = 46/108 (43%), Gaps = 11/108 (10%)
Query 16 LINSSSD------QCFARIEHVASDQADP---RPGVWHSSGMNRIRLSTTVDAALLTSAR 66
L N SSD + + H A A+P RP + S N + T D LT R
Sbjct 49 LYNKSSDLQLDFWKAPTALGHFADVMANPNMARPLITFLSEHNIPHIVTIDDVQGLTIER 108
Query 67 DMRAGITDAALIDEALAALLARHR--SAEVDASYAAYDKHPVDEPDEW 112
+ R TDA+ +D + A L R + S+ A Y D H +E +W
Sbjct 109 EGRRSETDASDLDPIMQAFLKRAKDASSRNKAKYGFGDYHSYNEMVQW 156
Lambda K H
0.317 0.128 0.379
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130354689300
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40