BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3639c
Length=188
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610775|ref|NP_218156.1| hypothetical protein Rv3639c [Mycob... 374 3e-102
gi|31794809|ref|NP_857302.1| hypothetical protein Mb3663c [Mycob... 372 1e-101
gi|340628603|ref|YP_004747055.1| hypothetical protein MCAN_36501... 371 3e-101
gi|31791882|ref|NP_854375.1| hypothetical protein Mb0717 [Mycoba... 60.8 1e-07
gi|298524190|ref|ZP_07011599.1| conserved hypothetical protein [... 60.5 1e-07
gi|15607838|ref|NP_215212.1| hypothetical protein Rv0698 [Mycoba... 60.5 1e-07
gi|289446257|ref|ZP_06436001.1| conserved hypothetical protein [... 58.9 4e-07
gi|289568046|ref|ZP_06448273.1| conserved hypothetical protein [... 57.4 9e-07
gi|340628813|ref|YP_004747265.1| hypothetical protein MCAN_38631... 57.4 9e-07
gi|298707651|emb|CBJ25968.1| similar to Kelch-like 2, Mayven (Dr... 34.7 6.5
>gi|15610775|ref|NP_218156.1| hypothetical protein Rv3639c [Mycobacterium tuberculosis H37Rv]
gi|15843249|ref|NP_338286.1| hypothetical protein MT3741 [Mycobacterium tuberculosis CDC1551]
gi|148663502|ref|YP_001285025.1| hypothetical protein MRA_3675 [Mycobacterium tuberculosis H37Ra]
41 more sequence titles
Length=188
Score = 374 bits (960), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 188/188 (100%), Positives = 188/188 (100%), Gaps = 0/188 (0%)
Query 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF 60
MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF
Sbjct 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF 60
Query 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH
Sbjct 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
Query 121 VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP
Sbjct 121 VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
Query 181 SAGQQGPS 188
SAGQQGPS
Sbjct 181 SAGQQGPS 188
>gi|31794809|ref|NP_857302.1| hypothetical protein Mb3663c [Mycobacterium bovis AF2122/97]
gi|121639552|ref|YP_979776.1| hypothetical protein BCG_3697c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224992049|ref|YP_002646738.1| hypothetical protein JTY_3698 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=188
Score = 372 bits (955), Expect = 1e-101, Method: Compositional matrix adjust.
Identities = 187/188 (99%), Positives = 187/188 (99%), Gaps = 0/188 (0%)
Query 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF 60
MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAA DSAQDGF
Sbjct 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAANDSAQDGF 60
Query 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH
Sbjct 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
Query 121 VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP
Sbjct 121 VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
Query 181 SAGQQGPS 188
SAGQQGPS
Sbjct 181 SAGQQGPS 188
>gi|340628603|ref|YP_004747055.1| hypothetical protein MCAN_36501 [Mycobacterium canettii CIPT
140010059]
gi|340006793|emb|CCC45981.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=188
Score = 371 bits (953), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 186/188 (99%), Positives = 186/188 (99%), Gaps = 0/188 (0%)
Query 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF 60
MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF
Sbjct 1 MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPPAAKDSAQDGF 60
Query 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH
Sbjct 61 RHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGPTPAPRGLATRQCPPRTVH 120
Query 121 VDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
VDRVRPNGAERALRARFRPILRPQFTLGDG NGLPLAACTKTGAYVPHLPYSPIAVDPQP
Sbjct 121 VDRVRPNGAERALRARFRPILRPQFTLGDGTNGLPLAACTKTGAYVPHLPYSPIAVDPQP 180
Query 181 SAGQQGPS 188
SA QQGPS
Sbjct 181 SASQQGPS 188
>gi|31791882|ref|NP_854375.1| hypothetical protein Mb0717 [Mycobacterium bovis AF2122/97]
gi|121636619|ref|YP_976842.1| hypothetical protein BCG_0747 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224989091|ref|YP_002643778.1| hypothetical protein JTY_0717 [Mycobacterium bovis BCG str. Tokyo
172]
6 more sequence titles
Length=109
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/45 (72%), Positives = 33/45 (74%), Gaps = 2/45 (4%)
Query 117 RTVHVDRVRPNGAERALRARFR--PILRPQFTLGDGANGLPLAAC 159
R VHVDRVR G ER LRA + PI RPQ TLGDGANGLPLA C
Sbjct 7 RRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGANGLPLAVC 51
>gi|298524190|ref|ZP_07011599.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298493984|gb|EFI29278.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=203
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/45 (72%), Positives = 33/45 (74%), Gaps = 2/45 (4%)
Query 117 RTVHVDRVRPNGAERALRARFR--PILRPQFTLGDGANGLPLAAC 159
R VHVDRVR G ER LRA + PI RPQ TLGDGANGLPLA C
Sbjct 7 RRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGANGLPLAVC 51
>gi|15607838|ref|NP_215212.1| hypothetical protein Rv0698 [Mycobacterium tuberculosis H37Rv]
gi|148660473|ref|YP_001281996.1| hypothetical protein MRA_0706 [Mycobacterium tuberculosis H37Ra]
gi|148821903|ref|YP_001286657.1| hypothetical protein TBFG_10712 [Mycobacterium tuberculosis F11]
21 more sequence titles
Length=203
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/45 (72%), Positives = 33/45 (74%), Gaps = 2/45 (4%)
Query 117 RTVHVDRVRPNGAERALRARFR--PILRPQFTLGDGANGLPLAAC 159
R VHVDRVR G ER LRA + PI RPQ TLGDGANGLPLA C
Sbjct 7 RRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGANGLPLAVC 51
>gi|289446257|ref|ZP_06436001.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289568641|ref|ZP_06448868.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289573306|ref|ZP_06453533.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289419215|gb|EFD16416.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289537737|gb|EFD42315.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289542395|gb|EFD46043.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=101
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/43 (70%), Positives = 32/43 (75%), Gaps = 2/43 (4%)
Query 119 VHVDRVRPNGAERALRARFR--PILRPQFTLGDGANGLPLAAC 159
+HVDRVR G ER LRA + PI RPQ TLGDGANGLPLA C
Sbjct 1 MHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGANGLPLAVC 43
>gi|289568046|ref|ZP_06448273.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289541799|gb|EFD45448.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=43
Score = 57.4 bits (137), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/30 (90%), Positives = 28/30 (94%), Gaps = 0/30 (0%)
Query 133 LRARFRPILRPQFTLGDGANGLPLAACTKT 162
+RARFRPILRPQFTLGDGANGLPLA TKT
Sbjct 1 MRARFRPILRPQFTLGDGANGLPLAGRTKT 30
>gi|340628813|ref|YP_004747265.1| hypothetical protein MCAN_38631 [Mycobacterium canettii CIPT
140010059]
gi|340007003|emb|CCC46194.1| putative uncharacterized protein [Mycobacterium canettii CIPT
140010059]
Length=119
Score = 57.4 bits (137), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 32/55 (59%), Positives = 36/55 (66%), Gaps = 1/55 (1%)
Query 124 VRPNGAERALRARFRPILRPQFTLGDGANGLPL-AACTKTGAYVPHLPYSPIAVD 177
+RPNGAER LRARFRPILRPQFTLG+GA+ AA + L Y I VD
Sbjct 1 MRPNGAERELRARFRPILRPQFTLGEGADERRFTAADRRQRGGCRQLAYLAIRVD 55
>gi|298707651|emb|CBJ25968.1| similar to Kelch-like 2, Mayven (Drosophila), partial [Ectocarpus
siliculosus]
Length=567
Score = 34.7 bits (78), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/72 (34%), Positives = 38/72 (53%), Gaps = 8/72 (11%)
Query 29 LLAVSDRNGI-----VSTSATTCNYPPAAKDSAQDGFRHALAAAIAADI--DEALRHGYG 81
LL+V+DR V + T +P +DS+ DGFRH L+AA+ ++ D+ L G
Sbjct 262 LLSVADRYDCHRLRRVILAYTLERFPTTCRDSSSDGFRH-LSAALVVEVLKDDRLAAGQE 320
Query 82 DLLELAYPLMSW 93
L + + +SW
Sbjct 321 GELAVFWAAISW 332
Lambda K H
0.319 0.135 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 183812957610
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40