BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1948c
Length=116
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609085|ref|NP_216464.1| hypothetical protein Rv1948c [Mycob... 238 3e-61
gi|31793140|ref|NP_855633.1| hypothetical protein Mb1983c [Mycob... 234 2e-60
gi|289574631|ref|ZP_06454858.1| conserved hypothetical protein [... 232 1e-59
gi|254818369|ref|ZP_05223370.1| hypothetical protein MintA_00500... 68.2 4e-10
gi|342859307|ref|ZP_08715961.1| hypothetical protein MCOL_10523 ... 63.2 1e-08
gi|293401739|ref|ZP_06645880.1| glucose-1-phosphate thymidylyltr... 35.8 2.1
>gi|15609085|ref|NP_216464.1| hypothetical protein Rv1948c [Mycobacterium tuberculosis H37Rv]
gi|15841419|ref|NP_336456.1| hypothetical protein MT1998.1 [Mycobacterium tuberculosis CDC1551]
gi|148661756|ref|YP_001283279.1| hypothetical protein MRA_1958 [Mycobacterium tuberculosis H37Ra]
24 more sequence titles
Length=116
Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 116/116 (100%), Positives = 116/116 (100%), Gaps = 0/116 (0%)
Query 1 MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD
Sbjct 1 MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
Query 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA
Sbjct 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
>gi|31793140|ref|NP_855633.1| hypothetical protein Mb1983c [Mycobacterium bovis AF2122/97]
gi|121637853|ref|YP_978076.1| hypothetical protein BCG_1987c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990337|ref|YP_002645024.1| hypothetical protein JTY_1971 [Mycobacterium bovis BCG str. Tokyo
172]
26 more sequence titles
Length=116
Score = 234 bits (598), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 115/116 (99%), Positives = 115/116 (99%), Gaps = 0/116 (0%)
Query 1 MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
MTVF IKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD
Sbjct 1 MTVFRIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
Query 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA
Sbjct 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
>gi|289574631|ref|ZP_06454858.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339632000|ref|YP_004723642.1| hypothetical protein MAF_19710 [Mycobacterium africanum GM041182]
gi|289539062|gb|EFD43640.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339331356|emb|CCC27041.1| hypothetical protein MAF_19710 [Mycobacterium africanum GM041182]
Length=116
Score = 232 bits (592), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 114/116 (99%), Positives = 114/116 (99%), Gaps = 0/116 (0%)
Query 1 MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
M VF IKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD
Sbjct 1 MPVFRIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTAD 60
Query 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA
Sbjct 61 MELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEYA 116
>gi|254818369|ref|ZP_05223370.1| hypothetical protein MintA_00500 [Mycobacterium intracellulare
ATCC 13950]
Length=346
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 63/115 (55%), Gaps = 9/115 (7%)
Query 8 PDNYFGD-VVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTADMELGSQ 66
P+ YFGD VL D G+ + A+ A + G + + DGV ES AD+ELG
Sbjct 223 PEFYFGDDAVLLTLDGGGVDKLKAALSDAKQHGASRLEHDGVTHEFHIESDAADIELGPI 282
Query 67 TVVWRFDDTKLVEILDKLSPLIDGEG------PGHQYIDDLNSPAPTLMISVDEY 115
VVWR D+ K EI+ L+ L D EG GH Y+ D+++P TL++S DEY
Sbjct 283 HVVWRLDEAKAAEIIADLAVLSD-EGTVGRPTSGHFYV-DMSTPTKTLVVSRDEY 335
>gi|342859307|ref|ZP_08715961.1| hypothetical protein MCOL_10523 [Mycobacterium colombiense CECT
3035]
gi|342133548|gb|EGT86751.1| hypothetical protein MCOL_10523 [Mycobacterium colombiense CECT
3035]
Length=135
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/106 (36%), Positives = 55/106 (52%), Gaps = 2/106 (1%)
Query 11 YFGD-VVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTADMELGSQTVV 69
Y GD VL A D G+ + A + G A D + V E G A++E TVV
Sbjct 13 YMGDDAVLLAMDAAGVDTVLAVLTDATQKGSARLDHGATIHQFVIEPGAAEIEFRDGTVV 72
Query 70 WRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLNSPAPTLMISVDEY 115
WR D EI++ L+ + D G GH Y+ D++ PA L++S++EY
Sbjct 73 WRLDAAVAAEIIELLTEMHDHPGSGHHYV-DISEPADLLVLSLNEY 117
>gi|293401739|ref|ZP_06645880.1| glucose-1-phosphate thymidylyltransferase [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304691|gb|EFE45939.1| glucose-1-phosphate thymidylyltransferase [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length=294
Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/82 (31%), Positives = 37/82 (46%), Gaps = 2/82 (2%)
Query 10 NYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFDIDGVQQRIVRESGTADMELGSQTVV 69
+F +V+ A +++G IF Y VR E G TFD +G + + E +
Sbjct 114 RHFSNVLKKAVEQEGATIFGYYVRDPREYGVVTFDKEG--KVLTLEEKPEHPKSNYAVPG 171
Query 70 WRFDDTKLVEILDKLSPLIDGE 91
F D +VEI K+ P GE
Sbjct 172 LYFYDNDVVEIAKKVKPSARGE 193
Lambda K H
0.319 0.138 0.402
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130541267802
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40