BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3196A
Length=66
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842779|ref|NP_337816.1| hypothetical protein MT3289 [Mycoba... 134 4e-30
gi|118617991|ref|YP_906323.1| hypothetical protein MUL_2509 [Myc... 109 1e-22
gi|289571416|ref|ZP_06451643.1| LOW QUALITY PROTEIN: hypothetica... 107 7e-22
gi|183981390|ref|YP_001849681.1| hypothetical protein MMAR_1367 ... 99.4 2e-19
gi|240169965|ref|ZP_04748624.1| hypothetical protein MkanA1_1167... 98.2 3e-19
gi|296169038|ref|ZP_06850700.1| conserved hypothetical protein [... 90.5 8e-17
gi|62859731|ref|NP_001016708.1| Golgi to ER traffic protein 4 ho... 33.9 9.3
>gi|15842779|ref|NP_337816.1| hypothetical protein MT3289 [Mycobacterium tuberculosis CDC1551]
gi|31794371|ref|NP_856864.1| hypothetical protein Mb3219c [Mycobacterium bovis AF2122/97]
gi|57117069|ref|YP_177939.1| hypothetical protein Rv3196A [Mycobacterium tuberculosis H37Rv]
76 more sequence titles
Length=66
Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 65/66 (99%), Positives = 66/66 (100%), Gaps = 0/66 (0%)
Query 1 VQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVP 60
+QEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVP
Sbjct 1 MQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVP 60
Query 61 EQGRLG 66
EQGRLG
Sbjct 61 EQGRLG 66
>gi|118617991|ref|YP_906323.1| hypothetical protein MUL_2509 [Mycobacterium ulcerans Agy99]
gi|118570101|gb|ABL04852.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=72
Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/64 (83%), Positives = 58/64 (91%), Gaps = 0/64 (0%)
Query 3 EGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQ 62
EGGP+E MSA+ TQH+AADALFRAIIETLDKHR + TLTE VLD LARAYA+ISTNVPEQ
Sbjct 9 EGGPREAMSAKQTQHEAADALFRAIIETLDKHRKDCTLTEGVLDDLARAYAAISTNVPEQ 68
Query 63 GRLG 66
GRLG
Sbjct 69 GRLG 72
>gi|289571416|ref|ZP_06451643.1| LOW QUALITY PROTEIN: hypothetical protein TBJG_01947 [Mycobacterium
tuberculosis T17]
gi|289545170|gb|EFD48818.1| LOW QUALITY PROTEIN: hypothetical protein TBJG_01947 [Mycobacterium
tuberculosis T17]
Length=52
Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 52/52 (100%), Positives = 52/52 (100%), Gaps = 0/52 (0%)
Query 15 TQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG 66
TQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG
Sbjct 1 TQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG 52
>gi|183981390|ref|YP_001849681.1| hypothetical protein MMAR_1367 [Mycobacterium marinum M]
gi|183174716|gb|ACC39826.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=57
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/57 (85%), Positives = 52/57 (92%), Gaps = 0/57 (0%)
Query 10 MSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG 66
MSA+ TQH+AADALFRAIIETLDKHR + TLTE VLD LARAYA+ISTNVPEQGRLG
Sbjct 1 MSAKQTQHEAADALFRAIIETLDKHRKDCTLTEGVLDDLARAYAAISTNVPEQGRLG 57
>gi|240169965|ref|ZP_04748624.1| hypothetical protein MkanA1_11679 [Mycobacterium kansasii ATCC
12478]
Length=57
Score = 98.2 bits (243), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/57 (83%), Positives = 51/57 (90%), Gaps = 0/57 (0%)
Query 10 MSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG 66
MSA+ TQHD ADALFRAIIETLDKHR + TLTE VLD LARAYA++STNVPEQGRLG
Sbjct 1 MSAKQTQHDTADALFRAIIETLDKHRKDGTLTEGVLDDLARAYAAVSTNVPEQGRLG 57
>gi|296169038|ref|ZP_06850700.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896297|gb|EFG75956.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=57
Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 42/57 (74%), Positives = 49/57 (86%), Gaps = 0/57 (0%)
Query 10 MSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQGRLG 66
M+A+ +QHDAAD+LFRAIIETLDKHRN+ TLT +VL LA AY S+S NVPEQGRLG
Sbjct 1 MAAKQSQHDAADSLFRAIIETLDKHRNDGTLTAEVLHELATAYGSVSANVPEQGRLG 57
>gi|62859731|ref|NP_001016708.1| Golgi to ER traffic protein 4 homolog [Xenopus (Silurana) tropicalis]
gi|317376178|sp|A4QNE0.1|GET4_XENTR RecName: Full=Golgi to ER traffic protein 4 homolog
gi|140833033|gb|AAI35490.1| hypothetical protein LOC549462 [Xenopus (Silurana) tropicalis]
Length=325
Score = 33.9 bits (76), Expect = 9.3, Method: Composition-based stats.
Identities = 19/51 (38%), Positives = 34/51 (67%), Gaps = 3/51 (5%)
Query 12 ARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLARAYASISTNVPEQ 62
+ S Q+ AAD L ++E+L+KH E +TE++L+ LA+ ++ + N PE+
Sbjct 73 SHSQQNSAAD-LSMLVLESLEKH--EVKVTEELLENLAKLFSLMDPNSPER 120
Lambda K H
0.311 0.126 0.339
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129951228512
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40