BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1499
Length=132
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840963|ref|NP_336000.1| hypothetical protein MT1548 [Mycoba... 269 9e-71
gi|31792696|ref|NP_855189.1| hypothetical protein Mb1537 [Mycoba... 268 1e-70
gi|167969309|ref|ZP_02551586.1| hypothetical protein MtubH3_1529... 240 6e-62
gi|289569526|ref|ZP_06449753.1| LOW QUALITY PROTEIN: conserved h... 128 4e-28
gi|289569528|ref|ZP_06449755.1| LOW QUALITY PROTEIN: hypothetica... 67.0 9e-10
gi|145225911|ref|YP_001136565.1| integrase catalytic subunit [My... 42.4 0.026
gi|333380552|ref|ZP_08472243.1| hypothetical protein HMPREF9455_... 37.7 0.52
gi|260579650|ref|ZP_05847518.1| IS3 family transposase OrfB [Cor... 35.4 2.7
gi|306834928|ref|ZP_07467984.1| ISMca2 transposase [Corynebacter... 35.0 3.3
gi|309811398|ref|ZP_07705185.1| transposase OrfB, IS3 family [De... 35.0 4.0
gi|311740827|ref|ZP_07714654.1| IS3 family transposase OrfB [Cor... 34.3 6.3
gi|311740801|ref|ZP_07714628.1| IS3 family transposase OrfB [Cor... 34.3 6.3
gi|311741423|ref|ZP_07715247.1| IS3 family transposase OrfB [Cor... 34.3 6.4
gi|261328687|emb|CBH11665.1| hypothetical protein, conserved [Tr... 34.3 6.7
gi|72390001|ref|XP_845295.1| hypothetical protein [Trypanosoma b... 34.3 7.2
gi|311740961|ref|ZP_07714787.1| transposase [Corynebacterium pse... 33.9 7.5
>gi|15840963|ref|NP_336000.1| hypothetical protein MT1548 [Mycobacterium tuberculosis CDC1551]
gi|254231728|ref|ZP_04925055.1| hypothetical protein TBCG_01476 [Mycobacterium tuberculosis C]
gi|13881170|gb|AAK45814.1| hypothetical protein MT1548 [Mycobacterium tuberculosis CDC1551]
gi|124600787|gb|EAY59797.1| hypothetical protein TBCG_01476 [Mycobacterium tuberculosis C]
Length=156
Score = 269 bits (688), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 132/132 (100%), Positives = 132/132 (100%), Gaps = 0/132 (0%)
Query 1 VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 60
VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP
Sbjct 25 VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 84
Query 61 GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER 120
GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER
Sbjct 85 GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER 144
Query 121 VQNRRGRHSALV 132
VQNRRGRHSALV
Sbjct 145 VQNRRGRHSALV 156
>gi|31792696|ref|NP_855189.1| hypothetical protein Mb1537 [Mycobacterium bovis AF2122/97]
gi|57116878|ref|NP_216015.2| hypothetical protein Rv1499 [Mycobacterium tuberculosis H37Rv]
gi|121637432|ref|YP_977655.1| hypothetical protein BCG_1563 [Mycobacterium bovis BCG str. Pasteur
1173P2]
45 more sequence titles
Length=132
Score = 268 bits (686), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 131/132 (99%), Positives = 132/132 (100%), Gaps = 0/132 (0%)
Query 1 VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 60
+PSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP
Sbjct 1 MPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 60
Query 61 GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER 120
GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER
Sbjct 61 GGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIER 120
Query 121 VQNRRGRHSALV 132
VQNRRGRHSALV
Sbjct 121 VQNRRGRHSALV 132
>gi|167969309|ref|ZP_02551586.1| hypothetical protein MtubH3_15295 [Mycobacterium tuberculosis
H37Ra]
gi|254550519|ref|ZP_05140966.1| hypothetical protein Mtube_08662 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|308231858|ref|ZP_07414028.2| hypothetical protein TMAG_02829 [Mycobacterium tuberculosis SUMu001]
23 more sequence titles
Length=118
Score = 240 bits (612), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 117/118 (99%), Positives = 118/118 (100%), Gaps = 0/118 (0%)
Query 15 LPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYES 74
+PRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYES
Sbjct 1 MPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYES 60
Query 75 RSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV 132
RSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV
Sbjct 61 RSSAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV 118
>gi|289569526|ref|ZP_06449753.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
gi|289543280|gb|EFD46928.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
Length=100
Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%)
Query 1 VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 60
VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP
Sbjct 25 VPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKRPAVGVDEHDP 84
Query 61 G 61
G
Sbjct 85 G 85
>gi|289569528|ref|ZP_06449755.1| LOW QUALITY PROTEIN: hypothetical protein TBJG_03717 [Mycobacterium
tuberculosis T17]
gi|289543282|gb|EFD46930.1| LOW QUALITY PROTEIN: hypothetical protein TBJG_03717 [Mycobacterium
tuberculosis T17]
Length=41
Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/41 (100%), Positives = 41/41 (100%), Gaps = 0/41 (0%)
Query 92 CCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV 132
CCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV
Sbjct 1 CCLYQQPKRPALRPTKAAATTAATTWIERVQNRRGRHSALV 41
>gi|145225911|ref|YP_001136565.1| integrase catalytic subunit [Mycobacterium gilvum PYR-GCK]
gi|145218374|gb|ABP47777.1| Integrase, catalytic region [Mycobacterium gilvum PYR-GCK]
Length=279
Score = 42.4 bits (98), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 21/29 (73%), Positives = 21/29 (73%), Gaps = 0/29 (0%)
Query 103 LRPTKAAATTAATTWIERVQNRRGRHSAL 131
L PTKAAA A WIERV NRR RHSAL
Sbjct 232 LWPTKAAAKLAVGDWIERVYNRRRRHSAL 260
>gi|333380552|ref|ZP_08472243.1| hypothetical protein HMPREF9455_00409 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826547|gb|EGJ99376.1| hypothetical protein HMPREF9455_00409 [Dysgonomonas gadei ATCC
BAA-286]
Length=388
Score = 37.7 bits (86), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 24/85 (29%), Positives = 40/85 (48%), Gaps = 7/85 (8%)
Query 39 LVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQP 98
L + + F Q + E++P A P++ I ++RS G +G +TT A Y P
Sbjct 14 LFISVTGFSQNK------EYNPIETAVPSLTIAPDARSGGMGDVGAATTPD-AYSQYWNP 66
Query 99 KRPALRPTKAAATTAATTWIERVQN 123
+ A +KA+ + T W+ V N
Sbjct 67 AKYAFATSKASLALSYTPWMRSVVN 91
>gi|260579650|ref|ZP_05847518.1| IS3 family transposase OrfB [Corynebacterium jeikeium ATCC 43734]
gi|258602220|gb|EEW15529.1| IS3 family transposase OrfB [Corynebacterium jeikeium ATCC 43734]
Length=276
Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/27 (63%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A WIE V NRR RHSAL
Sbjct 250 PTRDAARKAVAYWIEVVYNRRRRHSAL 276
>gi|306834928|ref|ZP_07467984.1| ISMca2 transposase [Corynebacterium accolens ATCC 49726]
gi|304569190|gb|EFM44699.1| ISMca2 transposase [Corynebacterium accolens ATCC 49726]
Length=112
Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/27 (63%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A WIE V NRR RHSAL
Sbjct 63 PTRDAARQAVAYWIEAVYNRRRRHSAL 89
>gi|309811398|ref|ZP_07705185.1| transposase OrfB, IS3 family [Dermacoccus sp. Ellin185]
gi|308434705|gb|EFP58550.1| transposase OrfB, IS3 family [Dermacoccus sp. Ellin185]
Length=105
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/26 (62%), Positives = 19/26 (74%), Gaps = 0/26 (0%)
Query 106 TKAAATTAATTWIERVQNRRGRHSAL 131
T+A A T +TWI+ V NRR RHSAL
Sbjct 60 TRAEAITGVSTWIDTVYNRRRRHSAL 85
>gi|311740827|ref|ZP_07714654.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
gi|311304347|gb|EFQ80423.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
Length=299
Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/27 (60%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A W+E V NRR RHSAL
Sbjct 250 PTRDAARKAVAYWMEVVYNRRRRHSAL 276
>gi|311740801|ref|ZP_07714628.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
gi|311740957|ref|ZP_07714783.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
gi|311303993|gb|EFQ80070.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
gi|311304321|gb|EFQ80397.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
Length=299
Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/27 (60%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A W+E V NRR RHSAL
Sbjct 250 PTRDAARKAVAYWMEVVYNRRRRHSAL 276
>gi|311741423|ref|ZP_07715247.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
gi|311303593|gb|EFQ79672.1| IS3 family transposase OrfB [Corynebacterium pseudogenitalium
ATCC 33035]
Length=299
Score = 34.3 bits (77), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 16/27 (60%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A W+E V NRR RHSAL
Sbjct 250 PTRDAARKAVAYWMEVVYNRRRRHSAL 276
>gi|261328687|emb|CBH11665.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length=492
Score = 34.3 bits (77), Expect = 6.7, Method: Composition-based stats.
Identities = 25/94 (27%), Positives = 37/94 (40%), Gaps = 6/94 (6%)
Query 37 ELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYE------SRSSAGGTIGHSTTSQV 90
E L++ D K A+G+D G T AV+ E SRS GG + HS
Sbjct 218 EPLIIVSPPQDDKHTAIGIDYGTSGAGVTAAVISTVEDHHETNSRSPGGGLLKHSNYEGN 277
Query 91 ACCLYQQPKRPALRPTKAAATTAATTWIERVQNR 124
C K + K TT + +++ + R
Sbjct 278 LCVDLNGCKDDGSKTKKCNGTTFNSMYLDTIAQR 311
>gi|72390001|ref|XP_845295.1| hypothetical protein [Trypanosoma brucei TREU927]
gi|62359248|gb|AAX79690.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801830|gb|AAZ11736.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length=491
Score = 34.3 bits (77), Expect = 7.2, Method: Composition-based stats.
Identities = 25/94 (27%), Positives = 37/94 (40%), Gaps = 6/94 (6%)
Query 37 ELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYE------SRSSAGGTIGHSTTSQV 90
E L++ D K A+G+D G T AV+ E SRS GG + HS
Sbjct 217 EPLIIVSPPQDDKHTAIGIDYGTSGAGVTAAVISTVEDHHETNSRSPGGGLLKHSNYEGN 276
Query 91 ACCLYQQPKRPALRPTKAAATTAATTWIERVQNR 124
C K + K TT + +++ + R
Sbjct 277 LCVDLNGCKDDGSKTKKCNGTTFNSMYLDTIAQR 310
>gi|311740961|ref|ZP_07714787.1| transposase [Corynebacterium pseudogenitalium ATCC 33035]
gi|311303997|gb|EFQ80074.1| transposase [Corynebacterium pseudogenitalium ATCC 33035]
Length=62
Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 16/27 (60%), Positives = 18/27 (67%), Gaps = 0/27 (0%)
Query 105 PTKAAATTAATTWIERVQNRRGRHSAL 131
PT+ AA A W+E V NRR RHSAL
Sbjct 13 PTRDAARKAVAYWMEVVYNRRRRHSAL 39
Lambda K H
0.316 0.130 0.392
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130990493970
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40