BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2023A
Length=152
Score E
Sequences producing significant alignments: (Bits) Value
gi|308370893|ref|ZP_07667038.1| hypothetical protein TMCG_00112 ... 321 2e-86
gi|15841507|ref|NP_336544.1| hypothetical protein MT2080 [Mycoba... 321 3e-86
gi|339632061|ref|YP_004723703.1| hypothetical protein MAF_20360 ... 320 4e-86
gi|289746025|ref|ZP_06505403.1| conserved hypothetical protein [... 319 7e-86
gi|308379101|ref|ZP_07668896.1| hypothetical protein TMJG_00277 ... 319 9e-86
gi|289754130|ref|ZP_06513508.1| conserved hypothetical protein [... 319 9e-86
gi|289758141|ref|ZP_06517519.1| conserved hypothetical protein [... 318 2e-85
gi|5042235|emb|CAB44653.1| hypothetical protein [Mycobacterium b... 129 2e-28
gi|289423479|ref|ZP_06425281.1| type I restriction-modification ... 37.0 0.95
gi|193214099|ref|YP_001995298.1| tetratricopeptide domain-contai... 36.6 1.3
gi|224005168|ref|XP_002296235.1| predicted protein [Thalassiosir... 36.2 1.8
gi|170079294|ref|YP_001735932.1| putative protein serine/threoni... 35.0 3.8
gi|149913585|ref|ZP_01902118.1| putative insertion sequence tran... 35.0 4.1
gi|17227840|ref|NP_484388.1| serine/threonine kinase [Nostoc sp.... 34.7 5.3
gi|75909003|ref|YP_323299.1| serine/threonine protein kinase [An... 34.3 6.6
gi|167824374|ref|ZP_02455845.1| hypothetical protein Bpseu9_1193... 33.5 10.0
>gi|308370893|ref|ZP_07667038.1| hypothetical protein TMCG_00112 [Mycobacterium tuberculosis SUMu003]
gi|308330523|gb|EFP19374.1| hypothetical protein TMCG_00112 [Mycobacterium tuberculosis SUMu003]
gi|323719427|gb|EGB28555.1| hypothetical protein TMMG_01292 [Mycobacterium tuberculosis CDC1551A]
Length=210
Score = 321 bits (822), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 151/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 59 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 118
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 119 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 178
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 179 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 210
>gi|15841507|ref|NP_336544.1| hypothetical protein MT2080 [Mycobacterium tuberculosis CDC1551]
gi|31793204|ref|NP_855697.1| hypothetical protein Mb2047c [Mycobacterium bovis AF2122/97]
gi|121637908|ref|YP_978131.1| hypothetical protein BCG_2041c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
38 more sequence titles
Length=225
Score = 321 bits (822), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 151/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 74 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 133
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 134 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 193
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 194 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 225
>gi|339632061|ref|YP_004723703.1| hypothetical protein MAF_20360 [Mycobacterium africanum GM041182]
gi|339331417|emb|CCC27106.1| hypothetical protein MAF_20360 [Mycobacterium africanum GM041182]
Length=167
Score = 320 bits (820), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 151/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 16 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 75
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 76 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 135
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 136 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 167
>gi|289746025|ref|ZP_06505403.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289686553|gb|EFD54041.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=225
Score = 319 bits (818), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 150/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILY+FHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 74 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYEFHWRRPIHSHMMHAQDRRPVTQLPTY 133
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 134 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 193
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 194 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 225
>gi|308379101|ref|ZP_07668896.1| hypothetical protein TMJG_00277 [Mycobacterium tuberculosis SUMu010]
gi|308358162|gb|EFP47013.1| hypothetical protein TMJG_00277 [Mycobacterium tuberculosis SUMu010]
Length=152
Score = 319 bits (817), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 151/152 (99%), Positives = 152/152 (100%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 1 MFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
>gi|289754130|ref|ZP_06513508.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289694717|gb|EFD62146.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=225
Score = 319 bits (817), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 150/152 (99%), Positives = 151/152 (99%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQN NYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 74 LFEFFTQNTSVGQNTNYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 133
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV
Sbjct 134 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 193
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 194 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 225
>gi|289758141|ref|ZP_06517519.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|289713705|gb|EFD77717.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|326903637|gb|EGE50570.1| hypothetical protein TBPG_01514 [Mycobacterium tuberculosis W-148]
Length=225
Score = 318 bits (814), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 149/152 (99%), Positives = 151/152 (99%), Gaps = 0/152 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTY 60
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILY+FHWRRPIHSHMMHAQDRRPVTQLPTY
Sbjct 74 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYEFHWRRPIHSHMMHAQDRRPVTQLPTY 133
Query 61 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRV 120
DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNL FRPIGATAQTALASEINAAKRV
Sbjct 134 DDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLHFRPIGATAQTALASEINAAKRV 193
Query 121 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 152
RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA
Sbjct 194 RTNDVTQRQIAVGKETSRLEPNFSIPQIEWPA 225
>gi|5042235|emb|CAB44653.1| hypothetical protein [Mycobacterium bovis BCG]
Length=131
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 57/58 (99%), Positives = 58/58 (100%), Gaps = 0/58 (0%)
Query 1 VFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLP 58
+FEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLP
Sbjct 74 LFEFFTQNTSVGQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLP 131
>gi|289423479|ref|ZP_06425281.1| type I restriction-modification system specificity subunit [Peptostreptococcus
anaerobius 653-L]
gi|289156113|gb|EFD04776.1| type I restriction-modification system specificity subunit [Peptostreptococcus
anaerobius 653-L]
Length=401
Score = 37.0 bits (84), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 24/84 (29%), Positives = 38/84 (46%), Gaps = 3/84 (3%)
Query 12 GQNANYYNCTVYKVPLIGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTYDDHAQTKPLNE 71
G A+ + + YKV +G I ++ H + H D P + + +PLNE
Sbjct 257 GNGASESSLSTYKVLRVGDIAFEGHTNKQFHFGRFVVNDIGTGIMSPRF---STLRPLNE 313
Query 72 MPVDFAKEIVRLWRVFVKDLNNHT 95
MPV+F K+ + V + L N T
Sbjct 314 MPVNFWKQYIHSESVMRRILVNST 337
>gi|193214099|ref|YP_001995298.1| tetratricopeptide domain-containing protein [Chloroherpeton thalassium
ATCC 35110]
gi|193087576|gb|ACF12851.1| Tetratricopeptide TPR_2 repeat protein [Chloroherpeton thalassium
ATCC 35110]
Length=1175
Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats.
Identities = 28/110 (26%), Positives = 44/110 (40%), Gaps = 5/110 (4%)
Query 30 SILYQFHWRRPIHSHMMHAQDRRPVTQLPTYDDHAQTKPLNEMPVDFAKEIVRLWRVFVK 89
++L W+ P S H R P+ Y D + ++P +F+KE RLWR++
Sbjct 537 TLLLTSRWKLPEWSEAEHHALRHPL-----YGDFLRMVRNEKLPPEFSKERYRLWRLYDT 591
Query 90 DLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQRQIAVGKETSRL 139
N L F TA+ E K + +Q +A+ S L
Sbjct 592 LHGNGRGLTFFASAVRGMTAVEEEAFLNKLAEVSAESQTDMALDTVISHL 641
>gi|224005168|ref|XP_002296235.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586267|gb|ACI64952.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length=1274
Score = 36.2 bits (82), Expect = 1.8, Method: Composition-based stats.
Identities = 13/24 (55%), Positives = 16/24 (67%), Gaps = 0/24 (0%)
Query 37 WRRPIHSHMMHAQDRRPVTQLPTY 60
WRRP H H M+ + RR V+Q P Y
Sbjct 937 WRRPFHIHGMYTKSRRDVSQTPFY 960
>gi|170079294|ref|YP_001735932.1| putative protein serine/threonine phosphatase [Synechococcus
sp. PCC 7002]
gi|169886963|gb|ACB00677.1| probable protein phosphatase [Synechococcus sp. PCC 7002]
Length=662
Score = 35.0 bits (79), Expect = 3.8, Method: Composition-based stats.
Identities = 21/55 (39%), Positives = 32/55 (59%), Gaps = 2/55 (3%)
Query 34 QFHWRRPIHSHMMHAQDRRPVTQLPTYDDHAQTKPLNEMPVDFAKEIVRLWRVFV 88
Q WR+P + ++ Q +P+T L TY AQT PL E + + ++V+LWR V
Sbjct 154 QEAWRQPEYQVLLFEQRPQPLTPLKTY-CQAQT-PLYEQLLQWCFQMVQLWRELV 206
>gi|149913585|ref|ZP_01902118.1| putative insertion sequence transposase protein, IS66 family
[Roseobacter sp. AzwK-3b]
gi|149812705|gb|EDM72534.1| putative insertion sequence transposase protein, IS66 family
[Roseobacter sp. AzwK-3b]
Length=356
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/80 (28%), Positives = 38/80 (48%), Gaps = 4/80 (5%)
Query 54 VTQLPTYDDHAQTKPLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASE 113
+ + ++ + Q +PLN FA+E V L + DL H + +P+ A + S
Sbjct 6 LIAMLVFEKYGQHQPLNRQAERFAREGVELSLSTLADLVGHATVALQPLHAL----IESH 61
Query 114 INAAKRVRTNDVTQRQIAVG 133
+ AA R+ +D T +A G
Sbjct 62 VRAAHRLHGDDTTVPLLARG 81
>gi|17227840|ref|NP_484388.1| serine/threonine kinase [Nostoc sp. PCC 7120]
gi|17129689|dbj|BAB72302.1| serine/threonine kinase [Nostoc sp. PCC 7120]
Length=796
Score = 34.7 bits (78), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/78 (29%), Positives = 41/78 (53%), Gaps = 10/78 (12%)
Query 68 PLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQ 127
PLNE+ F KE+ LW++ + + ++TN+ +R TA + + A V + +TQ
Sbjct 684 PLNEINSTFVKEVQNLWKIDILSILSNTNITWR-------TATSYD---ATLVLSQAITQ 733
Query 128 RQIAVGKETSRLEPNFSI 145
+G + + +P FS+
Sbjct 734 NPTRLGIKKTLSQPQFSV 751
>gi|75909003|ref|YP_323299.1| serine/threonine protein kinase [Anabaena variabilis ATCC 29413]
gi|75702728|gb|ABA22404.1| amino acid/amide ABC transporter substrate-binding protein, HAAT
family [Anabaena variabilis ATCC 29413]
Length=780
Score = 34.3 bits (77), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 22/78 (29%), Positives = 39/78 (50%), Gaps = 10/78 (12%)
Query 68 PLNEMPVDFAKEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQ 127
PLNEM F KE+ LW++ + L +T++ +R T+ A+ V + +TQ
Sbjct 668 PLNEMNSTFVKEVQNLWKIDLSSLLRNTDITWRT--TTSYDAML--------VLSQAITQ 717
Query 128 RQIAVGKETSRLEPNFSI 145
+G + + +P FS+
Sbjct 718 NPTRLGIQKTLSQPQFSV 735
>gi|167824374|ref|ZP_02455845.1| hypothetical protein Bpseu9_11938 [Burkholderia pseudomallei
9]
Length=189
Score = 33.5 bits (75), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 22/70 (32%), Positives = 33/70 (48%), Gaps = 1/70 (1%)
Query 84 WRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQRQIAVGKE-TSRLEPN 142
WRV +R ++T +E+ AAK +T RQIAV +E SRL+ +
Sbjct 72 WRVIRTASEARAEAVYRDFAKQSETLAVNELQAAKLESQKALTDRQIAVAQERASRLQAD 131
Query 143 FSIPQIEWPA 152
SI + + A
Sbjct 132 LSIAREQRAA 141
Lambda K H
0.321 0.133 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128332939266
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40