BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0662c
Length=84
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840065|ref|NP_335102.1| CopG family DNA-binding protein [My... 163 8e-39
gi|15607802|ref|NP_215176.1| hypothetical protein Rv0662c [Mycob... 163 9e-39
gi|308231610|ref|ZP_07413109.2| antitoxin [Mycobacterium tubercu... 161 3e-38
gi|340625680|ref|YP_004744132.1| hypothetical protein MCAN_06611... 161 3e-38
gi|167967492|ref|ZP_02549769.1| hypothetical protein MtubH3_0543... 157 5e-37
gi|333989223|ref|YP_004521837.1| CopG family transcriptional reg... 117 7e-25
gi|269929124|ref|YP_003321445.1| hypothetical protein Sthe_3223 ... 47.8 5e-04
gi|258593756|emb|CBE70097.1| protein of unknown function [NC10 b... 41.2 0.053
gi|256824110|ref|YP_003148070.1| hypothetical protein Ksed_02200... 38.9 0.27
gi|169830612|ref|YP_001716594.1| CopG/DNA-binding domain-contain... 37.0 1.1
gi|344174770|emb|CCA86581.1| conserved hypothetical protein, WD-... 35.8 2.0
gi|302882309|ref|XP_003040065.1| hypothetical protein NECHADRAFT... 35.8 2.3
gi|284047181|ref|YP_003397521.1| hypothetical protein Cwoe_5745 ... 35.4 3.0
gi|225874393|ref|YP_002755852.1| hypothetical protein ACP_2836 [... 35.0 3.7
gi|37521109|ref|NP_924486.1| hypothetical protein gsl1540 [Gloeo... 34.3 7.0
gi|290954992|ref|YP_003486174.1| MerR family transcriptional reg... 34.3 7.2
>gi|15840065|ref|NP_335102.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551]
gi|254230991|ref|ZP_04924318.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|13880212|gb|AAK44916.1| DNA-binding protein, CopG family [Mycobacterium tuberculosis
CDC1551]
gi|124600050|gb|EAY59060.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=126
Score = 163 bits (413), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%)
Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60
MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD
Sbjct 43 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 102
Query 61 AADMSVPEPRELKQELEALRARRG 84
AADMSVPEPRELKQELEALRARRG
Sbjct 103 AADMSVPEPRELKQELEALRARRG 126
>gi|15607802|ref|NP_215176.1| hypothetical protein Rv0662c [Mycobacterium tuberculosis H37Rv]
gi|31791846|ref|NP_854339.1| hypothetical protein Mb0681c [Mycobacterium bovis AF2122/97]
gi|121636583|ref|YP_976806.1| hypothetical protein BCG_0711c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
45 more sequence titles
Length=122
Score = 163 bits (412), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%)
Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60
MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD
Sbjct 39 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 98
Query 61 AADMSVPEPRELKQELEALRARRG 84
AADMSVPEPRELKQELEALRARRG
Sbjct 99 AADMSVPEPRELKQELEALRARRG 122
>gi|308231610|ref|ZP_07413109.2| antitoxin [Mycobacterium tuberculosis SUMu001]
gi|308369986|ref|ZP_07419832.2| antitoxin [Mycobacterium tuberculosis SUMu002]
gi|308370470|ref|ZP_07421634.2| antitoxin [Mycobacterium tuberculosis SUMu003]
19 more sequence titles
Length=84
Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%)
Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60
MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD
Sbjct 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60
Query 61 AADMSVPEPRELKQELEALRARRG 84
AADMSVPEPRELKQELEALRARRG
Sbjct 61 AADMSVPEPRELKQELEALRARRG 84
>gi|340625680|ref|YP_004744132.1| hypothetical protein MCAN_06611 [Mycobacterium canettii CIPT
140010059]
gi|340003870|emb|CCC43001.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=122
Score = 161 bits (407), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/84 (99%), Positives = 83/84 (99%), Gaps = 0/84 (0%)
Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60
MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD
Sbjct 39 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 98
Query 61 AADMSVPEPRELKQELEALRARRG 84
AADM VPEPRELKQELEALRARRG
Sbjct 99 AADMPVPEPRELKQELEALRARRG 122
>gi|167967492|ref|ZP_02549769.1| hypothetical protein MtubH3_05432 [Mycobacterium tuberculosis
H37Ra]
gi|254549622|ref|ZP_05140069.1| CopG family DNA-binding protein [Mycobacterium tuberculosis '98-R604
INH-RIF-EM']
gi|308373543|ref|ZP_07432749.2| antitoxin [Mycobacterium tuberculosis SUMu005]
gi|308377485|ref|ZP_07479344.2| antitoxin [Mycobacterium tuberculosis SUMu009]
gi|308337212|gb|EFP26063.1| antitoxin [Mycobacterium tuberculosis SUMu005]
gi|308355535|gb|EFP44386.1| antitoxin [Mycobacterium tuberculosis SUMu009]
Length=82
Score = 157 bits (397), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 82/82 (100%), Positives = 82/82 (100%), Gaps = 0/82 (0%)
Query 3 MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA 62
MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA
Sbjct 1 MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA 60
Query 63 DMSVPEPRELKQELEALRARRG 84
DMSVPEPRELKQELEALRARRG
Sbjct 61 DMSVPEPRELKQELEALRARRG 82
>gi|333989223|ref|YP_004521837.1| CopG family transcriptional regulator [Mycobacterium sp. JDM601]
gi|333485191|gb|AEF34583.1| CopG family DNA-binding protein [Mycobacterium sp. JDM601]
Length=80
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/80 (77%), Positives = 67/80 (84%), Gaps = 0/80 (0%)
Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM 64
+ HRLQILLDDE HRR+TA ARERGV VA+VVREAIDRGL P RRKSAG+RLLDA DM
Sbjct 1 MEHRLQILLDDERHRRLTAAARERGVSVASVVREAIDRGLAGPVDRRKSAGQRLLDAPDM 60
Query 65 SVPEPRELKQELEALRARRG 84
VP+P ELKQEL+ LR RRG
Sbjct 61 PVPDPAELKQELDELRGRRG 80
>gi|269929124|ref|YP_003321445.1| hypothetical protein Sthe_3223 [Sphaerobacter thermophilus DSM
20745]
gi|269788481|gb|ACZ40623.1| hypothetical protein Sthe_3223 [Sphaerobacter thermophilus DSM
20745]
Length=106
Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/77 (43%), Positives = 51/77 (67%), Gaps = 2/77 (2%)
Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM 64
L HR+Q+LLDDE + ++ AR RG+ + V+REAIDR L + + R++A +L A M
Sbjct 8 LDHRVQVLLDDERYEKVAREARRRGISIGAVIREAIDR-LPTDSEARRAAIEAILAAEPM 66
Query 65 SVP-EPRELKQELEALR 80
VP +P +L++EL+ R
Sbjct 67 PVPDDPADLRRELDTAR 83
>gi|258593756|emb|CBE70097.1| protein of unknown function [NC10 bacterium 'Dutch sediment']
Length=95
Score = 41.2 bits (95), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 28/76 (37%), Positives = 47/76 (62%), Gaps = 5/76 (6%)
Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDA-AD 63
L ++QILL DE HR + + + RG P++ ++REA+ L+ A R++A R+ + A
Sbjct 4 LTQKVQILLTDEQHRALLKLVKARGKPLSVLLREAVVDKLLIEA--RQAAKRKAFEEIAA 61
Query 64 MSVP--EPRELKQELE 77
MS+P E E++ E+E
Sbjct 62 MSLPVAEWPEMEDEIE 77
>gi|256824110|ref|YP_003148070.1| hypothetical protein Ksed_02200 [Kytococcus sedentarius DSM 20547]
gi|256687503|gb|ACV05305.1| hypothetical protein Ksed_02200 [Kytococcus sedentarius DSM 20547]
Length=88
Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/73 (44%), Positives = 39/73 (54%), Gaps = 11/73 (15%)
Query 9 LQILLDDECHRRITAV---ARERGVPVATVVREAIDRGLV-SPAGRRKSAGRRLL----D 60
+Q+LLD+ RRIT + A ERGV V+TVVR+AID RR A R L D
Sbjct 1 MQLLLDE---RRITLLRERATERGVSVSTVVRDAIDASFEDDDMARRAQAARDFLALTAD 57
Query 61 AADMSVPEPRELK 73
A EP +LK
Sbjct 58 NARHEAAEPADLK 70
>gi|169830612|ref|YP_001716594.1| CopG/DNA-binding domain-containing protein [Candidatus Desulforudis
audaxviator MP104C]
gi|169637456|gb|ACA58962.1| CopG domain protein DNA-binding domain protein [Candidatus Desulforudis
audaxviator MP104C]
Length=89
Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/79 (36%), Positives = 41/79 (52%), Gaps = 9/79 (11%)
Query 8 RLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAG-RRKSAGRRLLDAA---- 62
R Q+ L +E RRI +A +RGV +A ++REA+D S A R+ RR + AA
Sbjct 10 RTQVQLTEEQVRRIQQLAADRGVSMAQLIREAVDLYTCSNAALSREEQVRRAIAAAGRFR 69
Query 63 ----DMSVPEPRELKQELE 77
D+SV + L + E
Sbjct 70 SGLQDLSVEHDKYLAEAFE 88
>gi|344174770|emb|CCA86581.1| conserved hypothetical protein, WD-40 repeat-containing protein
[Ralstonia syzygii R24]
Length=1214
Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/70 (33%), Positives = 41/70 (59%), Gaps = 7/70 (10%)
Query 5 LAHRLQILLDDECHRRIT-AVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAAD 63
L ++ + L D C + ++ A A+ P A+ +R+ ID+ L++P+G R A ++D AD
Sbjct 281 LNYKPETALLDYCDKALSEATAKP---PRASALRDWIDQRLLTPSGLRAPA---MIDPAD 334
Query 64 MSVPEPRELK 73
+ P P+EL
Sbjct 335 GNCPTPQELS 344
>gi|302882309|ref|XP_003040065.1| hypothetical protein NECHADRAFT_105463 [Nectria haematococca
mpVI 77-13-4]
gi|256720932|gb|EEU34352.1| hypothetical protein NECHADRAFT_105463 [Nectria haematococca
mpVI 77-13-4]
Length=543
Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/54 (34%), Positives = 28/54 (52%), Gaps = 4/54 (7%)
Query 18 HRRITAVAR----ERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADMSVP 67
H+R+T + R + G P+ V E + LV PAG + + L D AD ++P
Sbjct 68 HKRVTHLVRFLVNQHGYPLDAFVYECMMDALVDPAGSSRGVEKLLDDMADQNIP 121
>gi|284047181|ref|YP_003397521.1| hypothetical protein Cwoe_5745 [Conexibacter woesei DSM 14684]
gi|283951402|gb|ADB54146.1| hypothetical protein Cwoe_5745 [Conexibacter woesei DSM 14684]
Length=107
Score = 35.4 bits (80), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/37 (55%), Positives = 25/37 (68%), Gaps = 0/37 (0%)
Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAID 41
L RLQIL+ +R+ ARERGV V ++VREAID
Sbjct 19 LNERLQILVTPLQRQRLEREARERGVSVGSLVREAID 55
>gi|225874393|ref|YP_002755852.1| hypothetical protein ACP_2836 [Acidobacterium capsulatum ATCC
51196]
gi|225793202|gb|ACO33292.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length=74
Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/71 (30%), Positives = 35/71 (50%), Gaps = 0/71 (0%)
Query 8 RLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADMSVP 67
R + LDD+ + + A R +P+ V E + R +V PA R+ G + D S P
Sbjct 2 RTTLALDDDVFEEVRSYAEARDLPLGRAVTELLRRAMVQPAPTRRVNGLLVFDLPADSSP 61
Query 68 EPRELKQELEA 78
E+ ++LE+
Sbjct 62 VTDEMVRDLES 72
>gi|37521109|ref|NP_924486.1| hypothetical protein gsl1540 [Gloeobacter violaceus PCC 7421]
gi|35212105|dbj|BAC89481.1| gsl1540 [Gloeobacter violaceus PCC 7421]
Length=99
Score = 34.3 bits (77), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 15/34 (45%), Positives = 25/34 (74%), Gaps = 0/34 (0%)
Query 7 HRLQILLDDECHRRITAVARERGVPVATVVREAI 40
+R QILL+ E HR++ +ARE+G ++ V+RE +
Sbjct 4 YRAQILLEPEQHRQLAEIAREQGRTLSEVMREIV 37
>gi|290954992|ref|YP_003486174.1| MerR family transcriptional regulator [Streptomyces scabiei 87.22]
gi|260644518|emb|CBG67603.1| putative MerR-family transcriptional regulator [Streptomyces
scabiei 87.22]
Length=540
Score = 34.3 bits (77), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (64%), Gaps = 1/38 (2%)
Query 21 ITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRL 58
I AVAR+ G+PV V+R DRG+V+P GR RR
Sbjct 22 IGAVARQVGLPV-KVIRHWSDRGVVAPVGRTAGGYRRY 58
Lambda K H
0.323 0.136 0.377
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131923386480
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40