BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2132
Length=76
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841623|ref|NP_336660.1| CopG family DNA-binding protein [My... 154 5e-36
gi|15609269|ref|NP_216648.1| hypothetical protein Rv2132 [Mycoba... 154 6e-36
gi|289447758|ref|ZP_06437502.1| CopG family DNA-binding protein ... 119 2e-25
gi|229822545|ref|YP_002884071.1| CopG family DNA-binding protein... 81.6 4e-14
gi|297623634|ref|YP_003705068.1| hypothetical protein Trad_1403 ... 47.0 0.001
gi|289745379|ref|ZP_06504757.1| conserved hypothetical protein [... 47.0 0.001
gi|333991904|ref|YP_004524518.1| CopG family transcriptional reg... 46.6 0.001
gi|15609241|ref|NP_216620.1| hypothetical protein Rv2104c [Mycob... 45.8 0.002
gi|116624724|ref|YP_826880.1| hypothetical protein Acid_5648 [Ca... 45.1 0.004
gi|240170676|ref|ZP_04749335.1| CopG family DNA-binding protein ... 44.3 0.007
gi|333990438|ref|YP_004523052.1| hypothetical protein JDM601_179... 42.4 0.023
gi|145221523|ref|YP_001132201.1| CopG/DNA-binding domain-contain... 42.4 0.026
gi|88810688|ref|ZP_01125945.1| hypothetical protein NB231_16448 ... 41.6 0.038
gi|320105316|ref|YP_004180906.1| hypothetical protein AciPR4_007... 41.6 0.041
gi|322435686|ref|YP_004217898.1| hypothetical protein AciX9_2073... 38.5 0.34
gi|300780205|ref|ZP_07090061.1| toxin-antitoxin system [Coryneba... 38.1 0.43
gi|317967918|ref|ZP_07969308.1| hypothetical protein SCB02_00122... 35.8 2.2
gi|336177809|ref|YP_004583184.1| CopG/DNA-binding domain-contain... 35.8 2.3
gi|298529362|ref|ZP_07016765.1| conserved hypothetical protein [... 35.4 3.1
gi|89074857|ref|ZP_01161311.1| AcrB/AcrD/AcrF family protein [Ph... 33.9 8.7
>gi|15841623|ref|NP_336660.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551]
gi|308232042|ref|ZP_07663980.1| hypothetical protein TMAG_00318 [Mycobacterium tuberculosis SUMu001]
gi|308372148|ref|ZP_07667308.1| hypothetical protein TMDG_00609 [Mycobacterium tuberculosis SUMu004]
11 more sequence titles
Length=82
Score = 154 bits (388), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 76/76 (100%), Positives = 76/76 (100%), Gaps = 0/76 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY 60
MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY
Sbjct 7 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY 66
Query 61 SNIGDAIETLDGPASG 76
SNIGDAIETLDGPASG
Sbjct 67 SNIGDAIETLDGPASG 82
>gi|15609269|ref|NP_216648.1| hypothetical protein Rv2132 [Mycobacterium tuberculosis H37Rv]
gi|31793312|ref|NP_855805.1| hypothetical protein Mb2156 [Mycobacterium bovis AF2122/97]
gi|121638014|ref|YP_978238.1| hypothetical protein BCG_2149 [Mycobacterium bovis BCG str. Pasteur
1173P2]
58 more sequence titles
Length=76
Score = 154 bits (388), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 76/76 (100%), Positives = 76/76 (100%), Gaps = 0/76 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY 60
MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY
Sbjct 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY 60
Query 61 SNIGDAIETLDGPASG 76
SNIGDAIETLDGPASG
Sbjct 61 SNIGDAIETLDGPASG 76
>gi|289447758|ref|ZP_06437502.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CPHL_A]
gi|289420716|gb|EFD17917.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CPHL_A]
Length=59
Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/59 (100%), Positives = 59/59 (100%), Gaps = 0/59 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGID 59
MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGID
Sbjct 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGID 59
>gi|229822545|ref|YP_002884071.1| CopG family DNA-binding protein [Beutenbergia cavernae DSM 12333]
gi|229568458|gb|ACQ82309.1| CopG family DNA-binding protein [Beutenbergia cavernae DSM 12333]
Length=80
Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/72 (60%), Positives = 54/72 (75%), Gaps = 1/72 (1%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVAN-RFQQQTYDMGEGID 59
MRTT+ L DDVAAA +RLR+ER IGL EAVNEL RAGL + A R++Q+T D+G ID
Sbjct 1 MRTTIRLDDDVAAAAERLRRERHIGLGEAVNELARAGLHRGAPAERRYRQRTADLGLRID 60
Query 60 YSNIGDAIETLD 71
SN+ +A+E LD
Sbjct 61 VSNVAEALELLD 72
>gi|297623634|ref|YP_003705068.1| hypothetical protein Trad_1403 [Truepera radiovictrix DSM 17093]
gi|297164814|gb|ADI14525.1| conserved hypothetical protein [Truepera radiovictrix DSM 17093]
Length=80
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/80 (34%), Positives = 44/80 (55%), Gaps = 4/80 (5%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLT---KRQVANRFQQQTYDMGEG 57
MRTT++L DDV ++RL+KER VN +R GL + + + F+ QT +G
Sbjct 1 MRTTLTLDDDVEKRLRRLQKERGESFKSLVNRALREGLVALEQPRATDAFRTQTVSLGRA 60
Query 58 -IDYSNIGDAIETLDGPASG 76
++ ++ DA+E +G G
Sbjct 61 LVNLDSVADALEVAEGAHHG 80
>gi|289745379|ref|ZP_06504757.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289685907|gb|EFD53395.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=84
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/55 (44%), Positives = 33/55 (60%), Gaps = 0/55 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMG 55
MRTTVSL DDV V+R ER + +A+N+ IR G + R + F +T D+G
Sbjct 1 MRTTVSLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAPSHFSTRTADLG 55
>gi|333991904|ref|YP_004524518.1| CopG family transcriptional regulator [Mycobacterium sp. JDM601]
gi|333487872|gb|AEF37264.1| CopG family DNA-binding protein [Mycobacterium sp. JDM601]
Length=75
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/65 (44%), Positives = 41/65 (64%), Gaps = 3/65 (4%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVA--NRFQQQTYDMGEGI 58
MRTTV + DVAA V RLR + +G+SEA+N L R G+ + A R++ +T +G +
Sbjct 1 MRTTVVIDSDVAAEVTRLRGQ-GMGVSEALNLLARRGMAAQTSAPGTRYRHRTAHIGLKV 59
Query 59 DYSNI 63
D SN+
Sbjct 60 DVSNV 64
>gi|15609241|ref|NP_216620.1| hypothetical protein Rv2104c [Mycobacterium tuberculosis H37Rv]
gi|15841595|ref|NP_336632.1| hypothetical protein MT2164 [Mycobacterium tuberculosis CDC1551]
gi|31793286|ref|NP_855779.1| hypothetical protein Mb2130c [Mycobacterium bovis AF2122/97]
73 more sequence titles
Length=84
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/55 (42%), Positives = 33/55 (60%), Gaps = 0/55 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMG 55
MRTTV+L DDV V+R ER + +A+N+ IR G + R + F +T D+G
Sbjct 1 MRTTVTLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAPSHFSTRTADLG 55
>gi|116624724|ref|YP_826880.1| hypothetical protein Acid_5648 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227886|gb|ABJ86595.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length=87
Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/75 (38%), Positives = 41/75 (55%), Gaps = 7/75 (9%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTK--RQVANRFQQQTYDMGEGI 58
MRTTV+L DV A++ + R + +A+N+ IR GL K + RF Q+TY +G
Sbjct 1 MRTTVTLDPDVERALKATMRTRGVSFKQALNDAIRTGLLKPGSKTRRRFVQKTYSLGAEQ 60
Query 59 DYS-----NIGDAIE 68
++ I DAIE
Sbjct 61 NFRWEKALAIADAIE 75
>gi|240170676|ref|ZP_04749335.1| CopG family DNA-binding protein [Mycobacterium kansasii ATCC
12478]
Length=91
Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/65 (39%), Positives = 41/65 (64%), Gaps = 3/65 (4%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANR--FQQQTYDMGEGI 58
+RTTV + DVA ++RLR+E +GLSEA+N L R G+T+ ++ +T +G +
Sbjct 17 VRTTVVIDSDVAGEIERLRRE-GMGLSEALNLLARRGMTRGAPPKSVVYKHRTSRIGLKV 75
Query 59 DYSNI 63
D +N+
Sbjct 76 DVTNV 80
>gi|333990438|ref|YP_004523052.1| hypothetical protein JDM601_1798 [Mycobacterium sp. JDM601]
gi|333486406|gb|AEF35798.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=83
Score = 42.4 bits (98), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 24/55 (44%), Positives = 32/55 (59%), Gaps = 1/55 (1%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMG 55
MRTTV+L DDV ++R ER + +A+NE IR G R FQ +T D+G
Sbjct 1 MRTTVTLDDDVEQLIRRRMAERQLSFKQALNEAIRDGSAGRATPQ-FQTRTADLG 54
>gi|145221523|ref|YP_001132201.1| CopG/DNA-binding domain-containing protein [Mycobacterium gilvum
PYR-GCK]
gi|145214009|gb|ABP43413.1| CopG domain protein DNA-binding domain protein [Mycobacterium
gilvum PYR-GCK]
Length=84
Score = 42.4 bits (98), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (55%), Gaps = 0/55 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMG 55
MRTTV+L DD A ++R E+ I EA+N IR G +R F + D+G
Sbjct 1 MRTTVTLDDDTVALIRRRMAEQGISFKEALNNAIRDGAAQRPAPAAFSTRVADLG 55
>gi|88810688|ref|ZP_01125945.1| hypothetical protein NB231_16448 [Nitrococcus mobilis Nb-231]
gi|88792318|gb|EAR23428.1| hypothetical protein NB231_16448 [Nitrococcus mobilis Nb-231]
Length=92
Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/76 (35%), Positives = 40/76 (53%), Gaps = 5/76 (6%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANR---FQQQTYDMG-- 55
MRTTV+L DVAA ++ L R E +N+++R GLT + A R F + + G
Sbjct 5 MRTTVTLEPDVAAKLKELAHRRRASFKETLNDVLRKGLTSQAKAGRSEPFVVKPHSGGFR 64
Query 56 EGIDYSNIGDAIETLD 71
GID + ++ L+
Sbjct 65 PGIDPDKLNQLLDQLE 80
>gi|320105316|ref|YP_004180906.1| hypothetical protein AciPR4_0071 [Terriglobus saanensis SP1PR4]
gi|319923837|gb|ADV80912.1| hypothetical protein AciPR4_0071 [Terriglobus saanensis SP1PR4]
Length=80
Score = 41.6 bits (96), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 23/75 (31%), Positives = 43/75 (58%), Gaps = 4/75 (5%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKR--QVANRFQQQTYDMG--E 56
MRTT+++ DDVAA ++R + + +A+N +R L + + A +F+ Q+ +G
Sbjct 1 MRTTLTIDDDVAALLKREMRRSGEPMKQAINRCLRTALAVKSTEPAPKFKVQSRKLGLLP 60
Query 57 GIDYSNIGDAIETLD 71
G+ Y N+ ++ LD
Sbjct 61 GMSYDNLEAVLDQLD 75
>gi|322435686|ref|YP_004217898.1| hypothetical protein AciX9_2073 [Acidobacterium sp. MP5ACTX9]
gi|321163413|gb|ADW69118.1| hypothetical protein AciX9_2073 [Acidobacterium sp. MP5ACTX9]
Length=81
Score = 38.5 bits (88), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/65 (33%), Positives = 38/65 (59%), Gaps = 3/65 (4%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANR---FQQQTYDMGEG 57
MRTT+++ DDVAA +Q+ + L +AVN L+RAGL + + F+ + +D
Sbjct 1 MRTTLTIDDDVAALLQKELRRSGEPLKQAVNRLLRAGLYQAATPMKSKPFKVRPFDASLP 60
Query 58 IDYSN 62
++++
Sbjct 61 TEWTS 65
>gi|300780205|ref|ZP_07090061.1| toxin-antitoxin system [Corynebacterium genitalium ATCC 33030]
gi|300534315|gb|EFK55374.1| toxin-antitoxin system [Corynebacterium genitalium ATCC 33030]
Length=80
Score = 38.1 bits (87), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 21/71 (30%), Positives = 38/71 (54%), Gaps = 0/71 (0%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDY 60
MR T+ L ++V A + + E+ + EAVN+L RAGL +R A + + MG ++
Sbjct 1 MRLTIRLDEEVYTAARAIAAEKGTSVGEAVNDLARAGLPRRDSAVDYIPMSQPMGMKVEC 60
Query 61 SNIGDAIETLD 71
+ + ++ D
Sbjct 61 IKVSEILDLED 71
>gi|317967918|ref|ZP_07969308.1| hypothetical protein SCB02_00122 [Synechococcus sp. CB0205]
Length=86
Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/46 (46%), Positives = 29/46 (64%), Gaps = 3/46 (6%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANR 46
MRTT+SL DDV AA + L ++R + ++EL R GL + ANR
Sbjct 8 MRTTLSLDDDVLAAAKVLARQRKQPIGSVISELARQGLAQ---ANR 50
>gi|336177809|ref|YP_004583184.1| CopG/DNA-binding domain-containing protein [Frankia symbiont
of Datisca glomerata]
gi|334858789|gb|AEH09263.1| CopG/DNA-binding domain-containing protein [Frankia symbiont
of Datisca glomerata]
Length=82
Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/38 (45%), Positives = 27/38 (72%), Gaps = 0/38 (0%)
Query 5 VSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQ 42
++L DVAAA+Q+ +E+ + EA+N L+RAGL R+
Sbjct 1 MTLDPDVAAALQKEVREQGMSFEEALNTLVRAGLANRR 38
>gi|298529362|ref|ZP_07016765.1| conserved hypothetical protein [Desulfonatronospira thiodismutans
ASO3-1]
gi|298510798|gb|EFI34701.1| conserved hypothetical protein [Desulfonatronospira thiodismutans
ASO3-1]
Length=81
Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 26/79 (33%), Positives = 40/79 (51%), Gaps = 9/79 (11%)
Query 1 MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQTY-----DMG 55
MRTT+++ +DV + L R I +NE +RAGL R+V QQ Y MG
Sbjct 1 MRTTLAIDNDVLEKARALAARRKIPFKTVINEALRAGL--REVEKPALQQDYTTMPQPMG 58
Query 56 --EGIDYSNIGDAIETLDG 72
+G + NI + + ++G
Sbjct 59 LKQGRNLDNIQELLAQVEG 77
>gi|89074857|ref|ZP_01161311.1| AcrB/AcrD/AcrF family protein [Photobacterium sp. SKA34]
gi|89049432|gb|EAR54994.1| AcrB/AcrD/AcrF family protein [Photobacterium sp. SKA34]
Length=1040
Score = 33.9 bits (76), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/52 (43%), Positives = 31/52 (60%), Gaps = 2/52 (3%)
Query 21 ERSIGLSEAVNELIRAGLTKRQVANRFQQQTYDMGEGIDYSNIGDAIETLDG 72
E +IG+SE N L + GLT QVAN QQ++ D+ GI + GD + +G
Sbjct 182 EIAIGVSE--NALRKYGLTFEQVANVVQQRSIDLPGGIIKAKDGDLLVRTNG 231
Lambda K H
0.314 0.130 0.344
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130617491818
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40