BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0662c Length=84 Score E Sequences producing significant alignments: (Bits) Value gi|15840065|ref|NP_335102.1| CopG family DNA-binding protein [My... 163 8e-39 gi|15607802|ref|NP_215176.1| hypothetical protein Rv0662c [Mycob... 163 9e-39 gi|308231610|ref|ZP_07413109.2| antitoxin [Mycobacterium tubercu... 161 3e-38 gi|340625680|ref|YP_004744132.1| hypothetical protein MCAN_06611... 161 3e-38 gi|167967492|ref|ZP_02549769.1| hypothetical protein MtubH3_0543... 157 5e-37 gi|333989223|ref|YP_004521837.1| CopG family transcriptional reg... 117 7e-25 gi|269929124|ref|YP_003321445.1| hypothetical protein Sthe_3223 ... 47.8 5e-04 gi|258593756|emb|CBE70097.1| protein of unknown function [NC10 b... 41.2 0.053 gi|256824110|ref|YP_003148070.1| hypothetical protein Ksed_02200... 38.9 0.27 gi|169830612|ref|YP_001716594.1| CopG/DNA-binding domain-contain... 37.0 1.1 gi|344174770|emb|CCA86581.1| conserved hypothetical protein, WD-... 35.8 2.0 gi|302882309|ref|XP_003040065.1| hypothetical protein NECHADRAFT... 35.8 2.3 gi|284047181|ref|YP_003397521.1| hypothetical protein Cwoe_5745 ... 35.4 3.0 gi|225874393|ref|YP_002755852.1| hypothetical protein ACP_2836 [... 35.0 3.7 gi|37521109|ref|NP_924486.1| hypothetical protein gsl1540 [Gloeo... 34.3 7.0 gi|290954992|ref|YP_003486174.1| MerR family transcriptional reg... 34.3 7.2 >gi|15840065|ref|NP_335102.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551] gi|254230991|ref|ZP_04924318.1| conserved hypothetical protein [Mycobacterium tuberculosis C] gi|13880212|gb|AAK44916.1| DNA-binding protein, CopG family [Mycobacterium tuberculosis CDC1551] gi|124600050|gb|EAY59060.1| conserved hypothetical protein [Mycobacterium tuberculosis C] Length=126 Score = 163 bits (413), Expect = 8e-39, Method: Compositional matrix adjust. Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%) Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD Sbjct 43 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 102 Query 61 AADMSVPEPRELKQELEALRARRG 84 AADMSVPEPRELKQELEALRARRG Sbjct 103 AADMSVPEPRELKQELEALRARRG 126 >gi|15607802|ref|NP_215176.1| hypothetical protein Rv0662c [Mycobacterium tuberculosis H37Rv] gi|31791846|ref|NP_854339.1| hypothetical protein Mb0681c [Mycobacterium bovis AF2122/97] gi|121636583|ref|YP_976806.1| hypothetical protein BCG_0711c [Mycobacterium bovis BCG str. Pasteur 1173P2] 45 more sequence titlesLength=122 Score = 163 bits (412), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%) Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD Sbjct 39 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 98 Query 61 AADMSVPEPRELKQELEALRARRG 84 AADMSVPEPRELKQELEALRARRG Sbjct 99 AADMSVPEPRELKQELEALRARRG 122 >gi|308231610|ref|ZP_07413109.2| antitoxin [Mycobacterium tuberculosis SUMu001] gi|308369986|ref|ZP_07419832.2| antitoxin [Mycobacterium tuberculosis SUMu002] gi|308370470|ref|ZP_07421634.2| antitoxin [Mycobacterium tuberculosis SUMu003] 19 more sequence titles Length=84 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 84/84 (100%), Positives = 84/84 (100%), Gaps = 0/84 (0%) Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD Sbjct 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60 Query 61 AADMSVPEPRELKQELEALRARRG 84 AADMSVPEPRELKQELEALRARRG Sbjct 61 AADMSVPEPRELKQELEALRARRG 84 >gi|340625680|ref|YP_004744132.1| hypothetical protein MCAN_06611 [Mycobacterium canettii CIPT 140010059] gi|340003870|emb|CCC43001.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059] Length=122 Score = 161 bits (407), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 83/84 (99%), Positives = 83/84 (99%), Gaps = 0/84 (0%) Query 1 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 60 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD Sbjct 39 MSMRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLD 98 Query 61 AADMSVPEPRELKQELEALRARRG 84 AADM VPEPRELKQELEALRARRG Sbjct 99 AADMPVPEPRELKQELEALRARRG 122 >gi|167967492|ref|ZP_02549769.1| hypothetical protein MtubH3_05432 [Mycobacterium tuberculosis H37Ra] gi|254549622|ref|ZP_05140069.1| CopG family DNA-binding protein [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|308373543|ref|ZP_07432749.2| antitoxin [Mycobacterium tuberculosis SUMu005] gi|308377485|ref|ZP_07479344.2| antitoxin [Mycobacterium tuberculosis SUMu009] gi|308337212|gb|EFP26063.1| antitoxin [Mycobacterium tuberculosis SUMu005] gi|308355535|gb|EFP44386.1| antitoxin [Mycobacterium tuberculosis SUMu009] Length=82 Score = 157 bits (397), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 82/82 (100%), Positives = 82/82 (100%), Gaps = 0/82 (0%) Query 3 MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA 62 MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA Sbjct 1 MRLAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA 60 Query 63 DMSVPEPRELKQELEALRARRG 84 DMSVPEPRELKQELEALRARRG Sbjct 61 DMSVPEPRELKQELEALRARRG 82 >gi|333989223|ref|YP_004521837.1| CopG family transcriptional regulator [Mycobacterium sp. JDM601] gi|333485191|gb|AEF34583.1| CopG family DNA-binding protein [Mycobacterium sp. JDM601] Length=80 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 61/80 (77%), Positives = 67/80 (84%), Gaps = 0/80 (0%) Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM 64 + HRLQILLDDE HRR+TA ARERGV VA+VVREAIDRGL P RRKSAG+RLLDA DM Sbjct 1 MEHRLQILLDDERHRRLTAAARERGVSVASVVREAIDRGLAGPVDRRKSAGQRLLDAPDM 60 Query 65 SVPEPRELKQELEALRARRG 84 VP+P ELKQEL+ LR RRG Sbjct 61 PVPDPAELKQELDELRGRRG 80 >gi|269929124|ref|YP_003321445.1| hypothetical protein Sthe_3223 [Sphaerobacter thermophilus DSM 20745] gi|269788481|gb|ACZ40623.1| hypothetical protein Sthe_3223 [Sphaerobacter thermophilus DSM 20745] Length=106 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 33/77 (43%), Positives = 51/77 (67%), Gaps = 2/77 (2%) Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM 64 L HR+Q+LLDDE + ++ AR RG+ + V+REAIDR L + + R++A +L A M Sbjct 8 LDHRVQVLLDDERYEKVAREARRRGISIGAVIREAIDR-LPTDSEARRAAIEAILAAEPM 66 Query 65 SVP-EPRELKQELEALR 80 VP +P +L++EL+ R Sbjct 67 PVPDDPADLRRELDTAR 83 >gi|258593756|emb|CBE70097.1| protein of unknown function [NC10 bacterium 'Dutch sediment'] Length=95 Score = 41.2 bits (95), Expect = 0.053, Method: Compositional matrix adjust. Identities = 28/76 (37%), Positives = 47/76 (62%), Gaps = 5/76 (6%) Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDA-AD 63 L ++QILL DE HR + + + RG P++ ++REA+ L+ A R++A R+ + A Sbjct 4 LTQKVQILLTDEQHRALLKLVKARGKPLSVLLREAVVDKLLIEA--RQAAKRKAFEEIAA 61 Query 64 MSVP--EPRELKQELE 77 MS+P E E++ E+E Sbjct 62 MSLPVAEWPEMEDEIE 77 >gi|256824110|ref|YP_003148070.1| hypothetical protein Ksed_02200 [Kytococcus sedentarius DSM 20547] gi|256687503|gb|ACV05305.1| hypothetical protein Ksed_02200 [Kytococcus sedentarius DSM 20547] Length=88 Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust. Identities = 32/73 (44%), Positives = 39/73 (54%), Gaps = 11/73 (15%) Query 9 LQILLDDECHRRITAV---ARERGVPVATVVREAIDRGLV-SPAGRRKSAGRRLL----D 60 +Q+LLD+ RRIT + A ERGV V+TVVR+AID RR A R L D Sbjct 1 MQLLLDE---RRITLLRERATERGVSVSTVVRDAIDASFEDDDMARRAQAARDFLALTAD 57 Query 61 AADMSVPEPRELK 73 A EP +LK Sbjct 58 NARHEAAEPADLK 70 >gi|169830612|ref|YP_001716594.1| CopG/DNA-binding domain-containing protein [Candidatus Desulforudis audaxviator MP104C] gi|169637456|gb|ACA58962.1| CopG domain protein DNA-binding domain protein [Candidatus Desulforudis audaxviator MP104C] Length=89 Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust. Identities = 28/79 (36%), Positives = 41/79 (52%), Gaps = 9/79 (11%) Query 8 RLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAG-RRKSAGRRLLDAA---- 62 R Q+ L +E RRI +A +RGV +A ++REA+D S A R+ RR + AA Sbjct 10 RTQVQLTEEQVRRIQQLAADRGVSMAQLIREAVDLYTCSNAALSREEQVRRAIAAAGRFR 69 Query 63 ----DMSVPEPRELKQELE 77 D+SV + L + E Sbjct 70 SGLQDLSVEHDKYLAEAFE 88 >gi|344174770|emb|CCA86581.1| conserved hypothetical protein, WD-40 repeat-containing protein [Ralstonia syzygii R24] Length=1214 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 23/70 (33%), Positives = 41/70 (59%), Gaps = 7/70 (10%) Query 5 LAHRLQILLDDECHRRIT-AVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAAD 63 L ++ + L D C + ++ A A+ P A+ +R+ ID+ L++P+G R A ++D AD Sbjct 281 LNYKPETALLDYCDKALSEATAKP---PRASALRDWIDQRLLTPSGLRAPA---MIDPAD 334 Query 64 MSVPEPRELK 73 + P P+EL Sbjct 335 GNCPTPQELS 344 >gi|302882309|ref|XP_003040065.1| hypothetical protein NECHADRAFT_105463 [Nectria haematococca mpVI 77-13-4] gi|256720932|gb|EEU34352.1| hypothetical protein NECHADRAFT_105463 [Nectria haematococca mpVI 77-13-4] Length=543 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 18/54 (34%), Positives = 28/54 (52%), Gaps = 4/54 (7%) Query 18 HRRITAVAR----ERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADMSVP 67 H+R+T + R + G P+ V E + LV PAG + + L D AD ++P Sbjct 68 HKRVTHLVRFLVNQHGYPLDAFVYECMMDALVDPAGSSRGVEKLLDDMADQNIP 121 >gi|284047181|ref|YP_003397521.1| hypothetical protein Cwoe_5745 [Conexibacter woesei DSM 14684] gi|283951402|gb|ADB54146.1| hypothetical protein Cwoe_5745 [Conexibacter woesei DSM 14684] Length=107 Score = 35.4 bits (80), Expect = 3.0, Method: Compositional matrix adjust. Identities = 20/37 (55%), Positives = 25/37 (68%), Gaps = 0/37 (0%) Query 5 LAHRLQILLDDECHRRITAVARERGVPVATVVREAID 41 L RLQIL+ +R+ ARERGV V ++VREAID Sbjct 19 LNERLQILVTPLQRQRLEREARERGVSVGSLVREAID 55 >gi|225874393|ref|YP_002755852.1| hypothetical protein ACP_2836 [Acidobacterium capsulatum ATCC 51196] gi|225793202|gb|ACO33292.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC 51196] Length=74 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 21/71 (30%), Positives = 35/71 (50%), Gaps = 0/71 (0%) Query 8 RLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADMSVP 67 R + LDD+ + + A R +P+ V E + R +V PA R+ G + D S P Sbjct 2 RTTLALDDDVFEEVRSYAEARDLPLGRAVTELLRRAMVQPAPTRRVNGLLVFDLPADSSP 61 Query 68 EPRELKQELEA 78 E+ ++LE+ Sbjct 62 VTDEMVRDLES 72 >gi|37521109|ref|NP_924486.1| hypothetical protein gsl1540 [Gloeobacter violaceus PCC 7421] gi|35212105|dbj|BAC89481.1| gsl1540 [Gloeobacter violaceus PCC 7421] Length=99 Score = 34.3 bits (77), Expect = 7.0, Method: Compositional matrix adjust. Identities = 15/34 (45%), Positives = 25/34 (74%), Gaps = 0/34 (0%) Query 7 HRLQILLDDECHRRITAVARERGVPVATVVREAI 40 +R QILL+ E HR++ +ARE+G ++ V+RE + Sbjct 4 YRAQILLEPEQHRQLAEIAREQGRTLSEVMREIV 37 >gi|290954992|ref|YP_003486174.1| MerR family transcriptional regulator [Streptomyces scabiei 87.22] gi|260644518|emb|CBG67603.1| putative MerR-family transcriptional regulator [Streptomyces scabiei 87.22] Length=540 Score = 34.3 bits (77), Expect = 7.2, Method: Compositional matrix adjust. Identities = 19/38 (50%), Positives = 24/38 (64%), Gaps = 1/38 (2%) Query 21 ITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRL 58 I AVAR+ G+PV V+R DRG+V+P GR RR Sbjct 22 IGAVARQVGLPV-KVIRHWSDRGVVAPVGRTAGGYRRY 58 Lambda K H 0.323 0.136 0.377 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131923386480 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40