BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2063 Length=77 Score E Sequences producing significant alignments: (Bits) Value gi|15841551|ref|NP_336588.1| CopG family DNA-binding protein [My... 148 2e-34 gi|340627074|ref|YP_004745526.1| hypothetical protein MCAN_20861... 146 9e-34 gi|254551089|ref|ZP_05141536.1| antitoxin [Mycobacterium tubercu... 99.4 1e-19 gi|326773556|ref|ZP_08232839.1| hypothetical protein HMPREF0059_... 48.1 4e-04 gi|325066714|ref|ZP_08125387.1| hypothetical protein AoriK_02789... 47.0 0.001 gi|256825974|ref|YP_003149934.1| hypothetical protein Ksed_21840... 43.1 0.015 gi|336179273|ref|YP_004584648.1| hypothetical protein FsymDg_343... 38.5 0.31 gi|30409694|dbj|BAC76305.1| hypothetical protein [Streptomyces v... 38.1 0.40 gi|21234221|ref|NP_639832.1| hypothetical protein SCP1.225 [Stre... 38.1 0.44 gi|21234065|ref|NP_639642.1| hypothetical protein SCP1.67c [Stre... 35.0 3.6 gi|302872669|ref|YP_003841305.1| hypothetical protein COB47_2060... 35.0 3.7 gi|269218053|ref|ZP_06161907.1| toxin-antitoxin system protein [... 34.7 4.7 gi|312794277|ref|YP_004027200.1| hypothetical protein Calkr_2121... 34.7 5.0 gi|237653406|ref|YP_002889720.1| CopG family transcriptional reg... 33.9 7.6 gi|56476699|ref|YP_158288.1| hypothetical protein ebD59 [Aromato... 33.9 7.8 >gi|15841551|ref|NP_336588.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551] gi|31793246|ref|NP_855739.1| hypothetical protein Mb2089 [Mycobacterium bovis AF2122/97] gi|57116939|ref|YP_177657.1| hypothetical protein Rv2063 [Mycobacterium tuberculosis H37Rv] 76 more sequence titlesLength=77 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 77/77 (100%), Positives = 77/77 (100%), Gaps = 0/77 (0%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA Sbjct 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 Query 61 VRDEDREWEGTVGDGLG 77 VRDEDREWEGTVGDGLG Sbjct 61 VRDEDREWEGTVGDGLG 77 >gi|340627074|ref|YP_004745526.1| hypothetical protein MCAN_20861 [Mycobacterium canettii CIPT 140010059] gi|340005264|emb|CCC44418.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059] Length=77 Score = 146 bits (369), Expect = 9e-34, Method: Compositional matrix adjust. Identities = 76/77 (99%), Positives = 76/77 (99%), Gaps = 0/77 (0%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 MSTSTTIRVS QTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA Sbjct 1 MSTSTTIRVSIQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 Query 61 VRDEDREWEGTVGDGLG 77 VRDEDREWEGTVGDGLG Sbjct 61 VRDEDREWEGTVGDGLG 77 >gi|254551089|ref|ZP_05141536.1| antitoxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] Length=50 Score = 99.4 bits (246), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 50/50 (100%), Positives = 50/50 (100%), Gaps = 0/50 (0%) Query 28 MSALLTELAAQAERQAIFRAEREASHAETTTQAVRDEDREWEGTVGDGLG 77 MSALLTELAAQAERQAIFRAEREASHAETTTQAVRDEDREWEGTVGDGLG Sbjct 1 MSALLTELAAQAERQAIFRAEREASHAETTTQAVRDEDREWEGTVGDGLG 50 >gi|326773556|ref|ZP_08232839.1| hypothetical protein HMPREF0059_01960 [Actinomyces viscosus C505] gi|326636786|gb|EGE37689.1| hypothetical protein HMPREF0059_01960 [Actinomyces viscosus C505] Length=74 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 31/70 (45%), Positives = 38/70 (55%), Gaps = 0/70 (0%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 MS+ TTI+V TRD L A A ERG++M A L EL +A R+ F R A A + Sbjct 1 MSSMTTIKVERSTRDGLRALASERGVTMDAALKELLEEAARERRFAEVRRAMEAHPPDET 60 Query 61 VRDEDREWEG 70 E REWE Sbjct 61 YLKELREWES 70 >gi|325066714|ref|ZP_08125387.1| hypothetical protein AoriK_02789 [Actinomyces oris K20] Length=89 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/70 (43%), Positives = 38/70 (55%), Gaps = 0/70 (0%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 M++ TTI+V TRD L A A ERG++M A L EL +A R+ F R A A + Sbjct 16 MTSMTTIKVERSTRDGLRALASERGVTMDAALKELLEEAARERRFAEVRRAMEAHPPDET 75 Query 61 VRDEDREWEG 70 E REWE Sbjct 76 YLKELREWES 85 >gi|256825974|ref|YP_003149934.1| hypothetical protein Ksed_21840 [Kytococcus sedentarius DSM 20547] gi|256689367|gb|ACV07169.1| hypothetical protein Ksed_21840 [Kytococcus sedentarius DSM 20547] Length=73 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 29/70 (42%), Positives = 36/70 (52%), Gaps = 0/70 (0%) Query 5 TTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQAVRDE 64 TTI+V RDRL A RG SM A + L + +R+A F A REA A A R E Sbjct 4 TTIKVDHDLRDRLRTAAAARGQSMGAHIAALLDEEDRRARFAAVREAMAASPPDAAYRVE 63 Query 65 DREWEGTVGD 74 EW+ G+ Sbjct 64 AEEWQADTGE 73 >gi|336179273|ref|YP_004584648.1| hypothetical protein FsymDg_3435 [Frankia symbiont of Datisca glomerata] gi|334860253|gb|AEH10727.1| hypothetical protein FsymDg_3435 [Frankia symbiont of Datisca glomerata] Length=95 Score = 38.5 bits (88), Expect = 0.31, Method: Compositional matrix adjust. Identities = 28/81 (35%), Positives = 43/81 (54%), Gaps = 4/81 (4%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFR-AEREASHAETTTQ 59 M TT+++ RDRL A A G ++ A L L +AE + +R A++ A+ + ++ Sbjct 1 MVAVTTMKLPVDVRDRLMALATSHGRTLGAELAALVEEAEERNWWRDAKQAAARLQADSE 60 Query 60 AVRDEDRE---WEGTVGDGLG 77 D RE W+ TV DGLG Sbjct 61 RWEDYLREADGWDTTVSDGLG 81 >gi|30409694|dbj|BAC76305.1| hypothetical protein [Streptomyces violaceoruber] Length=117 Score = 38.1 bits (87), Expect = 0.40, Method: Compositional matrix adjust. Identities = 24/47 (52%), Positives = 30/47 (64%), Gaps = 2/47 (4%) Query 5 TTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIF--RAER 49 TTI++ +TRD+LAA A R +SM AL+ E AA A A RAER Sbjct 30 TTIKIGAETRDKLAALAEARNMSMRALIEEFAATALTPAQLRERAER 76 >gi|21234221|ref|NP_639832.1| hypothetical protein SCP1.225 [Streptomyces coelicolor A3(2)] gi|13620708|emb|CAC36748.1| hypothetical protein [Streptomyces coelicolor A3(2)] Length=117 Score = 38.1 bits (87), Expect = 0.44, Method: Compositional matrix adjust. Identities = 24/47 (52%), Positives = 30/47 (64%), Gaps = 2/47 (4%) Query 5 TTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIF--RAER 49 TTI++ +TRD+LAA A R +SM AL+ E AA A A RAER Sbjct 30 TTIKIGAETRDKLAALAEARNMSMRALIEEFAATALTPAQLRERAER 76 >gi|21234065|ref|NP_639642.1| hypothetical protein SCP1.67c [Streptomyces coelicolor A3(2)] gi|21234285|ref|NP_639896.1| hypothetical protein SCP1.286 [Streptomyces coelicolor A3(2)] gi|13620551|emb|CAC36589.1| hypothetical protein [Streptomyces coelicolor A3(2)] gi|13620772|emb|CAC36812.1| hypothetical protein [Streptomyces coelicolor A3(2)] Length=84 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 18/37 (49%), Positives = 24/37 (65%), Gaps = 0/37 (0%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAA 37 MS T+I+ S RDRL A ERG +++ LL ELA+ Sbjct 1 MSKPTSIKTSEDVRDRLRVLAEERGTTITELLEELAS 37 >gi|302872669|ref|YP_003841305.1| hypothetical protein COB47_2060 [Caldicellulosiruptor obsidiansis OB47] gi|302575528|gb|ADL43319.1| hypothetical protein COB47_2060 [Caldicellulosiruptor obsidiansis OB47] Length=122 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 22/80 (28%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 M+ +TT+RV +T + L +++ M ++ E A+ +R+ + A EA A Sbjct 42 MNKTTTVRVDIETYESLKKLSQQLNQPMQKVIQEALAEYKRKILLSATAEAYAALKENSK 101 Query 61 VRDEDRE----WEGTVGDGL 76 + E+ E WE T+ DG+ Sbjct 102 LWQEEIEERQLWENTLQDGI 121 >gi|269218053|ref|ZP_06161907.1| toxin-antitoxin system protein [Actinomyces sp. oral taxon 848 str. F0332] gi|269212988|gb|EEZ79328.1| toxin-antitoxin system protein [Actinomyces sp. oral taxon 848 str. F0332] Length=71 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 23/65 (36%), Positives = 34/65 (53%), Gaps = 2/65 (3%) Query 5 TTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQAVRDE 64 TTI+V ++ RDR+ ARE+G+S ++ + L ER+ R E S A DE Sbjct 3 TTIKVPSELRDRINKGAREQGLSAASFIERLLDGYERRQ--RMEAFGSAIREADAAYWDE 60 Query 65 DREWE 69 R W+ Sbjct 61 FRAWD 65 >gi|312794277|ref|YP_004027200.1| hypothetical protein Calkr_2121 [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181417|gb|ADQ41587.1| hypothetical protein Calkr_2121 [Caldicellulosiruptor kristjanssonii 177R1B] Length=82 Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust. Identities = 23/80 (29%), Positives = 39/80 (49%), Gaps = 4/80 (5%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQA 60 M+ +TT+RV +T + L + + M ++ E A+ +R+ I A EA A Sbjct 1 MNKTTTVRVDIETYEGLKKLSEQLNQPMQKIIQEALAEYKRKIILSATAEAFAALKENSK 60 Query 61 VRDEDRE----WEGTVGDGL 76 + E+ E WE T+ DG+ Sbjct 61 LWKEEIEERQLWENTLQDGI 80 >gi|237653406|ref|YP_002889720.1| CopG family transcriptional regulator [Thauera sp. MZ1T] gi|237624653|gb|ACR01343.1| CopG domain protein DNA-binding domain protein [Thauera sp. MZ1T] Length=73 Score = 33.9 bits (76), Expect = 7.6, Method: Compositional matrix adjust. Identities = 22/39 (57%), Positives = 25/39 (65%), Gaps = 1/39 (2%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQA 39 MST TIR+ T RL A AR RG SM+ L+ EL AQA Sbjct 1 MST-LTIRLPDDTAARLKALARSRGQSMNKLIEELGAQA 38 >gi|56476699|ref|YP_158288.1| hypothetical protein ebD59 [Aromatoleum aromaticum EbN1] gi|56312742|emb|CAI07387.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1] Length=76 Score = 33.9 bits (76), Expect = 7.8, Method: Compositional matrix adjust. Identities = 19/39 (49%), Positives = 28/39 (72%), Gaps = 1/39 (2%) Query 1 MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQA 39 MST TIR+ T +RL + AR RG+S++ L+ E++AQA Sbjct 1 MST-LTIRLPDDTAERLKSLARSRGLSVNKLVEEMSAQA 38 Lambda K H 0.310 0.120 0.315 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130175841596 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40