BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0298 Length=75 Score E Sequences producing significant alignments: (Bits) Value gi|15607439|ref|NP_214812.1| hypothetical protein Rv0298 [Mycoba... 148 3e-34 gi|296164017|ref|ZP_06846641.1| conserved hypothetical protein [... 99.0 2e-19 gi|108802484|ref|YP_642680.1| CopG-like DNA-binding protein [Myc... 95.9 2e-18 gi|315037978|ref|YP_004031546.1| hypothetical protein LA2_03890 ... 35.8 2.3 gi|218203957|ref|YP_002364810.1| hypothetical protein PCC8801_45... 33.5 9.5 >gi|15607439|ref|NP_214812.1| hypothetical protein Rv0298 [Mycobacterium tuberculosis H37Rv] gi|15839684|ref|NP_334721.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551] gi|31791477|ref|NP_853970.1| hypothetical protein Mb0306 [Mycobacterium bovis AF2122/97] 75 more sequence titlesLength=75 Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 75/75 (100%), Positives = 75/75 (100%), Gaps = 0/75 (0%) Query 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID 60 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID Sbjct 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID 60 Query 61 AYAQRVYQANRAAGS 75 AYAQRVYQANRAAGS Sbjct 61 AYAQRVYQANRAAGS 75 >gi|296164017|ref|ZP_06846641.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900641|gb|EFG80023.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=75 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 54/74 (73%), Positives = 64/74 (87%), Gaps = 0/74 (0%) Query 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID 60 M K K+SV ++ AVLAA DADA+AAGLNRSEMIE+AL NEHLR++L +YT TVP LDID Sbjct 1 MAKAKVSVMIEEAVLAAADADAQAAGLNRSEMIERALHNEHLRISLENYTTHTVPTLDID 60 Query 61 AYAQRVYQANRAAG 74 AYA++VYQANRAAG Sbjct 61 AYAEKVYQANRAAG 74 >gi|108802484|ref|YP_642680.1| CopG-like DNA-binding protein [Mycobacterium sp. MCS] gi|119855312|ref|YP_935915.1| CopG/DNA-binding domain-containing protein [Mycobacterium sp. KMS] gi|108772903|gb|ABG11624.1| CopG-like DNA-binding protein [Mycobacterium sp. MCS] gi|119698029|gb|ABL95100.1| CopG domain protein DNA-binding domain protein [Mycobacterium sp. KMS] Length=75 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 53/74 (72%), Positives = 65/74 (88%), Gaps = 0/74 (0%) Query 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID 60 M K+K+SVTVD VLAA DADA+A G+NRSE+IEQALR+EHLR+AL++YTA TVPAL+ID Sbjct 1 MAKDKVSVTVDTDVLAAADADAKAIGMNRSELIEQALRHEHLRLALQNYTAHTVPALNID 60 Query 61 AYAQRVYQANRAAG 74 YA ++YQANRAA Sbjct 61 DYAAKIYQANRAAN 74 >gi|315037978|ref|YP_004031546.1| hypothetical protein LA2_03890 [Lactobacillus amylovorus GRL 1112] gi|312276111|gb|ADQ58751.1| hypothetical protein LA2_03890 [Lactobacillus amylovorus GRL 1112] Length=109 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 19/65 (30%), Positives = 33/65 (51%), Gaps = 11/65 (16%) Query 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALR-----------NEHLRVALRDY 49 MT KI++ +DA + I A+A+ LN++EM+E ++ N L+ +R+Y Sbjct 1 MTTSKIAIKIDAEIYERISAEAKKNNLNKTEMLEFIVKDYFHNQTIEEGNTELKSIIRNY 60 Query 50 TAKTV 54 V Sbjct 61 NENLV 65 >gi|218203957|ref|YP_002364810.1| hypothetical protein PCC8801_4534 [Cyanothece sp. PCC 8801] gi|218169708|gb|ACK68443.1| conserved hypothetical protein [Cyanothece sp. PCC 8801] Length=78 Score = 33.5 bits (75), Expect = 9.5, Method: Compositional matrix adjust. Identities = 22/74 (30%), Positives = 36/74 (49%), Gaps = 3/74 (4%) Query 1 MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDID 60 M K+KI+VT+D ++ +D + AG NRSE + + L +V A L+ Sbjct 1 MGKQKIAVTLDKTLVGFLD---QVAGGNRSEYLNKLLIEHREKVLKAQLIAALAEELEDP 57 Query 61 AYAQRVYQANRAAG 74 Y Q + + + AG Sbjct 58 GYPQELLEWDLVAG 71 Lambda K H 0.316 0.125 0.325 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131059142040 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40