BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2550c Length=81 Score E Sequences producing significant alignments: (Bits) Value gi|15609687|ref|NP_217066.1| hypothetical protein Rv2550c [Mycob... 163 9e-39 gi|340627567|ref|YP_004746019.1| hypothetical protein MCAN_25921... 162 1e-38 gi|308369749|ref|ZP_07418927.2| antitoxin [Mycobacterium tubercu... 144 4e-33 gi|289762720|ref|ZP_06522098.1| conserved hypothetical protein [... 137 5e-31 gi|333980727|ref|YP_004518672.1| CopG-like domain-containing pro... 43.1 0.012 gi|297623628|ref|YP_003705062.1| CopG domain-containing protein ... 40.0 0.10 gi|147679034|ref|YP_001213249.1| hypothetical protein PTH_2699 [... 38.1 0.47 >gi|15609687|ref|NP_217066.1| hypothetical protein Rv2550c [Mycobacterium tuberculosis H37Rv] gi|15842087|ref|NP_337124.1| CopG family DNA-binding protein [Mycobacterium tuberculosis CDC1551] gi|31793733|ref|NP_856226.1| hypothetical protein Mb2580c [Mycobacterium bovis AF2122/97] 71 more sequence titlesLength=81 Score = 163 bits (412), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 81/81 (100%), Positives = 81/81 (100%), Gaps = 0/81 (0%) Query 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG Sbjct 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 Query 61 SFVGEADLSASVDDVVYGKHE 81 SFVGEADLSASVDDVVYGKHE Sbjct 61 SFVGEADLSASVDDVVYGKHE 81 >gi|340627567|ref|YP_004746019.1| hypothetical protein MCAN_25921 [Mycobacterium canettii CIPT 140010059] gi|340005757|emb|CCC44923.1| hypothetical protein MCAN_25921 [Mycobacterium canettii CIPT 140010059] Length=81 Score = 162 bits (411), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 80/81 (99%), Positives = 81/81 (100%), Gaps = 0/81 (0%) Query 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG Sbjct 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 Query 61 SFVGEADLSASVDDVVYGKHE 81 SFVGEADLSAS+DDVVYGKHE Sbjct 61 SFVGEADLSASIDDVVYGKHE 81 >gi|308369749|ref|ZP_07418927.2| antitoxin [Mycobacterium tuberculosis SUMu002] gi|308372333|ref|ZP_07428274.2| antitoxin [Mycobacterium tuberculosis SUMu004] gi|308326524|gb|EFP15375.1| antitoxin [Mycobacterium tuberculosis SUMu002] gi|308333561|gb|EFP22412.1| antitoxin [Mycobacterium tuberculosis SUMu004] Length=73 Score = 144 bits (363), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 72/73 (99%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 9 VKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVGSFVGEADL 68 +KRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVGSFVGEADL Sbjct 1 MKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVGSFVGEADL 60 Query 69 SASVDDVVYGKHE 81 SASVDDVVYGKHE Sbjct 61 SASVDDVVYGKHE 73 >gi|289762720|ref|ZP_06522098.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289710226|gb|EFD74242.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] Length=79 Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 68/71 (96%), Positives = 68/71 (96%), Gaps = 0/71 (0%) Query 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG Sbjct 1 MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVG 60 Query 61 SFVGEADLSAS 71 SFVGEAD S Sbjct 61 SFVGEADCPRS 71 >gi|333980727|ref|YP_004518672.1| CopG-like domain-containing protein DNA-binding protein [Desulfotomaculum kuznetsovii DSM 6115] gi|333824208|gb|AEG16871.1| CopG-like domain-containing protein DNA-binding protein [Desulfotomaculum kuznetsovii DSM 6115] Length=84 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 25/52 (49%), Positives = 33/52 (64%), Gaps = 2/52 (3%) Query 9 VKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPD--PVDAF 58 ++R QIY+ D RAL EARR+ S A LIR+ V+EHL + G P +AF Sbjct 1 MQRTQIYLHPDQHRALLNEARRKGISLAELIRQIVSEHLERQGERKAPKEAF 52 >gi|297623628|ref|YP_003705062.1| CopG domain-containing protein DNA-binding domain-containing protein [Truepera radiovictrix DSM 17093] gi|297164808|gb|ADI14519.1| CopG domain protein DNA-binding domain protein [Truepera radiovictrix DSM 17093] Length=96 Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 43/75 (58%), Gaps = 2/75 (2%) Query 9 VKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHL-RQPGPDPVDAFVG-SFVGEA 66 +KR +Y+DE D LA AR++ KA L+RE ++ ++ RQ P P+ VG G Sbjct 1 MKRTTVYVDEKTDLELAHLARQQGRPKAELVREALSRYVERQRTPRPLPRSVGMGRSGLP 60 Query 67 DLSASVDDVVYGKHE 81 DL+ D+++ G +E Sbjct 61 DLAERTDELLGGLYE 75 >gi|147679034|ref|YP_001213249.1| hypothetical protein PTH_2699 [Pelotomaculum thermopropionicum SI] gi|146275131|dbj|BAF60880.1| hypothetical protein [Pelotomaculum thermopropionicum SI] Length=83 Score = 38.1 bits (87), Expect = 0.47, Method: Compositional matrix adjust. Identities = 24/66 (37%), Positives = 40/66 (61%), Gaps = 5/66 (7%) Query 12 LQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGP---DPVDAFVGSFVGE-AD 67 LQ+Y+D+ + + L + A+R++ ++A LIR+Y+ + L Q P DP +G G+ AD Sbjct 6 LQVYVDQALHQQLRLMAKRKKVTQAELIRKYLYQGL-QKEPDLHDPALDIIGLGTGKTAD 64 Query 68 LSASVD 73 LS D Sbjct 65 LSERHD 70 Lambda K H 0.322 0.138 0.395 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128409240708 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40