BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0550c Length=88 Score E Sequences producing significant alignments: (Bits) Value gi|15607690|ref|NP_215064.1| hypothetical protein Rv0550c [Mycob... 175 2e-42 gi|167968377|ref|ZP_02550654.1| hypothetical protein MtubH3_1021... 143 7e-33 gi|254549505|ref|ZP_05139952.1| antitoxin [Mycobacterium tubercu... 132 2e-29 gi|312195503|ref|YP_004015564.1| hypothetical protein FraEuI1c_1... 43.9 0.008 gi|229494705|ref|ZP_04388463.1| Clp amino domain protein [Rhodoc... 42.4 0.021 gi|336176491|ref|YP_004581866.1| hypothetical protein FsymDg_038... 41.6 0.045 gi|238061789|ref|ZP_04606498.1| endopeptidase Clp [Micromonospor... 38.5 0.30 gi|262203354|ref|YP_003274562.1| Clp domain-containing protein [... 38.5 0.33 gi|271968012|ref|YP_003342208.1| ATPase with chaperone activity ... 36.6 1.2 gi|256376119|ref|YP_003099779.1| Clp domain-containing protein [... 35.4 2.9 gi|290474511|ref|YP_003467391.1| hypothetical protein XBJ1_1477 ... 35.4 3.0 gi|300721163|ref|YP_003710431.1| CcdA protein [Xenorhabdus nemat... 34.7 4.3 gi|302555461|ref|ZP_07307803.1| anhydro-N-acetylmuramic acid kin... 34.7 4.4 gi|284988789|ref|YP_003407343.1| hypothetical protein Gobs_0166 ... 34.3 6.3 >gi|15607690|ref|NP_215064.1| hypothetical protein Rv0550c [Mycobacterium tuberculosis H37Rv] gi|15839947|ref|NP_334984.1| hypothetical protein MT0575 [Mycobacterium tuberculosis CDC1551] gi|31791732|ref|NP_854225.1| hypothetical protein Mb0565c [Mycobacterium bovis AF2122/97] 59 more sequence titlesLength=88 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 87/88 (99%), Positives = 88/88 (100%), Gaps = 0/88 (0%) Query 1 LLSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWL 60 +LSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWL Sbjct 1 MLSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWL 60 Query 61 EGLEPRSTGARHDDVLGAIDAARDEFEA 88 EGLEPRSTGARHDDVLGAIDAARDEFEA Sbjct 61 EGLEPRSTGARHDDVLGAIDAARDEFEA 88 >gi|167968377|ref|ZP_02550654.1| hypothetical protein MtubH3_10219 [Mycobacterium tuberculosis H37Ra] gi|254230893|ref|ZP_04924220.1| hypothetical protein TBCG_00545 [Mycobacterium tuberculosis C] gi|308371704|ref|ZP_07425883.2| antitoxin [Mycobacterium tuberculosis SUMu004] gi|308395797|ref|ZP_07669349.1| antitoxin [Mycobacterium tuberculosis SUMu012] gi|124599952|gb|EAY58962.1| hypothetical protein TBCG_00545 [Mycobacterium tuberculosis C] gi|308335761|gb|EFP24612.1| antitoxin [Mycobacterium tuberculosis SUMu004] gi|308367246|gb|EFP56097.1| antitoxin [Mycobacterium tuberculosis SUMu012] gi|323721024|gb|EGB30088.1| antitoxin [Mycobacterium tuberculosis CDC1551A] Length=72 Score = 143 bits (361), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 72/72 (100%), Positives = 72/72 (100%), Gaps = 0/72 (0%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVL 76 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVL Sbjct 1 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVL 60 Query 77 GAIDAARDEFEA 88 GAIDAARDEFEA Sbjct 61 GAIDAARDEFEA 72 >gi|254549505|ref|ZP_05139952.1| antitoxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] Length=67 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 66/67 (99%), Positives = 67/67 (100%), Gaps = 0/67 (0%) Query 22 VYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVLGAIDA 81 +YVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVLGAIDA Sbjct 1 MYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVLGAIDA 60 Query 82 ARDEFEA 88 ARDEFEA Sbjct 61 ARDEFEA 67 >gi|312195503|ref|YP_004015564.1| hypothetical protein FraEuI1c_1636 [Frankia sp. EuI1c] gi|311226839|gb|ADP79694.1| hypothetical protein FraEuI1c_1636 [Frankia sp. EuI1c] Length=88 Score = 43.9 bits (102), Expect = 0.008, Method: Compositional matrix adjust. Identities = 27/54 (50%), Positives = 33/54 (62%), Gaps = 4/54 (7%) Query 15 VCMARLNVYVPDELAERARARGLNVSALTQAAISAELENSAT----DAWLEGLE 64 + MAR+ + VPDEL RA A GLNVS +T A+ ELE A DA+L LE Sbjct 9 LTMARVTITVPDELLARATAAGLNVSHVTAVALVEELERQAKRAELDAYLAELE 62 >gi|229494705|ref|ZP_04388463.1| Clp amino domain protein [Rhodococcus erythropolis SK121] gi|229318372|gb|EEN84235.1| Clp amino domain protein [Rhodococcus erythropolis SK121] Length=233 Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust. Identities = 19/42 (46%), Positives = 28/42 (67%), Gaps = 0/42 (0%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDA 58 M ++N+Y+PD+LA RARGL VSA+ Q A+ L+ D+ Sbjct 1 MPKVNIYLPDDLAAAVRARGLPVSAICQMALRGTLDREIPDS 42 >gi|336176491|ref|YP_004581866.1| hypothetical protein FsymDg_0383 [Frankia symbiont of Datisca glomerata] gi|334857471|gb|AEH07945.1| hypothetical protein FsymDg_0383 [Frankia symbiont of Datisca glomerata] Length=85 Score = 41.6 bits (96), Expect = 0.045, Method: Compositional matrix adjust. Identities = 25/52 (49%), Positives = 34/52 (66%), Gaps = 4/52 (7%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSAT----DAWLEGLE 64 MAR+N+ V DEL + ARA GLN+S L AA++ EL+ A DA+L L+ Sbjct 3 MARVNITVSDELMDSARAAGLNISRLATAALAEELDRRAKIAELDAYLSELD 54 >gi|238061789|ref|ZP_04606498.1| endopeptidase Clp [Micromonospora sp. ATCC 39149] gi|237883600|gb|EEP72428.1| endopeptidase Clp [Micromonospora sp. ATCC 39149] Length=265 Score = 38.5 bits (88), Expect = 0.30, Method: Compositional matrix adjust. Identities = 17/33 (52%), Positives = 24/33 (73%), Gaps = 0/33 (0%) Query 15 VCMARLNVYVPDELAERARARGLNVSALTQAAI 47 V M ++NVY+PDELAE + G+ VSA+ Q A+ Sbjct 13 VLMPKINVYLPDELAEAVKETGVPVSAICQRAL 45 >gi|262203354|ref|YP_003274562.1| Clp domain-containing protein [Gordonia bronchialis DSM 43247] gi|262086701|gb|ACY22669.1| Clp domain protein [Gordonia bronchialis DSM 43247] Length=253 Score = 38.5 bits (88), Expect = 0.33, Method: Compositional matrix adjust. Identities = 18/42 (43%), Positives = 26/42 (62%), Gaps = 0/42 (0%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDA 58 M ++N+YVPD+LAE R+ GL +S + Q A+ L A A Sbjct 1 MPKINIYVPDDLAEEVRSAGLPISRICQQALREALRAPAGGA 42 >gi|271968012|ref|YP_003342208.1| ATPase with chaperone activity ATP-binding subunit-like protein [Streptosporangium roseum DSM 43021] gi|270511187|gb|ACZ89465.1| ATPase with chaperone activity ATP-binding subunit-like protein [Streptosporangium roseum DSM 43021] Length=250 Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust. Identities = 16/31 (52%), Positives = 23/31 (75%), Gaps = 0/31 (0%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAI 47 M ++NVY+PDELAE + G+ VSA+ Q A+ Sbjct 1 MPKINVYLPDELAEAVKEAGVPVSAICQRAL 31 >gi|256376119|ref|YP_003099779.1| Clp domain-containing protein [Actinosynnema mirum DSM 43827] gi|255920422|gb|ACU35933.1| Clp domain protein [Actinosynnema mirum DSM 43827] Length=229 Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust. Identities = 20/39 (52%), Positives = 26/39 (67%), Gaps = 4/39 (10%) Query 17 MARLNVYVPDELAERARARGLNVSALTQAAISAELENSA 55 M ++NVY+PD+LAE RA + VSA+ Q A LE SA Sbjct 1 MPKINVYLPDDLAETVRALNVPVSAICQRA----LEQSA 35 >gi|290474511|ref|YP_003467391.1| hypothetical protein XBJ1_1477 [Xenorhabdus bovienii SS-2004] gi|289173824|emb|CBJ80606.1| conserved hypothetical protein [Xenorhabdus bovienii SS-2004] Length=605 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 25/58 (44%), Positives = 31/58 (54%), Gaps = 5/58 (8%) Query 19 RLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGAR--HDD 74 RL+V+ DE+A+ ARAR L AL A +E T L G+ PR AR HDD Sbjct 131 RLDVHYEDEVADIARARRLAKGALVHLASHSECWQRQT---LNGVIPRKVLARFSHDD 185 >gi|300721163|ref|YP_003710431.1| CcdA protein [Xenorhabdus nematophila ATCC 19061] gi|297627648|emb|CBJ88169.1| CcdA protein [Xenorhabdus nematophila ATCC 19061] Length=92 Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust. Identities = 20/49 (41%), Positives = 30/49 (62%), Gaps = 4/49 (8%) Query 20 LNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWL----EGLE 64 ++V V EL E+A+ GLN SA+ A+ AEL+++A + W EG E Sbjct 25 VSVTVSPELYEQAKQIGLNFSAILTQALIAELKSAAAEQWKRDNREGFE 73 >gi|302555461|ref|ZP_07307803.1| anhydro-N-acetylmuramic acid kinase [Streptomyces viridochromogenes DSM 40736] gi|302473079|gb|EFL36172.1| anhydro-N-acetylmuramic acid kinase [Streptomyces viridochromogenes DSM 40736] Length=384 Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust. Identities = 15/27 (56%), Positives = 18/27 (67%), Gaps = 0/27 (0%) Query 60 LEGLEPRSTGARHDDVLGAIDAARDEF 86 L G EP STGARH VLG++ RD + Sbjct 342 LPGTEPASTGARHPSVLGSVTPGRDGW 368 >gi|284988789|ref|YP_003407343.1| hypothetical protein Gobs_0166 [Geodermatophilus obscurus DSM 43160] gi|284062034|gb|ADB72972.1| hypothetical protein Gobs_0166 [Geodermatophilus obscurus DSM 43160] Length=57 Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust. Identities = 17/30 (57%), Positives = 20/30 (67%), Gaps = 0/30 (0%) Query 56 TDAWLEGLEPRSTGARHDDVLGAIDAARDE 85 TDAWL + S H+DVL A+DAARDE Sbjct 16 TDAWLASHDTDSAHVSHEDVLAALDAARDE 45 Lambda K H 0.318 0.130 0.363 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130095868320 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40