BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0299 Length=100 Score E Sequences producing significant alignments: (Bits) Value gi|15607440|ref|NP_214813.1| hypothetical protein Rv0299 [Mycoba... 196 7e-49 gi|31791478|ref|NP_853971.1| hypothetical protein Mb0307 [Mycoba... 195 2e-48 gi|339293356|gb|AEJ45467.1| hypothetical protein CCDC5079_0277 [... 195 2e-48 gi|108802485|ref|YP_642681.1| hypothetical protein Mmcs_5525 [My... 141 4e-32 gi|296164016|ref|ZP_06846640.1| MazF family toxin-antitoxin syst... 115 2e-24 gi|15839685|ref|NP_334722.1| hypothetical protein MT0313 [Mycoba... 94.7 3e-18 gi|158313702|ref|YP_001506210.1| transcriptional modulator of Ma... 54.7 5e-06 gi|78189625|ref|YP_379963.1| glucose-6-phosphate 1-dehydrogenase... 35.0 3.4 gi|110638615|ref|YP_678824.1| GTP-dependent nucleic acid-binding... 34.7 5.0 >gi|15607440|ref|NP_214813.1| hypothetical protein Rv0299 [Mycobacterium tuberculosis H37Rv] gi|121636214|ref|YP_976437.1| hypothetical protein BCG_0339 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|148660065|ref|YP_001281588.1| hypothetical protein MRA_0308 [Mycobacterium tuberculosis H37Ra] 74 more sequence titlesLength=100 Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 99/100 (99%), Positives = 100/100 (100%), Gaps = 0/100 (0%) Query 1 LIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 +IAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN Sbjct 1 MIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 Query 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC Sbjct 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 >gi|31791478|ref|NP_853971.1| hypothetical protein Mb0307 [Mycobacterium bovis AF2122/97] gi|31617064|emb|CAD93171.1| HYPOTHETICAL PROTEIN Mb0307 [Mycobacterium bovis AF2122/97] Length=100 Score = 195 bits (495), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 98/100 (98%), Positives = 100/100 (100%), Gaps = 0/100 (0%) Query 1 LIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 +IAPGDIAPRRD+EHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN Sbjct 1 MIAPGDIAPRRDNEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 Query 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC Sbjct 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 >gi|339293356|gb|AEJ45467.1| hypothetical protein CCDC5079_0277 [Mycobacterium tuberculosis CCDC5079] Length=190 Score = 195 bits (495), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 98/99 (99%), Positives = 99/99 (100%), Gaps = 0/99 (0%) Query 1 LIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 +IAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN Sbjct 1 MIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 Query 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALL 99 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALL Sbjct 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALL 99 >gi|108802485|ref|YP_642681.1| hypothetical protein Mmcs_5525 [Mycobacterium sp. MCS] gi|119855313|ref|YP_935916.1| hypothetical protein Mkms_5927 [Mycobacterium sp. KMS] gi|108772904|gb|ABG11625.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119698030|gb|ABL95101.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length=100 Score = 141 bits (355), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 63/100 (63%), Positives = 82/100 (82%), Gaps = 0/100 (0%) Query 1 LIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 +I PGDIAPRRD+ HE YV +LSN++H +ADTGR+++CPF+PGR+P+ +AMVVAVEQP Sbjct 1 MITPGDIAPRRDTGHEAYVVILSNSIHLSADTGRLVSCPFVPGRIPDAAMAMVVAVEQPE 60 Query 61 GTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 G +LPELVQWL AAL P+GN G ALR+ AS++TAL+ Sbjct 61 GVVLPELVQWLPTAALDEPIGNIGAQALRQTASLITALIT 100 >gi|296164016|ref|ZP_06846640.1| MazF family toxin-antitoxin system [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900640|gb|EFG80022.1| MazF family toxin-antitoxin system [Mycobacterium parascrofulaceum ATCC BAA-614] Length=101 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 52/84 (62%), Positives = 62/84 (74%), Gaps = 0/84 (0%) Query 1 LIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPN 60 +I PGDI P RD+ ELYV VLSN +H AA TG+VI CPFIPG +P +AM+V V QP Sbjct 1 MITPGDITPGRDTNQELYVVVLSNTIHLAAATGQVIICPFIPGEIPSSTMAMIVTVLQPK 60 Query 61 GTLLPELVQWLHVAALGAPLGNAG 84 G +LPEL+QWL VAAL P+GN G Sbjct 61 GVVLPELIQWLPVAALDQPIGNIG 84 >gi|15839685|ref|NP_334722.1| hypothetical protein MT0313 [Mycobacterium tuberculosis CDC1551] gi|13879807|gb|AAK44536.1| hypothetical protein MT0313 [Mycobacterium tuberculosis CDC1551] Length=49 Score = 94.7 bits (234), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 49/49 (100%), Positives = 49/49 (100%), Gaps = 0/49 (0%) Query 52 MVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 100 MVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC Sbjct 1 MVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC 49 >gi|158313702|ref|YP_001506210.1| transcriptional modulator of MazE/toxin, MazF [Frankia sp. EAN1pec] gi|158109107|gb|ABW11304.1| transcriptional modulator of MazE/toxin, MazF [Frankia sp. EAN1pec] Length=100 Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 30/63 (48%), Positives = 39/63 (62%), Gaps = 2/63 (3%) Query 21 VLSNALHRAADTGRVITCPFIPGR-VPEDLLAMVVAVEQP-NGTLLPELVQWLHVAALGA 78 ++S L+ A TGRV+TCP IPG +P D A + P GT+LPELV W+ V+ L Sbjct 18 IVSADLYNRAGTGRVVTCPVIPGEPLPHDDYAADAGITTPIRGTILPELVAWMPVSGLSH 77 Query 79 PLG 81 PLG Sbjct 78 PLG 80 >gi|78189625|ref|YP_379963.1| glucose-6-phosphate 1-dehydrogenase [Chlorobium chlorochromatii CaD3] gi|78171824|gb|ABB28920.1| glucose-6-phosphate 1-dehydrogenase [Chlorobium chlorochromatii CaD3] Length=478 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 26/92 (29%), Positives = 39/92 (43%), Gaps = 27/92 (29%) Query 6 DIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQ-PNG--- 61 ++ P R ++HE + L+N + D + PED + + +EQ PNG Sbjct 66 ELFPERTAQHEAFQRFLANLHYSVVDLAQ-----------PEDYIKLRQHIEQLPNGGGI 114 Query 62 ------------TLLPELVQWLHVAALGAPLG 81 TL P++VQ LH A LG G Sbjct 115 SNNLLFYLAIPPTLAPQIVQSLHTAGLGEADG 146 >gi|110638615|ref|YP_678824.1| GTP-dependent nucleic acid-binding protein EngD [Cytophaga hutchinsonii ATCC 33406] gi|110281296|gb|ABG59482.1| GTP-binding protein [Cytophaga hutchinsonii ATCC 33406] Length=365 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 26/92 (29%), Positives = 45/92 (49%), Gaps = 12/92 (13%) Query 20 AVLSNALHRA-ADTGRVITCPFIPG----RVPEDLLAMVVAVEQPNGTLLPELVQWLHVA 74 + L NAL A A+ C P VP+D L ++ + +P +LP +++++ +A Sbjct 16 STLFNALSNAKAEAANYPFCTIEPNVGVVTVPDDRLGILEGIVKPE-KVLPAIIEFVDIA 74 Query 75 AL------GAPLGNAGVAALREAASVVTALLC 100 L G LGN +A +RE ++V + C Sbjct 75 GLVKGASKGEGLGNQFLANIREVDAIVHVIRC 106 Lambda K H 0.321 0.137 0.408 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129239199826 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40