BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1429
Length=422
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608567|ref|NP_215945.1| hypothetical protein Rv1429 [Mycoba... 840 0.0
gi|289447026|ref|ZP_06436770.1| conserved hypothetical protein [... 839 0.0
gi|340626443|ref|YP_004744895.1| hypothetical protein MCAN_14451... 838 0.0
gi|15840886|ref|NP_335923.1| hypothetical protein MT1472 [Mycoba... 837 0.0
gi|339294407|gb|AEJ46518.1| hypothetical protein CCDC5079_1328 [... 837 0.0
gi|289442874|ref|ZP_06432618.1| conserved hypothetical protein [... 835 0.0
gi|294994990|ref|ZP_06800681.1| hypothetical protein Mtub2_10885... 831 0.0
gi|289749987|ref|ZP_06509365.1| conserved hypothetical protein [... 788 0.0
gi|240171799|ref|ZP_04750458.1| hypothetical protein MkanA1_2096... 567 1e-159
gi|311744357|ref|ZP_07718159.1| conserved hypothetical protein [... 290 4e-76
gi|29827884|ref|NP_822518.1| hypothetical protein SAV_1343 [Stre... 275 9e-72
gi|326332574|ref|ZP_08198842.1| hypothetical protein NBCG_04018 ... 196 4e-48
gi|300787003|ref|YP_003767294.1| hypothetical protein AMED_5127 ... 195 1e-47
gi|183981598|ref|YP_001849889.1| hypothetical protein MMAR_1582 ... 183 4e-44
gi|118618757|ref|YP_907089.1| hypothetical protein MUL_3452 [Myc... 177 3e-42
gi|240170379|ref|ZP_04749038.1| hypothetical protein MkanA1_1379... 171 2e-40
gi|226362670|ref|YP_002780448.1| CdaR family transcriptional reg... 171 2e-40
gi|302530215|ref|ZP_07282557.1| predicted protein [Streptomyces ... 162 7e-38
gi|333922108|ref|YP_004495689.1| hypothetical protein AS9A_4456 ... 158 2e-36
gi|226309064|ref|YP_002769024.1| CdaR family transcriptional reg... 157 2e-36
gi|333920056|ref|YP_004493637.1| hypothetical protein AS9A_2390 ... 157 3e-36
gi|260905674|ref|ZP_05913996.1| hypothetical protein BlinB_10102... 156 7e-36
gi|257056612|ref|YP_003134444.1| regulator of polyketide synthas... 155 2e-35
gi|229491813|ref|ZP_04385634.1| conserved hypothetical protein [... 154 3e-35
gi|326382684|ref|ZP_08204375.1| hypothetical protein SCNU_07090 ... 154 4e-35
gi|302529880|ref|ZP_07282222.1| predicted protein [Streptomyces ... 151 2e-34
gi|111020453|ref|YP_703425.1| hypothetical protein RHA1_ro03464 ... 151 3e-34
gi|343926342|ref|ZP_08765847.1| putative CdaR family transcripti... 148 2e-33
gi|333992637|ref|YP_004525251.1| hypothetical protein JDM601_399... 146 6e-33
gi|145225199|ref|YP_001135877.1| hypothetical protein Mflv_4621 ... 145 1e-32
gi|183981618|ref|YP_001849909.1| hypothetical protein MMAR_1603 ... 145 1e-32
gi|118618745|ref|YP_907077.1| hypothetical protein MUL_3434 [Myc... 142 8e-32
gi|111024979|ref|YP_707399.1| hypothetical protein RHA1_ro08196 ... 142 1e-31
gi|40787287|gb|AAR90204.1| hypothetical protein PDK3.063 [Rhodoc... 142 1e-31
gi|324997138|ref|ZP_08118250.1| hypothetical protein PseP1_00170... 138 1e-30
gi|183981345|ref|YP_001849636.1| hypothetical protein MMAR_1323 ... 137 3e-30
gi|118618038|ref|YP_906370.1| hypothetical protein MUL_2557 [Myc... 132 8e-29
gi|296140551|ref|YP_003647794.1| PucR family transcriptional reg... 130 5e-28
gi|240170688|ref|ZP_04749347.1| hypothetical protein MkanA1_1534... 126 9e-27
gi|54025185|ref|YP_119427.1| hypothetical protein nfa32160 [Noca... 125 2e-26
gi|290955996|ref|YP_003487178.1| hypothetical protein SCAB_14641... 114 2e-23
gi|345013629|ref|YP_004815983.1| putative PucR family transcript... 113 7e-23
gi|240170335|ref|ZP_04748994.1| hypothetical protein MkanA1_1356... 102 2e-19
gi|111017753|ref|YP_700725.1| hypothetical protein RHA1_ro00732 ... 101 3e-19
gi|54022358|ref|YP_116600.1| hypothetical protein nfa3940 [Nocar... 99.8 8e-19
gi|111025737|ref|YP_708157.1| hypothetical protein RHA1_ro08955 ... 95.5 2e-17
gi|226309082|ref|YP_002769042.1| CdaR family transcriptional reg... 95.1 2e-17
gi|333920275|ref|YP_004493856.1| hypothetical protein AS9A_2609 ... 94.7 3e-17
gi|111020067|ref|YP_703039.1| hypothetical protein RHA1_ro03078 ... 94.4 3e-17
gi|296394810|ref|YP_003659694.1| PucR family transcriptional reg... 94.4 3e-17
>gi|15608567|ref|NP_215945.1| hypothetical protein Rv1429 [Mycobacterium tuberculosis H37Rv]
gi|31792623|ref|NP_855116.1| hypothetical protein Mb1464 [Mycobacterium bovis AF2122/97]
gi|121637359|ref|YP_977582.1| hypothetical protein BCG_1490 [Mycobacterium bovis BCG str. Pasteur
1173P2]
67 more sequence titles
Length=422
Score = 840 bits (2170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/422 (99%), Positives = 422/422 (100%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|289447026|ref|ZP_06436770.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289419984|gb|EFD17185.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=422
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/422 (99%), Positives = 421/422 (99%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGP ANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPAANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|340626443|ref|YP_004744895.1| hypothetical protein MCAN_14451 [Mycobacterium canettii CIPT
140010059]
gi|340004633|emb|CCC43777.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=422
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/422 (99%), Positives = 421/422 (99%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLA TPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLAGTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|15840886|ref|NP_335923.1| hypothetical protein MT1472 [Mycobacterium tuberculosis CDC1551]
gi|13881087|gb|AAK45737.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
gi|323720093|gb|EGB29199.1| hypothetical protein TMMG_02131 [Mycobacterium tuberculosis CDC1551A]
Length=422
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/422 (99%), Positives = 421/422 (99%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDI LSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDILLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|339294407|gb|AEJ46518.1| hypothetical protein CCDC5079_1328 [Mycobacterium tuberculosis
CCDC5079]
Length=422
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/422 (99%), Positives = 421/422 (99%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRV DGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVRDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|289442874|ref|ZP_06432618.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289415793|gb|EFD13033.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
Length=422
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/419 (99%), Positives = 419/419 (100%), Gaps = 0/419 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
>gi|294994990|ref|ZP_06800681.1| hypothetical protein Mtub2_10885 [Mycobacterium tuberculosis
210]
Length=422
Score = 831 bits (2146), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/422 (99%), Positives = 419/422 (99%), Gaps = 0/422 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLA TPVD P
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLAITPVDDPG 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
Sbjct 241 EREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
Query 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL
Sbjct 301 LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA
Sbjct 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
Query 421 KQ 422
KQ
Sbjct 421 KQ 422
>gi|289749987|ref|ZP_06509365.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690574|gb|EFD58003.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=423
Score = 788 bits (2035), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/423 (98%), Positives = 414/423 (98%), Gaps = 1/423 (0%)
Query 1 VAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
+AEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV
Sbjct 1 MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAV 60
Query 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA
Sbjct 61 HYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPA 120
Query 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR
Sbjct 121 DRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPR 180
Query 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD
Sbjct 181 AERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTD 240
Query 241 E-REARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKAL 299
E R F PA TR FAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKAL
Sbjct 241 ETRGTGCGFWPATTRGFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKAL 300
Query 300 ALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 359
ALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL
Sbjct 301 ALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 360
Query 360 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR
Sbjct 361 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 420
Query 420 AKQ 422
AKQ
Sbjct 421 AKQ 423
>gi|240171799|ref|ZP_04750458.1| hypothetical protein MkanA1_20965 [Mycobacterium kansasii ATCC
12478]
Length=422
Score = 567 bits (1461), Expect = 1e-159, Method: Compositional matrix adjust.
Identities = 310/418 (75%), Positives = 355/418 (85%), Gaps = 4/418 (0%)
Query 5 GGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLD 64
GG PISVIAR M IRD FI E+FD MK+EI+GLDYD+RM D+W+ASITEN+V AVHYL+
Sbjct 9 GGSPISVIARQMDTIRDQFIVEVFDTMKSEIQGLDYDSRMMDMWQASITENYVAAVHYLE 68
Query 65 RDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVS 124
RD P SL+EAP AALAYARAAAQRD+PL+ LVRAHRLGHARFLEVAM+YVSLLEPA RV
Sbjct 69 RDAPTSLLEAPPAALAYARAAAQRDVPLAPLVRAHRLGHARFLEVAMRYVSLLEPAQRVP 128
Query 125 TIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERA 184
TI ELVNRS+R+VDLVADQ+IVAYE EH+RWLSR GL+QQ VSELLA TPVDV RAE+
Sbjct 129 TITELVNRSSRIVDLVADQMIVAYEEEHERWLSRHGGLRQQSVSELLAGTPVDVQRAEKL 188
Query 185 LGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREA 244
L YRLDG+H+AAVVWVD+AVP GDV+A F+QVRCL+A ELG V SL+VPTDEREA
Sbjct 189 LRYRLDGMHVAAVVWVDAAVPAGDVMAVFEQVRCLVAA----ELGLVGGSLLVPTDEREA 244
Query 245 RLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGG 304
RLWFS RA PSR+RAAFESAGIRARLA G+ DGLRGFRASLKQA+ VKA+ AGG
Sbjct 245 RLWFSVWDDRAGDPSRLRAAFESAGIRARLAYGQAADGLRGFRASLKQAQLVKAVVRAGG 304
Query 305 ARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRS 364
AR RV+ YDDVAP+AL+A D++ LR +V +VLG+LSVD+ERN WLRETLREFL+RNRS
Sbjct 305 ARRSARVVCYDDVAPIALMAADVDALRCYVAEVLGELSVDNERNEWLRETLREFLVRNRS 364
Query 365 YVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRAKQ 422
YV TA+AM+LHRNTIQYRV QAMELC + DDPDA FRVQ+ALE+CRWMAPAVL A +
Sbjct 365 YVTTAEAMLLHRNTIQYRVAQAMELCAGSFDDPDAVFRVQVALEICRWMAPAVLAAPK 422
>gi|311744357|ref|ZP_07718159.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311312323|gb|EFQ82238.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=417
Score = 290 bits (742), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 179/411 (44%), Positives = 247/411 (61%), Gaps = 7/411 (1%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +AR ++ +R+ FI +LFD EI LD+D R+ L ASITEN VTA++YL+R
Sbjct 10 VAEVARRLRPLREPFIRDLFDLTLVEIAELDHDERLRGLLEASITENIVTALNYLERGPE 69
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIE 128
++AP AALAYAR AQR +PLS L+RA+RLGH RFL+ A+ + + ++ +
Sbjct 70 PGDLDAPTAALAYARILAQRGVPLSALIRAYRLGHTRFLDAALAVLPDAVTGEPMAVVPH 129
Query 129 LVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYR 188
LV +SA +DLV D++ A+E E +RW + G+++QWV ++LAD VD+ RA ALG R
Sbjct 130 LVRQSADYLDLVCDRVGRAWEAERERWTASGFGVRRQWVDQVLADRQVDLDRAAEALGLR 189
Query 189 LDGVHIAAVVWVDSAVPIGDVVAQFDQVRC-LLAGELGPELGPVANSLMVPTDEREARLW 247
D +H+A +W V GD V + Q C ++A LG P L+V TD+RE W
Sbjct 190 FDALHLAVELWPTDDVADGD-VDRVVQASCDVVARHLGVRRDP----LVVRTDDREVAAW 244
Query 248 FSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARP 307
F A P + +AG ++ACGR G+ GFR S +QA RVK + A G R
Sbjct 245 FEVADGVHVDPRALATELVAAGSPVQVACGRPEHGVEGFRRSHRQARRVKLVRAASG-RS 303
Query 308 GGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVA 367
V Y +VA V +LA+DL+ R V LG L+ D +R LR TLREF+LR+ S+ A
Sbjct 304 EPAVTTYAEVAAVTVLAEDLDATRALVLRALGSLAEDSDRAQMLRGTLREFVLRHGSFAA 363
Query 368 TADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
TA A LHRN++QYRV QA +LC + DP AF V +ALE RW+ AVL
Sbjct 364 TAAATNLHRNSVQYRVQQAKDLCALDPTDPATAFDVLVALEAARWLGRAVL 414
>gi|29827884|ref|NP_822518.1| hypothetical protein SAV_1343 [Streptomyces avermitilis MA-4680]
gi|29604985|dbj|BAC69053.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=401
Score = 275 bits (704), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 179/406 (45%), Positives = 238/406 (59%), Gaps = 7/406 (1%)
Query 16 MQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAP 75
MQ RD+FI++L +AEI L++D + L ASITEN VT++H + V+AP
Sbjct 1 MQAHRDEFIAKLVATTEAEISQLEHDEPLRGLLEASITENIVTSLHVVINRIDPGTVDAP 60
Query 76 AAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIELVNRSAR 135
A+A++YAR AQRD+PLS L+RA+RLGHA+ L++ + L D T+I LV+ S+
Sbjct 61 ASAVSYARRLAQRDVPLSALLRAYRLGHAQSLDLVLGEAVRLNLPDIAGTLITLVSLSSA 120
Query 136 LVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIA 195
VD V DQ+ YE E +RW+ R L++ WV++LL + VD +AE ALGYRL G H+
Sbjct 121 YVDRVCDQIARVYEEERERWVGTRGVLRRHWVTQLLDNPRVDQRQAEAALGYRLSGSHLG 180
Query 196 AVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRA 255
W+D D A FD++ LL L P L++ TDE R+W + P
Sbjct 181 VEGWLDGTAATTDPTAVFDRLASLLHTVLRAHGRP----LLIHTDEAGVRIWLAVRPDCP 236
Query 256 FAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYD 315
+ A A + R+A G V GL GFR S + A R KALAL+ G R + +
Sbjct 237 VDADTVAAELADAALPVRVALGSVRPGLDGFRRSTRAAARAKALALSAGP-TAPRAVAFA 295
Query 316 DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILH 375
VAPVALL D+ EL FV+D LGDL+VDD RN LRETLR FL NRSY ATAD + +H
Sbjct 296 RVAPVALLVDEPRELADFVSDTLGDLAVDDPRNEVLRETLRVFLATNRSYAATADHLTVH 355
Query 376 RNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRAK 421
RNT+ YRV +A++ +LD AF + AL VCRW VLR K
Sbjct 356 RNTVHYRVQRAVDHYRLDLD--ANAFDLHFALNVCRWHGGKVLRPK 399
>gi|326332574|ref|ZP_08198842.1| hypothetical protein NBCG_04018 [Nocardioidaceae bacterium Broad-1]
gi|325949575|gb|EGD41647.1| hypothetical protein NBCG_04018 [Nocardioidaceae bacterium Broad-1]
Length=414
Score = 196 bits (499), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 138/388 (36%), Positives = 207/388 (54%), Gaps = 14/388 (3%)
Query 38 LDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVR 97
L D + +L RAS+ N T H L + AP AA+ YAR AQR I + LVR
Sbjct 23 LGGDDVILELLRASVESNVETFFHMLQHGIATEEIGAPPAAIEYARRLAQRGISSNALVR 82
Query 98 AHRLGHARFLEVAMQYVSLLEPADRVS-TIIELVNRSA-RLVDLVADQLIVAYEHEHDRW 155
A+R+G +R L++A+ V+ EP V+ +++ R VD VA+Q++ YE E +RW
Sbjct 83 AYRIGQSRVLDLAIAEVTRHEPDREVALAATQILQRGGFAYVDRVAEQVVAEYESELERW 142
Query 156 LSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQ 215
L+ R+ ++ ++ LLA + +++ AE ALGYRL H+ +VW GD A +
Sbjct 143 LANRNTVRASTLASLLAGSDIELGVAENALGYRLRQHHLGLIVWDAD----GDGAACGLR 198
Query 216 VRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFE---SAGIRA 272
+ L E+G ++G V +P D A W P ++A E + A
Sbjct 199 LLESLVAEVGEQVGAVGQPFFMPQDSSHAWAWIPLGRAPRTDPLDLQAIVELVTATADGA 258
Query 273 RLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD--VAPVALLADDLEEL 330
R+A GR + GFR S ++A R +A R G V YDD V ++LA DL+
Sbjct 259 RIAIGRPRPAVAGFRTSHEEAVRAHTVAAIANERAGA-VTVYDDPGVQVASILAHDLDGT 317
Query 331 RRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELC 390
R+ V LG L+VDDE + LRETL FL SY+ATA+ + +H+NT++YRV +A E+
Sbjct 318 RQLVATSLGRLAVDDEPHQRLRETLLAFLGAKSSYLATAEVLHVHKNTVKYRVDKAAEVR 377
Query 391 GQNLDDPDAAFRVQMALEVCRWMAPAVL 418
G+ +D+ +++AL CRW+ AVL
Sbjct 378 GRAIDEDR--LNLELALTACRWLGAAVL 403
>gi|300787003|ref|YP_003767294.1| hypothetical protein AMED_5127 [Amycolatopsis mediterranei U32]
gi|299796517|gb|ADJ46892.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340528495|gb|AEK43700.1| hypothetical protein RAM_26115 [Amycolatopsis mediterranei S699]
Length=421
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 141/382 (37%), Positives = 197/382 (52%), Gaps = 11/382 (2%)
Query 41 DARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHR 100
D RM L AS+ +N TA+ + VE PAAA+ YAR AQR P+ L+RA+
Sbjct 44 DERMVSLLSASVYQNIETALQIFRHGIDPAGVEPPAAAVEYARRLAQRGTPVFDLIRAYD 103
Query 101 LGHARFLEVAMQ-YVSLLEPADRVSTIIELVNRSA-RLVDLVADQLIVAYEHEHDRWLSR 158
LG A L+ Q + L++ A + ++ + R A + V QL+ Y+ E DRWL
Sbjct 104 LGQAAMLDFGFQECIRLVDDAALLGAMMRRLLRVAYEFITRVVRQLVGIYQDERDRWLLN 163
Query 159 RSGLQQQWVSELLADT--PVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQV 216
RS + V++LL P DV AE +GYRL G H+ +VW S + D ++ + V
Sbjct 164 RSAARAAKVADLLDGNGEPPDVDAAEAVIGYRLRGTHVGMIVWHASEAFVDDALSLLESV 223
Query 217 RCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFESAGIRARLAC 276
AG + + L VP DE A +W AP + AA A R+
Sbjct 224 ----AGAVFERVRGQGRPLFVPRDEASAWVWLPLAPGATIRRDHLDAALAEAEAGVRVTV 279
Query 277 GRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTD 336
G G G+ GFR + +QA RV ALALA G RV+ + +V VAL+ DL R +V
Sbjct 280 GDPGTGVAGFRDTHQQARRVHALALAAGEHCD-RVLTFREVGTVALMTSDLNAARLWVAS 338
Query 337 VLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDD 396
LG L+ DDE LRETLR FL SY A A +++H+N++QYRV +A EL + L +
Sbjct 339 TLGPLAADDENGGRLRETLRVFLTTGGSYTAAAAELMMHKNSVQYRVRKAQELLPRGLGE 398
Query 397 PDAAFRVQMALEVCRWMAPAVL 418
V++AL +CR + AVL
Sbjct 399 DR--LDVELALALCRRLGSAVL 418
>gi|183981598|ref|YP_001849889.1| hypothetical protein MMAR_1582 [Mycobacterium marinum M]
gi|183174924|gb|ACC40034.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=425
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 133/398 (34%), Positives = 202/398 (51%), Gaps = 13/398 (3%)
Query 24 ISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYAR 83
I ++ E+RG DA++ DL AS+ N ++ + D P VE+P AAL YAR
Sbjct 34 IQGTLEREIVELRG---DAQLLDLLHASVEGNVAAVLNAIHYDIPIERVESPTAALEYAR 90
Query 84 AAAQRDIPLSGLVRAHRLGHARFLEVAMQYV--SLLEPADRVSTIIELVNRSARLVDLVA 141
AQR +P++ LVRA+RLGH LE + V + +PA + + + +D ++
Sbjct 91 RLAQRGVPVNALVRAYRLGHKEMLERIIDGVQEAGADPALSLDVFNRISEVTFNYIDWIS 150
Query 142 DQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVD 201
Q++ AYE E DRWL RS ++ ++E+L +D R++ Y L VH+A V+W
Sbjct 151 QQVVAAYEAERDRWLENRSRVRNVRIAEILDGGDIDTDAMTRSIRYPLRKVHLALVLWFP 210
Query 202 SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRI 261
G+ +F+++ L EL LG N+L V D W A R+
Sbjct 211 DDATDGN---EFERLERFL-DELAEHLG-TGNALFVAADRVTGWGWIPLRVNDAGLAERV 265
Query 262 RAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD-VAPV 320
R +A G+ G+ GFR + +QA + +A GA V D+ ++
Sbjct 266 RRFVAGHTDAPHVALGQALPGVEGFRRAHRQARNAHRVGVAVGASAPAVVAVSDEGLSAA 325
Query 321 ALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQ 380
AL+ DL E +V + LG LS D + ++ LRETLR FL SY A A+ + LH N+++
Sbjct 326 ALMGADLPEAGAWVRETLGPLSTDSDNDAVLRETLRVFLREGGSYKAAAERLHLHYNSVK 385
Query 381 YRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
YRV +A+E G+ +DD V+MAL VC+W VL
Sbjct 386 YRVARAVERRGRPIDDE--RLDVEMALLVCQWFGAVVL 421
>gi|118618757|ref|YP_907089.1| hypothetical protein MUL_3452 [Mycobacterium ulcerans Agy99]
gi|118570867|gb|ABL05618.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=425
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 128/397 (33%), Positives = 200/397 (51%), Gaps = 13/397 (3%)
Query 24 ISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYAR 83
I ++ E+RG D ++ DL AS+ N ++ + D P VE+P AAL YAR
Sbjct 34 IQGTLEREIVELRG---DVQLLDLLHASVEGNVAAVLNAIHYDIPIERVESPTAALEYAR 90
Query 84 AAAQRDIPLSGLVRAHRLGHARFLEVAMQYV--SLLEPADRVSTIIELVNRSARLVDLVA 141
AQR +P++ LVRA+RLGH LE + V + +PA + + + +D ++
Sbjct 91 RLAQRGVPVNALVRAYRLGHKEMLERIIDGVQEAGADPALSLDVFNRISEVTFNYIDWIS 150
Query 142 DQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVD 201
Q++ AYE E DRWL RS ++ ++E+L +D +++ Y L VH+A V+W
Sbjct 151 QQVVAAYEAERDRWLENRSRVRNVRIAEILDGGDIDTDAMTKSIRYPLRKVHLALVLWFP 210
Query 202 SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRI 261
G+ +F+++ L EL LG N+L V D W A R+
Sbjct 211 DDATDGN---EFERLERFL-DELAEHLG-TGNALFVAADRITGWGWIPLRANDAGLTGRV 265
Query 262 RAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD-VAPV 320
R +A G+ G+ GFR + +QA + +A GA + D+ ++
Sbjct 266 RRFVAGHTDAPHVALGQALPGVEGFRRAHRQARNAHRVGVAVGASAPAVMAVSDEGLSAA 325
Query 321 ALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQ 380
AL+ DL + +V + LG LS D + ++ LRETLR FL SY A A+ + LH N+++
Sbjct 326 ALMGADLPDAGAWVRETLGPLSTDSDNDAVLRETLRVFLREGGSYKAAAERLHLHYNSVK 385
Query 381 YRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAV 417
YRV +A+E G+ +DD V+MAL VC+W V
Sbjct 386 YRVARAVERRGRPIDDE--RLDVEMALLVCQWFGAVV 420
>gi|240170379|ref|ZP_04749038.1| hypothetical protein MkanA1_13795 [Mycobacterium kansasii ATCC
12478]
Length=423
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 138/417 (34%), Positives = 210/417 (51%), Gaps = 10/417 (2%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +A +Q D S++ ++ +I L +AR+ DL ASI N T +H L D
Sbjct 13 VAEVAGRLQRRLADVTSQIHRALERQIPDLRREARIMDLLGASIEGNVDTMLHALQYDIA 72
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVST-II 127
VEAP AAL YAR AQ +P++ LVRA+RLG R E+ + E D + +I
Sbjct 73 VEHVEAPTAALEYARRLAQHGVPVNALVRAYRLGQRRMNELIFAELRATEIPDSMRVAVI 132
Query 128 ELVNRSA-RLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTP-VDVPRAERAL 185
E +N + +D ++ Q++ YE E +RWL ++ L+ V ELLA T +DV A A+
Sbjct 133 EAMNAAIFEYIDWMSQQVVAVYEDERERWLENQNALRGVRVRELLAATKSIDVDAATTAI 192
Query 186 GYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
Y L H+ ++W G V + +++ L G LG +G A+ L V D
Sbjct 193 RYPLRWHHVGLIMWSGDQ---GFDVDELPRLQRFLRG-LGESVGADASPLFVAADRSSGW 248
Query 246 LWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGA 305
W A A S++R S +A G + G+ GFR + ++A +A+AG
Sbjct 249 AWLPFRAAVADAVSKVRQFALSRPDSPNVAIGNMAGGVEGFRRTHREASEAHGVAIAGDR 308
Query 306 RPGGRVMFYD-DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRS 364
R V D ++ VA L DL R +V VLG+L+ D++ + LRETLR FL S
Sbjct 309 RAATVVAAGDPGLSVVARLGGDLAGTRDWVARVLGNLARDNDNDERLRETLRVFLGCGAS 368
Query 365 YVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRAK 421
Y A + +H NT++YRV +A+ G+++ V++AL C W AVL+ K
Sbjct 369 YKMAAAELNMHFNTVKYRVGRAVARRGRDIGGDR--LDVELALLACHWYGAAVLQPK 423
>gi|226362670|ref|YP_002780448.1| CdaR family transcriptional regulator [Rhodococcus opacus B4]
gi|226241155|dbj|BAH51503.1| putative CdaR family transcriptional regulator [Rhodococcus opacus
B4]
Length=420
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 142/403 (36%), Positives = 211/403 (53%), Gaps = 18/403 (4%)
Query 23 FISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYA 82
S++ + + AEI L D ++ DL RAS+ N T H L D +++APAAA+ YA
Sbjct 26 LTSDITEVLYAEIPDLRADRQLFDLLRASVEGNLDTIFHTLQHDIDPDILDAPAAAMEYA 85
Query 83 RAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADR--VSTIIELVNRSARLVDLV 140
R AQ +P++ LVRA+RLG L + + + +D V + ++ + +D V
Sbjct 86 RRLAQHGVPVNALVRAYRLGQTNLLGLVFEELRGAAVSDEFGVPVLQRIITVMSVYIDRV 145
Query 141 ADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWV 200
+ Q++ YE E +RWL+ +SG++ V E+LA D P A L Y L H+AA++WV
Sbjct 146 SQQVVEVYERERERWLAHQSGVRAVRVQEILAGN--DDPDAAAILNYVLLQRHLAAIIWV 203
Query 201 DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSR 260
GD++A+ + +AG +G V + L V D +W P R AP
Sbjct 204 PD-TDSGDMLARIEAAAHDVAG----FVGGVTDPLFVAADRVTGWVWV-PLGARTRAPRS 257
Query 261 IRAAFESAGIRA---RLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD- 316
R E R +A G V G GFR S A+R +ALALA G R V Y +
Sbjct 258 YRELGEFLTGRYPGLAMAVGEVAHGPNGFRRSHGGAQRARALALAAG-RNAPVVTSYAEP 316
Query 317 -VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILH 375
V+ V+LL DDL R +V VLG L+ D++ + LRET++ +L N S V A + LH
Sbjct 317 GVSTVSLLIDDLTATRAWVRTVLGGLATDNDNAARLRETVQVYLANNLSNVTAAKELGLH 376
Query 376 RNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
N+++YRV +A E G++ A V++AL C+W+ PAVL
Sbjct 377 YNSVKYRVKRAAEERGEDFTRDRLA--VELALLACQWLGPAVL 417
>gi|302530215|ref|ZP_07282557.1| predicted protein [Streptomyces sp. AA4]
gi|302439110|gb|EFL10926.1| predicted protein [Streptomyces sp. AA4]
Length=456
Score = 162 bits (411), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 131/421 (32%), Positives = 209/421 (50%), Gaps = 24/421 (5%)
Query 10 SVIARHMQLIRDDFISEL-------FDKMKAEIRGLDYDARMADLWRASITENFVTAVHY 62
S A + + D ++EL +++ AE+ L DA+ A+L +++ EN V A+
Sbjct 46 STAAATLAKVADSVLAELALLRARIVEEVTAELPALAPDAQAAELLDSTVRENLVAALGV 105
Query 63 LDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADR 122
L T V AP AL +AR AQR +P++ ++RA+RLG A F + + ++ EP +
Sbjct 106 LGGATEPVDVGAPPVALEFARRLAQRRVPITAMLRAYRLGQAAFQQEMIARIAA-EPVEA 164
Query 123 VSTI---IELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVP 179
EL + +D ++++++ AY+ E D WL R+ + V L+ PVD
Sbjct 165 ADVAVAATELSQVAFTYIDKISEEVVEAYQLERDTWLRHRNAARLAKVQAALSGKPVDTA 224
Query 180 RAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPT 239
E+ LGY L H+ AV+W + D + ++ LLA L ++ P L+V
Sbjct 225 DVEKTLGYALSERHVGAVLWCGPELDESDRLTTLERHAALLANAL--DVAP----LVVAP 278
Query 240 DEREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKAL 299
WF P A + AA + R+A G GL GFR + +QA + +A+
Sbjct 279 HASTVWAWF---PVSALDLDAVSAALAGSPDPVRVALGDPASGLAGFRTTHQQARQAEAV 335
Query 300 A-LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREF 358
A + RP V + P+AL+A D + +V VLG L+ DDE + +RET+ +
Sbjct 336 AQMTERTRPRP-VTAAAQLGPLALVAADPSAVAGWVQSVLGALADDDEGHHRMRETVWAY 394
Query 359 LLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
L S + A + LH+NTIQYR+ +A + G+ L + V++AL CR + PAVL
Sbjct 395 LSSGSSLMVAAQELHLHKNTIQYRLRKAEQERGRPLS--EGRIDVEVALLACRLLGPAVL 452
Query 419 R 419
R
Sbjct 453 R 453
>gi|333922108|ref|YP_004495689.1| hypothetical protein AS9A_4456 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484329|gb|AEF42889.1| hypothetical protein AS9A_4456 [Amycolicicoccus subflavus DQS3-9A1]
Length=478
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 140/403 (35%), Positives = 199/403 (50%), Gaps = 28/403 (6%)
Query 31 MKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDI 90
+ A+I L DA++ DL R S+ N T ++ P S VE P AAL YAR AQR +
Sbjct 88 LVAQITELRGDAQLLDLLRDSVEGNIETIFSAIEHAIPISQVEPPTAALEYARRLAQRGV 147
Query 91 PLSGLVRAHRLGHARFLEVAMQYVSLLE-PADRVSTIIELVNRSA-RLVDLVADQLIVAY 148
+ LVRA+RLG L V + + + PA + E V+ + +D +++Q+I Y
Sbjct 148 SANALVRAYRLGQQELLRVLLDDIRSADLPAQNKLDVFEQVSSTTFGYIDWISEQVIAVY 207
Query 149 EHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVW---VDSAVP 205
++E ++WL R+ ++ V E+L VD AL Y L H+A VVW VDSA
Sbjct 208 QNEREQWLEDRNRVRALQVREVLTADAVDEDTMTTALRYPLRRFHLALVVWRPGVDSAAD 267
Query 206 IGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAF 265
+G + +F VR +L LG N L + D W P A + + +
Sbjct 268 LGH-MERF--VR-----DLAEHLGASHNPLFIAEDRLTGWAWI---PLAAKSAAAEAVSA 316
Query 266 ESAGIRAR-----LACGRVGDGLRGFRASLKQAERVKALALA----GGARPGGRVMFYD- 315
A R + LA G G GFR S QA +A+ALA G A P G V D
Sbjct 317 ARAFTRGQPDPPSLALGEALPGFAGFRRSHHQALDARAVALASNTEGAADPHGVVAISDL 376
Query 316 DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILH 375
++ ALL D+ R +V +VLG LS D E + LR TL+ FL SY A A + LH
Sbjct 377 GLSAAALLGGDVNAARVWVYEVLGPLSSDTENDERLRNTLQVFLRSGSSYKAAAGQLNLH 436
Query 376 RNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
N+++YRV +A+E G ++ V++AL +CRW AVL
Sbjct 437 YNSVKYRVARAIERRGLPIEADR--LEVEIALLLCRWYRGAVL 477
>gi|226309064|ref|YP_002769024.1| CdaR family transcriptional regulator [Rhodococcus erythropolis
PR4]
gi|226188181|dbj|BAH36285.1| putative CdaR family transcriptional regulator [Rhodococcus erythropolis
PR4]
Length=429
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/424 (32%), Positives = 203/424 (48%), Gaps = 31/424 (7%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +AR + D E+ M EI LD DA++ ++ AS+ N T +H L D P
Sbjct 19 VAELARRLDERSADIAREMAFMMAHEIDQLDADAKLLEMLEASVQGNITTIIHVLANDIP 78
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIE 128
++ AA+ YA AQRD+P + LVRA+ +G + + +S L+ R+ T+
Sbjct 79 IDHLQPTTAAVEYALRLAQRDVPSNSLVRAYHMGQGDLMRICHDEISTLDIPARL-TLAV 137
Query 129 LVNRS---ARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERAL 185
L + S +D + + AYE E RW++ + L + LL+ T D E
Sbjct 138 LKHTSDVVYSYIDWITLYVFDAYERERSRWMAAQGNLHSAAIHALLSGTNADTAAFEAET 197
Query 186 GYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
GYRL H+A VVW + DV+ R L +LG G ++ D R
Sbjct 198 GYRLGQNHVALVVW---STWDADVMGINTLDR--LVRDLGAAAGADKPPIITAIDRRTVW 252
Query 246 LWFS-----PAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
W PAP P + AA G AR+A G G G+ GF+ S +QA ++A
Sbjct 253 AWLPFGRRVPAPD----PEVLAAAVPLDG-GARVAIGLPGSGVEGFKRSHEQATAAYSVA 307
Query 301 LAGGARPGGRVMFYD-DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 359
V F D VA V+LL+++++ + +V +VLG L+ D + LR TL +
Sbjct 308 AVPDTPQRPVVSFGDRGVAVVSLLSENIDSTKSWVWEVLGPLADDTPSAASLRTTLSVYF 367
Query 360 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFR----VQMALEVCRWMAP 415
S++ TA+ M LHRNT++YR+ +A L DP AA R + +AL+VC +
Sbjct 368 AHGESHLHTANHMNLHRNTVKYRINKA-------LGDPVAAGRDKLDLALALQVCELLGR 420
Query 416 AVLR 419
+VLR
Sbjct 421 SVLR 424
>gi|333920056|ref|YP_004493637.1| hypothetical protein AS9A_2390 [Amycolicicoccus subflavus DQS3-9A1]
gi|333482277|gb|AEF40837.1| hypothetical protein AS9A_2390 [Amycolicicoccus subflavus DQS3-9A1]
Length=431
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/401 (32%), Positives = 197/401 (50%), Gaps = 30/401 (7%)
Query 29 DKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQR 88
D + EI L DA++ L ++ N T + D +E P AL YAR AQR
Sbjct 39 DTLMREISELRDDAQLQQLLHDTVAANIDTVFAAIRNDISLDHIEPPTTALEYARRLAQR 98
Query 89 DIPLSGLVRAHRLGHARFLEVAMQYVS--------LLEPADRVSTIIELVNRSARLVDLV 140
D+ L+RA+R+GH L V M+ + +L+ +R++ + + R +D +
Sbjct 99 DVSADALIRAYRIGHQTVLNVVMEEIRGCDFDSPLILDVIERITAL------TFRYIDWI 152
Query 141 ADQLIVAYEHEHDRWLSRRSGLQQQWVSELLA----DTPVDVPRAERALGYRLDGVHIAA 196
+ QLI Y+ E DRW + R+ L+ V ELLA DT D+ A+ Y L VH+A
Sbjct 153 SQQLIRVYQDERDRWQASRNSLRSSRVRELLAGGDTDTDADLDVLTSAISYPLRRVHLAI 212
Query 197 VVWVDSAVPIGDVVAQFDQVRCLL-AGELGPELGPVANSLMVPTDEREARLWFS-PAPTR 254
VVW G++ A V L +G GPE + L V D +W A R
Sbjct 213 VVWCHDQQGGGELTAIERFVHKLHESGVAGPE-----SPLFVAADRLTGWVWVPVTAAAR 267
Query 255 AFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFY 314
A +R A ++ +A G+ GL GFR S +QA+ + +A A G + RV
Sbjct 268 ATVQQAVRDAVDATPQAPFVAIGQPLHGLDGFRRSHQQAQLARTVAHATGQQEQ-RVTDA 326
Query 315 DDVAPV--ALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAM 372
+ + LLA++ + R +V +VLG L+ E + LRETLR FL + Y + A+ +
Sbjct 327 SECGVLLSGLLAENADATRSWVGEVLGPLASPTESDERLRETLRAFLRADSGYKSAAEEL 386
Query 373 ILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWM 413
+H NT++YRV +A+E G+ + D V++AL +C W+
Sbjct 387 HMHPNTVRYRVRRALERRGREISDD--RLDVEVALLLCHWL 425
>gi|260905674|ref|ZP_05913996.1| hypothetical protein BlinB_10102 [Brevibacterium linens BL2]
Length=435
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 125/411 (31%), Positives = 196/411 (48%), Gaps = 34/411 (8%)
Query 5 GGGPISVIARHM-QLIR---------DDFISELFDKMKAEIRGLDYDARMADLWRASITE 54
G GP V M QL R + +++ + I L + + DL ASI
Sbjct 12 GPGPTHVTPEVMAQLSRISAELSPRVPELTRSVYEYLATRIAELGEEPTLLDLLSASIAG 71
Query 55 NFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYV 114
N T H L +E P AA YAR AQR I ++ LVRA+RLG R L+ A ++
Sbjct 72 NIETIFHALQHGIAPDNLEPPVAAYEYARRLAQRGISVNALVRAYRLGQQRLLQSAYDFI 131
Query 115 SLLE--PADRVSTIIE-LVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELL 171
+ + P D + + LV+ + +D ++ ++ + YE E + WL+ R+ ++ V +++
Sbjct 132 TANDELPVDLAPAVFQRLVDEVSEYIDWMSQKVALLYEQEREAWLANRTTARESQVRDII 191
Query 172 ADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPV 231
VD A LGY L H+A V W + P + V + + G +G
Sbjct 192 KGGDVDAAAAAATLGYSLTARHVAVVAWTHHSAP--ETVDDLGRFTSAI-NAAGAAMGSP 248
Query 232 ANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFESAGIR------ARLACGRVGDGLRG 285
+ L++ D+ A W + + + ES G R LA G G G
Sbjct 249 RSRLIISRDQDTAWGWITVPESWEYE--------ESLGDRLDATDAVHLAFGSAHTGAEG 300
Query 286 FRASLKQAERVKALALAGGARPGGRVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSV 343
FR S ++A RV+ + +AG R + +DD +A V+L++ D+E + +V VLG L+
Sbjct 301 FRLSHQEAMRVQNVCVAG--RSPAALRSHDDPGMALVSLMSTDVEAAQDWVRSVLGPLAE 358
Query 344 DDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNL 394
+ E N+ R TL FL + SY ATA AM +H+N+I+YRV A+ L G +L
Sbjct 359 NSEANARHRSTLLTFLRHDLSYTATATAMTMHKNSIRYRVEIAVSLLGTDL 409
>gi|257056612|ref|YP_003134444.1| regulator of polyketide synthase expression [Saccharomonospora
viridis DSM 43017]
gi|256586484|gb|ACU97617.1| regulator of polyketide synthase expression [Saccharomonospora
viridis DSM 43017]
Length=441
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 129/403 (33%), Positives = 199/403 (50%), Gaps = 17/403 (4%)
Query 25 SELFDKMKAEIRGLDYD---ARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAY 81
+EL D M A I G D + ++ AS+ N T +H L D P ++ AA Y
Sbjct 36 TELNDSMNAAIEGAIADLANPELTEMLHASVEGNITTILHMLRNDIPIEHLQPATAATEY 95
Query 82 ARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE-PAD-RVSTIIELVNRSARLVDL 139
A A+ + + L RA+ +G L + LL+ P++ ++ + L VD
Sbjct 96 AIRLARAGVSAAPLRRAYHIGSDDLLAEIFHEIQLLDCPSELKLRLLHHLAGWLHHYVDW 155
Query 140 VADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVW 199
+ ++ A+E E L + + + V +L PV+ + R GYRL+ H+A V+W
Sbjct 156 ITRVVLDAHEAERQTLLKQHATDIFRLVHRVLDREPVEYDQFARTAGYRLNHPHVATVLW 215
Query 200 VDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFS-PAPTRAFAP 258
+ + D Q + +R L A L LG ++ L +P D A +W P T
Sbjct 216 DERTLQAAD---QIEVLRSL-ATRLARVLGSSSDPLFMPVDRSTAWVWCHIPDATSPIDT 271
Query 259 SRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD-- 316
+R+ A R A G G+ GFR +L+QA V+ +A A GA P +VM Y D
Sbjct 272 ARVHEVLADAPA-VRAAMGTPVFGIEGFRRTLEQANAVRTVASASGA-PHAKVMSYGDDG 329
Query 317 VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHR 376
+A VA+L DL RR+V D LG L++D E + LRETL F LR SYV T+ ++LHR
Sbjct 330 MAVVAMLVRDLPASRRWVADTLGALALDTEPAARLRETLLTF-LRTGSYVETSKKLMLHR 388
Query 377 NTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
NT++YR+ +A G+ L + +++A +C + PAVL+
Sbjct 389 NTVKYRLTKAKRERGRPLT--EGRLDLELAPHLCHVLGPAVLQ 429
>gi|229491813|ref|ZP_04385634.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229321494|gb|EEN87294.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=415
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 131/424 (31%), Positives = 201/424 (48%), Gaps = 31/424 (7%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +AR + D E+ M EI LD DA++ ++ AS+ N T +H L D P
Sbjct 5 VAELARRLDERSADIAREMAFMMAHEIDQLDADAKLLEMLEASVQGNITTIIHVLANDIP 64
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIE 128
++ AA+ YA AQRD+P + LVRA+ +G + + +S L+ R+ T+
Sbjct 65 IDHLQPTTAAVEYALRLAQRDVPSNSLVRAYHMGQGDLMRICHDEISTLDIPARL-TLAV 123
Query 129 LVNRS---ARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERAL 185
L + S +D + + AYE E RW++ + L + LL+ D E
Sbjct 124 LKHTSDVVYSYIDWITLYVFDAYERERSRWMAAQGNLHSAAIHALLSGANADTAAFEAET 183
Query 186 GYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
GYRL H+A VVW + DV+ R L +LG G ++ D R
Sbjct 184 GYRLGQNHVALVVW---STWDADVMGINTLDR--LVRDLGAAAGADKPPIITAIDRRTVW 238
Query 246 LWFS-----PAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA 300
W PAP P + A G AR+A G G G+ GF+ S +QA ++A
Sbjct 239 AWLPFGRRVPAPD----PEVLATAVPLDG-GARVAIGLPGSGVDGFKRSHEQATAAYSVA 293
Query 301 LAGGARPGGRVMFYD-DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 359
V F D VA V+LL+++++ + +V +VLG L+ D + LR TL +
Sbjct 294 AVPDTPQRPVVSFGDRGVAVVSLLSENIDSTKSWVWEVLGPLAEDTPSAASLRTTLSVYF 353
Query 360 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFR----VQMALEVCRWMAP 415
S++ TA+ M LHRNT++YR+ +A L DP AA R + +AL+VC +
Sbjct 354 AHGESHLHTANHMNLHRNTVKYRINKA-------LGDPVAAGRDKLDLALALQVCELLGR 406
Query 416 AVLR 419
+VLR
Sbjct 407 SVLR 410
>gi|326382684|ref|ZP_08204375.1| hypothetical protein SCNU_07090 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198803|gb|EGD55986.1| hypothetical protein SCNU_07090 [Gordonia neofelifaecis NRRL
B-59395]
Length=417
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 134/415 (33%), Positives = 195/415 (47%), Gaps = 25/415 (6%)
Query 19 IRDDFIS---ELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAP 75
+R+D + + D + I LD D + ++ AS+ N V L D P S ++ P
Sbjct 16 MREDLLGLSGGITDTLAGGIDQLDDDPVLVEMLGASVHGNVTNIVDMLAGDIPLSNLQPP 75
Query 76 AAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE-PADRVSTIIELVNRSA 134
AA+ YA AQR+IP + LVRA+ +G L+V + ++ + PA + V+ S
Sbjct 76 TAAVEYALRLAQREIPSNALVRAYHMGQNYSLQVIYRIITEMNLPAQEALDLTAAVSESI 135
Query 135 -RLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVH 193
R +D + + AYE+E RW V LLA ER YRLD VH
Sbjct 136 YRYIDWITGYVFEAYENERRRWAGVNGTALTTTVHNLLASPESSADEFERDTAYRLDRVH 195
Query 194 IAAVVWVDSAVPIGDVVAQFDQV--RCLLAGELGPELGPVANSLMVPTDEREARLWF--- 248
+AV+W+D G +A+ D++ R +A GP L+ P D +W
Sbjct 196 QSAVLWIDDRS--GVELAELDRIARRAAVASR---SDGP---PLVTPVDRSTVWVWVPYP 247
Query 249 SPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPG 308
P P A S F+ R A G+ G GFR S +QA +A G+ G
Sbjct 248 GPRPRARSAESSSGLVFDRLPTGVRAAVGQRCSGAAGFRRSHEQALAALRVASVPGSPTG 307
Query 309 GRVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYV 366
R+ YDD +A ALL D++ +V +VLGDL+ D + LRETL FL S+V
Sbjct 308 ARI-GYDDPGIAVSALLGQDVDTTAAWVREVLGDLASDSPATAPLRETLAVFLATADSHV 366
Query 367 ATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRAK 421
TA + LHRNT++YRV +A+ + G D D A ++L C + VL A+
Sbjct 367 RTAARLNLHRNTVKYRVDKALSMIGAERDRLDLA----VSLTACELLGSLVLAAR 417
>gi|302529880|ref|ZP_07282222.1| predicted protein [Streptomyces sp. AA4]
gi|302438775|gb|EFL10591.1| predicted protein [Streptomyces sp. AA4]
Length=414
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/378 (33%), Positives = 182/378 (49%), Gaps = 19/378 (5%)
Query 20 RDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAAL 79
R +++ F +M E R +D + L AS + N V + L P + P AA
Sbjct 26 RTTELTDWFVEMIPEFR---HDETVRKLMIASTSANLVAILDMLSHSIPLDRITVPPAAA 82
Query 80 AYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADR---VSTIIELVNRSARL 136
YAR AQ ++ L L+RA+RLG RF + A++ + D + + EL R+ R
Sbjct 83 EYARRFAQHELSLEALLRAYRLGEHRFEQWAIEALERQPNIDTRLALGVLAELSRRTNRY 142
Query 137 VDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAA 196
+D V + LI +E E RW SR + + +L + A++ L + + H AA
Sbjct 143 IDQVIEGLIDIFETERRRWSSRTGAARAAQIRLVLDSDTLTEDAAQQLLAFPMRQWHRAA 202
Query 197 VVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAF 256
V W+ A G++ A A L E+ +L + D+R LW
Sbjct 203 VAWL-PAEGAGELQA---------AARLLQEVCGRGPALTMLADDRT--LWSWTVSADRA 250
Query 257 APSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD 316
R+R+ G RLA G G GL GFR SL++A R +A+A V+ +DD
Sbjct 251 DVERLRSGVTDIGGGLRLALGAPGYGLSGFRGSLREAVRARAVA-ETNEDQSQHVVLFDD 309
Query 317 VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHR 376
VA ALL + +++ R++ VLGDL DD LRETLR FL + SY A + LH+
Sbjct 310 VAIAALLTEQSDDVYRWIARVLGDLVADDPGTEQLRETLRVFLDTDGSYTHAAARLHLHK 369
Query 377 NTIQYRVIQAMELCGQNL 394
NT+ YRV +A ELCG+ L
Sbjct 370 NTVHYRVRKAEELCGRPL 387
>gi|111020453|ref|YP_703425.1| hypothetical protein RHA1_ro03464 [Rhodococcus jostii RHA1]
gi|110819983|gb|ABG95267.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=421
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/415 (28%), Positives = 194/415 (47%), Gaps = 13/415 (3%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +AR + + + + + EI LD D ++ +L AS+ N T VH L D P
Sbjct 16 VAGVARKLDARQAEITRTMSALLAHEIDQLDEDPQLVELLEASVNGNVSTIVHVLANDIP 75
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTII- 127
++ AA+ YA AQRD+ + LVRA+ +G +++ VS L+ + ++ +
Sbjct 76 VDHLQPTTAAVEYALRLAQRDVSSNSLVRAYHMGQDDLIKICYDEVSALQLSGPLTLAVL 135
Query 128 -ELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALG 186
+ +D + + AYE E RWL R + + LL T D E
Sbjct 136 KHISEVVYSYIDWITLYVFDAYEQERRRWLGARGNVHSSTIHTLLTGTGNDGSAFEAETH 195
Query 187 YRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARL 246
YRL+ H+A ++W + + A VR LA L + P+ ++ D R
Sbjct 196 YRLEQTHVAMILWSTGSDTEASLNALDHYVRD-LAHHLATDSAPIVTAI----DRRTLWA 250
Query 247 WFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGAR 306
W + + AA A R A G G+ GFR S +QA ++A
Sbjct 251 WLPFGRRKPILDTTELAAAMPANPGIRTAIGLPASGIAGFRRSHEQAHAAYSVATVPHT- 309
Query 307 PGGRVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRS 364
P ++ + D VA V+LLA++L+ R +V +VLG L+ + ++ + LR TL + S
Sbjct 310 PARPIVGFGDRGVAVVSLLAENLDSTRAWVWEVLGPLAENTDQAATLRTTLSTYFATGES 369
Query 365 YVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
++ TA M LHRNT++YR+ +A+ G + + +AL+VC ++ P VL+
Sbjct 370 HLHTAQQMNLHRNTVKYRITKAL---GDPATGTHSKLDLALALQVCEFLGPTVLK 421
>gi|343926342|ref|ZP_08765847.1| putative CdaR family transcriptional regulator [Gordonia alkanivorans
NBRC 16433]
gi|343763580|dbj|GAA12773.1| putative CdaR family transcriptional regulator [Gordonia alkanivorans
NBRC 16433]
Length=428
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/414 (30%), Positives = 194/414 (47%), Gaps = 21/414 (5%)
Query 10 SVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQ 69
+++A+ ++ + ++EL M I LD D ++ +L AS+ N T +H L D P
Sbjct 29 ALVAKMLREGESELVAELSSMMTRGIDQLDTDPKLIELLAASVHGNVSTIIHVLANDIPI 88
Query 70 SLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVST--II 127
++ AA+ YA AQRD+P + LVRA+ +G + + V L+ S I
Sbjct 89 EHLQPATAAVEYALRLAQRDVPSNSLVRAYHMGQNSMMHRCYRLVEELDLDAEASMALIR 148
Query 128 ELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGY 187
+ + +D + + AYE E RWL +Q + L D E GY
Sbjct 149 HISDVVFGYIDWITLYVFEAYEDERRRWLGVEGNVQSAAIHTFLDSLDADDRDFESETGY 208
Query 188 RLDGVHIAAVVWV--DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
RL+ H+A +VW D +G + A +L ++G + ++ D
Sbjct 209 RLERRHLALIVWSADDDPRELGALTRA--------ARDLAVQIGGGGDPIVTAIDRSTVW 260
Query 246 LWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGA 305
W P R S +A R+A G G RGFR + +QA ++A G+
Sbjct 261 AWI-PLAVRGDDDSVAQATVPPG---VRVAWGLPASGARGFRRTHEQARAAYSVATTPGS 316
Query 306 RPGGRVMFYD-DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRS 364
G V F D VA V+LLA +L+ R +V +VLG L+ D + LRETL + S
Sbjct 317 SAGQVVGFGDRGVAVVSLLARELDSTRAWVHEVLGGLAEDTPNAAMLRETLSVYFATKES 376
Query 365 YVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
++ TA+ + LHRNT++YRV +A+ ++ D D A +AL+VC ++ P VL
Sbjct 377 HLHTAERLNLHRNTVKYRVGKALAEVPRDRDRLDLA----LALKVCEFLGPTVL 426
>gi|333992637|ref|YP_004525251.1| hypothetical protein JDM601_3997 [Mycobacterium sp. JDM601]
gi|333488605|gb|AEF37997.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=423
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 119/387 (31%), Positives = 193/387 (50%), Gaps = 17/387 (4%)
Query 38 LDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVR 97
L D+ + L ++ N T + + P + + APA AL +AR AQR +P++ LVR
Sbjct 40 LITDSSLLQLLHETVAANVDTYFSAIRHNIPVAEIAAPAVALEHARRLAQRGVPMNALVR 99
Query 98 AHRLGHARFLEVAMQYV--SLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRW 155
+RLGH+ L V ++ + + L+P R+ + E+ +D ++ Q+ Y+ E +RW
Sbjct 100 GYRLGHSMALRVVLEEIRSAELDPDLRLDVLSEMSTLMFGYIDEMSQQVSAVYQAERERW 159
Query 156 LSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQ 215
L R+ ++ V E+LAD +DV A+ Y L H+A +VW + GD + +
Sbjct 160 LESRNAVRALRVREILADEGLDVDAMTTAIRYPLRRTHVALIVWYPESA--GDKLTAAEG 217
Query 216 VRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAF-APSRIRAAFESAGIRARL 274
LA + + P L +P D W A + A +IR ++A +
Sbjct 218 FIKQLAESVAGQGAP----LFIPADSTTGWAWIPLASSAGHDAVEQIRGCAQTASGEPWV 273
Query 275 ACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD--VAPVALL-ADDLEELR 331
A G G+ GFR S +QA V LA+A G RV D ++ ALL D++ R
Sbjct 274 AIGDPLPGVEGFRRSHQQALAVHTLAVASGVT---RVSAAADPGLSAAALLGGDNVAAAR 330
Query 332 RFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCG 391
+V +VLG L+ + + LR+TLR FL S+ A + + H NT++YRV +A+E G
Sbjct 331 AWVGEVLGPLARATDGDERLRDTLRVFLRAGSSFKAAGEQLHFHVNTMKYRVQRAIERRG 390
Query 392 QNLDDPDAAFRVQMALEVCRWMAPAVL 418
+ + + V++AL +C+W AVL
Sbjct 391 RPIAEDR--LDVEIALLLCQWYGAAVL 415
>gi|145225199|ref|YP_001135877.1| hypothetical protein Mflv_4621 [Mycobacterium gilvum PYR-GCK]
gi|145217685|gb|ABP47089.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=430
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 121/422 (29%), Positives = 206/422 (49%), Gaps = 29/422 (6%)
Query 17 QLIRD--DFISELFDKMKA----EIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQS 70
L RD D + EL + + ++ L D ++ +L ++ N T + P
Sbjct 17 HLFRDLGDLVRELPESIHGLLIEQVAELAADQQLKELLSDTVAANVDTWFSVVRHSIPFD 76
Query 71 LVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLE--VAMQYVSLLEPADRVSTIIE 128
+E+P AAL +AR AQR++P++ L+RA+RLGH L +A + L P ++++ I
Sbjct 77 RMESPTAALEHARRMAQREVPVNALLRAYRLGHQYGLNLIIAGLRSADLPPEEKLALIEH 136
Query 129 LVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYR 188
+ S R +D +++Q++ Y+ E W RR L+ Q + ++L VD+ A A+ Y
Sbjct 137 ITRVSFRYIDWMSEQVLETYQTERAEWDERRRSLRAQAIRDILNGRDVDLTEASSAMRYF 196
Query 189 LDGVHIAAVVWVD--SAVPIGDVVA--QFDQVRCLLAGELGPELGPVANSLMVPTDEREA 244
L H+A +VW+D + + +A +F Q G S+ D
Sbjct 197 LGATHVALMVWLDRDAGADTDEFIAMERFMQHAVSATG--------AKQSVYFSIDRLVG 248
Query 245 RLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGG 304
W + + S++R + RL+ G G+ GFR + +QAE+ + +A+A
Sbjct 249 CAWMTVHRPSDYL-SQLRDFVRAQPDGPRLSVGEPLPGIEGFRRTYRQAEQARVVAVAAD 307
Query 305 ARPGGRVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRN 362
R++ D VA ++L D L+ ++ DVLG L+ + + LR+TLR FL
Sbjct 308 GSSRHRIVAARDPGVALASMLMTDDATLKEWIHDVLGPLAQNTASDQRLRDTLRVFLRAG 367
Query 363 RSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFR--VQMALEVCRWMAPAVLRA 420
S+ A A+ + +H NT++YRV +A+E G+ P AA R V++AL +C W+ L
Sbjct 368 SSFKAAANELQIHANTVKYRVNRALERRGR----PIAAERLDVEVALLMCYWLGEVALNP 423
Query 421 KQ 422
Q
Sbjct 424 TQ 425
>gi|183981618|ref|YP_001849909.1| hypothetical protein MMAR_1603 [Mycobacterium marinum M]
gi|183174944|gb|ACC40054.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=421
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 128/411 (32%), Positives = 202/411 (50%), Gaps = 27/411 (6%)
Query 22 DFISELFDKMKAEIRG----LDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAA 77
D ++ L D ++AE+ L DA + L SI N T + VE P A
Sbjct 23 DLMTSLVDTVEAEVVSEVGELREDALLTRLLHDSIRANIETVFSAIRHGIRIENVEPPTA 82
Query 78 ALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARL- 136
AL +AR AQR++ ++ LVR++RLGH L+VA V L+ +S +++ R ++
Sbjct 83 ALEHARRLAQREVSVNSLVRSYRLGHKAVLDVARGQVRALKLDQGLS--LDVFGRIEQVT 140
Query 137 ---VDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVH 193
VD + +++ Y+ EHDRW R+ L+ V E+L +DV + Y L+ +H
Sbjct 141 FGYVDHITQEVVNTYQSEHDRWTENRNSLRALRVREVLDGAQLDVDAMTTEIRYPLNLIH 200
Query 194 IAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFS-PAP 252
+A V+W D +G+ +A +V + G +G A+ L +P D W
Sbjct 201 LATVMWFDGPA-LGNELAIMQRV----IRQFGQSVGASASPLFIPVDRLTGWAWVPLTTD 255
Query 253 TRAFAPSRIRAAFESAGIRARLACGRVGDGL---RGFRASLKQAERVKALALAGGARPGG 309
T A + IR E A R + VGD L GFR S QA +++A+ G+
Sbjct 256 TARNAVTEIR---EFARERTDIPWIAVGDPLPHVAGFRRSHWQARDARSVAIVLGSN-AH 311
Query 310 RVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVA 367
RV D ++ LL ++E+ +V +LG L+ + + LRETLR FL S+ A
Sbjct 312 RVTAAGDPGLSMAGLLGRNIEDAAAWVGQILGPLASQTDSDERLRETLRAFLRSGSSFKA 371
Query 368 TADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
A+ + LH N+++YRV +A++ G+ L D V++AL +C W AVL
Sbjct 372 AAEELHLHHNSVKYRVQRAIKRRGRPLTDD--RLDVEVALLLCHWYGAAVL 420
>gi|118618745|ref|YP_907077.1| hypothetical protein MUL_3434 [Mycobacterium ulcerans Agy99]
gi|118570855|gb|ABL05606.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=421
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 129/413 (32%), Positives = 202/413 (49%), Gaps = 31/413 (7%)
Query 22 DFISELFDKMKAEIRG----LDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAA 77
D ++ L D ++AE+ L DA + L SI N T + VE P A
Sbjct 23 DLMTSLVDTVEAEVVSEVGELREDALLTRLLHDSIRANIETVFSAIRHGIRIENVEPPTA 82
Query 78 ALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARL- 136
AL +AR AQR++ ++ LVR++RLGH L+VA V L+ I+++ R ++
Sbjct 83 ALEHARRLAQREVSVNSLVRSYRLGHKAVLDVARGQVRALKLDQ--GLILDVFGRIEQVT 140
Query 137 ---VDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVH 193
VD + +++ Y+ EHDRW R+ L+ V E+L +DV + Y L+ +H
Sbjct 141 FGYVDHITQEVVNTYQSEHDRWTENRNSLRALRVREVLDGAQLDVDAMTTEIRYPLNLIH 200
Query 194 IAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFS-PAP 252
+A V+W D +G+ +A +V + G +G A+ L +P D W
Sbjct 201 LATVMWFDGPA-LGNELAIMQRV----IRQFGQSVGASASPLFIPVDRLTGWAWVPLTTD 255
Query 253 TRAFAPSRIRAAFESAGIRARLACGRVGDGL---RGFRASLKQAERVKALALAGGARPGG 309
T A + IR E A R + VGD L GFR S QA +++A+ G+
Sbjct 256 TARNAVTEIR---EFARERTDIPWIAVGDPLPHVAGFRRSHWQARDARSVAIVLGSN-AH 311
Query 310 RVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVA 367
RV D ++ LL ++E+ +V +LG L+ + + LRETLR FL S+ A
Sbjct 312 RVTAAGDPGLSMAGLLGRNIEDAAAWVGQILGPLASQTDSDERLRETLRAFLRSGSSFKA 371
Query 368 TADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFR--VQMALEVCRWMAPAVL 418
A+ + LH N+++YRV +A++ G+ P A R V++AL +C W AVL
Sbjct 372 AAEELHLHHNSVKYRVQRAIKRRGR----PLTADRLDVEVALLLCHWYGAAVL 420
>gi|111024979|ref|YP_707399.1| hypothetical protein RHA1_ro08196 [Rhodococcus jostii RHA1]
gi|110823958|gb|ABG99241.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=444
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 132/418 (32%), Positives = 191/418 (46%), Gaps = 29/418 (6%)
Query 12 IARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSL 71
I +Q D + ++ A + LD D M D +S+T+N + L S
Sbjct 28 IGERLQRETDAISHGMTAEIAAAVGELD-DRAMRDALHSSVTDNVEVMIDQLAHSKEVSD 86
Query 72 VEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE--PADRVSTIIEL 129
+ + A YA A++D+P S L RA+ +G L V ++ P ++ L
Sbjct 87 LPSLPHAHRYAEELARQDVPESSLRRAYHVGSHYLLARIFDQVQEIDCPPHEQPPLYRHL 146
Query 130 VNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRL 189
+ +D + Q+I Y+ E R + WV+ +L V A YRL
Sbjct 147 AGWLYQYIDAITRQVIATYQEEQRSSHERAARTTFTWVNRVLEAEDVSPREFSAATKYRL 206
Query 190 DGVHIAAVVWVD----SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
D VH+A VWVD + DQVR +L G P L+V T REA
Sbjct 207 DQVHVACRVWVDDRADQPAHTPALAPLIDQVRAMLGGRDDP--------LVVVTGRREAD 258
Query 246 LWFSPA---PTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALA 302
+WF TRAF + SAG +RLA G G G GFRAS QA + +A
Sbjct 259 VWFGGVHRVDTRAF-----DSVVASAG-GSRLAFGSPGFGPAGFRASRSQAHQASRIAHV 312
Query 303 GGARPGGRVMFYDD--VAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
+ P RV Y D + ++ L DDL R +V DVLG+L+VD + + R+T+R FL
Sbjct 313 A-SDPTARVTSYADEGIPVISRLIDDLPATRAWVHDVLGELAVDSDAAARQRDTVRVFLE 371
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
SY ATA ++LHRN+I+YR+ +A G+ + D Q+AL +CR + VL
Sbjct 372 SAFSYSATASQLMLHRNSIRYRLEKASLQLGRGV--ADKPLDTQLALALCRVLGSVVL 427
>gi|40787287|gb|AAR90204.1| hypothetical protein PDK3.063 [Rhodococcus sp. DK17]
Length=402
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 129/400 (33%), Positives = 186/400 (47%), Gaps = 29/400 (7%)
Query 30 KMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRD 89
++ A + LD D M D +S+T+N + L S + + A YA A++D
Sbjct 4 EIAAAVGELD-DRAMRDALHSSVTDNVEVMIDQLAHSKAVSDLPSLPHAHRYAEELARQD 62
Query 90 IPLSGLVRAHRLGHARFLEVAMQYVSLLE--PADRVSTIIELVNRSARLVDLVADQLIVA 147
+P S L RA+ +G L V ++ P ++ L + +D + Q+I
Sbjct 63 VPESSLRRAYHVGSHYLLARIFDQVQEIDCPPHEQPPLYRHLAGWLYQYIDAITRQVIAT 122
Query 148 YEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVD----SA 203
Y+ E R + WV+ +L V A YRLD VH+A VWVD
Sbjct 123 YQEEQRSSHERAARTTFTWVNRVLEAEDVSPREFSAATKYRLDQVHVACRVWVDDRADQP 182
Query 204 VPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPA---PTRAFAPSR 260
+ DQVR +L G P L+V T REA +WF TRAF
Sbjct 183 AHTPALAPLIDQVRAMLGGRDDP--------LVVVTGRREADVWFGGVHRVDTRAF---- 230
Query 261 IRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD--VA 318
+ SAG +RLA G G G GFRAS QA + +A + P RV Y D +
Sbjct 231 -DSVVASAG-GSRLAFGSPGFGPDGFRASRSQAHQASRIAHVA-SDPTARVTSYADEGIP 287
Query 319 PVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNT 378
++ L DDL R +V DVLG+L+VD + + R+T+R FL SY ATA ++LHRN+
Sbjct 288 VISRLIDDLPATRAWVHDVLGELAVDSDAAARQRDTVRVFLESAFSYSATASQLMLHRNS 347
Query 379 IQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
I+YR+ +A G+ + D Q+AL +CR + VL
Sbjct 348 IRYRLEKASLQLGRGV--ADKPLDTQLALALCRVLGSVVL 385
>gi|324997138|ref|ZP_08118250.1| hypothetical protein PseP1_00170 [Pseudonocardia sp. P1]
Length=421
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 134/422 (32%), Positives = 192/422 (46%), Gaps = 33/422 (7%)
Query 9 ISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTP 68
++ +AR + + EL D I +D + L AS + N V V L P
Sbjct 23 VATVARDIDARLPELTVELTDWFVEMIPEFRHDEAVRQLMVASTSANLVGIVDLLAHAIP 82
Query 69 QSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYV---SLLEPADRVST 125
+ P AA YAR AQ ++ L L+RA+RLG RF + A+ + +P ++
Sbjct 83 VEQIAVPPAAAEYARRFAQHELSLEALLRAYRLGEHRFGQWALDSLRRGDRRDPDVVLAA 142
Query 126 IIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERAL 185
+ L R+ R +D V + LI YE E RW +R + +L P+ A+ L
Sbjct 143 VASLSERTNRYIDQVVEGLIDIYETERRRWSTRSGAGLAARIRMVLDTDPLSDSAADELL 202
Query 186 GYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREAR 245
G + G H AAV+W+ D LL G L V D R
Sbjct 203 GLPVRGWHRAAVLWISGPTDAPDDDGLLQAGARLLHDAAG------RPPLTVLADSRTLW 256
Query 246 LWFSPAPTRAFAPSRIRAAFESAGIRARL------ACGRVGDGLRGFRASLKQAERVKAL 299
W S PT AP+ + +RAR+ A G GL GFR SL +A R +A+
Sbjct 257 AWHS-GPT---APT-----LDVELLRARMPGDVCVALGAPAAGLSGFRGSLAEAVRARAV 307
Query 300 ALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 359
P V +DDVA ALL + E+LRR+V VL DL D+ +RETLR FL
Sbjct 308 VENSVLTPP--VTEFDDVAIPALLTERTEDLRRWVARVLSDLDSDEPGVDQVRETLRVFL 365
Query 360 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
SY A + LH+NT+ YRV +A +L G+ L D +V++AL +A ++LR
Sbjct 366 ASGGSYTQAAGRLHLHKNTVHYRVRKAEDLRGRPLGDD--RLQVEVAL-----LAASLLR 418
Query 420 AK 421
++
Sbjct 419 SR 420
>gi|183981345|ref|YP_001849636.1| hypothetical protein MMAR_1323 [Mycobacterium marinum M]
gi|183174671|gb|ACC39781.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=426
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 126/405 (32%), Positives = 199/405 (50%), Gaps = 20/405 (4%)
Query 25 SELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARA 84
S + ++ I L D R+ +L SI N T +H L D VEAP AL YAR
Sbjct 32 SHIRRSLERTIPELGGDLRIEELLGTSIEANVDTMLHALRYDIAVERVEAPTTALEYARR 91
Query 85 AAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE-PADRVSTIIELVNRSA-RLVDLVAD 142
AQ +P++ LVRA+RLG E+ + + P +IE ++ + +D ++
Sbjct 92 LAQHGVPVNALVRAYRLGQRLMNELIFDELRATDIPESMRVPVIEAISVAMFEYIDWMSQ 151
Query 143 QLIVAYEHEHDRWLSRRSGLQQQWVSELL-ADTPVDVPRAERALGYRLDGVHIAAVVWVD 201
Q++V YE E +RWL ++ L+ V E+L AD +D A ++ + L H+A V+W
Sbjct 152 QVVVVYEDERERWLENQNSLRGVRVREILAADKEIDADAAITSVRHPLRWHHLALVMWYP 211
Query 202 SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRI 261
G V + +++ L ELG G A+ L V D W P RA AP R
Sbjct 212 DQ---GSEVDELPRLQRFLR-ELGEAAGVDASPLFVAADPSCGWGWL---PYRA-APERA 263
Query 262 RAAFESAGIRAR-----LACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYD- 315
AA +R+R +A G G+ GFR S ++A +++ + RP + +
Sbjct 264 VAAVADF-VRSRPDSPNVAIGTPASGVDGFRRSHREAAEARSVGILCDRRPPLMISAGEP 322
Query 316 DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILH 375
++ VA DL R +V VLG+L+ D + ++ LR+TLR +L SY A + +H
Sbjct 323 GLSVVARFGGDLAGTREWVAQVLGELARDGDSDARLRDTLRVYLACGSSYKLAAQRLNMH 382
Query 376 RNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRA 420
NT++YRV +A+ G+ + V++AL VC W +VL++
Sbjct 383 FNTVKYRVGRAVARRGRAIGSDR--LDVELALLVCHWYGGSVLQS 425
>gi|118618038|ref|YP_906370.1| hypothetical protein MUL_2557 [Mycobacterium ulcerans Agy99]
gi|118570148|gb|ABL04899.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=405
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 127/406 (32%), Positives = 198/406 (49%), Gaps = 21/406 (5%)
Query 25 SELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARA 84
S + ++ I L D R+ +L SI N T +H L D VEAP AL YAR
Sbjct 12 SHIRRSLERTIPELGGDLRIEELLGTSIEANVDTMLHALRYDIAVERVEAPTTALEYARR 71
Query 85 AAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE-PADRVSTIIELVNRSA-RLVDLVAD 142
AQ +P++ LVRA+RLG E+ + + P +IE ++ + +D ++
Sbjct 72 LAQHGVPVNALVRAYRLGQRLMNELIFDELRATDIPESMRVPVIEAISVAMFEYIDWMSQ 131
Query 143 QLIVAYEHEHDRWLSRRSGLQQQWVSELL-ADTPVDVPRAERALGYRLDGVHIAAVVWVD 201
Q++V YE E +RWL ++ L+ V E L AD +D A ++ + L H A V+W
Sbjct 132 QVVV-YEDERERWLENQNSLRGVRVRETLAADKEIDADAAITSVRHPLRWHHPALVMWYP 190
Query 202 SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRI 261
G V + +++ L ELG G A+ L V D W P RA AP R
Sbjct 191 DQ---GSEVDELPRLQRFLR-ELGEAAGVDASPLFVAADPSCGWGWL---PYRA-APERA 242
Query 262 RAAFESAGIRAR-----LACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYD- 315
AA +R+R +A G G+ GFR S ++A +++ + RP + +
Sbjct 243 VAAVADF-VRSRPDSPNVAIGTPASGVDGFRRSHREAAEARSVGILCDRRPPLMISAGEP 301
Query 316 DVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILH 375
++ VA DL R +V VLG+L+ D + ++ LR+TLR +L SY A + +H
Sbjct 302 GLSVVARFGGDLAGTREWVAQVLGELARDGDSDARLRDTLRVYLACGSSYKLAAQRLNMH 361
Query 376 RNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLRAK 421
NT++YRV +A+ G+ + V++AL VC W +VL++K
Sbjct 362 FNTVKYRVGRAVARRGRAIGSD--RLDVELALLVCHWYGGSVLQSK 405
>gi|296140551|ref|YP_003647794.1| PucR family transcriptional regulator [Tsukamurella paurometabola
DSM 20162]
gi|296028685|gb|ADG79455.1| putative transcriptional regulator, PucR family [Tsukamurella
paurometabola DSM 20162]
Length=421
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 129/387 (34%), Positives = 185/387 (48%), Gaps = 22/387 (5%)
Query 31 MKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDI 90
M AEI L D + AS+ N T +H + +EAP AA+ Y R AQR +
Sbjct 39 MLAEIPELRADHALDMALAASVAGNVDTVLHGMMLGVEPGRIEAPLAAMEYPRRLAQRGL 98
Query 91 PLSGLVRAHRLGHARFLEVAMQYV--SLLEPADRVSTIIELVNRSARLVDLVADQLIVAY 148
P++ LVRA+RLG A + V + L +++ + + S D V + +I AY
Sbjct 99 PVTALVRAYRLGQASMVRQMHDAVRATGLTVEQKLAAHEWITDWSFAYSDTVIETVITAY 158
Query 149 EHEHDRWLSRRSGLQQQWVSELLA-DTPVDVPRAERALGYRLDGVHIAAVV-WVDSAVPI 206
+ E DRW+ RSG + V ELLA D VD A A+GY L H+A + + DS
Sbjct 159 QRERDRWMQARSGARVARVRELLAHDGVVDADAASLAIGYPLRRSHLALIASYSDS---- 214
Query 207 GDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPA-PTRAFAPSRIRAAF 265
D D+V + EL +GPV L++ D+R W P A +R R A
Sbjct 215 -DDTDGPDRVEVFVR-ELAAAVGPVEAPLLLAADQRTVWGWLPVGDPVEAVDRAR-RHAA 271
Query 266 ESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDD--VAPVALL 323
S+G R+A G V G+ GFR S +A + A V+F D V AL+
Sbjct 272 ASSGDGPRVAFGAVRPGIEGFRRSHTEAAAARRTAGER------PVVFAGDPGVMVAALV 325
Query 324 ADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRV 383
D +R+ DVLG L+ D E ++ LR TL +L Y A A + LH N+++YRV
Sbjct 326 GTDPGAAQRWARDVLGPLAEDTEADARLRRTLAVYLRHGGGYKAAAAELTLHPNSVKYRV 385
Query 384 IQAMELCGQNLDDPDAAFRVQMALEVC 410
+A+E G+ + V++AL VC
Sbjct 386 QRALERRGRGIGADR--LDVEVALLVC 410
>gi|240170688|ref|ZP_04749347.1| hypothetical protein MkanA1_15340 [Mycobacterium kansasii ATCC
12478]
Length=434
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 102/336 (31%), Positives = 156/336 (47%), Gaps = 25/336 (7%)
Query 72 VEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRVSTIIELVN 131
VE PA LA ARA R IPL+ L+R +RL + ++ DR +
Sbjct 84 VELPAPTLAIARAGVVRQIPLANLMRFYRLAQTLLWQWMWDRITAAA-TDRAQQALAFRL 142
Query 132 RSARL---VDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYR 188
++ + VD ++ AYE E + WL + + + ++L D RA + L Y
Sbjct 143 ATSWMFGYVDAALNRAEQAYEAEREIWLRNTAAARTDAIDDILTQRERDPQRASKRLRYD 202
Query 189 LDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWF 248
++ H+ + WVDSA D + ++ LA E+ + +L+ P A WF
Sbjct 203 VNRHHVGVIAWVDSAPEHRDAQSSLNEALTTLAREMRAD-----TTLIHPGGSLVAFGWF 257
Query 249 SPAPTRAFAPSRIRAAFESAGIRAR---------LACGRVGDGLRGFRASLKQAERVKAL 299
S + + A F++ G+ R + G G GL+GFR S +A + +
Sbjct 258 S------WRSAIGTAGFDTTGVSTRRPTLPDGVRVGIGEPGHGLKGFRCSHIEASNARRV 311
Query 300 ALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFL 359
A GAR G + Y DVA AL + D E FV VLG L+ DDE + TL +L
Sbjct 312 ASLAGAR-AGTLTHYRDVAVAALASCDAEHAASFVHRVLGPLAADDEATYRVATTLSVYL 370
Query 360 LRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLD 395
NRS + TA + +H NT+ YRV QA ++ G+++D
Sbjct 371 QENRSRLRTAQRLTVHPNTVSYRVDQAEKILGRSID 406
>gi|54025185|ref|YP_119427.1| hypothetical protein nfa32160 [Nocardia farcinica IFM 10152]
gi|54016693|dbj|BAD58063.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=414
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 123/398 (31%), Positives = 179/398 (45%), Gaps = 32/398 (8%)
Query 22 DFISELFDKMKAEIRGLDYDARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAY 81
D ++ +F +M E R +D + L AS N + L V P AA Y
Sbjct 29 DEMTGMFVEMIPEFR---HDDEVRRLMVASTGGNLSAIMDLLALSISFDEVSVPPAAAEY 85
Query 82 ARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEPADRV-----STIIELVNRSARL 136
AR AQ + L LVRA+RLG FL+ A+ + L P+ + S I E+VNR
Sbjct 86 ARRFAQHGMSLEALVRAYRLGEHMFLQRAITALGELGPSAELALATTSHIAEMVNR---Y 142
Query 137 VDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAA 196
+D V + +I YE+E RW +R + V +L +D+ AE+ LG L G H+AA
Sbjct 143 IDRVLEGVIDIYENERQRWDARSDATRAAQVRAVLDGEGLDLASAEQMLGTSLRGWHLAA 202
Query 197 VVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANS---LMVPTDEREARLWFSPAPT 253
++W D + + G EL A L + DE W S A
Sbjct 203 IIWTPPGTAASDTLLRA-----------GVELLSAATGKRPLTILVDEHNCWAWISSAGK 251
Query 254 RAFAPSRIRAAFE-SAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGGRVM 312
+ A+ G+R +A G GL GFR + + A A+A A +
Sbjct 252 PVLDVDALEASLRRHPGLR--MAVGERDSGLAGFRRTF--LDASAARAVAVAAPRVRELT 307
Query 313 FYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAM 372
Y V+ +LL D L E++ + VLGDL DDE + LR+T + +L S A M
Sbjct 308 LYSRVSVASLLLDRLPEVKAWAQRVLGDLMRDDESTARLRDTAQVYLDARGSLTDAAARM 367
Query 373 ILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVC 410
+H+NT+ YRV +A EL G +L +++AL VC
Sbjct 368 HVHKNTVHYRVRKAEELLGHSLTVNR--LELELALLVC 403
>gi|290955996|ref|YP_003487178.1| hypothetical protein SCAB_14641 [Streptomyces scabiei 87.22]
gi|260645522|emb|CBG68612.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length=413
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/359 (31%), Positives = 165/359 (46%), Gaps = 20/359 (5%)
Query 63 LDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYV-SLLEPAD 121
L DTP V P AL R + R + L ++ A + H E ++++ L P +
Sbjct 67 LHADTPGEAVATPQLALDGNRDSVHRGVALDRVLHAMWISHVHHYERLLEFLDQTLPPHE 126
Query 122 RVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRA 181
T+ + S V+ + Y E + WL + ++Q + +++A P+ V
Sbjct 127 LAGTVRRVTELSFAYVEAFTARFSAEYTAEREAWLGSLAATRRQVIEDIIAGVPISVRDP 186
Query 182 ERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCL--LAGELGPELGPVANSLMVPT 239
E LG L H AAV+W + A D L LAG L + L+V
Sbjct 187 ELVLGLDLSRHHRAAVLWTEDD-------AGTDSAHTLHRLAGRLA-DAADAGRPLVVRP 238
Query 240 DEREARLWFS-PAPTRAFAPSRIRAAFESA-GIRARLACGRVGDGLRGFRASLKQAERVK 297
LW S AP R+R A ++ GIRA A G + G+ GFR S A V+
Sbjct 239 GGTSLWLWMSWAAPPEQRLAERLRGAVDAPHGIRA--ALGPLDAGIDGFRRSHLGALEVR 296
Query 298 ALALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLRE 357
+A A R + +D++ +ALL + E R FV LG L+ DD+R++ LRETLR
Sbjct 297 RVA-AASTRRSSWLADHDELEVIALLTANPEHARWFVHRQLGALATDDDRSAELRETLRL 355
Query 358 FLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNL-DDPDAAFRVQMALEVCRWMAP 415
+L RS A A + + NT+ YRV QA + G +L DP R+ +ALE+ ++ P
Sbjct 356 YLAFERSRTAAAQVLHVAPNTVGYRVRQAEAILGTDLAKDP---LRIGLALEIYDYLNP 411
>gi|345013629|ref|YP_004815983.1| putative PucR family transcriptional regulator [Streptomyces
violaceusniger Tu 4113]
gi|344039978|gb|AEM85703.1| putative transcriptional regulator, PucR family [Streptomyces
violaceusniger Tu 4113]
Length=457
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 112/341 (33%), Positives = 152/341 (45%), Gaps = 36/341 (10%)
Query 87 QRDIPLSGLVRAHRLGHARFLEVAMQYVS-LLEPADRVSTIIELVNRSARLVDLVADQLI 145
R +PL ++ R+GHA E ++ + L++P V + + VD ++D +I
Sbjct 116 HRAVPLERVLHGVRIGHAATTEAFLRACAELVDPEAAVDEVTAISRELFSYVDDLSDTMI 175
Query 146 VAYEHEHDRWLSRRSGLQQQWVSELLAD---TPVDVPRAERALGYRLDGVHIAAVVWVDS 202
AY EH+ W + + + V LL+D T DV A RALGY L H A VVW D
Sbjct 176 RAYLVEHEVWSTSAAAARADIVRSLLSDATATATDVGEASRALGYDLRRTHEAVVVWSD- 234
Query 203 AVPIGDVVAQFDQVRCLLA-GELGPELGPVANSLM-----VPTD-------EREARLWFS 249
VP G Q L A G + PVA+ + VP+D R W +
Sbjct 235 -VPNGSSTLQAVATEALRARGATTTLVVPVASGRLWAWGTVPSDGTVTSDGTRRTGSWET 293
Query 250 PAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQA---ERVKALALAGGAR 306
A A A R +AAF G G G+ GFR S ++A ERV+ L G
Sbjct 294 IA--EALARQRTQAAF-----------GTPGGGVEGFRRSHREARRGERVERLRREAGRV 340
Query 307 PGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYV 366
P Y DVA +ALLA DL+ FV LG L+ LR TL ++ RS
Sbjct 341 PR-HATAYADVAAIALLATDLDAAGDFVRRELGGLAARSASMEALRTTLYHYIGAERSLA 399
Query 367 ATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMAL 407
A + + R T+ YRV +A E+ G LDD A +AL
Sbjct 400 DVARRLHVARGTVTYRVKRAQEVLGHGLDDRRFALHTALAL 440
>gi|240170335|ref|ZP_04748994.1| hypothetical protein MkanA1_13565 [Mycobacterium kansasii ATCC
12478]
Length=415
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 120/406 (30%), Positives = 178/406 (44%), Gaps = 27/406 (6%)
Query 21 DDFISELFDKMKAEIRG-LDYDARMADLWRASITENFVTAVHYLDR-DTPQSLVEAPAAA 78
D ++E+ D EI + DA +A AS N V + L R D + + P A
Sbjct 26 DHLVAEM-DAAVVEIAPVMGADAAIAAEMSASNRANAVRLLTALARRDGRGAPADVPPEA 84
Query 79 LAYARAAAQRDIPLSGLVRAHRLGH----ARFLEVAMQYVSLLEPADRVSTIIELVNRSA 134
L R +R I L + +A+R G RF+ A Q V+ P + ++E+ ++
Sbjct 85 LDVVRTVVRRGIDLDVIFQAYRRGQNVAWQRFMVYAAQVVA---PGPELVALLEVTSQLM 141
Query 135 -RLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVH 193
VD V ++I +HE + L + + V +L P+D RA LGY LD H
Sbjct 142 FSYVDQVIGRVIADAQHEREELLGGAMARRTETVRLILDGAPIDRRRASERLGYELDRRH 201
Query 194 IAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSPAPT 253
A V+W + P G++ + LLA L +P+ W T
Sbjct 202 TALVLWAE---PRGEIQGVLESAATLLARAAR-----ARPPLTLPSGTATLWAWLGTNAT 253
Query 254 RAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPGG-RVM 312
A +R A SA R+A G G+ GFR S A ++ L G PGG R+
Sbjct 254 PAM--DALRDAIRSADPTVRVAVGPTQPGITGFRRSHAAALVIQRLL---GGHPGGERIA 308
Query 313 FYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAM 372
F+ D+ AL A + ++ FV LG L+ D + LRETLR FL + +
Sbjct 309 FHRDLEVTALAAQNQDQAAEFVATTLGPLAADTPGAARLRETLRVFLDEAENAPRAGIRL 368
Query 373 ILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVL 418
HRNT+ RV +A EL G + + A V++ALE+ + P VL
Sbjct 369 HTHRNTVLQRVARATELLGHHPGERRLA--VELALELTHHIGPRVL 412
>gi|111017753|ref|YP_700725.1| hypothetical protein RHA1_ro00732 [Rhodococcus jostii RHA1]
gi|110817283|gb|ABG92567.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=397
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 111/385 (29%), Positives = 165/385 (43%), Gaps = 18/385 (4%)
Query 41 DARMADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHR 100
DA + L S + N + + + EAP AL +ARA A R + ++R +R
Sbjct 26 DAELRGLTLGSCSSNLEAVLSMVRHGIDVAAAEAPVTALEHARAMASRGHSVDVMLRFYR 85
Query 101 LGHARFLEVAMQYVS--LLEPADRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSR 158
LGH F E ++ + +PA + T I+L R +D ++ + Y E DR ++
Sbjct 86 LGHEYFTEKLSDSLTDWIEDPAVALRTFIDLERFGFRYIDRISSLVAAEYVAELDRRQNQ 145
Query 159 RSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRC 218
+ V LLA VD+ RAER L +R G I V WVD G + F
Sbjct 146 ARAERADLVRALLAGERVDIARAERVLSHRFTGRQIGFVCWVDDR---GVDLEGF----- 197
Query 219 LLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGR 278
A ++G LG +SL+V W S + + + F +A G
Sbjct 198 --ARQVGRFLG-AGHSLVVADGPLAVWGWASITGDVRTSLTGMATEFPGERENVHIAVGS 254
Query 279 VGDGLRGFRASLKQAERV-KALALAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDV 337
G GFR S +A R + + L+G A P + + DVA V ++ DL+ R FV
Sbjct 255 PHPGAAGFRTSHLEALRTRRIIELSGRAAPS--ITQFSDVALVDAISRDLDAARAFVAAQ 312
Query 338 LGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDP 397
LG L+ DD + R L L S A + +HRNT+ RV +A E G+
Sbjct 313 LGALARDDAKERSERAALLAVLDAQGSLATAASTLGIHRNTVLQRVRRAEERRGRPATIN 372
Query 398 DAAFRVQMALEVCRWMAPAVLRAKQ 422
A + AL VC + +VL +
Sbjct 373 IA--ELHAALTVCNVLGASVLHGSE 395
>gi|54022358|ref|YP_116600.1| hypothetical protein nfa3940 [Nocardia farcinica IFM 10152]
gi|54013866|dbj|BAD55236.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=431
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 102/331 (31%), Positives = 149/331 (46%), Gaps = 19/331 (5%)
Query 75 PAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLE-PAD-RVSTIIELVNR 132
PA A A+AR A+R L L+R + G L+ V E PA+ + ++ + +R
Sbjct 86 PAEAHAFARTIARRGFDLRVLLRTYHAGMEAVLDYMNDAVGQREVPAEIERAVMLRMFDR 145
Query 133 SARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGV 192
+ + + L + L Y E +R L + + V LLA +D +A LGYRL
Sbjct 146 TTKWISLSVELLTDTYMEERERVLRAALNRRTETVHALLAGEELDADQASVRLGYRLGLH 205
Query 193 HIAAVVWVDSAVPIGD--VVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSP 250
H+A V+W D P GD V D+V +A ELG L V + W
Sbjct 206 HLAFVLWTDRIEPGGDAEVTGLLDRVAARVAAELGTN-----RLLTVASGASGMWAWAGL 260
Query 251 APTRAFA----PSRIRAAFESAGIRA--RLACGRVGDGLRGFRASLKQAERVKALALAGG 304
A P RI + I A R+A G G + GFR ++A + +A GG
Sbjct 261 DDAAHAADLAAPGRIEQVLDGQ-IEAPVRIAFGVPGARVAGFRDGHREAMAARQVAERGG 319
Query 305 ARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRS 364
R RV+ Y DV L D + + LG L+ D LR+TL +L R RS
Sbjct 320 GR---RVVAYRDVEIAYLAGVDQHAMWGLIRRELGALAGTDPATVRLRDTLHVYLSRQRS 376
Query 365 YVATADAMILHRNTIQYRVIQAMELCGQNLD 395
ATA A+ +H+NT++YR+ + EL G ++
Sbjct 377 PEATAKALGVHKNTVRYRLQRIEELLGHPVE 407
>gi|111025737|ref|YP_708157.1| hypothetical protein RHA1_ro08955 [Rhodococcus jostii RHA1]
gi|110824716|gb|ABG99999.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=425
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/336 (28%), Positives = 156/336 (47%), Gaps = 25/336 (7%)
Query 67 TPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLE----VAMQYVSLLEPADR 122
+P S V+ P AL +A A + +PL ++R ++LG +L V ++ + + AD
Sbjct 87 SPGSEVQPPREALDFADEAVVQQVPLVAVLRGYQLGMQHWLRWCAPVIARHTNPVVQADE 146
Query 123 VSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAE 182
+ + V R +D +++ +I YE E R + + + V +LA V+V
Sbjct 147 LQLAVSAV---VRYIDRLSEIMIAEYERELQRRATSGASRRAALVRAVLAGDVVNVDDTA 203
Query 183 RALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDER 242
L Y L G H+A + + D Q D + A +G L + T
Sbjct 204 HLLHYPLAGRHMALALHSRA-----DSTNQVDVLEAA-ARSFATSVG-ATGLLTIATGLA 256
Query 243 EARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALA 302
W + + +R R + G G+ GF S +QA ++AL +
Sbjct 257 TMDAWVAVKADGGRPTNPMRE-------RVTIGVGTPLTGVAGFVQSHRQA--LRALEIL 307
Query 303 GGARPGGR--VMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLL 360
A PG + +YD V ++L+A D+ +++ FVT LG L+ DER+ LRETL FL
Sbjct 308 HMAAPGKLDPITYYDRVRLISLVAKDIPDVQTFVTATLGGLAGRDERSHELRETLLAFLE 367
Query 361 RNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDD 396
N+SY A + + LH+NT+ RV +A EL G+++ +
Sbjct 368 ANKSYTAVSLSSHLHKNTVVQRVKRASELTGRDITN 403
>gi|226309082|ref|YP_002769042.1| CdaR family transcriptional regulator [Rhodococcus erythropolis
PR4]
gi|226188199|dbj|BAH36303.1| putative CdaR family transcriptional regulator [Rhodococcus erythropolis
PR4]
Length=410
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/327 (28%), Positives = 156/327 (48%), Gaps = 16/327 (4%)
Query 72 VEAPAAALAYARAAAQRDIPLSGLVR---AHRLGHARFLEVAMQYVSLLEPADRVSTIIE 128
V PA ++A A A+R + L L++ A RL F++ +++ + + P + + ++
Sbjct 73 VTPPAESVALALTVARRGMDLRVLLKIYGAGRLAMLGFVDESIEALPI-GPELKRALLVR 131
Query 129 LVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYR 188
+ + R +++ + L+ Y E + + + V ++A + A R L Y
Sbjct 132 VWGSAMRWLEVTTELLVATYAKERESLARGAFARRSETVHAIVAGEALHSDEASRILDYP 191
Query 189 LDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWF 248
+ H A V+W D P DV+A+ D E E L +P+ RE W
Sbjct 192 MRRHHTAFVLWTDDTDPAADVLARLDSYARTFVDESNGE-----RVLTLPSGARELWSWV 246
Query 249 SPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARPG 308
+ + +A S + A R+A G G G+ GF S ++A + +A+ G PG
Sbjct 247 AHFDGQEWA-SHVDARHSDL----RVAVGASGYGMEGFARSHREALAAQRVAVRSGGSPG 301
Query 309 GRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYVAT 368
+ Y+DV LL +DL ELR V L L+ D + +R+T+R + N + AT
Sbjct 302 --ITVYEDVQVPCLLTEDLGELRELVARELKGLAGADGVTTRIRDTVRVYYENNCTAAAT 359
Query 369 ADAMILHRNTIQYRVIQAMELCGQNLD 395
A A+ LH+NT++YR+ QA +L G+++D
Sbjct 360 AAALGLHKNTVRYRLDQAEKLLGRSVD 386
>gi|333920275|ref|YP_004493856.1| hypothetical protein AS9A_2609 [Amycolicicoccus subflavus DQS3-9A1]
gi|333482496|gb|AEF41056.1| hypothetical protein AS9A_2609 [Amycolicicoccus subflavus DQS3-9A1]
Length=414
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/332 (29%), Positives = 151/332 (46%), Gaps = 16/332 (4%)
Query 67 TPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLL--EPADRVS 124
TP + + P A+A AR A+R + + L++ + G L + + V + +P +++
Sbjct 70 TPSAEYQPPPEAVALARTVARRGMDVQILLKIYGTGRTTALTLLNEIVQEIPVDPELKLA 129
Query 125 TIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERA 184
I++L + R ++L D L+ Y E + + + + V LL + V A
Sbjct 130 AIVDLWGLAMRWLELSTDVLLSTYTTEREALMRGALARRAETVHSLLRGEKLPVDDASAQ 189
Query 185 LGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREA 244
L Y L H A V+W D P D++ Q + ++A LG + +L V + R
Sbjct 190 LDYPLRRYHTAVVLWTDQEEPGTDILPQLETAAQVVARALG-----ASRALTVASGARGL 244
Query 245 RLWFSPAPTRAFAPSRIRAAFESAGIRARL--ACGRVGDGLRGFRASLKQAERVKALALA 302
W + P A SA A L A G G+ GF AS ++A +A A
Sbjct 245 WAWIATIEL----PDLDELALISAW-PAHLGGAVGTPIRGVAGFVASHREAIAANRVATA 299
Query 303 GGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRN 362
P R YD+V L+ D + LR FV LG+L+ DD+ + LRETL + +
Sbjct 300 RSGSP--RFTRYDEVQIPYLMGLDRDALRTFVQRELGELASDDDSAARLRETLLAYFVSG 357
Query 363 RSYVATADAMILHRNTIQYRVIQAMELCGQNL 394
S A + +H+NT++YRV QA + G +L
Sbjct 358 SSPARAARQLQVHKNTVRYRVEQAQAVLGDSL 389
>gi|111020067|ref|YP_703039.1| hypothetical protein RHA1_ro03078 [Rhodococcus jostii RHA1]
gi|110819597|gb|ABG94881.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=414
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/353 (29%), Positives = 162/353 (46%), Gaps = 18/353 (5%)
Query 72 VEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVS--LLEPADRVSTIIEL 129
V+ P AA AR A+R + L L++ +R+G L A + S + +P +I L
Sbjct 74 VDLPPAAHGLARTIARRGLHLRVLMQIYRVGQKALLRFAAETASERITDPVLEPKVLIRL 133
Query 130 VNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRL 189
+ R+ +++ + L Y E +R LS Q + V ++ D A L Y L
Sbjct 134 LERANHWLNVSLEVLADTYSEERERGLSGAFARQAETVQAIIRGEIADTVAASNRLNYPL 193
Query 190 DGVHIAAVVWVD--SAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLW 247
+ A V+W++ + D + D +A ++G L VP+ R W
Sbjct 194 LVHNTALVLWLEDTQSNQAEDEIGVLDSAARTVAAKIGAR-----QMLTVPSGSRGLWAW 248
Query 248 FSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALALAGGARP 307
+ I AGIR +A G G G++GFR S +A + ++ +RP
Sbjct 249 LAAEVEPDLTALDIGPGI-PAGIR--IAIGNPGKGIQGFRQSHIEAIAAQRIS---ESRP 302
Query 308 G-GRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLRNRSYV 366
+++ Y DV V LL + R V+ L L D ++ LR TLR +L NRS
Sbjct 303 AETQLICYADVEIVHLLDGHPDAARALVSRELRGLDGTDAASAMLRRTLRGYLTVNRSPD 362
Query 367 ATADAMILHRNTIQYRVIQAMELCGQNLDDPDAAFRVQMALEVCRWMAPAVLR 419
A A A+ +H+NT++YR+ +A EL G+ + P+ ++++ALE P VL+
Sbjct 363 AAARALGVHKNTVRYRIQRAEELLGRPV-GPN-RLKLELALEYADTYGPVVLK 413
>gi|296394810|ref|YP_003659694.1| PucR family transcriptional regulator [Segniliparus rotundus
DSM 44985]
gi|296181957|gb|ADG98863.1| putative transcriptional regulator, PucR family [Segniliparus
rotundus DSM 44985]
Length=431
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/330 (30%), Positives = 146/330 (45%), Gaps = 14/330 (4%)
Query 63 LDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLGHARFLEVAMQYVSLLEP-AD 121
L RDT + AP LA R I + ++R+ L HA EV + V + P A
Sbjct 87 LARDTEVTSATAPEV-LAGPVELVARGIGVEHMLRSIHLAHAAAAEVLIDAVGRIVPQAR 145
Query 122 RVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRSGLQQQWVSELLADTPVDVPRA 181
R + + +VD++ + + + H+ W + + ++ + V ++L V + RA
Sbjct 146 RFDETRRINDFLFHIVDIMNTHMSMEFARAHEAWSTSSNAMRMEVVEDILRGADVPLGRA 205
Query 182 ERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDE 241
R LGY L H+A + W P A+ Q+R A L L +
Sbjct 206 VRVLGYDLSRWHLAVIAWTGGPAP-----AEPKQLREAAAAALAAAGCASTAVLSLGAQ- 259
Query 242 REARLWFSPAPTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALAL 301
R+W + T S G+R LA G G G+ GFR S QA R + +
Sbjct 260 ---RVWAWGSRTAQPPMPNEEPPPISPGVR--LATGLPGFGVDGFRRSHDQASRAARVGV 314
Query 302 AGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSWLRETLREFLLR 361
AR Y DV VA+L+ DL FV LG+L+ E + LR TL+ +L R
Sbjct 315 MSTARDTW-FFPYGDVDIVAMLSADLPVAGEFVVRELGELAGPGESTAVLRHTLKCYLDR 373
Query 362 NRSYVATADAMILHRNTIQYRVIQAMELCG 391
+RS TAD + + RNT+ YRV +A +L G
Sbjct 374 DRSLARTADCLHVARNTVAYRVRRAEQLRG 403
Lambda K H
0.324 0.137 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 856034544960
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40