BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3521
Length=303
Score E
Sequences producing significant alignments: (Bits) Value
gi|167969131|ref|ZP_02551408.1| hypothetical protein MtubH3_1430... 617 8e-175
gi|15843131|ref|NP_338168.1| hypothetical protein MT3622 [Mycoba... 615 4e-174
gi|15610657|ref|NP_218038.1| hypothetical protein Rv3521 [Mycoba... 614 5e-174
gi|289555786|ref|ZP_06444996.1| conserved hypothetical protein [... 613 1e-173
gi|31794697|ref|NP_857190.1| hypothetical protein Mb3551 [Mycoba... 612 2e-173
gi|340628485|ref|YP_004746937.1| hypothetical protein MCAN_35321... 611 5e-173
gi|289440950|ref|ZP_06430694.1| conserved hypothetical protein [... 610 1e-172
gi|296166551|ref|ZP_06848982.1| conserved hypothetical protein [... 523 2e-146
gi|41406642|ref|NP_959478.1| hypothetical protein MAP0544c [Myco... 521 4e-146
gi|118463428|ref|YP_879918.1| hypothetical protein MAV_0638 [Myc... 521 7e-146
gi|254773595|ref|ZP_05215111.1| hypothetical protein MaviaA2_028... 519 3e-145
gi|254822605|ref|ZP_05227606.1| hypothetical protein MintA_21929... 516 1e-144
gi|240172353|ref|ZP_04751012.1| hypothetical protein MkanA1_2376... 511 4e-143
gi|342862255|ref|ZP_08718897.1| hypothetical protein MCOL_25316 ... 511 6e-143
gi|118619268|ref|YP_907600.1| hypothetical protein MUL_4080 [Myc... 510 1e-142
gi|183984974|ref|YP_001853265.1| hypothetical protein MMAR_5006 ... 509 2e-142
gi|126437580|ref|YP_001073271.1| hypothetical protein Mjls_5016 ... 477 1e-132
gi|108801597|ref|YP_641794.1| hypothetical protein Mmcs_4634 [My... 474 8e-132
gi|145222130|ref|YP_001132808.1| hypothetical protein Mflv_1538 ... 473 2e-131
gi|315442569|ref|YP_004075448.1| nucleic-acid-binding protein co... 468 4e-130
gi|120406168|ref|YP_955997.1| hypothetical protein Mvan_5220 [My... 465 4e-129
gi|333992300|ref|YP_004524914.1| hypothetical protein JDM601_366... 462 4e-128
gi|118470277|ref|YP_890147.1| hypothetical protein MSMEG_5921 [M... 460 1e-127
gi|169631245|ref|YP_001704894.1| hypothetical protein MAB_4167 [... 457 8e-127
gi|111021657|ref|YP_704629.1| hypothetical protein RHA1_ro04685 ... 408 4e-112
gi|54022490|ref|YP_116732.1| hypothetical protein nfa5230 [Nocar... 407 1e-111
gi|226304421|ref|YP_002764379.1| hypothetical protein RER_09320 ... 406 2e-111
gi|226364194|ref|YP_002781976.1| hypothetical protein ROP_47840 ... 404 6e-111
gi|312138195|ref|YP_004005531.1| hypothetical protein REQ_07290 ... 399 2e-109
gi|325674900|ref|ZP_08154587.1| hypothetical protein HMPREF0724_... 390 1e-106
gi|312191035|gb|ADQ43400.1| hypothetical protein ro04685 [Rhodoc... 382 4e-104
gi|300784463|ref|YP_003764754.1| hypothetical protein AMED_2557 ... 375 5e-102
gi|302525688|ref|ZP_07278030.1| conserved hypothetical protein [... 372 3e-101
gi|333918657|ref|YP_004492238.1| hypothetical protein AS9A_0986 ... 359 4e-97
gi|296141489|ref|YP_003648732.1| hypothetical protein Tpau_3818 ... 355 4e-96
gi|119716957|ref|YP_923922.1| hypothetical protein Noca_2732 [No... 346 2e-93
gi|343928237|ref|ZP_08767691.1| hypothetical protein GOALK_111_0... 346 3e-93
gi|262203823|ref|YP_003275031.1| hypothetical protein Gbro_3964 ... 345 5e-93
gi|326331660|ref|ZP_08197948.1| hypothetical protein NBCG_03099 ... 343 2e-92
gi|311744230|ref|ZP_07718034.1| conserved hypothetical protein [... 336 2e-90
gi|326383119|ref|ZP_08204808.1| hypothetical protein SCNU_09281 ... 333 2e-89
gi|145595167|ref|YP_001159464.1| hypothetical protein Strop_2642... 327 1e-87
gi|159038412|ref|YP_001537665.1| hypothetical protein Sare_2839 ... 325 4e-87
gi|319950792|ref|ZP_08024680.1| hypothetical protein ES5_14353 [... 290 2e-76
gi|182440626|ref|YP_001828345.1| hypothetical protein SGR_6833 [... 273 3e-71
gi|326781301|ref|ZP_08240566.1| protein of unknown function DUF3... 271 6e-71
gi|345013190|ref|YP_004815544.1| hypothetical protein Strvi_5753... 271 8e-71
gi|29827785|ref|NP_822419.1| hypothetical protein SAV_1244 [Stre... 271 8e-71
gi|297197503|ref|ZP_06914900.1| conserved hypothetical protein [... 267 2e-69
gi|328880326|emb|CCA53565.1| hypothetical protein SVEN_0278 [Str... 262 4e-68
>gi|167969131|ref|ZP_02551408.1| hypothetical protein MtubH3_14305 [Mycobacterium tuberculosis
H37Ra]
gi|306777871|ref|ZP_07416208.1| hypothetical protein TMAG_00011 [Mycobacterium tuberculosis SUMu001]
gi|306973989|ref|ZP_07486650.1| hypothetical protein TMJG_00766 [Mycobacterium tuberculosis SUMu010]
6 more sequence titles
Length=334
Score = 617 bits (1591), Expect = 8e-175, Method: Compositional matrix adjust.
Identities = 303/303 (100%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 32 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 92 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 151
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 152 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 211
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 212 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 271
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK
Sbjct 272 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 331
Query 301 HHL 303
HHL
Sbjct 332 HHL 334
>gi|15843131|ref|NP_338168.1| hypothetical protein MT3622 [Mycobacterium tuberculosis CDC1551]
gi|148824727|ref|YP_001289481.1| hypothetical protein TBFG_13554 [Mycobacterium tuberculosis F11]
gi|253800562|ref|YP_003033563.1| hypothetical protein TBMG_03560 [Mycobacterium tuberculosis KZN
1435]
32 more sequence titles
Length=334
Score = 615 bits (1585), Expect = 4e-174, Method: Compositional matrix adjust.
Identities = 302/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 32 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 92 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 151
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 152 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 211
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 212 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 271
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 272 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDADYDTYK 331
Query 301 HHL 303
HHL
Sbjct 332 HHL 334
>gi|15610657|ref|NP_218038.1| hypothetical protein Rv3521 [Mycobacterium tuberculosis H37Rv]
gi|148663384|ref|YP_001284907.1| hypothetical protein MRA_3560 [Mycobacterium tuberculosis H37Ra]
gi|2924458|emb|CAA17758.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium tuberculosis H37Rv]
gi|148507536|gb|ABQ75345.1| hypothetical protein MRA_3560 [Mycobacterium tuberculosis H37Ra]
Length=303
Score = 614 bits (1584), Expect = 5e-174, Method: Compositional matrix adjust.
Identities = 302/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 1 MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK
Sbjct 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
Query 301 HHL 303
HHL
Sbjct 301 HHL 303
>gi|289555786|ref|ZP_06444996.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289440418|gb|EFD22911.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=325
Score = 613 bits (1581), Expect = 1e-173, Method: Compositional matrix adjust.
Identities = 302/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 23 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 82
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 83 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 142
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 143 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 202
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 203 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 262
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 263 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDADYDTYK 322
Query 301 HHL 303
HHL
Sbjct 323 HHL 325
>gi|31794697|ref|NP_857190.1| hypothetical protein Mb3551 [Mycobacterium bovis AF2122/97]
gi|121639440|ref|YP_979664.1| hypothetical protein BCG_3585 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224991937|ref|YP_002646626.1| hypothetical protein JTY_3586 [Mycobacterium bovis BCG str. Tokyo
172]
20 more sequence titles
Length=303
Score = 612 bits (1578), Expect = 2e-173, Method: Compositional matrix adjust.
Identities = 301/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 1 MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDADYDTYK 300
Query 301 HHL 303
HHL
Sbjct 301 HHL 303
>gi|340628485|ref|YP_004746937.1| hypothetical protein MCAN_35321 [Mycobacterium canettii CIPT
140010059]
gi|340006675|emb|CCC45863.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=303
Score = 611 bits (1575), Expect = 5e-173, Method: Compositional matrix adjust.
Identities = 300/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 1 MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA
Sbjct 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPT+EFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 181 RTGKTGKVYFPPHGADPATGKPTTEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDADYDTYK 300
Query 301 HHL 303
HHL
Sbjct 301 HHL 303
>gi|289440950|ref|ZP_06430694.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289571760|ref|ZP_06451987.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289752234|ref|ZP_06511612.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289755650|ref|ZP_06515028.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289413869|gb|EFD11109.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289545514|gb|EFD49162.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289692821|gb|EFD60250.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289696237|gb|EFD63666.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=303
Score = 610 bits (1572), Expect = 1e-172, Method: Compositional matrix adjust.
Identities = 300/303 (99%), Positives = 303/303 (100%), Gaps = 0/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ
Sbjct 1 MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT
Sbjct 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTAS+EESAYLRAIAQGKLVGA
Sbjct 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASYEESAYLRAIAQGKLVGA 180
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL
Sbjct 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDADYDTYK 300
Query 301 HHL 303
HHL
Sbjct 301 HHL 303
>gi|296166551|ref|ZP_06848982.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898163|gb|EFG77738.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=348
Score = 523 bits (1346), Expect = 2e-146, Method: Compositional matrix adjust.
Identities = 258/307 (85%), Positives = 274/307 (90%), Gaps = 7/307 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR RRI+GVRGSDGRVHVPP EYDPVTYEPL EMVPVSSVGTV SWTWQ
Sbjct 45 VGPTLSKFFTALRERRILGVRGSDGRVHVPPAEYDPVTYEPLGEMVPVSSVGTVVSWTWQ 104
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPLDRPFAWALIKLDGADT +MHAVD G P AI +GARVH HWAD+PVGAIT
Sbjct 105 PEPLEGQPLDRPFAWALIKLDGADTPMMHAVDAGE--PKAIKSGARVHVHWADEPVGAIT 162
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA F LGE AEPV+ + +DPVTMIVTPI L IQHTASHEESAYLRAIAQGKL+GA
Sbjct 163 DIAYFELGEDAEPVSEQAAGE-QDPVTMIVTPISLTIQHTASHEESAYLRAIAQGKLLGA 221
Query 181 RT----GKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVA 236
RT G+ GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVA
Sbjct 222 RTRGANGEEGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVA 281
Query 237 AYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANY 296
AYVLLDGADIPFLHLV+D+DAH+VRMGMRVEAVWKPRE WG GIDNIEYFRPTGEPDA Y
Sbjct 282 AYVLLDGADIPFLHLVADIDAHEVRMGMRVEAVWKPREEWGFGIDNIEYFRPTGEPDAEY 341
Query 297 DTYKHHL 303
DTYKHHL
Sbjct 342 DTYKHHL 348
>gi|41406642|ref|NP_959478.1| hypothetical protein MAP0544c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394991|gb|AAS02861.1| hypothetical protein MAP_0544c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=344
Score = 521 bits (1343), Expect = 4e-146, Method: Compositional matrix adjust.
Identities = 255/303 (85%), Positives = 273/303 (91%), Gaps = 3/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR RR++GVRGSDGRVHVPP EYDPVTYEPL EMVPVS VGTV SW WQ
Sbjct 45 VGPTLSKFFTALRERRVLGVRGSDGRVHVPPPEYDPVTYEPLGEMVPVSPVGTVVSWAWQ 104
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+P+ GQPLDRPFAWALIKLDGADT L+HAVD G P AI +G RVH HWAD+PVGAIT
Sbjct 105 PDPIEGQPLDRPFAWALIKLDGADTPLLHAVDAGE--PKAIKSGTRVHVHWADEPVGAIT 162
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALGE EPVA D +DPV+MIVTPI L IQHTASHEESAYLRAIAQGKL+GA
Sbjct 163 DIAYFALGEDPEPVAEQPDAD-KDPVSMIVTPISLTIQHTASHEESAYLRAIAQGKLLGA 221
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGK GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 222 RTGKNGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVAAYVL 281
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+DVDAH+VRMGMRVEAVWKPRE+WG GIDNIEYFRPTGEPDA+YDTYK
Sbjct 282 LDGADIPFLHLVADVDAHEVRMGMRVEAVWKPREQWGFGIDNIEYFRPTGEPDADYDTYK 341
Query 301 HHL 303
HHL
Sbjct 342 HHL 344
>gi|118463428|ref|YP_879918.1| hypothetical protein MAV_0638 [Mycobacterium avium 104]
gi|118164715|gb|ABK65612.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336458431|gb|EGO37405.1| putative nucleic-acid-binding protein containing a Zn-ribbon
[Mycobacterium avium subsp. paratuberculosis S397]
Length=322
Score = 521 bits (1341), Expect = 7e-146, Method: Compositional matrix adjust.
Identities = 255/303 (85%), Positives = 273/303 (91%), Gaps = 3/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR RR++GVRGSDGRVHVPP EYDPVTYEPL EMVPVS VGTV SW WQ
Sbjct 23 VGPTLSKFFTALRERRVLGVRGSDGRVHVPPPEYDPVTYEPLGEMVPVSPVGTVVSWAWQ 82
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+P+ GQPLDRPFAWALIKLDGADT L+HAVD G P AI +G RVH HWAD+PVGAIT
Sbjct 83 PDPIEGQPLDRPFAWALIKLDGADTPLLHAVDAGE--PKAIKSGTRVHVHWADEPVGAIT 140
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALGE EPVA D +DPV+MIVTPI L IQHTASHEESAYLRAIAQGKL+GA
Sbjct 141 DIAYFALGEDPEPVAEQPDAD-KDPVSMIVTPISLTIQHTASHEESAYLRAIAQGKLLGA 199
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGK GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 200 RTGKNGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVAAYVL 259
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+DVDAH+VRMGMRVEAVWKPRE+WG GIDNIEYFRPTGEPDA+YDTYK
Sbjct 260 LDGADIPFLHLVADVDAHEVRMGMRVEAVWKPREQWGFGIDNIEYFRPTGEPDADYDTYK 319
Query 301 HHL 303
HHL
Sbjct 320 HHL 322
>gi|254773595|ref|ZP_05215111.1| hypothetical protein MaviaA2_02810 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=331
Score = 519 bits (1336), Expect = 3e-145, Method: Compositional matrix adjust.
Identities = 254/303 (84%), Positives = 272/303 (90%), Gaps = 3/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR RR++GVRGSDGRVHVPP EYDPVTYEPL EMVPVS VGTV SW WQ
Sbjct 32 VGPTLSKFFTALRERRVLGVRGSDGRVHVPPPEYDPVTYEPLGEMVPVSPVGTVVSWAWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+P+ GQPLDRPFAWALIKLDGADT L+HAVD G P AI +G RVH HWAD+PVGAIT
Sbjct 92 PDPIEGQPLDRPFAWALIKLDGADTPLLHAVDAGE--PKAIKSGTRVHVHWADEPVGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALGE EPVA D +DPV MIVTPI L IQHTASHEESAYLRAIAQGKL+GA
Sbjct 150 DIAYFALGEDPEPVAEQPDAD-KDPVGMIVTPISLTIQHTASHEESAYLRAIAQGKLLGA 208
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTGK GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 209 RTGKNGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVAAYVL 268
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+DVDAH+VRMGMRVEAVWKPRE+WG GIDNIEYFRPTGEPDA+Y+TYK
Sbjct 269 LDGADIPFLHLVADVDAHEVRMGMRVEAVWKPREQWGFGIDNIEYFRPTGEPDADYNTYK 328
Query 301 HHL 303
HHL
Sbjct 329 HHL 331
>gi|254822605|ref|ZP_05227606.1| hypothetical protein MintA_21929 [Mycobacterium intracellulare
ATCC 13950]
Length=331
Score = 516 bits (1330), Expect = 1e-144, Method: Compositional matrix adjust.
Identities = 251/303 (83%), Positives = 272/303 (90%), Gaps = 3/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR R ++GVRGSDGRVHVPP EYDPVTYEPL EMVPVSSVGTV SWTWQ
Sbjct 32 VGPTLSKFFTALRDRHVLGVRGSDGRVHVPPPEYDPVTYEPLGEMVPVSSVGTVVSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+ GQPLDRPFAWALIKLDGADT L+HAVD G P AI TG+RVH HW D+PVGAIT
Sbjct 92 PEPIEGQPLDRPFAWALIKLDGADTPLIHAVDAGE--PKAIKTGSRVHVHWVDEPVGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA F LGE AE VA D +DPVTMIVTP+ L IQH+ASHEESAYLRAIAQGKL+GA
Sbjct 150 DIAYFELGEEAEAVAEQSDGD-KDPVTMIVTPVSLTIQHSASHEESAYLRAIAQGKLLGA 208
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
+TG+ GKVYFPPHGADPATG+PT++FVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 209 KTGENGKVYFPPHGADPATGQPTTDFVELPDKGTVTTFAIINIPFQGQRIKPPYVAAYVL 268
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+D+DAH+VRMGMRVEAVWKPRE WG GIDNIEYFRPTGEPDA+YDTYK
Sbjct 269 LDGADIPFLHLVADIDAHEVRMGMRVEAVWKPREEWGFGIDNIEYFRPTGEPDADYDTYK 328
Query 301 HHL 303
HHL
Sbjct 329 HHL 331
>gi|240172353|ref|ZP_04751012.1| hypothetical protein MkanA1_23763 [Mycobacterium kansasii ATCC
12478]
Length=330
Score = 511 bits (1317), Expect = 4e-143, Method: Compositional matrix adjust.
Identities = 250/303 (83%), Positives = 271/303 (90%), Gaps = 4/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L +FFTALR RI+GVRGSDGRVHVPP EYDPVTYEPL+EMVPVS VGTVASWTWQ
Sbjct 32 VGPILGQFFTALRECRILGVRGSDGRVHVPPAEYDPVTYEPLTEMVPVSDVGTVASWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPLDRPFAWALIKLDGADTLLMHAVD G P I TGARVH HWAD+P GAIT
Sbjct 92 PEPLQGQPLDRPFAWALIKLDGADTLLMHAVDAGE--PDKIRTGARVHVHWADEPQGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FA G+ EPV + ++PVTM++TPI++ IQHTASHEESAYLRAIA+GKL+GA
Sbjct 150 DIAYFAPGDEQEPVPEATGD--QEPVTMVITPIEMTIQHTASHEESAYLRAIAEGKLLGA 207
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG+ GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 208 RTGEKGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVAAYVL 267
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+DVDAH+VRMGMRVEAVWKPRE+WG GIDNIEYFRPTGEPDA YDTYK
Sbjct 268 LDGADIPFLHLVADVDAHEVRMGMRVEAVWKPREQWGFGIDNIEYFRPTGEPDAEYDTYK 327
Query 301 HHL 303
HHL
Sbjct 328 HHL 330
>gi|342862255|ref|ZP_08718897.1| hypothetical protein MCOL_25316 [Mycobacterium colombiense CECT
3035]
gi|342130333|gb|EGT83653.1| hypothetical protein MCOL_25316 [Mycobacterium colombiense CECT
3035]
Length=335
Score = 511 bits (1316), Expect = 6e-143, Method: Compositional matrix adjust.
Identities = 251/307 (82%), Positives = 272/307 (89%), Gaps = 7/307 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR R ++GVRGSDGRVHVPP EYDPVTYEPL EMVPVSSVGTV SWTWQ
Sbjct 32 VGPTLSKFFTALRDRHVLGVRGSDGRVHVPPPEYDPVTYEPLGEMVPVSSVGTVVSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+ GQPLDRPFAWALIKLDGADT L+HAVD G P AI TG RVHAHW D+PVGAIT
Sbjct 92 PEPIEGQPLDRPFAWALIKLDGADTPLIHAVDAGE--PKAIKTGTRVHAHWVDEPVGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALG+ AE V D +DPVTMIVTP+ L IQH+ASHEESAYLRAIAQGKL+GA
Sbjct 150 DIAYFALGDEAETVTEQSDGD-KDPVTMIVTPVSLTIQHSASHEESAYLRAIAQGKLLGA 208
Query 181 RT----GKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVA 236
+T G+ GKVYFPPHGADPATG+PT+EFVELPDKGTVTTFAI+NIPF GQRIKPPYVA
Sbjct 209 KTMSVSGEKGKVYFPPHGADPATGQPTTEFVELPDKGTVTTFAIINIPFQGQRIKPPYVA 268
Query 237 AYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANY 296
AYVLLDG+DIPFLHLV+D+DAH+VRMGMRVEAVWKPRE WG GIDNIEYFRPTGEPDA+Y
Sbjct 269 AYVLLDGSDIPFLHLVADIDAHEVRMGMRVEAVWKPREEWGFGIDNIEYFRPTGEPDADY 328
Query 297 DTYKHHL 303
DTYKHHL
Sbjct 329 DTYKHHL 335
>gi|118619268|ref|YP_907600.1| hypothetical protein MUL_4080 [Mycobacterium ulcerans Agy99]
gi|118571378|gb|ABL06129.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=329
Score = 510 bits (1313), Expect = 1e-142, Method: Compositional matrix adjust.
Identities = 251/303 (83%), Positives = 268/303 (89%), Gaps = 5/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTL FFTALR RRI+GVRGSDGRVHVPP EYDPVTYEPL EMVPVS+VGTVASWTWQ
Sbjct 32 VGPTLGEFFTALRERRILGVRGSDGRVHVPPAEYDPVTYEPLGEMVPVSAVGTVASWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPLDRPFAWALIKLDGADT L+HAVD G GP I TGARVH HWAD+ VGAIT
Sbjct 92 PEPLPGQPLDRPFAWALIKLDGADTPLLHAVDAG--GPDKIKTGARVHVHWADETVGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA F LGE AEP +D PV MIVTP+ L IQHTASHEESAYLRAIA+GKL+GA
Sbjct 150 DIAYFVLGEDAEPPGEPSEQD---PVKMIVTPVSLTIQHTASHEESAYLRAIAEGKLLGA 206
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG GKVYFPPHGADPATG+PT+EFVELPD+GTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 207 RTGAKGKVYFPPHGADPATGQPTTEFVELPDQGTVTTFAIINIPFQGQRIKPPYVAAYVL 266
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+D+DA++VRMGMRVEAVWKPRE WG GIDNIEYFRPTGEPDA+YDTYK
Sbjct 267 LDGADIPFLHLVADIDANEVRMGMRVEAVWKPREEWGFGIDNIEYFRPTGEPDADYDTYK 326
Query 301 HHL 303
HHL
Sbjct 327 HHL 329
>gi|183984974|ref|YP_001853265.1| hypothetical protein MMAR_5006 [Mycobacterium marinum M]
gi|183178300|gb|ACC43410.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=329
Score = 509 bits (1311), Expect = 2e-142, Method: Compositional matrix adjust.
Identities = 251/303 (83%), Positives = 268/303 (89%), Gaps = 5/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTL FFTALR RRI+GVRGSDGRVHVPP EYDPVTYEPL EMVPVS+VGTVASWTWQ
Sbjct 32 VGPTLGEFFTALRERRILGVRGSDGRVHVPPAEYDPVTYEPLGEMVPVSAVGTVASWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPLDRPFAWALIKLDGADT L+HAVD G GP I TGARVH HWAD+ VGAIT
Sbjct 92 PEPLPGQPLDRPFAWALIKLDGADTPLLHAVDAG--GPDKIKTGARVHVHWADETVGAIT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA F LGE AEP +D PV MIVTP+ L IQHTASHEESAYLRAIA+GKL+GA
Sbjct 150 DIAYFKLGEDAEPPGEPSEQD---PVKMIVTPVSLTIQHTASHEESAYLRAIAEGKLLGA 206
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG GKVYFPPHGADPATG+PT+EFVELPD+GTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 207 RTGAKGKVYFPPHGADPATGQPTTEFVELPDQGTVTTFAIINIPFQGQRIKPPYVAAYVL 266
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+D+DA++VRMGMRVEAVWKPRE WG GIDNIEYFRPTGEPDA+YDTYK
Sbjct 267 LDGADIPFLHLVADIDANEVRMGMRVEAVWKPREEWGFGIDNIEYFRPTGEPDADYDTYK 326
Query 301 HHL 303
HHL
Sbjct 327 HHL 329
>gi|126437580|ref|YP_001073271.1| hypothetical protein Mjls_5016 [Mycobacterium sp. JLS]
gi|126237380|gb|ABO00781.1| protein of unknown function DUF35 [Mycobacterium sp. JLS]
Length=338
Score = 477 bits (1227), Expect = 1e-132, Method: Compositional matrix adjust.
Identities = 234/309 (76%), Positives = 260/309 (85%), Gaps = 8/309 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L +FFTALR +RIVGVRGSDGRVHVPP EYDPVTYE L+E+VPV+ VGTV SWTWQ
Sbjct 32 VGPLLGQFFTALRDKRIVGVRGSDGRVHVPPAEYDPVTYERLTEIVPVAGVGTVVSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P PL GQPLDRPFAWALIKLDGADT ++HAVD G A AI G RVHAHW D+PVGAIT
Sbjct 92 PAPLEGQPLDRPFAWALIKLDGADTPMLHAVDAGDA--DAISAGTRVHAHWVDEPVGAIT 149
Query 121 DIACFALGETAEPVA------AHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQ 174
DIA FALGE AEP + +DPVTM+VTP +EIQHTAS ES +LRA+ +
Sbjct 150 DIAFFALGEDAEPEGKPSDPRTRGAQTDKDPVTMLVTPSSIEIQHTASAPESTFLRALEE 209
Query 175 GKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPY 234
GKL+GARTGK GK+YFPP ADPATG+PT+EFVELPD+GTVTTFAI+NIPF GQRIKPPY
Sbjct 210 GKLLGARTGKDGKLYFPPREADPATGRPTTEFVELPDRGTVTTFAIINIPFAGQRIKPPY 269
Query 235 VAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA 294
VAAYVLLDGADIPFLHLV++++A QVRMGMRVEAVWKPRE WGLGIDNI +FRPTGEPDA
Sbjct 270 VAAYVLLDGADIPFLHLVTEIEADQVRMGMRVEAVWKPREEWGLGIDNISHFRPTGEPDA 329
Query 295 NYDTYKHHL 303
YDTYKHHL
Sbjct 330 EYDTYKHHL 338
>gi|108801597|ref|YP_641794.1| hypothetical protein Mmcs_4634 [Mycobacterium sp. MCS]
gi|119870751|ref|YP_940703.1| hypothetical protein Mkms_4722 [Mycobacterium sp. KMS]
gi|108772016|gb|ABG10738.1| protein of unknown function DUF35 [Mycobacterium sp. MCS]
gi|119696840|gb|ABL93913.1| protein of unknown function DUF35 [Mycobacterium sp. KMS]
Length=338
Score = 474 bits (1220), Expect = 8e-132, Method: Compositional matrix adjust.
Identities = 232/309 (76%), Positives = 258/309 (84%), Gaps = 8/309 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L +FFTALR +RIVGVRGSDGRVHVPP EYDPVTYE L+E+VPV+ VGTV SWTWQ
Sbjct 32 VGPLLGQFFTALREKRIVGVRGSDGRVHVPPAEYDPVTYERLTEIVPVAGVGTVVSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P PL GQPLDRPFAWALIKLDGADT ++H VD G A I G RVHAHW D+PVGAIT
Sbjct 92 PAPLEGQPLDRPFAWALIKLDGADTPMLHTVDAGDA--DKISAGTRVHAHWVDEPVGAIT 149
Query 121 DIACFALGETAEPVA------AHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQ 174
DIA FALGE AEP + +DPVTM+VTP +EIQHTAS ES +LRA+ +
Sbjct 150 DIAYFALGEDAEPEGEPSDPRTRGAQTDKDPVTMLVTPSSIEIQHTASAPESTFLRALEE 209
Query 175 GKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPY 234
GKL+GARTGK GK+YFPP ADPATG+PT+EFVELPD+GTVTTFAI+NIPF GQRIKPPY
Sbjct 210 GKLLGARTGKDGKLYFPPREADPATGRPTTEFVELPDRGTVTTFAIINIPFAGQRIKPPY 269
Query 235 VAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA 294
VAAYVLLDGADIPFLHLV++++A QVRMGMRVEAVWKPRE WGLGIDNI +FRPTGEPDA
Sbjct 270 VAAYVLLDGADIPFLHLVTEIEADQVRMGMRVEAVWKPREEWGLGIDNISHFRPTGEPDA 329
Query 295 NYDTYKHHL 303
YDTYKHHL
Sbjct 330 EYDTYKHHL 338
>gi|145222130|ref|YP_001132808.1| hypothetical protein Mflv_1538 [Mycobacterium gilvum PYR-GCK]
gi|145214616|gb|ABP44020.1| protein of unknown function DUF35 [Mycobacterium gilvum PYR-GCK]
Length=332
Score = 473 bits (1216), Expect = 2e-131, Method: Compositional matrix adjust.
Identities = 231/303 (77%), Positives = 259/303 (86%), Gaps = 2/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L FFTALR RRIVGVRGSDG+V VPP EYDPVT+E L+E+VPV+SVGTV SWTWQ
Sbjct 32 VGPLLGEFFTALRERRIVGVRGSDGKVLVPPAEYDPVTWEQLTEIVPVASVGTVLSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+PL GQPLDRPFAWALIKLDGADT L+HAVD G AG + I TGARVHAHW D+PVGAIT
Sbjct 92 PQPLPGQPLDRPFAWALIKLDGADTPLLHAVDTGAAGSAGISTGARVHAHWVDEPVGAIT 151
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALG+ AE V + DPVTM+V+P +EIQHTAS ES +LRA+ QGKL+GA
Sbjct 152 DIAYFALGDEAEDVP--PAPEGLDPVTMVVSPSAIEIQHTASLPESTFLRALEQGKLLGA 209
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
R+G+TGKVYFPP ADPATG + FVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 210 RSGETGKVYFPPKEADPATGLELNNFVELPDKGTVTTFAIINIPFAGQRIKPPYVAAYVL 269
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+++D +VRMGMRV+AVWKP E WGLGIDNI+YFRPTGEPDA+YDTYK
Sbjct 270 LDGADIPFLHLVTEIDPSEVRMGMRVQAVWKPEEEWGLGIDNIDYFRPTGEPDADYDTYK 329
Query 301 HHL 303
HHL
Sbjct 330 HHL 332
>gi|315442569|ref|YP_004075448.1| nucleic-acid-binding protein containing a Zn-ribbon [Mycobacterium
sp. Spyr1]
gi|315260872|gb|ADT97613.1| predicted nucleic-acid-binding protein containing a Zn-ribbon
[Mycobacterium sp. Spyr1]
Length=324
Score = 468 bits (1205), Expect = 4e-130, Method: Compositional matrix adjust.
Identities = 230/303 (76%), Positives = 258/303 (86%), Gaps = 2/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L FFTALR RRIVGVRGSDG+V VPP EYDPVT+E L+E+VPV+SVGTV SWTWQ
Sbjct 24 VGPLLGDFFTALRERRIVGVRGSDGKVLVPPAEYDPVTWEQLTEIVPVASVGTVLSWTWQ 83
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+PL GQPLDRPFAWALIKLDGADT L+HAVD G AG + I TGARVHAHW D+PVGAIT
Sbjct 84 PQPLPGQPLDRPFAWALIKLDGADTPLLHAVDTGAAGSAGISTGARVHAHWVDEPVGAIT 143
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALG+ AE V + DPVTM+V+P +EIQHTAS ES +LRA+ QG L+GA
Sbjct 144 DIAYFALGDEAEDVP--PAPEGLDPVTMVVSPSAIEIQHTASLPESTFLRALEQGTLLGA 201
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
R+G+TGKVYFPP ADPATG + FVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 202 RSGETGKVYFPPKEADPATGLELNNFVELPDKGTVTTFAIINIPFAGQRIKPPYVAAYVL 261
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+++D +VRMGMRV+AVWKP E WGLGIDNI+YFRPTGEPDA+YDTYK
Sbjct 262 LDGADIPFLHLVTEIDPSEVRMGMRVQAVWKPEEEWGLGIDNIDYFRPTGEPDADYDTYK 321
Query 301 HHL 303
HHL
Sbjct 322 HHL 324
>gi|120406168|ref|YP_955997.1| hypothetical protein Mvan_5220 [Mycobacterium vanbaalenii PYR-1]
gi|119958986|gb|ABM15991.1| protein of unknown function DUF35 [Mycobacterium vanbaalenii
PYR-1]
Length=330
Score = 465 bits (1196), Expect = 4e-129, Method: Compositional matrix adjust.
Identities = 231/303 (77%), Positives = 254/303 (84%), Gaps = 4/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGP L FFTALR RRIVGVRGSDG+VHVPP EYDPVT+E LSE+VPV+SVGTV SWTWQ
Sbjct 32 VGPLLGEFFTALRERRIVGVRGSDGKVHVPPAEYDPVTWEQLSEIVPVASVGTVQSWTWQ 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPLDRPFAWALIKLDGADT L+HAVD G++ AI TG RVHAHW D+PVGA+T
Sbjct 92 PEPLEGQPLDRPFAWALIKLDGADTPLLHAVDAGSS--DAISTGTRVHAHWVDEPVGAVT 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALG+ E V + DPVTMIV P +EIQHTAS ESA+LRA+ QGKL+G
Sbjct 150 DIAYFALGDQPEDVP--PAPEGLDPVTMIVVPTSIEIQHTASRPESAFLRALEQGKLLGN 207
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG GKVYFP ADPATG E+VEL DKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 208 RTGADGKVYFPAREADPATGVQLDEYVELSDKGTVTTFAIINIPFAGQRIKPPYVAAYVL 267
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIP LHLVSD+DA +VRMGMRV+AVWKP ++WGLGIDNIEYFRPTGEPDA+YDTYK
Sbjct 268 LDGADIPVLHLVSDIDADKVRMGMRVQAVWKPEDQWGLGIDNIEYFRPTGEPDADYDTYK 327
Query 301 HHL 303
HHL
Sbjct 328 HHL 330
>gi|333992300|ref|YP_004524914.1| hypothetical protein JDM601_3660 [Mycobacterium sp. JDM601]
gi|333488268|gb|AEF37660.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=330
Score = 462 bits (1188), Expect = 4e-128, Method: Compositional matrix adjust.
Identities = 225/302 (75%), Positives = 253/302 (84%), Gaps = 4/302 (1%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GP L +FFTALR RRIVGVRGSDGRV+VPP EYDPVTYE L+E+VPV+SVGTV SW+WQP
Sbjct 33 GPVLGQFFTALRERRIVGVRGSDGRVYVPPAEYDPVTYEQLTEIVPVASVGTVVSWSWQP 92
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
EPL GQPLD PFAWALIKLDGAD L+HAV GP AI G RVH HWA++ VGAITD
Sbjct 93 EPLEGQPLDTPFAWALIKLDGADVPLLHAV--AAEGPKAISAGTRVHVHWAEETVGAITD 150
Query 122 IACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGAR 181
IA FA+GE EPV + D RDPV+M++TPI LEIQH+ASH ESAYLRA +GKL+GAR
Sbjct 151 IAYFAIGEDPEPV--EQRSDDRDPVSMVITPIALEIQHSASHPESAYLRAFKEGKLLGAR 208
Query 182 TGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVLL 241
TG GKVYFP ADPATG+ +++VELPD GT+TTFAI+NIPF GQ+IKPPYVAAYVLL
Sbjct 209 TGTDGKVYFPAREADPATGRQLTDYVELPDTGTITTFAIINIPFQGQKIKPPYVAAYVLL 268
Query 242 DGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYKH 301
DGADIPFL LVSDVDA VRMGMRV+AVWKPRE W G++NIEYFRPTGEPDA+YDTYKH
Sbjct 269 DGADIPFLTLVSDVDAADVRMGMRVQAVWKPREEWTYGMENIEYFRPTGEPDADYDTYKH 328
Query 302 HL 303
HL
Sbjct 329 HL 330
>gi|118470277|ref|YP_890147.1| hypothetical protein MSMEG_5921 [Mycobacterium smegmatis str.
MC2 155]
gi|118171564|gb|ABK72460.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=300
Score = 460 bits (1183), Expect = 1e-127, Method: Compositional matrix adjust.
Identities = 229/303 (76%), Positives = 255/303 (85%), Gaps = 8/303 (2%)
Query 5 LSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQPEPL 64
LS+FFTALR RRIVGVRGSDGRVHVPP EYDPVTYEPL+E+VPV+ VGTV SWTWQPEPL
Sbjct 2 LSQFFTALRDRRIVGVRGSDGRVHVPPAEYDPVTYEPLTEVVPVAGVGTVVSWTWQPEPL 61
Query 65 AGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITDIAC 124
GQPLDRPFAWALIKLDGADT L+HAV A ++ TG RVHAHW D+P GAITDIA
Sbjct 62 EGQPLDRPFAWALIKLDGADTALLHAV---AAEEGSVSTGMRVHAHWVDEPAGAITDIAY 118
Query 125 FALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGART-- 182
F G+T EPVA D RDPVTM+V P +EIQH+AS ES YLR++ +GKLVGART
Sbjct 119 FLPGDTPEPVA-DAPADERDPVTMLVVPSSIEIQHSASLPESTYLRSLREGKLVGARTVG 177
Query 183 --GKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
G+ GKVYFPP ADPATG +EFVELPDKGTVTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 178 PNGEKGKVYFPPKEADPATGLELNEFVELPDKGTVTTFAIINIPFAGQRIKPPYVAAYVL 237
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV+D+DA +VRMGMRVEAVWKP++ WGLGIDNI +FRPTGEPDA+YD+YK
Sbjct 238 LDGADIPFLHLVTDIDASEVRMGMRVEAVWKPKDEWGLGIDNISHFRPTGEPDADYDSYK 297
Query 301 HHL 303
HHL
Sbjct 298 HHL 300
>gi|169631245|ref|YP_001704894.1| hypothetical protein MAB_4167 [Mycobacterium abscessus ATCC 19977]
gi|169243212|emb|CAM64240.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=329
Score = 457 bits (1177), Expect = 8e-127, Method: Compositional matrix adjust.
Identities = 222/303 (74%), Positives = 256/303 (85%), Gaps = 2/303 (0%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPTLS+FFTALR R+IVG RGSDG++HVP EYDPVTY PL+++VPVSSVGTV SW+WQ
Sbjct 29 VGPTLSKFFTALRDRQIVGTRGSDGKIHVPAAEYDPVTYAPLTDVVPVSSVGTVQSWSWQ 88
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEPL GQPL +PFAWALIKLDGADT L+HAVDVGTAG + I TGARVHA WAD+ VGAIT
Sbjct 89 PEPLEGQPLAKPFAWALIKLDGADTSLLHAVDVGTAGSAGITTGARVHAVWADETVGAIT 148
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FALGE A ++ ++PVTM VTPI+LE+QH S EESAYLRA+++GKL+G
Sbjct 149 DIAYFALGEKTAATPAPTSD--QEPVTMQVTPIRLEVQHITSPEESAYLRALSEGKLLGG 206
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG G+VYFP GADP TG+PTS+ V++ DKG VTTFAI+NIPF GQRIKPPYVAAYVL
Sbjct 207 RTGAGGRVYFPARGADPLTGEPTSDLVQVADKGVVTTFAIINIPFPGQRIKPPYVAAYVL 266
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHLV D+D VRMGMRVEAVWKP+E WG GIDNI+YFRPTGEPDA+Y+TYK
Sbjct 267 LDGADIPFLHLVYDIDPADVRMGMRVEAVWKPKEEWGYGIDNIQYFRPTGEPDADYETYK 326
Query 301 HHL 303
+
Sbjct 327 DRV 329
>gi|111021657|ref|YP_704629.1| hypothetical protein RHA1_ro04685 [Rhodococcus jostii RHA1]
gi|110821187|gb|ABG96471.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=321
Score = 408 bits (1049), Expect = 4e-112, Method: Compositional matrix adjust.
Identities = 203/304 (67%), Positives = 232/304 (77%), Gaps = 7/304 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPT+ F TALR R+++G RGSDGRV+VPP E+DP T +PL++ V VS GTV SWTW
Sbjct 24 VGPTIGAFVTALRDRKVIGARGSDGRVYVPPPEFDPNTADPLTDFVGVSDAGTVVSWTWM 83
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+AGQPL PFAWALI LDGADT L+HAVDVG+ P+A+ TG RV A WA + VG I
Sbjct 84 PEPIAGQPLTTPFAWALITLDGADTSLVHAVDVGS--PAAMSTGMRVRARWAQERVGRIQ 141
Query 121 DIACFALGETA-EPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVG 179
DI CF GE+A EP ++E PVTM+ TPI L+ H+AS EES YLR + GKL+G
Sbjct 142 DIVCFEPGESAGEPEPTTESE----PVTMVTTPIDLDYMHSASAEESYYLRGLKAGKLIG 197
Query 180 ARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYV 239
RTG GKVY PP A+P G PT E VELPD+G VTTF IVN+PFLGQRIKPPYVAAYV
Sbjct 198 GRTGPDGKVYIPPRSANPTDGIPTKEQVELPDRGIVTTFCIVNVPFLGQRIKPPYVAAYV 257
Query 240 LLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTY 299
LLDGADI FLHL+ D DA VRMGMRVEA WKPRE WG ++NIEYFRPTGEPDA YDTY
Sbjct 258 LLDGADIAFLHLILDCDATDVRMGMRVEAKWKPREEWGYTLENIEYFRPTGEPDAEYDTY 317
Query 300 KHHL 303
KHHL
Sbjct 318 KHHL 321
>gi|54022490|ref|YP_116732.1| hypothetical protein nfa5230 [Nocardia farcinica IFM 10152]
gi|54013998|dbj|BAD55368.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=322
Score = 407 bits (1046), Expect = 1e-111, Method: Compositional matrix adjust.
Identities = 199/303 (66%), Positives = 237/303 (79%), Gaps = 4/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPT+ RF T LRAR+IVGVRGSDGRV VPP EYDPVT E L+E V V+ GTV++WTW
Sbjct 24 VGPTIGRFLTGLRARKIVGVRGSDGRVLVPPPEYDPVTSEALTEFVDVADTGTVSTWTWV 83
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
+PL GQP DRPFAWALI LDGAD+ L+HAVDV + P + TG RV A WA+Q G I
Sbjct 84 RDPLPGQPFDRPFAWALITLDGADSALLHAVDVDS--PDQMRTGMRVRARWAEQTEGFIK 141
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DI CF GET+ AA D +PVTMI TP+ L +HTAS +E+ YLR +A+GKL+GA
Sbjct 142 DIVCFEPGETSTAPAAPV--DEGEPVTMITTPVDLSYKHTASPQETVYLRGLAEGKLIGA 199
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RT GKVYFPP GA+P G+PT +++EL D GTVTTF IVN+PFLGQRIKPPYVAAYVL
Sbjct 200 RTDAAGKVYFPPRGANPTDGRPTEDYIELSDHGTVTTFCIVNVPFLGQRIKPPYVAAYVL 259
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIP LHLV DA +VRMGMRV+AVWKPRE+WG G++N+++F P+GEPDA+Y+TYK
Sbjct 260 LDGADIPVLHLVLGCDASEVRMGMRVKAVWKPREQWGHGLENVDHFEPSGEPDADYETYK 319
Query 301 HHL 303
HHL
Sbjct 320 HHL 322
>gi|226304421|ref|YP_002764379.1| hypothetical protein RER_09320 [Rhodococcus erythropolis PR4]
gi|229494164|ref|ZP_04387927.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|226183536|dbj|BAH31640.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
gi|229318526|gb|EEN84384.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=318
Score = 406 bits (1044), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 195/302 (65%), Positives = 230/302 (77%), Gaps = 4/302 (1%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GPT+ F TALR R+++G RGS+GRV VPP E+DP T EPL++ V VS GTV SWTW P
Sbjct 21 GPTIGAFVTALRDRKVIGARGSNGRVFVPPPEFDPDTAEPLTDFVGVSDGGTVVSWTWMP 80
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
EP+ GQPL +PFAWALIKLDGADT ++HAVDV + P I TG RV A WA + +G I D
Sbjct 81 EPIEGQPLTKPFAWALIKLDGADTSMLHAVDVDS--PDDISTGLRVRARWASERIGQIKD 138
Query 122 IACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGAR 181
I CF GE+ +A + + DPVTMI TP+ L H+AS EES YLR +A+GKL+G R
Sbjct 139 IECFEPGESENGIAT--VDSSADPVTMITTPVDLHFMHSASAEESFYLRGLAEGKLIGGR 196
Query 182 TGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVLL 241
+G K+Y PP GA+P GKPTSE +ELPDKG VTTF IVN+PFLGQRIKPPYVAAYVLL
Sbjct 197 SGPEDKIYIPPRGANPTNGKPTSEQIELPDKGIVTTFCIVNVPFLGQRIKPPYVAAYVLL 256
Query 242 DGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYKH 301
DGADIPFLHL+ + DA VRMGMRVEA WKPRE WG ++NIEYFRPTGEPDA Y T++H
Sbjct 257 DGADIPFLHLILECDAADVRMGMRVEAKWKPREEWGYTLENIEYFRPTGEPDAEYSTFQH 316
Query 302 HL 303
HL
Sbjct 317 HL 318
>gi|226364194|ref|YP_002781976.1| hypothetical protein ROP_47840 [Rhodococcus opacus B4]
gi|226242683|dbj|BAH53031.1| hypothetical protein [Rhodococcus opacus B4]
Length=328
Score = 404 bits (1039), Expect = 6e-111, Method: Compositional matrix adjust.
Identities = 199/303 (66%), Positives = 227/303 (75%), Gaps = 5/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPT+ F TALR R+++G RGSDGRV+VPP E+DP T EPL++ V VS GTV SWTW
Sbjct 31 VGPTIGAFVTALRDRKVIGARGSDGRVYVPPPEFDPTTAEPLTDFVGVSDAGTVVSWTWM 90
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+AGQPL PFAWALIKLDGADT ++HAVDV + P+ + TG RV A WA + G I
Sbjct 91 PEPIAGQPLTSPFAWALIKLDGADTSMVHAVDVPS--PAGMSTGMRVRARWAQERAGHIQ 148
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DI CF GE+A A + +PVTMI TP+ L+ H+AS EES YLR + GKL+G
Sbjct 149 DIVCFEPGESA---GAPEPSTESEPVTMITTPVDLDYMHSASAEESYYLRGLKAGKLIGG 205
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RTG GKVY PP A+P G PT E VELPD G VTTF IVN+PFLGQRIKPPYVAAYVL
Sbjct 206 RTGPGGKVYIPPRSANPTDGIPTKEQVELPDTGIVTTFCIVNVPFLGQRIKPPYVAAYVL 265
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADI FLHL+ D DA VRMGMRVEA WKPRE WG ++NIEYFRPTGEPDA YDTYK
Sbjct 266 LDGADIAFLHLILDCDAADVRMGMRVEAKWKPREEWGYTLENIEYFRPTGEPDAEYDTYK 325
Query 301 HHL 303
HHL
Sbjct 326 HHL 328
>gi|312138195|ref|YP_004005531.1| hypothetical protein REQ_07290 [Rhodococcus equi 103S]
gi|311887534|emb|CBH46846.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=318
Score = 399 bits (1026), Expect = 2e-109, Method: Compositional matrix adjust.
Identities = 192/303 (64%), Positives = 229/303 (76%), Gaps = 4/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
VGPT+ F TALR R+++G RGSDGRVHVPP E+DP T+EP+++ V VS GTV SW+W
Sbjct 20 VGPTIGAFVTALRDRKVIGARGSDGRVHVPPPEFDPATHEPMTDFVDVSDTGTVVSWSWM 79
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+ GQPL PFAWAL+KLDGADT ++HAVD G+ P A+ TG RV WAD+ G I
Sbjct 80 PEPIEGQPLSHPFAWALVKLDGADTSILHAVDAGS--PEAMSTGMRVRVRWADERTGRIQ 137
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACF GE+ A T DPVT IVTPI L HTAS EE+ YLR + +GK++G
Sbjct 138 DIACFEPGESDTDSTA--TVSTGDPVTDIVTPIDLHYTHTASFEETYYLRGLMEGKIIGG 195
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
RT GKVY PP GA+P G PT E VE+ DKGT+TTF IVN+PFLGQ+IKPPYVAAYVL
Sbjct 196 RTDANGKVYVPPRGANPTDGMPTKEQVEVSDKGTITTFCIVNVPFLGQQIKPPYVAAYVL 255
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIPFLHL+ DVDA +VRMGMRVEAVW+P E W + N+ +FRP+GEPDA+YD+YK
Sbjct 256 LDGADIPFLHLILDVDAAEVRMGMRVEAVWRPEEEWEYSLRNVSHFRPSGEPDADYDSYK 315
Query 301 HHL 303
HHL
Sbjct 316 HHL 318
>gi|325674900|ref|ZP_08154587.1| hypothetical protein HMPREF0724_12369 [Rhodococcus equi ATCC
33707]
gi|325554486|gb|EGD24161.1| hypothetical protein HMPREF0724_12369 [Rhodococcus equi ATCC
33707]
Length=291
Score = 390 bits (1003), Expect = 1e-106, Method: Compositional matrix adjust.
Identities = 187/295 (64%), Positives = 224/295 (76%), Gaps = 4/295 (1%)
Query 9 FTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQPEPLAGQP 68
TALR R+++G RGSDGRVHVPP E+DP T+EP+++ V VS GTV SW+W PEP+ GQP
Sbjct 1 MTALRDRKVIGARGSDGRVHVPPPEFDPATHEPMTDFVDVSDTGTVVSWSWMPEPIEGQP 60
Query 69 LDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITDIACFALG 128
L PFAWAL+KLDGADT ++HAVD G+ P A+ TG RV WAD+ G I DIACF G
Sbjct 61 LSHPFAWALVKLDGADTSILHAVDAGS--PGAMSTGMRVRVRWADERTGRIQDIACFEPG 118
Query 129 ETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGARTGKTGKV 188
E+ A T DPVT IVTPI L HTAS EE+ YLR + +GK++G RT GKV
Sbjct 119 ESDTDSTA--TVSTGDPVTDIVTPIDLHYTHTASFEETYYLRGLMEGKIIGGRTDANGKV 176
Query 189 YFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVLLDGADIPF 248
Y PP GA+P G PT E VE+ DKGT+TTF IVN+PFLGQ+IKPPYVAAYVLLDGADIPF
Sbjct 177 YVPPRGANPTDGMPTKEQVEVSDKGTITTFCIVNVPFLGQQIKPPYVAAYVLLDGADIPF 236
Query 249 LHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYKHHL 303
LHL+ DVDA +VRMGMRVEAVW+P+E W + N+ +FRP+GEPDA+YD+YKHHL
Sbjct 237 LHLILDVDAAEVRMGMRVEAVWRPKEEWEYSLRNVSHFRPSGEPDADYDSYKHHL 291
>gi|312191035|gb|ADQ43400.1| hypothetical protein ro04685 [Rhodococcus rhodochrous]
Length=323
Score = 382 bits (981), Expect = 4e-104, Method: Compositional matrix adjust.
Identities = 193/302 (64%), Positives = 220/302 (73%), Gaps = 6/302 (1%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GPT+ F T LR RI+G RGSDGRV VPP E+D VT+EPL++ V V GTV SWTW
Sbjct 28 GPTVGAFVTGLRDGRILGARGSDGRVLVPPPEFDAVTHEPLTDFVEVGQTGTVVSWTWNA 87
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
EPL GQP DRPFAWALI+LDGADT L+HAVDV A P I TG RV WA + G I D
Sbjct 88 EPLPGQPFDRPFAWALIRLDGADTTLLHAVDV--ASPDEIGTGLRVRVRWAAERTGKIHD 145
Query 122 IACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGAR 181
I F GE V A E PVTMI TP+ L +H+AS EES YLR + +GK++G R
Sbjct 146 IEAFEPGEATLTVQAADGE----PVTMITTPVDLHYRHSASPEESWYLRGLKEGKIIGGR 201
Query 182 TGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVLL 241
TG GKVY PP GA P G PT E VE+PDKG VTTF IVN+PF+GQ+IKPPYVAAYVLL
Sbjct 202 TGPGGKVYVPPRGASPTDGVPTKEPVEVPDKGIVTTFCIVNVPFMGQQIKPPYVAAYVLL 261
Query 242 DGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYKH 301
DGADIPFLHL+ + DA +VRMGMRVEA W+PRE W + NIEYFRPTGEPDA+YDT+KH
Sbjct 262 DGADIPFLHLILECDASEVRMGMRVEAKWRPREEWDHTLRNIEYFRPTGEPDADYDTFKH 321
Query 302 HL 303
HL
Sbjct 322 HL 323
>gi|300784463|ref|YP_003764754.1| hypothetical protein AMED_2557 [Amycolatopsis mediterranei U32]
gi|299793977|gb|ADJ44352.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340525884|gb|AEK41089.1| hypothetical protein RAM_12995 [Amycolatopsis mediterranei S699]
Length=325
Score = 375 bits (963), Expect = 5e-102, Method: Compositional matrix adjust.
Identities = 190/309 (62%), Positives = 222/309 (72%), Gaps = 11/309 (3%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GP L RF ALR RRI G+RGSDGRVHVPPVEYDPVT E LSE VPV+ GTV SW+W P
Sbjct 21 GPVLGRFVNALRDRRIEGIRGSDGRVHVPPVEYDPVTAEQLSEFVPVAEEGTVVSWSWCP 80
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
PL GQPL+RPFAWAL+KLDGADT ++HAVD G P IH+G RV WAD+ VG I D
Sbjct 81 RPLDGQPLNRPFAWALVKLDGADTPMLHAVDAGE--PGNIHSGQRVRVRWADEVVGHIRD 138
Query 122 IACFALGETAE-------PVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQ 174
IA F + + P A + E A PV++++TP+ L+ H+AS EES YLR +A+
Sbjct 139 IAYFLPVDAEDTTPTQPAPPVADREEGA--PVSVVITPVHLKYLHSASPEESTYLRGLAE 196
Query 175 GKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPY 234
GKL+G R GKVY PP GA P G PT+E VELPD G VTTF IVN+PFLGQRIKPPY
Sbjct 197 GKLIGQRCPACGKVYIPPRGACPTDGVPTTEEVELPDTGIVTTFCIVNVPFLGQRIKPPY 256
Query 235 VAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA 294
VAAY+LLDGADI FLHLV A V+MGMRV A WKPR+ W ++NI +F PTGEPDA
Sbjct 257 VAAYILLDGADIAFLHLVLGCAAEDVKMGMRVRAAWKPRDEWWTSLENISHFEPTGEPDA 316
Query 295 NYDTYKHHL 303
Y+T+ HHL
Sbjct 317 AYETFAHHL 325
>gi|302525688|ref|ZP_07278030.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302434583|gb|EFL06399.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=327
Score = 372 bits (956), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 186/309 (61%), Positives = 222/309 (72%), Gaps = 11/309 (3%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GP L RF ALR RRI GVRGSDGRVHVPPVEYDP T +PL+E VPV + GTV SW+W
Sbjct 23 GPVLGRFVNALRERRIEGVRGSDGRVHVPPVEYDPATADPLTEFVPVGTEGTVVSWSWCA 82
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
+PL GQPL RPFAW L+KLDGADT L+HA+D G+ P +H G RV WA + VG I D
Sbjct 83 DPLDGQPLSRPFAWVLVKLDGADTSLLHALDAGS--PDNVHIGQRVRVRWAGETVGHIRD 140
Query 122 IACFALGET-------AEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQ 174
IA F + A P A + E A PV++I+TP+ L+ QH+AS EES YLR +A+
Sbjct 141 IAYFLPADAPDTTPTEAPPPVAEREEGA--PVSVIITPVHLKYQHSASPEESRYLRGLAE 198
Query 175 GKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPY 234
G+++G R + GKVY PP GA P G PT++ VELPD G VTTF IVN+PFLGQRIKPPY
Sbjct 199 GRMLGQRCPECGKVYIPPRGACPVDGVPTTDEVELPDTGIVTTFCIVNVPFLGQRIKPPY 258
Query 235 VAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDA 294
VAAY+LLDGADI FLHLV A +VRMGMRV A W+PRE W ++NI +F PTGEPDA
Sbjct 259 VAAYILLDGADIAFLHLVLGCAAEEVRMGMRVRASWRPREEWWTSLENISHFEPTGEPDA 318
Query 295 NYDTYKHHL 303
Y+T+ HHL
Sbjct 319 EYETFAHHL 327
>gi|333918657|ref|YP_004492238.1| hypothetical protein AS9A_0986 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480878|gb|AEF39438.1| hypothetical protein AS9A_0986 [Amycolicicoccus subflavus DQS3-9A1]
Length=329
Score = 359 bits (921), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 181/310 (59%), Positives = 214/310 (70%), Gaps = 9/310 (2%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP LSRF TAL R+I+G++GSDGRVHVPPVEYDPVT EPL+E V V + GTV +W+W
Sbjct 22 LGPVLSRFMTALAQRQILGIKGSDGRVHVPPVEYDPVTAEPLTEFVEVGTEGTVLTWSWC 81
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+P+ GQP+ +PFAWALI+LDGAD L+HAV+V + PS I TG RV WA+ P G I
Sbjct 82 PKPVEGQPIQQPFAWALIRLDGADAGLLHAVNVPS--PSDIRTGMRVQVQWAEAPTGHIR 139
Query 121 DIACF-------ALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIA 173
DIA F + P D D VT I+TPIQL HT S EES YLRA+A
Sbjct 140 DIAYFVPTDPGTSAAAPQAPPPPESKRDDEDRVTTIITPIQLAYDHTVSAEESRYLRALA 199
Query 174 QGKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPP 233
GKL+G R + G+VY PP GA P G PT+ VELPD G VTTF IVN+PFLGQRI PP
Sbjct 200 DGKLIGQRCAECGQVYIPPRGACPVDGVPTTTEVELPDTGIVTTFCIVNVPFLGQRITPP 259
Query 234 YVAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPD 293
YV AYVLLDGADI FLHLV D+ VRMGMRV AVW+P+ W + NI+YF PTGE D
Sbjct 260 YVVAYVLLDGADIAFLHLVRGCDSADVRMGMRVRAVWRPKAEWQTSLSNIDYFTPTGERD 319
Query 294 ANYDTYKHHL 303
A +T+ HL
Sbjct 320 APIETFARHL 329
>gi|296141489|ref|YP_003648732.1| hypothetical protein Tpau_3818 [Tsukamurella paurometabola DSM
20162]
gi|296029623|gb|ADG80393.1| protein of unknown function DUF35 [Tsukamurella paurometabola
DSM 20162]
Length=320
Score = 355 bits (912), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 177/307 (58%), Positives = 216/307 (71%), Gaps = 12/307 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP LS F T LR RRIVG R + GRVHVPP+E+DP T+ PL+++VPVS GTV SW+W
Sbjct 22 LGPVLSAFMTNLRDRRIVGTRDAAGRVHVPPLEFDPDTHAPLTDVVPVSDTGTVESWSWN 81
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+ GQP DRPFA+ALI+LDGADT L+HA+DV P+ + TG RV A W P GAI
Sbjct 82 AHPVDGQPFDRPFAYALIRLDGADTSLLHALDV--TDPADVSTGMRVRARWVADPTGAIG 139
Query 121 DIACFALGETAEPVAAHKTED----ARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGK 176
DIA F EP H D A DP+T++ TP++L ++H+AS E+ YLRA+A+G+
Sbjct 140 DIAAF------EPGEGHGVPDGTAAAEDPITILTTPVELHLEHSASVPETRYLRALAEGR 193
Query 177 LVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVA 236
L+G R GK G VY PP A P G PT+E V+LPD G VTTF +VN+PF GQRI PPYVA
Sbjct 194 LLGQRCGKCGNVYVPPRNACPIDGIPTTEEVDLPDTGVVTTFCVVNVPFQGQRITPPYVA 253
Query 237 AYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANY 296
AYVL+DGADIPFLHLV + +VRMGMRV A WKPRE W NI +F PTGEPDA Y
Sbjct 254 AYVLIDGADIPFLHLVLGCEPAEVRMGMRVRASWKPREEWTCSPGNISHFEPTGEPDAPY 313
Query 297 DTYKHHL 303
+Y+ HL
Sbjct 314 SSYEKHL 320
>gi|119716957|ref|YP_923922.1| hypothetical protein Noca_2732 [Nocardioides sp. JS614]
gi|119537618|gb|ABL82235.1| protein of unknown function DUF35 [Nocardioides sp. JS614]
Length=319
Score = 346 bits (888), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 168/302 (56%), Positives = 213/302 (71%), Gaps = 2/302 (0%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GP L RF T LR R+VG R SDGRV VPP E+DPV++E ++E V V+ GTV SWTW P
Sbjct 20 GPVLGRFLTGLRDGRVVGARTSDGRVVVPPPEFDPVSHEAVTEFVEVAPTGTVTSWTWVP 79
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
EP+ GQP DRPFA+AL+ LDGADT +HA+D+ A P + TG RV WA++ VGAITD
Sbjct 80 EPVPGQPFDRPFAFALVTLDGADTPFLHALDL--ASPDQVSTGMRVRVRWAEERVGAITD 137
Query 122 IACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGAR 181
IAC+ + V DPVT ++TP+ L+ + AS EESA+ R + +G++VG R
Sbjct 138 IACWEALDAVVEVRGEARLATTDPVTGVITPVSLDYLYAASPEESAFYRGLNEGRIVGQR 197
Query 182 TGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVLL 241
KVY PP A P+ G PT+E VE+ GT+TTF +VN+PFLGQ+I PPYV+AYVLL
Sbjct 198 CPACQKVYVPPRSACPSDGTPTAEEVEVAQTGTITTFCVVNVPFLGQKITPPYVSAYVLL 257
Query 242 DGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYKH 301
DGADI LHL+ V A +VRMGMRV+AVWKP E W ++NI++F PTGEPDA+YDTY+
Sbjct 258 DGADIAVLHLILGVPADEVRMGMRVKAVWKPEEEWTYSLENIDHFEPTGEPDADYDTYRQ 317
Query 302 HL 303
HL
Sbjct 318 HL 319
>gi|343928237|ref|ZP_08767691.1| hypothetical protein GOALK_111_00060 [Gordonia alkanivorans NBRC
16433]
gi|343761831|dbj|GAA14617.1| hypothetical protein GOALK_111_00060 [Gordonia alkanivorans NBRC
16433]
Length=347
Score = 346 bits (887), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 183/310 (60%), Positives = 216/310 (70%), Gaps = 10/310 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP LS+F ALR RIVG +GSDG V VPPVE+DPVT + +E+V VS+VGTV SWTW
Sbjct 41 LGPVLSQFALALRDGRIVGSKGSDGAVSVPPVEFDPVTGQQSTEIVEVSTVGTVTSWTWH 100
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P+ GQPLD+PFAWALIKLDGADT L+HAV V + PS I TG RVHA ++ +G I
Sbjct 101 DAPVPGQPLDKPFAWALIKLDGADTTLLHAVSVDS--PSEISTGLRVHAVFSAARIGRID 158
Query 121 DIACFALGETAEPVAAHKTEDA----RDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGK 176
DIA FA GE+ + A T DA + +I TP+ EI H+A+ EES YL + GK
Sbjct 159 DIAYFAPGESTD-AAPENTADAPKGADTGLVVIPTPVTTEITHSANEEESVYLEGLKAGK 217
Query 177 LVGARTGK---TGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPP 233
L+G R G G+VYFPP G PA G E VEL G VTTF IVN+PF GQRIKPP
Sbjct 218 LIGTRIGSGVDEGRVYFPPRGVSPADGSRAVERVELAHTGIVTTFCIVNVPFQGQRIKPP 277
Query 234 YVAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPD 293
YVAAYVLLDGADIPFLHL+ D +A VRMGMRV+AVW P + W I NI +F PTGEPD
Sbjct 278 YVAAYVLLDGADIPFLHLILDCEAADVRMGMRVKAVWLPEDEWEYSIGNISHFAPTGEPD 337
Query 294 ANYDTYKHHL 303
A+Y+TYK HL
Sbjct 338 ADYETYKDHL 347
>gi|262203823|ref|YP_003275031.1| hypothetical protein Gbro_3964 [Gordonia bronchialis DSM 43247]
gi|262087170|gb|ACY23138.1| protein of unknown function DUF35 [Gordonia bronchialis DSM 43247]
Length=329
Score = 345 bits (885), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 180/306 (59%), Positives = 218/306 (72%), Gaps = 5/306 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP LS+F ALR RIVG SDG V VPPVE+DP T P SE+V V++ GTV +W+WQ
Sbjct 26 LGPVLSQFALALRDGRIVGSANSDGTVSVPPVEFDPTTGAPTSELVDVATTGTVTTWSWQ 85
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PEP+A QPLDRPFAWALI+LDGADT ++HAV V +A A++TG RVHA W+ G I
Sbjct 86 PEPVAAQPLDRPFAWALIRLDGADTAILHAVAVDSA--DAMNTGMRVHAVWSAARTGRID 143
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIA FA G+TA+ + E++ D +I TPI EI H+A+ EES YL + GKL+G+
Sbjct 144 DIAHFAPGDTAQTAPDNTAENSEDTDVIITTPITTEIIHSANEEESVYLEGLKAGKLIGS 203
Query 181 RTGK---TGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAA 237
R G G+VYFPP PA G + E VELPD G VTTF IVN+PF GQ+IKPPYVAA
Sbjct 204 RIGSGVDAGRVYFPPRAVSPADGSRSVERVELPDTGIVTTFCIVNVPFRGQQIKPPYVAA 263
Query 238 YVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYD 297
YVLLDGADIPFLHL+ D +A VRMGMRV+AVW P + W I NI +F PTGE DA+Y+
Sbjct 264 YVLLDGADIPFLHLILDCEAADVRMGMRVKAVWAPEDEWEYSIGNISHFAPTGEDDADYE 323
Query 298 TYKHHL 303
TYK HL
Sbjct 324 TYKDHL 329
>gi|326331660|ref|ZP_08197948.1| hypothetical protein NBCG_03099 [Nocardioidaceae bacterium Broad-1]
gi|325950459|gb|EGD42511.1| hypothetical protein NBCG_03099 [Nocardioidaceae bacterium Broad-1]
Length=317
Score = 343 bits (881), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 175/306 (58%), Positives = 213/306 (70%), Gaps = 12/306 (3%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQP 61
GP + RF T LR IVG R DGRV VPP EYDPVTY ++E V + GTV SWTW
Sbjct 20 GPVIGRFLTGLRDATIVGGRLGDGRVAVPPPEYDPVTYRAVTEFVELPDTGTVTSWTWVS 79
Query 62 EPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAITD 121
EP+AGQP +PFA+ALI +DGADT +HAV+V A P I TG RV A WA++ G++TD
Sbjct 80 EPVAGQPFQKPFAYALITIDGADTPWLHAVEV--ASPDDIETGMRVRARWAEERTGSVTD 137
Query 122 IACFA----LGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKL 177
+ A ET P D V +IVTP+ L+ Q+ AS EES++ R +A+G++
Sbjct 138 LVFVADDGNAPETGTPAGGT------DDVGLIVTPVSLDYQYAASPEESSFFRGLAEGRI 191
Query 178 VGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAA 237
VG R K KVY PP GA P G PTSE +EL D GTVTTF +VN+PFLGQRIKPPYV+A
Sbjct 192 VGQRCPKCRKVYVPPRGACPTDGVPTSEEIELSDVGTVTTFCVVNVPFLGQRIKPPYVSA 251
Query 238 YVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYD 297
YVLLDGADI HL+ +V A +VRMGMRV+AVWKPRE WG I+NI +F PTGEPDA++D
Sbjct 252 YVLLDGADIALQHLILEVPAEEVRMGMRVKAVWKPREEWGTSIENISHFAPTGEPDADFD 311
Query 298 TYKHHL 303
+YKHHL
Sbjct 312 SYKHHL 317
>gi|311744230|ref|ZP_07718034.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311312403|gb|EFQ82316.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=319
Score = 336 bits (862), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 177/304 (59%), Positives = 217/304 (72%), Gaps = 6/304 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GPTLS F + LR R+++G SDG V VPP EYDP T EP++EM V+ G V SW W
Sbjct 21 LGPTLSDFMSGLRNRQVLGGVLSDGSVVVPPPEYDPHTLEPVTEMRRVADEGVVQSWVWV 80
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
EP+ QPLDRPFA+ALI LDGAD L+HAVD G+ P I TG RV A WA++ GAIT
Sbjct 81 SEPVRDQPLDRPFAFALIVLDGADQPLLHAVDAGS--PDQISTGLRVRARWAEETAGAIT 138
Query 121 DIACFA-LGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVG 179
DI F LG EP A A +PVT IV+P++L + AS EESA+ R +A+G+++G
Sbjct 139 DIRWFEPLG--TEPAPATDAGTA-EPVTGIVSPVRLAYDYAASPEESAFFRGLAEGRILG 195
Query 180 ARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYV 239
R KVY PP GA P G PT++ VELPD GTVTTF IVN+PFLGQ+I+PPYV+AYV
Sbjct 196 QRCPTCHKVYVPPRGACPVDGVPTTDEVELPDHGTVTTFCIVNVPFLGQKIEPPYVSAYV 255
Query 240 LLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTY 299
LLDGADI FLHL+ VDA VRMG+RV+AVWKPR+ WG I+NI +F PTGEPDA ++TY
Sbjct 256 LLDGADIAFLHLILGVDAADVRMGLRVKAVWKPRDEWGTTIENISHFEPTGEPDAGFETY 315
Query 300 KHHL 303
+ HL
Sbjct 316 QQHL 319
>gi|326383119|ref|ZP_08204808.1| hypothetical protein SCNU_09281 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198255|gb|EGD55440.1| hypothetical protein SCNU_09281 [Gordonia neofelifaecis NRRL
B-59395]
Length=326
Score = 333 bits (855), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 174/305 (58%), Positives = 211/305 (70%), Gaps = 5/305 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP LS+F ALR RIVG RGSDGRV PP E+DPV+ P + +V V+SVGTV SW+WQ
Sbjct 25 LGPVLSQFALALRDGRIVGSRGSDGRVTTPPAEFDPVSGAPTTGLVDVASVGTVESWSWQ 84
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P PL GQ LDRPFA+ALIKLDG+DT L+H VD A PS + GARVHA W G IT
Sbjct 85 PRPLDGQALDRPFAFALIKLDGSDTSLVHVVD--AADPSQLSVGARVHAVWRAARSGVIT 142
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEI--QHTASHEESAYLRAIAQGKLV 178
DIA F+LGE A E D ++ + H+A+ ES YL + GKL+
Sbjct 143 DIAHFSLGEAPSDAPAATGEMNVDDAGHVIITTPITTDIMHSAAESESWYLEGLKAGKLI 202
Query 179 GARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAY 238
G R +TG+VYFPP PA G PT E VEL D GTVTTF IVN+PFLGQ+IKPPYVAAY
Sbjct 203 GGRV-QTGEVYFPPRYVSPADGSPTVERVELADSGTVTTFCIVNVPFLGQQIKPPYVAAY 261
Query 239 VLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDT 298
VLLDGADIPFLHL+ D A +VRMGMRV+AVW+P W + NI +F P+GEPDA++++
Sbjct 262 VLLDGADIPFLHLILDTPAEEVRMGMRVKAVWRPESEWDHTMRNISHFAPSGEPDADFES 321
Query 299 YKHHL 303
Y++HL
Sbjct 322 YRNHL 326
>gi|145595167|ref|YP_001159464.1| hypothetical protein Strop_2642 [Salinispora tropica CNB-440]
gi|145304504|gb|ABP55086.1| protein of unknown function DUF35 [Salinispora tropica CNB-440]
Length=329
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 168/303 (56%), Positives = 202/303 (67%), Gaps = 5/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP L RF T LR RR++G R +DGRVHVPP+EYDP T+ P++E+VPV GTV SWTW
Sbjct 32 LGPVLGRFMTGLRDRRVLGARTADGRVHVPPLEYDPATHAPVTELVPVPHTGTVTSWTWT 91
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PL GQPLDRPF WAL++LDGADT L+HAVD GT ++ TG RV WA Q G I
Sbjct 92 DRPLDGQPLDRPFGWALVRLDGADTALLHAVDAGTR--ESMRTGMRVRIRWAAQRSGHIR 149
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACF + P DPVT I TPI+L HT S EES YLRA+A+G+L+G
Sbjct 150 DIACFEPDQGPPPTLDDTISG--DPVTGITTPIRLSYTHTTSAEESRYLRALAEGRLLGQ 207
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
R KVY PP PA G PT E V + D+GT+TTF +VN+PF GQR+ PPYV A VL
Sbjct 208 RCPACRKVYVPPRVC-PADGVPTEEEVPVRDRGTITTFCVVNVPFAGQRLDPPYVVAQVL 266
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIP HL+ +VRMGMRV AVW+ W +NI +FRPT EPDA Y++Y+
Sbjct 267 LDGADIPIPHLILGPATSEVRMGMRVAAVWREPTTWSTTPENIAHFRPTDEPDAPYESYQ 326
Query 301 HHL 303
HL
Sbjct 327 EHL 329
>gi|159038412|ref|YP_001537665.1| hypothetical protein Sare_2839 [Salinispora arenicola CNS-205]
gi|157917247|gb|ABV98674.1| protein of unknown function DUF35 [Salinispora arenicola CNS-205]
Length=319
Score = 325 bits (834), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 166/303 (55%), Positives = 206/303 (68%), Gaps = 6/303 (1%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP L +F T LR RR++G R SDGRVHVPP+EYDP T+ P++E+VPV GTV SWTW
Sbjct 23 LGPVLGQFMTGLRDRRVLGARTSDGRVHVPPLEYDPATHAPVTELVPVQPTGTVTSWTWT 82
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
PL GQPLDRPF WALI+LDG+DT L+HAVD AG ++ TG RV WA + G I
Sbjct 83 ERPLDGQPLDRPFGWALIRLDGSDTPLLHAVD---AGRESMRTGMRVRIRWATRRSGHIR 139
Query 121 DIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLVGA 180
DIACF + +P DPVT++ TPI+L HT S EES YLRA+A+G+L+G
Sbjct 140 DIACFEPVQAPDP--GVDPAAGGDPVTVMTTPIRLSYTHTTSAEESRYLRALAEGRLLGQ 197
Query 181 RTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAYVL 240
R KVY PP PA G PT + V + D GTVTT+ +VN+PF GQR+ PPYV A +L
Sbjct 198 RCPVCRKVYVPPRVC-PADGVPTEDEVPVRDHGTVTTYCVVNVPFAGQRLDPPYVVAQIL 256
Query 241 LDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDTYK 300
LDGADIP HL+ + +VRMGMRV AVW+ E W +NI +FRPTGEPDA Y++Y+
Sbjct 257 LDGADIPIPHLILGLPTSEVRMGMRVAAVWRDPETWSTTPENIAHFRPTGEPDAPYESYQ 316
Query 301 HHL 303
HL
Sbjct 317 EHL 319
>gi|319950792|ref|ZP_08024680.1| hypothetical protein ES5_14353 [Dietzia cinnamea P4]
gi|319435549|gb|EFV90781.1| hypothetical protein ES5_14353 [Dietzia cinnamea P4]
Length=323
Score = 290 bits (742), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 157/311 (51%), Positives = 192/311 (62%), Gaps = 13/311 (4%)
Query 2 GPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSS----VGTVASW 57
GP L FFT LR RR+VG R S G VH+PPVE+DP T L+E V V S G V +W
Sbjct 17 GPVLGAFFTGLRERRLVGNRDSRGTVHLPPVEFDPHTRRALTESVEVGSGSAIEGLVVAW 76
Query 58 TWQPEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVG 117
TW P P PLD PFAWAL++ DGADT ++ + + GP A+ TG RV WA + G
Sbjct 77 TWVPAPTEVNPLDTPFAWALVRFDGADTAML--LPLAADGPEAVSTGMRVRLRWAAERTG 134
Query 118 AITDIACFALGETAEPVAAHKTED-----ARDPVTMIVTPIQLEIQHTASHEESAYLRAI 172
I DIAC + PV A +D DPVT++VTPI L + HTA ES YLRAI
Sbjct 135 TIHDIACVVPADA--PVDAGVDDDEVPQQTDDPVTIVVTPIGLSVTHTAGPAESEYLRAI 192
Query 173 AQGKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKP 232
QGK++G R +VY PP P+ G E+VE+ D GTVTTF IVN+PF GQ+I P
Sbjct 193 VQGKVLGRRRSNGPEVYVPPRDYCPSDGVAMGEYVEVSDIGTVTTFGIVNVPFAGQQITP 252
Query 233 PYVAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEP 292
PYV AY+LLDG+D+P HLV +A +VRMGMRV AVW P E + I +F PTG+P
Sbjct 253 PYVTAYILLDGSDVPIQHLVLGCEASEVRMGMRVRAVWNPEEGRPASMKAIAHFEPTGDP 312
Query 293 DANYDTYKHHL 303
DA+ TY HL
Sbjct 313 DADPATYSKHL 323
>gi|182440626|ref|YP_001828345.1| hypothetical protein SGR_6833 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178469142|dbj|BAG23662.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length=315
Score = 273 bits (697), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 150/304 (50%), Positives = 189/304 (63%), Gaps = 12/304 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R ++GVR DG V VPPVEYDPVT L ++V V+ GTV +W W
Sbjct 19 LGPVQSAFLTGLRERTVLGVRTEDGTVLVPPVEYDPVTANELRDLVEVAPTGTVTTWAWN 78
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P P QPLD PFAW L++LDGA T L+H +D GP A+ TG RV WA GAIT
Sbjct 79 PSPRRDQPLDTPFAWVLVRLDGAGTALLHVLDA--PGPDAVRTGMRVRVRWAADRTGAIT 136
Query 121 DIACFALGETAEPVAAHKTE---DARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKL 177
DIACF E+ EP AA T + DPVT IVTP +L+ HT +SAY++A+ + +
Sbjct 137 DIACFEPYES-EPGAAEPTPHSGEFSDPVTGIVTPARLDYVHTPGRAQSAYIKALEERRT 195
Query 178 VGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAA 237
VG R KVY PP GA P G T+E VE+ +GTVTTF IVNI I+ PYV A
Sbjct 196 VGERCPACRKVYVPPRGACPTCGVATAEQVEVGPRGTVTTFCIVNIKAKNLDIEVPYVYA 255
Query 238 YVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYD 297
++ LDGAD+ ++ + QVRMG+RVE VW G +++++RPTGEPDA+YD
Sbjct 256 HIALDGADLALHGRIAGIPYDQVRMGLRVEPVWSE------GARHVDHYRPTGEPDADYD 309
Query 298 TYKH 301
TYK
Sbjct 310 TYKE 313
>gi|326781301|ref|ZP_08240566.1| protein of unknown function DUF35 [Streptomyces cf. griseus XylebKG-1]
gi|326661634|gb|EGE46480.1| protein of unknown function DUF35 [Streptomyces griseus XylebKG-1]
Length=315
Score = 271 bits (694), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 150/305 (50%), Positives = 188/305 (62%), Gaps = 14/305 (4%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R ++GVR DG V VPPVEYDPVT L ++V V+ GTV +W W
Sbjct 19 LGPVQSAFLTGLRERTVLGVRTDDGTVLVPPVEYDPVTANELRDLVEVAPTGTVTTWAWN 78
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P P QPLD PFAW L++LDGA T L+H +D GP A+ TG RV WA GAIT
Sbjct 79 PSPRRDQPLDTPFAWVLVRLDGAGTALLHVLDA--PGPDAVRTGMRVRVRWAADRTGAIT 136
Query 121 DIACFALGE----TAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGK 176
DIACF E TAEP T + DPVT IVTP +L+ HT +SAY++A+ + +
Sbjct 137 DIACFEPYESEPGTAEPTP--HTGEFTDPVTGIVTPARLDYVHTPGRAQSAYIKALEERR 194
Query 177 LVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVA 236
VG R KVY PP GA P G T+E VE+ +GTVTTF IVNI I+ PYV
Sbjct 195 TVGERCPACRKVYVPPRGACPTCGVATTEQVEVGPRGTVTTFCIVNIKAKNLDIEVPYVY 254
Query 237 AYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANY 296
A++ LDGAD+ ++ + QVRMG+RVE VW G +++++RPTGEPDA+Y
Sbjct 255 AHIALDGADLALHGRIAGIPYDQVRMGLRVEPVWSE------GARHVDHYRPTGEPDADY 308
Query 297 DTYKH 301
DTYK
Sbjct 309 DTYKE 313
>gi|345013190|ref|YP_004815544.1| hypothetical protein Strvi_5753 [Streptomyces violaceusniger
Tu 4113]
gi|344039539|gb|AEM85264.1| protein of unknown function DUF35 [Streptomyces violaceusniger
Tu 4113]
Length=319
Score = 271 bits (694), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 153/305 (51%), Positives = 188/305 (62%), Gaps = 11/305 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R ++GVR +DGRV +PPVEYDPVT + LS++V V+ GTV +W W
Sbjct 24 LGPVQSAFLTGLRERTVLGVRTTDGRVLMPPVEYDPVTADELSDLVEVAPTGTVTTWAWN 83
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P P GQPLD PFAW L++LDGADT L+HAVD GP A+ TG RV WA + VGAIT
Sbjct 84 PAPRRGQPLDTPFAWVLVRLDGADTALLHAVDA--PGPDAVRTGMRVRIRWAGERVGAIT 141
Query 121 DIACFALGETAEPVAA--HKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLV 178
DIACF + AE A H E DPVT IV P +L+ + +S YL+A+A
Sbjct 142 DIACFEPYDGAEGGEAVPHNGE-FTDPVTGIVAPARLDYTYAPGRAQSRYLKALAGRTTQ 200
Query 179 GARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAY 238
G R KVY PP GA P G T E VE+ +GTVTTF IVNI I+ PYV A+
Sbjct 201 GERCPSCRKVYVPPRGACPTCGVATDEQVEVGPRGTVTTFCIVNIKARNLDIEVPYVYAH 260
Query 239 VLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDT 298
+ LDGA + V + QVRMG+RVE VW R+ +++RPTGEPDA+YDT
Sbjct 261 IALDGAGLALHGRVGGIPYDQVRMGLRVEPVWSEASRYP------DHYRPTGEPDADYDT 314
Query 299 YKHHL 303
YK L
Sbjct 315 YKELL 319
>gi|29827785|ref|NP_822419.1| hypothetical protein SAV_1244 [Streptomyces avermitilis MA-4680]
gi|15824259|dbj|BAB69415.1| hypothetical protein [Streptomyces avermitilis]
gi|29604886|dbj|BAC68954.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=306
Score = 271 bits (694), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 150/306 (50%), Positives = 193/306 (64%), Gaps = 11/306 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R ++GV+ DGR VPPVEYDPVT E + ++V V+ GTV +W W
Sbjct 9 LGPVQSAFLTGLRERVLLGVKTGDGRTLVPPVEYDPVTAEEIHDLVEVAPTGTVTTWAWN 68
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P GQPLD PFAW L++LDGADT L+HA+D AGP A+H+G RV WA Q GAIT
Sbjct 69 HAPRRGQPLDTPFAWVLVRLDGADTALLHALDA--AGPDAVHSGLRVRVRWAAQRSGAIT 126
Query 121 DIACFALGETAEPVAAHKT-EDAR--DPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKL 177
DIACF ++ + AA T D R DPVT IV P +L+ ++ +SA+L A+A+ +
Sbjct 127 DIACFEPYDSGDDAAAEPTGHDGRFADPVTGIVAPARLDYVYSPGRAQSAHLDALAEQRT 186
Query 178 VGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAA 237
VG R KVY PP GA P G TSE VE+ +GTVTT+ IVNI I+ PYV A
Sbjct 187 VGERCPSCRKVYVPPRGACPTCGVATSEAVEVGPRGTVTTYCIVNIKAKNLDIEVPYVYA 246
Query 238 YVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYD 297
++ LDGAD+ + + QVRMG+RVE VW R + +++RPTGEPDA+Y+
Sbjct 247 HIALDGADLALHGRIGGIPYDQVRMGLRVEPVWTDGGR------HPDHYRPTGEPDADYE 300
Query 298 TYKHHL 303
TYK L
Sbjct 301 TYKELL 306
>gi|297197503|ref|ZP_06914900.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|297146762|gb|EDY59662.2| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=305
Score = 267 bits (683), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/305 (50%), Positives = 188/305 (62%), Gaps = 12/305 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R I+GV+ DGR VPPVEYDPVT E + ++V V GTV +W W
Sbjct 11 LGPVQSAFLTGLRERVILGVKTRDGRTLVPPVEYDPVTAEEIRDLVAVGVTGTVTTWAWN 70
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
P GQPLDRPFAW L+KLDGADT L+HA+D GP A+ TG RV WA++ GAIT
Sbjct 71 HAPRRGQPLDRPFAWVLVKLDGADTALLHALD--APGPDAVRTGMRVRVRWAEERTGAIT 128
Query 121 DIACFA--LGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLV 178
DIACF G+ +E VA H E DPV IV +L+ ++ ++AY+ A+A+ + V
Sbjct 129 DIACFEPYDGDDSE-VAVHAGE-FEDPVHGIVAQARLDYTYSPGRAQTAYINALAERRAV 186
Query 179 GARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAY 238
G R KVY PP GA P G T+E VE+ GTVTTF IVNI I+ PYV A+
Sbjct 187 GERCPSCRKVYVPPRGACPTCGVATAEQVEVGPSGTVTTFCIVNIKAKNLDIEVPYVYAH 246
Query 239 VLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDT 298
+ LDGAD+ + + QVRMG+RVE VW R+ +++RPTGEPDA YDT
Sbjct 247 IALDGADLALHGRIGGIPYDQVRMGLRVEPVWTEGGRYP------DHYRPTGEPDAEYDT 300
Query 299 YKHHL 303
YK L
Sbjct 301 YKELL 305
>gi|328880326|emb|CCA53565.1| hypothetical protein SVEN_0278 [Streptomyces venezuelae ATCC
10712]
Length=320
Score = 262 bits (670), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 145/305 (48%), Positives = 188/305 (62%), Gaps = 11/305 (3%)
Query 1 VGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSEMVPVSSVGTVASWTWQ 60
+GP S F T LR R ++GVR G V VPPVEYDP T L ++V V + GTV +W W
Sbjct 25 LGPVQSAFLTGLRERTVLGVRTGTGEVLVPPVEYDPATAAELRDLVEVGATGTVTTWAWN 84
Query 61 PEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIHTGARVHAHWADQPVGAIT 120
EP GQPL PFAW L++LDGADT L+HA+D GP A+ TG RV WAD+ GAIT
Sbjct 85 HEPRPGQPLATPFAWVLVRLDGADTALLHALDA--PGPHAVRTGMRVRVRWADERAGAIT 142
Query 121 DIACFALGETAEPVAAHKTEDA--RDPVTMIVTPIQLEIQHTASHEESAYLRAIAQGKLV 178
DIACF ++ +PVA + D DPVT IV P +L+ ++ ++ YLRA+A+ + V
Sbjct 143 DIACFEPHDS-DPVAEPRPHDGLFADPVTGIVAPARLDYTYSPGGAQTRYLRALAERRTV 201
Query 179 GARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTFAIVNIPFLGQRIKPPYVAAY 238
G R KVY PP GA P G T++ VE+ +GTVTT+ IVNI I+ PYV A+
Sbjct 202 GERCPSCSKVYVPPRGACPTCGVATTDQVEVGPRGTVTTYCIVNIKAKNLDIEVPYVYAH 261
Query 239 VLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERWGLGIDNIEYFRPTGEPDANYDT 298
+ LDGA + + + QVRMG+RVE VW R+ +++RPTGEPDA+YDT
Sbjct 262 IALDGAGLALHGRIGGIPYDQVRMGLRVEPVWSDDGRYP------DHYRPTGEPDADYDT 315
Query 299 YKHHL 303
YK L
Sbjct 316 YKELL 320
Lambda K H
0.319 0.137 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 508884486504
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40