BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3031
Length=526
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610168|ref|NP_217547.1| hypothetical protein Rv3031 [Mycoba... 1058 0.0
gi|289448707|ref|ZP_06438451.1| conserved hypothetical protein [... 1056 0.0
gi|298526499|ref|ZP_07013908.1| conserved hypothetical protein [... 1056 0.0
gi|340628023|ref|YP_004746475.1| hypothetical protein MCAN_30561... 1051 0.0
gi|308232321|ref|ZP_07664056.1| hypothetical protein TMAG_03204 ... 1043 0.0
gi|289575740|ref|ZP_06455967.1| conserved hypothetical protein [... 1033 0.0
gi|240169582|ref|ZP_04748241.1| hypothetical protein MkanA1_0973... 937 0.0
gi|296171246|ref|ZP_06852650.1| family 57 glycosyl hydrolase [My... 936 0.0
gi|183981696|ref|YP_001849987.1| hypothetical protein MMAR_1682 ... 933 0.0
gi|118617516|ref|YP_905848.1| hypothetical protein MUL_1920 [Myc... 933 0.0
gi|342858147|ref|ZP_08714802.1| hypothetical protein MCOL_04716 ... 932 0.0
gi|15827918|ref|NP_302181.1| hypothetical protein ML1714 [Mycoba... 920 0.0
gi|254776299|ref|ZP_05217815.1| glycosyl hydrolase, family prote... 918 0.0
gi|41409161|ref|NP_961997.1| hypothetical protein MAP3063 [Mycob... 917 0.0
gi|118464392|ref|YP_883040.1| glycosyl hydrolase, family protein... 916 0.0
gi|254821747|ref|ZP_05226748.1| glycosyl hydrolase, family prote... 914 0.0
gi|336459159|gb|EGO38106.1| hypothetical protein MAPs_05970 [Myc... 913 0.0
gi|108798836|ref|YP_639033.1| glycoside hydrolase family protein... 862 0.0
gi|126434436|ref|YP_001070127.1| glycoside hydrolase family prot... 860 0.0
gi|120403089|ref|YP_952918.1| glycoside hydrolase family protein... 858 0.0
gi|118469272|ref|YP_886692.1| glycosyl hydrolase, family protein... 856 0.0
gi|145224840|ref|YP_001135518.1| glycoside hydrolase family prot... 850 0.0
gi|315445171|ref|YP_004078050.1| hypothetical protein Mspyr1_360... 850 0.0
gi|333991391|ref|YP_004524005.1| hypothetical protein JDM601_275... 818 0.0
gi|169630446|ref|YP_001704095.1| hypothetical protein MAB_3365 [... 807 0.0
gi|229493420|ref|ZP_04387209.1| glycosyl hydrolase, family prote... 717 0.0
gi|226305858|ref|YP_002765818.1| hypothetical protein RER_23710 ... 716 0.0
gi|312140520|ref|YP_004007856.1| glycosyl hydrolase family 57 [R... 713 0.0
gi|325675860|ref|ZP_08155544.1| family 57 glycosyl hydrolase [Rh... 713 0.0
gi|226365922|ref|YP_002783705.1| hypothetical protein ROP_65130 ... 704 0.0
gi|111023421|ref|YP_706393.1| hypothetical protein RHA1_ro06460 ... 700 0.0
gi|289751702|ref|ZP_06511080.1| conserved hypothetical protein [... 692 0.0
gi|326381510|ref|ZP_08203204.1| hypothetical protein SCNU_01125 ... 681 0.0
gi|296393812|ref|YP_003658696.1| hypothetical protein Srot_1401 ... 673 0.0
gi|262203152|ref|YP_003274360.1| hypothetical protein Gbro_3262 ... 672 0.0
gi|343926882|ref|ZP_08766375.1| hypothetical protein GOALK_072_0... 663 0.0
gi|54026252|ref|YP_120494.1| hypothetical protein nfa42810 [Noca... 657 0.0
gi|317508605|ref|ZP_07966264.1| glycosyl hydrolase [Segniliparus... 657 0.0
gi|333920888|ref|YP_004494469.1| Family 57 glycosyl hydrolase [A... 647 0.0
gi|289759155|ref|ZP_06518533.1| conserved hypothetical protein [... 642 0.0
gi|296140613|ref|YP_003647856.1| glycoside hydrolase family prot... 638 0.0
gi|2414527|emb|CAB16416.1| hypothetical protein MLCB637.01c [Myc... 593 2e-167
gi|325000548|ref|ZP_08121660.1| glycoside hydrolase family 57 [P... 556 5e-156
gi|300783571|ref|YP_003763862.1| hypothetical protein AMED_1649 ... 555 7e-156
gi|302524922|ref|ZP_07277264.1| glycoside hydrolase, family 57 [... 549 4e-154
gi|257056805|ref|YP_003134637.1| hypothetical protein Svir_28290... 546 2e-153
gi|256380050|ref|YP_003103710.1| glycoside hydrolase family prot... 544 1e-152
gi|331695603|ref|YP_004331842.1| hypothetical protein Psed_1754 ... 533 2e-149
gi|134102636|ref|YP_001108297.1| glycoside hydrolase family prot... 532 7e-149
gi|336179611|ref|YP_004584986.1| hypothetical protein FsymDg_378... 436 5e-120
>gi|15610168|ref|NP_217547.1| hypothetical protein Rv3031 [Mycobacterium tuberculosis H37Rv]
gi|15842594|ref|NP_337631.1| hypothetical protein MT3115 [Mycobacterium tuberculosis CDC1551]
gi|31794209|ref|NP_856702.1| hypothetical protein Mb3057 [Mycobacterium bovis AF2122/97]
40 more sequence titles
Length=526
Score = 1058 bits (2735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/526 (99%), Positives = 526/526 (100%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV
Sbjct 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
>gi|289448707|ref|ZP_06438451.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289421665|gb|EFD18866.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=526
Score = 1056 bits (2732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/526 (99%), Positives = 526/526 (100%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVW+GAKVADLVQLNSEV
Sbjct 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWNGAKVADLVQLNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
>gi|298526499|ref|ZP_07013908.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298496293|gb|EFI31587.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=526
Score = 1056 bits (2732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/526 (99%), Positives = 525/526 (99%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAY PGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYVPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV
Sbjct 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
>gi|340628023|ref|YP_004746475.1| hypothetical protein MCAN_30561 [Mycobacterium canettii CIPT
140010059]
gi|340006213|emb|CCC45387.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=526
Score = 1051 bits (2717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/526 (99%), Positives = 524/526 (99%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALD+FATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDDFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRL HRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLGHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDP+RADRAVDVHVADFVDVVRNRLL ESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPKRADRAVDVHVADFVDVVRNRLLCESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV
Sbjct 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
>gi|308232321|ref|ZP_07664056.1| hypothetical protein TMAG_03204 [Mycobacterium tuberculosis SUMu001]
gi|308369938|ref|ZP_07666828.1| hypothetical protein TMBG_03182 [Mycobacterium tuberculosis SUMu002]
gi|308371213|ref|ZP_07667113.1| hypothetical protein TMCG_01446 [Mycobacterium tuberculosis SUMu003]
23 more sequence titles
Length=519
Score = 1043 bits (2696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/519 (99%), Positives = 519/519 (100%), Gaps = 0/519 (0%)
Query 8 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 67
+PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM
Sbjct 1 MPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 60
Query 68 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 127
TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE
Sbjct 61 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 120
Query 128 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 187
CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL
Sbjct 121 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 180
Query 188 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 247
ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV
Sbjct 181 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 240
Query 248 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 307
AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA
Sbjct 241 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 300
Query 308 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 367
DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP
Sbjct 301 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 360
Query 368 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 427
AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT
Sbjct 361 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 420
Query 428 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 487
IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT
Sbjct 421 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 480
Query 488 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 481 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 519
>gi|289575740|ref|ZP_06455967.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289540171|gb|EFD44749.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=525
Score = 1033 bits (2671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 515/526 (98%), Positives = 519/526 (99%), Gaps = 1/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERADRAVDVHVADFVDV + + ER+GRPAHVIAAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPERADRAVDVHVADFVDVFPGQRIG-PERVGRPAHVIAAFDTELFGHWWYEGPTWLQ 359
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV
Sbjct 360 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 419
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 420 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 479
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
Sbjct 480 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 525
>gi|240169582|ref|ZP_04748241.1| hypothetical protein MkanA1_09732 [Mycobacterium kansasii ATCC
12478]
Length=526
Score = 937 bits (2423), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/526 (88%), Positives = 490/526 (94%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N + S VPG+FTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL +VL ALA E+R
Sbjct 1 MNATVSRVPGMFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLFRVLRALAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLGMTPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR +SKSA Y SCTPEAL
Sbjct 61 GLVTLGMTPVVNAQLDDPYCLDGMHHWLANWRLRATEAASVRCLPRSKSASYQSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFG REC +A +AL +FAT WRHGGSPLLRGLIDAG VELLGGPLAHPFQPLL PRLRE
Sbjct 121 RAFGTRECVEADQALQDFATLWRHGGSPLLRGLIDAGAVELLGGPLAHPFQPLLNPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADA LR+A RP GIWAPECAYAPG+E DYA AGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAWLRMAARPTGIWAPECAYAPGLEHDYAAAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNV SE KA
Sbjct 241 VGDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVSSEAKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERADRAVD+HVADFVD+VR RL++ESERIGRPAHV+AAFDTELFGHWWYEGPTWL
Sbjct 301 PYDPERADRAVDIHVADFVDLVRGRLIAESERIGRPAHVVAAFDTELFGHWWYEGPTWLA 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALPAAGVRVGTL DA+ DGFVG+ VELPPSSWGSGKDWQVW+G KVADLVQLNSEV
Sbjct 361 RVLRALPAAGVRVGTLRDALTDGFVGEAVELPPSSWGSGKDWQVWNGEKVADLVQLNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTT+DKALAQTASLDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTVDKALAQTASLDGPIPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALA+GRRD A+RLA+GWNRADGLFGALDARRLP+
Sbjct 481 HLHAHATREIAGALASGRRDNAQRLADGWNRADGLFGALDARRLPR 526
>gi|296171246|ref|ZP_06852650.1| family 57 glycosyl hydrolase [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894214|gb|EFG73971.1| family 57 glycosyl hydrolase [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=526
Score = 936 bits (2419), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/526 (88%), Positives = 488/526 (93%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+ TS VPG+FTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPL++VL LADE+R
Sbjct 1 MTTSRDRVPGMFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLMRVLTTLADEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR A +S+SA Y +CTPEAL
Sbjct 61 GLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAAEAASVRSAPRSRSAGYQACTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRECA+A RAL+ FAT WRHGGSPLLR LIDAGTVELLGGPLAHPFQPLL PRLRE
Sbjct 121 RALGIRECAEADRALEEFATHWRHGGSPLLRRLIDAGTVELLGGPLAHPFQPLLTPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQ R+AHRP GIWAPECAYAPGME DYA AGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQQRIAHRPGGIWAPECAYAPGMERDYAAAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPS+ KA
Sbjct 241 VGDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSDAKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERAD AVD HV DFV VVRNRL SESERIGRPAHV+AAFDTELFGHWWYEGPTWLQ
Sbjct 301 PYDPERADHAVDTHVDDFVGVVRNRLASESERIGRPAHVVAAFDTELFGHWWYEGPTWLQ 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRA+PAAGVRVGTL+DAIADG VG V LPPSSWGSGKDWQVW+G KVADLVQLN+EV
Sbjct 361 RVLRAMPAAGVRVGTLTDAIADGLVGSAVALPPSSWGSGKDWQVWAGDKVADLVQLNTEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTT+DKALAQ ASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALTTVDKALAQRASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALA+GRRD A RLA+GWNRADGLFGALDARRLP+
Sbjct 481 HLHAHATREIAGALASGRRDAAGRLADGWNRADGLFGALDARRLPR 526
>gi|183981696|ref|YP_001849987.1| hypothetical protein MMAR_1682 [Mycobacterium marinum M]
gi|183175022|gb|ACC40132.1| conserved protein [Mycobacterium marinum M]
Length=531
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/525 (88%), Positives = 487/525 (93%), Gaps = 0/525 (0%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL +VL LADE R
Sbjct 7 TNSPDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLFRVLRTLADEGRRG 66
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
LITLGMTPVVNAQLDDPYCL+G+HHWLANW+LRA EA SVR + SKSA + SCTPEALR
Sbjct 67 LITLGMTPVVNAQLDDPYCLDGMHHWLANWRLRAAEATSVRSSPASKSAKHLSCTPEALR 126
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
FG RECA+A +ALD+FAT WRHG SPLLR L+DAGTVELLGGPLAHPFQPLL PRLREF
Sbjct 127 DFGTRECAEADQALDDFATAWRHGASPLLRALLDAGTVELLGGPLAHPFQPLLNPRLREF 186
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
ALREGLADA+LR+AH P GIWAPECAYAPGME DYA A VSHFMVDGPSLHGDTALGRPV
Sbjct 187 ALREGLADARLRMAHSPSGIWAPECAYAPGMEHDYAAAAVSHFMVDGPSLHGDTALGRPV 246
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
G TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNV SE KAP
Sbjct 247 GDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVASEAKAP 306
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
YDP+RADRA+DVHVADFV+VVR RL++ES+RIGRPAHV+AAFDTELFGHWWYEGPTWLQR
Sbjct 307 YDPQRADRAIDVHVADFVEVVRGRLIAESQRIGRPAHVVAAFDTELFGHWWYEGPTWLQR 366
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLRALPAAGVRVGTL DA+A+GFVG PVELPPSSWGSGKDWQVW+G KV+DLVQLNSEVV
Sbjct 367 VLRALPAAGVRVGTLHDAMANGFVGAPVELPPSSWGSGKDWQVWNGPKVSDLVQLNSEVV 426
Query 422 DTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 481
DTALTT+DKALAQTASLDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH
Sbjct 427 DTALTTVDKALAQTASLDGPIPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 486
Query 482 LHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
LHAHATREIAGALA+GRRD A+RLA+GWNRADGLFGALDARRLPK
Sbjct 487 LHAHATREIAGALASGRRDNAQRLAQGWNRADGLFGALDARRLPK 531
>gi|118617516|ref|YP_905848.1| hypothetical protein MUL_1920 [Mycobacterium ulcerans Agy99]
gi|118569626|gb|ABL04377.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=531
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/525 (88%), Positives = 486/525 (93%), Gaps = 0/525 (0%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL +VL LADE R
Sbjct 7 TNSPDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLFRVLRTLADEGRRG 66
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
LITLGMTPVVNAQLDDPYCL+G+HHWLANW+LRA EA SVR A SKSA + SCTPEALR
Sbjct 67 LITLGMTPVVNAQLDDPYCLDGMHHWLANWRLRAAEATSVRSAPASKSAKHLSCTPEALR 126
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
FG RECA+A +ALD+FA WRHG SPLLR LIDAGTVELLGGPLAHPFQPLL PRLREF
Sbjct 127 DFGTRECAEADQALDDFAAAWRHGASPLLRALIDAGTVELLGGPLAHPFQPLLNPRLREF 186
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
ALREGLADA+LR+AH P GIWAPECAYAPGME DYA A VSHFMVDGPSLHGDTALGRPV
Sbjct 187 ALREGLADARLRMAHSPSGIWAPECAYAPGMEHDYAAAAVSHFMVDGPSLHGDTALGRPV 246
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
G TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNV SE KAP
Sbjct 247 GDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVASEAKAP 306
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
YDP+RADRA+DVHVADFV+VVR RL++ES+RIGRPAHV+AAFDTELFGHWWYEGPTWLQR
Sbjct 307 YDPQRADRAIDVHVADFVEVVRGRLIAESQRIGRPAHVVAAFDTELFGHWWYEGPTWLQR 366
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLRALPAAGVRVGTL DA+A+GFVG PVELPPSSWGSGKDWQVW+G KV+DLVQLNSEVV
Sbjct 367 VLRALPAAGVRVGTLHDAMANGFVGAPVELPPSSWGSGKDWQVWNGPKVSDLVQLNSEVV 426
Query 422 DTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 481
DTALTT+DKALAQTASLDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH
Sbjct 427 DTALTTVDKALAQTASLDGPIPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 486
Query 482 LHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
LHAHATREIAGALA+GRRD A+RLA+GWNRADGLFGALDARRLPK
Sbjct 487 LHAHATREIAGALASGRRDNAQRLAQGWNRADGLFGALDARRLPK 531
>gi|342858147|ref|ZP_08714802.1| hypothetical protein MCOL_04716 [Mycobacterium colombiense CECT
3035]
gi|342133851|gb|EGT87031.1| hypothetical protein MCOL_04716 [Mycobacterium colombiense CECT
3035]
Length=526
Score = 932 bits (2408), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/526 (88%), Positives = 488/526 (93%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
++ S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAA+YLPLL+VL LA E+R
Sbjct 1 MSASQDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAASYLPLLRVLHTLAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
+ITLG+TPVVNAQLDDPYCL+G+HHWLANW+LR EA SVR A +SKSA Y SCTPEAL
Sbjct 61 GVITLGVTPVVNAQLDDPYCLDGMHHWLANWRLRGLEATSVRSAPRSKSAGYQSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE +A RALD+FATRWRHGGSPLLR L+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIRESDEAQRALDDFATRWRHGGSPLLRSLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQ RLAHRP GIWAPECAYAPGME DYA AGV+HFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQARLAHRPGGIWAPECAYAPGMEHDYAAAGVTHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNV SE KA
Sbjct 241 VGDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVSSEDKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERAD AVD HVADFVDVVRNRLL+ESERIGRPAHV+AAFDTELFGHWWYEGPTWL+
Sbjct 301 PYDPERADHAVDTHVADFVDVVRNRLLAESERIGRPAHVVAAFDTELFGHWWYEGPTWLE 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALP AGVRVGTL+DAIADGFVGDPV LPPSSWGSGKDWQVW+G +VADLV LNSEV
Sbjct 361 RVLRALPEAGVRVGTLTDAIADGFVGDPVALPPSSWGSGKDWQVWAGDQVADLVALNSEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VD AL+T+DKAL+QTA LDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSA DYARYRA
Sbjct 421 VDMALSTVDKALSQTAPLDGPIPRDHVADQILRETLLTVSSDWPFMVSKDSATDYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIA ALA+GRRDTA+RLAEGWNRADGLFGALDARRLP+
Sbjct 481 HLHAHATREIADALASGRRDTAQRLAEGWNRADGLFGALDARRLPR 526
>gi|15827918|ref|NP_302181.1| hypothetical protein ML1714 [Mycobacterium leprae TN]
gi|221230395|ref|YP_002503811.1| hypothetical protein MLBr_01714 [Mycobacterium leprae Br4923]
gi|13093471|emb|CAC30667.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933502|emb|CAR71809.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=522
Score = 920 bits (2379), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/526 (87%), Positives = 483/526 (92%), Gaps = 4/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSA+PVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVL LADENRH
Sbjct 1 MNTSANPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLHTLADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR +SK P+C P+AL
Sbjct 61 RLITLGVTPVVNAQLDDPYCLDGMHHWLANWRLRATEAASVRSGSESK----PACAPQAL 116
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFG REC +A RAL+ FAT WRHGGSPLLR LIDAGTVELLGGPLAHPFQPL+APRLRE
Sbjct 117 RAFGARECIEAQRALEYFATLWRHGGSPLLRSLIDAGTVELLGGPLAHPFQPLIAPRLRE 176
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FAL EGL DA LRLAHRP GIWAPECAYAPGME DY+ AG++HFMVDGPSLHGDTALGRP
Sbjct 177 FALHEGLDDAWLRLAHRPTGIWAPECAYAPGMEHDYSAAGITHFMVDGPSLHGDTALGRP 236
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG T VVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTG NVPSE KA
Sbjct 237 VGDTAVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGCNVPSEAKA 296
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDP+ AD+ VD HV DFV VVRNRL +ESERIGRPAHV+AAFDTELFGHWWYEGP WLQ
Sbjct 297 PYDPDHADKVVDAHVDDFVGVVRNRLFTESERIGRPAHVVAAFDTELFGHWWYEGPIWLQ 356
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALP AGVRVGTLSDA+A GFVG+ V LPPSSWGSGKDWQVW+G KVADLVQLN+EV
Sbjct 357 RVLRALPTAGVRVGTLSDALAGGFVGNTVTLPPSSWGSGKDWQVWAGDKVADLVQLNNEV 416
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTT+DK LAQT SLDG LPR+HVADQ+LRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 417 VDTALTTVDKVLAQTTSLDGLLPRNHVADQLLRETLLTVSSDWPFMVSKDSAADYARYRA 476
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALA+GRRDTA +LA+GWNRADGLFGALDARRLP+
Sbjct 477 HLHAHATREIAGALASGRRDTAAQLADGWNRADGLFGALDARRLPR 522
>gi|254776299|ref|ZP_05217815.1| glycosyl hydrolase, family protein 57 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=534
Score = 918 bits (2372), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/534 (87%), Positives = 492/534 (93%), Gaps = 8/534 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N+S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLL+VL LA E+R
Sbjct 1 MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRVLDTLAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR A +SKSA Y +CTPEAL
Sbjct 61 GLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAASVRSAPRSKSAGYQACTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE A+A RALD+FATRWRHGGSPLLR L+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIRESAEAERALDDFATRWRHGGSPLLRRLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHR----PKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTA 236
FALREGLADA LRL R P GIWAPECAYAPG+E DYA AGV+HFMVDGPSLHGDTA
Sbjct 181 FALREGLADAALRLGARKGRGPGGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTA 240
Query 237 LGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPS 296
LGRPVG TDVVAFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDHLTGLKPARVTGRNVPS
Sbjct 241 LGRPVGDTDVVAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS 300
Query 297 EQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGP 356
E KAPYDP+RAD AVD+HVADFVDVVRNRL SESERIGRPAHV+AAFDTELFGHWWYEGP
Sbjct 301 EGKAPYDPDRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTELFGHWWYEGP 360
Query 357 TWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQL 416
TWL RVLRALP AGVRVGTL DAIA GFVGDPV+LPPSSWGSGKDWQVW+G +VADLVQL
Sbjct 361 TWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSWGSGKDWQVWAGDQVADLVQL 420
Query 417 NSEVVDTALTTIDKALAQT----ASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSA 472
NSEVVDTAL+T+DKAL+Q A+LDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSA
Sbjct 421 NSEVVDTALSTVDKALSQAGSQPAALDGPVPRDHVADQILRETLLTVSSDWPFMVSKDSA 480
Query 473 ADYARYRAHLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
ADYARYRAHLHAHATREIAGALA+GRRDTA+RLA+GWNRADGLFGALDARRLP+
Sbjct 481 ADYARYRAHLHAHATREIAGALASGRRDTAQRLADGWNRADGLFGALDARRLPR 534
>gi|41409161|ref|NP_961997.1| hypothetical protein MAP3063 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397981|gb|AAS05611.1| hypothetical protein MAP_3063 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=534
Score = 917 bits (2369), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/534 (87%), Positives = 491/534 (92%), Gaps = 8/534 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N+S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLL+VL LA E+R
Sbjct 1 MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRVLDTLAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR A +SKSA Y +CTPEAL
Sbjct 61 GLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAASVRSAPRSKSAGYQACTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE A+A RALD+FATRWRHGGSPLLR L+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIRESAEAERALDDFATRWRHGGSPLLRRLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHR----PKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTA 236
FALREGLADA LRL R P GIWAPECAYAPG+E DYA AGV+HFMVDGPSLHGDTA
Sbjct 181 FALREGLADAALRLGARKGRGPGGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTA 240
Query 237 LGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPS 296
LGRPVG TDVVAFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDHLTGLKPARVTGRNVPS
Sbjct 241 LGRPVGDTDVVAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS 300
Query 297 EQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGP 356
E KAPYDP+RAD AVD+HVADFVDVVRNRL SESERIGRPAHV+AAFDTELFGHWWYEGP
Sbjct 301 ESKAPYDPQRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTELFGHWWYEGP 360
Query 357 TWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQL 416
TWL RVLRALP AGVRVGTL DAIA GFVGDPV+LPPSSWGSGKDWQVW+G +VADLVQL
Sbjct 361 TWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSWGSGKDWQVWAGDQVADLVQL 420
Query 417 NSEVVDTALTTIDKALAQT----ASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSA 472
NSEVVDTAL+T+DKAL+Q A+LDGP+PRD VADQILRETLLTVSSDWPFMVSKDSA
Sbjct 421 NSEVVDTALSTVDKALSQAGSQPAALDGPVPRDRVADQILRETLLTVSSDWPFMVSKDSA 480
Query 473 ADYARYRAHLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
ADYARYRAHLHAHATREIAGALA+GRRDTA+RLA+GWNRADGLFGALDARRLP+
Sbjct 481 ADYARYRAHLHAHATREIAGALASGRRDTAQRLADGWNRADGLFGALDARRLPR 534
>gi|118464392|ref|YP_883040.1| glycosyl hydrolase, family protein 57 [Mycobacterium avium 104]
gi|118165679|gb|ABK66576.1| glycosyl hydrolase, family protein 57 [Mycobacterium avium 104]
Length=534
Score = 916 bits (2367), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/534 (87%), Positives = 491/534 (92%), Gaps = 8/534 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N+S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLL+VL LA E+R
Sbjct 1 MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRVLDTLAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR A +SKSA Y +CTPEAL
Sbjct 61 GLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAASVRSAPRSKSAGYQACTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE A+A RALD+FATRWRHGGSPLLR L+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIRESAEAERALDDFATRWRHGGSPLLRRLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHR----PKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTA 236
FALREGLADA LRL R P GIWAPECAYAPG+E DYA AGV+HFMVDGPSLHGDTA
Sbjct 181 FALREGLADAALRLGARKGRGPGGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTA 240
Query 237 LGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPS 296
LGRPVG TDVVAFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDHLTGLKPARVTGRNVPS
Sbjct 241 LGRPVGDTDVVAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS 300
Query 297 EQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGP 356
E KAPYDP+RAD AVD+HVADFVDVVRNRL SESERIGRPAHV+AAFDTELFGHWWYEGP
Sbjct 301 EGKAPYDPDRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTELFGHWWYEGP 360
Query 357 TWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQL 416
WL RVLRALP AGVRVGTL DAIA GFVGDPV+LPPSSWGSGKDWQVW+G +VADLVQL
Sbjct 361 RWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSWGSGKDWQVWAGDQVADLVQL 420
Query 417 NSEVVDTALTTIDKALAQT----ASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSA 472
NSEVVDTAL+T+DKAL+Q A+LDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSA
Sbjct 421 NSEVVDTALSTVDKALSQAGSQPAALDGPVPRDHVADQILRETLLTVSSDWPFMVSKDSA 480
Query 473 ADYARYRAHLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
ADYARYRAHLHAHATREIAGALA+GRRDTA+RLA+GWNRADGLFGALDARRLP+
Sbjct 481 ADYARYRAHLHAHATREIAGALASGRRDTAQRLADGWNRADGLFGALDARRLPR 534
>gi|254821747|ref|ZP_05226748.1| glycosyl hydrolase, family protein 57 [Mycobacterium intracellulare
ATCC 13950]
Length=526
Score = 914 bits (2363), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/526 (87%), Positives = 488/526 (93%), Gaps = 0/526 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N++ VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPL++VL LA E+R
Sbjct 1 MNSTPDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLMRVLNTLAGEDRQ 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
LITLG+TPVVNAQLDDPYCL G+HHWLANW+LRA EAASVR +SKSA Y SCTPEAL
Sbjct 61 GLITLGVTPVVNAQLDDPYCLGGMHHWLANWRLRAAEAASVRSIPRSKSAGYQSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE A+A AL++FAT WRHGGS LRGL+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIREEAEAENALEDFATLWRHGGSAPLRGLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADA RLAH P GIWAPECAYAPG+E DYA AGV+HFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAGQRLAHTPTGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG TDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSE KA
Sbjct 241 VGDTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEGKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERAD AVD+HVADFV+ VRNRL SES RIGRPAHVIAAFDTELFGHWWYEGPTWL+
Sbjct 301 PYDPERADHAVDIHVADFVETVRNRLTSESARIGRPAHVIAAFDTELFGHWWYEGPTWLE 360
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALP AGVRVGTL+DAIADGFVG PV LPPSSWGSGKDWQVW+G +VADLVQLN+EV
Sbjct 361 RVLRALPEAGVRVGTLTDAIADGFVGSPVALPPSSWGSGKDWQVWAGDQVADLVQLNNEV 420
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTAL+T+DKAL+QTA+LDGP+PRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
Sbjct 421 VDTALSTVDKALSQTATLDGPIPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALA+GRRDTA+RLAEGWNRADGLFGALDARRLP+
Sbjct 481 HLHAHATREIAGALASGRRDTAQRLAEGWNRADGLFGALDARRLPR 526
>gi|336459159|gb|EGO38106.1| hypothetical protein MAPs_05970 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=534
Score = 913 bits (2360), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/534 (86%), Positives = 490/534 (92%), Gaps = 8/534 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+N+S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLL+VL LA E+R
Sbjct 1 MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRVLDTLAGEDRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR A +SKSA Y +CTPEAL
Sbjct 61 GLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAASVRSAPRSKSAGYQACTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RA GIRE A+A RALD+FATRWRHGGSPLLR L+DAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RALGIRESAEAERALDDFATRWRHGGSPLLRRLLDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHR----PKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTA 236
FALREGLADA LRL R P GIWAPECAYAPG+E YA AGV+HFMVDGPSLHGDTA
Sbjct 181 FALREGLADAALRLGARKGRGPGGIWAPECAYAPGLEHYYAAAGVTHFMVDGPSLHGDTA 240
Query 237 LGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPS 296
LGRPVG TDVVAFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDHLTGLKPARVTGRNVPS
Sbjct 241 LGRPVGDTDVVAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS 300
Query 297 EQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGP 356
E KAPYDP+RAD AVD+HVADFVDVVRNRL SESERIGRPAHV+AAFDTELFGHWWYEGP
Sbjct 301 ESKAPYDPQRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTELFGHWWYEGP 360
Query 357 TWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQL 416
TWL RVLRALP AGVRVGTL DAIA GFVGDPV+LPPSSWGSGKDWQVW+G +VADLVQL
Sbjct 361 TWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSWGSGKDWQVWAGDQVADLVQL 420
Query 417 NSEVVDTALTTIDKALAQT----ASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSA 472
NSEVVDTAL+T+DKAL+Q A+LDGP+PRD VADQILRETLLTVSSDWPFMVSKDSA
Sbjct 421 NSEVVDTALSTVDKALSQAGSQPAALDGPVPRDRVADQILRETLLTVSSDWPFMVSKDSA 480
Query 473 ADYARYRAHLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
ADYARYRAHLHAHATREIAGALA+GRRDTA+RLA+GWNRADGLFGALDARRLP+
Sbjct 481 ADYARYRAHLHAHATREIAGALASGRRDTAQRLADGWNRADGLFGALDARRLPR 534
>gi|108798836|ref|YP_639033.1| glycoside hydrolase family protein [Mycobacterium sp. MCS]
gi|119867951|ref|YP_937903.1| glycoside hydrolase family protein [Mycobacterium sp. KMS]
gi|108769255|gb|ABG07977.1| (1->4)-alpha-D-glucan branching enzyme [Mycobacterium sp.
MCS]
gi|119694040|gb|ABL91113.1| (1->4)-alpha-D-glucan branching enzyme [Mycobacterium sp.
KMS]
Length=521
Score = 862 bits (2226), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/519 (82%), Positives = 459/519 (89%), Gaps = 4/519 (0%)
Query 8 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 67
VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+A+YLPL++VL LA E R L+TLGM
Sbjct 7 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSASYLPLMRVLRRLAGEGRDHLLTLGM 66
Query 68 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 127
TPVV AQLDDPYCL G+H WLANWQLRA EAA++R + S P+CTPEALRAFG+RE
Sbjct 67 TPVVTAQLDDPYCLTGMHSWLANWQLRALEAATLR----ASSDTTPACTPEALRAFGVRE 122
Query 128 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 187
+A AL+ FAT WRHGGSPLLR L+DAGTVELLGGPLAHPFQPLL PRLREFALREGL
Sbjct 123 QGEAELALEEFATLWRHGGSPLLRELVDAGTVELLGGPLAHPFQPLLNPRLREFALREGL 182
Query 188 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 247
ADA R AH P+GIWAPECAYAPGME DYA AGV HFMVDGPSLHGDTALGRPVG + VV
Sbjct 183 ADAGQRFAHTPRGIWAPECAYAPGMEADYAAAGVGHFMVDGPSLHGDTALGRPVGHSGVV 242
Query 248 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 307
AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDH+TGLKPARVTGRNVPS KAPY+P+RA
Sbjct 243 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHVTGLKPARVTGRNVPSSAKAPYEPDRA 302
Query 308 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 367
D A+D HVADFV VVR RL ESERIGRPAHV+AAFDTELFGHWWYEGP WL RVLRALP
Sbjct 303 DAAIDAHVADFVQVVRRRLTDESERIGRPAHVVAAFDTELFGHWWYEGPEWLARVLRALP 362
Query 368 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 427
AGVRVGTLSDA+ GFVG PV+LPPSSWGSGKDWQVW+G +V D V+LN+EVVDTAL+T
Sbjct 363 EAGVRVGTLSDAVDGGFVGAPVDLPPSSWGSGKDWQVWAGDQVTDFVRLNAEVVDTALST 422
Query 428 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 487
+DKAL Q AS+ P PRD VADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT
Sbjct 423 VDKALTQRASVGSPTPRDTVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 482
Query 488 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
REIA ALAAGRR+ A+RLA+GWNRADGLFGALDARRLP+
Sbjct 483 REIADALAAGRREQAQRLADGWNRADGLFGALDARRLPR 521
>gi|126434436|ref|YP_001070127.1| glycoside hydrolase family protein [Mycobacterium sp. JLS]
gi|126234236|gb|ABN97636.1| (1->4)-alpha-D-glucan branching enzyme [Mycobacterium sp.
JLS]
Length=521
Score = 860 bits (2222), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/519 (82%), Positives = 458/519 (89%), Gaps = 4/519 (0%)
Query 8 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 67
VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+A+YLPL +VL LA E R L+TLGM
Sbjct 7 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSASYLPLTRVLRRLAGEGRDHLLTLGM 66
Query 68 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 127
TPVV AQLDDPYCL G+H WLANWQLRA EAA++R + S P+CTPEALRAFG+RE
Sbjct 67 TPVVTAQLDDPYCLTGMHSWLANWQLRALEAATLR----ASSDTTPACTPEALRAFGVRE 122
Query 128 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 187
+A AL+ FAT WRHGGSPLLR L+DAGTVELLGGPLAHPFQPLL PRLREFALREGL
Sbjct 123 QGEAELALEEFATLWRHGGSPLLRELVDAGTVELLGGPLAHPFQPLLNPRLREFALREGL 182
Query 188 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 247
ADA R AH P+GIWAPECAYAPGME DYA AGV HFMVDGPSLHGDTALGRPVG + VV
Sbjct 183 ADAGQRFAHTPRGIWAPECAYAPGMEADYAAAGVGHFMVDGPSLHGDTALGRPVGHSGVV 242
Query 248 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 307
AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDH+TGLKPARVTGRNVPS KAPY+P+RA
Sbjct 243 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHVTGLKPARVTGRNVPSSAKAPYEPDRA 302
Query 308 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 367
D A+D HVADFV VVR RL ESERIGRPAHV+AAFDTELFGHWWYEGP WL RVLRALP
Sbjct 303 DAAIDAHVADFVQVVRRRLTDESERIGRPAHVVAAFDTELFGHWWYEGPEWLARVLRALP 362
Query 368 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 427
AGVRVGTLSDA+ GFVG PV+LPPSSWGSGKDWQVW+G +V D V+LN+EVVDTAL+T
Sbjct 363 EAGVRVGTLSDAVDGGFVGAPVDLPPSSWGSGKDWQVWAGDQVTDFVRLNAEVVDTALST 422
Query 428 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 487
+DKAL Q AS+ P PRD VADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT
Sbjct 423 VDKALTQRASVGSPTPRDTVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 482
Query 488 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
REIA ALAAGRR+ A+RLA+GWNRADGLFGALDARRLP+
Sbjct 483 REIADALAAGRREQAQRLADGWNRADGLFGALDARRLPR 521
>gi|120403089|ref|YP_952918.1| glycoside hydrolase family protein [Mycobacterium vanbaalenii
PYR-1]
gi|119955907|gb|ABM12912.1| (1->4)-alpha-D-glucan branching enzyme [Mycobacterium vanbaalenii
PYR-1]
Length=532
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/521 (82%), Positives = 453/521 (87%), Gaps = 4/521 (0%)
Query 5 ASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLIT 64
A PVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+A YLPL++VL LA ENR LIT
Sbjct 14 AEPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSACYLPLVRVLRTLAAENRRHLIT 73
Query 65 LGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFG 124
LG+TPVV AQLDDPYCL G+ HWLANWQLRA EAA++R A S P+ P+ALR FG
Sbjct 74 LGVTPVVAAQLDDPYCLQGMQHWLANWQLRAVEAATMRTA----SGASPASEPKALRQFG 129
Query 125 IRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALR 184
RE +A R L F WRHG SP LR L+D +ELLGGPLAHPFQPLL PRLREFALR
Sbjct 130 SREHTEAERELAEFDALWRHGASPALRELLDGEVIELLGGPLAHPFQPLLNPRLREFALR 189
Query 185 EGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKT 244
EGLADA R AH P GIWAPECAY+PGME YA AGVSHFMVDGPSLHGDTALGRPVG +
Sbjct 190 EGLADAAHRFAHSPTGIWAPECAYSPGMEAGYADAGVSHFMVDGPSLHGDTALGRPVGDS 249
Query 245 DVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDP 304
VVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDH TGLKPARVTGR VPSE+KAPYDP
Sbjct 250 GVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHATGLKPARVTGRAVPSEEKAPYDP 309
Query 305 ERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLR 364
RAD AVDVHVADFV+ VR RL +ES RIGRPAHV+AAFDTELFGHWWYEGP WL+RVLR
Sbjct 310 ARADAAVDVHVADFVETVRQRLSAESARIGRPAHVVAAFDTELFGHWWYEGPVWLERVLR 369
Query 365 ALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTA 424
ALPAAGVRVGTLSDAIADGFVG PVELPPSSWGSGKDWQVWSG +VADLVQLNSEVVDTA
Sbjct 370 ALPAAGVRVGTLSDAIADGFVGSPVELPPSSWGSGKDWQVWSGEQVADLVQLNSEVVDTA 429
Query 425 LTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHA 484
L+T+DKALA+ SLD P PRD VADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHA
Sbjct 430 LSTVDKALARGGSLDAPTPRDFVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHA 489
Query 485 HATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
HATREI+ ALA+GR D A+RLAEGWNRADGLFGALDARRLP
Sbjct 490 HATREISAALASGRDDHAQRLAEGWNRADGLFGALDARRLP 530
>gi|118469272|ref|YP_886692.1| glycosyl hydrolase, family protein 57 [Mycobacterium smegmatis
str. MC2 155]
gi|118170559|gb|ABK71455.1| glycosyl hydrolase, family protein 57 [Mycobacterium smegmatis
str. MC2 155]
Length=514
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/525 (81%), Positives = 456/525 (87%), Gaps = 13/525 (2%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
+ SA PVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPL++VL LA E R
Sbjct 3 DASAEPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLVRVLRTLAAEGRSH 62
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
LI+LGMTPVV AQLDDPYCL G+HHWLANWQLRA EA ++R + LR
Sbjct 63 LISLGMTPVVTAQLDDPYCLTGMHHWLANWQLRALEATTLRDRK-------------GLR 109
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
FG E + AA A+++F T WRHG SPLLR LIDA T+ELLGGPL+HPFQPLL PRLREF
Sbjct 110 EFGSHELSLAAEAMEDFTTHWRHGASPLLRELIDAETIELLGGPLSHPFQPLLNPRLREF 169
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
ALREGLAD+Q R AH P GIWAPECAYAPGME Y AGV HFMVDGPSLHGDTALGRPV
Sbjct 170 ALREGLADSQHRFAHTPTGIWAPECAYAPGMETGYGAAGVGHFMVDGPSLHGDTALGRPV 229
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
G++DV+AFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDH TGLKPARVTGRNVPSEQKAP
Sbjct 230 GESDVIAFGRDLQVSYRVWSPKSGYPGHGAYRDFHTYDHTTGLKPARVTGRNVPSEQKAP 289
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
Y+PERADRAVDVHVADFV+VVR RLL ES+RIGRPAHV+AAFDTELFGHWW+EGP WL+R
Sbjct 290 YEPERADRAVDVHVADFVEVVRRRLLDESQRIGRPAHVVAAFDTELFGHWWHEGPVWLER 349
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLRALP AGVRVGTL+DA ADGFVG PVELPPSSWGSGKDWQVWSG KVADLVQLNSEVV
Sbjct 350 VLRALPQAGVRVGTLADAAADGFVGTPVELPPSSWGSGKDWQVWSGQKVADLVQLNSEVV 409
Query 422 DTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 481
D AL T+DKAL Q A++ P+ RD VADQILRETLLTVSSDWPFMVSKDSAA+YARYRAH
Sbjct 410 DNALATVDKALTQHATVGSPVTRDRVADQILRETLLTVSSDWPFMVSKDSAAEYARYRAH 469
Query 482 LHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
LHAHATREI+ ALAAGRR+ A RLA+GWN+ADGLFGALDARRLP+
Sbjct 470 LHAHATREISDALAAGRREQAERLADGWNKADGLFGALDARRLPR 514
>gi|145224840|ref|YP_001135518.1| glycoside hydrolase family protein [Mycobacterium gilvum PYR-GCK]
gi|145217326|gb|ABP46730.1| (1->4)-alpha-D-glucan branching enzyme [Mycobacterium gilvum
PYR-GCK]
Length=531
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/520 (81%), Positives = 457/520 (88%), Gaps = 3/520 (0%)
Query 6 SPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITL 65
S VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL++VL LA ENR LITL
Sbjct 13 SSVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLIRVLRTLAAENRRHLITL 72
Query 66 GMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGI 125
G+TPVV AQLDDPYCL G++HWLANWQLRA EA++ R S+ P+ P+ALR FG+
Sbjct 73 GITPVVAAQLDDPYCLQGMNHWLANWQLRALEASTTR---SSEPGAPPASQPQALRQFGV 129
Query 126 RECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALRE 185
RE +A R L F WRHG SP+ R L+D+ T+ELLGGPLAHPFQPLL PRLREFALRE
Sbjct 130 REYDEAERELGEFDALWRHGASPVFRELLDSQTIELLGGPLAHPFQPLLNPRLREFALRE 189
Query 186 GLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTD 245
GLADA R AHRP GIWAPECAYAPGME+ Y AGV+HFMVDGPSL GDT+LGRPVG +D
Sbjct 190 GLADAHARFAHRPSGIWAPECAYAPGMEIGYDDAGVTHFMVDGPSLRGDTSLGRPVGGSD 249
Query 246 VVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPE 305
VVAFGRDLQVSYRVWSPKSGYPGH+AYRDFHTYDH TGLKPARVTGRNVPS+ KAPYDP
Sbjct 250 VVAFGRDLQVSYRVWSPKSGYPGHSAYRDFHTYDHATGLKPARVTGRNVPSDAKAPYDPA 309
Query 306 RADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRA 365
RAD AVDVHVADFVD VR RL SESERIGRPAHV+AAFDTELFGHWWYEGP WL+RVLRA
Sbjct 310 RADAAVDVHVADFVDTVRRRLASESERIGRPAHVVAAFDTELFGHWWYEGPVWLERVLRA 369
Query 366 LPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTAL 425
LPAAGVRVGTLSDAIA+G+VG PV+LPPSSWGSGKDWQVWSG +VADLVQLNSEVVD AL
Sbjct 370 LPAAGVRVGTLSDAIAEGYVGAPVDLPPSSWGSGKDWQVWSGDQVADLVQLNSEVVDAAL 429
Query 426 TTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAH 485
+T+DKALA A+L P PRD VADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAH
Sbjct 430 STVDKALAHGAALGSPTPRDFVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAH 489
Query 486 ATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
ATREI+ ALA+GRR+ A++LA+GWNRADGLFGALDARRLP
Sbjct 490 ATREISDALASGRREHAQQLADGWNRADGLFGALDARRLP 529
>gi|315445171|ref|YP_004078050.1| hypothetical protein Mspyr1_36070 [Mycobacterium sp. Spyr1]
gi|315263474|gb|ADU00216.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=531
Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/518 (81%), Positives = 455/518 (88%), Gaps = 3/518 (0%)
Query 8 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 67
VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL++VL LA ENR LITLG+
Sbjct 15 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLIRVLRTLAAENRRHLITLGI 74
Query 68 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 127
TPVV AQLDDPYCL G++HWLANWQLRA EA++ R S+ P+ P+ALR FG+RE
Sbjct 75 TPVVAAQLDDPYCLQGMNHWLANWQLRALEASTTR---SSEPGAPPASQPQALRQFGVRE 131
Query 128 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 187
+A R L F WRHG SPL R L+D T+ELLGGPLAHPFQPLL PRLREFALREGL
Sbjct 132 YEEAERELGEFDALWRHGASPLFRELLDNQTIELLGGPLAHPFQPLLNPRLREFALREGL 191
Query 188 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 247
ADA R AHRP GIWAPECAYAPGME+ Y AGV+HFMVDGPSL GDT+LGRPVG +DVV
Sbjct 192 ADAHTRFAHRPSGIWAPECAYAPGMEIGYDDAGVTHFMVDGPSLRGDTSLGRPVGGSDVV 251
Query 248 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 307
AFGRDLQVSYRVWSPKSGYPGH+AYRDFHTYDH TGLKPARVTGRNVPS+ KAPYDP RA
Sbjct 252 AFGRDLQVSYRVWSPKSGYPGHSAYRDFHTYDHATGLKPARVTGRNVPSDAKAPYDPARA 311
Query 308 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 367
D AVDVHVADFVD VR RL +ESERIGRPAHV+AAFDTELFGHWWYEGP WL+RVLRALP
Sbjct 312 DAAVDVHVADFVDTVRRRLAAESERIGRPAHVVAAFDTELFGHWWYEGPVWLERVLRALP 371
Query 368 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 427
AAGVRVGTLSDAIA+G+VG PV+LPPSSWGSGKDWQVWSG +VADLVQLNSEVVD AL+T
Sbjct 372 AAGVRVGTLSDAIAEGYVGAPVDLPPSSWGSGKDWQVWSGDQVADLVQLNSEVVDAALST 431
Query 428 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 487
+DKALA A+L P PRD VADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT
Sbjct 432 VDKALAHGAALGSPTPRDFVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 491
Query 488 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
REI+ ALA+GRR+ A++LA+GWNRADGLFGALDARRLP
Sbjct 492 REISDALASGRREHAQQLADGWNRADGLFGALDARRLP 529
>gi|333991391|ref|YP_004524005.1| hypothetical protein JDM601_2751 [Mycobacterium sp. JDM601]
gi|333487359|gb|AEF36751.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=509
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/524 (79%), Positives = 443/524 (85%), Gaps = 16/524 (3%)
Query 3 TSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRL 62
T VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSW+AAYLPL++VL LA E R L
Sbjct 2 TEKPSVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWSAAYLPLMRVLRTLAAEGRRNL 61
Query 63 ITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRA 122
+TLG+TPVV AQLDDPYCL+G+HHWLANWQLRA +A ++ P+ LR
Sbjct 62 LTLGVTPVVAAQLDDPYCLDGMHHWLANWQLRATQATTL---------------PD-LRD 105
Query 123 FGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFA 182
FG ECA A +ALD+FA WRHG SPLLR L AGTVE+LGGPLAHPFQPLLAPRLREFA
Sbjct 106 FGRHECAKAEQALDDFALLWRHGASPLLRELTQAGTVEMLGGPLAHPFQPLLAPRLREFA 165
Query 183 LREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVG 242
LREGLAD + RLA RP GIWAPECAYAPGME YA AGV+HFMVDGPSLHGDTALGRPVG
Sbjct 166 LREGLADTRARLAQRPTGIWAPECAYAPGMEAGYAAAGVTHFMVDGPSLHGDTALGRPVG 225
Query 243 KTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPY 302
+ V+AFGRDLQVSYRVWSPKSGYPGH AYRDFHTYDH TGLKP+RVTGRNV E KAPY
Sbjct 226 DSGVIAFGRDLQVSYRVWSPKSGYPGHGAYRDFHTYDHRTGLKPSRVTGRNVEPEHKAPY 285
Query 303 DPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRV 362
DP+RA+ AVD HVADFV VVR RL ESERIGRPAHVIAAFDTELFGHWW+EGP WL RV
Sbjct 286 DPQRAEAAVDTHVADFVKVVRTRLQDESERIGRPAHVIAAFDTELFGHWWHEGPQWLARV 345
Query 363 LRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVD 422
LRALP AG+RVGTLSDAI GFVGD VELPPSSWGSGKDWQVWSG KVADLVQLNSEVV+
Sbjct 346 LRALPEAGIRVGTLSDAIDAGFVGDSVELPPSSWGSGKDWQVWSGEKVADLVQLNSEVVE 405
Query 423 TALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHL 482
TAL +DKA+A A+ GP PRD VADQILRE LLTVSSDWPFMVSKDSAA+YARYRA L
Sbjct 406 TALGAVDKAMAAAATPSGPSPRDRVADQILREALLTVSSDWPFMVSKDSAAEYARYRAQL 465
Query 483 HAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HAHATREIA ALA+G RD A RLAEGWNRADGLFGALDARRLP+
Sbjct 466 HAHATREIADALASGHRDAAERLAEGWNRADGLFGALDARRLPR 509
>gi|169630446|ref|YP_001704095.1| hypothetical protein MAB_3365 [Mycobacterium abscessus ATCC 19977]
gi|169242413|emb|CAM63441.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=520
Score = 807 bits (2084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/526 (76%), Positives = 439/526 (84%), Gaps = 6/526 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+ S PVPGLFT+VLHTHLPWLA+HGRWPVGEEWLYQSW+AAYLPL +VL LA E R
Sbjct 1 MTPSKKPVPGLFTMVLHTHLPWLANHGRWPVGEEWLYQSWSAAYLPLFKVLRTLAAEGRE 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPVV AQLDDP+CL G+H WLANWQLRA EA+++ + + + +PE L
Sbjct 61 NLLTLGITPVVAAQLDDPHCLTGLHSWLANWQLRAFEASTIS---STDAEPGTASSPEML 117
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFG+RE A ALD F T WRHGGS LR LIDA +ELLGGPLAHPFQPLL PRLRE
Sbjct 118 RAFGVREYQTATAALDEFDTYWRHGGSGPLRNLIDAKAIELLGGPLAHPFQPLLHPRLRE 177
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADA R H P GIWAPECAYAPGME YA AGV HFMVDGPSL GDT+LGR
Sbjct 178 FALREGLADAAQRFGHAPTGIWAPECAYAPGMEEGYAAAGVKHFMVDGPSLQGDTSLGRT 237
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG +DVVAFGRDL VSYRVWSPKSGYPGHAAYRDFHTYDH TGLKPARVTGRNVPSE KA
Sbjct 238 VGDSDVVAFGRDLAVSYRVWSPKSGYPGHAAYRDFHTYDHDTGLKPARVTGRNVPSESKA 297
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPERA+ A+D+HV DFVD VR RL+ ES RIGRPAHVIAAFDTEL+GHWWYEGPTWL+
Sbjct 298 PYDPERANAAIDIHVRDFVDTVRQRLVDESSRIGRPAHVIAAFDTELYGHWWYEGPTWLE 357
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLRALP AG++VGTL A G+VG+P +LP SSWGSGKDW VW+G KVADLVQLN+EV
Sbjct 358 RVLRALPEAGIQVGTLEQAREQGYVGEPFDLPASSWGSGKDWHVWNGEKVADLVQLNTEV 417
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALT +DKAL + + P R+ VADQILRE LLTVSSDWPFMVSKDSAADYARYRA
Sbjct 418 VDTALTAVDKALNEQPA---PHTRNRVADQILREALLTVSSDWPFMVSKDSAADYARYRA 474
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
HLHAHATREIAGALA+GR D A RLA+GWNRADGLFGALDARRLP+
Sbjct 475 HLHAHATREIAGALASGRHDVASRLADGWNRADGLFGALDARRLPR 520
>gi|229493420|ref|ZP_04387209.1| glycosyl hydrolase, family protein 57 [Rhodococcus erythropolis
SK121]
gi|229319736|gb|EEN85568.1| glycosyl hydrolase, family protein 57 [Rhodococcus erythropolis
SK121]
Length=519
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/525 (69%), Positives = 413/525 (79%), Gaps = 16/525 (3%)
Query 3 TSASPV--PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
++A+PV PG+F LVLH+HLPWLA+HGRWPVGEEW+YQSWAA+Y+PL L LADE R
Sbjct 2 SNATPVTEPGMFALVLHSHLPWLANHGRWPVGEEWIYQSWAASYIPLAAALRRLADEGRS 61
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPV+ AQLDDP+CL G+HHWL NWQ+RA EAA + A A
Sbjct 62 HLLTLGITPVLAAQLDDPHCLAGMHHWLGNWQIRAHEAAGMPDA--------------AH 107
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
R G RE +A AL +F T W+HG SP+ R LID ELLGGPLAHPFQPLL PRLR
Sbjct 108 RELGAREHRASAAALADFETHWQHGASPVFRDLIDREAFELLGGPLAHPFQPLLDPRLRA 167
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
F+LREGLADA R H P GIW PEC Y PGME YA AGV+HFMVDGP+L GDT+LGRP
Sbjct 168 FSLREGLADAHARWNHTPTGIWGPECGYTPGMERGYAEAGVTHFMVDGPALRGDTSLGRP 227
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
V ++DVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDH TGLKP+RVTGR V S K
Sbjct 228 VRESDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHETGLKPSRVTGRTVDSADKL 287
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDPE A AVD HVADFV+ VR RL SES RIGR A V+AAFDTELFGHWW+EGP WL+
Sbjct 288 PYDPELAAAAVDKHVADFVETVRARLRSESTRIGRDALVVAAFDTELFGHWWHEGPQWLE 347
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
++LRALP AG+RVGTL+DA A G+VG+PV+L SSWGSGKDW+VW+G +V+DLVQLNSEV
Sbjct 348 KLLRALPEAGIRVGTLADAKASGYVGEPVQLEDSSWGSGKDWRVWAGDQVSDLVQLNSEV 407
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTT+DK+ + + P R+ V DQ+LRETL+TVSSDW FMVSKDSAA YAR RA
Sbjct 408 VDTALTTVDKSRGRDNAPGRPELRNRVNDQVLRETLMTVSSDWAFMVSKDSAAGYARERA 467
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
H HAHATREI+ AL +GR A RLAEGWNRADGLF LDARRLP
Sbjct 468 HKHAHATREISEALVSGRDAVAERLAEGWNRADGLFPGLDARRLP 512
>gi|226305858|ref|YP_002765818.1| hypothetical protein RER_23710 [Rhodococcus erythropolis PR4]
gi|226184975|dbj|BAH33079.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=522
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/525 (69%), Positives = 413/525 (79%), Gaps = 16/525 (3%)
Query 3 TSASPV--PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
++A+PV PG+F LVLH+HLPWLA+HGRWPVGEEW+YQSWAA+Y+PL L LADE R
Sbjct 2 SNATPVTEPGMFALVLHSHLPWLANHGRWPVGEEWIYQSWAASYIPLAAALRRLADEGRS 61
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
L+TLG+TPV+ AQLDDP+CL G+HHWL NWQ+RA EAA + A A
Sbjct 62 HLLTLGITPVLAAQLDDPHCLAGMHHWLGNWQIRAHEAAGMPDA--------------AH 107
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
R G RE +A AL++F T W+HG SP+ R LID ELLGGPLAHPFQPLL PRLR
Sbjct 108 RELGAREHRASAAALEDFETHWQHGASPVFRDLIDREAFELLGGPLAHPFQPLLDPRLRA 167
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
F+LREGLADA R H P GIW PEC Y PGME YA AGV+HFMVDGP+L GDT+LGRP
Sbjct 168 FSLREGLADAHARWNHTPTGIWGPECGYTPGMERGYAEAGVTHFMVDGPALRGDTSLGRP 227
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
V ++DVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDH TGLKP+RVTGR V S K
Sbjct 228 VRESDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHETGLKPSRVTGRTVDSADKL 287
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PY+PE A AVD HVADFV+ VR RL SES RIGR A V+AAFDTELFGHWW+EGP WL+
Sbjct 288 PYNPELAAAAVDKHVADFVETVRARLRSESARIGRDALVVAAFDTELFGHWWHEGPQWLE 347
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
++LRALP AG+RVGTL+DA G+VG+PV+L SSWGSGKDW+VW+G +V+DLVQLNSEV
Sbjct 348 KLLRALPEAGIRVGTLADAKESGYVGEPVQLEDSSWGSGKDWRVWAGDQVSDLVQLNSEV 407
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
VDTALTT+DK+ + + P R+ V DQ+LRETL+TVSSDW FMVSKDSAA YAR RA
Sbjct 408 VDTALTTVDKSRGRDNAPGRPELRNRVNDQVLRETLMTVSSDWAFMVSKDSAAGYARERA 467
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
H HAHATREI+ AL +GR A RLAEGWNRADGLF LDARRLP
Sbjct 468 HKHAHATREISEALVSGRDAVAERLAEGWNRADGLFPGLDARRLP 512
>gi|312140520|ref|YP_004007856.1| glycosyl hydrolase family 57 [Rhodococcus equi 103S]
gi|311889859|emb|CBH49176.1| putative glycosyl hydrolase family 57 [Rhodococcus equi 103S]
Length=519
Score = 713 bits (1841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/517 (71%), Positives = 415/517 (81%), Gaps = 16/517 (3%)
Query 9 PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMT 68
PG+F LVLH+HLPWLA+HGRWPVGEEWLYQSWAA YLP+ VL LA E R++TLG+T
Sbjct 6 PGMFCLVLHSHLPWLANHGRWPVGEEWLYQSWAATYLPVTAVLRRLAAEGHSRMLTLGVT 65
Query 69 PVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIREC 128
PV+ AQLDDP+CL+G+HHWL NWQ+RA EAA + PS + + L G RE
Sbjct 66 PVLAAQLDDPHCLDGMHHWLGNWQIRAHEAAGM-----------PSASHKEL---GAREH 111
Query 129 ADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLA 188
+A AL +F TRWRHGGS +LR LIDA +ELLGGPLAHPFQPLL RLR F+L EGLA
Sbjct 112 RASAAALADFETRWRHGGSAVLRELIDAEAIELLGGPLAHPFQPLLDERLRAFSLTEGLA 171
Query 189 DAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVA 248
DA R H P GIWAPEC Y PGME YA+AGVSHFMVDGP+L GDT+ GRPV +DVVA
Sbjct 172 DAHARWGHTPAGIWAPECGYTPGMERGYASAGVSHFMVDGPALRGDTSAGRPVWGSDVVA 231
Query 249 FGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERAD 308
FGRDL+VSYRVWSPK+GYPGH AYRDFHTYDH TGLKPARVTGR+VPSE+KAPYDPE A
Sbjct 232 FGRDLEVSYRVWSPKTGYPGHPAYRDFHTYDHDTGLKPARVTGRSVPSEKKAPYDPESAA 291
Query 309 RAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPA 368
A+D HV DFV+ VR RL SES+RIG+ A V+AAFDTELFGHWWYEGP WL++VLRALP
Sbjct 292 AALDKHVDDFVETVRRRLRSESDRIGKGALVVAAFDTELFGHWWYEGPQWLEKVLRALPE 351
Query 369 AGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTI 428
AG+RVGTL+DA A G+VG+PVEL SSWGSGKDW+VW+G +V DLVQLN+EVV TAL TI
Sbjct 352 AGIRVGTLADARASGYVGEPVELQDSSWGSGKDWRVWAGDQVQDLVQLNAEVVQTALDTI 411
Query 429 DKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATR 488
DK ++ A+ P R+ V DQ+LRETL+TVSSDW FMVSKDSAA YAR RAH HAHA R
Sbjct 412 DK--SRDANPGRPELRNRVHDQMLRETLMTVSSDWAFMVSKDSAAGYARDRAHKHAHALR 469
Query 489 EIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
EIA A+A+GR D ARRLA+GWN ADGLF ALDARRLP
Sbjct 470 EIADAVASGRDDLARRLADGWNAADGLFPALDARRLP 506
>gi|325675860|ref|ZP_08155544.1| family 57 glycosyl hydrolase [Rhodococcus equi ATCC 33707]
gi|325553831|gb|EGD23509.1| family 57 glycosyl hydrolase [Rhodococcus equi ATCC 33707]
Length=527
Score = 713 bits (1840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/524 (70%), Positives = 416/524 (80%), Gaps = 16/524 (3%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
T S PG+F LVLH+HLPWLA+HGRWPVGEEWLYQSWAA YLP+ VL LA E R
Sbjct 7 ETVTSREPGMFCLVLHSHLPWLANHGRWPVGEEWLYQSWAATYLPVTAVLRRLAAEGHSR 66
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
++TLG+TPV+ AQLDDP+CL+G+HHWL NWQ+RA EAA + PS + + L
Sbjct 67 MLTLGVTPVLAAQLDDPHCLDGMHHWLGNWQIRAHEAAGM-----------PSASHKEL- 114
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
G RE +A AL +F TRWRHGGS +LR LIDA +ELLGGPLAHPFQPLL RLR F
Sbjct 115 --GAREHRASAAALADFETRWRHGGSAVLRELIDAEAIELLGGPLAHPFQPLLDERLRAF 172
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
+L EGLADA R H P GIWAPEC Y PGME YA+AGVSHFMVDGP+L GDT+ GRPV
Sbjct 173 SLTEGLADAHARWGHTPAGIWAPECGYTPGMERGYASAGVSHFMVDGPALRGDTSAGRPV 232
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
+DVVAFGRDL+VSYRVWSPK+GYPGH YRDFHTYDH TGLKPARVTGR+VPSE+KAP
Sbjct 233 WGSDVVAFGRDLEVSYRVWSPKTGYPGHPDYRDFHTYDHDTGLKPARVTGRSVPSEKKAP 292
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
YDPE A A+D HV DFV+ VR RL SES+RIG+ A V+AAFDTELFGHWWYEGP WL++
Sbjct 293 YDPESAAAALDKHVDDFVETVRRRLRSESDRIGKGALVVAAFDTELFGHWWYEGPQWLEK 352
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLRALP AG+RVGTL+DA A G+VG+PVEL SSWGSGKDW+VW+G +V DLVQLN+EVV
Sbjct 353 VLRALPEAGIRVGTLADARASGYVGEPVELQDSSWGSGKDWRVWAGDQVQDLVQLNAEVV 412
Query 422 DTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 481
TAL TIDK ++ A+ P R+ V DQ+LRETL+TVSSDW FMVSKDSAA YAR RAH
Sbjct 413 QTALDTIDK--SRDANPGRPELRNRVHDQMLRETLMTVSSDWAFMVSKDSAAGYARDRAH 470
Query 482 LHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
HAHA REIA A+A+GR D ARRLA+GWN ADGLF ALDARRLP
Sbjct 471 KHAHALREIADAVASGRDDLARRLADGWNAADGLFPALDARRLP 514
>gi|226365922|ref|YP_002783705.1| hypothetical protein ROP_65130 [Rhodococcus opacus B4]
gi|226244412|dbj|BAH54760.1| hypothetical protein [Rhodococcus opacus B4]
Length=532
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/517 (69%), Positives = 400/517 (78%), Gaps = 14/517 (2%)
Query 9 PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMT 68
PG+F LVLH+HLPWLA+HGRWPVGEEWLYQSWAA+YLPL +L L+DE R L+TLG+T
Sbjct 16 PGMFCLVLHSHLPWLANHGRWPVGEEWLYQSWAASYLPLTAMLRRLSDEGRSHLLTLGIT 75
Query 69 PVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIREC 128
PV+ AQLDDP+CL G+HHWL NWQ+RA EAA + +A R G RE
Sbjct 76 PVLAAQLDDPHCLAGMHHWLGNWQIRAHEAAGM--------------PDDAHRELGAREH 121
Query 129 ADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLA 188
+A AL +F RWRHGGSP+LR L+D ELLGGPLAHPFQPLL PRLR F+LREGLA
Sbjct 122 RASAAALADFELRWRHGGSPVLRELLDREAFELLGGPLAHPFQPLLDPRLRAFSLREGLA 181
Query 189 DAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVA 248
DA R P GIW PEC Y PGME YA AGV+HFMVDGP+L GDT+LGRPV +DVVA
Sbjct 182 DAAARWNCTPTGIWGPECGYTPGMETGYAEAGVTHFMVDGPALRGDTSLGRPVRDSDVVA 241
Query 249 FGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERAD 308
FGRDLQVSYRVWSPKSGYPGH AYRDFHTYDH TGLKPARVTGR V S KAPYDP A
Sbjct 242 FGRDLQVSYRVWSPKSGYPGHGAYRDFHTYDHATGLKPARVTGRTVESADKAPYDPALAT 301
Query 309 RAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPA 368
A D HVADFV VR RL ES RIGRPA V+AAFDTELFGHWW+EGP WL++VLRALP
Sbjct 302 AAADRHVADFVATVRRRLRDESARIGRPALVVAAFDTELFGHWWHEGPEWLEKVLRALPE 361
Query 369 AGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTI 428
AG+RVGTL DA G+VG+PV+L SSWGSGKDW+VW+G +V+DLVQLN+EVV TAL T+
Sbjct 362 AGIRVGTLDDARNQGYVGEPVQLENSSWGSGKDWRVWAGDQVSDLVQLNTEVVATALDTV 421
Query 429 DKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATR 488
DK A+ P R+ V DQ+LRE L+TVSSDW FMVSKDSAA YAR RAH HAHATR
Sbjct 422 DKYRDADAAPGRPALRNRVNDQMLREALMTVSSDWAFMVSKDSAAGYARDRAHKHAHATR 481
Query 489 EIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
EIA A++AG+ A RLAEGWNRADGLF LDARRLP
Sbjct 482 EIAAAVSAGKDAAASRLAEGWNRADGLFPGLDARRLP 518
>gi|111023421|ref|YP_706393.1| hypothetical protein RHA1_ro06460 [Rhodococcus jostii RHA1]
gi|110822951|gb|ABG98235.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=532
Score = 700 bits (1807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/517 (69%), Positives = 400/517 (78%), Gaps = 14/517 (2%)
Query 9 PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMT 68
PG+F LVLH+HLPWLA+HGRWPVGEEWLYQSWAA+YLPL +L L+DE R L+TLG+T
Sbjct 16 PGMFCLVLHSHLPWLANHGRWPVGEEWLYQSWAASYLPLTAMLRRLSDEGRSHLLTLGIT 75
Query 69 PVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIREC 128
PV+ AQLDDP+CL G+HHWL NWQ+RA EAA + +A R G RE
Sbjct 76 PVLAAQLDDPHCLAGMHHWLGNWQIRAHEAAGM--------------PDDAHRDLGAREH 121
Query 129 ADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLA 188
+A AL +F RWRHGGSP+LR L+D ELLGGPLAHPFQPLL PRLR F+L EGLA
Sbjct 122 RASAAALADFELRWRHGGSPVLRDLLDREAFELLGGPLAHPFQPLLDPRLRAFSLHEGLA 181
Query 189 DAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVA 248
DA R P GIW PEC Y PGME YA AGV+HFMVDGP+L GDT+LGRPV ++DVVA
Sbjct 182 DAAARWYCTPTGIWGPECGYTPGMETGYAEAGVTHFMVDGPALRGDTSLGRPVRESDVVA 241
Query 249 FGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERAD 308
FGRDLQVSYRVWSPKSGYPGH AYRDFHTYDH TGLKPARVTGR V S KAPYDP A
Sbjct 242 FGRDLQVSYRVWSPKSGYPGHGAYRDFHTYDHATGLKPARVTGRTVDSADKAPYDPALAT 301
Query 309 RAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPA 368
AVD HVADFV VR RL ES+R GRPA V+AAFDTELFGHWW+EGP WL++VLRALP
Sbjct 302 AAVDRHVADFVATVRKRLRDESQRNGRPALVVAAFDTELFGHWWHEGPEWLEKVLRALPE 361
Query 369 AGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTI 428
AG+RVGTL DA G+VG+PV+L SSWGSGKDW+VW+G +V+DLVQLN+EVV TAL T+
Sbjct 362 AGIRVGTLDDARRQGYVGEPVQLENSSWGSGKDWRVWAGDQVSDLVQLNAEVVATALDTV 421
Query 429 DKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATR 488
DK + P R+ V DQILRE L+TVSSDW FMVSKDSAA YAR RAH HAHATR
Sbjct 422 DKTHDADTAPGRPALRNRVNDQILREALMTVSSDWAFMVSKDSAAGYARDRAHKHAHATR 481
Query 489 EIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
EIA A++AG+ TA RLAEGWN ADGLF LDARRLP
Sbjct 482 EIAAAVSAGKDATANRLAEGWNHADGLFPGLDARRLP 518
>gi|289751702|ref|ZP_06511080.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289692289|gb|EFD59718.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=340
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/340 (99%), Positives = 340/340 (100%), Gaps = 0/340 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVI 340
PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVI
Sbjct 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVI 340
>gi|326381510|ref|ZP_08203204.1| hypothetical protein SCNU_01125 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199757|gb|EGD56937.1| hypothetical protein SCNU_01125 [Gordonia neofelifaecis NRRL
B-59395]
Length=512
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/523 (65%), Positives = 396/523 (76%), Gaps = 19/523 (3%)
Query 4 SASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLI 63
+ VPG FTLVLH+HLPWLA+H RWPVGEEWLYQSWA Y P+ L LAD+ ++
Sbjct 2 TVDKVPGQFTLVLHSHLPWLANHSRWPVGEEWLYQSWAHCYQPVFAALRRLADDGFSDVL 61
Query 64 TLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEA-LRA 122
+LG+TPV+ AQLDDP+C + ++ WLA+WQLRA EAA PE R
Sbjct 62 SLGVTPVLAAQLDDPHCASSMYEWLADWQLRAMEAA---------------IAPEPDRRR 106
Query 123 FGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFA 182
G RE AA AL +F T+WRHGGSP +R L+ +G +ELLGGPLAHPFQPLL RLR+F+
Sbjct 107 MGHREFRSAAAALHDFETQWRHGGSPQIRPLVSSGIIELLGGPLAHPFQPLLDSRLRQFS 166
Query 183 LREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVG 242
L EGLADA+LR RP GIWAPECAY PGME +Y AGV HFMVDGP+LHGDTALGRPVG
Sbjct 167 LSEGLADAELRWGTRPTGIWAPECAYTPGMEQEYRAAGVGHFMVDGPTLHGDTALGRPVG 226
Query 243 KTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPY 302
+ VVAFGRDL VSYRVWSPKSGYPGHAAYRDFHTYDH TG KP RVTG+NVP E K PY
Sbjct 227 DSGVVAFGRDLTVSYRVWSPKSGYPGHAAYRDFHTYDHQTGFKPVRVTGKNVPGEAKKPY 286
Query 303 DPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRV 362
DP D +D HV DF+D +R RL+SESERIGRPAHV+AAFDTEL+GHWW+EGP WL+R+
Sbjct 287 DPSLVDPVIDKHVDDFIDCIRARLISESERIGRPAHVVAAFDTELYGHWWHEGPIWLERL 346
Query 363 LRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVD 422
+R LP AGV VG+L+ A +G+VG+PVEL SSWGSGKDW+VW G +V LVQLNSEV D
Sbjct 347 MRRLPEAGVTVGSLATARRNGYVGEPVELDDSSWGSGKDWRVWEGPQVQHLVQLNSEVTD 406
Query 423 TALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHL 482
TAL ++DK + RD +ADQILRETL+TV SDWPFM+SKD+AA YA+ RA+
Sbjct 407 TALDSLDKRRGHAPGVPA---RDGIADQILRETLMTVQSDWPFMISKDTAAGYAQDRAYK 463
Query 483 HAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
HAHATREI A AG+ AR+LA+GW AD LFGALDARRLP
Sbjct 464 HAHATREICAAADAGKEGRARQLADGWRHADNLFGALDARRLP 506
>gi|296393812|ref|YP_003658696.1| hypothetical protein Srot_1401 [Segniliparus rotundus DSM 44985]
gi|296180959|gb|ADG97865.1| Domain of unknown function DUF1957 [Segniliparus rotundus DSM
44985]
Length=522
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/517 (66%), Positives = 399/517 (78%), Gaps = 8/517 (1%)
Query 8 VPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGM 67
VPG+F+LVLHTHLPW+ HHGRWPVGEEWLYQ+WA +YLPL +L L + R L++L +
Sbjct 11 VPGMFSLVLHTHLPWVVHHGRWPVGEEWLYQAWAQSYLPLFSLLRRLGERGRANLLSLSL 70
Query 68 TPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRE 127
TPV+ AQLDDP L G+ HWLANW LRA+EAA V+ + + + +PEALR G E
Sbjct 71 TPVLAAQLDDPSALAGMRHWLANWALRADEAAVVQ---STGARPGTASSPEALRRLGAHE 127
Query 128 CADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGL 187
+A AL F + W HGGSP++R L+D+G +ELLGGPLAH F PLL PRLREFALREGL
Sbjct 128 RREAGAALAEFESNWLHGGSPVVRDLVDSGVIELLGGPLAHSFSPLLHPRLREFALREGL 187
Query 188 ADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVV 247
ADAQ R RP GIWAPECA+ PGM +YA AGV F++DGP++HG+T L RPV TDV+
Sbjct 188 ADAQARGLGRPGGIWAPECAFTPGMGAEYAQAGVRRFLLDGPAMHGETTLARPVEGTDVL 247
Query 248 AFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERA 307
AFGRDL V+YRVWSPKSGYPG AYRDFHTYDHL GLKPARVTG +VP E+KAPYDP RA
Sbjct 248 AFGRDLAVTYRVWSPKSGYPGDGAYRDFHTYDHLVGLKPARVTGHHVPGEEKAPYDPARA 307
Query 308 DRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALP 367
AV V DFV VR RL +ES R GRPA V+AAFDTELFGHWW+EGP WL++VL ALP
Sbjct 308 QIAVRRDVDDFVQTVRERLSAESARTGRPALVVAAFDTELFGHWWHEGPQWLEQVLTALP 367
Query 368 AAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT 427
AAGVRVGTL++A A+GFVG+ ++ SWGSGKDW+VWSG +VADLV+LN++V +TAL T
Sbjct 368 AAGVRVGTLAEAAANGFVGERLDPADCSWGSGKDWRVWSGPQVADLVELNADVAETALRT 427
Query 428 IDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHAT 487
+DK LA T L RD V+DQILRETLLTV+SDW F+VSKD+AA YAR RAHLHAHA
Sbjct 428 VDKRLAGTQ-----LFRDRVSDQILRETLLTVASDWAFLVSKDTAAQYARDRAHLHAHAV 482
Query 488 REIAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
REIA A +G A RLA+GWN AD LFG LDARRL
Sbjct 483 REIAQAAESGHEGKAVRLAQGWNIADSLFGQLDARRL 519
>gi|262203152|ref|YP_003274360.1| hypothetical protein Gbro_3262 [Gordonia bronchialis DSM 43247]
gi|262086499|gb|ACY22467.1| Domain of unknown function DUF1957 [Gordonia bronchialis DSM
43247]
Length=522
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/520 (68%), Positives = 397/520 (77%), Gaps = 21/520 (4%)
Query 6 SPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITL 65
S VPG FTLVLH+HLPWLA+HGRWPVGEEWLYQSWAA+YLP+++VL LAD+ H ++L
Sbjct 7 SAVPGQFTLVLHSHLPWLANHGRWPVGEEWLYQSWAASYLPVVEVLNRLADDGFHDQLSL 66
Query 66 GMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRA-FG 124
G+TPV+ AQLDDP+C +H WLA+WQLRA A + +P+ +RA G
Sbjct 67 GITPVLAAQLDDPHCCASMHTWLADWQLRACAA---------------TASPDPVRAEAG 111
Query 125 IRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALR 184
RE AA ALD+F TRWRHG +P++R L AG +E+LGGPLAHPF PLL PRLR F+L
Sbjct 112 RREFVAAATALDDFETRWRHGAAPVIRALAGAGVIEVLGGPLAHPFAPLLDPRLRRFSLS 171
Query 185 EGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKT 244
EGLADA+ R P GIWAPECA+APGME +Y AGV HFMVDGP+LHGDTALGRPVG T
Sbjct 172 EGLADARARWGFDPAGIWAPECAFAPGMEDEYQRAGVRHFMVDGPTLHGDTALGRPVGDT 231
Query 245 DVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDP 304
DVVA+GRDL VSYRVWSPKSGYPGHAAYRDFHTYDH TGLK +RVTGR+VP + K PYDP
Sbjct 232 DVVAYGRDLSVSYRVWSPKSGYPGHAAYRDFHTYDHETGLKSSRVTGRSVPGDAKKPYDP 291
Query 305 ERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLR 364
+R D +D HV DFVD VR RL +ESERIGRPA V+AAFDTELFGHWW+EGP WL+RVLR
Sbjct 292 DRVDAVIDRHVDDFVDHVRRRLQAESERIGRPALVVAAFDTELFGHWWFEGPRWLERVLR 351
Query 365 ALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTA 424
LP AGVRVGTL+ A +GFVGD V LP SSWGSGKDW+VW G +V LV LN+EV +TA
Sbjct 352 RLPEAGVRVGTLAQAAQNGFVGDAVTLPASSWGSGKDWRVWDGPQVRHLVDLNAEVTETA 411
Query 425 LTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHA 484
L T+ K L G R VADQILRETLLTV+SDWPFMVSKD+AA YA RAH HA
Sbjct 412 LDTVTKLLEG-----GRGERHRVADQILRETLLTVASDWPFMVSKDTAAHYAVQRAHTHA 466
Query 485 HATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
HATREI A GR A RLA GW RAD LF LDARRL
Sbjct 467 HATREICDAAMRGRDADAARLAAGWQRADNLFAGLDARRL 506
>gi|343926882|ref|ZP_08766375.1| hypothetical protein GOALK_072_01040 [Gordonia alkanivorans NBRC
16433]
gi|343763242|dbj|GAA13301.1| hypothetical protein GOALK_072_01040 [Gordonia alkanivorans NBRC
16433]
Length=517
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/524 (65%), Positives = 388/524 (75%), Gaps = 20/524 (3%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+ S VPG FT VLH+HLPWLA+HGRWPVGEEWLYQSWA++Y+PLL VL LAD+
Sbjct 1 MTISTPEVPGQFTFVLHSHLPWLANHGRWPVGEEWLYQSWASSYIPLLDVLERLADDGLT 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
+TLG+TPV+ AQLDDP+C + ++ WLA+WQLRA EAA+ S AD
Sbjct 61 DQLTLGITPVLAAQLDDPHCASSMYEWLADWQLRATEAAT------SNDADRADA----- 109
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
G RE +A RAL+ F T+WRHG SPL+R + +G E+LGGPLAHPF PLL PRLR
Sbjct 110 ---GRREYLEAQRALETFETKWRHGASPLIRRVASSGVAEILGGPLAHPFAPLLDPRLRT 166
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
F LREGL DA R P GIWAPECA+APGME +YA AGV HFMVDGP+LHGDTALGRP
Sbjct 167 FFLREGLHDAHARWGFDPAGIWAPECAFAPGMEAEYARAGVGHFMVDGPTLHGDTALGRP 226
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG + V+A+GRDL VSYRVWSPKSGYPGHAAYRDFHTYDH +GLK RVTGR+VP ++K
Sbjct 227 VGDSGVIAYGRDLTVSYRVWSPKSGYPGHAAYRDFHTYDHESGLKRFRVTGRSVPGDEKK 286
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDP D +D HV DFV VR RL+ ES RIGR A V+AAFDTELFGHWW+EGP WL+
Sbjct 287 PYDPAHVDAVIDRHVEDFVGHVRERLIEESTRIGRDALVVAAFDTELFGHWWHEGPIWLE 346
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
RVLR LP GV+VGTL+ A G VG+PV+LP SSWGSGKDW+VW+G +V LV LN EV
Sbjct 347 RVLRRLPEVGVKVGTLAGATRSGLVGEPVDLPASSWGSGKDWRVWNGPQVQHLVTLNEEV 406
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
+TAL + K LA G R VADQI+RETLLTVSSDWPFMVSKD+AA YA RA
Sbjct 407 TETALDAVAKLLA------GSGERSRVADQIIRETLLTVSSDWPFMVSKDTAAQYAVSRA 460
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
H HAHATREI+ A GR D A LA+ W RAD LF ALDARRL
Sbjct 461 HTHAHATREISDAALRGRDDAAATLAQNWARADNLFPALDARRL 504
>gi|54026252|ref|YP_120494.1| hypothetical protein nfa42810 [Nocardia farcinica IFM 10152]
gi|54017760|dbj|BAD59130.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=536
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/524 (67%), Positives = 393/524 (75%), Gaps = 19/524 (3%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
+ S+ VPG FTLVLH+HLPWLAHHGRWPVGEEWLYQSWAA+YLP+++VL LA E R
Sbjct 3 SVSSQQVPGQFTLVLHSHLPWLAHHGRWPVGEEWLYQSWAASYLPVVEVLRTLAAEGRSH 62
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
L+TLG+TPV+ AQLDDP+CL G+HHWL NWQLRA+EA++
Sbjct 63 LLTLGITPVLAAQLDDPHCLAGMHHWLGNWQLRADEASAAGNV----------------- 105
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
A G E A AL F WRHG +P+ R L+DA VELLGGPLAHPFQPLL PRLR F
Sbjct 106 ALGRHEHRLATAALAEFEQHWRHGAAPVWRSLVDAEVVELLGGPLAHPFQPLLHPRLRAF 165
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
L EGLADA+ R RP GIWAPEC Y PGME YA AGV+HF+VDGP+L GD++LGRPV
Sbjct 166 QLSEGLADARHRWGTRPTGIWAPECGYTPGMERGYAAAGVTHFLVDGPALRGDSSLGRPV 225
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
+DVVAFGRDL VSYRVWSPKSGYPGHAAYRDFH YDH TGLKP+RVTG+ V KAP
Sbjct 226 HDSDVVAFGRDLHVSYRVWSPKSGYPGHAAYRDFHHYDHATGLKPSRVTGKTVAGPDKAP 285
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
YDPE A AV V DFV VR RL++ES+RIGRPA V+AAFDTELFGHWW+EGP WL +
Sbjct 286 YDPELAAAAVAGDVEDFVRTVRARLIAESDRIGRPALVVAAFDTELFGHWWHEGPQWLAQ 345
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLRALP AGV VGTL+DA A GFVG+PV L SSWGSGKDW+VW+G +V DLV+LN EVV
Sbjct 346 VLRALPEAGVTVGTLADARARGFVGEPVPLADSSWGSGKDWRVWAGDQVRDLVELNDEVV 405
Query 422 DTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAH 481
AL T+DK A A P RD VADQ+LRE +LTVSSDW FMVSKDSAA YAR RAH
Sbjct 406 GLALDTVDKMRA--ADPGAPALRDPVADQLLREAILTVSSDWAFMVSKDSAAGYARDRAH 463
Query 482 LHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
HAHA REIA A+ AG+ AR+LA W+ ADG F +DARRLP
Sbjct 464 QHAHAVREIAAAVGAGQVAKARQLAARWSAADGFFPGVDARRLP 507
>gi|317508605|ref|ZP_07966264.1| glycosyl hydrolase [Segniliparus rugosus ATCC BAA-974]
gi|316253097|gb|EFV12508.1| glycosyl hydrolase [Segniliparus rugosus ATCC BAA-974]
Length=509
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/514 (65%), Positives = 391/514 (77%), Gaps = 8/514 (1%)
Query 11 LFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTPV 70
+F VLHTHLPW+ +HGRWPVGEEWLYQ+WA +YLPL +L L + R ++L +TPV
Sbjct 1 MFCFVLHTHLPWVVNHGRWPVGEEWLYQAWAQSYLPLFAMLRRLGETGRTNALSLSLTPV 60
Query 71 VNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECAD 130
+ AQLDDP+ L + HWLANW LRA+EAA VR + + + +PEALR G E
Sbjct 61 LAAQLDDPHALTSMRHWLANWALRADEAAVVR---STGARPGTASSPEALRRLGNYERRQ 117
Query 131 AARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLADA 190
A AL F + W HGGSP++RGL+D+G VELLGGPLAH F PLL PRLREF LREGLADA
Sbjct 118 ADAALAEFESAWLHGGSPVVRGLVDSGVVELLGGPLAHSFSPLLHPRLREFTLREGLADA 177
Query 191 QLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAFG 250
+ R RP+GIWAPECA+ PGM +YA AGV F++DGP++HGDT+L RPV TDV+AFG
Sbjct 178 RARGFARPRGIWAPECAFTPGMGEEYARAGVERFLLDGPAMHGDTSLARPVEGTDVLAFG 237
Query 251 RDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADRA 310
RDL V+YRVWSPKSGYPG AYRDFHTYDHL GLKPARVTG +VP E+KAPYDP RA A
Sbjct 238 RDLSVTYRVWSPKSGYPGDGAYRDFHTYDHLVGLKPARVTGHHVPEEEKAPYDPARAQIA 297
Query 311 VDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAAG 370
V VADFV VR+RL +ES R GRPA V+AAFDTELFGHWW+EGP WL++VL ALP AG
Sbjct 298 VRRDVADFVQTVRDRLAAESARTGRPALVVAAFDTELFGHWWHEGPEWLEQVLTALPEAG 357
Query 371 VRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTIDK 430
VRVGTL +A A GFVG+ ++ SWGSGKDW+VWSG +VADLV+LN++VVDTAL T+DK
Sbjct 358 VRVGTLGEAAACGFVGERLDPEDCSWGSGKDWRVWSGPQVADLVELNADVVDTALRTVDK 417
Query 431 ALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATREI 490
LAQ RD V+DQI+RE LLTVSSDW F+VSKD+AA YAR RAHLHAHATREI
Sbjct 418 RLAQPQMF-----RDRVSDQIIREALLTVSSDWAFLVSKDTAAQYARGRAHLHAHATREI 472
Query 491 AGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
A A +G A +LA GW AD LFG LDARRL
Sbjct 473 AEAAESGHEARAAQLASGWRIADSLFGQLDARRL 506
>gi|333920888|ref|YP_004494469.1| Family 57 glycosyl hydrolase [Amycolicicoccus subflavus DQS3-9A1]
gi|333483109|gb|AEF41669.1| Family 57 glycosyl hydrolase [Amycolicicoccus subflavus DQS3-9A1]
Length=535
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/526 (64%), Positives = 394/526 (75%), Gaps = 14/526 (2%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
N S VPG+F LVLH+HLPWLAHHGRWPVGEEWLYQSWAA Y P+ +VL LADE R R
Sbjct 13 NHSGGSVPGMFCLVLHSHLPWLAHHGRWPVGEEWLYQSWAATYQPVFRVLRRLADEGRSR 72
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
L++LG+TPV+ AQLDDPY L G++ WL NWQLR+ EAA + AR+ S + +P +LR
Sbjct 73 LVSLGVTPVLAAQLDDPYTLAGMYDWLGNWQLRSHEAA-LATARECGSG---AVSPASLR 128
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
G RE +A A+++F WRHGG+ L+R L DAG +ELLGGPL HPFQPLL P LR F
Sbjct 129 RLGGREHRASAAAVEDFEQNWRHGGAGLIRSLTDAGAIELLGGPLTHPFQPLLDPELRSF 188
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
AL++GL DA+ R +P GIWAPECAY G+E Y +GVSHF+VDGPSL GDT+LGRPV
Sbjct 189 ALQQGLRDARQRFGTQPTGIWAPECAYTQGLESLYQRSGVSHFLVDGPSLRGDTSLGRPV 248
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
G + VVAFGRDL VSYRVWSPKSGYPGH YRDFH+Y H TGLKPARVT +N P+E+KAP
Sbjct 249 GTSGVVAFGRDLHVSYRVWSPKSGYPGHRDYRDFHSYHHPTGLKPARVTSKNTPAERKAP 308
Query 302 YDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQR 361
YDP RA A+D H DFV+ VR RL ES RIGRPA V+AAFDTELFGHWW+EGP WL+
Sbjct 309 YDPHRAATALDEHADDFVETVRRRLADESARIGRPALVVAAFDTELFGHWWHEGPEWLEL 368
Query 362 VLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVV 421
VLR LP AG++VGTL DA+ G+VG PV+LP SSWGSGKDW+VW+G V DLVQLN+EV
Sbjct 369 VLRRLPEAGIQVGTLQDAMDGGYVGAPVQLPDSSWGSGKDWRVWAGDDVQDLVQLNAEVA 428
Query 422 DTALTTIDKALAQTASLDGPLP---RDHVADQILRETLLTVSSDWPFMVSKDSAADYARY 478
+ ALT + K LP RD V+DQI+RE LL ++SDW FMV+KDSA DYAR
Sbjct 429 ELALTAVKKRAEH-------LPRSTRDTVSDQIIREALLCLASDWAFMVTKDSAVDYARD 481
Query 479 RAHLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
RAH HAHA REIA A +G AR LA GW ADGLF ALD+R L
Sbjct 482 RAHKHAHAAREIAAAALSGDEARARALAAGWQHADGLFPALDSRVL 527
>gi|289759155|ref|ZP_06518533.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|289714719|gb|EFD78731.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
Length=322
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/318 (99%), Positives = 315/318 (99%), Gaps = 0/318 (0%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH
Sbjct 1 MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL
Sbjct 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE
Sbjct 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
Sbjct 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA
Sbjct 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
Query 301 PYDPERADRAVDVHVADF 318
PYDPERADRAVDVH F
Sbjct 301 PYDPERADRAVDVHCCRF 318
>gi|296140613|ref|YP_003647856.1| glycoside hydrolase family protein [Tsukamurella paurometabola
DSM 20162]
gi|296028747|gb|ADG79517.1| glycoside hydrolase family 57 [Tsukamurella paurometabola DSM
20162]
Length=512
Score = 638 bits (1645), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/517 (65%), Positives = 385/517 (75%), Gaps = 22/517 (4%)
Query 9 PGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMT 68
PG+ +VLH+HLPWLAHHG WPVGEEWLYQSWAA+YL + +VL LADE R +TLG+T
Sbjct 8 PGMIAVVLHSHLPWLAHHGAWPVGEEWLYQSWAASYLRVAEVLDRLADEGRTEQLTLGIT 67
Query 69 PVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIREC 128
PV+ AQLDDPYCL +HH+L NWQLRA EAA + + SA+ G RE
Sbjct 68 PVLAAQLDDPYCLEAMHHYLGNWQLRAHEAA---LSGRPASAE-----------LGAREH 113
Query 129 ADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLA 188
+AA AL F ++WRHGG+P+LR L DAGTVELLGGPLAHPFQPLL PRLRE +LREGLA
Sbjct 114 RNAAHALRLFESQWRHGGTPVLRRLADAGTVELLGGPLAHPFQPLLHPRLRELSLREGLA 173
Query 189 DAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVA 248
+ RL P GIWAPECAYAPGME Y AGV+HFMVDGP+L GDTAL RPVG + V+A
Sbjct 174 YGKARLGQDPAGIWAPECAYAPGMEHGYQAAGVTHFMVDGPTLGGDTALARPVGDSTVLA 233
Query 249 FGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERAD 308
FGRDL VSYRVWSPKSGYPGHAAYRDF+ +DH TGLKP+RVTG+NVP ++KAPYDP+RAD
Sbjct 234 FGRDLPVSYRVWSPKSGYPGHAAYRDFYAHDHSTGLKPSRVTGKNVPEQEKAPYDPQRAD 293
Query 309 RAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPA 368
A+D HVADFV+ V RL SES RIGR A V+AAFDTELFGHWWYEGP WL+RVLRALP
Sbjct 294 AAIDKHVADFVETVVQRLRSESSRIGRDALVVAAFDTELFGHWWYEGPIWLERVLRALPE 353
Query 369 AGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTI 428
AGVRVGTL A G+VG P E+ SSWGSGKDW+VW G +V + V+LN E+ T L +
Sbjct 354 AGVRVGTLRSAAEAGYVGAPREIAASSWGSGKDWRVWEGPQVREFVELNEEITTTTLRVL 413
Query 429 DKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATR 488
DK + RD V+DQ+L+E L+ + SDWPFMVSKD+AA YAR RA+ HAHA R
Sbjct 414 DKRAGRQ--------RDRVSDQVLQEALMCLQSDWPFMVSKDTAAAYARDRAYKHAHALR 465
Query 489 EIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
EIA A G A LA W +AD F LDARRLP
Sbjct 466 EIADAAERGDEARAAMLARSWRQADDPFPGLDARRLP 502
>gi|2414527|emb|CAB16416.1| hypothetical protein MLCB637.01c [Mycobacterium leprae]
Length=338
Score = 593 bits (1529), Expect = 2e-167, Method: Compositional matrix adjust.
Identities = 290/342 (85%), Positives = 309/342 (91%), Gaps = 4/342 (1%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NTSA+PVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVL LADENRH
Sbjct 1 MNTSANPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLHTLADENRH 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
RLITLG+TPVVNAQLDDPYCL+G+HHWLANW+LRA EAASVR +SK P+C P+AL
Sbjct 61 RLITLGVTPVVNAQLDDPYCLDGMHHWLANWRLRATEAASVRSGSESK----PACAPQAL 116
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
RAFG REC +A RAL+ FAT WRHGGSPLLR LIDAGTVELLGGPLAHPFQPL+APRLRE
Sbjct 117 RAFGARECIEAQRALEYFATLWRHGGSPLLRSLIDAGTVELLGGPLAHPFQPLIAPRLRE 176
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FAL EGL DA LRLAHRP GIWAPECAYAPGME DY+ AG++HFMVDGPSLHGDTALGRP
Sbjct 177 FALHEGLDDAWLRLAHRPTGIWAPECAYAPGMEHDYSAAGITHFMVDGPSLHGDTALGRP 236
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG T VVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTG NVPSE KA
Sbjct 237 VGDTAVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGCNVPSEAKA 296
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAA 342
PYDP+ AD+ VD HV DFV VVRNRL +ESERIGRPAHV+AA
Sbjct 297 PYDPDHADKVVDAHVDDFVGVVRNRLFTESERIGRPAHVVAA 338
>gi|325000548|ref|ZP_08121660.1| glycoside hydrolase family 57 [Pseudonocardia sp. P1]
Length=515
Score = 556 bits (1432), Expect = 5e-156, Method: Compositional matrix adjust.
Identities = 306/516 (60%), Positives = 344/516 (67%), Gaps = 19/516 (3%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F LVLH+HLP LA HGRWPVGEEWLYQSWA +YLP++ L LA E R L TLG+TP
Sbjct 7 GTFCLVLHSHLPLLARHGRWPVGEEWLYQSWAQSYLPVVATLRELAAEGRGGLATLGVTP 66
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDPYCL GVH WL W LRA AA L E
Sbjct 67 VLAAQLDDPYCLRGVHDWLGGWTLRAHSAAG------------------RLPDLAAHEHR 108
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
+ A F WRHGGSP LR L D+G VELLGGP AHPFQPLL PR+R F+LR GLAD
Sbjct 109 LSTTATAEFEAHWRHGGSPALRSLRDSGAVELLGGPAAHPFQPLLDPRVRRFSLRAGLAD 168
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
LRL RP GIWAPEC YAPGME DYATAGV F+VDGP+L GDTAL RPVG +DV+
Sbjct 169 HALRLGSRPAGIWAPECGYAPGMEHDYATAGVGRFLVDGPALRGDTALARPVGDSDVLCV 228
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRDL V+YRVWSP+ GYPG A YRDFHT+DH +GLKPARVTGR VPSE+K PY P+ A R
Sbjct 229 GRDLDVTYRVWSPRKGYPGSAEYRDFHTWDHASGLKPARVTGRRVPSERKKPYSPDLAAR 288
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
AV DFVD V RL + GRPA +AAFDTELFGHWW EGP WL VLR LP A
Sbjct 289 AVQRDARDFVDTVVARLRALRGTTGRPALTVAAFDTELFGHWWQEGPDWLAAVLRLLPEA 348
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTID 429
GV V TL AI DG VGDPV LP SWGSGKDW+VW+G +VAD+V N +V L +D
Sbjct 349 GVAVRTLGGAIDDGLVGDPVTLPECSWGSGKDWRVWAGPQVADVVARNDDVQRDLLAGVD 408
Query 430 KALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATRE 489
L L P RD +AD ++ + L +SSDW FMVSKDSAADYAR R+ HA E
Sbjct 409 TVLPGDGPLLRPDARDPLADLLVDQALHALSSDWAFMVSKDSAADYARSRSATHAGRVAE 468
Query 490 IAGALAAGRRDTARRLAEGWNRADG-LFGALDARRL 524
++G L AGRR A R A W A +FG +DAR L
Sbjct 469 LSGLLRAGRRRAAGRRAAHWTDATAPVFGHVDARDL 504
>gi|300783571|ref|YP_003763862.1| hypothetical protein AMED_1649 [Amycolatopsis mediterranei U32]
gi|299793085|gb|ADJ43460.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524961|gb|AEK40166.1| hypothetical protein RAM_08380 [Amycolatopsis mediterranei S699]
Length=501
Score = 555 bits (1430), Expect = 7e-156, Method: Compositional matrix adjust.
Identities = 296/515 (58%), Positives = 355/515 (69%), Gaps = 20/515 (3%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F LV+H+HLPWL HHG WPVGEEWLYQ+WA +YLP++++L ADE R ++TLG+TP
Sbjct 4 GTFCLVVHSHLPWLPHHGSWPVGEEWLYQAWAHSYLPMVELLERFADEGRRDVLTLGVTP 63
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDPY + H WL +WQLRA+ AA++ LR E
Sbjct 64 VLAAQLDDPYSIRAFHDWLGHWQLRAQHAATLWRG------------DPLLRELAAAEHR 111
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
A RA + TRWRHG SP+LR L+D T+ELLGGPLAHPFQPLL PR+REFAL GLAD
Sbjct 112 TAVRAAEELGTRWRHGFSPVLRSLVDNSTIELLGGPLAHPFQPLLDPRVREFALNAGLAD 171
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
LR+ RP+GIWAPEC YAPGME DYA AGV F+VDGPSL G+T RPVG TDV+AF
Sbjct 172 TALRVGTRPEGIWAPECGYAPGMENDYAAAGVRRFLVDGPSLRGETWAARPVGGTDVLAF 231
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRDL+V+YRVWSPK+GYPGH+AYRDFHT+ H GLK ARVTG+ V + KAPYDP A
Sbjct 232 GRDLEVTYRVWSPKAGYPGHSAYRDFHTWQHEVGLKAARVTGKTVEPQDKAPYDPALAAD 291
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
+ HV DFV+ V RL S + GR A V+AA+DTELFGHWW+EGP WL+ VLRALP A
Sbjct 292 VLATHVKDFVETVVTRLRSLKRQHGREALVVAAYDTELFGHWWHEGPAWLEGVLRALPEA 351
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTID 429
GVRV TL A+ G VG+P++LP SSWGSGKDW+VW G +V D+V N+ + D L +
Sbjct 352 GVRVTTLEGAMEAGHVGEPIDLPSSSWGSGKDWRVWDGEQVKDVVAANAALQDRLLDLV- 410
Query 430 KALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATRE 489
L +TA RD VADQ + E +L + SDW FMV+KDSAADYAR RA +H
Sbjct 411 AGLDRTA-------RDTVADQAVAEAMLALESDWAFMVTKDSAADYARRRAAVHTERFDA 463
Query 490 IAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
+AG L G R A LA + DG FG LDAR L
Sbjct 464 LAGLLRRGDRARAVELAAAYRADDGPFGHLDARAL 498
>gi|302524922|ref|ZP_07277264.1| glycoside hydrolase, family 57 [Streptomyces sp. AA4]
gi|302433817|gb|EFL05633.1| glycoside hydrolase, family 57 [Streptomyces sp. AA4]
Length=509
Score = 549 bits (1415), Expect = 4e-154, Method: Compositional matrix adjust.
Identities = 291/526 (56%), Positives = 355/526 (68%), Gaps = 20/526 (3%)
Query 1 LNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRH 60
+NT G F LVLH+HLPWL HHG WPVGEEWLYQ+W +YLP++ +L A E R
Sbjct 1 MNTRNPASEGTFCLVLHSHLPWLPHHGSWPVGEEWLYQAWTHSYLPVVDLLRRFAAEGRR 60
Query 61 RLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEAL 120
++TLGMTPV+ AQLDDPY + H W+ +WQLR + A+++ L
Sbjct 61 DVLTLGMTPVLAAQLDDPYAIRACHDWVGHWQLRTQHASTLWRG------------DPLL 108
Query 121 RAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLRE 180
R E A A TRWRHG SP+LR L+D+ T+ELLGGPLAHPFQPLL R+R
Sbjct 109 RDLAAAEHQAAMHATAELETRWRHGFSPILRSLVDSDTIELLGGPLAHPFQPLLDQRVRR 168
Query 181 FALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP 240
FAL+ GL D LR+ +GIWAPEC YAPGME DY AGV F+VDGPSLHGDT+ P
Sbjct 169 FALQAGLEDTALRIGRAQEGIWAPECGYAPGMERDYHDAGVRRFLVDGPSLHGDTSAAHP 228
Query 241 VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKA 300
VG +DVV FGRDL+V+YRVWSPK+GYPGHAAYRDFHT+ H GLK +RVTG+ V + KA
Sbjct 229 VGDSDVVCFGRDLEVTYRVWSPKAGYPGHAAYRDFHTWAHEVGLKASRVTGKTVEPQDKA 288
Query 301 PYDPERADRAVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
PYDP A + HV DFV+ V RL S E+ GR + V+AA+DTELFGHWW+EGP WL+
Sbjct 289 PYDPALATDVLGSHVKDFVETVVARLRSLREQHGRESLVVAAYDTELFGHWWHEGPAWLE 348
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
+LRALP AGVRV TL A+ G +G+PV+LP SSWGSGKDW+VW G +VAD+VQ N+ +
Sbjct 349 GILRALPEAGVRVTTLRGALEAGHLGEPVQLPASSWGSGKDWRVWDGEQVADMVQDNTAL 408
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
L +DK T RD VADQ + E ++ +SSDW FMV+KDSAADYAR RA
Sbjct 409 QTRLLDLVDKQDGTT--------RDTVADQAVAEAMMALSSDWAFMVTKDSAADYARRRA 460
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
+H +AG L +G R A LAE + RADG FG LDAR L +
Sbjct 461 RVHTERFDALAGLLRSGDRARALALAEEYRRADGPFGHLDARALRR 506
>gi|257056805|ref|YP_003134637.1| hypothetical protein Svir_28290 [Saccharomonospora viridis DSM
43017]
gi|256586677|gb|ACU97810.1| uncharacterized conserved protein [Saccharomonospora viridis
DSM 43017]
Length=505
Score = 546 bits (1408), Expect = 2e-153, Method: Compositional matrix adjust.
Identities = 299/516 (58%), Positives = 353/516 (69%), Gaps = 25/516 (4%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F LVLH+HLPWL HHG WPVGEEWLYQ+WA +Y+PL+ +L ADE R ++TLG+TP
Sbjct 6 GTFCLVLHSHLPWLPHHGNWPVGEEWLYQAWAHSYVPLVDLLHRFADEGRRDVLTLGVTP 65
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDPY L WL +W LRA+ AAS+ + L E
Sbjct 66 VLAAQLDDPYALRSFSEWLGHWTLRAQHAASLWRGH------------DLLSDLASAEYR 113
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
A R L +RWRHG SPLLR L+D+ +ELLGGP HPFQPLL R+R FALR GLAD
Sbjct 114 AAHRVLAEAESRWRHGFSPLLRSLVDSQVIELLGGPATHPFQPLLDQRVRAFALRTGLAD 173
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
LR+ HRP GIWAPEC YAPGME DYA AGV F+VDGPSL GDTA R VG TDVV F
Sbjct 174 TTLRIGHRPDGIWAPECGYAPGMEDDYAAAGVHRFLVDGPSLRGDTAAARTVGSTDVVCF 233
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRDL+V+YRVWSPK+GYPGH+AYRDFHT+ H G+KP+RVTG+++ E KAPYDP A
Sbjct 234 GRDLEVTYRVWSPKAGYPGHSAYRDFHTWAHEVGIKPSRVTGKHIAPEDKAPYDPALAAD 293
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
A+ +HV DFV+ V RL ER GRP+ V+AA+DTELFGHWWYEGP WL+ VLRALP A
Sbjct 294 ALRLHVKDFVETVVQRLRQLRERHGRPSLVVAAYDTELFGHWWYEGPQWLEAVLRALPEA 353
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTT-I 428
GVRV TL A+ G +G PVELP SSWGSGKDW+VW G +VAD+V+ N+ + L +
Sbjct 354 GVRVTTLRGALEAGHLGGPVELPASSWGSGKDWRVWDGEQVADMVEANAALQRRLLALPV 413
Query 429 DKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATR 488
+ A+ RD VADQ + E LL +SSDW FMV+KDSAADYAR RA H
Sbjct 414 NDAV-----------RDPVADQAVTEALLALSSDWAFMVTKDSAADYARRRAREHTERFD 462
Query 489 EIAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
+A AL AG D AR LAE + A FG LDAR L
Sbjct 463 RLAEALRAGHPD-ARTLAERYRAASLPFGHLDAREL 497
>gi|256380050|ref|YP_003103710.1| glycoside hydrolase family protein [Actinosynnema mirum DSM 43827]
gi|255924353|gb|ACU39864.1| glycoside hydrolase family 57 [Actinosynnema mirum DSM 43827]
Length=499
Score = 544 bits (1402), Expect = 1e-152, Method: Compositional matrix adjust.
Identities = 300/522 (58%), Positives = 340/522 (66%), Gaps = 35/522 (6%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F LVLH+HLPWLAHHG WPVGEEWLYQ+WA +YLP++ +L A E R ++TLG+TP
Sbjct 6 GTFCLVLHSHLPWLAHHGAWPVGEEWLYQAWAHSYLPVVDLLERFAAEGRRDVLTLGVTP 65
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDPY L GVH WL NWQLRA AA L RE
Sbjct 66 VLAAQLDDPYSLRGVHDWLGNWQLRAHGAAP------------------RLPELAAREHR 107
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
++A AL+ F RWRHG SPLLR L+D+G VELLGGP AHPFQPLL RLR FAL GL D
Sbjct 108 ESAVALERFEGRWRHGFSPLLRPLVDSGVVELLGGPAAHPFQPLLDERLRAFALETGLRD 167
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
LRL RP GIWAPEC YAPGME YA AGV F+VDGPSL T R VG +DVV F
Sbjct 168 TALRLGSRPAGIWAPECGYAPGMERGYAAAGVRRFLVDGPSLGNQTWAARRVGDSDVVCF 227
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRDL+VSYRVWSP+ GYPGH YRDFHTYDH +GLKP+RVTG V + K PYDPE A
Sbjct 228 GRDLEVSYRVWSPRVGYPGHPDYRDFHTYDHASGLKPSRVTGVEVAPQDKRPYDPEAARA 287
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
AV H DFV V RL GR A V+AA+DTEL+GHWW+EGP WL+ VLRALP A
Sbjct 288 AVRGHTEDFVGAVVARLRELRAAHGREALVVAAYDTELYGHWWHEGPLWLESVLRALPEA 347
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTID 429
GVRV TL A+ G V VE+P SSWG+GKDW W G +VAD V N+E+ L
Sbjct 348 GVRVTTLRGAVEAGHVVGSVEVPASSWGAGKDWNTWGGPQVADFVHANAELQRELLGL-- 405
Query 430 KALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATRE 489
L G + RD VADQ +RE LL++SSDW FMV+KDSAADYARYRA HA RE
Sbjct 406 -------ELGGAV-RDPVADQAVREALLSLSSDWAFMVTKDSAADYARYRAKAHAERFRE 457
Query 490 IAGALAA-------GRRDTARRLAEGWNRADGLFGALDARRL 524
+A + A GR D AR A G FG LDAR L
Sbjct 458 LAVEIRAARGGSDLGRVDRARGRAAELRALSGPFGHLDARAL 499
>gi|331695603|ref|YP_004331842.1| hypothetical protein Psed_1754 [Pseudonocardia dioxanivorans
CB1190]
gi|326950292|gb|AEA23989.1| Domain of unknown function DUF1957 [Pseudonocardia dioxanivorans
CB1190]
Length=524
Score = 533 bits (1374), Expect = 2e-149, Method: Compositional matrix adjust.
Identities = 300/515 (59%), Positives = 343/515 (67%), Gaps = 20/515 (3%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F LVLH+HLPWLAHHGRWPVGEEWLYQSWA AYLP++ VL LA E R L+TLG+TP
Sbjct 6 GTFCLVLHSHLPWLAHHGRWPVGEEWLYQSWAHAYLPVVDVLERLAAEGRRDLLTLGVTP 65
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDP+CL G+H WL W LRA EAA+ P+ R RE
Sbjct 66 VLAAQLDDPHCLRGMHDWLGGWLLRAHEAATR--------------LPDLAR----REHR 107
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
A AL F RWRHGG+P+LRGL DAG +ELLGGP HPF PLL R+R FAL GL D
Sbjct 108 AATAALRGFEARWRHGGAPVLRGLADAGALELLGGPATHPFAPLLDDRVRAFALDAGLVD 167
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
R RP IWAPEC YAPGME YA AGV F+VDGP+LHGDTAL RPVG + V+
Sbjct 168 HTRRFGVRPAAIWAPECGYAPGMETGYAAAGVDRFVVDGPALHGDTALLRPVGDSGVLVA 227
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRDL V+YRVWSP++GYPGH YRDFHT+DH +GLKP+RVTG++V + K PYDP+RA
Sbjct 228 GRDLDVTYRVWSPRAGYPGHPDYRDFHTFDHRSGLKPSRVTGKHVAPQDKRPYDPDRAAA 287
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
A+ ADFV VVR RL R+GRP +AAFDTELFGHWW+EGP WL+ VLRALP A
Sbjct 288 ALARDAADFVGVVRERLTGIRARLGRPGLAVAAFDTELFGHWWHEGPAWLEAVLRALPEA 347
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTID 429
GVR TL+ A+ G VG PVELP SSWG GKDW VW G VADLV+ V L T
Sbjct 348 GVRATTLAGAVDAGLVGTPVELPRSSWGLGKDWHVWDGDPVADLVRRGQAVQRDLLATAH 407
Query 430 KALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATRE 489
A A A RD + D + + LL +SSDW FMVS DSAADYAR RAH HA
Sbjct 408 PAPAHPAPAHATAARDPLRDALAEQALLALSSDWAFMVSHDSAADYARSRAHGHADRVAT 467
Query 490 IAGALAAGRRDTARRLAEGWNRADGLFGALDARRL 524
+A LAAG R A+ W G FG +DARRL
Sbjct 468 LARLLAAGDRRAAQAAVASWE--PGPFGHVDARRL 500
>gi|134102636|ref|YP_001108297.1| glycoside hydrolase family protein [Saccharopolyspora erythraea
NRRL 2338]
gi|291004680|ref|ZP_06562653.1| glycoside hydrolase family protein [Saccharopolyspora erythraea
NRRL 2338]
gi|133915259|emb|CAM05372.1| glycoside hydrolase, family 57 [Saccharopolyspora erythraea NRRL
2338]
Length=493
Score = 532 bits (1370), Expect = 7e-149, Method: Compositional matrix adjust.
Identities = 282/516 (55%), Positives = 342/516 (67%), Gaps = 25/516 (4%)
Query 10 GLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTP 69
G F +VLH+HLPWLA HGRWPVGEEWLYQ+WA++YLP++ +L LA E ++T G+TP
Sbjct 2 GDFAMVLHSHLPWLAGHGRWPVGEEWLYQAWASSYLPVVGMLERLAAEGHRDVLTFGLTP 61
Query 70 VVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECA 129
V+ AQLDDP+CL G HHWL NW+LRAEEAA AR S D + EC
Sbjct 62 VLAAQLDDPHCLRGFHHWLGNWRLRAEEAA----ARWRGSGDELAAA----------ECR 107
Query 130 DAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREFALREGLAD 189
A+RAL+ F RWRHG SP+LR L+D+ VEL+GGP HPFQPLL PRLREFALR GLAD
Sbjct 108 AASRALEEFELRWRHGASPVLRPLVDSDVVELIGGPATHPFQPLLDPRLREFALRLGLAD 167
Query 190 AQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPVGKTDVVAF 249
+LRL P+GIWAPECAY PG+E +YA GV F+VDG +L D + VG+TD+V F
Sbjct 168 TRLRLGSAPEGIWAPECAYGPGVEREYAATGVRRFLVDGSALD-DVSAAHTVGRTDIVCF 226
Query 250 GRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADR 309
GRD+ +S VWS GYP H AYRDFHT+DH GLKP+R+TG++V E K PY A
Sbjct 227 GRDVSISDLVWS-DGGYPSHPAYRDFHTFDHAVGLKPSRITGKDVAPEDKQPYALGPAAA 285
Query 310 AVDVHVADFVDVVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAA 369
D H DFV VR RL ER GR V+AAFDTELFGHWW+EGP WL+RVLR LP A
Sbjct 286 VADEHADDFVAAVRQRLEEHRERGGRQGLVVAAFDTELFGHWWFEGPRWLERVLRTLPEA 345
Query 370 GVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTID 429
GVRV TL A+A G +G+PVELP +SWG G DW+VW+G VA++ N + + L
Sbjct 346 GVRVTTLRGALAVGHLGEPVELPETSWGVGGDWRVWAGEPVAEIADANQRLQERVLRRAP 405
Query 430 KALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATRE 489
+A A DHV DQ+LRET+L ++SDW FMV+ D+ A YAR RAH H
Sbjct 406 RAGAAR---------DHVLDQLLRETVLALASDWAFMVTHDAGAGYARERAHDHTSRAHR 456
Query 490 IAGALAAGRRDTARRLAEGWNRADGLFGALDARRLP 525
+ L +GR A LA DG+FG LDAR LP
Sbjct 457 LLDLLESGRYADADALARELRGRDGVFGHLDARLLP 492
>gi|336179611|ref|YP_004584986.1| hypothetical protein FsymDg_3788 [Frankia symbiont of Datisca
glomerata]
gi|334860591|gb|AEH11065.1| Domain of unknown function DUF1957 [Frankia symbiont of Datisca
glomerata]
Length=522
Score = 436 bits (1121), Expect = 5e-120, Method: Compositional matrix adjust.
Identities = 272/526 (52%), Positives = 330/526 (63%), Gaps = 29/526 (5%)
Query 2 NTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHR 61
T A PV G F LVLH+HLPW+AH G WPVGEEWLYQ+W AYLPLL +L LAD +
Sbjct 24 RTPAEPV-GTFCLVLHSHLPWVAHAGAWPVGEEWLYQAWTGAYLPLLDLLERLADTGQRN 82
Query 62 LITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALR 121
++TLG+TPV+ A LDDPYCL+G+ W+A+WQLRA+ A + +
Sbjct 83 VLTLGVTPVLAAMLDDPYCLSGLARWVADWQLRAQGALV-----------------DGVP 125
Query 122 AFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVELLGGPLAHPFQPLLAPRLREF 181
G E A A AL +RWRHG SPLLR L DAG VELLGGP +HP PLL R+
Sbjct 126 GAG-HEGALATAALRAVESRWRHGASPLLRRLADAGVVELLGGPASHPVLPLLDERIVPV 184
Query 182 ALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRPV 241
L GL DA LRL RP+GIWAPECAY PG+E YA AGV+HF+VDGP++ G+TA V
Sbjct 185 QLGAGLDDATLRLGTRPRGIWAPECAYRPGLERHYAAAGVTHFVVDGPTVGGETAGAYDV 244
Query 242 GKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAP 301
+ V+AF RDL VSY VWSP +GYP A YRDFHT+DH +GL+ +RVT + P +KAP
Sbjct 245 AGSGVLAFPRDLGVSYEVWSPTAGYPRGAWYRDFHTFDHGSGLRVSRVTSVDTPPGRKAP 304
Query 302 YDPERADRAVDVHVADFVDVVRNRLLS-ESERIGRPAHVIAAFDTELFGHWWYEGPTWLQ 360
Y+P A RA A FV V+R RL + R GR A V+ AFDTELFGH W+EGP +L+
Sbjct 305 YEPAAARRAAADDAARFVGVLRRRLTELAAARDGRRALVVCAFDTELFGHHWHEGPAFLE 364
Query 361 RVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGKDWQVWSGAKVADLVQLNSEV 420
VL LP AG++ TLS A A G V +LP SWG GKD+ VW A V DL +L +E
Sbjct 365 AVLTLLPDAGIQAATLSQAAAAGHVEGARDLPAGSWGRGKDFHVW--ADVTDLAKLAAE- 421
Query 421 VDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA 480
V+ L + A A P RD V DQ+ RE LT +SDW F V+ DSAADYAR RA
Sbjct 422 VNGRLVEVVTAAAH------PAERDDVLDQMAREAFLTSASDWAFCVTHDSAADYARSRA 475
Query 481 HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK 526
+ HA +A + A R A RLA + D F ALDARRL +
Sbjct 476 YTHAGRFDALADTVHARDRLAATRLARHYRTLDHPFPALDARRLTR 521
Lambda K H
0.320 0.136 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1156240501672
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40