BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0141c
Length=136
Score E
Sequences producing significant alignments: (Bits) Value
gi|308231473|ref|ZP_07412563.2| hypothetical protein TMAG_01275 ... 283 8e-75
gi|15607283|ref|NP_214655.1| hypothetical protein Rv0141c [Mycob... 283 9e-75
gi|340625176|ref|YP_004743628.1| hypothetical protein MCAN_01441... 277 3e-73
gi|183980382|ref|YP_001848673.1| hypothetical protein MMAR_0351 ... 216 9e-55
gi|118619826|ref|YP_908158.1| hypothetical protein MUL_4769 [Myc... 215 2e-54
gi|240168145|ref|ZP_04746804.1| hypothetical protein MkanA1_0244... 213 1e-53
gi|254822365|ref|ZP_05227366.1| hypothetical protein MintA_20701... 186 7e-46
gi|296167332|ref|ZP_06849734.1| conserved hypothetical protein [... 185 2e-45
gi|118465210|ref|YP_884272.1| hypothetical protein MAV_5155 [Myc... 184 3e-45
gi|41409656|ref|NP_962492.1| hypothetical protein MAP3558c [Myco... 184 3e-45
gi|342859426|ref|ZP_08716080.1| hypothetical protein MCOL_11118 ... 184 5e-45
gi|333988730|ref|YP_004521344.1| hypothetical protein JDM601_009... 151 4e-35
gi|111223008|ref|YP_713802.1| Delta-5-3-ketosteroid isomerase-li... 48.1 4e-04
gi|337764079|emb|CCB72788.1| Delta-5-3-ketosteroid isomerase-lik... 48.1 4e-04
gi|239814324|ref|YP_002943234.1| hypothetical protein Vapar_1317... 47.0 0.001
gi|312197768|ref|YP_004017829.1| limonene-12-epoxide hydrolase [... 44.7 0.005
gi|332528246|ref|ZP_08404251.1| hypothetical protein HGR_00065 [... 43.9 0.009
gi|167647193|ref|YP_001684856.1| ketosteroid isomerase-like prot... 43.5 0.010
gi|329765414|ref|ZP_08256994.1| Ketosteroid isomerase-related pr... 43.1 0.012
gi|256389606|ref|YP_003111170.1| Limonene-12-epoxide hydrolase [... 43.1 0.013
gi|334345520|ref|YP_004554072.1| hypothetical protein Sphch_1886... 43.1 0.015
gi|334342799|ref|YP_004555403.1| hypothetical protein Sphch_3294... 42.7 0.020
gi|256390623|ref|YP_003112187.1| Limonene-12-epoxide hydrolase [... 42.4 0.025
gi|342857663|ref|ZP_08714319.1| hypothetical protein MCOL_02265 ... 42.0 0.030
gi|340344716|ref|ZP_08667848.1| Ketosteroid isomerase-related pr... 40.4 0.085
gi|71280932|ref|YP_269245.1| hypothetical protein CPS_2529 [Colw... 40.4 0.086
gi|111022377|ref|YP_705349.1| hypothetical protein RHA1_ro05411 ... 40.4 0.087
gi|334342811|ref|YP_004555415.1| hypothetical protein Sphch_3310... 40.0 0.10
gi|254168902|ref|ZP_04875742.1| aspartyl-tRNA synthetase [Acidul... 40.0 0.10
gi|269126944|ref|YP_003300314.1| hypothetical protein Tcur_2725 ... 39.7 0.14
gi|325921558|ref|ZP_08183405.1| ketosteroid isomerase-like prote... 39.7 0.16
gi|254166888|ref|ZP_04873742.1| aspartyl-tRNA synthetase [Acidul... 39.3 0.19
gi|146299901|ref|YP_001194492.1| hypothetical protein Fjoh_2145 ... 39.3 0.20
gi|94495237|ref|ZP_01301818.1| hypothetical protein SKA58_02050 ... 39.3 0.22
gi|13474406|ref|NP_105974.1| hypothetical protein mlr5282 [Mesor... 39.3 0.22
gi|338973295|ref|ZP_08628662.1| hypothetical protein CSIRO_1745 ... 38.9 0.27
gi|111224856|ref|YP_715650.1| putative terpene synthesis protein... 38.9 0.27
gi|94498797|ref|ZP_01305344.1| hypothetical protein SKA58_10240 ... 38.5 0.32
gi|260775727|ref|ZP_05884623.1| hypothetical protein VIC_001108 ... 38.5 0.34
gi|217979484|ref|YP_002363631.1| hypothetical protein Msil_3364 ... 38.5 0.35
gi|325971539|ref|YP_004247730.1| aspartate kinase [Spirochaeta s... 38.5 0.37
gi|300115442|ref|YP_003762017.1| hypothetical protein Nwat_2961 ... 38.1 0.49
gi|325921561|ref|ZP_08183408.1| ketosteroid isomerase-like prote... 37.7 0.53
gi|337762648|emb|CCB71356.1| Ketosteroid isomerase-like protein ... 37.7 0.54
gi|254822841|ref|ZP_05227842.1| putative terpene synthesis prote... 37.7 0.55
gi|288919425|ref|ZP_06413758.1| Limonene-12-epoxide hydrolase [F... 37.7 0.56
gi|254500900|ref|ZP_05113051.1| CAIB/BAIF family [Labrenzia alex... 37.4 0.72
gi|322371794|ref|ZP_08046337.1| hypothetical protein ZOD2009_198... 37.0 0.89
gi|334338636|ref|YP_004543616.1| hypothetical protein Desru_0024... 37.0 0.97
gi|27377616|ref|NP_769145.1| hypothetical protein blr2505 [Brady... 37.0 1.0
>gi|308231473|ref|ZP_07412563.2| hypothetical protein TMAG_01275 [Mycobacterium tuberculosis SUMu001]
gi|308370323|ref|ZP_07421081.2| hypothetical protein TMCG_02351 [Mycobacterium tuberculosis SUMu003]
gi|308373895|ref|ZP_07434029.2| hypothetical protein TMFG_03101 [Mycobacterium tuberculosis SUMu006]
12 more sequence titles
Length=137
Score = 283 bits (723), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK
Sbjct 2 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 61
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ
Sbjct 62 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 121
Query 121 SLAEVYPVACATPGRC 136
SLAEVYPVACATPGRC
Sbjct 122 SLAEVYPVACATPGRC 137
>gi|15607283|ref|NP_214655.1| hypothetical protein Rv0141c [Mycobacterium tuberculosis H37Rv]
gi|15839522|ref|NP_334559.1| hypothetical protein MT0149 [Mycobacterium tuberculosis CDC1551]
gi|31791319|ref|NP_853812.1| hypothetical protein Mb0146c [Mycobacterium bovis AF2122/97]
63 more sequence titles
Length=136
Score = 283 bits (723), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK
Sbjct 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ
Sbjct 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
Query 121 SLAEVYPVACATPGRC 136
SLAEVYPVACATPGRC
Sbjct 121 SLAEVYPVACATPGRC 136
>gi|340625176|ref|YP_004743628.1| hypothetical protein MCAN_01441 [Mycobacterium canettii CIPT
140010059]
gi|340003366|emb|CCC42485.1| hypothetical protein MCAN_01441 [Mycobacterium canettii CIPT
140010059]
Length=136
Score = 277 bits (709), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 133/136 (98%), Positives = 135/136 (99%), Gaps = 0/136 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK
Sbjct 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
QVFEVNIELIRCVNEGETVVVEGHC+GVSADRTRYDSPFVCIFETRDGMI+SLREYSDTQ
Sbjct 61 QVFEVNIELIRCVNEGETVVVEGHCEGVSADRTRYDSPFVCIFETRDGMIVSLREYSDTQ 120
Query 121 SLAEVYPVACATPGRC 136
SLAEVYPVACAT GRC
Sbjct 121 SLAEVYPVACATQGRC 136
>gi|183980382|ref|YP_001848673.1| hypothetical protein MMAR_0351 [Mycobacterium marinum M]
gi|183173708|gb|ACC38818.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=133
Score = 216 bits (550), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 100/133 (76%), Positives = 117/133 (88%), Gaps = 1/133 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMFLQS+CEGGDLDEGFALLS+DFTYWSI+TR+ +K RRAVE+RK
Sbjct 1 MTPFDDPQAELAWMFLQSMCEGGDLDEGFALLSDDFTYWSIITRSAFEKDALRRAVEQRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
+VFE+NI+L RCVNEGETVV+EG +G++A RYDSPFVCIFET+DG+I+SLREYSDTQ
Sbjct 61 KVFEINIDLRRCVNEGETVVIEGQSEGMTATGVRYDSPFVCIFETKDGLIVSLREYSDTQ 120
Query 121 SLAEVYPVACATP 133
S A V+P A A P
Sbjct 121 SFAAVFP-AGANP 132
>gi|118619826|ref|YP_908158.1| hypothetical protein MUL_4769 [Mycobacterium ulcerans Agy99]
gi|118571936|gb|ABL06687.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=133
Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 100/133 (76%), Positives = 116/133 (88%), Gaps = 1/133 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMFLQS CEGGDLDEGFALLS+DFTYWSI+TR+ +K RRAVE+RK
Sbjct 1 MTPFDDPQAELAWMFLQSTCEGGDLDEGFALLSDDFTYWSIITRSAFEKDALRRAVEQRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
+VFE+NI+L RCVNEGETVV+EG +G++A RYDSPFVCIFET+DG+I+SLREYSDTQ
Sbjct 61 KVFEINIDLRRCVNEGETVVIEGQSEGMTATGVRYDSPFVCIFETKDGLIVSLREYSDTQ 120
Query 121 SLAEVYPVACATP 133
S A V+P A A P
Sbjct 121 SFAAVFP-AGANP 132
>gi|240168145|ref|ZP_04746804.1| hypothetical protein MkanA1_02442 [Mycobacterium kansasii ATCC
12478]
Length=127
Score = 213 bits (541), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/126 (80%), Positives = 112/126 (89%), Gaps = 0/126 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQAELAWMF+QS+CEGGDLDEGFALLS+DFTYWSI TR+ +DK T RRAVERRK
Sbjct 1 MTPFDDPQAELAWMFMQSMCEGGDLDEGFALLSDDFTYWSIATRSAIDKATLRRAVERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
+ EVNI+LIRCVNEGETVVVEG + V+A TRYDSPFVCIFE DG I+SLREYSDTQ
Sbjct 61 KRSEVNIDLIRCVNEGETVVVEGRVEAVAAGGTRYDSPFVCIFECHDGQIVSLREYSDTQ 120
Query 121 SLAEVY 126
SLA+V+
Sbjct 121 SLAKVF 126
>gi|254822365|ref|ZP_05227366.1| hypothetical protein MintA_20701 [Mycobacterium intracellulare
ATCC 13950]
Length=132
Score = 186 bits (473), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 86/127 (68%), Positives = 103/127 (82%), Gaps = 0/127 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQ ELAWMFLQS+ +GGDLDEGFALL +DFT+W++ TRT DK TFRRAVERRK
Sbjct 1 MTPFDDPQGELAWMFLQSISDGGDLDEGFALLRDDFTFWTLYTRTACDKDTFRRAVERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
Q E+ I+L+RCVNEGETVVVE G ++D T YDSPFVCIF+TR G+I+S+R Y DT+
Sbjct 61 QDLELVIDLVRCVNEGETVVVEAQATGTTSDGTSYDSPFVCIFDTRAGLIVSMRLYGDTR 120
Query 121 SLAEVYP 127
+ A P
Sbjct 121 ATASALP 127
>gi|296167332|ref|ZP_06849734.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897276|gb|EFG76880.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=139
Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/127 (67%), Positives = 103/127 (82%), Gaps = 0/127 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQ ELAWMFLQS +GGDLDEGFALLS+DFTYWS+VTR+ DK+TFRR VERRK
Sbjct 9 MTPFDDPQGELAWMFLQSTSDGGDLDEGFALLSDDFTYWSLVTRSTCDKQTFRRTVERRK 68
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
+ E+ I+L+RCVNEGETVVVE +G + YDSPFVCIF+TR+G+I+S+R YSDT+
Sbjct 69 RELELTIDLVRCVNEGETVVVEAQANGTTTSGVEYDSPFVCIFDTREGLIVSMRLYSDTR 128
Query 121 SLAEVYP 127
+ P
Sbjct 129 AHEAALP 135
>gi|118465210|ref|YP_884272.1| hypothetical protein MAV_5155 [Mycobacterium avium 104]
gi|118166497|gb|ABK67394.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=133
Score = 184 bits (468), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 84/127 (67%), Positives = 101/127 (80%), Gaps = 0/127 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQ ELAWMFLQS EGGDLDEGFALLS+DF+YW++ TR DK T RRA+ERRK
Sbjct 2 MTPFDDPQGELAWMFLQSTTEGGDLDEGFALLSDDFSYWTLYTRAACDKDTLRRAIERRK 61
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
Q E+ I+L+RCVNEGETVVVE G ++ YDSPFVCIF+TRDG+I+S+R YSDT+
Sbjct 62 QDLELTIDLVRCVNEGETVVVEAQAAGTTSAGVEYDSPFVCIFDTRDGLIVSMRVYSDTR 121
Query 121 SLAEVYP 127
++ P
Sbjct 122 AVDNAMP 128
>gi|41409656|ref|NP_962492.1| hypothetical protein MAP3558c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254777497|ref|ZP_05219013.1| hypothetical protein MaviaA2_22901 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41398488|gb|AAS06108.1| hypothetical protein MAP_3558c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459905|gb|EGO38817.1| ketosteroid isomerase-like protein [Mycobacterium avium subsp.
paratuberculosis S397]
Length=132
Score = 184 bits (468), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 84/127 (67%), Positives = 101/127 (80%), Gaps = 0/127 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQ ELAWMFLQS EGGDLDEGFALLS+DF+YW++ TR DK T RRA+ERRK
Sbjct 1 MTPFDDPQGELAWMFLQSTTEGGDLDEGFALLSDDFSYWTLYTRAACDKDTLRRAIERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
Q E+ I+L+RCVNEGETVVVE G ++ YDSPFVCIF+TRDG+I+S+R YSDT+
Sbjct 61 QDLELTIDLVRCVNEGETVVVEAQAAGTTSAGVEYDSPFVCIFDTRDGLIVSMRVYSDTR 120
Query 121 SLAEVYP 127
++ P
Sbjct 121 AVDNAMP 127
>gi|342859426|ref|ZP_08716080.1| hypothetical protein MCOL_11118 [Mycobacterium colombiense CECT
3035]
gi|342133667|gb|EGT86870.1| hypothetical protein MCOL_11118 [Mycobacterium colombiense CECT
3035]
Length=132
Score = 184 bits (466), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/127 (67%), Positives = 100/127 (79%), Gaps = 0/127 (0%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTPFDDPQ ELAWMFLQS +GGDLDEGFALLS+DFTYW++ TRT K FRR VERRK
Sbjct 1 MTPFDDPQGELAWMFLQSTSDGGDLDEGFALLSDDFTYWTLYTRTPCGKDAFRRGVERRK 60
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
Q E+ I+L+RCVNEGETVVVE G + YDSPFVCIF+TRDG+I+S+R YSDT+
Sbjct 61 QELELTIDLVRCVNEGETVVVEAQATGTTTAGVEYDSPFVCIFDTRDGLIVSMRLYSDTR 120
Query 121 SLAEVYP 127
++A P
Sbjct 121 AVATALP 127
>gi|333988730|ref|YP_004521344.1| hypothetical protein JDM601_0090 [Mycobacterium sp. JDM601]
gi|333484698|gb|AEF34090.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=129
Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 70/127 (56%), Positives = 90/127 (71%), Gaps = 0/127 (0%)
Query 2 TPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRKQ 61
TPFDDPQAELAWMFLQ LC DLDE F L+S+DF+YWSI+TR +D R+ VE R+
Sbjct 3 TPFDDPQAELAWMFLQGLCGDDDLDEIFTLVSDDFSYWSILTRQAIDADALRQQVEERRH 62
Query 62 VFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQS 121
+ ++L RC+NEGETVV+E H D V+AD YDSP V I + DG I+S+REY++T+
Sbjct 63 GLRITLDLTRCINEGETVVIEAHGDCVTADGGHYDSPLVFIVDCHDGRIVSVREYTETRF 122
Query 122 LAEVYPV 128
EV +
Sbjct 123 AGEVLGI 129
>gi|111223008|ref|YP_713802.1| Delta-5-3-ketosteroid isomerase-like protein (fragment) [Frankia
alni ACN14a]
gi|111150540|emb|CAJ62240.1| Delta-5-3-ketosteroid isomerase-like protein (Fragment) (partial
match) [Frankia alni ACN14a]
Length=125
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/61 (43%), Positives = 36/61 (60%), Gaps = 0/61 (0%)
Query 65 VNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
++IEL + EG VV G +A YD+ + IFE RDG I+S+REY+DT +A
Sbjct 62 LHIELTDVLAEGPRVVATWISRGTAAGGAPYDNRCLGIFEVRDGRIVSVREYADTDHVAR 121
Query 125 V 125
V
Sbjct 122 V 122
>gi|337764079|emb|CCB72788.1| Delta-5-3-ketosteroid isomerase-like protein (Partial match)
[Streptomyces cattleya NRRL 8057]
Length=139
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/54 (47%), Positives = 32/54 (60%), Gaps = 0/54 (0%)
Query 67 IELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
I L V +GE VV E GV+A YD+ + +F RDG I S+REY+DTQ
Sbjct 73 ITLTNVVADGEQVVAEWTSRGVAAGGRAYDNACLGVFTVRDGRITSVREYTDTQ 126
>gi|239814324|ref|YP_002943234.1| hypothetical protein Vapar_1317 [Variovorax paradoxus S110]
gi|239800901|gb|ACS17968.1| protein of unknown function DUF1486 [Variovorax paradoxus S110]
Length=138
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/47 (49%), Positives = 29/47 (62%), Gaps = 0/47 (0%)
Query 75 EGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQS 121
EG+TV+VE A+ RYD+ + IFE RDG I +REY DTQ
Sbjct 85 EGDTVIVELRLQATLANGRRYDNDYCFIFELRDGRIARVREYMDTQK 131
>gi|312197768|ref|YP_004017829.1| limonene-12-epoxide hydrolase [Frankia sp. EuI1c]
gi|311229104|gb|ADP81959.1| limonene-12-epoxide hydrolase [Frankia sp. EuI1c]
Length=136
Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/61 (37%), Positives = 32/61 (53%), Gaps = 0/61 (0%)
Query 65 VNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
V L R V +GE V E + YD+ + +F+ RDG+I+S+REY DT +
Sbjct 72 VTQTLHRIVADGEYAVAEWTSHARARSGAAYDNDYAVLFQVRDGLIVSVREYCDTSYMKR 131
Query 125 V 125
V
Sbjct 132 V 132
>gi|332528246|ref|ZP_08404251.1| hypothetical protein HGR_00065 [Hylemonella gracilis ATCC 19624]
gi|332042266|gb|EGI78587.1| hypothetical protein HGR_00065 [Hylemonella gracilis ATCC 19624]
Length=151
Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/114 (27%), Positives = 53/114 (47%), Gaps = 10/114 (8%)
Query 22 GGDLDEGFALLSNDFTYWSIVTRTELD-------KKTFRRAVERRKQVFEVNIEL--IRC 72
DL+ +++D T+W I + E K R R E +++ + C
Sbjct 19 ASDLEGALDTMADDATWW-IPGKKERSPSAGLYPKDKIGRLFHRMVNALENGLKMTVVFC 77
Query 73 VNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVY 126
+ EG+ V +E +G + +Y + + + RDG I+S+REY DTQ +V+
Sbjct 78 IAEGDHVALEVVSEGDLKNGRQYRQEYHMLLKFRDGKIVSVREYLDTQHANDVW 131
>gi|167647193|ref|YP_001684856.1| ketosteroid isomerase-like protein [Caulobacter sp. K31]
gi|167349623|gb|ABZ72358.1| ketosteroid isomerase-like protein [Caulobacter sp. K31]
Length=134
Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/119 (32%), Positives = 61/119 (52%), Gaps = 15/119 (12%)
Query 23 GDLDEGFALLSNDFTYWSIVTRTEL-----DKKTFRRAVERRKQVFEV----NIELIRCV 73
GDLD F+ +++D WS+V + ++ A E +Q+ E+ +I R +
Sbjct 19 GDLDGLFSTMTDD-VQWSVVGPPDAFPLFGERSGREGAAEYFRQLGEILDITSISPNRFL 77
Query 74 NEGETVVVEGHCDGV---SADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVYPVA 129
++G+TVVV GH DGV + + R + +V +F RD I S +E+ DT L + A
Sbjct 78 SDGDTVVVLGHTDGVMRKTGEPVR--TQWVHVFTLRDDQIASYQEFVDTTVLMKAQMAA 134
>gi|329765414|ref|ZP_08256994.1| Ketosteroid isomerase-related protein [Candidatus Nitrosoarchaeum
limnia SFB1]
gi|329138320|gb|EGG42576.1| Ketosteroid isomerase-related protein [Candidatus Nitrosoarchaeum
limnia SFB1]
Length=134
Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/46 (40%), Positives = 29/46 (64%), Gaps = 0/46 (0%)
Query 79 VVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
VVV G DGVS + ++ PF ++E ++ II R+++DTQ + E
Sbjct 86 VVVLGIYDGVSKNDKKFQVPFSHVYEIKENKIIKFRQFTDTQRIHE 131
>gi|256389606|ref|YP_003111170.1| Limonene-12-epoxide hydrolase [Catenulispora acidiphila DSM 44928]
gi|256355832|gb|ACU69329.1| Limonene-12-epoxide hydrolase [Catenulispora acidiphila DSM 44928]
Length=137
Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/61 (35%), Positives = 32/61 (53%), Gaps = 0/61 (0%)
Query 65 VNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
V I L+ +G+ V+ E +A+ YD+ +F DG I S+REY+DT +A
Sbjct 72 VVIALVHAFADGDQVLAEWTSKATAANGATYDNRCAAVFTVEDGKITSVREYADTHHVAA 131
Query 125 V 125
V
Sbjct 132 V 132
>gi|334345520|ref|YP_004554072.1| hypothetical protein Sphch_1886 [Sphingobium chlorophenolicum
L-1]
gi|334102142|gb|AEG49566.1| hypothetical protein Sphch_1886 [Sphingobium chlorophenolicum
L-1]
Length=139
Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/110 (30%), Positives = 48/110 (44%), Gaps = 7/110 (6%)
Query 22 GGDLDEGFALLSNDFTYWSIVTRTELD----KKTFRRAVERRKQVFEVNIELI--RCVNE 75
G+LD LLS D W + + KK FR + F + I + E
Sbjct 19 AGNLDHAHELLSEDLV-WEVAGNPPISGTFGKKEFRANNLKILDTFVAWPKWIVDSAIVE 77
Query 76 GETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEV 125
GE V + H + +Y + F+ + RDG I+ + EY DTQ +AE+
Sbjct 78 GEKVAILSHSAADTPGGFQYRNKFMMLCTVRDGKILHVIEYMDTQHVAEL 127
>gi|334342799|ref|YP_004555403.1| hypothetical protein Sphch_3294 [Sphingobium chlorophenolicum
L-1]
gi|334103474|gb|AEG50897.1| hypothetical protein Sphch_3294 [Sphingobium chlorophenolicum
L-1]
Length=144
Score = 42.7 bits (99), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/68 (31%), Positives = 36/68 (53%), Gaps = 1/68 (1%)
Query 62 VFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDS-PFVCIFETRDGMIISLREYSDTQ 120
V + + V++G+TVV+E DG ++ RYD +V I + +DG + +REY D
Sbjct 77 VARTSFNITNMVSDGDTVVIEFFIDGYTSAGARYDKVEYVSIVQLKDGKVTQVREYMDAL 136
Query 121 SLAEVYPV 128
E + +
Sbjct 137 KAKEAHSL 144
>gi|256390623|ref|YP_003112187.1| Limonene-12-epoxide hydrolase [Catenulispora acidiphila DSM 44928]
gi|256356849|gb|ACU70346.1| Limonene-12-epoxide hydrolase [Catenulispora acidiphila DSM 44928]
Length=133
Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/57 (34%), Positives = 30/57 (53%), Gaps = 0/57 (0%)
Query 64 EVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
E+ +++ EGE V+VE Y + +F RDG I+++REY+DTQ
Sbjct 68 EIGVKVTNVFGEGEQVLVEWDSWATGKTGRPYHEKNIGVFVVRDGKIVAMREYADTQ 124
>gi|342857663|ref|ZP_08714319.1| hypothetical protein MCOL_02265 [Mycobacterium colombiense CECT
3035]
gi|342134996|gb|EGT88162.1| hypothetical protein MCOL_02265 [Mycobacterium colombiense CECT
3035]
Length=154
Score = 42.0 bits (97), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 25/116 (22%), Positives = 51/116 (44%), Gaps = 3/116 (2%)
Query 6 DPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRKQVFEV 65
D LAW+ + L G++ +G ++ T+W +VTR + + ++ +
Sbjct 5 DTATALAWLPWR-LISQGEISQGLGTFDDEGTWWDMVTRQNHPMAAMKAVLAEILEIVPM 63
Query 66 NIELIRCVNEGETVVVEGHCDGVSADRTRYDS--PFVCIFETRDGMIISLREYSDT 119
+ + + EG V + G ++ Y++ FV +I+++REY DT
Sbjct 64 RFDFVDSIVEGNHVALMVESHGAVSETAIYNNVYTFVTELAADRNVIVAVREYVDT 119
>gi|340344716|ref|ZP_08667848.1| Ketosteroid isomerase-related protein-like protein [Nitrosopumilus
sp. MY1]
gi|339519857|gb|EGP93580.1| Ketosteroid isomerase-related protein-like protein [Nitrosopumilus
sp. MY1]
Length=127
Score = 40.4 bits (93), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 13/54 (25%), Positives = 32/54 (60%), Gaps = 0/54 (0%)
Query 71 RCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
+ ++ + V+V G G+S ++D PF +++ +D I+ R+++DT+ + +
Sbjct 71 QFLDAADNVIVIGRYQGISKKGNKFDIPFSHVYQIKDNKIMQFRQFTDTKKIHD 124
>gi|71280932|ref|YP_269245.1| hypothetical protein CPS_2529 [Colwellia psychrerythraea 34H]
gi|71146672|gb|AAZ27145.1| conserved hypothetical protein [Colwellia psychrerythraea 34H]
Length=134
Score = 40.4 bits (93), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 29/109 (27%), Positives = 47/109 (44%), Gaps = 9/109 (8%)
Query 27 EGFALLSNDFTYWSIVTR-------TELDKKTFRRAVERRKQVFEVNIELIRC--VNEGE 77
EG L N+ W ++ R E+DK + +F + L G+
Sbjct 23 EGVLSLLNNAAIWRVMGREGGLPLSGEMDKIAVADLMNTVNNIFPKGMTLTPSGWTASGD 82
Query 78 TVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVY 126
V E G +D T Y++ + +FE +D II++REY DT + V+
Sbjct 83 RVAAEVESYGQKSDGTIYNNMYHFLFEIKDEQIITIREYMDTLHVKSVF 131
>gi|111022377|ref|YP_705349.1| hypothetical protein RHA1_ro05411 [Rhodococcus jostii RHA1]
gi|110821907|gb|ABG97191.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=133
Score = 40.4 bits (93), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 29/112 (26%), Positives = 52/112 (47%), Gaps = 9/112 (8%)
Query 23 GDLDEGFALLSNDFTYWSIVTRTE-----LDKKTFRRAVERRKQVFE---VNIELIRCVN 74
GD+++ L D TYW + R + +K+ + V+E + I +
Sbjct 21 GDVEQILDTLHPDATYW-VSGRIDGMSGSYNKEGLGALLGGVTSVYENGALEITPTSAIA 79
Query 75 EGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVY 126
+G+ V VE + Y + + +FE RDG I+S++EY DTQ +++
Sbjct 80 QGDKVAVEATSYAKLKNGKVYQNAYHFLFEFRDGKILSVKEYMDTQHAYDIF 131
>gi|334342811|ref|YP_004555415.1| hypothetical protein Sphch_3310 [Sphingobium chlorophenolicum
L-1]
gi|334103486|gb|AEG50909.1| hypothetical protein Sphch_3310 [Sphingobium chlorophenolicum
L-1]
Length=142
Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/103 (25%), Positives = 48/103 (47%), Gaps = 5/103 (4%)
Query 23 GDLDEGFALLSNDFTYW---SIVTRTELDKKTFRRAVERRKQVFEVNIELI--RCVNEGE 77
GD+ LLS+D +W + + K FR+ E +F+ + + + EG+
Sbjct 20 GDIGYAHELLSDDLEWWVAGDLPFSGTITKPVFRKNNEMLVGLFKTFPKWVADSAIAEGD 79
Query 78 TVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
+ H +G + +Y + + + RDG+I+ + EY DT+
Sbjct 80 KAALICHSEGETKGGFQYRNKYHMLVTVRDGLIVRVEEYMDTK 122
>gi|254168902|ref|ZP_04875742.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
gi|197622166|gb|EDY34741.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
Length=428
Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/97 (35%), Positives = 48/97 (50%), Gaps = 15/97 (15%)
Query 53 RRAVERRKQV---FEVNIELIRCVNEGETVVVEGHCDGVSAD-RTRYDSPFV-------- 100
R V++ KQ FE+ E I +N E+ + G D VSAD TR ++ F+
Sbjct 74 RGKVQKTKQAKLGFEILPEEIEVLNRAESPLPLGVADKVSADLETRIENRFLDLRKREVL 133
Query 101 CIFETRDGMIISLREYSDTQSLAEVYP---VACATPG 134
+FE R ++ +REY Q+ EV+ VA AT G
Sbjct 134 AVFEIRSTILQGIREYLLNQNFVEVHTPKIVATATEG 170
>gi|269126944|ref|YP_003300314.1| hypothetical protein Tcur_2725 [Thermomonospora curvata DSM 43183]
gi|268311902|gb|ACY98276.1| protein of unknown function DUF1486 [Thermomonospora curvata
DSM 43183]
Length=148
Score = 39.7 bits (91), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/114 (26%), Positives = 50/114 (44%), Gaps = 2/114 (1%)
Query 10 ELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELD-KKTFRRAVERRKQVFEVNIE 68
E+ F +L E GD++ + + D W + E D +++ + R+ V +V E
Sbjct 29 EIVSAFFAAL-ESGDIEAIREIYAPDIVVWHNYDQVEQDLEQSLKVLKAFRRTVSDVRYE 87
Query 69 LIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSL 122
+IR + G+ V+ + G +D P + RDG I + EY D L
Sbjct 88 IIRQADTGDGVLQQHILHGRLSDGRNIAIPAAIYLQVRDGRITRIEEYLDPSPL 141
>gi|325921558|ref|ZP_08183405.1| ketosteroid isomerase-like protein [Xanthomonas gardneri ATCC
19865]
gi|325547956|gb|EGD18963.1| ketosteroid isomerase-like protein [Xanthomonas gardneri ATCC
19865]
Length=138
Score = 39.7 bits (91), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/111 (31%), Positives = 51/111 (46%), Gaps = 13/111 (11%)
Query 23 GDLDEGFALLSNDFTY------WSIV-TRTELD--KKTFRRAVERRKQVFEVNIELIRCV 73
GD AL++ND + W + T LD RRA E + E + V
Sbjct 20 GDSQALLALVANDIEWIIPGEDWPLAGTHRGLDGVADALRRASEEVSFSYP---EPLEYV 76
Query 74 NEGETVVVEGHCDG-VSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLA 123
+G+ V+V G G ++A ++ FV R+G + +REY DTQ+LA
Sbjct 77 AQGDRVLVIGFAKGTITATNRTFEDYFVFAITVRNGKLAHIREYIDTQALA 127
>gi|254166888|ref|ZP_04873742.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
gi|289596142|ref|YP_003482838.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
gi|197624498|gb|EDY37059.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
gi|289533929|gb|ADD08276.1| aspartyl-tRNA synthetase [Aciduliprofundum boonei T469]
Length=428
Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/94 (35%), Positives = 47/94 (50%), Gaps = 15/94 (15%)
Query 56 VERRKQV---FEVNIELIRCVNEGETVVVEGHCDGVSAD-RTRYDSPFV--------CIF 103
V++ KQ FE+ E I +N E+ + G D VSAD TR ++ F+ +F
Sbjct 77 VQKTKQAKLGFEILPEEIEVLNRAESPLPLGVADKVSADLETRIENRFLDLRKREVLAVF 136
Query 104 ETRDGMIISLREYSDTQSLAEVYP---VACATPG 134
E R ++ +REY Q+ EV+ VA AT G
Sbjct 137 EIRSTILQGIREYLLNQNFVEVHTPKIVATATEG 170
>gi|146299901|ref|YP_001194492.1| hypothetical protein Fjoh_2145 [Flavobacterium johnsoniae UW101]
gi|146154319|gb|ABQ05173.1| hypothetical protein Fjoh_2145 [Flavobacterium johnsoniae UW101]
Length=122
Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 47/116 (41%), Gaps = 5/116 (4%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
MTP E F+ S D A L++D + + K+ F + +E
Sbjct 1 MTPNKQTVNEYMAAFMVS-----DHARILACLTDDVVWEMPGIYQHVGKEAFDKEIENDN 55
Query 61 QVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREY 116
V I++I+ + E V+ EG G + + D+ F +FE +G I L Y
Sbjct 56 FVGSPTIQIIKLIEENNIVIAEGAVQGNMKNGGKLDAVFCDVFEMENGKIKKLTSY 111
>gi|94495237|ref|ZP_01301818.1| hypothetical protein SKA58_02050 [Sphingomonas sp. SKA58]
gi|94425503|gb|EAT10523.1| hypothetical protein SKA58_02050 [Sphingomonas sp. SKA58]
Length=130
Score = 39.3 bits (90), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/115 (30%), Positives = 56/115 (49%), Gaps = 11/115 (9%)
Query 20 CEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVE-----RRKQVFEVNIELIRCVN 74
E GD++ AL D +W I+ ++ + F +V+ +K+ EV I
Sbjct 18 VEEGDVETIEALQHPDCQWW-ILGHGDMSRADFIASVKGGLLSAQKRTAEV----IAITA 72
Query 75 EGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVYPVA 129
EG+ V VE + V DR Y + + + RDG I+S +EY DT+++AE +
Sbjct 73 EGDRVAVEVRGEMVFPDRV-YRNDYHNLLIIRDGQIVSGKEYMDTRAVAEAFSAG 126
>gi|13474406|ref|NP_105974.1| hypothetical protein mlr5282 [Mesorhizobium loti MAFF303099]
gi|14025159|dbj|BAB51760.1| mlr5282 [Mesorhizobium loti MAFF303099]
Length=148
Score = 39.3 bits (90), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 23/81 (29%), Positives = 40/81 (50%), Gaps = 1/81 (1%)
Query 56 VERRKQVFEVNIELIRCVNEGETVVVEGHCDG-VSADRTRYDSPFVCIFETRDGMIISLR 114
+E + E + E V +G+ V+V G G + A ++ +V RDG + ++R
Sbjct 58 LETASKSIETSTEPQEFVAQGDRVLVVGFARGKIKATNKVFEDDWVFAITVRDGRLTNIR 117
Query 115 EYSDTQSLAEVYPVACATPGR 135
EY DTQ+LA + + P +
Sbjct 118 EYVDTQALARASQMDASGPAQ 138
>gi|338973295|ref|ZP_08628662.1| hypothetical protein CSIRO_1745 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338233604|gb|EGP08727.1| hypothetical protein CSIRO_1745 [Bradyrhizobiaceae bacterium
SG-6C]
Length=142
Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/53 (34%), Positives = 30/53 (57%), Gaps = 0/53 (0%)
Query 73 VNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEV 125
+ EG+ VVVE D V+ RYD+ + +++ +G I ++EY DT + V
Sbjct 77 IAEGDFVVVEAKGDNVTRSGVRYDNDYCMVWKVENGRITQIKEYCDTALVERV 129
>gi|111224856|ref|YP_715650.1| putative terpene synthesis protein [Frankia alni ACN14a]
gi|111152388|emb|CAJ64124.1| putative terpene synthesis protein [Frankia alni ACN14a]
Length=147
Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 36/116 (32%), Positives = 54/116 (47%), Gaps = 7/116 (6%)
Query 23 GDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRKQVFEVN---IELIRCVNEGETV 79
G++D A + D TY V E + A E +Q + + E++ V++ V
Sbjct 23 GNMDRLLAQFAEDATYQMSVPGREALRGRELIATELARQAGDYSDCVCEILTVVSDDRHV 82
Query 80 VVEGHCDGVSA--DRTRYDSPFVCIFETRD-GMIISLREYSDTQSLAEVYPVACAT 132
V E D V+ D TR +P + IFE D G+I++ REY D SL+ V T
Sbjct 83 VTE-RIDHVTMLHDGTRVSNPLLAIFEVNDDGLIVAWREYWDALSLSHRMGVDPGT 137
>gi|94498797|ref|ZP_01305344.1| hypothetical protein SKA58_10240 [Sphingomonas sp. SKA58]
gi|94421764|gb|EAT06818.1| hypothetical protein SKA58_10240 [Sphingomonas sp. SKA58]
Length=142
Score = 38.5 bits (88), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/130 (27%), Positives = 57/130 (44%), Gaps = 9/130 (6%)
Query 4 FDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVT----RTELDKKTFRRAVERR 59
+P+A +A FL S G D++ L+ D T+W + +DK T + +
Sbjct 1 MSEPRAIVA-AFLDSFSSG-DVEAILEYLTEDATWWVLGNVHGMSGTIDKATLGSLLRKV 58
Query 60 KQVF---EVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREY 116
K ++ + I ++EG V E D Y + + F+ I S+REY
Sbjct 59 KPLYLDGALTITPSSMISEGRQVSAEATSHARLCDGRVYANAYHFRFDLDGDRICSVREY 118
Query 117 SDTQSLAEVY 126
SDTQ + E++
Sbjct 119 SDTQHMLEIF 128
>gi|260775727|ref|ZP_05884623.1| hypothetical protein VIC_001108 [Vibrio coralliilyticus ATCC
BAA-450]
gi|260608143|gb|EEX34312.1| hypothetical protein VIC_001108 [Vibrio coralliilyticus ATCC
BAA-450]
Length=118
Score = 38.5 bits (88), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 16/47 (35%), Positives = 26/47 (56%), Gaps = 0/47 (0%)
Query 76 GETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSL 122
G+T+V EG D + D + YD+ F I +DG+I + ++D L
Sbjct 72 GDTLVCEGQVDYIRLDGSAYDAKFATILTLKDGLIYQYKIFADVSGL 118
>gi|217979484|ref|YP_002363631.1| hypothetical protein Msil_3364 [Methylocella silvestris BL2]
gi|217504860|gb|ACK52269.1| protein of unknown function DUF1486 [Methylocella silvestris
BL2]
Length=137
Score = 38.5 bits (88), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/79 (28%), Positives = 38/79 (49%), Gaps = 1/79 (1%)
Query 56 VERRKQVFEVNIELIRCVNEGETVVVEGHCDG-VSADRTRYDSPFVCIFETRDGMIISLR 114
+E + E + E V +G+ V+V G G + A + ++ RDG + ++R
Sbjct 58 LETASKSMETSTEPREFVAQGDRVLVVGFARGKIKATNKTFQDDWIFAITVRDGRLTNIR 117
Query 115 EYSDTQSLAEVYPVACATP 133
EY DTQ+LA + + P
Sbjct 118 EYVDTQALARASQMDASGP 136
>gi|325971539|ref|YP_004247730.1| aspartate kinase [Spirochaeta sp. Buddy]
gi|324026777|gb|ADY13536.1| aspartate kinase [Spirochaeta sp. Buddy]
Length=798
Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 34/130 (27%), Positives = 60/130 (47%), Gaps = 18/130 (13%)
Query 1 MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRK 60
+ P D + EL+ + L + +G D + ++L F+ W+++ T L + ++ +ER K
Sbjct 39 LAPLRDTEFELSSLLLSA--KGRD-ERLWSLQEQRFSSWTMLVETILSVPSGKKVLERIK 95
Query 61 QVFEVNIE-------LIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISL 113
Q F +IE L+ V+ G +E CD AD +C + T G+ +
Sbjct 96 QGF-ADIEDILRSVWLVEEVSSGVQRYLEKLCDSWIAD-------LLCHYATTSGITADM 147
Query 114 REYSDTQSLA 123
EY S+A
Sbjct 148 AEYGAVSSIA 157
>gi|300115442|ref|YP_003762017.1| hypothetical protein Nwat_2961 [Nitrosococcus watsonii C-113]
gi|299541379|gb|ADJ29696.1| conserved hypothetical protein [Nitrosococcus watsonii C-113]
Length=130
Score = 38.1 bits (87), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 17/48 (36%), Positives = 28/48 (59%), Gaps = 1/48 (2%)
Query 73 VNEGETVVVEGHCDG-VSADRTRYDSPFVCIFETRDGMIISLREYSDT 119
V +G TVVV G G A + +PF +++ +DG ++S +Y+DT
Sbjct 74 VADGSTVVVLGQYSGSYKATGKSFKAPFAHVWKFKDGKVVSFHQYTDT 121
>gi|325921561|ref|ZP_08183408.1| ketosteroid isomerase-like protein [Xanthomonas gardneri ATCC
19865]
gi|325547959|gb|EGD18966.1| ketosteroid isomerase-like protein [Xanthomonas gardneri ATCC
19865]
Length=122
Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 21/52 (41%), Positives = 29/52 (56%), Gaps = 1/52 (1%)
Query 73 VNEGETVVVEGHCDG-VSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLA 123
V +G+ V+V G G V A ++ FV RDG + +REY DTQ+LA
Sbjct 60 VAQGDRVLVIGLATGKVKATNKPFEDHFVFAITVRDGKVAHIREYVDTQALA 111
>gi|337762648|emb|CCB71356.1| Ketosteroid isomerase-like protein [Streptomyces cattleya NRRL
8057]
Length=137
Score = 37.7 bits (86), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/57 (32%), Positives = 28/57 (50%), Gaps = 0/57 (0%)
Query 64 EVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
E+ + + EG+ V+VE Y + F RDG I+++REY+DTQ
Sbjct 70 EIGLRVTNVFGEGDQVLVEWDSWATGKTGRAYHEKNIGAFVVRDGRIVTMREYADTQ 126
>gi|254822841|ref|ZP_05227842.1| putative terpene synthesis protein [Mycobacterium intracellulare
ATCC 13950]
Length=147
Score = 37.7 bits (86), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/109 (31%), Positives = 51/109 (47%), Gaps = 6/109 (5%)
Query 21 EGGDLDEGFALLSNDFTYWSIVTRTEL--DKKTFRRAVERRKQVFEVNIE---LIRCVNE 75
E D+D AL++ DF + V + + + R +E+ + +E + V++
Sbjct 25 ESMDIDAMTALMAEDFVWQLHVPLSPVVRGRDAARAELEKHNTLSTGMVEGSEIRTIVSD 84
Query 76 GETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAE 124
G+TVVVE D S + IFE RDG I REY DT +A+
Sbjct 85 GDTVVVE-RVDVNSMNGVAVTFYVTAIFEVRDGAIAHWREYWDTAHVAQ 132
>gi|288919425|ref|ZP_06413758.1| Limonene-12-epoxide hydrolase [Frankia sp. EUN1f]
gi|288349213|gb|EFC83457.1| Limonene-12-epoxide hydrolase [Frankia sp. EUN1f]
Length=154
Score = 37.7 bits (86), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 28/108 (26%), Positives = 45/108 (42%), Gaps = 4/108 (3%)
Query 24 DLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRKQVFE----VNIELIRCVNEGETV 79
D + +D YW + L K + V R K V + + IR V+EG+TV
Sbjct 25 DWEHLHVFFGDDSVYWDVPAGPTLAAKGPKGIVARAKSVLDSLATFENDQIRTVSEGDTV 84
Query 80 VVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQSLAEVYP 127
+ E H + P + + +DG+I ++Y D Q+L P
Sbjct 85 MTEHHEIWRFSSGEEIVLPCLSVQVVQDGVIALWKDYWDMQTLMGPAP 132
>gi|254500900|ref|ZP_05113051.1| CAIB/BAIF family [Labrenzia alexandrii DFL-11]
gi|222436971|gb|EEE43650.1| CAIB/BAIF family [Labrenzia alexandrii DFL-11]
Length=381
Score = 37.4 bits (85), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 32/118 (28%), Positives = 51/118 (44%), Gaps = 10/118 (8%)
Query 8 QAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTRTELDKKTFRRAVERRKQ-----V 62
QA+ M L E G GF L+ D T + + +RRA+ + Q +
Sbjct 127 QADSGMMSLTGFPETGPTRIGFMLV--DVT-TGVSAAYAIAAALYRRAMTGKGQFLDVAM 183
Query 63 FEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDTQ 120
++ ++L+ C +G +V GH + + + P FET DGMI+ SD Q
Sbjct 184 YDTALQLMCC--QGADYLVRGHLPPLMGNESPMTQPTAGTFETLDGMILLATLTSDQQ 239
>gi|322371794|ref|ZP_08046337.1| hypothetical protein ZOD2009_19873 [Haladaptatus paucihalophilus
DX253]
gi|320548679|gb|EFW90350.1| hypothetical protein ZOD2009_19873 [Haladaptatus paucihalophilus
DX253]
Length=128
Score = 37.0 bits (84), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 23/71 (33%), Positives = 34/71 (48%), Gaps = 3/71 (4%)
Query 49 KKTFRRAVERRKQVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDG 108
+ F A+ER + +E R ++ G+TVV G + D R +SPF + E DG
Sbjct 57 ENVFVPAMERFESF---RVEPDRFIDGGDTVVAVGTFHATTEDGERIESPFAHVNELHDG 113
Query 109 MIISLREYSDT 119
I Y+DT
Sbjct 114 HITRFVNYTDT 124
>gi|334338636|ref|YP_004543616.1| hypothetical protein Desru_0024 [Desulfotomaculum ruminis DSM
2154]
gi|334089990|gb|AEG58330.1| hypothetical protein Desru_0024 [Desulfotomaculum ruminis DSM
2154]
Length=133
Score = 37.0 bits (84), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 31/127 (25%), Positives = 61/127 (49%), Gaps = 17/127 (13%)
Query 5 DDPQAELAWMFLQSLCEGG---DLDEGFALLSNDFTYWSIVTRTELDKK--TFRRAVERR 59
++ ELA F++++ +G +L+ + +N Y +++T+ +++ + A +
Sbjct 2 ENKNVELAIEFIKAVSDGKIGEELNNFYDESANQIEYPNLLTKKIVERNLDAIKDASIKG 61
Query 60 KQVFEV-NIELIRCVNEGETVVVEGHCDG---------VSADRTRYDSPFVCIFETRDGM 109
KQV + N E+I+ + G TV++E G + D + + F FE RDG
Sbjct 62 KQVISIQNYEIIKAYSCGNTVIIEAIWTGRLAIPLGKLKAGDEMK--AYFAQFFEFRDGK 119
Query 110 IISLREY 116
I+ R Y
Sbjct 120 IVKQRNY 126
>gi|27377616|ref|NP_769145.1| hypothetical protein blr2505 [Bradyrhizobium japonicum USDA 110]
gi|27350761|dbj|BAC47770.1| blr2505 [Bradyrhizobium japonicum USDA 110]
Length=159
Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 17/49 (35%), Positives = 27/49 (56%), Gaps = 0/49 (0%)
Query 71 RCVNEGETVVVEGHCDGVSADRTRYDSPFVCIFETRDGMIISLREYSDT 119
R + + + VVVE D V+ + RYD+ + +F G I +REY D+
Sbjct 92 RFIADSDIVVVEAKGDNVTPEGVRYDNDYCLVFRLEGGRIKEIREYCDS 140
Lambda K H
0.323 0.138 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128858389450
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40