BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1065
Length=188
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608205|ref|NP_215581.1| hypothetical protein Rv1065 [Mycoba... 374 4e-102
gi|297633597|ref|ZP_06951377.1| hypothetical protein MtubK4_0571... 372 2e-101
gi|289761205|ref|ZP_06520583.1| LOW QUALITY PROTEIN: conserved h... 312 1e-83
gi|339294062|gb|AEJ46173.1| hypothetical protein CCDC5079_0983 [... 309 1e-82
gi|342861658|ref|ZP_08718304.1| hypothetical protein MCOL_22331 ... 294 4e-78
gi|118616104|ref|YP_904436.1| hypothetical protein MUL_0216 [Myc... 292 2e-77
gi|240170300|ref|ZP_04748959.1| hypothetical protein MkanA1_1338... 279 1e-73
gi|183984372|ref|YP_001852663.1| hypothetical protein MMAR_4401 ... 270 6e-71
gi|41407112|ref|NP_959948.1| hypothetical protein MAP1014 [Mycob... 267 5e-70
gi|118463236|ref|YP_880438.1| cysteine dioxygenase type I superf... 267 6e-70
gi|254823412|ref|ZP_05228413.1| cysteine dioxygenase type I supe... 264 4e-69
gi|296169932|ref|ZP_06851541.1| cysteine dioxygenase type I fami... 261 3e-68
gi|108801132|ref|YP_641329.1| cysteine dioxygenase type I [Mycob... 252 2e-65
gi|169628275|ref|YP_001701924.1| hypothetical protein MAB_1182 [... 246 9e-64
gi|145222621|ref|YP_001133299.1| cysteine dioxygenase type I [My... 245 2e-63
gi|120405636|ref|YP_955465.1| cysteine dioxygenase type I [Mycob... 238 3e-61
gi|118468952|ref|YP_889526.1| cysteine dioxygenase type I [Mycob... 230 8e-59
gi|333989653|ref|YP_004522267.1| hypothetical protein JDM601_101... 229 1e-58
gi|312138063|ref|YP_004005399.1| cysteine dioxygenase [Rhodococc... 221 4e-56
gi|226304125|ref|YP_002764083.1| cysteine dioxygenase [Rhodococc... 218 3e-55
gi|54022373|ref|YP_116615.1| hypothetical protein nfa4090 [Nocar... 218 5e-55
gi|226363744|ref|YP_002781526.1| cysteine dioxygenase [Rhodococc... 216 1e-54
gi|111021390|ref|YP_704362.1| cysteine dioxygenase [Rhodococcus ... 216 2e-54
gi|317507886|ref|ZP_07965584.1| cysteine dioxygenase type I [Seg... 203 1e-50
gi|343926937|ref|ZP_08766430.1| putative cysteine dioxygenase [G... 199 2e-49
gi|296392945|ref|YP_003657829.1| cysteine dioxygenase type I [Se... 197 5e-49
gi|262203211|ref|YP_003274419.1| cysteine dioxygenase type I [Go... 195 3e-48
gi|229494302|ref|ZP_04388065.1| cysteine dioxygenase type I [Rho... 194 4e-48
gi|296141813|ref|YP_003649056.1| cysteine dioxygenase type I [Ts... 170 9e-41
gi|331694427|ref|YP_004330666.1| cysteine dioxygenase type I [Ps... 131 4e-29
gi|325002467|ref|ZP_08123579.1| cysteine dioxygenase type I [Pse... 125 3e-27
gi|297560351|ref|YP_003679325.1| cysteine dioxygenase type I [No... 121 4e-26
gi|269124738|ref|YP_003298108.1| cysteine dioxygenase type I [Th... 99.4 2e-19
gi|311743303|ref|ZP_07717110.1| cysteine dioxygenase type I fami... 96.7 1e-18
gi|311899554|dbj|BAJ31962.1| putative cysteine dioxygenase [Kita... 92.0 4e-17
gi|291299203|ref|YP_003510481.1| cysteine dioxygenase type I [St... 87.8 8e-16
gi|302867356|ref|YP_003835993.1| cysteine dioxygenase type I [Mi... 86.3 2e-15
gi|302524883|ref|ZP_07277225.1| cysteine dioxygenase [Streptomyc... 85.9 2e-15
gi|300783535|ref|YP_003763826.1| cysteine dioxygenase [Amycolato... 85.9 3e-15
gi|299137588|ref|ZP_07030769.1| cysteine dioxygenase type I [Aci... 83.6 1e-14
gi|256392269|ref|YP_003113833.1| cysteine dioxygenase type I [Ca... 83.2 2e-14
gi|330468584|ref|YP_004406327.1| cysteine dioxygenase type i [Ve... 80.1 2e-13
gi|320105306|ref|YP_004180896.1| cysteine dioxygenase type I [Te... 79.7 2e-13
gi|326332280|ref|ZP_08198560.1| cysteine dioxygenase type I fami... 79.3 3e-13
gi|297200127|ref|ZP_06917524.1| cysteine dioxygenase [Streptomyc... 79.0 3e-13
gi|182438295|ref|YP_001826014.1| putative cysteine dioxygenase [... 78.6 5e-13
gi|297157872|gb|ADI07584.1| putative cysteine dioxygenase [Strep... 77.4 8e-13
gi|29831584|ref|NP_826218.1| cysteine dioxygenase [Streptomyces ... 77.4 8e-13
gi|239988188|ref|ZP_04708852.1| putative cysteine dioxygenase [S... 77.4 1e-12
gi|345015663|ref|YP_004818017.1| cysteine dioxygenase type I [St... 77.4 1e-12
>gi|15608205|ref|NP_215581.1| hypothetical protein Rv1065 [Mycobacterium tuberculosis H37Rv]
gi|15840498|ref|NP_335535.1| hypothetical protein MT1095 [Mycobacterium tuberculosis CDC1551]
gi|31792256|ref|NP_854749.1| hypothetical protein Mb1094 [Mycobacterium bovis AF2122/97]
70 more sequence titles
Length=188
Score = 374 bits (960), Expect = 4e-102, Method: Compositional matrix adjust.
Identities = 187/188 (99%), Positives = 188/188 (100%), Gaps = 0/188 (0%)
Query 1 VVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI 60
+VMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI
Sbjct 1 MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI 60
Query 61 HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG 120
HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG
Sbjct 61 HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG 120
Query 121 FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTEL 180
FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTEL
Sbjct 121 FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTEL 180
Query 181 TDQPEGSG 188
TDQPEGSG
Sbjct 181 TDQPEGSG 188
>gi|297633597|ref|ZP_06951377.1| hypothetical protein MtubK4_05713 [Mycobacterium tuberculosis
KZN 4207]
gi|297730583|ref|ZP_06959701.1| hypothetical protein MtubKR_05798 [Mycobacterium tuberculosis
KZN R506]
gi|313657911|ref|ZP_07814791.1| hypothetical protein MtubKV_05793 [Mycobacterium tuberculosis
KZN V2475]
Length=186
Score = 372 bits (954), Expect = 2e-101, Method: Compositional matrix adjust.
Identities = 186/186 (100%), Positives = 186/186 (100%), Gaps = 0/186 (0%)
Query 3 MPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHG 62
MPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHG
Sbjct 1 MPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHG 60
Query 63 DEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFP 122
DEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFP
Sbjct 61 DEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFP 120
Query 123 LGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTD 182
LGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTD
Sbjct 121 LGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTD 180
Query 183 QPEGSG 188
QPEGSG
Sbjct 181 QPEGSG 186
>gi|289761205|ref|ZP_06520583.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis GM 1503]
gi|289708711|gb|EFD72727.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis GM 1503]
Length=190
Score = 312 bits (800), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/157 (99%), Positives = 157/157 (100%), Gaps = 0/157 (0%)
Query 1 VVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI 60
+VMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI
Sbjct 1 MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRI 60
Query 61 HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG 120
HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG
Sbjct 61 HGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAG 120
Query 121 FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSP 157
FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSP
Sbjct 121 FPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSP 157
>gi|339294062|gb|AEJ46173.1| hypothetical protein CCDC5079_0983 [Mycobacterium tuberculosis
CCDC5079]
Length=153
Score = 309 bits (792), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 152/153 (99%), Positives = 153/153 (100%), Gaps = 0/153 (0%)
Query 36 VLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLS 95
+LGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLS
Sbjct 1 MLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLS 60
Query 96 GSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAY 155
GSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAY
Sbjct 61 GSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAY 120
Query 156 SPPLTAMSYYEITERNTLRRQRTELTDQPEGSG 188
SPPLTAMSYYEITERNTLRRQRTELTDQPEGSG
Sbjct 121 SPPLTAMSYYEITERNTLRRQRTELTDQPEGSG 153
>gi|342861658|ref|ZP_08718304.1| hypothetical protein MCOL_22331 [Mycobacterium colombiense CECT
3035]
gi|342130792|gb|EGT84088.1| hypothetical protein MCOL_22331 [Mycobacterium colombiense CECT
3035]
Length=196
Score = 294 bits (753), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 147/186 (80%), Positives = 161/186 (87%), Gaps = 9/186 (4%)
Query 11 AVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWL 70
+ PS GPTRLRV DLL ATDQAADDVL GRCDHLLP GG+P+++RW+TRIHGDEELD+WL
Sbjct 10 SAPSAGPTRLRVPDLLYATDQAADDVLSGRCDHLLPPGGIPESRRWFTRIHGDEELDVWL 69
Query 71 ISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVV 130
ISWVPGQPTELHDHGGSLGALTV+SGSLNEYRWDGR LRRRRLD+GDQAGFPLGWVHDVV
Sbjct 70 ISWVPGQPTELHDHGGSLGALTVVSGSLNEYRWDGRALRRRRLDSGDQAGFPLGWVHDVV 129
Query 131 WAPRPIGGPDAAGM---------AVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELT 181
WAPRP+ P + + V PTLSVHAYSPPLTAMSYYEIT+R TLRR RTELT
Sbjct 130 WAPRPVTVPVSLPVAGSPGAAAAPVRPTLSVHAYSPPLTAMSYYEITDRKTLRRDRTELT 189
Query 182 DQPEGS 187
DQPEG+
Sbjct 190 DQPEGA 195
>gi|118616104|ref|YP_904436.1| hypothetical protein MUL_0216 [Mycobacterium ulcerans Agy99]
gi|118568214|gb|ABL02965.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=191
Score = 292 bits (747), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/177 (82%), Positives = 155/177 (88%), Gaps = 2/177 (1%)
Query 9 TTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDI 68
T P+PGPTRLRV DLL ATDQ ADDVL GRCDHLLP+GGVP RW+TR+HGD+ELD+
Sbjct 15 TVTSPAPGPTRLRVPDLLHATDQVADDVLSGRCDHLLPEGGVPDDGRWFTRVHGDDELDV 74
Query 69 WLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHD 128
WLISWVPG TELHDHGGSLGALTVLSGSLNE+RWDG RLRRRRLDAGDQAGFPLGWVHD
Sbjct 75 WLISWVPGHATELHDHGGSLGALTVLSGSLNEFRWDGTRLRRRRLDAGDQAGFPLGWVHD 134
Query 129 VVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
VVWAPRP P AA + PTLSVHAYSPPLTAMSYYE+T+RNTLRR+RTELTD PE
Sbjct 135 VVWAPRPAAEPIAAPL--PPTLSVHAYSPPLTAMSYYEVTDRNTLRRKRTELTDHPE 189
>gi|240170300|ref|ZP_04748959.1| hypothetical protein MkanA1_13388 [Mycobacterium kansasii ATCC
12478]
Length=192
Score = 279 bits (714), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 150/192 (79%), Positives = 166/192 (87%), Gaps = 5/192 (2%)
Query 1 VVMPLVT-PTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTR 59
+ MPL+T P A P PGPTRLRV DLL ATDQ ADDVL GR DHLLP GG+P+T+RW+ R
Sbjct 1 MAMPLLTSPAVASPFPGPTRLRVPDLLHATDQVADDVLSGRYDHLLPRGGLPETERWFAR 60
Query 60 IHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQA 119
+HGD++LDIWLISWVPG TELHDHGGS+GALTVLSGSLNEYRWDGRRLRRRRLDAGDQA
Sbjct 61 VHGDDDLDIWLISWVPGHATELHDHGGSIGALTVLSGSLNEYRWDGRRLRRRRLDAGDQA 120
Query 120 GFPLGWVHDVVWAPR--PIGGPDAAGMA--VAPTLSVHAYSPPLTAMSYYEITERNTLRR 175
GFPLGWVHDVVWAPR P+ P ++ +APTLSVHAYSPPLTAMSYYE+TERNTLRR
Sbjct 121 GFPLGWVHDVVWAPRKAPVTEPAVEPLSSPIAPTLSVHAYSPPLTAMSYYEVTERNTLRR 180
Query 176 QRTELTDQPEGS 187
+RTELTDQPE S
Sbjct 181 RRTELTDQPEKS 192
>gi|183984372|ref|YP_001852663.1| hypothetical protein MMAR_4401 [Mycobacterium marinum M]
gi|183177698|gb|ACC42808.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=164
Score = 270 bits (691), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 133/162 (83%), Positives = 143/162 (89%), Gaps = 2/162 (1%)
Query 24 DLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHD 83
DLL ATDQ ADDVL GRCDHLLP+GGVP RW+TR+HGD+ELD+WLISWVPG TELHD
Sbjct 3 DLLHATDQVADDVLSGRCDHLLPEGGVPDDGRWFTRVHGDDELDVWLISWVPGHATELHD 62
Query 84 HGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAG 143
HGGSLGALTVLSGSLNE+RWDG RLRRRRLDAGDQAGFPLGWVHDVVWAPRP P AA
Sbjct 63 HGGSLGALTVLSGSLNEFRWDGTRLRRRRLDAGDQAGFPLGWVHDVVWAPRPAAEPIAA- 121
Query 144 MAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
+ PTLSVHAYSPPLTAMSYYE+T+ NTLRR+RTELTD PE
Sbjct 122 -PLPPTLSVHAYSPPLTAMSYYEVTDHNTLRRKRTELTDHPE 162
>gi|41407112|ref|NP_959948.1| hypothetical protein MAP1014 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254774075|ref|ZP_05215591.1| cysteine dioxygenase type I superfamily protein [Mycobacterium
avium subsp. avium ATCC 25291]
gi|41395463|gb|AAS03331.1| hypothetical protein MAP_1014 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=195
Score = 267 bits (683), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 147/174 (85%), Positives = 156/174 (90%), Gaps = 4/174 (2%)
Query 18 TRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQ 77
TRLRV DLL ATDQAADDVL GRCDHLLP GG+P ++RW+TRIHGDEELD+WLISWVPG
Sbjct 22 TRLRVPDLLYATDQAADDVLSGRCDHLLPPGGIPASRRWFTRIHGDEELDVWLISWVPGH 81
Query 78 PTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIG 137
PTELHDHGGSLGALTV+SGSLNEYRWDGR LRRRRLDAGDQAGFPLGWVHDVVWAPRP+
Sbjct 82 PTELHDHGGSLGALTVVSGSLNEYRWDGRALRRRRLDAGDQAGFPLGWVHDVVWAPRPVS 141
Query 138 GPDA----AGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGS 187
GP + A APTLSVHAYSPPLTAMSYY+ITER TLRRQRTELTDQPEGS
Sbjct 142 GPVSRRAVAAAQAAPTLSVHAYSPPLTAMSYYDITERKTLRRQRTELTDQPEGS 195
>gi|118463236|ref|YP_880438.1| cysteine dioxygenase type I superfamily protein [Mycobacterium
avium 104]
gi|118164523|gb|ABK65420.1| cysteine dioxygenase type I superfamily protein [Mycobacterium
avium 104]
gi|336461464|gb|EGO40334.1| Cysteine dioxygenase type I [Mycobacterium avium subsp. paratuberculosis
S397]
Length=197
Score = 267 bits (682), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 147/174 (85%), Positives = 156/174 (90%), Gaps = 4/174 (2%)
Query 18 TRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQ 77
TRLRV DLL ATDQAADDVL GRCDHLLP GG+P ++RW+TRIHGDEELD+WLISWVPG
Sbjct 24 TRLRVPDLLYATDQAADDVLSGRCDHLLPPGGIPASRRWFTRIHGDEELDVWLISWVPGH 83
Query 78 PTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIG 137
PTELHDHGGSLGALTV+SGSLNEYRWDGR LRRRRLDAGDQAGFPLGWVHDVVWAPRP+
Sbjct 84 PTELHDHGGSLGALTVVSGSLNEYRWDGRALRRRRLDAGDQAGFPLGWVHDVVWAPRPVS 143
Query 138 GPDA----AGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGS 187
GP + A APTLSVHAYSPPLTAMSYY+ITER TLRRQRTELTDQPEGS
Sbjct 144 GPVSRRAVAAAQAAPTLSVHAYSPPLTAMSYYDITERKTLRRQRTELTDQPEGS 197
>gi|254823412|ref|ZP_05228413.1| cysteine dioxygenase type I superfamily protein [Mycobacterium
intracellulare ATCC 13950]
Length=172
Score = 264 bits (675), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 142/169 (85%), Positives = 153/169 (91%), Gaps = 5/169 (2%)
Query 24 DLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHD 83
DLL ATDQAADDVL GRCDHLLP+GG+P++QRW+TRIHGDEELD+WLISWVPG PTELHD
Sbjct 3 DLLHATDQAADDVLSGRCDHLLPEGGIPESQRWFTRIHGDEELDVWLISWVPGHPTELHD 62
Query 84 HGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI-----GG 138
HGGSLGALTV+SGSLNEYRWDGR LRRRRLDAGDQAGFPLGWVHDVVWAPRP+ G
Sbjct 63 HGGSLGALTVVSGSLNEYRWDGRALRRRRLDAGDQAGFPLGWVHDVVWAPRPVTVPVTGL 122
Query 139 PDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGS 187
P A+ PTLSVHAYSPPLTAMSYY+IT+RNTLRRQRTELTDQPEGS
Sbjct 123 PGASAGPAQPTLSVHAYSPPLTAMSYYDITDRNTLRRQRTELTDQPEGS 171
>gi|296169932|ref|ZP_06851541.1| cysteine dioxygenase type I family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895396|gb|EFG75101.1| cysteine dioxygenase type I family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=191
Score = 261 bits (668), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/192 (73%), Positives = 154/192 (81%), Gaps = 8/192 (4%)
Query 1 VVMPLVTPTTAVPSP------GPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQ 54
+ +PL P + P P GPTRLRV DLL ATD+AADDVL GRCDHLLP GGVP ++
Sbjct 1 MSVPLAVPAASRPRPFSSPSAGPTRLRVPDLLHATDRAADDVLSGRCDHLLPPGGVPDSR 60
Query 55 RWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLD 114
RW+TRIHGDEELD+WLISWVPG TELHDHGGSLGALTV+SGSLNE+RWDGR LR+RRLD
Sbjct 61 RWFTRIHGDEELDVWLISWVPGHHTELHDHGGSLGALTVVSGSLNEFRWDGRALRQRRLD 120
Query 115 AGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLR 174
AGDQAGFPLGWVHDV P P + P+LSVHAYSPPLTAMSYY+IT RN LR
Sbjct 121 AGDQAGFPLGWVHDV--VWAPRPVPVPVSVPARPSLSVHAYSPPLTAMSYYQITGRNRLR 178
Query 175 RQRTELTDQPEG 186
RQRTELTDQPEG
Sbjct 179 RQRTELTDQPEG 190
>gi|108801132|ref|YP_641329.1| cysteine dioxygenase type I [Mycobacterium sp. MCS]
gi|119870264|ref|YP_940216.1| cysteine dioxygenase type I [Mycobacterium sp. KMS]
gi|126436961|ref|YP_001072652.1| cysteine dioxygenase type I [Mycobacterium sp. JLS]
gi|108771551|gb|ABG10273.1| cysteine dioxygenase type I [Mycobacterium sp. MCS]
gi|119696353|gb|ABL93426.1| cysteine dioxygenase type I [Mycobacterium sp. KMS]
gi|126236761|gb|ABO00162.1| cysteine dioxygenase type I [Mycobacterium sp. JLS]
Length=176
Score = 252 bits (643), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 133/185 (72%), Positives = 145/185 (79%), Gaps = 10/185 (5%)
Query 3 MPLVTPTTAVPS-PGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIH 61
M + T AVP+ PTRLR+ DLL ATD+ ADDVL GR DHLLP GGVP RWYTR+H
Sbjct 1 MSVHTLAPAVPAVSAPTRLRLPDLLHATDRGADDVLNGRYDHLLPRGGVPTDDRWYTRLH 60
Query 62 GDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGF 121
GD+ELDIWLISWVP + TELHDHGGSLGALTVLSGSL+E RWDG LR+RRL AGDQA F
Sbjct 61 GDDELDIWLISWVPERSTELHDHGGSLGALTVLSGSLSETRWDGEGLRQRRLAAGDQAAF 120
Query 122 PLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELT 181
PLGWVHDVVWAP G PTLSVHAYSPPLTAMSYYE+T+R TLRR RTELT
Sbjct 121 PLGWVHDVVWAPDTTTG---------PTLSVHAYSPPLTAMSYYEVTDRKTLRRNRTELT 171
Query 182 DQPEG 186
+ PEG
Sbjct 172 ESPEG 176
>gi|169628275|ref|YP_001701924.1| hypothetical protein MAB_1182 [Mycobacterium abscessus ATCC 19977]
gi|169240242|emb|CAM61270.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=190
Score = 246 bits (629), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 124/170 (73%), Positives = 139/170 (82%), Gaps = 1/170 (0%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PTRLR+ DLLR TD+ ADD L GR DHLLP GG+P +RW TRIH D+ELD+WLISWVP
Sbjct 19 PTRLRLPDLLRITDEGADDALHGRFDHLLPAGGLPVDERWATRIHADDELDVWLISWVPD 78
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GSLGALTVLSGSL+EYRWDG +L RRRLDAGDQAGFPLGWVHDV+ AP +
Sbjct 79 KSTELHDHCGSLGALTVLSGSLHEYRWDGSQLVRRRLDAGDQAGFPLGWVHDVMRAPLKL 138
Query 137 GGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
G + PTLSVHAYSPPLTAMSYYE+T+ NTLRR RT LTD+PEG
Sbjct 139 SGAPVPAES-GPTLSVHAYSPPLTAMSYYEVTQANTLRRSRTILTDEPEG 187
>gi|145222621|ref|YP_001133299.1| cysteine dioxygenase type I [Mycobacterium gilvum PYR-GCK]
gi|315443086|ref|YP_004075965.1| Cysteine dioxygenase type I [Mycobacterium sp. Spyr1]
gi|145215107|gb|ABP44511.1| cysteine dioxygenase type I [Mycobacterium gilvum PYR-GCK]
gi|315261389|gb|ADT98130.1| Cysteine dioxygenase type I [Mycobacterium sp. Spyr1]
Length=172
Score = 245 bits (626), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/171 (73%), Positives = 141/171 (83%), Gaps = 3/171 (1%)
Query 16 GPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVP 75
PTRLR ADLL TD+ ADD+LGG DH+LP GG P T+RW+TR+HG+EELD+WLISWVP
Sbjct 4 APTRLRPADLLHVTDRFADDILGGDYDHVLPAGGPPTTERWFTRLHGNEELDVWLISWVP 63
Query 76 GQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRP 135
TELHDHGGSLGALTV+SG+L E RWDG LR RRL AGDQA FPLGWVHDVVWA
Sbjct 64 DCSTELHDHGGSLGALTVVSGALRETRWDGSALRDRRLVAGDQAAFPLGWVHDVVWAR-- 121
Query 136 IGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
G G+A APTLSVHAYSPPLTAMSYY++T+RNTLRR+RT+LTD+PEG
Sbjct 122 -DGVTVGGIAPAPTLSVHAYSPPLTAMSYYDVTDRNTLRRKRTQLTDKPEG 171
>gi|120405636|ref|YP_955465.1| cysteine dioxygenase type I [Mycobacterium vanbaalenii PYR-1]
gi|119958454|gb|ABM15459.1| cysteine dioxygenase type I [Mycobacterium vanbaalenii PYR-1]
Length=171
Score = 238 bits (607), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 120/170 (71%), Positives = 134/170 (79%), Gaps = 4/170 (2%)
Query 16 GPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVP 75
PTRLR ADLL TD+ ADDVLGG DH+LP G+P +RW+TR+HG +ELD+WLISWV
Sbjct 4 APTRLRPADLLHVTDRFADDVLGGDYDHVLPAAGLPTAERWFTRLHGTDELDVWLISWVS 63
Query 76 GQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRP 135
+ TELHDHGGSLGALTV+SG+L E RWDG LR RRL AGDQA FPLGWVHDVVWA
Sbjct 64 NRSTELHDHGGSLGALTVVSGTLRETRWDGEALRERRLVAGDQAAFPLGWVHDVVWARES 123
Query 136 IGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
I G G PTLSVHAYSPPLTAMSYYE+T +NTLRR RTELTD+PE
Sbjct 124 IRGGGTPG----PTLSVHAYSPPLTAMSYYEVTTQNTLRRNRTELTDKPE 169
>gi|118468952|ref|YP_889526.1| cysteine dioxygenase type I [Mycobacterium smegmatis str. MC2
155]
gi|118170239|gb|ABK71135.1| cysteine dioxygenase type I superfamily protein [Mycobacterium
smegmatis str. MC2 155]
Length=182
Score = 230 bits (586), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 120/182 (66%), Positives = 137/182 (76%), Gaps = 11/182 (6%)
Query 5 LVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDE 64
L T T PTRLR+ DLL TD+AAD VL GR D LL D +P+ +RWYTR+ G++
Sbjct 12 LGTSATVPAVSAPTRLRLPDLLNTTDRAADAVLSGRYDRLLRD--LPEDERWYTRLDGND 69
Query 65 ELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLG 124
ELD+WLISWVP + TELHDHGGSLGALTV+SG+L E RWDG LR RRL AG QA FPLG
Sbjct 70 ELDVWLISWVPDRSTELHDHGGSLGALTVVSGALTETRWDGEALRHRRLSAGSQAAFPLG 129
Query 125 WVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQP 184
WVHDVV AP P+ + PTLSVHAYSPPLTAMSYYE+T++NTLRR RTELTD P
Sbjct 130 WVHDVVRAPGPV---------IGPTLSVHAYSPPLTAMSYYEVTQQNTLRRSRTELTDAP 180
Query 185 EG 186
EG
Sbjct 181 EG 182
>gi|333989653|ref|YP_004522267.1| hypothetical protein JDM601_1013 [Mycobacterium sp. JDM601]
gi|333485621|gb|AEF35013.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=176
Score = 229 bits (585), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/174 (68%), Positives = 129/174 (75%), Gaps = 10/174 (5%)
Query 14 SPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISW 73
+P P LR+ DLL+ TD AAD VL GR +HLLP G+P RW+TRIHGDE LDIWLISW
Sbjct 13 APRPRSLRLPDLLQTTDLAADAVLDGRYEHLLPTSGLPTDSRWFTRIHGDERLDIWLISW 72
Query 74 VPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAP 133
PG TELHDHG SLGALTVLSGSL+E+ WDG +L RRRLDAGDQA F GWVHDVVWAP
Sbjct 73 APGHATELHDHGDSLGALTVLSGSLDEFHWDGTQLARRRLDAGDQASFSRGWVHDVVWAP 132
Query 134 RPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGS 187
G PTLSVHAYSPPL MSYY++ NTLRRQRTELT+ PE S
Sbjct 133 SVAG----------PTLSVHAYSPPLVEMSYYDVAPDNTLRRQRTELTEHPEAS 176
>gi|312138063|ref|YP_004005399.1| cysteine dioxygenase [Rhodococcus equi 103S]
gi|325675036|ref|ZP_08154723.1| cysteine dioxygenase type I family protein [Rhodococcus equi
ATCC 33707]
gi|311887402|emb|CBH46714.1| cysteine dioxygenase [Rhodococcus equi 103S]
gi|325554622|gb|EGD24297.1| cysteine dioxygenase type I family protein [Rhodococcus equi
ATCC 33707]
Length=194
Score = 221 bits (563), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 116/170 (69%), Positives = 127/170 (75%), Gaps = 7/170 (4%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PTRLR ADLLR TDQ A DV+ GR DHLLP P +RW TR+ D+++D+WLISWVP
Sbjct 29 PTRLRPADLLRITDQGAADVIEGRHDHLLP-AAFPTHERWSTRLSSDDDVDVWLISWVPD 87
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GS GALTVLSGSL EYRW LRRR LDAGDQA FPLGWVHDV+ AP P
Sbjct 88 KSTELHDHAGSFGALTVLSGSLAEYRWTDGDLRRRTLDAGDQAAFPLGWVHDVMRAPGP- 146
Query 137 GGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
A + PTLSVHAYSPPLTAMSYYE+TE LRR RTELTD PEG
Sbjct 147 -----ATDSTEPTLSVHAYSPPLTAMSYYEVTEHGALRRTRTELTDLPEG 191
>gi|226304125|ref|YP_002764083.1| cysteine dioxygenase [Rhodococcus erythropolis PR4]
gi|226183240|dbj|BAH31344.1| putative cysteine dioxygenase [Rhodococcus erythropolis PR4]
Length=172
Score = 218 bits (556), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 113/170 (67%), Positives = 129/170 (76%), Gaps = 2/170 (1%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PTRLR ADLLR TD+ A+ VL GR DHL+PD P RW TR+H D+++D+WLISWVP
Sbjct 2 PTRLRPADLLRLTDEGANGVLDGRFDHLIPDA-FPTLDRWSTRLHADDDVDVWLISWVPE 60
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GS GALTVLSGSL E+RW G L R+LDAGDQA FPLGWVHDVV +
Sbjct 61 RNTELHDHAGSFGALTVLSGSLTEFRWAGDALVERQLDAGDQASFPLGWVHDVVRSTDAP 120
Query 137 GGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
G P + APTLSVHAYSPPLTAMS+YE+T+ TLRR RTELTD PEG
Sbjct 121 GAPIEIS-SDAPTLSVHAYSPPLTAMSFYEVTDHRTLRRTRTELTDLPEG 169
>gi|54022373|ref|YP_116615.1| hypothetical protein nfa4090 [Nocardia farcinica IFM 10152]
gi|54013881|dbj|BAD55251.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=185
Score = 218 bits (554), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/177 (65%), Positives = 131/177 (75%), Gaps = 6/177 (3%)
Query 11 AVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGV-PQTQRWYTRIHGDEELDIW 69
AV PTRLR ADLLR TD+ A+DVL GR DHLLP GG P +RW TR+ D+E+D+W
Sbjct 12 AVAPALPTRLRPADLLRLTDEGAEDVLDGRYDHLLPAGGAWPTEERWATRLRADDEVDVW 71
Query 70 LISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDV 129
LISW P + TELHDH GSLGALTVLSG+L+E RW G LR R L AGDQA FP+GWVH+V
Sbjct 72 LISWTPAKTTELHDHAGSLGALTVLSGALSELRWTGTELRARTLSAGDQAAFPIGWVHEV 131
Query 130 VWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
+ AP I + PTLSVHAYSPPLTAMSYYEIT + TLRR RT LTD+PEG
Sbjct 132 MRAPAAI-----EPVTAEPTLSVHAYSPPLTAMSYYEITGQGTLRRTRTVLTDEPEG 183
>gi|226363744|ref|YP_002781526.1| cysteine dioxygenase [Rhodococcus opacus B4]
gi|226242233|dbj|BAH52581.1| putative cysteine dioxygenase [Rhodococcus opacus B4]
Length=177
Score = 216 bits (551), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 116/171 (68%), Positives = 131/171 (77%), Gaps = 8/171 (4%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PTRLR ADLLR TDQ A +VL GR D LLP P +RW TR++ D+++D+WLISWVP
Sbjct 11 PTRLRPADLLRITDQGASEVLDGRHDVLLPQSW-PIDERWSTRLYSDDDVDVWLISWVPD 69
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GS GALTVLSG+L+E+RW G RLR R LDAGDQA FPLGWVHDVV A
Sbjct 70 RNTELHDHAGSFGALTVLSGALSEFRWAGDRLRHRTLDAGDQASFPLGWVHDVVRA---- 125
Query 137 GGPDAAGM-AVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
PDA G V PTLSVHAYSPPL+AMSYYE+T+ TLRR RTELTD PEG
Sbjct 126 --PDAPGAEVVTPTLSVHAYSPPLSAMSYYEVTDHGTLRRTRTELTDLPEG 174
>gi|111021390|ref|YP_704362.1| cysteine dioxygenase [Rhodococcus jostii RHA1]
gi|110820920|gb|ABG96204.1| possible cysteine dioxygenase [Rhodococcus jostii RHA1]
Length=177
Score = 216 bits (549), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/170 (67%), Positives = 129/170 (76%), Gaps = 6/170 (3%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PTRLR ADLLR TDQ A +VL GR D LLP P +RW TR++ D+++D+WLISWVP
Sbjct 11 PTRLRPADLLRITDQGASEVLDGRHDVLLPQSW-PTDERWSTRLYSDDDVDVWLISWVPD 69
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GS GALTVLSG+L+E+RW G RLR R L+AGDQA FPLGWVHDVV AP
Sbjct 70 RNTELHDHAGSFGALTVLSGALSEFRWAGDRLRHRTLEAGDQASFPLGWVHDVVRAPDAP 129
Query 137 GGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
G V PTLSVHAYSPPL+AMSYYE+T+ TLRR RTELTD PEG
Sbjct 130 GAE-----VVTPTLSVHAYSPPLSAMSYYEVTDHGTLRRTRTELTDLPEG 174
>gi|317507886|ref|ZP_07965584.1| cysteine dioxygenase type I [Segniliparus rugosus ATCC BAA-974]
gi|316253815|gb|EFV13187.1| cysteine dioxygenase type I [Segniliparus rugosus ATCC BAA-974]
Length=183
Score = 203 bits (516), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 108/173 (63%), Positives = 122/173 (71%), Gaps = 2/173 (1%)
Query 15 PGPTRLRVADLLRATDQAADDVLGGRCDHLLP-DGGVPQTQRWYTRIHGDEELDIWLISW 73
P PTRL ADLLR T AD V G HLLP GG P RW ++ D++LD+W ISW
Sbjct 6 PLPTRLNTADLLRITADVADQVRDGAWSHLLPPAGGWPTDDRWCRQLFADDDLDVWAISW 65
Query 74 VPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAP 133
VP + TELHDHGGSLGALTV+ G+L E+RW G RLR RRL +G QA FPLGWVHDV WA
Sbjct 66 VPDRTTELHDHGGSLGALTVVDGALAEWRWTGDRLRERRLASGAQAAFPLGWVHDVTWAA 125
Query 134 RPIGGPDAAGM-AVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
A G A+AP LSVHAYSPPLT MSYYE+TER++LRR R ELTD PE
Sbjct 126 SGTSALTADGTGAIAPALSVHAYSPPLTVMSYYEVTERHSLRRVRAELTDIPE 178
>gi|343926937|ref|ZP_08766430.1| putative cysteine dioxygenase [Gordonia alkanivorans NBRC 16433]
gi|343763297|dbj|GAA13356.1| putative cysteine dioxygenase [Gordonia alkanivorans NBRC 16433]
Length=197
Score = 199 bits (505), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/169 (64%), Positives = 117/169 (70%), Gaps = 9/169 (5%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PT LR ADLLR TDQ DVL GR D LLP T RW TR++ D++LD+WLISW PG
Sbjct 38 PTHLRPADLLRITDQGVADVLDGRHDALLPTEW-DTTHRWSTRLYADDDLDVWLISWTPG 96
Query 77 QPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPI 136
+ TELHDH GSLGALTVLSGSL EY W G L R LDAGDQA FPLGWVHDVV P
Sbjct 97 EATELHDHAGSLGALTVLSGSLREYHWTGDDLAVRVLDAGDQAAFPLGWVHDVVKNP--- 153
Query 137 GGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
PTLSVHAYSPPLTAMSYYE+ + LRR RT LTD+PE
Sbjct 154 -----TTQVAGPTLSVHAYSPPLTAMSYYEVADAGHLRRTRTILTDEPE 197
>gi|296392945|ref|YP_003657829.1| cysteine dioxygenase type I [Segniliparus rotundus DSM 44985]
gi|296180092|gb|ADG96998.1| cysteine dioxygenase type I [Segniliparus rotundus DSM 44985]
Length=190
Score = 197 bits (502), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 106/175 (61%), Positives = 121/175 (70%), Gaps = 4/175 (2%)
Query 15 PGPTRLRVADLLRATDQAADDVLGGRCDHLLP-DGGVPQTQRWYTRIHGDEELDIWLISW 73
P PTRL ADLLR T A+ + G HLLP GG P RW ++ D+ELD+W ISW
Sbjct 12 PLPTRLSTADLLRTTADVAEQIKDGAWAHLLPPAGGWPTDDRWCRQLFADDELDVWAISW 71
Query 74 VPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAP 133
VP + TELHDHGGSLGALTV+ G+L E+RW G RLR RRL AG QA F LGWVHDV WA
Sbjct 72 VPDRTTELHDHGGSLGALTVVDGALAEWRWTGSRLRERRLGAGAQAAFALGWVHDVTWAQ 131
Query 134 ---RPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
P+ AA + AP LSVHAYSPPLT MSYYE+TE+ +LRR R ELTD PE
Sbjct 132 PGVSPLVKGAAAPGSTAPALSVHAYSPPLTVMSYYEVTEQQSLRRVRAELTDVPE 186
>gi|262203211|ref|YP_003274419.1| cysteine dioxygenase type I [Gordonia bronchialis DSM 43247]
gi|262086558|gb|ACY22526.1| cysteine dioxygenase type I [Gordonia bronchialis DSM 43247]
Length=195
Score = 195 bits (496), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/177 (64%), Positives = 120/177 (68%), Gaps = 1/177 (0%)
Query 9 TTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDI 68
TT S PTRLR ADLLR TDQ DVL G D LLP P RW TRIH D ++D+
Sbjct 20 TTRRRSHLPTRLRPADLLRITDQCVADVLDGMHDALLPTEWDP-VHRWATRIHTDNDVDV 78
Query 69 WLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHD 128
WLISW PG+ TELHDH GSLGALTVLSGSL EY W G L R L GDQA FPLGWVHD
Sbjct 79 WLISWTPGESTELHDHAGSLGALTVLSGSLREYHWTGDDLAVRILGEGDQAAFPLGWVHD 138
Query 129 VVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPE 185
V+ P P A TLSVHAYSPPLTAMSYY++TE LRR RTELTDQPE
Sbjct 139 VMRNPPPADAAPADAEMSPVTLSVHAYSPPLTAMSYYDVTEDGALRRTRTELTDQPE 195
>gi|229494302|ref|ZP_04388065.1| cysteine dioxygenase type I [Rhodococcus erythropolis SK121]
gi|229318664|gb|EEN84522.1| cysteine dioxygenase type I [Rhodococcus erythropolis SK121]
Length=152
Score = 194 bits (494), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 98/151 (65%), Positives = 114/151 (76%), Gaps = 2/151 (1%)
Query 36 VLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLS 95
+L GR DHL+PD P RW TR+H D+++D+WLISWVP + TELHDH GS GALTVLS
Sbjct 1 MLDGRFDHLIPDT-FPTLDRWSTRLHADDDVDVWLISWVPERNTELHDHAGSFGALTVLS 59
Query 96 GSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAY 155
GSL E+RW G L R+LDAGDQA FPLGWVHDVV + G P + APTLS+HAY
Sbjct 60 GSLTEFRWAGDALVERQLDAGDQASFPLGWVHDVVRSTDAPGAPIEIS-SDAPTLSIHAY 118
Query 156 SPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
SPPLTAMS+YE+T+ TLRR RTELTD PEG
Sbjct 119 SPPLTAMSFYEVTDHRTLRRTRTELTDLPEG 149
>gi|296141813|ref|YP_003649056.1| cysteine dioxygenase type I [Tsukamurella paurometabola DSM 20162]
gi|296029947|gb|ADG80717.1| cysteine dioxygenase type I [Tsukamurella paurometabola DSM 20162]
Length=294
Score = 170 bits (431), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 89/151 (59%), Positives = 105/151 (70%), Gaps = 3/151 (1%)
Query 36 VLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLS 95
V G DHLLP P +RW R+ GD+++D+WLISWVP + TELHDH GSLGALTV+
Sbjct 19 VHAGDYDHLLP-SAWPAGERWAARLWGDDDVDVWLISWVPERSTELHDHAGSLGALTVVD 77
Query 96 GSLNEYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAY 155
G+L E WDG LR RR+D G QA F GWVHDV P A+G A +PTLSVHAY
Sbjct 78 GALAERSWDGEGLRERRIDPGGQAAFDRGWVHDVT--RHPDAHDAASGEASSPTLSVHAY 135
Query 156 SPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
SPPLTAMSYYE++ LRR R+ELTD+PE
Sbjct 136 SPPLTAMSYYEVSPNGRLRRVRSELTDEPEA 166
>gi|331694427|ref|YP_004330666.1| cysteine dioxygenase type I [Pseudonocardia dioxanivorans CB1190]
gi|326949116|gb|AEA22813.1| cysteine dioxygenase type I [Pseudonocardia dioxanivorans CB1190]
Length=175
Score = 131 bits (330), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 88/185 (48%), Positives = 107/185 (58%), Gaps = 23/185 (12%)
Query 3 MPLVTPTTAVPS---PGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTR 59
M L P A+P+ PGP +ADL T + A +V G L + + +RWY R
Sbjct 1 MLLHAPGPALPARALPGPAGYELADLTALTRRVASEVRAG-----LHEVQIDPLRRWYRR 55
Query 60 IHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRR--LRRRRLDAGD 117
+HGD+ +D+WLISW Q ELHDH GSLGALTV+SG L E W LR R L AG
Sbjct 56 LHGDDFVDVWLISWATEQAAELHDHAGSLGALTVVSGRLTEEFWAASTGGLRSRTLHAGR 115
Query 118 QAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQR 177
GF LG VH+V P DAA +SVHAYSPPLTAMSYY++T LRR R
Sbjct 116 SVGFGLGHVHEV---SNP--SADAA-------VSVHAYSPPLTAMSYYDVTG-GRLRRTR 162
Query 178 TELTD 182
+ELT+
Sbjct 163 SELTE 167
>gi|325002467|ref|ZP_08123579.1| cysteine dioxygenase type I [Pseudonocardia sp. P1]
Length=190
Score = 125 bits (314), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/186 (46%), Positives = 100/186 (54%), Gaps = 26/186 (13%)
Query 11 AVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLL-PDGGVPQTQRWYTRIHGDEELDIW 69
A P PT + DL T + A DV GR ++ PD +RWY + D +D+W
Sbjct 23 ARPVAAPTPYDLQDLQELTREIAADVRAGRHGVVVDPD------RRWYRLLRSDGLVDVW 76
Query 70 LISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDG-RRLRRRRLDAGDQAGFPLGWVHD 128
LISW Q ELHDH GS+GALTV+SG+L E RW G LR R L G A FPLG VHD
Sbjct 77 LISWATEQIAELHDHAGSIGALTVVSGTLTERRWGGPAGLRTRTLRHGRGAAFPLGHVHD 136
Query 129 VVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITE------RNTLRRQRTELTD 182
V A A +SVHAYSPPL+AMSYYE+ + LRR RTEL
Sbjct 137 V------------ANTADEAAVSVHAYSPPLSAMSYYEVEDVPATAGHQRLRRSRTELVQ 184
Query 183 QPEGSG 188
+G G
Sbjct 185 PGQGVG 190
>gi|297560351|ref|YP_003679325.1| cysteine dioxygenase type I [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296844799|gb|ADH66819.1| cysteine dioxygenase type I [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=178
Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/132 (53%), Positives = 79/132 (60%), Gaps = 13/132 (9%)
Query 54 QRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRL 113
RW R+ D+ D+WLISW P Q T LHDH GSLGALTV++G L E WD LR R L
Sbjct 44 NRWSVRLRADDHTDVWLISWTPDQSTRLHDHAGSLGALTVVAGDLVERYWDA-GLRERAL 102
Query 114 DAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTL 173
G FPLG VHDVV A + +P +SVHAYSPPLTAM YYE+ L
Sbjct 103 PDGGGGRFPLGHVHDVVNA------------SDSPAVSVHAYSPPLTAMHYYEVGGDGAL 150
Query 174 RRQRTELTDQPE 185
RR R+ LT PE
Sbjct 151 RRTRSVLTTDPE 162
>gi|269124738|ref|YP_003298108.1| cysteine dioxygenase type I [Thermomonospora curvata DSM 43183]
gi|268309696|gb|ACY96070.1| cysteine dioxygenase type I [Thermomonospora curvata DSM 43183]
Length=152
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/130 (44%), Positives = 71/130 (55%), Gaps = 16/130 (12%)
Query 54 QRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRL 113
+RWY R+H DE ++WL+SW+PGQ T LHDHGGS GA V GSL+EY RR L
Sbjct 38 ERWYERLHHDEHHEVWLLSWMPGQSTGLHDHGGSRGAFAVALGSLDEYDLHTRRT----L 93
Query 114 DAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTL 173
G F +H+V A AP +SVH YSPPLT+M+ Y++T L
Sbjct 94 TVGQFREFGADHIHEV------------ANTTQAPAVSVHVYSPPLTSMNRYDLTPAGRL 141
Query 174 RRQRTELTDQ 183
R E DQ
Sbjct 142 VRLAVERADQ 151
>gi|311743303|ref|ZP_07717110.1| cysteine dioxygenase type I family protein [Aeromicrobium marinum
DSM 15272]
gi|311313371|gb|EFQ83281.1| cysteine dioxygenase type I family protein [Aeromicrobium marinum
DSM 15272]
Length=179
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/179 (38%), Positives = 85/179 (48%), Gaps = 25/179 (13%)
Query 15 PGPTR------LRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDI 68
P P R L +A+L+ T A DV G L RW+ R+H D ++DI
Sbjct 5 PAPARKSAATPLSLAELVGLTTAVAADVRAG-----LYAVEADVDHRWHVRLHRDAQVDI 59
Query 69 WLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGR-RLRRRRLDAGDQAGFPLGWVH 127
WLISW Q T+LHDHGGS GA TV+ G+L E W G L G+ F +VH
Sbjct 60 WLISWTTEQGTQLHDHGGSAGAFTVVEGALTESVWTGVGELHDNERSTGETVRFGEHYVH 119
Query 128 DVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEG 186
DV A A +SVHAYS PL M++Y++ L R + TD PE
Sbjct 120 DV------------RNTAAATAVSVHAYSTPLERMNFYDVA-GGRLERLASVWTDDPEA 165
>gi|311899554|dbj|BAJ31962.1| putative cysteine dioxygenase [Kitasatospora setae KM-6054]
Length=195
Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/116 (44%), Positives = 65/116 (57%), Gaps = 14/116 (12%)
Query 54 QRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDG--RRLRRR 111
RWY R+ E+ ++W+ISW+PGQ T HDHGGS GA TV G L E G L R
Sbjct 76 NRWYERLELAEDYEVWVISWLPGQSTGFHDHGGSRGAFTVALGELEELALAGPEHGLTVR 135
Query 112 RLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEI 167
RL AG + F ++HDV A P +++HAYSPPL+ MS+YE+
Sbjct 136 RLSAGSERAFGPQYLHDV------------RNTAQGPAVTLHAYSPPLSEMSHYEL 179
>gi|291299203|ref|YP_003510481.1| cysteine dioxygenase type I [Stackebrandtia nassauensis DSM 44728]
gi|290568423|gb|ADD41388.1| cysteine dioxygenase type I [Stackebrandtia nassauensis DSM 44728]
Length=153
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/132 (41%), Positives = 70/132 (54%), Gaps = 16/132 (12%)
Query 51 PQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRR--L 108
PQ QRWY R+H + ++WL++W+PGQ TELHDHGGS GA TV+SG L E+ L
Sbjct 31 PQ-QRWYHRMHVGDGYEVWLLTWLPGQETELHDHGGSAGAFTVVSGELTEFTPSATSAGL 89
Query 109 RRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEIT 168
L +G F ++H V P +SVHAY P LT M YE+T
Sbjct 90 STWTLRSGQGHRFGARFIHKVT------------NRGTEPAISVHAYGPALTIMRRYELT 137
Query 169 ERNTLRRQRTEL 180
E + LR E+
Sbjct 138 E-SGLRMANVEM 148
>gi|302867356|ref|YP_003835993.1| cysteine dioxygenase type I [Micromonospora aurantiaca ATCC 27029]
gi|315506239|ref|YP_004085126.1| cysteine dioxygenase type i [Micromonospora sp. L5]
gi|302570215|gb|ADL46417.1| cysteine dioxygenase type I [Micromonospora aurantiaca ATCC 27029]
gi|315412858|gb|ADU10975.1| cysteine dioxygenase type I [Micromonospora sp. L5]
Length=150
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/115 (42%), Positives = 61/115 (54%), Gaps = 12/115 (10%)
Query 54 QRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRL 113
QRWY R+ D+ ++W +SW+PGQ T+LHDHGGS GA V++G L E G RLR RL
Sbjct 32 QRWYARLDADDAHEVWALSWLPGQATDLHDHGGSAGAFLVVAGVLTEETVSGGRLRPHRL 91
Query 114 DAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEIT 168
AG F + VH V P +SVH Y P LT M+ Y +
Sbjct 92 AAGAGRRFGVRHVHQVT------------NRGDEPAVSVHVYRPALTRMTRYHLV 134
>gi|302524883|ref|ZP_07277225.1| cysteine dioxygenase [Streptomyces sp. AA4]
gi|302433778|gb|EFL05594.1| cysteine dioxygenase [Streptomyces sp. AA4]
Length=199
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/115 (42%), Positives = 65/115 (57%), Gaps = 13/115 (11%)
Query 52 QTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSL-NEYRWDGRRLRR 110
+ QRW+ R+ + +++WL+SW+PGQ T+ HDHGG+ G+ +VL G L EYR+ G +RR
Sbjct 78 EDQRWWARLALTDGVELWLLSWLPGQHTKPHDHGGASGSFSVLQGELGEEYRYPGGPVRR 137
Query 111 RRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYY 165
R AG GF G H V+ G+ P SVHAYSPPL Y
Sbjct 138 RTHTAGQGIGFGAGRAHQVL------------GVGSEPAASVHAYSPPLVPTREY 180
>gi|300783535|ref|YP_003763826.1| cysteine dioxygenase [Amycolatopsis mediterranei U32]
gi|299793049|gb|ADJ43424.1| cysteine dioxygenase [Amycolatopsis mediterranei U32]
gi|340524921|gb|AEK40126.1| cysteine dioxygenase [Amycolatopsis mediterranei S699]
Length=181
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/115 (43%), Positives = 66/115 (58%), Gaps = 13/115 (11%)
Query 52 QTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSL-NEYRWDGRRLRR 110
+ +RW+ R+ + +++WL+SW+PGQ T+ HDHGG+ G+ TVL G L EYR+ G +RR
Sbjct 60 EDRRWWARLALTDGVELWLLSWLPGQYTKPHDHGGASGSFTVLQGELGEEYRYPGGPIRR 119
Query 111 RRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYY 165
R AG GF G H V G+ P+ SVHAYSPPL A Y
Sbjct 120 RTHVAGQGLGFGAGRAHQVT------------GLGDRPSASVHAYSPPLVATREY 162
>gi|299137588|ref|ZP_07030769.1| cysteine dioxygenase type I [Acidobacterium sp. MP5ACTX8]
gi|298600229|gb|EFI56386.1| cysteine dioxygenase type I [Acidobacterium sp. MP5ACTX8]
Length=325
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/133 (40%), Positives = 71/133 (54%), Gaps = 19/133 (14%)
Query 55 RWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLD 114
RWY R++ + DIW+ISW+PGQ T HDHG S GA V +G L E+R G + R +
Sbjct 58 RWYERLYHGPDHDIWVISWMPGQSTGFHDHGESAGAFVVATGILEEHR-PGEQT--RVIP 114
Query 115 AGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNTLR 174
G F + HDV A ++AP +S+HAYSPPLT M+ YE+ +
Sbjct 115 PGHPRAFGSEYAHDVRNA------------SLAPAISIHAYSPPLTDMNEYELEGNQLVP 162
Query 175 R----QRTELTDQ 183
R +R E +Q
Sbjct 163 RESVSERAETLNQ 175
>gi|256392269|ref|YP_003113833.1| cysteine dioxygenase type I [Catenulispora acidiphila DSM 44928]
gi|256358495|gb|ACU71992.1| cysteine dioxygenase type I [Catenulispora acidiphila DSM 44928]
Length=165
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/154 (40%), Positives = 78/154 (51%), Gaps = 22/154 (14%)
Query 17 PTRLRVADLLRATDQAADDVLGGRCDHLLPDGGVPQTQRWYTRIHGDEELDIWLISWVPG 76
PT+LR DL RA D L H P QRW+TR+ +++WL+SW+PG
Sbjct 30 PTQLR--DLTRALADVHGDRLRPLVRHTEP-------QRWWTRLALTRGVEVWLLSWLPG 80
Query 77 QPTELHDHGGSLGALTVLSGSLN-EYRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRP 135
Q T+ HDHGG+ G+ VLSG + E+R+ G + RRL GD GF H V R
Sbjct 81 QGTKPHDHGGAAGSFAVLSGEVQEEHRYPGGPIGVRRLQVGDALGFGGDRAHIV----RQ 136
Query 136 IGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITE 169
G + P +VHAYSPPL YE E
Sbjct 137 TG--------IRPAATVHAYSPPLLPTREYESLE 162
>gi|330468584|ref|YP_004406327.1| cysteine dioxygenase type i [Verrucosispora maris AB-18-032]
gi|328811555|gb|AEB45727.1| cysteine dioxygenase type i [Verrucosispora maris AB-18-032]
Length=154
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/115 (41%), Positives = 58/115 (51%), Gaps = 12/115 (10%)
Query 53 TQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRR 112
RWY R+ D++ ++W +SW+PGQ T+LHDHGGS GA V SG L E G RLR
Sbjct 35 ASRWYARLAADDDHEVWALSWLPGQGTDLHDHGGSSGAFLVCSGVLTEETVSGGRLRPHL 94
Query 113 LDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEI 167
LDAG F VH V P +SVH Y P L M+ Y +
Sbjct 95 LDAGSGRRFGPRHVHVVT------------NRHAEPAVSVHVYRPALRRMTRYHL 137
>gi|320105306|ref|YP_004180896.1| cysteine dioxygenase type I [Terriglobus saanensis SP1PR4]
gi|319923827|gb|ADV80902.1| cysteine dioxygenase type I [Terriglobus saanensis SP1PR4]
Length=322
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/113 (40%), Positives = 60/113 (54%), Gaps = 15/113 (13%)
Query 55 RWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLD 114
RWY R++ + DIW ISW+PGQ T HDHG S GA V +G L E+R + L +
Sbjct 58 RWYERLYHGPDYDIWAISWMPGQSTGFHDHGESSGAFVVATGILQEHRHGEQPL---AIP 114
Query 115 AGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEI 167
G F + HDV + +AP +S+HAYSPPL M+ YE+
Sbjct 115 PGQPRTFGPDYTHDV------------RNVYLAPAISIHAYSPPLNEMNEYEL 155
>gi|326332280|ref|ZP_08198560.1| cysteine dioxygenase type I family protein [Nocardioidaceae bacterium
Broad-1]
gi|325949986|gb|EGD42046.1| cysteine dioxygenase type I family protein [Nocardioidaceae bacterium
Broad-1]
Length=146
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/117 (41%), Positives = 63/117 (54%), Gaps = 13/117 (11%)
Query 51 PQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRR 110
P+T R + +H D+ +++WLI+W PG T HDHG + A TVL+GSL E+ W G L+
Sbjct 28 PETGREFHLLHRDDAVEVWLIAWAPGASTGFHDHGTATTAFTVLTGSLVEHNWLG-GLQL 86
Query 111 RRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEI 167
+ GD G VHDV R +G P LS+HAY+P L AM Y
Sbjct 87 ADVGPGDARAHAAGHVHDV----RNVGS--------RPALSLHAYAPRLDAMHNYHF 131
>gi|297200127|ref|ZP_06917524.1| cysteine dioxygenase [Streptomyces sviceus ATCC 29083]
gi|197713423|gb|EDY57457.1| cysteine dioxygenase [Streptomyces sviceus ATCC 29083]
Length=169
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/115 (44%), Positives = 61/115 (54%), Gaps = 20/115 (17%)
Query 53 TQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRR-- 110
T RWY R+ ++WL+SWVPGQ + LHDHG S G LTVL G+L E R R
Sbjct 56 TSRWYHRLRTGPGYEVWLLSWVPGQGSGLHDHGRSSGVLTVLEGTLTE------RTERST 109
Query 111 RRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYY 165
R L AG Q F G+VH+VV A+ P +S+H Y P LT M Y
Sbjct 110 RALGAGSQRVFAPGYVHEVV------------NDALEPAVSLHVYYPGLTEMPMY 152
>gi|182438295|ref|YP_001826014.1| putative cysteine dioxygenase [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|326778946|ref|ZP_08238211.1| cysteine dioxygenase type I [Streptomyces cf. griseus XylebKG-1]
gi|178466811|dbj|BAG21331.1| putative cysteine dioxygenase [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|326659279|gb|EGE44125.1| cysteine dioxygenase type I [Streptomyces griseus XylebKG-1]
Length=166
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/108 (44%), Positives = 58/108 (54%), Gaps = 16/108 (14%)
Query 55 RWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLD 114
RWY R+H ++WL+SWVPGQ + LHDHG S G LTVL G L E G R L
Sbjct 58 RWYHRLHQGPGYEVWLLSWVPGQGSGLHDHGLSAGVLTVLEGRLTERTESG----ARSLG 113
Query 115 AGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAM 162
AG Q F G+VH+VV ++ P +S+H Y P LT M
Sbjct 114 AGAQRAFGPGYVHEVV------------NDSLEPAVSLHVYYPGLTEM 149
>gi|297157872|gb|ADI07584.1| putative cysteine dioxygenase [Streptomyces bingchenggensis BCW-1]
Length=177
Score = 77.4 bits (189), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/133 (39%), Positives = 65/133 (49%), Gaps = 16/133 (12%)
Query 53 TQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRR 112
T RWY R+ ++WL+SWVPGQ + HDHG S G LTVL G L E G R
Sbjct 54 TSRWYHRLRTGPGYEVWLLSWVPGQGSGAHDHGASSGVLTVLEGELTERVGHG---ERHS 110
Query 113 LDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAMSYYEITERNT 172
L AG Q F G+VHDVV A+ P +S+H Y P LT M + ++
Sbjct 111 LRAGAQRVFAPGYVHDVV------------NDALEPAVSLHIYFPGLTDMPMHP-SQDAV 157
Query 173 LRRQRTELTDQPE 185
R + D P+
Sbjct 158 RERAEGRVPDAPD 170
>gi|29831584|ref|NP_826218.1| cysteine dioxygenase [Streptomyces avermitilis MA-4680]
gi|29608700|dbj|BAC72753.1| putative cysteine dioxygenase [Streptomyces avermitilis MA-4680]
Length=175
Score = 77.4 bits (189), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/110 (44%), Positives = 59/110 (54%), Gaps = 16/110 (14%)
Query 53 TQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRR 112
T RWY R+ ++WL+SWVPGQ + LHDHG S G LTVL G+L E G R
Sbjct 56 TSRWYHRLRTGPGYEVWLLSWVPGQGSGLHDHGRSSGVLTVLEGALTERTERG----TRA 111
Query 113 LDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAM 162
L AG Q F G+VH+VV A+ P +S+H Y P LT M
Sbjct 112 LGAGAQRVFAPGYVHEVV------------NDALEPAVSLHVYYPGLTEM 149
>gi|239988188|ref|ZP_04708852.1| putative cysteine dioxygenase [Streptomyces roseosporus NRRL
11379]
gi|291445170|ref|ZP_06584560.1| cysteine dioxygenase [Streptomyces roseosporus NRRL 15998]
gi|291348117|gb|EFE75021.1| cysteine dioxygenase [Streptomyces roseosporus NRRL 15998]
Length=166
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/108 (43%), Positives = 58/108 (54%), Gaps = 16/108 (14%)
Query 55 RWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRRLD 114
RWY R+H ++WL+SWVPGQ + HDHG S G LTVL G L E+ G R L
Sbjct 58 RWYHRLHQGPGYEVWLLSWVPGQGSGRHDHGLSAGVLTVLEGELTEHTERG----TRSLG 113
Query 115 AGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAM 162
AG Q F G+VH+VV ++ P +S+H Y P LT M
Sbjct 114 AGAQRSFAPGYVHEVV------------NDSLEPAVSLHIYYPGLTEM 149
>gi|345015663|ref|YP_004818017.1| cysteine dioxygenase type I [Streptomyces violaceusniger Tu 4113]
gi|344042012|gb|AEM87737.1| cysteine dioxygenase type I [Streptomyces violaceusniger Tu 4113]
Length=177
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/110 (45%), Positives = 58/110 (53%), Gaps = 14/110 (12%)
Query 53 TQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYRWDGRRLRRRR 112
T RWY R+ ++WL+SWVPGQ + HDHG S G LTVL G L E G R R
Sbjct 54 TSRWYHRLRTGPGYEVWLLSWVPGQGSGAHDHGRSSGVLTVLQGELTERV--GTRGVRHA 111
Query 113 LDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLTAM 162
L AG Q F G+VHDVV A+ P +S+H Y P LT M
Sbjct 112 LRAGAQRVFAPGYVHDVV------------NDALEPAVSLHIYFPGLTEM 149
Lambda K H
0.318 0.138 0.444
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 183812957610
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40