BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3647c
Length=192
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610783|ref|NP_218164.1| hypothetical protein Rv3647c [Mycob... 372 2e-101
gi|340628619|ref|YP_004747071.1| hypothetical protein MCAN_36671... 371 3e-101
gi|118619393|ref|YP_907725.1| hypothetical protein MUL_4223 [Myc... 323 1e-86
gi|240173036|ref|ZP_04751694.1| hypothetical protein MkanA1_2723... 315 3e-84
gi|342862015|ref|ZP_08718659.1| hypothetical protein MCOL_24110 ... 312 1e-83
gi|296166703|ref|ZP_06849128.1| conserved hypothetical protein [... 307 6e-82
gi|254821834|ref|ZP_05226835.1| hypothetical protein MintA_18012... 305 2e-81
gi|41406522|ref|NP_959358.1| hypothetical protein MAP0424 [Mycob... 304 5e-81
gi|118464759|ref|YP_879798.1| hypothetical protein MAV_0517 [Myc... 302 1e-80
gi|254773486|ref|ZP_05215002.1| hypothetical protein MaviaA2_022... 301 3e-80
gi|15827004|ref|NP_301267.1| hypothetical protein ML0199 [Mycoba... 298 2e-79
gi|120406345|ref|YP_956174.1| hypothetical protein Mvan_5397 [My... 293 7e-78
gi|145221985|ref|YP_001132663.1| hypothetical protein Mflv_1393 ... 292 1e-77
gi|108801760|ref|YP_641957.1| hypothetical protein Mmcs_4797 [My... 292 2e-77
gi|169627592|ref|YP_001701241.1| hypothetical protein MAB_0488 [... 281 2e-74
gi|118470880|ref|YP_890378.1| hypothetical protein MSMEG_6158 [M... 276 8e-73
gi|333992572|ref|YP_004525186.1| hypothetical protein JDM601_393... 266 9e-70
gi|336460893|gb|EGO39777.1| hypothetical protein MAPs_36180 [Myc... 246 1e-63
gi|289748161|ref|ZP_06507539.1| conserved hypothetical protein [... 242 2e-62
gi|312138018|ref|YP_004005354.1| hypothetical protein REQ_05450 ... 228 4e-58
gi|226363672|ref|YP_002781454.1| hypothetical protein ROP_42620 ... 223 8e-57
gi|111021328|ref|YP_704300.1| hypothetical protein RHA1_ro04352 ... 223 1e-56
gi|226304001|ref|YP_002763959.1| hypothetical protein RER_05120 ... 213 2e-53
gi|229494796|ref|ZP_04388552.1| conserved hypothetical protein [... 211 3e-53
gi|262200588|ref|YP_003271796.1| hypothetical protein Gbro_0574 ... 207 8e-52
gi|343926495|ref|ZP_08766000.1| hypothetical protein GOALK_060_0... 193 1e-47
gi|326383465|ref|ZP_08205152.1| hypothetical protein SCNU_11031 ... 192 1e-47
gi|289571884|ref|ZP_06452111.1| conserved hypothetical protein [... 187 6e-46
gi|134096989|ref|YP_001102650.1| hypothetical protein SACE_0376 ... 171 5e-41
gi|302530816|ref|ZP_07283158.1| conserved hypothetical protein [... 128 3e-28
gi|300790600|ref|YP_003770891.1| hypothetical protein AMED_8796 ... 120 1e-25
gi|256374445|ref|YP_003098105.1| hypothetical protein Amir_0290 ... 114 9e-24
gi|258650984|ref|YP_003200140.1| hypothetical protein Namu_0737 ... 111 7e-23
gi|319948997|ref|ZP_08023097.1| hypothetical protein ES5_06342 [... 109 2e-22
gi|159039924|ref|YP_001539177.1| hypothetical protein Sare_4409 ... 105 3e-21
gi|330465253|ref|YP_004402996.1| hypothetical protein VAB18032_0... 101 5e-20
gi|145596539|ref|YP_001160836.1| hypothetical protein Strop_4028... 95.9 2e-18
gi|331694280|ref|YP_004330519.1| hypothetical protein Psed_0394 ... 89.0 3e-16
gi|238062264|ref|ZP_04606973.1| hypothetical protein MCAG_03230 ... 87.4 9e-16
gi|315501238|ref|YP_004080125.1| hypothetical protein ML5_0422 [... 72.4 3e-11
gi|325002271|ref|ZP_08123383.1| hypothetical protein PseP1_26078... 71.6 5e-11
gi|302864953|ref|YP_003833590.1| hypothetical protein Micau_0447... 70.1 1e-10
gi|336460799|gb|EGO39684.1| hypothetical protein MAPs_36190 [Myc... 58.2 6e-07
gi|300865169|ref|ZP_07109993.1| hypothetical protein OSCI_149002... 39.3 0.28
gi|156937060|ref|YP_001434856.1| hypothetical protein Igni_0265 ... 38.1 0.69
gi|87121122|ref|ZP_01077013.1| transcriptional regulatory protei... 35.8 3.1
gi|302916299|ref|XP_003051960.1| hypothetical protein NECHADRAFT... 35.8 3.3
gi|153010755|ref|YP_001371969.1| glycosyl transferase family pro... 34.3 9.5
>gi|15610783|ref|NP_218164.1| hypothetical protein Rv3647c [Mycobacterium tuberculosis H37Rv]
gi|15843259|ref|NP_338296.1| hypothetical protein MT3750 [Mycobacterium tuberculosis CDC1551]
gi|31794817|ref|NP_857310.1| hypothetical protein Mb3671c [Mycobacterium bovis AF2122/97]
74 more sequence titles
Length=192
Score = 372 bits (954), Expect = 2e-101, Method: Compositional matrix adjust.
Identities = 191/192 (99%), Positives = 192/192 (100%), Gaps = 0/192 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL
Sbjct 1 MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA
Sbjct 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP
Sbjct 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
Query 181 DGAEAWVQWPRT 192
DGAEAWVQWPRT
Sbjct 181 DGAEAWVQWPRT 192
>gi|340628619|ref|YP_004747071.1| hypothetical protein MCAN_36671 [Mycobacterium canettii CIPT
140010059]
gi|340006809|emb|CCC45998.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=192
Score = 371 bits (953), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 190/192 (99%), Positives = 192/192 (100%), Gaps = 0/192 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL
Sbjct 1 MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
VPEVARTDENTPLVRTA+DPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA
Sbjct 61 VPEVARTDENTPLVRTAIDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP
Sbjct 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
Query 181 DGAEAWVQWPRT 192
DGAEAWVQWPRT
Sbjct 181 DGAEAWVQWPRT 192
>gi|118619393|ref|YP_907725.1| hypothetical protein MUL_4223 [Mycobacterium ulcerans Agy99]
gi|183985107|ref|YP_001853398.1| hypothetical protein MMAR_5139 [Mycobacterium marinum M]
gi|118571503|gb|ABL06254.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183178433|gb|ACC43543.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=192
Score = 323 bits (827), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 167/191 (88%), Positives = 174/191 (92%), Gaps = 0/191 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFFAAESVPPAV DLSGVLA GQ+V+VG GARLSVVV ESWRA ALAEM++EAGL
Sbjct 1 MSQLSFFAAESVPPAVEDLSGVLAASGQVVMVGAGARLSVVVGESWRAEALAEMMREAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
VPE+ TDE+TPLVRTAVDP L IAAEWTRGAVKTVPPRWLPGPRELRAW LAAGSPEA
Sbjct 61 VPEITHTDEDTPLVRTAVDPRLRAIAAEWTRGAVKTVPPRWLPGPRELRAWALAAGSPEA 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRG PALRISGRRRLSRLVENVGEPP
Sbjct 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGGHPALRISGRRRLSRLVENVGEPP 180
Query 181 DGAEAWVQWPR 191
GAEA QWPR
Sbjct 181 PGAEALAQWPR 191
>gi|240173036|ref|ZP_04751694.1| hypothetical protein MkanA1_27236 [Mycobacterium kansasii ATCC
12478]
Length=192
Score = 315 bits (806), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 163/191 (86%), Positives = 173/191 (91%), Gaps = 0/191 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFFAAESVPPAV DLSGVLA GQIV+VG GARLSVVV+E WRA ALAEM+++AGL
Sbjct 1 MSQLSFFAAESVPPAVDDLSGVLAASGQIVIVGTGARLSVVVSELWRAVALAEMMRDAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
V E+ARTDE+TPLVRTA DP L IAA WTRGAVKTVPPRWLPGPRELR WTLAAGSPEA
Sbjct 61 VAEIARTDEDTPLVRTAADPTLRPIAAAWTRGAVKTVPPRWLPGPRELRTWTLAAGSPEA 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
DRYLLGLDPHAPDT+SPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENVGEPP
Sbjct 121 DRYLLGLDPHAPDTYSPLASALMRVGIAPTLIGTRGARPALRISGRRRLSRLVENVGEPP 180
Query 181 DGAEAWVQWPR 191
D +A QWPR
Sbjct 181 DSPDALAQWPR 191
>gi|342862015|ref|ZP_08718659.1| hypothetical protein MCOL_24110 [Mycobacterium colombiense CECT
3035]
gi|342130555|gb|EGT83864.1| hypothetical protein MCOL_24110 [Mycobacterium colombiense CECT
3035]
Length=199
Score = 312 bits (800), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 164/198 (83%), Positives = 175/198 (89%), Gaps = 7/198 (3%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC-------GARLSVVVAESWRASALAE 53
+SQLSFF AESVPPAVADLSGVLA GQIV VG GARLSVVV +SWRASALA+
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVTVGATGESRVAGARLSVVVDQSWRASALAD 60
Query 54 MIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTL 113
MI+EAGLV E++RTDE+TPLVRTAVDP L +AAEWTRGAVKTVPPRWLPGPRELRAWTL
Sbjct 61 MIREAGLVAEISRTDEDTPLVRTAVDPSLSTLAAEWTRGAVKTVPPRWLPGPRELRAWTL 120
Query 114 AAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLV 173
AAG+PE + YLL LDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLV
Sbjct 121 AAGNPEGEHYLLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLV 180
Query 174 ENVGEPPDGAEAWVQWPR 191
ENVGEPPDGAEA +WPR
Sbjct 181 ENVGEPPDGAEALSRWPR 198
>gi|296166703|ref|ZP_06849128.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897968|gb|EFG77549.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=196
Score = 307 bits (786), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 161/195 (83%), Positives = 171/195 (88%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVG----CGARLSVVVAESWRASALAEMIQ 56
+SQLSFF AESVPPAVADLSGVLA GQIV+VG GARLSVVV ++WRA+ALAEMI+
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGGPEAHGARLSVVVDQAWRAAALAEMIR 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
EAGL PE+ RTDE+TPLVRTAV P L +AAEWTRGAVKTVPPRWLPGPRELRAWTLAAG
Sbjct 61 EAGLAPEIGRTDEDTPLVRTAVTPALVSLAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
PE D YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENV
Sbjct 121 HPEGDHYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GE PDG +A WPR
Sbjct 181 GEAPDGVDASSVWPR 195
>gi|254821834|ref|ZP_05226835.1| hypothetical protein MintA_18012 [Mycobacterium intracellulare
ATCC 13950]
Length=196
Score = 305 bits (781), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/195 (83%), Positives = 171/195 (88%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC----GARLSVVVAESWRASALAEMIQ 56
+SQLSFF AESVPPAVADLSGVLA GQIV VG GARLSVVV WRA+ALA+MI+
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVTVGGAEAQGARLSVVVDAPWRAAALADMIR 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
EAGL E+ RTDE+TPLVRTAVDP L +AAEWTRGAVKTVPPRWLPGPRELRAWTLAAG
Sbjct 61 EAGLAAEIGRTDEDTPLVRTAVDPSLSTLAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+PE + YLL LDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENV
Sbjct 121 NPEGEHYLLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GEPPDGAEA +WPR
Sbjct 181 GEPPDGAEALSRWPR 195
>gi|41406522|ref|NP_959358.1| hypothetical protein MAP0424 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394871|gb|AAS02741.1| hypothetical protein MAP_0424 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=196
Score = 304 bits (778), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 159/195 (82%), Positives = 169/195 (87%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC----GARLSVVVAESWRASALAEMIQ 56
+SQLSFF AESVPPAVADLSGVLA GQIV+VG GARLSVVV +WRA ALA+MI
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARLSVVVDHTWRAEALADMIS 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
EAGLV E+ RTDE+TPLVRTAVDP L +AAEWTRGAVKTVPPRWLPGPRELRAWTLAAG
Sbjct 61 EAGLVAEIGRTDEDTPLVRTAVDPALSPLAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+PE + Y+L LDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENV
Sbjct 121 NPEGEHYVLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GEPPD EA WPR
Sbjct 181 GEPPDSPEASAHWPR 195
>gi|118464759|ref|YP_879798.1| hypothetical protein MAV_0517 [Mycobacterium avium 104]
gi|118166046|gb|ABK66943.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=196
Score = 302 bits (774), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 158/195 (82%), Positives = 168/195 (87%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC----GARLSVVVAESWRASALAEMIQ 56
+SQLSFF AESVPPAVADLSGVLA GQIV+VG GARLSVVV +WRA ALA+MI
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARLSVVVDHTWRAEALADMIS 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
EAGLV E+ RTDE+TPLVRTAVDP L +A EWTRGAVKTVPPRWLPGPRELRAWTLAAG
Sbjct 61 EAGLVAEIGRTDEDTPLVRTAVDPALSPLAVEWTRGAVKTVPPRWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+PE + Y+L LDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENV
Sbjct 121 NPEGEHYVLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GEPPD EA WPR
Sbjct 181 GEPPDSPEASAHWPR 195
>gi|254773486|ref|ZP_05215002.1| hypothetical protein MaviaA2_02245 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=196
Score = 301 bits (771), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 157/195 (81%), Positives = 168/195 (87%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC----GARLSVVVAESWRASALAEMIQ 56
+SQLSFF AESVPPAVADLSGVLA GQIV+VG GARLSVVV +WRA ALA+MI
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARLSVVVDHTWRAEALADMIS 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
EAGLV E+ RTDE+TPLVRTAVDP L +A EWTRGAVKTVPPRWLPGPRELRAWTLAAG
Sbjct 61 EAGLVAEIGRTDEDTPLVRTAVDPALSPLAVEWTRGAVKTVPPRWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+PE + Y+L LDPHAPDTHSPLASALMRVGIAPTLIGTRG RPALRISGRRRLSRLVENV
Sbjct 121 NPEGEHYVLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GEPP+ EA WPR
Sbjct 181 GEPPNSPEASAHWPR 195
>gi|15827004|ref|NP_301267.1| hypothetical protein ML0199 [Mycobacterium leprae TN]
gi|221229482|ref|YP_002502898.1| hypothetical protein MLBr_00199 [Mycobacterium leprae Br4923]
gi|3097242|emb|CAA18819.1| hypothetical protein MLCB2548.32c [Mycobacterium leprae]
gi|13092551|emb|CAC29707.1| ML0199 [Mycobacterium leprae]
gi|219932589|emb|CAR70292.1| unnamed protein product [Mycobacterium leprae Br4923]
Length=200
Score = 298 bits (764), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 160/200 (80%), Positives = 172/200 (86%), Gaps = 8/200 (4%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC-------GARLSVVVAESWRASALAE 53
+SQLSFF AES+ PA+ADL+GVLA GQIV+V ARLSVVV + WRASALAE
Sbjct 1 MSQLSFFTAESLLPAIADLAGVLAASGQIVVVSASGQSPAPAARLSVVVDQLWRASALAE 60
Query 54 MIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTL 113
MI EAGLVPE++RT+E+TPLVRTAVDPLLC IAAEWTRGAVKTVPPRWLPGPRELRAW L
Sbjct 61 MISEAGLVPEISRTEEDTPLVRTAVDPLLCPIAAEWTRGAVKTVPPRWLPGPRELRAWIL 120
Query 114 AAGSPE-ADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRL 172
AAG PE A+RYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTR RPALRISGRRRLSRL
Sbjct 121 AAGVPEAANRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRSGRPALRISGRRRLSRL 180
Query 173 VENVGEPPDGAEAWVQWPRT 192
+ENVGEPPD AEA WPR
Sbjct 181 LENVGEPPDWAEALALWPRV 200
>gi|120406345|ref|YP_956174.1| hypothetical protein Mvan_5397 [Mycobacterium vanbaalenii PYR-1]
gi|119959163|gb|ABM16168.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=222
Score = 293 bits (751), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 151/195 (78%), Positives = 167/195 (86%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG----ARLSVVVAESWRASALAEMIQ 56
VSQLSFF+AESVPP +ADL+G+LA GQ+VLVG ARLSVVV + WRA LAEMI+
Sbjct 27 VSQLSFFSAESVPPTIADLTGILAAAGQVVLVGGARDQAARLSVVVDQVWRAEGLAEMIE 86
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
+AGL E++RTDE++PLVRTAVD L IA EWTRGAVKTVPP+WLPGPRELRAWTLAAG
Sbjct 87 DAGLAAEISRTDEDSPLVRTAVDTRLVAIATEWTRGAVKTVPPQWLPGPRELRAWTLAAG 146
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
PE DRYLLGLDPHAPDTHS LASA+MRVGIAPTLIGTRG+RPALRISGRRRL RLVENV
Sbjct 147 RPEDDRYLLGLDPHAPDTHSALASAMMRVGIAPTLIGTRGSRPALRISGRRRLLRLVENV 206
Query 177 GEPPDGAEAWVQWPR 191
GEPPD A A QWP+
Sbjct 207 GEPPDDAAALTQWPQ 221
>gi|145221985|ref|YP_001132663.1| hypothetical protein Mflv_1393 [Mycobacterium gilvum PYR-GCK]
gi|315446275|ref|YP_004079154.1| hypothetical protein Mspyr1_47790 [Mycobacterium sp. Spyr1]
gi|145214471|gb|ABP43875.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315264578|gb|ADU01320.1| hypothetical protein Mspyr1_47790 [Mycobacterium sp. Spyr1]
Length=199
Score = 292 bits (748), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 152/195 (78%), Positives = 165/195 (85%), Gaps = 4/195 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVL----VGCGARLSVVVAESWRASALAEMIQ 56
+SQLSFF+AESVPPA+ADL+G+LAGPGQ+VL G ARLSVVV WRA ALAEMI
Sbjct 1 MSQLSFFSAESVPPAIADLTGILAGPGQVVLRGGAEGQAARLSVVVEARWRADALAEMIA 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
+ GL PE+ RTDE PLVRTA D L IA +WTRGAVKTVPP+WLPGPRELRAWTLAAG
Sbjct 61 DVGLEPEITRTDEGHPLVRTAADVRLVAIAVDWTRGAVKTVPPQWLPGPRELRAWTLAAG 120
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+PEADRYLLGLDPHAPDTH LASA+MRVGIAPTLIGTRG+RPALRISGRRRLSRLVENV
Sbjct 121 TPEADRYLLGLDPHAPDTHPALASAMMRVGIAPTLIGTRGSRPALRISGRRRLSRLVENV 180
Query 177 GEPPDGAEAWVQWPR 191
GEPP EA QWPR
Sbjct 181 GEPPAAVEALAQWPR 195
>gi|108801760|ref|YP_641957.1| hypothetical protein Mmcs_4797 [Mycobacterium sp. MCS]
gi|119870911|ref|YP_940863.1| hypothetical protein Mkms_4883 [Mycobacterium sp. KMS]
gi|126437747|ref|YP_001073438.1| hypothetical protein Mjls_5183 [Mycobacterium sp. JLS]
gi|108772179|gb|ABG10901.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119697000|gb|ABL94073.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126237547|gb|ABO00948.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=224
Score = 292 bits (748), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 151/196 (78%), Positives = 169/196 (87%), Gaps = 4/196 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG----ARLSVVVAESWRASALAEMIQ 56
VSQLSFF+AE+VPPAVADL+G+LA PGQ+VLVG G ARLSVVV + WRA ALAEMI
Sbjct 28 VSQLSFFSAEAVPPAVADLTGLLAAPGQVVLVGSGREQGARLSVVVEDLWRAEALAEMIT 87
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
+AGL E++RTDENTPLVRTAV+P L IAAEWTRGAVKTVPP+WLPGPRELRAWTLA+G
Sbjct 88 DAGLGAEISRTDENTPLVRTAVEPRLVAIAAEWTRGAVKTVPPQWLPGPRELRAWTLASG 147
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENV 176
+ E + YLLGLDPHAPDTHSPLASA+MR+GIAPTLIGTRG+RPALRISGRRRL+RLVE V
Sbjct 148 TREPNGYLLGLDPHAPDTHSPLASAMMRIGIAPTLIGTRGSRPALRISGRRRLTRLVETV 207
Query 177 GEPPDGAEAWVQWPRT 192
GEPP A WP T
Sbjct 208 GEPPQDVAALSHWPST 223
>gi|169627592|ref|YP_001701241.1| hypothetical protein MAB_0488 [Mycobacterium abscessus ATCC 19977]
gi|169239559|emb|CAM60587.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=212
Score = 281 bits (720), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/194 (75%), Positives = 162/194 (84%), Gaps = 3/194 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
VSQLSFF+AESVPP V DL+G+LAGPGQ+V+ G GAR+SVVV + WRA ALAEMI E GL
Sbjct 18 VSQLSFFSAESVPPEVTDLAGLLAGPGQVVVSGAGARISVVVDQPWRALALAEMITETGL 77
Query 61 VPEVARTD---ENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGS 117
E+ T+ EN PLVRTA+DP + IA EWTRGAVKTVP +WLPG RELRAW LAAGS
Sbjct 78 QAEIGHTETGTENHPLVRTAIDPAILPIAREWTRGAVKTVPAQWLPGARELRAWVLAAGS 137
Query 118 PEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVG 177
PEADRYLLGLDPHAPDTHSPLA+ALMRVGIAPTLIGTRG PALRISGRRRL RL+EN+G
Sbjct 138 PEADRYLLGLDPHAPDTHSPLAAALMRVGIAPTLIGTRGANPALRISGRRRLGRLLENIG 197
Query 178 EPPDGAEAWVQWPR 191
EPP +A+ WPR
Sbjct 198 EPPGDTDAFRVWPR 211
>gi|118470880|ref|YP_890378.1| hypothetical protein MSMEG_6158 [Mycobacterium smegmatis str.
MC2 155]
gi|118172167|gb|ABK73063.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=197
Score = 276 bits (707), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 147/196 (75%), Positives = 163/196 (84%), Gaps = 5/196 (2%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG----ARLSVVVAESWRASALAEMIQ 56
+SQLSFF+AESVPPAV DL+G+LA PGQI++VG G AR+SVVV E WRA LAEMI+
Sbjct 1 MSQLSFFSAESVPPAVTDLTGMLAAPGQILVVGGGGHPTARISVVVDELWRAHGLAEMIE 60
Query 57 EAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAG 116
+AGL E+ART+ENTPLVRT +D L +A WTRGAVKTVPP WLPG RELRAWTLAAG
Sbjct 61 QAGLTAEIARTEENTPLVRTTMDVRLVPLARAWTRGAVKTVPPEWLPGSRELRAWTLAAG 120
Query 117 SPEA-DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVEN 175
+PEA DRYLLGLDPHAPDTH LASA+MRVGIAPTLIGTRG+ PALRISGRRRL RLVEN
Sbjct 121 TPEADDRYLLGLDPHAPDTHPVLASAMMRVGIAPTLIGTRGSHPALRISGRRRLLRLVEN 180
Query 176 VGEPPDGAEAWVQWPR 191
VGEPP A QWPR
Sbjct 181 VGEPPGDVAALAQWPR 196
>gi|333992572|ref|YP_004525186.1| hypothetical protein JDM601_3932 [Mycobacterium sp. JDM601]
gi|333488540|gb|AEF37932.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=192
Score = 266 bits (681), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 138/180 (77%), Positives = 156/180 (87%), Gaps = 0/180 (0%)
Query 12 VPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVPEVARTDENT 71
+PP+VADL+GVLAGPGQIV++G ARLSVVV WRA ALAE+I EAGL+PE+ RT+E+T
Sbjct 1 MPPSVADLAGVLAGPGQIVVMGAEARLSVVVDAQWRAVALAELITEAGLLPEITRTEEDT 60
Query 72 PLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYLLGLDPHA 131
PLVRTAVD L +A WTRGAVKTVPP+W+PGPRELRAWTLAAG+ EADRYLLGLDPHA
Sbjct 61 PLVRTAVDSRLRALAQAWTRGAVKTVPPQWVPGPRELRAWTLAAGAAEADRYLLGLDPHA 120
Query 132 PDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAEAWVQWPR 191
PDT +PLASALMRVGIAPTLIG RG+ PALRI+GRRRL+RLVENVGE P EA QWPR
Sbjct 121 PDTFAPLASALMRVGIAPTLIGIRGSHPALRITGRRRLARLVENVGESPPVPEALTQWPR 180
>gi|336460893|gb|EGO39777.1| hypothetical protein MAPs_36180 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=153
Score = 246 bits (628), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 125/152 (83%), Positives = 134/152 (89%), Gaps = 0/152 (0%)
Query 40 VVVAESWRASALAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPP 99
+VV +WRA ALA+MI EAGLV E+ RTDE+TPLVRTAVDP L +AAEWTRGAVKTVPP
Sbjct 1 MVVDHTWRAEALADMISEAGLVAEIGRTDEDTPLVRTAVDPALSPLAAEWTRGAVKTVPP 60
Query 100 RWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRP 159
RWLPGPRELRAWTLAAG+PE + Y+L LDPHAPDTHSPLASALMRVGIAPTLIGTRG RP
Sbjct 61 RWLPGPRELRAWTLAAGNPEGEHYVLALDPHAPDTHSPLASALMRVGIAPTLIGTRGGRP 120
Query 160 ALRISGRRRLSRLVENVGEPPDGAEAWVQWPR 191
ALRISGRRRLSRLVENVGEPPD EA WPR
Sbjct 121 ALRISGRRRLSRLVENVGEPPDSPEASAHWPR 152
>gi|289748161|ref|ZP_06507539.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289688748|gb|EFD56177.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=128
Score = 242 bits (618), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 124/128 (97%), Positives = 124/128 (97%), Gaps = 0/128 (0%)
Query 65 ARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYL 124
ARTDENT TAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYL
Sbjct 1 ARTDENTRWCGTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYL 60
Query 125 LGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAE 184
LGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAE
Sbjct 61 LGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAE 120
Query 185 AWVQWPRT 192
AWVQWPRT
Sbjct 121 AWVQWPRT 128
>gi|312138018|ref|YP_004005354.1| hypothetical protein REQ_05450 [Rhodococcus equi 103S]
gi|325675219|ref|ZP_08154904.1| hypothetical protein HMPREF0724_12686 [Rhodococcus equi ATCC
33707]
gi|311887357|emb|CBH46668.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325553925|gb|EGD23602.1| hypothetical protein HMPREF0724_12686 [Rhodococcus equi ATCC
33707]
Length=194
Score = 228 bits (581), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 118/193 (62%), Positives = 142/193 (74%), Gaps = 2/193 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
++QLSFF+AES+PPAV DL G+LA GQ+ + GAR+SVVV WRA A+A ++ EA L
Sbjct 1 MAQLSFFSAESMPPAVTDLGGLLAAQGQVAVSKDGARVSVVVDSLWRAEAIATLMAEADL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
PE+ ++E PLVRTA P L +AA WTRGAVK+VPP W+PG RE RAW LAAG EA
Sbjct 61 EPEIGTSEEGRPLVRTASVPHLIDLAARWTRGAVKSVPPGWIPGAREQRAWVLAAGRVEA 120
Query 121 D--RYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
D RYLLGLDPHAPDTH LA +LMR G+APT++G RG+ P LRISGRRRL L EN+GE
Sbjct 121 DGQRYLLGLDPHAPDTHVVLAQSLMRAGVAPTIVGIRGSTPGLRISGRRRLMHLAENIGE 180
Query 179 PPDGAEAWVQWPR 191
PD +A WP
Sbjct 181 APDDPDARRNWPH 193
>gi|226363672|ref|YP_002781454.1| hypothetical protein ROP_42620 [Rhodococcus opacus B4]
gi|226242161|dbj|BAH52509.1| hypothetical protein [Rhodococcus opacus B4]
Length=194
Score = 223 bits (569), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 117/193 (61%), Positives = 144/193 (75%), Gaps = 2/193 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFF+AE++PPAV DL G+LA GQ+V G AR+S+VV WRA A+AE+I +AGL
Sbjct 1 MSQLSFFSAEAMPPAVTDLCGLLAATGQVVTSGGRARISIVVDAQWRAEAIAELIAQAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
E+ R+DE +PLVRTA L +A +WTRGAVK VP W+P R+LR W LA+G EA
Sbjct 61 EVEITRSDEGSPLVRTASVVDLRPLADQWTRGAVKAVPSGWVPSGRQLRVWALASGRGEA 120
Query 121 --DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
+R++LGLDPHAPDTH+PLA ALMR GIAPTLIGTRG+ P LRISGRRRL RLVE++GE
Sbjct 121 EGERFVLGLDPHAPDTHAPLAQALMRAGIAPTLIGTRGSGPGLRISGRRRLGRLVESIGE 180
Query 179 PPDGAEAWVQWPR 191
P + WP
Sbjct 181 APGNVDDRTGWPH 193
>gi|111021328|ref|YP_704300.1| hypothetical protein RHA1_ro04352 [Rhodococcus jostii RHA1]
gi|110820858|gb|ABG96142.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=194
Score = 223 bits (567), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/193 (61%), Positives = 143/193 (75%), Gaps = 2/193 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFF+AES+PPAV DL G+LA GQ+V AR+S+VV WRA A+AE+I +AGL
Sbjct 1 MSQLSFFSAESMPPAVTDLCGLLAATGQVVTSAGRARISIVVDAQWRAEAIAELITQAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
E+ R+DE +PLVRTA L +A +WTRGAVK VP W+P R+LR W LA+G EA
Sbjct 61 EVEITRSDEGSPLVRTASVVDLRPLADQWTRGAVKAVPSGWVPSGRQLRVWALASGRSEA 120
Query 121 --DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
+R++LGLDPHAPDTH+PLA ALMR GIAPTLIGTRG+ P LRISGRRRL RLVE++GE
Sbjct 121 EGERFVLGLDPHAPDTHAPLAQALMRAGIAPTLIGTRGSGPGLRISGRRRLGRLVESIGE 180
Query 179 PPDGAEAWVQWPR 191
P + WP
Sbjct 181 APGNLDDRTGWPH 193
>gi|226304001|ref|YP_002763959.1| hypothetical protein RER_05120 [Rhodococcus erythropolis PR4]
gi|226183116|dbj|BAH31220.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=206
Score = 213 bits (541), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/193 (56%), Positives = 140/193 (73%), Gaps = 2/193 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
VSQLSFF+AES+PPAV DL+G+LAGPGQ+V AR+S+VV WRA A+AE+I + GL
Sbjct 13 VSQLSFFSAESIPPAVTDLAGMLAGPGQVVTSEDRARISIVVDRDWRAQAVAELIAQCGL 72
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPE- 119
EV R++E +PLVRT P L ++ +WT+GAVK VP W+P R+LR W +AAG E
Sbjct 73 GAEVTRSEEGSPLVRTQSTPALLPLSVQWTKGAVKAVPVGWVPNSRQLRVWAVAAGRLEE 132
Query 120 -ADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
+R++ GLDPHA +TH+PLA ALMRVGIAPT +G R P LR+SG++RL++LVE +GE
Sbjct 133 GGERFVFGLDPHAKETHAPLAQALMRVGIAPTQLGNRTPGPGLRVSGKKRLTKLVEYLGE 192
Query 179 PPDGAEAWVQWPR 191
P + V WP
Sbjct 193 APKHVDTSVAWPH 205
>gi|229494796|ref|ZP_04388552.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318292|gb|EEN84157.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=194
Score = 211 bits (538), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 106/193 (55%), Positives = 140/193 (73%), Gaps = 2/193 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQLSFF+AES+PPAV DL+G+LAGPGQ+V AR+S+VV WRA A+AE+I + GL
Sbjct 1 MSQLSFFSAESIPPAVTDLAGMLAGPGQVVTSEDRARISIVVDRDWRAQAVAELIAQCGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPE- 119
EV R++E +PLVRT P L ++ +WT+GAVK VP W+P R+LR W +AAG E
Sbjct 61 GAEVTRSEEGSPLVRTQSTPALLPLSVQWTKGAVKAVPVGWVPNSRQLRVWAVAAGRLEE 120
Query 120 -ADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
+R++ GLDPHA +TH+PLA ALMRVGIAPT +G R P LR+SG++RL++LVE +GE
Sbjct 121 GGERFVFGLDPHAKETHAPLAQALMRVGIAPTQLGNRTPGPGLRVSGKKRLTKLVEYLGE 180
Query 179 PPDGAEAWVQWPR 191
P + V WP
Sbjct 181 APKHVDTSVAWPH 193
>gi|262200588|ref|YP_003271796.1| hypothetical protein Gbro_0574 [Gordonia bronchialis DSM 43247]
gi|262083935|gb|ACY19903.1| hypothetical protein Gbro_0574 [Gordonia bronchialis DSM 43247]
Length=199
Score = 207 bits (526), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 113/191 (60%), Positives = 132/191 (70%), Gaps = 2/191 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
V QLSF++AE+ PA DL+G+LA GQ G R+S+VV WRA + E + AGL
Sbjct 7 VGQLSFYSAETEQPAYDDLAGLLAAHGQSARSDSGTRVSIVVPARWRAEHIVEEMTAAGL 66
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
E A +DE TPL RTA P L + WT GAVK VP W P PR LR W LAAG PE
Sbjct 67 TAESATSDEGTPLARTAACPELDALHRAWTSGAVKAVPAGWTPTPRVLRLWVLAAGRPEG 126
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTR-GTRPALRISGRRRLSRLVENVGEP 179
DRYLLGLDP+APDTHSPLA+ALMRVGIAPTL+G R G PALR++GRRRL+RLVE +G+P
Sbjct 127 DRYLLGLDPYAPDTHSPLATALMRVGIAPTLVGARSGHPPALRVAGRRRLTRLVEYIGDP 186
Query 180 PDGAEAWVQWP 190
P A A WP
Sbjct 187 PSAA-ATADWP 196
>gi|343926495|ref|ZP_08766000.1| hypothetical protein GOALK_060_01590 [Gordonia alkanivorans NBRC
16433]
gi|343763733|dbj|GAA12926.1| hypothetical protein GOALK_060_01590 [Gordonia alkanivorans NBRC
16433]
Length=194
Score = 193 bits (490), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/190 (55%), Positives = 129/190 (68%), Gaps = 1/190 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+ QLSFF+AE+ PA +DL+G+LA GQ V G R+S+VV + WRA + E ++ +GL
Sbjct 1 MGQLSFFSAETEEPAYSDLAGLLAAHGQAVRSDSGTRVSIVVRDRWRAEQIVEEMRASGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
EV +DE TPL RTA L + W+ GAVK +P W+P R LR W +A+G +
Sbjct 61 DAEVTTSDEGTPLARTAACHELDALHLAWSAGAVKAMPTGWIPSYRALRLWVIASGHSDE 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
RY LGLDPHAPDTH+ LA+ALMRVGIAPTL+GTRG PALRI+G RRL RL E VG PP
Sbjct 121 GRYQLGLDPHAPDTHAALATALMRVGIAPTLVGTRGHSPALRIAGHRRLVRLHEYVGPPP 180
Query 181 DGAEAWVQWP 190
+ A A WP
Sbjct 181 NAA-AVPDWP 189
>gi|326383465|ref|ZP_08205152.1| hypothetical protein SCNU_11031 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197871|gb|EGD55058.1| hypothetical protein SCNU_11031 [Gordonia neofelifaecis NRRL
B-59395]
Length=191
Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 100/190 (53%), Positives = 131/190 (69%), Gaps = 1/190 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
+SQ+S F+AE PA+ADL+G+LA GQ V GAR+SVVVA+ WRA + I+ AGL
Sbjct 1 MSQMSLFSAEIEDPAIADLAGLLAAQGQSVHTSWGARVSVVVADEWRAEEICAEIRGAGL 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
E+ ++E PL RT +P + + W+ GAVK VP W P P LR WTLA+G P+
Sbjct 61 EAEILTSEEGRPLARTEANPRITALHRAWSAGAVKAVPEGWTPTPHALRLWTLASGRPDG 120
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
YLLGLDPHAPDTH+PL+++LMR+GIAPTL+G +G ALR+S R+R++RL E VG P
Sbjct 121 AHYLLGLDPHAPDTHAPLSTSLMRIGIAPTLVGVKGGAHALRVSSRKRITRLAETVGIAP 180
Query 181 DGAEAWVQWP 190
+GA V WP
Sbjct 181 EGAPDGV-WP 189
>gi|289571884|ref|ZP_06452111.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289545638|gb|EFD49286.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=109
Score = 187 bits (475), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 94/94 (100%), Positives = 94/94 (100%), Gaps = 0/94 (0%)
Query 99 PRWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTR 158
PRWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTR
Sbjct 16 PRWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTR 75
Query 159 PALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT 192
PALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT
Sbjct 76 PALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT 109
>gi|134096989|ref|YP_001102650.1| hypothetical protein SACE_0376 [Saccharopolyspora erythraea NRRL
2338]
gi|291006266|ref|ZP_06564239.1| hypothetical protein SeryN2_17248 [Saccharopolyspora erythraea
NRRL 2338]
gi|133909612|emb|CAL99724.1| hypothetical protein SACE_0376 [Saccharopolyspora erythraea NRRL
2338]
Length=194
Score = 171 bits (433), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 97/185 (53%), Positives = 121/185 (66%), Gaps = 2/185 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG--ARLSVVVAESWRASALAEMIQEA 58
+ QLSFF+AE+ P +ADL+G+L GPGQ V G G ARLSVVV ++WRA +L +
Sbjct 1 MDQLSFFSAEARHPRIADLAGLLCGPGQAVGFGRGTAARLSVVVDDAWRARSLVLACADR 60
Query 59 GLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSP 118
G+ E+ R+DE PLVRTA L +A W RGAVK+VP + P LR W L AG
Sbjct 61 GVDAELGRSDEGRPLVRTAFRADLTELARHWLRGAVKSVPADFAPDGCALRLWALTAGRL 120
Query 119 EADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
E YLLGLDPHAP+TH PL +AL R G+ IG R PALR++G+RR++RL E VG
Sbjct 121 EPGGYLLGLDPHAPETHEPLVAALARSGLPARFIGARAGGPALRVTGKRRIARLAELVGP 180
Query 179 PPDGA 183
PDGA
Sbjct 181 VPDGA 185
>gi|302530816|ref|ZP_07283158.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302439711|gb|EFL11527.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=233
Score = 128 bits (322), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 84/198 (43%), Positives = 114/198 (58%), Gaps = 11/198 (5%)
Query 4 LSFFAAESVPPAVADLSGVLAGPGQIVLVG-CGARLSVVVAESWRASALAEMIQEAGLVP 62
+S F+AE+ P + DL+G+L GQI G ARLSV+V E WRA LA + G
Sbjct 5 ISLFSAEATGPGLPDLAGLLCCQGQITGFGRTAARLSVLVDEPWRARVLARECRSRGADA 64
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTR---------GAVKTVPPRWLPGPRELRAWTL 113
+VA + +P VRT+ L G+A +W R + K VP + LR W L
Sbjct 65 QVAVAECGSPQVRTSFRVDLLGLAEQWLRPGHTGPTEDDSGKAVPGGFRLSGAMLRMWAL 124
Query 114 AAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLV 173
AAG PE YLLG+DP AP TH L + L +G+ L+G + +PA+R+SGRR+L+ L+
Sbjct 125 AAGRPEPGGYLLGVDPLAPGTHEELLTVLAPLGVHARLLGPKAEQPAVRVSGRRKLAGLL 184
Query 174 ENVGEPPDGAEA-WVQWP 190
E +GEPP GAEA W + P
Sbjct 185 ELIGEPPAGAEAVWPELP 202
>gi|300790600|ref|YP_003770891.1| hypothetical protein AMED_8796 [Amycolatopsis mediterranei U32]
gi|299800114|gb|ADJ50489.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532289|gb|AEK47494.1| hypothetical protein RAM_45135 [Amycolatopsis mediterranei S699]
Length=216
Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/197 (41%), Positives = 106/197 (54%), Gaps = 12/197 (6%)
Query 4 LSFFAAESVPPAVADLSGVLAGPGQIVLVG-CGARLSVVVAESWRASALAEMIQEAGLVP 62
+S F+AE+ P + DL+G+L GQI G ARLSVVV E WRA LA ++ G
Sbjct 5 ISLFSAEASGPGLGDLAGLLCCHGQITGFGRTAARLSVVVEEPWRAHVLAGELRCRGADA 64
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAV---------KTVPPRWLPGPRELRAWTL 113
+V++ D P VRT+ L +A +W R K VP + LR W L
Sbjct 65 QVSKADCGRPQVRTSFRVDLLPLALQWLREGCAGPVEDDSGKAVPDGFRLSGAMLRMWAL 124
Query 114 AAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLV 173
A G P YLLG+DP AP H L AL +G+ L G + PA++++G+RRL L+
Sbjct 125 AGGRPGTQGYLLGVDPLAPGMHERLVEALTPLGVPAKLTGPKAEVPAVKVTGKRRLEALL 184
Query 174 ENVGEPPDGAEAWVQWP 190
E +GEPP GAEA WP
Sbjct 185 ELIGEPPPGAEA--AWP 199
>gi|256374445|ref|YP_003098105.1| hypothetical protein Amir_0290 [Actinosynnema mirum DSM 43827]
gi|255918748|gb|ACU34259.1| hypothetical protein Amir_0290 [Actinosynnema mirum DSM 43827]
Length=210
Score = 114 bits (284), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 80/199 (41%), Positives = 102/199 (52%), Gaps = 18/199 (9%)
Query 2 SQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG--ARLSVVVAESWRASALAEMIQEAG 59
QLSF++AE+ P V DL+G+L GPG+++ G ARL+ V+A+ WR AL + E G
Sbjct 3 QQLSFYSAEARRPGVDDLAGLLCGPGRVLGFARGRAARLTAVLADPWRGPALVAALAERG 62
Query 60 LVPEVAR----------TDENTP------LVRTAVDPLLCGIAAEWTRGAVKTVPPRWLP 103
+ E D P VRT L +AA W K VP + P
Sbjct 63 VQAESGAPEPVGDPEPPADGQEPGAQPPVQVRTPFRTDLAPLAAHWLLAGAKVVPRGFTP 122
Query 104 GPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRI 163
LR W L +G YLLGLDP APDTH PL +AL G+ L+ + PALR+
Sbjct 123 HGGVLRLWALTSGRWVEPGYLLGLDPDAPDTHEPLRAALASAGLPAALLTPKSGGPALRV 182
Query 164 SGRRRLSRLVENVGEPPDG 182
+GRRRL RL E VG P G
Sbjct 183 TGRRRLERLSELVGRAPTG 201
>gi|258650984|ref|YP_003200140.1| hypothetical protein Namu_0737 [Nakamurella multipartita DSM
44233]
gi|258554209|gb|ACV77151.1| hypothetical protein Namu_0737 [Nakamurella multipartita DSM
44233]
Length=192
Score = 111 bits (277), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 73/191 (39%), Positives = 105/191 (55%), Gaps = 2/191 (1%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
++QLS ++A+ P DL G+LA G++ G RL + +A+ WRASAL + +
Sbjct 1 MTQLSLWSADLTAPVGEDLGGLLAADGRLEEGDDGVRLIIPLADPWRASALVRECRVRDV 60
Query 61 VPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEA 120
+ TDE+ +RT P+L + W G K +P +R W +A+G P
Sbjct 61 DAHI-ETDEHVTELRTDPAPVLAELRERWVDGPDKVMPAGLELSAGLIRCWVIASGRPAP 119
Query 121 DRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
YLLGLDP P+ H PLA+ +G+A +++G RG PA+RI G RR SRL E VG PP
Sbjct 120 VGYLLGLDPRTPELHQPLAAVCAAMGLAGSILGPRGGGPAVRIVGHRRCSRLAEMVGTPP 179
Query 181 DGAEAWVQWPR 191
A A Q+P+
Sbjct 180 PEAPAG-QFPQ 189
>gi|319948997|ref|ZP_08023097.1| hypothetical protein ES5_06342 [Dietzia cinnamea P4]
gi|319437338|gb|EFV92358.1| hypothetical protein ES5_06342 [Dietzia cinnamea P4]
Length=210
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/194 (40%), Positives = 103/194 (54%), Gaps = 9/194 (4%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
V+QLSFFAA+ P +DL GVLA GQ L G A++SV + +WRA A ++ +AGL
Sbjct 16 VTQLSFFAADDHVPDPSDLEGVLAARGQSTLAGEVAQVSVALDAAWRADAFEAILAQAGL 75
Query 61 VPEVARTD-ENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPE 119
P + D + V TA +L + W RGAV VP W P LR W L AG
Sbjct 76 DPMRSDPDPDGRCTVSTARTSVLAPVVRRWRRGAVTAVPEGWTPSAGALRIWVLTAGHIT 135
Query 120 ADRYL-----LGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVE 174
+ GL+ HAP L +AL RVGI T +G++G P LR+ + +RL +
Sbjct 136 DTGVVELGIDAGLEHHAP-RRDALRAALERVGIRTTYVGSKGGGPLLRLGTAKARARLAQ 194
Query 175 NVGEPPDG--AEAW 186
++G PP G AE W
Sbjct 195 DIGAPPAGVPAEHW 208
>gi|159039924|ref|YP_001539177.1| hypothetical protein Sare_4409 [Salinispora arenicola CNS-205]
gi|157918759|gb|ABW00187.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=236
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/178 (42%), Positives = 98/178 (56%), Gaps = 1/178 (0%)
Query 3 QLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVP 62
QL+ F AE+ PAVADL+G+LAGP + ++G ARL+VVV ++WR L + GL
Sbjct 48 QLALFGAEATDPAVADLAGLLAGPAEASVMGGTARLAVVVDDAWRVHVLIAELDARGLPA 107
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADR 122
A + VRT+ +L + A+W G K PP + R LR W +AAG+
Sbjct 108 SWAAVGDGRHTVRTSYTRVLKPLVAQWLHGPAKHPPPGFHLDGRGLRLWLVAAGAVAESG 167
Query 123 YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
LL L P A SP+ +AL VG+ P + PA RISGRR L+R E VG+PP
Sbjct 168 VLLRLGPAAHRRVSPVGAALAAVGL-PAVPEPAPDGPAYRISGRRPLNRFAELVGDPP 224
>gi|330465253|ref|YP_004402996.1| hypothetical protein VAB18032_06360 [Verrucosispora maris AB-18-032]
gi|328808224|gb|AEB42396.1| hypothetical protein VAB18032_06360 [Verrucosispora maris AB-18-032]
Length=260
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/176 (40%), Positives = 93/176 (53%), Gaps = 0/176 (0%)
Query 3 QLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVP 62
QL FF AE+ P+VADL+G+LAGPG++ +G ARLSVVV WR L + + G+
Sbjct 71 QLVFFGAETAEPSVADLAGLLAGPGEVHRMGGTARLSVVVDAGWRVHVLVAELAQRGVRA 130
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADR 122
T++ V+TA + +AA W RG + P + R LR W AAG +
Sbjct 131 TWTPTEDQRYAVQTAYTRAIVPLAAAWLRGPTQQPPAGFQLDGRRLRLWLAAAGVVDPPE 190
Query 123 YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
LL L P S + +AL G+ L+ PA RISGRRR+ RL E VGE
Sbjct 191 ILLHLGGVDPGRWSVVGAALTAAGLVGELVEPGAGGPAYRISGRRRVLRLAELVGE 246
>gi|145596539|ref|YP_001160836.1| hypothetical protein Strop_4028 [Salinispora tropica CNB-440]
gi|145305876|gb|ABP56458.1| hypothetical protein Strop_4028 [Salinispora tropica CNB-440]
Length=238
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/178 (39%), Positives = 95/178 (54%), Gaps = 1/178 (0%)
Query 3 QLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVP 62
QL+FF AE+ PAVAD++G+LAGP I ++G ARL+VVV ++WR L ++ L
Sbjct 50 QLTFFGAEAAEPAVADVAGLLAGPADISVMGGTARLAVVVDDAWRVHVLVAELEARHLPT 109
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADR 122
A VRTA +L + A W G K P + R LR W +AAG+
Sbjct 110 SWAAAGGGRHTVRTAYTRVLKPLVAAWLNGPAKHPPDAFHLDGRGLRLWLVAAGAVMDSD 169
Query 123 YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPP 180
LL L P A + + +AL VG+ P + + A RI+GRR L+R E VG+PP
Sbjct 170 VLLRLGPAAHQRVASVGAALAAVGL-PAVPESGPDGLAYRITGRRLLNRFAELVGDPP 226
>gi|331694280|ref|YP_004330519.1| hypothetical protein Psed_0394 [Pseudonocardia dioxanivorans
CB1190]
gi|326948969|gb|AEA22666.1| hypothetical protein Psed_0394 [Pseudonocardia dioxanivorans
CB1190]
Length=211
Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/187 (42%), Positives = 100/187 (54%), Gaps = 14/187 (7%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG--ARLSVVVAESWRASALAEMIQEA 58
V+QLS F+AE+ P ADL+G+L GPG+I G G ARLS+ VA++ RA A+
Sbjct 4 VAQLSLFSAEARPVRRADLAGLLCGPGRIARFGSGTTARLSLQVADAGRARAVRAAAAAT 63
Query 59 GLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAV----------KTVPPRWLPGPREL 108
G+ E D+ T +R+A L +A W K VP + L
Sbjct 64 GVRLEATPADDGTVALRSAFRCDLVALAKAWAGSDAAGGAAAAADRKVVPDDFQLDGSLL 123
Query 109 RAWTLAAG-SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRR 167
R W LAAG + E YLL LDP AP TH PLA+A R GI+P +G PALRISG
Sbjct 124 RLWALAAGRADERGGYLLALDPLAPHTHRPLAAAAYRAGISPARVGG-DDHPALRISGAA 182
Query 168 RLSRLVE 174
R+ RLV+
Sbjct 183 RVRRLVD 189
>gi|238062264|ref|ZP_04606973.1| hypothetical protein MCAG_03230 [Micromonospora sp. ATCC 39149]
gi|237884075|gb|EEP72903.1| hypothetical protein MCAG_03230 [Micromonospora sp. ATCC 39149]
Length=205
Score = 87.4 bits (215), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 67/179 (38%), Positives = 94/179 (53%), Gaps = 1/179 (0%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGL 60
V QLS F AE+ P+VADL+G+LAGPG++ +G ARLSVV+ +WR L + G+
Sbjct 13 VRQLSLFGAEAADPSVADLAGLLAGPGEVSRMGGTARLSVVLDSAWRVHVLVAELGRRGV 72
Query 61 VPEVARTDENTPLVRTAV-DPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPE 119
T + LVRT+ L A VK P + R LR W AAG+ +
Sbjct 73 AATWEATADGRHLVRTSYASTLAPLALAWLAAEDVKRPPAGFHLNGRRLRLWVAAAGAAD 132
Query 120 ADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGE 178
+LL L P+ +AL VG+ L+ + PA RI+GRRRL+RL + +G+
Sbjct 133 PPGFLLRLGATDERCWGPVGAALAAVGLPAVLLDAQAGGPAYRITGRRRLARLADLIGD 191
>gi|315501238|ref|YP_004080125.1| hypothetical protein ML5_0422 [Micromonospora sp. L5]
gi|315407857|gb|ADU05974.1| hypothetical protein ML5_0422 [Micromonospora sp. L5]
Length=202
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/188 (40%), Positives = 99/188 (53%), Gaps = 2/188 (1%)
Query 3 QLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVP 62
Q S F+ E+ PA+ADL+G+LAGPG++ +G AR+SVVV +WR L + G+
Sbjct 13 QPSLFSTEAADPALADLAGLLAGPGEVGRMGGTARISVVVDAAWRVHVLVAELGARGVPA 72
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADR 122
T++ VRTA +L +A W RGAVK P R+ R LR W AAG+ E
Sbjct 73 SWEPTEDGRHRVRTAYTSMLAPLARAWLRGAVKRPPARFHLDGRRLRLWAAAAGTAEPAG 132
Query 123 YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDG 182
+ L L P + + +AL VG+ + PA RI G RR+SRL E VGE P
Sbjct 133 FRLRLGPADEQSWPVVRAALAAVGLPAAFVEPDEGGPAFRIGG-RRMSRLAELVGERPAT 191
Query 183 AEAWVQWP 190
A WP
Sbjct 192 APV-ADWP 198
>gi|325002271|ref|ZP_08123383.1| hypothetical protein PseP1_26078 [Pseudonocardia sp. P1]
Length=199
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 96/198 (49%), Gaps = 13/198 (6%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCG--ARLSVVVAESWRASALAEMIQEA 58
+ Q+S F+AE+ P + DL+G+L GPG+I G G AR V + R ALA +
Sbjct 1 MPQMSLFSAEARPAGLTDLAGLLCGPGRIERFGAGDTARFDVPLPFEGRERALAALAAAR 60
Query 59 GLVPEVARTDENTPLVRTAVDPLLCGIAAEW-TRGAVKTVPPRW-LPGPRELRAWTLAAG 116
G+ + +R+A L +A W T K VPP + L G A
Sbjct 61 GVTLAPGASG-----MRSAFRRDLVPLARTWCTPDGRKQVPPDFQLDGAALRLWALAAGV 115
Query 117 SPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLI--GTRGT-RPALRISGRRRLSRLV 173
+LL LDPHAP TH PL +A R G+ P + G G PALR+ G RR++RLV
Sbjct 116 GDLRGGHLLLLDPHAPWTHGPLIAAATRAGLPPARLATGEHGAPGPALRLHGTRRMARLV 175
Query 174 ENVGEPPDGAEAWVQWPR 191
E VG P +WPR
Sbjct 176 ELVGPAPS-TLGTSEWPR 192
>gi|302864953|ref|YP_003833590.1| hypothetical protein Micau_0447 [Micromonospora aurantiaca ATCC
27029]
gi|302567812|gb|ADL44014.1| hypothetical protein Micau_0447 [Micromonospora aurantiaca ATCC
27029]
Length=202
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/188 (39%), Positives = 98/188 (53%), Gaps = 2/188 (1%)
Query 3 QLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVP 62
Q S F+ E+ PA+ADL+G+LAGPG++ +G AR+SVVV +WR L + G+
Sbjct 13 QPSLFSTEAADPALADLAGLLAGPGEVGRMGGTARISVVVDAAWRVHVLVAELGARGVPA 72
Query 63 EVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADR 122
T++ VRTA +L +A W RG VK P R+ R LR W AAG+ E
Sbjct 73 SWEPTEDGRHRVRTAYTSMLAPLARAWLRGGVKRPPARFHLDGRRLRLWAAAAGTAEPAG 132
Query 123 YLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDG 182
+ L L P + + +AL VG+ + PA RI G RR+SRL E VGE P
Sbjct 133 FRLRLGPADEPSWPVVRAALAAVGLPAAFVEPDEGGPAFRIGG-RRMSRLAELVGERPAT 191
Query 183 AEAWVQWP 190
A WP
Sbjct 192 APV-ADWP 198
>gi|336460799|gb|EGO39684.1| hypothetical protein MAPs_36190 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=42
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/42 (77%), Positives = 34/42 (81%), Gaps = 4/42 (9%)
Query 1 VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGC----GARL 38
+SQLSFF AESVPPAVADLSGVLA GQIV+VG GARL
Sbjct 1 MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARL 42
>gi|300865169|ref|ZP_07109993.1| hypothetical protein OSCI_1490029 [Oscillatoria sp. PCC 6506]
gi|300336859|emb|CBN55143.1| hypothetical protein OSCI_1490029 [Oscillatoria sp. PCC 6506]
Length=313
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 21/66 (32%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query 125 LGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPAL--RISG-RRRLSRLVENVGEPPD 181
+ LDP+ D H+ L S + P I + L + +G R L+RL E +G+P +
Sbjct 68 IALDPNLADVHANLGSLYANLEQWPEAIASYQQALTLQPKFAGVYRNLARLFEQIGKPEE 127
Query 182 GAEAWV 187
GA+ W
Sbjct 128 GADFWY 133
>gi|156937060|ref|YP_001434856.1| hypothetical protein Igni_0265 [Ignicoccus hospitalis KIN4/I]
gi|156566044|gb|ABU81449.1| protein of unknown function DUF885 [Ignicoccus hospitalis KIN4/I]
Length=482
Score = 38.1 bits (87), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 23/69 (34%), Positives = 38/69 (56%), Gaps = 10/69 (14%)
Query 119 EADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPA---LRISGRRRLSRLVEN 175
E D++L GL+ A + + P+ +L TL+ RG + + L GR++ +VE
Sbjct 173 EYDKWLDGLE--ADEGYQPMGESLF-----STLLRVRGIKASAEELEALGRKKAKEIVEE 225
Query 176 VGEPPDGAE 184
+GEPP+G E
Sbjct 226 LGEPPEGKE 234
>gi|87121122|ref|ZP_01077013.1| transcriptional regulatory protein [Marinomonas sp. MED121]
gi|86163614|gb|EAQ64888.1| transcriptional regulatory protein [Marinomonas sp. MED121]
Length=283
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/130 (25%), Positives = 66/130 (51%), Gaps = 8/130 (6%)
Query 38 LSVVVAESWRASALAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTV 97
+ V ++E++RA + I + ++ ++ T E PL R D L+ + ++ G K+V
Sbjct 122 IRVGLSETFRAQVTSGEI-DLAVLAQIPPTGEGQPLYR---DKLVWLASEDFHLGTHKSV 177
Query 98 PPRWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGT 157
P +P P R +AA + + L L+ H+ H + SA++ G+A T++ +
Sbjct 178 PLALVPSPCLYRKTAIAALDKQNMPWQLALNCHS---HEAIKSAVIS-GLAVTVLTEKDL 233
Query 158 RPALRISGRR 167
RP +++ ++
Sbjct 234 RPGMKVLTQK 243
>gi|302916299|ref|XP_003051960.1| hypothetical protein NECHADRAFT_5957 [Nectria haematococca mpVI
77-13-4]
gi|256732899|gb|EEU46247.1| hypothetical protein NECHADRAFT_5957 [Nectria haematococca mpVI
77-13-4]
Length=262
Score = 35.8 bits (81), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/59 (38%), Positives = 31/59 (53%), Gaps = 5/59 (8%)
Query 130 HAPD-THSPLASALMRVGIAPTLIGTRGTRP----ALRISGRRRLSRLVENVGEPPDGA 183
H P+ S +A+A + +G++PT+I T G RP L + GRR L VG P A
Sbjct 8 HTPEFIKSKMAAAAIVLGLSPTIIATLGVRPQETAVLSVVGRRHLLAFALAVGSPALNA 66
>gi|153010755|ref|YP_001371969.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC
49188]
gi|151562643|gb|ABS16140.1| glycosyl transferase family 2 [Ochrobactrum anthropi ATCC 49188]
Length=753
Score = 34.3 bits (77), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 35/119 (30%), Positives = 54/119 (46%), Gaps = 31/119 (26%)
Query 73 LVRTAVDPLLCGIAAEW--TRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYLLGLDPH 130
L+ TAVDP+L I+A + T G+V +PPR++ G + AW + +LL
Sbjct 171 LLETAVDPMLGEISAIYGITPGSVTVIPPRFVVGRQARNAW-------QPCHFLL----- 218
Query 131 APDTHSPLASALMRVGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAEAWVQW 189
+ L S+L+ V G +G A+R +L GE D A+ W +W
Sbjct 219 --EAAQDLPSSLLLV-----FTGQKGV--AVR--------KLAATTGEQTDFAKWWTKW 260
Lambda K H
0.318 0.134 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 192573564720
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40