BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1502
Length=299
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608640|ref|NP_216018.1| hypothetical protein Rv1502 [Mycoba... 608 2e-172
gi|15840966|ref|NP_336003.1| hypothetical protein MT1551 [Mycoba... 605 2e-171
gi|339294477|gb|AEJ46588.1| hypothetical protein CCDC5079_1398 [... 603 7e-171
gi|294993249|ref|ZP_06798940.1| hypothetical protein Mtub2_01757... 602 3e-170
gi|183982331|ref|YP_001850622.1| hypothetical protein MMAR_2318 ... 452 3e-125
gi|31792700|ref|NP_855193.1| hypothetical protein Mb1541 [Mycoba... 388 6e-106
gi|289442953|ref|ZP_06432697.1| LOW QUALITY PROTEIN: conserved h... 381 8e-104
gi|149177703|ref|ZP_01856304.1| hypothetical protein PM8797T_276... 300 2e-79
gi|337754878|ref|YP_004647389.1| hypothetical protein F7308_0862... 271 7e-71
gi|119898969|ref|YP_934182.1| hypothetical protein azo2678 [Azoa... 253 3e-65
gi|154253754|ref|YP_001414578.1| hypothetical protein Plav_3317 ... 243 4e-62
gi|330808321|ref|YP_004352783.1| hypothetical protein PSEBR_a157... 233 2e-59
gi|289569531|ref|ZP_06449758.1| hypothetical protein TBJG_03720 ... 229 3e-58
gi|116250569|ref|YP_766407.1| hypothetical protein RL0798 [Rhizo... 220 2e-55
gi|167582755|ref|ZP_02375629.1| hypothetical protein BthaT_31721... 215 6e-54
gi|31792699|ref|NP_855192.1| hypothetical protein Mb1540 [Mycoba... 215 7e-54
gi|83720742|ref|YP_443709.1| hypothetical protein BTH_I3215 [Bur... 215 7e-54
gi|291613078|ref|YP_003523235.1| hypothetical protein Slit_0608 ... 210 2e-52
gi|150017451|ref|YP_001309705.1| hypothetical protein Cbei_2593 ... 203 3e-50
gi|323140032|ref|ZP_08075042.1| hypothetical protein Met49242DRA... 199 5e-49
gi|152994861|ref|YP_001339696.1| hypothetical protein Mmwyl1_082... 198 8e-49
gi|295148979|gb|ADF80978.1| hypothetical protein [Vibrio cholerae] 196 3e-48
gi|86147247|ref|ZP_01065562.1| hypothetical protein MED222_17818... 196 4e-48
gi|344923648|ref|ZP_08777109.1| hypothetical protein COdytL_0323... 188 8e-46
gi|146298083|ref|YP_001192674.1| hypothetical protein Fjoh_0319 ... 182 4e-44
gi|170722913|ref|YP_001750601.1| hypothetical protein PputW619_3... 177 1e-42
gi|124010089|ref|ZP_01694749.1| conserved hypothetical protein [... 176 6e-42
gi|296161428|ref|ZP_06844234.1| conserved hypothetical protein [... 174 1e-41
gi|237742770|ref|ZP_04573251.1| conserved hypothetical protein [... 170 3e-40
gi|242400024|ref|YP_002995449.1| hypothetical protein TSIB_2053 ... 166 3e-39
gi|284040023|ref|YP_003389953.1| hypothetical protein Slin_5182 ... 166 5e-39
gi|336315407|ref|ZP_08570318.1| hypothetical protein Rhein_1693 ... 160 2e-37
gi|227112550|ref|ZP_03826206.1| hypothetical protein PcarbP_0627... 159 5e-37
gi|50120668|ref|YP_049835.1| hypothetical protein ECA1735 [Pecto... 157 2e-36
gi|253688943|ref|YP_003018133.1| hypothetical protein PC1_2566 [... 157 2e-36
gi|149915503|ref|ZP_01904030.1| hypothetical protein RAZWK3B_057... 155 1e-35
gi|167627484|ref|YP_001677984.1| hypothetical protein Fphi_1258 ... 153 3e-35
gi|149925690|ref|ZP_01913954.1| hypothetical protein LMED105_056... 152 4e-35
gi|296101726|ref|YP_003611872.1| hypothetical protein ECL_01362 ... 152 9e-35
gi|119504992|ref|ZP_01627069.1| hypothetical protein MGP2080_052... 149 4e-34
gi|336315406|ref|ZP_08570317.1| Putative glycosylase [Rheinheime... 146 3e-33
gi|83951253|ref|ZP_00959986.1| hypothetical protein ISM_09125 [R... 146 4e-33
gi|254373349|ref|ZP_04988837.1| predicted protein [Francisella t... 144 2e-32
gi|336322666|ref|YP_004602633.1| hypothetical protein Flexsi_037... 143 3e-32
gi|299134512|ref|ZP_07027705.1| conserved hypothetical protein [... 142 5e-32
gi|253688942|ref|YP_003018132.1| hypothetical protein PC1_2565 [... 142 9e-32
gi|229916907|ref|YP_002885553.1| hypothetical protein EAT1b_1180... 141 1e-31
gi|119504993|ref|ZP_01627070.1| hypothetical protein MGP2080_052... 141 1e-31
gi|186477054|ref|YP_001858524.1| hypothetical protein Bphy_2303 ... 139 6e-31
gi|227329195|ref|ZP_03833219.1| hypothetical protein PcarcW_1840... 138 1e-30
>gi|15608640|ref|NP_216018.1| hypothetical protein Rv1502 [Mycobacterium tuberculosis H37Rv]
gi|148661297|ref|YP_001282820.1| hypothetical protein MRA_1513 [Mycobacterium tuberculosis H37Ra]
gi|167969312|ref|ZP_02551589.1| hypothetical protein MtubH3_15310 [Mycobacterium tuberculosis
H37Ra]
11 more sequence titles
Length=299
Score = 608 bits (1569), Expect = 2e-172, Method: Compositional matrix adjust.
Identities = 299/299 (100%), Positives = 299/299 (100%), Gaps = 0/299 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI
Sbjct 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
Query 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE 240
RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE
Sbjct 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE 240
Query 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN
Sbjct 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
>gi|15840966|ref|NP_336003.1| hypothetical protein MT1551 [Mycobacterium tuberculosis CDC1551]
gi|121637435|ref|YP_977658.1| hypothetical protein BCG_1566 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|148822724|ref|YP_001287478.1| hypothetical protein TBFG_11533 [Mycobacterium tuberculosis F11]
57 more sequence titles
Length=299
Score = 605 bits (1560), Expect = 2e-171, Method: Compositional matrix adjust.
Identities = 298/299 (99%), Positives = 298/299 (99%), Gaps = 0/299 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI
Sbjct 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
Query 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE 240
RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRP VVRDAGVYRMWFCARGAKYRIYCATSE
Sbjct 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPCVVRDAGVYRMWFCARGAKYRIYCATSE 240
Query 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN
Sbjct 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
>gi|339294477|gb|AEJ46588.1| hypothetical protein CCDC5079_1398 [Mycobacterium tuberculosis
CCDC5079]
Length=299
Score = 603 bits (1556), Expect = 7e-171, Method: Compositional matrix adjust.
Identities = 297/299 (99%), Positives = 298/299 (99%), Gaps = 0/299 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI
Sbjct 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
Query 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE 240
RYAQSRDGVHWEKQDRVHIDT+GSDNSAACRP VVRDAGVYRMWFCARGAKYRIYCATSE
Sbjct 181 RYAQSRDGVHWEKQDRVHIDTNGSDNSAACRPCVVRDAGVYRMWFCARGAKYRIYCATSE 240
Query 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN
Sbjct 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
>gi|294993249|ref|ZP_06798940.1| hypothetical protein Mtub2_01757 [Mycobacterium tuberculosis
210]
Length=299
Score = 602 bits (1552), Expect = 3e-170, Method: Compositional matrix adjust.
Identities = 297/299 (99%), Positives = 297/299 (99%), Gaps = 0/299 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI
Sbjct 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVI 180
Query 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE 240
RYAQSRDGVHWEKQD VHIDTSGSDNSAACRP VVRDAGVYRMWFCARGAKYRIYCATSE
Sbjct 181 RYAQSRDGVHWEKQDCVHIDTSGSDNSAACRPCVVRDAGVYRMWFCARGAKYRIYCATSE 240
Query 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN
Sbjct 241 DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN 299
>gi|183982331|ref|YP_001850622.1| hypothetical protein MMAR_2318 [Mycobacterium marinum M]
gi|183175657|gb|ACC40767.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=302
Score = 452 bits (1163), Expect = 3e-125, Method: Compositional matrix adjust.
Identities = 215/302 (72%), Positives = 252/302 (84%), Gaps = 3/302 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M WRKLGRIF PSGELDW+R+HA+ PV EW++GDIFRIYFS RD QNRSSIGSV+VDLA
Sbjct 1 MPWRKLGRIFVPSGELDWARTHASQPVAEWVDGDIFRIYFSTRDDQNRSSIGSVVVDLAA 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GGK+L+I EP+L PGA GMFDDCGVS+GSIV GDTR LYY GWNLAVTVPWKN IG+A
Sbjct 61 GGKVLEISPEPVLGPGALGMFDDCGVSMGSIVPVGDTRFLYYMGWNLAVTVPWKNAIGLA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGT---DEIP 177
IS+AG PF+RWSTFPVV LDE DP+S+SYPWVI+D YRMWYGSN+ W + T D +P
Sbjct 121 ISQAGGPFKRWSTFPVVPLDEGDPYSISYPWVIRDDDKYRMWYGSNVRWEQKTKNMDGLP 180
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCA 237
HVI+ A+S D +HWEKQ+ V IDT+G D+ AA RP VVRD G+YRMW+CARGA+Y IY A
Sbjct 181 HVIKSAESIDAIHWEKQELVAIDTAGCDDIAAARPCVVRDPGLYRMWYCARGAQYSIYHA 240
Query 238 TSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVL 297
SEDG+ W QLGKD GID SP WD++ + YPCVFDH+GQRF++YSGDGYGRTGFGLAVL
Sbjct 241 VSEDGVIWTQLGKDNGIDASPGEWDANSVGYPCVFDHKGQRFLIYSGDGYGRTGFGLAVL 300
Query 298 EN 299
++
Sbjct 301 DD 302
>gi|31792700|ref|NP_855193.1| hypothetical protein Mb1541 [Mycobacterium bovis AF2122/97]
gi|31618290|emb|CAD96208.1| HYPOTHETICAL PROTEIN [SECOND PART] [Mycobacterium bovis AF2122/97]
Length=189
Score = 388 bits (996), Expect = 6e-106, Method: Compositional matrix adjust.
Identities = 187/189 (99%), Positives = 188/189 (99%), Gaps = 0/189 (0%)
Query 111 VPWKNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG 170
+PWKNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG
Sbjct 1 MPWKNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG 60
Query 171 EGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA 230
EGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRP VVRDAGVYRMWFCARGA
Sbjct 61 EGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPCVVRDAGVYRMWFCARGA 120
Query 231 KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRT 290
KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRT
Sbjct 121 KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRT 180
Query 291 GFGLAVLEN 299
GFGLAVLEN
Sbjct 181 GFGLAVLEN 189
>gi|289442953|ref|ZP_06432697.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
gi|289750065|ref|ZP_06509443.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_02791 [Mycobacterium
tuberculosis T92]
gi|289415872|gb|EFD13112.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
gi|289690652|gb|EFD58081.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_02791 [Mycobacterium
tuberculosis T92]
Length=217
Score = 381 bits (978), Expect = 8e-104, Method: Compositional matrix adjust.
Identities = 184/186 (99%), Positives = 185/186 (99%), Gaps = 0/186 (0%)
Query 114 KNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGT 173
+NTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGT
Sbjct 32 ENTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGT 91
Query 174 DEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYR 233
DEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRP VVRDAGVYRMWFCARGAKYR
Sbjct 92 DEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPCVVRDAGVYRMWFCARGAKYR 151
Query 234 IYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFG 293
IYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFG
Sbjct 152 IYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFG 211
Query 294 LAVLEN 299
LAVLEN
Sbjct 212 LAVLEN 217
>gi|149177703|ref|ZP_01856304.1| hypothetical protein PM8797T_27607 [Planctomyces maris DSM 8797]
gi|148843521|gb|EDL57883.1| hypothetical protein PM8797T_27607 [Planctomyces maris DSM 8797]
Length=302
Score = 300 bits (768), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 141/301 (47%), Positives = 199/301 (67%), Gaps = 2/301 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W+KLG++FAP DW SHAA PV + + ++R+Y S RD N+SSI + ++
Sbjct 1 MKWKKLGQVFAPDHHYDWMVSHAANPVADQLSDSLYRVYSSCRDKNNKSSIYHIDFNINQ 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
KIL+I PIL PG G FDD GV++ +V + LYY GWNL VTVPW+N+IG+A
Sbjct 61 PDKILNISKTPILSPGDPGYFDDSGVTVTGLVTVDKIKYLYYLGWNLGVTVPWRNSIGLA 120
Query 121 ISEA-GAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTD-EIPH 178
IS++ G F ++S P++ + DP S+SYPW++ + G ++MWYGSNL W + D
Sbjct 121 ISDSTGCIFTKYSPAPIIDRNSVDPLSISYPWILHENGIWKMWYGSNLEWDQDNDCAFKF 180
Query 179 VIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCAT 238
I+YA+S +G++W + + I +D A RP V+ D G+Y+MW+ RG YRI A
Sbjct 181 CIKYAESENGINWRRDGIIAITFKSADEYALARPCVINDNGIYKMWYSYRGISYRIGYAE 240
Query 239 SEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLE 298
S+DG+ W +L ++ GIDVS WDS+MIEYP VFDH+ R+MLY+G+ YG+TGFGLAVLE
Sbjct 241 SDDGINWTRLDEEVGIDVSKTGWDSEMIEYPHVFDHKSNRYMLYNGNAYGKTGFGLAVLE 300
Query 299 N 299
+
Sbjct 301 S 301
>gi|337754878|ref|YP_004647389.1| hypothetical protein F7308_0862 [Francisella sp. TX077308]
gi|336446483|gb|AEI35789.1| hypothetical protein F7308_0862 [Francisella sp. TX077308]
Length=308
Score = 271 bits (694), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 131/304 (44%), Positives = 197/304 (65%), Gaps = 7/304 (2%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVGG 62
W+K+G+IF P DW SHA++P E I+ D+F+IYFS R+ QN SSIG V++++
Sbjct 4 WKKIGKIFEPYNNYDWMISHASVPFAENIQNDLFKIYFSCRNKQNESSIGYVVININKPN 63
Query 63 KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI- 121
+I+++ EP+L G G FDD GV I+ D + LYY GWNL VTVP++N+IG+A+
Sbjct 64 EIIEVSKEPVLERGELGAFDDSGVMGCCILNNQDNKYLYYIGWNLGVTVPFRNSIGLAVS 123
Query 122 SEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHV-- 179
S+AG F+R P++ +P ++ V++D G +++WY S W + ++I H
Sbjct 124 SDAGDTFKRMFNGPIIDRSRDEPHFVASNCVLKDEGIFKIWYLSCTEWIKIDEKIMHKYH 183
Query 180 IRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK----YRIY 235
I+YA+S+DG++W+++ + ID A P V+++ G+Y+MWF +RG K YRI
Sbjct 184 IKYAESKDGINWDREGTIAIDYKDEYEYAISVPRVIKEDGIYKMWFSSRGTKDIPTYRIK 243
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLA 295
A S+DG+ W + +D DVS WDSDM+ YP +FDH +R+MLY+G+ YG+TGFGLA
Sbjct 244 YAESKDGINWIRKDEDVCFDVSEREWDSDMLCYPFIFDHNNKRYMLYNGNDYGKTGFGLA 303
Query 296 VLEN 299
VLEN
Sbjct 304 VLEN 307
>gi|119898969|ref|YP_934182.1| hypothetical protein azo2678 [Azoarcus sp. BH72]
gi|119671382|emb|CAL95295.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length=306
Score = 253 bits (645), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 135/303 (45%), Positives = 184/303 (61%), Gaps = 7/303 (2%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVGG 62
W+KLG+IF + DW SHA +P+ + +EGD++RIYFS RD +NR G + VD+
Sbjct 2 WKKLGKIFCAEQQSDWLYSHAMIPIADQVEGDLYRIYFSSRDKRNRGHGGFLEVDMLNPT 61
Query 63 KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI- 121
K+L + +P+L PG G FDD G SIV G +L+YYTG NL VTV +N+IG+A
Sbjct 62 KVLRVHPDPVLEPGDLGCFDDSGALPNSIVNVGGRKLMYYTGINLGVTVKIRNSIGLAEW 121
Query 122 SEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHV-- 179
+E+ F + PV+ P ++ P V + G +R W+ S + W + E H
Sbjct 122 NESAQCFHKLFRGPVIDRTRDLPHFVATPEVQYEAGRFRAWFTSCVRWEQDPSEAKHFYH 181
Query 180 IRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK----YRIY 235
+ YA+S DGV WE+ V I+ A P V++DA +YRMWFC+R K YRI
Sbjct 182 LEYAESVDGVEWERDGTVAIEFRDHHEYALGVPRVLKDADMYRMWFCSRATKDCPTYRIR 241
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLA 295
ATS DG+ W + + GIDVS WDS+MI YP VFDH G+RFMLY+G+GYG+TGFG+A
Sbjct 242 YATSSDGVKWTRHDEQVGIDVSKSGWDSEMICYPFVFDHAGRRFMLYNGNGYGKTGFGIA 301
Query 296 VLE 298
V E
Sbjct 302 VWE 304
>gi|154253754|ref|YP_001414578.1| hypothetical protein Plav_3317 [Parvibaculum lavamentivorans
DS-1]
gi|154157704|gb|ABS64921.1| conserved hypothetical protein [Parvibaculum lavamentivorans
DS-1]
Length=313
Score = 243 bits (619), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 133/305 (44%), Positives = 180/305 (60%), Gaps = 8/305 (2%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDL-AVG 61
WR+LGR+ AP W +SHA+ P + +YFS RD +RSS+ SV + L G
Sbjct 8 WRRLGRVIAPEASAPWWQSHASYPTALVRSDGLIDVYFSVRDATSRSSLASVTLSLDGEG 67
Query 62 GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI 121
+ P P+L PG RG FD GVS+G ++ + + YY GW++ V+VP+ N IG+A
Sbjct 68 FQRESAPKGPLLGPGMRGAFDADGVSVGCVIEKDNELIAYYLGWSVGVSVPFSNFIGIAT 127
Query 122 S--EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHV 179
+ A F R PV+ DPF+L YPWV++ G YRMWYGS+L WGE E+ HV
Sbjct 128 APRTGDAVFRRREIVPVIGRSAVDPFTLGYPWVMRSGSEYRMWYGSHLAWGEVGLEMKHV 187
Query 180 IRYAQSRDGVHWEKQDRVHIDTSGSDNS---AACRPYVVRDA-GVYRMWFCARGAKYRIY 235
I+ A+S DG W +V I G+++ A RP VV +A G++ MW+ R Y +
Sbjct 188 IKEAKSSDGFSWSAIGKVAIPLKGAEDPQEFAVSRPSVVAEADGIWSMWYARRRPGYELG 247
Query 236 CATSED-GLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGL 294
A S+D G TW++ + SPD WD YPCVFDH G+R+MLY+G+GYGRTGFGL
Sbjct 248 FAISDDEGATWQRQDERIAWTGSPDDWDDREQTYPCVFDHHGRRYMLYNGNGYGRTGFGL 307
Query 295 AVLEN 299
AVLE
Sbjct 308 AVLET 312
>gi|330808321|ref|YP_004352783.1| hypothetical protein PSEBR_a1578 [Pseudomonas brassicacearum
subsp. brassicacearum NFM421]
gi|327376429|gb|AEA67779.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
Length=307
Score = 233 bits (595), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/304 (44%), Positives = 174/304 (58%), Gaps = 9/304 (2%)
Query 2 AWRKLGRIFAPSGELDWSR--SHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLA 59
W+KLGR++ P E + SHAA P+P + D+FR++FS RD NRSS+G+V +D+
Sbjct 4 TWQKLGRLYTPENEKRHPKLLSHAANPLPVHLHNDVFRVFFSARDCDNRSSVGAVDIDIE 63
Query 60 VGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGV 119
I + P P L G G F GVSIG+ A + + + + GW W+ +G
Sbjct 64 QRIVIKEHPL-PFLEHGPAGSFHADGVSIGNCYIANEVQYMLFMGWQSPDNQHWRGDVGR 122
Query 120 AISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGT-YRMWYGSNLGWGEGTDEIPH 178
I A S P ++ DE DP SLSYPWV+++G Y MWYGS W G E+ H
Sbjct 123 LIVNADTTLTLESDLPFMSTDEIDPISLSYPWVLKNGNNGYDMWYGSTKTWDSGNGEMIH 182
Query 179 VIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDA-GVYRMWFCAR---GAKYRI 234
VI A S DG +W + + + I A RP V ++ G MWF R G YRI
Sbjct 183 VINSAHSHDGNNWHR-NGLAIPFEVGVAQAFSRPTVAKNNLGGLEMWFSYRSGTGDTYRI 241
Query 235 YCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGL 294
AT++ G WR ++ GIDVSPD WDS+MIEYP VFDH+ R+MLY+G+ YG+TGFGL
Sbjct 242 GYATTDGGTQWRLALEEAGIDVSPDGWDSEMIEYPFVFDHKHNRYMLYNGNSYGKTGFGL 301
Query 295 AVLE 298
AVLE
Sbjct 302 AVLE 305
>gi|289569531|ref|ZP_06449758.1| hypothetical protein TBJG_03720 [Mycobacterium tuberculosis T17]
gi|289543285|gb|EFD46933.1| hypothetical protein TBJG_03720 [Mycobacterium tuberculosis T17]
Length=116
Score = 229 bits (585), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 114/115 (99%), Positives = 114/115 (99%), Gaps = 0/115 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKN 115
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWK
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKT 115
>gi|116250569|ref|YP_766407.1| hypothetical protein RL0798 [Rhizobium leguminosarum bv. viciae
3841]
gi|115255217|emb|CAK06292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length=313
Score = 220 bits (561), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/303 (44%), Positives = 171/303 (57%), Gaps = 7/303 (2%)
Query 2 AWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVG 61
W K +F PSG RSHAA P+ +EGD +R++FSGRD +NRSS+G+V ++L +
Sbjct 4 TWVKTDLLFKPSGLHPKLRSHAANPLALHLEGDTYRVFFSGRDSENRSSVGAVDINL-LT 62
Query 62 GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI 121
+++ +P L GA G F + G+SIG+ AG R + + GW W+ IG
Sbjct 63 REVVHEHKQPFLVHGAGGSFFEAGISIGNCYYAGVQRYMLFMGWQRPPGGHWRGDIGRIK 122
Query 122 SEAGAPFERWSTFPVVALDERDPFSLSYPWVIQ-DGGTYRMWYGSNLGWGEGTDEIPHVI 180
E + +A DE D SLSYPWV + D +RMWYGS + W G E+ HVI
Sbjct 123 VRPDLTLELDADVAFMASDEEDSVSLSYPWVEKTDSEKFRMWYGSTVTWDAGNGEMLHVI 182
Query 181 RYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGV-YRMWFCAR---GAKYRIYC 236
+ A SRDG W K+ V I A RP V+ DA YRMWF R G YRI
Sbjct 183 KSASSRDGHIWHKEG-VAIPYEIGRAQAFSRPTVLIDAACGYRMWFSYRSGQGEAYRIGY 241
Query 237 ATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAV 296
+ S DG+ W GIDVS + WDS MIEYP VF H +MLY+GDGYG+TGFGLAV
Sbjct 242 SESRDGIAWILKLDQVGIDVSENGWDSAMIEYPYVFRHEDNTYMLYNGDGYGKTGFGLAV 301
Query 297 LEN 299
L++
Sbjct 302 LDD 304
>gi|167582755|ref|ZP_02375629.1| hypothetical protein BthaT_31721 [Burkholderia thailandensis
TXDOH]
Length=319
Score = 215 bits (548), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 115/307 (38%), Positives = 164/307 (54%), Gaps = 12/307 (3%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
+ W K G I+ L W+ SHA +P ++GD R+ FS RD NRS I + V
Sbjct 4 IHWEKRGLIYTVDARLPWATSHAQIPTAAGLKGDALRLLFSSRDADNRSGIARLDVRAGD 63
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++LD+ A+P+L PGA G FDDCG S+V LYY GWN+ T+P+ N +G+A
Sbjct 64 PSQVLDVKADPVLPPGALGAFDDCGTMPSSVVERDGVHYLYYIGWNVRNTIPYHNAVGLA 123
Query 121 ISE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--EGTDEIP 177
ISE G + R PV+ +P+ V + G +R WY + GW G E
Sbjct 124 ISEDGGETYRRLFEGPVMDRTAEEPYFCGTTCVRIENGIWRNWYLACTGWSIVAGKPEPR 183
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCAR--------- 228
+ ++YA+SRDG+HWE+ R+ ID D R V D YRMWFC R
Sbjct 184 YHLKYAESRDGIHWERTGRIAIDYLSDDEGGLARASVHHDGSRYRMWFCKRSHIAYRENS 243
Query 229 GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYG 288
YR+ A S DG+ W ++ ++ +DVS WD+ M+ YP V + G+ ++ Y+G+G+G
Sbjct 244 SVSYRMGYAESADGIVWDRMDEEAALDVSETGWDAFMVAYPEVVEIGGRLYLFYNGNGFG 303
Query 289 RTGFGLA 295
TGFG A
Sbjct 304 ATGFGYA 310
>gi|31792699|ref|NP_855192.1| hypothetical protein Mb1540 [Mycobacterium bovis AF2122/97]
gi|31618289|emb|CAD96207.1| HYPOTHETICAL PROTEIN [FIRST PART] [Mycobacterium bovis AF2122/97]
Length=116
Score = 215 bits (547), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 108/115 (94%), Positives = 108/115 (94%), Gaps = 0/115 (0%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV
Sbjct 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKN 115
GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWN P K
Sbjct 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNSLSPCPGKT 115
>gi|83720742|ref|YP_443709.1| hypothetical protein BTH_I3215 [Burkholderia thailandensis E264]
gi|167620870|ref|ZP_02389501.1| hypothetical protein BthaB_31486 [Burkholderia thailandensis
Bt4]
gi|83654567|gb|ABC38630.1| conserved hypothetical protein [Burkholderia thailandensis E264]
Length=319
Score = 215 bits (547), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 114/307 (38%), Positives = 164/307 (54%), Gaps = 12/307 (3%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
+ W K G ++ L W+ SHA +P ++GD R+ FS RD NRS I + V
Sbjct 4 IHWEKRGLVYTVDARLPWATSHAQIPTAAGVKGDALRLLFSSRDADNRSGIARLDVRAGD 63
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++LD+ A+P+L PGA G FDDCG S+V LYY GWN+ T+P+ N +G+A
Sbjct 64 PSQVLDVKADPVLPPGALGAFDDCGTMPSSVVERDGVHYLYYIGWNVRNTIPYHNAVGLA 123
Query 121 ISE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--EGTDEIP 177
ISE G + R PV+ +P+ V + G +R WY + GW G E
Sbjct 124 ISEDGGETYRRLFEGPVMDRTAEEPYFCGTTCVRIENGIWRNWYLACTGWSIVAGKPEPR 183
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCAR--------- 228
+ ++YA+SRDG+HWE+ R+ ID D R V D YRMWFC R
Sbjct 184 YHLKYAESRDGIHWERTGRIAIDYLSDDEGGLARASVHHDGSRYRMWFCKRSHTAYRENS 243
Query 229 GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYG 288
YR+ A S DG+ W ++ ++ +DVS WD+ M+ YP V + G+ ++ Y+G+G+G
Sbjct 244 SVSYRMGYAESADGIVWDRMDEEAALDVSETGWDAFMVAYPEVVEIGGRLYLFYNGNGFG 303
Query 289 RTGFGLA 295
TGFG A
Sbjct 304 ATGFGYA 310
>gi|291613078|ref|YP_003523235.1| hypothetical protein Slit_0608 [Sideroxydans lithotrophicus ES-1]
gi|291583190|gb|ADE10848.1| conserved hypothetical protein [Sideroxydans lithotrophicus ES-1]
Length=315
Score = 210 bits (535), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/309 (40%), Positives = 170/309 (56%), Gaps = 12/309 (3%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W K G I++ L W+R+HA +P + ++ + R+ FS RD NRS I + VD
Sbjct 4 MCWNKRGLIYSVDERLPWARTHAQIPTVDVLDDERLRVLFSSRDETNRSLIARMDVDARN 63
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
IL I AEPIL G G FDDCG+ +IV G + LYY GWN+ TVP+ N++G+A
Sbjct 64 PSTILAIQAEPILPLGRPGTFDDCGMMPSAIVDRGGQKYLYYIGWNVRNTVPYHNSVGLA 123
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--EGTDEIP 177
+S + G + R PV+ +P+ + + + G +R WY S GW EG E
Sbjct 124 VSDDGGETYRRMFEGPVMDRTAEEPYFCATTCIRIENGIWRNWYLSCTGWEMVEGRMEPR 183
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWF---------CAR 228
+ ++YA+S DG+HW ++ RV ID + R V +D +YRMW+ AR
Sbjct 184 YHLKYAESHDGIHWRREGRVAIDYASPAEGGIVRASVRKDGLLYRMWYSYRSHADYRSAR 243
Query 229 GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYG 288
YRI A S DGL W +L GI S + WDS M+ YP V D +R+M Y+G+G+G
Sbjct 244 ANSYRIGYAESGDGLVWTRLDDMAGIVPSAEGWDSFMLAYPEVVDVGSRRYMFYNGNGFG 303
Query 289 RTGFGLAVL 297
+TGFG A L
Sbjct 304 QTGFGYAEL 312
>gi|150017451|ref|YP_001309705.1| hypothetical protein Cbei_2593 [Clostridium beijerinckii NCIMB
8052]
gi|149903916|gb|ABR34749.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB
8052]
Length=303
Score = 203 bits (516), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/302 (36%), Positives = 164/302 (55%), Gaps = 4/302 (1%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W K G IF+P GE +W +S+A +P + I D RIYF+ D + IG + VD+
Sbjct 1 MKWNKQGLIFSPKGEFEWMQSYALIPTADIISNDTIRIYFATLDREMYGRIGYIDVDMLN 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
I +I +P+L G G FDD GV+ I+ + + LYY GW VP+ G+A
Sbjct 61 LKNIKNISEKPVLDIGDIGTFDDSGVNPSCILTVDNKKYLYYYGWQRCERVPYMLFAGLA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEI--PH 178
SE G F + S PV+ + +P+ S +I + ++ WY S + W D+ +
Sbjct 121 TSEDGENFTKISKVPVLDRTKEEPYLRSATSIIVEDNIFKCWYVSAINWILVNDKSYPKY 180
Query 179 VIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK--YRIYC 236
VI+YA S DG+ W ++ I RP+VV++ +Y+MW+ R Y+I
Sbjct 181 VIKYAYSYDGIEWISENHTCISFKNEYEYGFGRPWVVKENDMYKMWYSIRSTNEPYKIGF 240
Query 237 ATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAV 296
ATS++GL W +L ++ GI+ S WDS+MI YP + + +M Y+G+ +G+TGFG A+
Sbjct 241 ATSKNGLDWTRLDEEAGIEKSESGWDSEMICYPNIVKFNSKTYMFYNGNRHGKTGFGYAI 300
Query 297 LE 298
LE
Sbjct 301 LE 302
>gi|323140032|ref|ZP_08075042.1| hypothetical protein Met49242DRAFT_4430 [Methylocystis sp. ATCC
49242]
gi|322394710|gb|EFX97301.1| hypothetical protein Met49242DRAFT_4430 [Methylocystis sp. ATCC
49242]
Length=305
Score = 199 bits (506), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 111/304 (37%), Positives = 169/304 (56%), Gaps = 5/304 (1%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M WRKLGRIFAP G W+RS+A +P E ++ D R+Y++ D + IG + +D
Sbjct 1 MKWRKLGRIFAPDGSRRWARSYAIIPTAELVDDDRLRVYYASIDEERNGRIGVLELDARN 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
IL +P+L G G FDD GV+ ++V++ ++YY GW VP+ GVA
Sbjct 61 PTHILHDRPDPVLDIGELGCFDDSGVNPSALVQSEVGAVMYYIGWQRCERVPYMLFAGVA 120
Query 121 ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQD-GGTYRMWYGSNLGWGE-GTDEIP- 177
F+R P++ E +PF S ++++ G+YR WY S WG G + P
Sbjct 121 RRGEDGVFQRLRRTPILDRTETEPFVRSATTILREPDGSYRCWYVSAHRWGYVGEKQYPE 180
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFC--ARGAKYRIY 235
++IR +S DG++W + + I+ S RP+V++D +Y+MW+ +R YR+
Sbjct 181 YIIRTTRSDDGLNWSRDSVIAINFSNPSEFGFGRPWVIKDGSLYKMWYSIRSRTEPYRLG 240
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLA 295
A SEDG++W + + S D WD +MI YPCV D G R++ Y+G+ +G TGFG+A
Sbjct 241 YAESEDGVSWARQDHRMQLMRSEDGWDQEMICYPCVIDASGGRYLFYNGNSHGATGFGVA 300
Query 296 VLEN 299
VLE
Sbjct 301 VLEK 304
>gi|152994861|ref|YP_001339696.1| hypothetical protein Mmwyl1_0829 [Marinomonas sp. MWYL1]
gi|150835785|gb|ABR69761.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
Length=304
Score = 198 bits (504), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 109/298 (37%), Positives = 167/298 (57%), Gaps = 7/298 (2%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVGG 62
W K G I+AP + +HA+ +P +I D++R++FSGR+ +N+SS+G D+ V
Sbjct 4 WSKQGLIYAPLKIDEMLSTHASNALPIFISDDVYRVFFSGRNSENKSSVGWFDFDI-VKQ 62
Query 63 KILDIPAEPILRPGARG-MFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI 121
+IL I E L + + G+S+G + G +Y+ W + W+ +G
Sbjct 63 EILYICDETFLSCSEKSRKYYSHGISLGCYLHDGVDIYVYFMAWQIEGNNHWRGDVGRFC 122
Query 122 SEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHVIR 181
+ + P + DE DP SLSYP++++D G +RMWYGS + W E+ HVI+
Sbjct 123 LDQSKKLKYVDDTPYMISDEEDPVSLSYPFILKDDGLFRMWYGSTISWDSPNGEMVHVIK 182
Query 182 YAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCAR---GAKYRIYCAT 238
YA S+DGV+W+K + I A RP V++ AG+Y MWF R G+ YRI A
Sbjct 183 YATSKDGVNWDKHG-IAIPFELGVAQAFSRPCVIKRAGIYHMWFSYRSGDGSTYRIGYAK 241
Query 239 SEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAV 296
S D + W + D G+ S D WDS+M+ YP +F H+ + +MLY+G+ +G+TG GLAV
Sbjct 242 SIDAINW-DVDFDSGVAPSKDGWDSEMVCYPYIFSHKEKVYMLYNGNAHGKTGIGLAV 298
>gi|295148979|gb|ADF80978.1| hypothetical protein [Vibrio cholerae]
Length=309
Score = 196 bits (499), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/306 (38%), Positives = 162/306 (53%), Gaps = 8/306 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDI-FRIYFSGRDGQNRSSIGSVIVDLA 59
M W+K+G ++ P + W++ +A LPVPE+IE + RIYF D +N I + VD
Sbjct 1 MKWKKMGLVYRPRRKQPWNQKYAILPVPEFIENENRIRIYFGSTDNENFGRISYIEVDAD 60
Query 60 VGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGV 119
KIL +P+L G G FDDCGV +V+ + LLY G+ V VP+ G+
Sbjct 61 EPTKILYEHQKPVLDLGREGTFDDCGVVPSCLVQKEECSLLYTVGFQRCVKVPYMLFAGL 120
Query 120 AISEAGAP--FERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMW--YGSNLGWGEGTDE 175
A+ E P +R+S P++ P S PWV+ + G YRMW YG+ EG
Sbjct 121 AMFEKNEPATMKRYSEAPILERTPERPISQGAPWVLYENGKYRMWHWYGTKWIEVEGKPF 180
Query 176 IPHVIRYAQSRDGVHWEKQDRVHI-DTSGSDNSAACRPYVVRDAGVYRMWFCARGAK--Y 232
I + I YA+S DG W D V + A RP V + Y MW+ R K Y
Sbjct 181 IDYHIGYAESDDGYTWSMTDNVCLAPIKELGEFAVARPCVFKQGETYHMWYSVRLEKKMY 240
Query 233 RIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGF 292
RI ATS+DGL W + D G++VS D WDS+M+ YP V + +G+ M ++G+ G TGF
Sbjct 241 RIAYATSKDGLKWIRHTGDFGLEVSDDGWDSEMMCYPAVIEVKGRLLMFFNGNNNGETGF 300
Query 293 GLAVLE 298
G+A E
Sbjct 301 GVAEAE 306
>gi|86147247|ref|ZP_01065562.1| hypothetical protein MED222_17818 [Vibrio sp. MED222]
gi|85834962|gb|EAQ53105.1| hypothetical protein MED222_17818 [Vibrio sp. MED222]
Length=318
Score = 196 bits (498), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 114/308 (38%), Positives = 170/308 (56%), Gaps = 13/308 (4%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIE-GDIFRIYFSGRDGQNRSSIGSVIVDLAVG 61
W K+G I+ P+ + WS +HA PV ++IE +I R+YFS R+ S V ++
Sbjct 9 WEKVGLIYKPNNTIPWSVTHAQAPVADYIEDKNIIRVYFSTRNIDGLSLPTFVDLNADNP 68
Query 62 GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI 121
+I+ I P+L G+ G FDD GV +V GD R LYY GWN+ + + N++G+AI
Sbjct 69 LEIIHINESPLLDLGSLGTFDDRGVMPSWVVNRGDERWLYYIGWNVRDNISYHNSVGLAI 128
Query 122 -SEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--EGTDEIPH 178
S + F R+S P+ D ++P+ + V+ D G ++ WY S GW G E +
Sbjct 129 ASSSDDKFVRFSEGPLWDRDWKEPYFSASTCVLFDDGVWKNWYLSCTGWKVVNGKSEPRY 188
Query 179 VIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARG--------- 229
I+YA+S DG++W ++ +V ID + + + VV++ G YRMWF R
Sbjct 189 HIKYAESEDGINWVREGKVAIDYKNEEEAGIVKASVVKENGRYRMWFSYRNFTNYRTDPK 248
Query 230 AKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGR 289
A YRI A S+DG+ W + GID+S WDS+MI YP V + M Y+G+G+GR
Sbjct 249 ASYRIGYAESKDGIVWNRNDDLAGIDISASGWDSEMIAYPHVIKVKESYLMFYNGNGFGR 308
Query 290 TGFGLAVL 297
+GFG A L
Sbjct 309 SGFGYARL 316
>gi|344923648|ref|ZP_08777109.1| hypothetical protein COdytL_03235 [Candidatus Odyssella thessalonicensis
L13]
Length=308
Score = 188 bits (478), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/308 (35%), Positives = 168/308 (55%), Gaps = 15/308 (4%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W K G IF G+ + ++H +P+ E + D +RIYFS RD RS + V+
Sbjct 1 MGWVKKGLIFKAQGQYPFMQTHTQVPLVEVVNADRWRIYFSTRDNLGRSRPTYIEVNAHN 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++L EP+L G G FDDCGV +I+ G +L+YYTGWN+ TVP+ N+IG+A
Sbjct 61 PLEVLYCHPEPLLELGEIGTFDDCGVMATAIINQGSRKLMYYTGWNVRNTVPYHNSIGLA 120
Query 121 ISE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTD--EIP 177
ISE G F+R+S P++A ++P+ + V+++ +RMWY GW E
Sbjct 121 ISEDGGKTFQRFSQGPLLASTYKEPYFVGLATVLKE-DKWRMWYSCCTGWHNHRQKPEAI 179
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA------- 230
+ I YA+S +G+ W + +V +D + + V ++ +Y M FC R
Sbjct 180 YRIHYAESDNGIDWHRSGQVALDYFNKEGNGLSVSSVFKEDDLYHMVFCYRKPFDYHTNP 239
Query 231 --KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYG 288
Y+I A SEDGL W++ +++ + +S WDS M+ YP + + ++ Y+G+ +G
Sbjct 240 LNSYKIGYAISEDGLRWQR--QEDILSLSEQGWDSFMLAYPFMLPQKDAFYLFYNGNDFG 297
Query 289 RTGFGLAV 296
++G GLAV
Sbjct 298 KSGLGLAV 305
>gi|146298083|ref|YP_001192674.1| hypothetical protein Fjoh_0319 [Flavobacterium johnsoniae UW101]
gi|146152501|gb|ABQ03355.1| hypothetical protein Fjoh_0319 [Flavobacterium johnsoniae UW101]
Length=309
Score = 182 bits (463), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/311 (34%), Positives = 177/311 (57%), Gaps = 21/311 (6%)
Query 3 WRKLGRIFAPSG-ELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRS---SIGSVIVDL 58
W+K G +F S + D+ +SHA++P +E ++FRIYFS R+ +S I +++ +
Sbjct 2 WKKKGLLFNVSHYKNDFIKSHASIPFAYHVEENMFRIYFSSRNEAGKSFPFYINAIVNNG 61
Query 59 AVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIG 118
+ +++ PIL G G FDD G+ ++++ D L+YY GWN +TV ++ +IG
Sbjct 62 NI--EVISDVVGPILELGRLGTFDDSGIMPSCLIKSNDKLLMYYIGWNPQITVSYRLSIG 119
Query 119 VAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIP 177
+AIS + G F+++S P+ + +P+ + P++I + ++MWY S GW E + P
Sbjct 120 LAISYDNGLTFQKFSEGPICDRNISEPYFNTAPYIIIENNVWKMWYISCTGW-EIINNYP 178
Query 178 ---HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK--- 231
+ ++YA+S DG++WE++ + +D A RP V+++ Y M+F R
Sbjct 179 EPSYHVKYAESDDGINWERKGTISLDYD-EKAKALGRPCVLKEDNKYVMYFSYRNTSEYR 237
Query 232 ------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGD 285
Y++ A S DG+ W + +D GI +S WDS M+EY VF+H G +MLY+G+
Sbjct 238 TSSQDGYKLGLALSYDGVIWEKKYEDVGIALSNFGWDSQMMEYCHVFEHMGFTYMLYNGN 297
Query 286 GYGRTGFGLAV 296
+G+ GFG AV
Sbjct 298 DFGKEGFGYAV 308
>gi|170722913|ref|YP_001750601.1| hypothetical protein PputW619_3750 [Pseudomonas putida W619]
gi|169760916|gb|ACA74232.1| conserved hypothetical protein [Pseudomonas putida W619]
Length=307
Score = 177 bits (450), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/306 (37%), Positives = 169/306 (56%), Gaps = 16/306 (5%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
MAW+KLGR F P + DW SHA +P ++ D+ R++ S R+ +S +V +D
Sbjct 1 MAWKKLGRTFDP--DKDWVGSHAQVPTALVLD-DVIRVFISTRNSAGKSLCYAVDLDKQD 57
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++ EP L GA G FDD GV ++ LYY+GWN +TVP+ N +GVA
Sbjct 58 PRTVVARHREPCLGFGAPGTFDDEGVMPSYALKKDGRTYLYYSGWNQRLTVPYHNAMGVA 117
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGE--GTDEIP 177
+S + G FE+ P++ +P+ P V+ D G ++MWY S W E G E
Sbjct 118 VSDDDGLHFEKLFEGPIMDRTATEPYLAVTPTVLFDQGLWKMWYVSGTRWLEVDGKYEPL 177
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK------ 231
+VI+YA+S+DG + + ++ S + A RP V+++ G+++MW+C+R ++
Sbjct 178 YVIKYAESKDGFEFTRFAPQCLE-SRFETEAFSRPCVIKENGIFKMWYCSRASQDYRNGA 236
Query 232 --YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGR 289
YRI A S DG TW + D GI S + WDS M YP +F + +MLY+G+ +G
Sbjct 237 GSYRIRYAESPDGRTWTR-HDDAGIAPSAEGWDSLMTCYPFIFQSGERTYMLYNGNRFGT 295
Query 290 TGFGLA 295
+GFGLA
Sbjct 296 SGFGLA 301
>gi|124010089|ref|ZP_01694749.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123983857|gb|EAY24262.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length=298
Score = 176 bits (445), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 108/303 (36%), Positives = 169/303 (56%), Gaps = 19/303 (6%)
Query 3 WRKLGRIFAPSGELDWSRSH-AALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVG 61
W+KLG I+ +R H A+P+ +I I RI+FS RD N+S ++ DL
Sbjct 4 WQKLGLIY--------NRQHYQAVPLAHFIAPHIIRIFFSTRDLANQSLPCAIDYDLHQQ 55
Query 62 GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAI 121
+ + E L G GMFD G+ +++ G+ +YY GWN +VP++N IG+ I
Sbjct 56 KVVNEFKIEVPL--GNLGMFDQNGIMPTALLDQGNELWMYYIGWNTGGSVPFRNAIGLLI 113
Query 122 SE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW-GEGTDEIPHV 179
S+ G F++ + P++ DP ++ V+ + G YRM+Y S + W + T E+ H
Sbjct 114 SKDGGHTFQKHAQGPLLDRCVYDPCFVASNCVLAEEGFYRMYYLSCVQWQAQPTGEVQHY 173
Query 180 --IRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA----KYR 233
I+YA+S +G+ W+++ +V I A P V+++AG Y+MW+ R + YR
Sbjct 174 YHIKYAESANGIDWKREGKVAIGFKNEYEYAISVPRVIKEAGRYKMWYSYRASAHTTTYR 233
Query 234 IYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFG 293
I A S DGL W + + G+DVS + WDS MI YP +F +R+MLY+G+ YG++G G
Sbjct 234 IGYAESVDGLDWVRKDELVGLDVSAEGWDSQMICYPEIFTFEHKRYMLYNGNEYGKSGIG 293
Query 294 LAV 296
LAV
Sbjct 294 LAV 296
>gi|296161428|ref|ZP_06844234.1| conserved hypothetical protein [Burkholderia sp. Ch1-1]
gi|295888243|gb|EFG68055.1| conserved hypothetical protein [Burkholderia sp. Ch1-1]
Length=306
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/307 (35%), Positives = 164/307 (54%), Gaps = 13/307 (4%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M WRKLG ++ P+G+L W+RS+A+ P P +++ RIY GRD + IG V VD
Sbjct 1 MEWRKLGVVWCPNGDLWWARSYASCPTPLFLDDGTLRIYVQGRDEKGIGRIGFVDVDAGD 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGV-SIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGV 119
++L + ++P+L G G FDD GV + R T +YY G+ + + ++ G+
Sbjct 61 PTRVLRVSSDPVLDVGVPGAFDDNGVFQTCVLARPDGTLAMYYVGFEICHQIRYRLLTGL 120
Query 120 AIS-EAGAPFERWSTFPVVALDERDPFSLSY---PWVIQDGGTYRMWYGSNLGWG--EGT 173
AIS + G F+R P++ ER P L + P+V+ +GG YRMWY + W EG
Sbjct 121 AISRDGGETFQRLRATPIL---ERSPDELYFRCGPFVMAEGGVYRMWYIAGSEWETLEGK 177
Query 174 DEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK-- 231
+ +RY +S DG+ W + + RPY+VR Y+M++ R
Sbjct 178 AMPVYDLRYLESEDGIVWPDAGSRVFELNRDVEHGVGRPYIVRKRDGYQMFYSIRKKPPL 237
Query 232 -YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRT 290
YR+ A S DGL W ++ + G+DVS WD++ IEY V + + F Y+G+ +G T
Sbjct 238 GYRMGYAESPDGLHWTRMDEQLGLDVSASGWDNETIEYSAVVNVGDKTFCFYNGNDFGGT 297
Query 291 GFGLAVL 297
GFG+A L
Sbjct 298 GFGVAEL 304
>gi|237742770|ref|ZP_04573251.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13]
gi|229430418|gb|EEO40630.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13]
Length=301
Score = 170 bits (430), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 106/303 (35%), Positives = 164/303 (55%), Gaps = 9/303 (2%)
Query 1 MAWRKLGRIFA--PSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDL 58
M W+KLG+IF + +W+ SH+A PV + D R+YFS RD + +S++GS +
Sbjct 1 MKWKKLGKIFEIDEKNKKNWNASHSANPVCIKLNSDEIRVYFSTRDTEGKSNVGSFDYSM 60
Query 59 AVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIG 118
KI+DI +P++ G+ D G+ IG+I+ D + +YY W + W+ I
Sbjct 61 K-ENKIIDINEKPVMLHGSGEEVDSSGIGIGNIIEILDEKYMYYMAWQVPQGQHWRGDIA 119
Query 119 VA-ISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIP 177
A + R F + ++ D SLSYP++I++ +Y MWYGS W G E+
Sbjct 120 RAKLDLENNVMVRDDDFLMTVNNDIDKVSLSYPFLIKENNSYYMWYGSTDTWDFGNGEML 179
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK--YRIY 235
H+I A S DG ++K+ + I A RP V++ +RMW+ RG K Y+I
Sbjct 180 HIINLAISEDGEKFDKRKKC-IPYEIGKAQAFSRPVVIKWKDKWRMWYSYRGNKDKYKIG 238
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLA 295
A +++ W K+ S WDS+M+ YP VF++ + +MLY+G+GYG+TG GLA
Sbjct 239 YAEADNLDKWEV--KESNFYCSESGWDSEMVCYPYVFEYNDKLYMLYNGNGYGKTGIGLA 296
Query 296 VLE 298
VLE
Sbjct 297 VLE 299
>gi|242400024|ref|YP_002995449.1| hypothetical protein TSIB_2053 [Thermococcus sibiricus MM 739]
gi|242266418|gb|ACS91100.1| hypothetical protein TSIB_2053 [Thermococcus sibiricus MM 739]
Length=308
Score = 166 bits (421), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/308 (33%), Positives = 160/308 (52%), Gaps = 10/308 (3%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M WRK+GRI+AP GE W + A P P ++ + R+Y RD + S IG V V
Sbjct 1 MKWRKMGRIYAPKGEKPWMQHSAMTPTPILLDDETIRVYVGFRDNEGVSRIGYVDVKADN 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++LDI EP+L G G FDD G+ +G +V+ + +YY G+ L + G+A
Sbjct 61 PSRVLDISMEPVLDIGIPGAFDDNGMILGDVVKYKNKIRMYYVGFQLVKKAKFLAFSGLA 120
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGE--GTDEIP 177
IS + G F R S P++ +R+ + + V+ + G +R+WY + W G
Sbjct 121 ISKDEGYTFRRISNAPILDRIDRELYIRAIHSVLFENGKWRIWYAAGNKWEYIGGKPYPS 180
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK------ 231
+ IRY +S+DG+ +E++ + + + RP V + Y M F +G K
Sbjct 181 YDIRYIESKDGITFERKPGTIVIPNNNTEYRIGRPRVYKFNEKYYM-FYTKGVKRGNHFD 239
Query 232 YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTG 291
Y A S DG+ W + + GI SP WDS+M+ YP + + + +M Y+G+G G++G
Sbjct 240 YLPGFAESFDGIHWVRKDHEIGITPSPRGWDSEMLCYPSLIQYEDKIYMFYNGNGMGKSG 299
Query 292 FGLAVLEN 299
FG A+LE+
Sbjct 300 FGYAILES 307
>gi|284040023|ref|YP_003389953.1| hypothetical protein Slin_5182 [Spirosoma linguale DSM 74]
gi|283819316|gb|ADB41154.1| conserved hypothetical protein [Spirosoma linguale DSM 74]
Length=315
Score = 166 bits (419), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 110/314 (36%), Positives = 172/314 (55%), Gaps = 23/314 (7%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W+K G ++ P G +SR+HA +P ++ D R+YFS RD + S++ V ++
Sbjct 1 MTWQKKGLVYKPDGSKPFSRTHAQVPFGFPMQ-DKVRVYFSTRDEHSASAVSFVELNPDN 59
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
++ + +P L+ GA GMFD+ G + GD LYYTGWN + T ++ +IG+A
Sbjct 60 LSEVTYVHDKPCLQKGAVGMFDETGTMPSWFLPVGDEIWLYYTGWNKSETASYRLSIGLA 119
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQ----DGGT-YRMWYGS--NLGWGEG 172
IS + G FER T P++ D ++ P V++ DG +RMWY S + G
Sbjct 120 ISRDGGLTFERKYTGPLLDRSIYDQVWIAQPCVMREEQPDGSIRWRMWYLSCTKIEVING 179
Query 173 TDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSD--NSAACRPYVVRDAGVYRMWFCARGA 230
E + ++YA+S DG+ W++ V + G D A RP V +D +Y+M+F R A
Sbjct 180 HPEPFYDVKYAESEDGIDWKRTGHVCV---GYDEFTDAIGRPTVYKDGDLYKMYFSYRNA 236
Query 231 ---------KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFML 281
YRI A S+DG++W + + GI+ S + WDS M++Y +F H+ Q ML
Sbjct 237 TNYRTDVERSYRIGYAESKDGISWERKDELAGIERSAEGWDSVMMDYCHIFKHQDQWIML 296
Query 282 YSGDGYGRTGFGLA 295
Y+G+G+G +GFG A
Sbjct 297 YNGNGFGASGFGYA 310
>gi|336315407|ref|ZP_08570318.1| hypothetical protein Rhein_1693 [Rheinheimera sp. A13L]
gi|335880384|gb|EGM78272.1| hypothetical protein Rhein_1693 [Rheinheimera sp. A13L]
Length=318
Score = 160 bits (405), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 111/322 (35%), Positives = 170/322 (53%), Gaps = 32/322 (9%)
Query 1 MAWRKLGRIFAPS------GELDWSRSHAALPVPEWIEGDIFRIYFSGR----DGQNRSS 50
M W+KLG+IF P+ G ++++S AL +++ RIYF + +G+ S
Sbjct 1 MKWQKLGKIFDPTTVVLADGCTEFAKSPQALVFEDFV-----RIYFCAQKKTANGKYLSF 55
Query 51 IGSVIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVT 110
V D ++ KIL + + I++PG G FD+ G+ S+ R + L Y +GW+ +
Sbjct 56 PQYVDFDKSLN-KILALSEQSIIQPGELGHFDEHGIFPFSVTRDDKSILAYTSGWSRRTS 114
Query 111 VPWKNTIGVAIS-EAGAPFERWSTF-PVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLG 168
V +IG+A S + GA FE++ PV+A +P ++ +V++ G+Y MWY
Sbjct 115 VSVDMSIGLARSTDQGASFEKYGAGGPVMAASHNEPMMVADAFVLKVNGSYHMWYIFGSH 174
Query 169 W----GEGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMW 224
W +G E + I YA S DG+ W++ + ID D A P V+ AG Y M+
Sbjct 175 WQKKTADGAAERFYKIAYAHSSDGITWQRTGQTIIDERIPDECQAL-PTVIYAAGKYHMY 233
Query 225 FCARGA---------KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHR 275
FC R A YR+ A SED + W + GID+S WDS+M+ YP +F+
Sbjct 234 FCYRSAYDFRQNSKNAYRLGYAWSEDLIRWTRDDNLAGIDLSDSGWDSEMMCYPNLFESD 293
Query 276 GQRFMLYSGDGYGRTGFGLAVL 297
GQ F+LY+G+ +GR GFGLA L
Sbjct 294 GQIFLLYNGNEFGRYGFGLARL 315
>gi|227112550|ref|ZP_03826206.1| hypothetical protein PcarbP_06277 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length=319
Score = 159 bits (402), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 109/315 (35%), Positives = 156/315 (50%), Gaps = 18/315 (5%)
Query 1 MAWRKLGRIFAPSGELD--WSRSHAALPVPEWIEGDIFRIYFSGR---DGQNRSSIGSVI 55
W+KLG++F P D W + A P + D R+YFS R D Q S
Sbjct 2 FKWKKLGKVFTPQDVNDRLWLKEFAQAPA-TLVFDDFVRVYFSCRPPADEQGMYVSYSAW 60
Query 56 VDLAVGG--KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPW 113
VDL K+L + PIL G G FD+ G S+VR + YY GW +VP+
Sbjct 61 VDLDRNNLFKVLRVSENPILPLGQSGEFDEFGTYPVSVVRDENIFRAYYAGWTRCESVPF 120
Query 114 KNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--E 171
IG+A S+ G F + P+++ +PF +S P V + G ++++Y + W +
Sbjct 121 NVAIGMATSDDGDVFRKAGPGPIISYSPEEPFVMSGPKVRRFNGEWQLFYIAGRRWKLVD 180
Query 172 GTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK 231
G E + IR A S DG++W K ++ I + ++ A P V Y M+FC R ++
Sbjct 181 GRAEPVYKIRMAVSSDGINWRKLNKDLISSRIEEDEAQASPDVFYANCKYHMFFCYRYSE 240
Query 232 --------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYS 283
YRI A S D + W + + G+DVS WDS+MI YP VF+ G+ +M Y
Sbjct 241 HYRGKKHGYRIGYAWSSDLIDWHRDDEKAGVDVSETGWDSEMISYPHVFELDGKVYMAYL 300
Query 284 GDGYGRTGFGLAVLE 298
GD GR GFGLA LE
Sbjct 301 GDQVGRYGFGLAQLE 315
>gi|50120668|ref|YP_049835.1| hypothetical protein ECA1735 [Pectobacterium atrosepticum SCRI1043]
gi|49611194|emb|CAG74640.1| conserved hypothetical protein [Pectobacterium atrosepticum SCRI1043]
Length=319
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/315 (35%), Positives = 155/315 (50%), Gaps = 18/315 (5%)
Query 1 MAWRKLGRIFAPS--GELDWSRSHAALPVPEWIEGDIFRIYFSGR---DGQNRSSIGSVI 55
W+KLG++F P W + A P + D R+YFS R D S
Sbjct 2 FQWKKLGKVFTPQDINNRPWLKEFAQAPA-TLVFDDFVRVYFSCRPPVDEHGMYVSYSAW 60
Query 56 VDLAVGG--KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPW 113
VDL +L + +PIL G G FD+ G S+VR + YY GW +VP+
Sbjct 61 VDLDRNNLFNVLRVSEKPILPLGQSGEFDEFGTYPVSVVRDENIFRAYYAGWTRCESVPF 120
Query 114 KNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--E 171
IG+AIS+ G F + P+++ +PF +S P + + G ++++Y + W +
Sbjct 121 NVAIGMAISDDGDVFRKAGPGPIISYSPEEPFVMSGPKIRRFNGEWQLFYIAGRRWKLVD 180
Query 172 GTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK 231
G E + IR A S DG +W K ++ I + ++ A P V G Y M+FC R ++
Sbjct 181 GRAEPVYKIRMAVSSDGTNWRKINKDLISSRIEEDEAQASPDVFYANGRYHMFFCYRYSE 240
Query 232 --------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYS 283
YRI A S D + W + + GIDVS WDS+MI YP VF+ G+ +M Y
Sbjct 241 HYRGKKHGYRIGYAWSSDLIDWHRDDEKVGIDVSETGWDSEMISYPHVFELDGKVYMAYL 300
Query 284 GDGYGRTGFGLAVLE 298
GD GR GFGLA LE
Sbjct 301 GDQVGRYGFGLAQLE 315
>gi|253688943|ref|YP_003018133.1| hypothetical protein PC1_2566 [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|251755521|gb|ACT13597.1| conserved hypothetical protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
Length=319
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/315 (34%), Positives = 155/315 (50%), Gaps = 18/315 (5%)
Query 1 MAWRKLGRIFAPSGELD--WSRSHAALPVPEWIEGDIFRIYFSGR-----DGQNRSSIGS 53
W+KLG++F P D W + A P + D R+YFS R G S
Sbjct 2 FQWKKLGKVFTPQEINDRPWLKEFAQAPA-TLVFDDFVRVYFSCRPPADEHGMYVSYSAW 60
Query 54 VIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPW 113
V +D +L + PIL G G FD+ G S+VR + YY GW +VP+
Sbjct 61 VDLDRHNLFNVLRVSETPILPLGQSGEFDEFGTYPVSVVRDENIFRAYYAGWTRCESVPF 120
Query 114 KNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWG--E 171
IG+A S+ G F + P+++ +PF +S P + + G ++++Y + W +
Sbjct 121 NVAIGMATSDDGDVFRKAGPGPIISYSPEEPFVMSGPKIRRFNGEWQLFYIAGRRWKRVD 180
Query 172 GTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK 231
G E + IR A S DG++W K ++ I + ++ A P V G Y M+FC R ++
Sbjct 181 GRAEPVYKIRMALSSDGINWRKINKDLISSRIEEDEAQASPDVFYANGKYHMFFCYRYSE 240
Query 232 --------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYS 283
YRI A S D + W + + GIDVS WDS+MI YP VF+ G+ +M Y
Sbjct 241 HYRGKKHGYRIGYAWSSDLIDWHRDDEKAGIDVSETGWDSEMISYPHVFELDGKVYMAYL 300
Query 284 GDGYGRTGFGLAVLE 298
GD GR GFGLA LE
Sbjct 301 GDQVGRYGFGLAQLE 315
>gi|149915503|ref|ZP_01904030.1| hypothetical protein RAZWK3B_05792 [Roseobacter sp. AzwK-3b]
gi|149810792|gb|EDM70633.1| hypothetical protein RAZWK3B_05792 [Roseobacter sp. AzwK-3b]
Length=322
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/316 (35%), Positives = 153/316 (49%), Gaps = 19/316 (6%)
Query 1 MAWRKLGRIFAPSGE--LDWSRSHAALPVPEWIEGDIFRIYFSGR-----DGQNRSSIGS 53
W+KLG++F P W A P I D+ R+YFS R +G S
Sbjct 5 FQWQKLGKVFDPRAYSCRPWLACFAQAPA-TLIFDDVVRVYFSCRPQPDANGHFTSYSSW 63
Query 54 VIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPW 113
V +D +++ + P+LR G G FD+ G S+++ D L YY GW +VP+
Sbjct 64 VDLDRTDLTRVVRVADAPVLRLGETGTFDEFGTYPISVIKTEDGVLAYYAGWTRCESVPF 123
Query 114 KNTIGVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW--G 170
IG A+S + GA FE+ P++ +PF +S P + + G TY ++Y + W
Sbjct 124 NVAIGAALSRDGGAHFEKLGQGPIIGYSPDEPFVMSGPKIRKFGETYYLFYIAGTKWVLH 183
Query 171 EGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA 230
+G E + IR A S DG +W K IDT ++ A P V G Y M+FC R +
Sbjct 184 KGRPEPVYRIRMAMSDDGRNWVKHGHHLIDTVVEEDEAQASPDVHFHDGRYHMFFCYRYS 243
Query 231 K--------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLY 282
YRI A S D TW++ G+ S WDS+M+ YP VF GQ +M Y
Sbjct 244 TDYRGHARGYRIGYAHSADLRTWQRDDSLCGMHPSETGWDSEMVSYPHVFQVDGQTYMAY 303
Query 283 SGDGYGRTGFGLAVLE 298
G+ GR GFGLA LE
Sbjct 304 LGNEVGREGFGLARLE 319
>gi|167627484|ref|YP_001677984.1| hypothetical protein Fphi_1258 [Francisella philomiragia subsp.
philomiragia ATCC 25017]
gi|167597485|gb|ABZ87483.1| conserved hypothetical protein [Francisella philomiragia subsp.
philomiragia ATCC 25017]
Length=304
Score = 153 bits (387), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/303 (34%), Positives = 142/303 (47%), Gaps = 7/303 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGD-IFRIYFSGRDGQNRSSIGSVIVDLA 59
M W K G I P W++ + LP P +IE D + R++F D N I + +D
Sbjct 1 MKWEKKGLIHRPKSNASWNKKYDILPTPYFIEKDNVIRVFFGTTDDMNFGRITFIDIDAD 60
Query 60 VGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGV 119
++ + ++ G G FDDCGV SI+R D +Y G+ V P+ G+
Sbjct 61 NPLNVVYEHDDYVVDLGRDGTFDDCGVVPSSIIRKNDRYYMYTVGFQRTVKTPYMLFAGL 120
Query 120 AISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDE--IP 177
S F R S P++ S P VI D G Y+MW+ W ++ +
Sbjct 121 LESSDLRSFSRVSESPILPRVGLRCISQGAPCVIFDEGMYKMWHWYATKWIHVNNKKFMD 180
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNS-AACRPYVVRDAGVYRMWFCARGAK--YRI 234
+ I YA+S DGV W D + S N RP+V +D GVY M++ R YRI
Sbjct 181 YHIGYAESTDGVSWNMHDEYCLKPEQSLNEFGVARPWVFKDDGVYHMYYSTRYVDKLYRI 240
Query 235 YCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGL 294
A S DGL W + + DVS WDS+MI YP V + + +M Y+G+ G TGFG
Sbjct 241 SYAYSFDGLKWIRTNQIP-FDVSDKGWDSEMICYPSVLKVKNKLYMFYNGNNNGETGFGY 299
Query 295 AVL 297
A +
Sbjct 300 AEM 302
>gi|149925690|ref|ZP_01913954.1| hypothetical protein LMED105_05682 [Limnobacter sp. MED105]
gi|149825807|gb|EDM85015.1| hypothetical protein LMED105_05682 [Limnobacter sp. MED105]
Length=305
Score = 152 bits (385), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/305 (33%), Positives = 148/305 (49%), Gaps = 8/305 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W+K G I+ PSG W+++ A P P + D R+Y RD Q S IG V VD
Sbjct 1 MKWQKKGHIYGPSGTPSWAQNSALTPTPILLNPDTIRVYAGFRDSQGVSRIGFVDVDSNN 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
GK+L + P L G G FDD GV +G +++ G++ +YY G+ L + G+A
Sbjct 61 PGKVLRVSETPALDIGQPGAFDDNGVILGDVIKVGESLHMYYVGFQLVAKAKFLAFSGLA 120
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIP-- 177
+S + G F+R S PV+ + + V + +R WY + GW E D P
Sbjct 121 VSTDGGDSFKRVSAAPVLDRANEGIYIRAIHSVHLENSRFRAWYACDDGW-ELIDGKPYP 179
Query 178 -HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFC--ARGAKYRI 234
+ IR+ S +G+H++++ + I SG D RP V G M F Y
Sbjct 180 RYQIRHVSSANGIHFDQETQPCIPLSG-DEYRIGRPRVFFVKGQRYMHFTWGTPQGDYFP 238
Query 235 YCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGL 294
A S+DGL W ++ GI ++P WDS + YP + + M Y+G+ G GFG
Sbjct 239 GLAKSDDGLHWTRIDDQLGISLAPQGWDSKHLCYPALLQVNDKTLMFYNGNNMGLEGFGW 298
Query 295 AVLEN 299
A LE+
Sbjct 299 AELES 303
>gi|296101726|ref|YP_003611872.1| hypothetical protein ECL_01362 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gi|295056185|gb|ADF60923.1| hypothetical protein ECL_01362 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length=314
Score = 152 bits (383), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 108/311 (35%), Positives = 152/311 (49%), Gaps = 19/311 (6%)
Query 6 LGRIFAPS--GELDWSRSHAALPVPEWIEGDIFRIYFSGR---DGQNRSSIGSVIVDLAV 60
+G++F P L W + A P I D R+YFS R D Q + S VDLA
Sbjct 1 MGKVFTPQEVTHLPWLKEFAQAPA-TLIFDDFVRVYFSCRPPADEQGKYVSYSAWVDLAR 59
Query 61 GG--KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIG 118
+L + EPIL G G FD+ G S++R D +Y GW +VP+ IG
Sbjct 60 DDLFHVLRVAREPILPLGGYGEFDEFGTYPVSVMRDNDVVKAWYAGWTRCESVPFNVAIG 119
Query 119 VAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWY--GSNLGWGEGTDE 175
+A+S + G F + P + +PF +S P + + ++++Y G W +G E
Sbjct 120 MAVSHDQGETFVKAGPGPAIGYSPDEPFVMSGPKIRRFNNQWQLFYIAGRKWKWVDGRAE 179
Query 176 IPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK---- 231
+ IR A S DG++W K ++ I + ++ A P V G Y M+FC R +
Sbjct 180 PVYKIRMATSDDGINWTKLNKDLIPSRIEEDEAQASPDVFYANGKYHMFFCYRYSAHYRG 239
Query 232 ----YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGY 287
YRI A S D +TW + GIDVS WD++MI YP VF+ G +M Y GD
Sbjct 240 KQNGYRIGYAWSLDMITWHRDDSKAGIDVSASGWDAEMISYPHVFELDGTIYMAYLGDQV 299
Query 288 GRTGFGLAVLE 298
GR GFGLA LE
Sbjct 300 GRYGFGLAQLE 310
>gi|119504992|ref|ZP_01627069.1| hypothetical protein MGP2080_05240 [marine gamma proteobacterium
HTCC2080]
gi|119459278|gb|EAW40376.1| hypothetical protein MGP2080_05240 [marine gamma proteobacterium
HTCC2080]
Length=320
Score = 149 bits (377), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 106/314 (34%), Positives = 150/314 (48%), Gaps = 19/314 (6%)
Query 3 WRKLGRIFAPSG--ELDWSRSHAALPVPEWIEGDIFRIYFSGR-----DGQNRSSIGSVI 55
W+KLG++F P W + A P E D R+YFS R GQ S V
Sbjct 4 WKKLGKVFTPQKIKGRPWLKEFAQAPATLIFE-DFVRVYFSCRPARDESGQYVSYSAYVD 62
Query 56 VDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKN 115
+D + + PIL G G FD+ G S++R D YY GW + P+
Sbjct 63 LDRENLFNVRAVSESPILPLGGLGEFDEFGSYPVSVIRESDGVRAYYGGWTRCSSTPYTV 122
Query 116 TIGVAISE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW--GEG 172
IG A SE G F+R P+++ +PF LS P + + G +++Y + +GW +G
Sbjct 123 AIGHAFSEDGGKSFKRAGPGPILSQTPHEPFVLSGPKIRRFGDEQQLFYVAGIGWEMHDG 182
Query 173 TDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA-- 230
E + IR A S++G +W++ R I + P V+R Y M+FC +
Sbjct 183 RAESIYRIRVATSKNGSNWKRDGRDLIPIKLDEKECQASPDVIRADNKYHMFFCYKHGVD 242
Query 231 ------KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSG 284
YRI A S+ + W + GIDVS + WDS+ I YP VF+ G FMLY G
Sbjct 243 FRNSSRGYRIGYAYSDTLVDWIRRDDLAGIDVSLEGWDSESIAYPHVFELDGNYFMLYLG 302
Query 285 DGYGRTGFGLAVLE 298
+ GR GFGLA+LE
Sbjct 303 NEVGRYGFGLAILE 316
>gi|336315406|ref|ZP_08570317.1| Putative glycosylase [Rheinheimera sp. A13L]
gi|335880383|gb|EGM78271.1| Putative glycosylase [Rheinheimera sp. A13L]
Length=324
Score = 146 bits (369), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/319 (33%), Positives = 154/319 (49%), Gaps = 22/319 (6%)
Query 1 MAWRKLGRIFAPS---GELDWSRSHAALPVPEWIEGDIFRIYFSGR-----DGQNRSSIG 52
+ W KLG +F P DW + A P + I R++F R + Q S
Sbjct 3 LTWEKLGLVFDPELIPERPDWMVNFAQAPNVVIFDSFI-RVFFCCRPKPDENKQFVSYCA 61
Query 53 SVIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVP 112
V +D K+L+I +P+L G G FD+ G S+ + Y GW +VP
Sbjct 62 FVDLDKTDLFKVLNISQKPLLSLGDLGTFDEFGTYPVSVTEDSGELIAIYGGWQRCESVP 121
Query 113 WKNTIGVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW-- 169
+ ++G+A S + G F + PV++ +PF ++ P + + T+ + Y + W
Sbjct 122 FNISLGLARSHDKGVSFTKHGPGPVLSHSPNEPFIVTSPKLRKYNDTWYLAYTAGRKWIL 181
Query 170 -GEGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCAR 228
EG EI + +R A S+D V+W + DR ID+ D+ A P + AG Y M+FC R
Sbjct 182 DEEGRPEIIYKMRMATSKDLVNWTRLDRDIIDSKLGDDEAQACPDIFYAAGKYHMFFCYR 241
Query 229 ---------GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRF 279
YRI A+S D W + +DVS WDS+M+ YP VF+ G +
Sbjct 242 QGLDFRSNKNNSYRIGYASSVDLQQWHRDDSKVDLDVSETGWDSEMVAYPTVFELDGTVY 301
Query 280 MLYSGDGYGRTGFGLAVLE 298
MLY+G+G G+TGFGLA L
Sbjct 302 MLYAGNGNGKTGFGLAKLH 320
>gi|83951253|ref|ZP_00959986.1| hypothetical protein ISM_09125 [Roseovarius nubinhibens ISM]
gi|83839152|gb|EAP78448.1| hypothetical protein ISM_09125 [Roseovarius nubinhibens ISM]
Length=311
Score = 146 bits (368), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/304 (33%), Positives = 147/304 (49%), Gaps = 7/304 (2%)
Query 2 AWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVG 61
AWRKLGRIF PSGELDW++ PVP + D RIY RD + S IG + VD A
Sbjct 5 AWRKLGRIFCPSGELDWAQHSFMTPVPLQVNADTIRIYGGMRDRKGISRIGWIEVDRARP 64
Query 62 GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRL-LYYTGWNLAVTVPWKNTIGVA 120
+ D+ + P++ G GMFDD G+ +G ++R D R+ +YY G+ L V + G+A
Sbjct 65 TVLRDVGSMPVIALGDPGMFDDNGMILGDLLRLEDGRIRMYYVGFQLVQQVKFLAFTGLA 124
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGE-GTDEIPH 178
S + G F+R P++ E+ PF + ++ G YR W W + G P
Sbjct 125 ESTDGGLSFQRLQKHPILDRAEQAPFINALHSILPVEGGYRAWISCGQRWQDIGGRVFPQ 184
Query 179 VIRYA-QSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCA--RGAKYRIY 235
+ S DG+H++ + D RP R Y M + +Y +
Sbjct 185 YNCWTVTSPDGIHFDMETATPTLDVTGDEYRIGRPRANRTTDGYEMRVTSDTLAKQYATF 244
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFD-HRGQRFMLYSGDGYGRTGFGL 294
A S DG+ W + +E WD +M YP D +G+ ++ ++G+ G TG G+
Sbjct 245 LAKSSDGVNWTRTTVEELPRGEAGDWDDEMTCYPARIDTDQGESYLFFNGNNMGETGVGV 304
Query 295 AVLE 298
AVL+
Sbjct 305 AVLD 308
>gi|254373349|ref|ZP_04988837.1| predicted protein [Francisella tularensis subsp. novicida GA99-3549]
gi|151571075|gb|EDN36729.1| predicted protein [Francisella novicida GA99-3549]
Length=303
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/302 (31%), Positives = 148/302 (50%), Gaps = 7/302 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W+K G IF + W S A P P + D R Y RD + S IG + +D
Sbjct 1 MKWQKKGLIFKNEFKKGWRYSSALQPTP-LVFDDKIRFYVGFRDEKGVSRIGFIDLDKKD 59
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
KIL I P+L G G FD+ GV +I+R + +YY G+ L V + G+A
Sbjct 60 PKKILKISDTPVLDIGPDGAFDEFGVVPSAIIRYDNKVYMYYAGYQLGKKVRFLVLSGLA 119
Query 121 ISE-AGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDEIPHV 179
IS+ G F+R P+ +++ V + ++ WYG + +G + V
Sbjct 120 ISDDNGETFKRIKKVPIFERTDKEMLFRVPHTVRFEENKFKFWYGGGSHFEQGKQKTLPV 179
Query 180 --IRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWF--CARGAKYRIY 235
+RY +S DG+ + + +I + + RP+V++ Y M++ + Y++
Sbjct 180 YDVRYLESIDGISIPSEGK-NIISLKENEYRVGRPFVIKRNSKYLMFYGYSSENKPYQLG 238
Query 236 CATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLA 295
A S+DG+ W +L + GI++S WDS+M+ YPCV D + ++ Y+G+ YG GFG A
Sbjct 239 YAESKDGINWIRLDDNVGIELSATGWDSEMMAYPCVVDINDKTYLFYNGNNYGADGFGYA 298
Query 296 VL 297
L
Sbjct 299 EL 300
>gi|336322666|ref|YP_004602633.1| hypothetical protein Flexsi_0377 [Flexistipes sinusarabici DSM
4947]
gi|336106247|gb|AEI14065.1| hypothetical protein Flexsi_0377 [Flexistipes sinusarabici DSM
4947]
Length=248
Score = 143 bits (361), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 76/233 (33%), Positives = 128/233 (55%), Gaps = 5/233 (2%)
Query 3 WRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVGG 62
WRKLGRI+ S DW SH P P ++ + RIYF RD NR+ + V+
Sbjct 2 WRKLGRIYTVSKHSDWEWSHTHKPTPFLVDENTLRIYFGVRDKSNRTRTTFIDVNPENPL 61
Query 63 KILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTR-LLYYTGWNLAVTVPWKNTIGVAI 121
+I+ +P+L G G FDD G ++ +++ + ++YY GWN + +VP +N+IG+A
Sbjct 62 EIIYEHHKPVLDLGPLGAFDDLGANVSCVLKNEKSEVIMYYYGWNTSTSVPARNSIGIAK 121
Query 122 S-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTD--EIPH 178
S + G FE+ P++ + +P+ + P+V+ G Y+MWY S W D EI +
Sbjct 122 SLDGGLTFEKMFVGPIMDRTKYEPYFTTAPFVLFKDGVYQMWYTSGTEWKLINDKPEICY 181
Query 179 VIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK 231
I+YA S+DG+ W+++++ I ++ R V+++ +Y+MW+ R K
Sbjct 182 HIKYATSKDGIEWKRENQSCI-IPQNEYEITARGSVIKEDEIYKMWYSKRSIK 233
>gi|299134512|ref|ZP_07027705.1| conserved hypothetical protein [Afipia sp. 1NLS2]
gi|298591259|gb|EFI51461.1| conserved hypothetical protein [Afipia sp. 1NLS2]
Length=302
Score = 142 bits (359), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 93/303 (31%), Positives = 146/303 (49%), Gaps = 8/303 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M + K+G +F SG DW SH +P ++ R+YF+ RD IG VD
Sbjct 1 MRFEKVGVVFDASGRADWMNSHTYVPTALLLDDSTIRVYFASRDKDQVGRIGWFDVDANE 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVA 120
K++ P L G G FDD GV+ S+ + D LYY GW L + G+A
Sbjct 61 PTKVIGFSDRPCLDIGDDGCFDDNGVTPLSVFKDHDGIRLYYAGWQLTPKARYMLFTGLA 120
Query 121 IS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGE--GTDEIP 177
IS + G F R+ PV+ + S +++ GG Y++WY + G+ G
Sbjct 121 ISKDGGNTFRRYQKSPVLDRSPSELVVRSGAHIMKHGGLYKIWYAAGSGFVNISGKQVPT 180
Query 178 HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCAR--GAKYRIY 235
+ + YA+S DG+ W + + I+ D RP ++ +++ +R YRI
Sbjct 181 YHLAYAESEDGITWPDKGILSIEPQAPDEYGFGRPGMLIRGDELNIFYSSRTFSKGYRIG 240
Query 236 CATSEDGLTWRQLGKDE-GIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGL 294
A S+DG TW + +D G++ S WDS+M + + + + M Y+G+ +GRTG GL
Sbjct 241 YARSDDGRTWTR--QDHLGLNTSAFGWDSEMTCFASIVETQAGTLMFYNGNDFGRTGIGL 298
Query 295 AVL 297
AV+
Sbjct 299 AVI 301
>gi|253688942|ref|YP_003018132.1| hypothetical protein PC1_2565 [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|251755520|gb|ACT13596.1| conserved hypothetical protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
Length=322
Score = 142 bits (357), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 100/321 (32%), Positives = 159/321 (50%), Gaps = 29/321 (9%)
Query 1 MAWRKLGRIFAPS------GELDWSRSHAALPVPEWIEGDIFRIYFSGR--DGQNRSSIG 52
+ WRK G I++P G +++S AL + D RIYFS R D +N I
Sbjct 2 LTWRKHGLIYSPQAHPPLIGGAGYAQSPQAL-----VFDDFVRIYFSTREIDEKNNKFIS 56
Query 53 SV-IVDLAVG-GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVT 110
V VD+ +IL++ A P++ G FD+ G+ +++R D + + TGWN V+
Sbjct 57 RVSYVDMDKNLQEILNVSAAPVIDHAELGTFDEHGIFPFNVLRHNDAVMAWTTGWNRRVS 116
Query 111 VPWKNTIGVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW 169
V +IG+AIS + G F+R +T PV++ +P+ + +V+ G + MWY +GW
Sbjct 117 VSVDTSIGLAISRDGGNTFQRHATGPVMSASLHEPYLVGDAFVLHLEGRFHMWYIYGVGW 176
Query 170 GEGTDEIP----HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWF 225
+ P + I +A S DG+ W ++ + I D+ P V++ Y M F
Sbjct 177 KRQQSDSPPDRVYKIAHAVSDDGIDWVRESKPIIADRLGDDECQALPTVIKVGNRYHMIF 236
Query 226 CAR---------GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRG 276
C R G YR+ A S+D +TW + P WDS+M YP +F
Sbjct 237 CYRECFDFRLGAGRGYRLGYAWSDDLMTWHRDDSQVPAISGPGEWDSEMQCYPHLFRCDE 296
Query 277 QRFMLYSGDGYGRTGFGLAVL 297
+ ++LY+G+ +G+ GFGLA L
Sbjct 297 KVYLLYNGNAFGKEGFGLAEL 317
>gi|229916907|ref|YP_002885553.1| hypothetical protein EAT1b_1180 [Exiguobacterium sp. AT1b]
gi|229468336|gb|ACQ70108.1| conserved hypothetical protein [Exiguobacterium sp. AT1b]
Length=312
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/315 (34%), Positives = 153/315 (49%), Gaps = 20/315 (6%)
Query 1 MAWRKLGRIF--APSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSV-IVD 57
M W+KLG IF AP G S A P E D RIYFS R+ V VD
Sbjct 1 MKWKKLGHIFDPAPYGFFGKYSSFAQSPQALVFE-DFVRIYFSTREPDGDMFKSHVRYVD 59
Query 58 LAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTI 117
+ ++D+ + I+ G RG FD+ G+ + R D Y +GW+ +V + I
Sbjct 60 MTRDFNVIDVSTDEIIPLGKRGTFDEHGIFPFHVTRTRDGLYGYTSGWSRRDSVAVETGI 119
Query 118 GVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTD-- 174
G+++S + G FER PV++ +PF + P+V+ Y M+Y W EG D
Sbjct 120 GLSVSRDEGETFERLGDGPVLSASIEEPFLVGDPFVVTRE-KYYMYYIYGTTWKEGPDGV 178
Query 175 -EIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGA--- 230
E + I A S DG +++ R+ D ++ A P V G Y M FC R
Sbjct 179 QERTYKIALATSEDGQTFKRHGRIVSDVIADESQAL--PTVFEADGRYHMIFCFRDTFGF 236
Query 231 ------KYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSG 284
YR+ A SE+ L W + G + S D WD+DM YP VF+ G+ ++LY+G
Sbjct 237 RTDPLRGYRLGYAYSENLLDWTRDDAALGFERSSDGWDADMECYPHVFEWEGRHYLLYNG 296
Query 285 DGYGRTGFGLAVLEN 299
+ +GR GFG+A+LE+
Sbjct 297 NEFGRHGFGVAILED 311
>gi|119504993|ref|ZP_01627070.1| hypothetical protein MGP2080_05245 [marine gamma proteobacterium
HTCC2080]
gi|119459279|gb|EAW40377.1| hypothetical protein MGP2080_05245 [marine gamma proteobacterium
HTCC2080]
Length=317
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/315 (33%), Positives = 151/315 (48%), Gaps = 18/315 (5%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIE-GDIFRIYFSGR--DGQNRSSIGSVIVD 57
M + KLG+IF+P ++ P+ I D RI+FS R DG++ VD
Sbjct 1 MEFEKLGKIFSPKDHNLFTNLGEFAQSPQAIVFDDRVRIFFSTREKDGEHTFKSHPCYVD 60
Query 58 LAVG-GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNT 116
+ +IL + P++ G RG FD+ G+ S AGD + TGW V+V +
Sbjct 61 FDLTFSRILGVADRPLIGLGDRGCFDEHGIFPLSPFFAGDKVYAFTTGWTRRVSVSTDSG 120
Query 117 IGVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGEGTDE 175
+G+AIS + G FE++ PV+ +PF +S +VI+ G Y MWY W
Sbjct 121 VGLAISRDRGRTFEKYGRGPVLGPSVDEPFLVSDGYVIEHEGQYHMWYIYGQRWITKVPG 180
Query 176 IP----HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAK 231
P + I +A S D + W++ I + P V+ + GV+ M +C R A
Sbjct 181 APPDRVYKIAHATSHDLITWKRSGIPIIADQLDRDECQALPSVINNEGVFIMAYCYRHAT 240
Query 232 ---------YRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLY 282
YR+ CA S D W+ + D + + WD DM YPC+F GQ +MLY
Sbjct 241 SFRHDSNRGYRLGCAVSTDLTNWKVEDLELVGDSNNNPWDVDMQCYPCLFKLSGQIYMLY 300
Query 283 SGDGYGRTGFGLAVL 297
+G+ +GR GFGLA L
Sbjct 301 NGNEFGRHGFGLARL 315
>gi|186477054|ref|YP_001858524.1| hypothetical protein Bphy_2303 [Burkholderia phymatum STM815]
gi|184193513|gb|ACC71478.1| conserved hypothetical protein [Burkholderia phymatum STM815]
Length=307
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 94/305 (31%), Positives = 154/305 (51%), Gaps = 9/305 (2%)
Query 1 MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAV 60
M W K G ++ + A +P P I+ RI+ + DG N V VD +
Sbjct 1 MQWLKRGLVYRTDQDAPAGTVRAMVPTPLLIDDRTIRIFLTVCDGDNVGRPYFVDVDASD 60
Query 61 GGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGD-TRLLYYTGWNLAVTVPWKNTIGV 119
KI+ P++R GA G FD+ G+ I+R D T ++YY+G+ + +V +K +G+
Sbjct 61 PTKIIGKSTGPLMRTGAPGAFDERGIVCAQILRNTDGTLMMYYSGFERSDSVRYKIFMGL 120
Query 120 AIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGWGE-GTDEIP 177
A S + G F R P++ E + P+VI Y+MWY + W G E+P
Sbjct 121 AKSVDNGESFVRVQDSPILGPTEAESMFRCAPFVIATERGYQMWYTAGSSWEVVGGKEVP 180
Query 178 -HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRD-AGVYRMWFCARG---AKY 232
+ ++Y +S DG+ W + V G D RP++ + G Y++++ R A Y
Sbjct 181 RYSLKYLESTDGIDWASEG-VPCMRFGPDEHGIGRPWITKSPEGKYQLYYSVRRISLAAY 239
Query 233 RIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGF 292
R+ A S++GL W ++ G+DVSP S+DSD + Y + + + + Y+G+G+GR GF
Sbjct 240 RLGYAESDNGLDWNRMDDQLGLDVSPGSFDSDGMSYTALINAGDKTYCFYNGNGFGRDGF 299
Query 293 GLAVL 297
+A L
Sbjct 300 AVAEL 304
>gi|227329195|ref|ZP_03833219.1| hypothetical protein PcarcW_18407 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length=322
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/321 (31%), Positives = 157/321 (49%), Gaps = 29/321 (9%)
Query 1 MAWRKLGRIFAPS------GELDWSRSHAALPVPEWIEGDIFRIYFSGR--DGQNRSSIG 52
+ WRK G I++P G +++S AL + D RIYFS R D +N I
Sbjct 2 LTWRKHGLIYSPQAHPPLIGGAGYAQSPQAL-----VYDDFVRIYFSTREIDEKNNKFIS 56
Query 53 SV-IVDLAVG-GKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYYTGWNLAVT 110
V VD+ +IL + P++ G FD+ G+ +++R D + + TGWN V+
Sbjct 57 RVSYVDMDKNLQEILKVSPAPVIAHAELGTFDEHGIFPFNVLRHNDVVMAWTTGWNRRVS 116
Query 111 VPWKNTIGVAIS-EAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMWYGSNLGW 169
V +IG+AIS + G F+R +T PV++ +P+ + +V+ G + MWY +GW
Sbjct 117 VSVDTSIGLAISRDGGNTFQRHATGPVMSASLHEPYLVGDAFVLHIEGRFHMWYIYGVGW 176
Query 170 GEGTDEIP----HVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWF 225
+ + P + I +A S DG+ W ++ + I D+ P V++ Y M F
Sbjct 177 KKQQSDSPPDRIYKIAHAVSDDGIDWVRESKPIIADRLGDDECQALPTVIKVGNRYHMIF 236
Query 226 CAR---------GAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRG 276
C R G YR+ A S+D +TW + WDS+M YP +F
Sbjct 237 CYRECFDFRLGAGRGYRLGYAWSDDLITWHRDDTQVPAISESGEWDSEMQCYPHLFQCDE 296
Query 277 QRFMLYSGDGYGRTGFGLAVL 297
+ ++LY+G+ +G+ GFGLA L
Sbjct 297 KVYLLYNGNAFGKEGFGLAEL 317
Lambda K H
0.321 0.140 0.468
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 502027544144
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40