BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2267c
Length=388
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609404|ref|NP_216783.1| hypothetical protein Rv2267c [Mycob... 801 0.0
gi|308232083|ref|ZP_07663997.1| hypothetical protein TMAG_00462 ... 739 0.0
gi|339295182|gb|AEJ47293.1| hypothetical protein CCDC5079_2103 [... 733 0.0
gi|340625605|ref|YP_004744057.1| hypothetical protein MCAN_05811... 630 1e-178
gi|240170114|ref|ZP_04748773.1| hypothetical protein MkanA1_1242... 613 1e-173
gi|168700662|ref|ZP_02732939.1| hypothetical protein GobsU_14152... 330 3e-88
gi|283782111|ref|YP_003372866.1| hypothetical protein Psta_4359 ... 319 4e-85
gi|327540657|gb|EGF27229.1| hypothetical protein RBWH47_00635 [R... 285 1e-74
gi|32474669|ref|NP_867663.1| hypothetical protein RB7157 [Rhodop... 284 2e-74
gi|87311169|ref|ZP_01093292.1| hypothetical protein DSM3645_1611... 264 2e-68
gi|149176793|ref|ZP_01855404.1| hypothetical protein PM8797T_151... 261 2e-67
gi|296121097|ref|YP_003628875.1| hypothetical protein Plim_0831 ... 240 4e-61
gi|325107012|ref|YP_004268080.1| hypothetical protein Plabr_0431... 223 4e-56
gi|325107656|ref|YP_004268724.1| hypothetical protein Plabr_1084... 219 6e-55
gi|332707922|ref|ZP_08427927.1| hypothetical protein LYNGBM3L_75... 216 6e-54
gi|303278906|ref|XP_003058746.1| predicted protein [Micromonas p... 184 2e-44
gi|308802137|ref|XP_003078382.1| unnamed protein product [Ostreo... 181 2e-43
gi|307592182|ref|YP_003899773.1| hypothetical protein Cyan7822_5... 178 1e-42
gi|326427251|gb|EGD72821.1| hypothetical protein PTSG_12190 [Sal... 176 7e-42
gi|332880316|ref|ZP_08447994.1| hypothetical protein HMPREF9074_... 172 7e-41
gi|258647598|ref|ZP_05735067.1| conserved hypothetical protein [... 172 1e-40
gi|330998081|ref|ZP_08321909.1| hypothetical protein HMPREF9442_... 169 5e-40
gi|255078840|ref|XP_002503000.1| predicted protein [Micromonas s... 168 1e-39
gi|145344489|ref|XP_001416764.1| predicted protein [Ostreococcus... 167 4e-39
gi|326435248|gb|EGD80818.1| hypothetical protein PTSG_01404 [Sal... 166 9e-39
gi|307109301|gb|EFN57539.1| hypothetical protein CHLNCDRAFT_1431... 162 7e-38
gi|325279282|ref|YP_004251824.1| hypothetical protein Odosp_0560... 162 1e-37
gi|326434796|gb|EGD80366.1| hypothetical protein PTSG_10621 [Sal... 160 3e-37
gi|333031249|ref|ZP_08459310.1| hypothetical protein Bcop_2162 [... 157 2e-36
gi|189463425|ref|ZP_03012210.1| hypothetical protein BACCOP_0414... 157 4e-36
gi|77165206|ref|YP_343731.1| sulfotransferase [Nitrosococcus oce... 155 1e-35
gi|254425194|ref|ZP_05038912.1| hypothetical protein S7335_5357 ... 149 1e-33
gi|339441104|ref|YP_004707109.1| hypothetical protein CXIVA_0040... 148 1e-33
gi|254883656|ref|ZP_05256366.1| conserved hypothetical protein [... 148 1e-33
gi|150003008|ref|YP_001297752.1| hypothetical protein BVU_0415 [... 148 1e-33
gi|294775643|ref|ZP_06741151.1| conserved hypothetical protein [... 148 2e-33
gi|237707998|ref|ZP_04538479.1| conserved hypothetical protein [... 147 4e-33
gi|212690546|ref|ZP_03298674.1| hypothetical protein BACDOR_0002... 146 5e-33
gi|159030655|emb|CAO88325.1| unnamed protein product [Microcysti... 143 5e-32
gi|166365918|ref|YP_001658191.1| hypothetical protein MAE_31770 ... 142 7e-32
gi|325300102|ref|YP_004260019.1| hypothetical protein Bacsa_3017... 141 2e-31
gi|167763506|ref|ZP_02435633.1| hypothetical protein BACSTE_0188... 140 3e-31
gi|218260668|ref|ZP_03475864.1| hypothetical protein PRABACTJOHN... 140 5e-31
gi|154492288|ref|ZP_02031914.1| hypothetical protein PARMER_0192... 139 6e-31
gi|116073267|ref|ZP_01470529.1| hypothetical protein RS9916_3249... 139 6e-31
gi|224540135|ref|ZP_03680674.1| hypothetical protein BACCELL_050... 139 9e-31
gi|256839695|ref|ZP_05545204.1| conserved hypothetical protein [... 139 9e-31
gi|198274257|ref|ZP_03206789.1| hypothetical protein BACPLE_0039... 138 1e-30
gi|299149160|ref|ZP_07042221.1| conserved hypothetical protein [... 137 3e-30
gi|315919130|ref|ZP_07915370.1| conserved hypothetical protein [... 137 3e-30
>gi|15609404|ref|NP_216783.1| hypothetical protein Rv2267c [Mycobacterium tuberculosis H37Rv]
gi|15841760|ref|NP_336797.1| hypothetical protein MT2329 [Mycobacterium tuberculosis CDC1551]
gi|31793446|ref|NP_855939.1| hypothetical protein Mb2290c [Mycobacterium bovis AF2122/97]
66 more sequence titles
Length=388
Score = 801 bits (2070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/388 (100%), Positives = 388/388 (100%), Gaps = 0/388 (0%)
Query 1 MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSC 60
MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSC
Sbjct 1 MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSC 60
Query 61 LGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHF 120
LGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHF
Sbjct 61 LGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHF 120
Query 121 LLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEE 180
LLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEE
Sbjct 121 LLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEE 180
Query 181 YLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHI 240
YLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHI
Sbjct 181 YLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHI 240
Query 241 VRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFY 300
VRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFY
Sbjct 241 VRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFY 300
Query 301 ELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVD 360
ELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVD
Sbjct 301 ELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVD 360
Query 361 EHWGEIIDRYGYDRHTPEPARLRPAVGG 388
EHWGEIIDRYGYDRHTPEPARLRPAVGG
Sbjct 361 EHWGEIIDRYGYDRHTPEPARLRPAVGG 388
>gi|308232083|ref|ZP_07663997.1| hypothetical protein TMAG_00462 [Mycobacterium tuberculosis SUMu001]
gi|308369674|ref|ZP_07666781.1| hypothetical protein TMBG_00820 [Mycobacterium tuberculosis SUMu002]
gi|308372193|ref|ZP_07427738.2| hypothetical protein TMDG_00750 [Mycobacterium tuberculosis SUMu004]
11 more sequence titles
Length=359
Score = 739 bits (1907), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/359 (100%), Positives = 359/359 (100%), Gaps = 0/359 (0%)
Query 30 MRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHW 89
MRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHW
Sbjct 1 MRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHW 60
Query 90 RTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHH 149
RTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHH
Sbjct 61 RTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHH 120
Query 150 PQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFR 209
PQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFR
Sbjct 121 PQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFR 180
Query 210 RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTF 269
RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTF
Sbjct 181 RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTF 240
Query 270 DGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFEC 329
DGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFEC
Sbjct 241 DGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFEC 300
Query 330 YLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG 388
YLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG
Sbjct 301 YLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG 359
>gi|339295182|gb|AEJ47293.1| hypothetical protein CCDC5079_2103 [Mycobacterium tuberculosis
CCDC5079]
gi|339298802|gb|AEJ50912.1| hypothetical protein CCDC5180_2075 [Mycobacterium tuberculosis
CCDC5180]
Length=356
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/356 (99%), Positives = 356/356 (100%), Gaps = 0/356 (0%)
Query 33 LIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTG 92
+IRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTG
Sbjct 1 MIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTG 60
Query 93 TTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQE 152
TTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQE
Sbjct 61 TTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQE 120
Query 153 DEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRK 212
DEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRK
Sbjct 121 DEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRK 180
Query 213 TVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGL 272
TVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGL
Sbjct 181 TVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGL 240
Query 273 DDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLP 332
DDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLP
Sbjct 241 DDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLP 300
Query 333 RLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG 388
RLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG
Sbjct 301 RLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG 356
>gi|340625605|ref|YP_004744057.1| hypothetical protein MCAN_05811 [Mycobacterium canettii CIPT
140010059]
gi|340003795|emb|CCC42921.1| unnamed protein product [Mycobacterium canettii CIPT 140010059]
Length=388
Score = 630 bits (1624), Expect = 1e-178, Method: Compositional matrix adjust.
Identities = 307/389 (79%), Positives = 340/389 (88%), Gaps = 2/389 (0%)
Query 1 MKALRSSSRLSRWR-EWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNS 59
M+ALR S+ L WR EWAAPLW+GC+FSAWMRLLIRNRFAVH SRWHF VLYT LS ++S
Sbjct 1 MRALRPSA-LRAWRQEWAAPLWIGCSFSAWMRLLIRNRFAVHWSRWHFVVLYTVLSALHS 59
Query 60 CLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHH 119
LGLWQK++FG+RVA+TVI +PPIFIVGHWRTGTTLLHELLV+D+RHTGPT YECL PHH
Sbjct 60 YLGLWQKVLFGKRVAKTVIVEPPIFIVGHWRTGTTLLHELLVLDERHTGPTSYECLVPHH 119
Query 120 FLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYE 179
FLLTEW AP EFLVSKHR MDNM+LSL HPQEDEFV CM G PS YLTIAFPNRPPQ
Sbjct 120 FLLTEWIAPLAEFLVSKHRVMDNMELSLRHPQEDEFVLCMLGQPSLYLTIAFPNRPPQDL 179
Query 180 EYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIH 239
YLDLEQ+ REL WK++LFRFVQQVYFRRRK VILKNP HSFRIKVLL++FPQAKFIH
Sbjct 180 RYLDLEQLTSRELAAWKQSLFRFVQQVYFRRRKRVILKNPPHSFRIKVLLDLFPQAKFIH 239
Query 240 IVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRF 299
IVRDPYVVYPST+HL K+LYR HGLQ+PTF GLD++V+STYVDLYRKLDEGR+LVDP+RF
Sbjct 240 IVRDPYVVYPSTVHLRKSLYRKHGLQRPTFAGLDEQVLSTYVDLYRKLDEGRKLVDPSRF 299
Query 300 YELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIV 359
YELRYEDLI DPE QLRRLY HL LG FE YLPRLR+YLADHA+Y+TNSY+LT EQRAIV
Sbjct 300 YELRYEDLIADPEEQLRRLYDHLELGGFERYLPRLRRYLADHAEYQTNSYELTAEQRAIV 359
Query 360 DEHWGEIIDRYGYDRHTPEPARLRPAVGG 388
+ WGE+IDRYGY TPEPA LRP GG
Sbjct 360 TQRWGEVIDRYGYGHPTPEPAHLRPMAGG 388
>gi|240170114|ref|ZP_04748773.1| hypothetical protein MkanA1_12426 [Mycobacterium kansasii ATCC
12478]
Length=400
Score = 613 bits (1581), Expect = 1e-173, Method: Compositional matrix adjust.
Identities = 292/362 (81%), Positives = 321/362 (89%), Gaps = 0/362 (0%)
Query 11 SRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFG 70
S W EWAAPLW+GCNFSAW RLLI NRFAVH SRWHFAVLYTFLS+VNS LG+ Q+ G
Sbjct 29 SWWHEWAAPLWIGCNFSAWTRLLIHNRFAVHWSRWHFAVLYTFLSVVNSVLGVCQQATLG 88
Query 71 RRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYV 130
RRVAETV+ADPP+FIVGHWRTGTTLLHELL++DD HT PTGYECLAP HFLLTEWFA +V
Sbjct 89 RRVAETVVADPPVFIVGHWRTGTTLLHELLILDDHHTAPTGYECLAPQHFLLTEWFARWV 148
Query 131 EFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPR 190
FLV HR MDNM+LSL HPQEDEF+WC+QGLPSPYL IAFPNRP +E Y+DLEQ+ PR
Sbjct 149 GFLVPTHRPMDNMELSLQHPQEDEFIWCVQGLPSPYLAIAFPNRPLAHERYVDLEQLTPR 208
Query 191 ELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPS 250
ELE WKRTLFRFVQQ+YFRRRKTVILKNP HSFRIKVLL+VFPQAKFIHIVRDPYVVYPS
Sbjct 209 ELEAWKRTLFRFVQQLYFRRRKTVILKNPIHSFRIKVLLDVFPQAKFIHIVRDPYVVYPS 268
Query 251 TIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGD 310
TIHLHKA RIH LQ+PTF GLDDKV+STYVDLYRKL+EGR+LV P+RFYELRYEDLI D
Sbjct 269 TIHLHKAFTRIHALQRPTFAGLDDKVLSTYVDLYRKLEEGRKLVAPSRFYELRYEDLIAD 328
Query 311 PEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRY 370
PEGQL RLY+HLGLGDFE PRLR+Y A+ ADY+TN+YQLT EQRA V +HWGE+IDRY
Sbjct 329 PEGQLCRLYEHLGLGDFERLRPRLRRYFAERADYETNTYQLTAEQRATVTQHWGEVIDRY 388
Query 371 GY 372
GY
Sbjct 389 GY 390
>gi|168700662|ref|ZP_02732939.1| hypothetical protein GobsU_14152 [Gemmata obscuriglobus UQM 2246]
Length=380
Score = 330 bits (846), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 166/362 (46%), Positives = 229/362 (64%), Gaps = 3/362 (0%)
Query 14 REWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRV 73
REWA LW GC+ W+RLL N +AV W+ A + + S+ N+ L G RV
Sbjct 20 REWAPRLWEGCDLFTWLRLLKDNGYAVQPPYWYIAAIVSANSVTNTVLRWCLNAAHGNRV 79
Query 74 AETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLT-EWFAPYVEF 132
ET + +PPIF++GHWRTGTTLLHELL+ D R P +C P H LLT + F Y +
Sbjct 80 RETKL-EPPIFVIGHWRTGTTLLHELLIRDTRFGFPDMQDCFNPQHALLTNQLFKRYASW 138
Query 133 LVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPREL 192
L+ R MDNM PQEDEF + GLP+ Y AFP+R P+ LDL + P++L
Sbjct 139 LLPDKRPMDNMPFGWERPQEDEFALALLGLPTTYTDFAFPDREPKDRGALDLSGLTPKQL 198
Query 193 EIWKRTLFRFVQQVYFR-RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPST 251
WKR RF+Q+V R K ++LK+P H+ R+ VLL+VFP AKF+HIVRDP V+PST
Sbjct 199 ARWKRVFVRFLQEVTVRIGGKRLVLKSPPHTARVPVLLDVFPDAKFVHIVRDPRAVFPST 258
Query 252 IHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDP 311
++L K L R HGLQ+PTF GL++KV+ + +Y +LDE R L P +F ELRYEDL+ +P
Sbjct 259 VNLWKTLARGHGLQRPTFPGLEEKVLREFRVIYDRLDEARPLFKPGQFAELRYEDLVREP 318
Query 312 EGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYG 371
L ++Y L +G +E P++ +Y +A+Y+ N + LT Q+A++ E WG++I RYG
Sbjct 319 VAALEQVYTTLEIGGYEAVRPKIEEYQRQNANYERNKFTLTDAQQALIAERWGDVIRRYG 378
Query 372 YD 373
Y+
Sbjct 379 YE 380
>gi|283782111|ref|YP_003372866.1| hypothetical protein Psta_4359 [Pirellula staleyi DSM 6068]
gi|283440564|gb|ADB19006.1| hypothetical protein Psta_4359 [Pirellula staleyi DSM 6068]
Length=393
Score = 319 bits (818), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 152/371 (41%), Positives = 227/371 (62%), Gaps = 1/371 (0%)
Query 5 RSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLW 64
R ++ + W+ W G W +L I++ F +H RW AVL ++ VNS L LW
Sbjct 23 RKQPKIHSYPFWSPRFWHGMRAGDWWKLCIKHGFRIHPIRWPMAVLLGMITPVNSILRLW 82
Query 65 QKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTE 124
Q+ +G R+ T I +PP+FI+GHWR+GTT LHE++ D+R PT Y+C APHHFLLTE
Sbjct 83 QRAQYGSRIDRTRIEEPPVFIIGHWRSGTTFLHEVMHQDERFYSPTTYQCFAPHHFLLTE 142
Query 125 WF-APYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLD 183
W A Y +L+ + R MDNM PQEDEF G P+PYL AFPN PP E+LD
Sbjct 143 WLIAGYGGWLMPRQRPMDNMATGWERPQEDEFALLTLGAPTPYLRCAFPNDPPPAVEFLD 202
Query 184 LEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRD 243
+E V P + + + + F + + FR +K ++LK+P H+ RI++L ++FP A+FIHIVR+
Sbjct 203 MEGVDPADEKKFSEAMIEFSKLITFRSQKQLLLKSPPHTGRIELLSKLFPGARFIHIVRN 262
Query 244 PYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELR 303
PY ++ ST+ L ++L + LQ P GL++ V+ +Y+ ++ R +DP E++
Sbjct 263 PYSLFSSTVRLWQSLDAVQSLQMPKHKGLEEFVLMCLTRMYQGYEKQRAKIDPAMIVEVK 322
Query 304 YEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHW 363
YEDL+ P +L R+Y L L E P++ ++L + DY+TN ++L E R +V EHW
Sbjct 323 YEDLVKSPMTELERIYGALKLPSIEGAKPKIEKFLTEQKDYQTNKHELDEESRKLVREHW 382
Query 364 GEIIDRYGYDR 374
G D+YGY++
Sbjct 383 GFYFDKYGYEK 393
>gi|327540657|gb|EGF27229.1| hypothetical protein RBWH47_00635 [Rhodopirellula baltica WH47]
Length=390
Score = 285 bits (729), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/367 (41%), Positives = 213/367 (59%), Gaps = 2/367 (0%)
Query 8 SRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKI 67
++L+ + ++ W G +AW RLL F + SR + + + VN+ L Q +
Sbjct 23 AKLNSYPFYSPRFWHGMRPAAWWRLLRSGSFEISPSRIPMVISVSLTTFVNTLLTWLQNV 82
Query 68 VFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWF- 126
+F RR+ E + PP+FIVGHWR+GTTLLHEL+V D+R + P+ ++C AP HFL+T+WF
Sbjct 83 LFARRLREAELHGPPVFIVGHWRSGTTLLHELMVRDERFSSPSTFQCFAPSHFLVTQWFF 142
Query 127 APYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQ 186
+ +L+ R MDNMD PQEDEF GLPSPY IAFP R EYLDL
Sbjct 143 RKFASWLLPGKRPMDNMDAGWERPQEDEFALMNLGLPSPYRRIAFPRRKQVDMEYLDLID 202
Query 187 VAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
V+ + E W TL F+ +V + +++K+PTH+ RI L FPQAKF+HI RDP
Sbjct 203 VSNEDRETWLSTLRSFLLRVSVSTNRPLVIKSPTHTGRIGHLARAFPQAKFVHITRDPRS 262
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
++PST L ++L + LQ +GLD+ V++ +Y R +D ++RYED
Sbjct 263 LFPSTCRLWRSLDEVQSLQTSDEEGLDEYVLTCLTKMYDSFHADRPEIDEHHIIDIRYED 322
Query 307 LIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLAD-HADYKTNSYQLTVEQRAIVDEHWGE 365
LI DP G LR +Y+ L L DF+ ++ + + H YKTN +QL +Q ++ + W +
Sbjct 323 LITDPVGTLRTIYESLRLSDFDTVSEDIQDWANNEHQQYKTNKHQLDPDQEKLLLDRWSD 382
Query 366 IIDRYGY 372
DRYGY
Sbjct 383 YFDRYGY 389
>gi|32474669|ref|NP_867663.1| hypothetical protein RB7157 [Rhodopirellula baltica SH 1]
gi|32445208|emb|CAD75210.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length=413
Score = 284 bits (727), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 146/367 (40%), Positives = 213/367 (59%), Gaps = 2/367 (0%)
Query 8 SRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKI 67
++L+ + ++ W G +AW RLL F + SR + + + VN+ L Q +
Sbjct 46 AKLNSYPFYSPRFWHGMRPAAWWRLLRSGSFEISPSRIPMVISVSLTTFVNTLLTWLQNV 105
Query 68 VFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWF- 126
+F RR+ E + PP+FIVGHWR+GTTLLHEL+V D+R + P+ ++C AP HFL+T+WF
Sbjct 106 LFARRLREAELHGPPVFIVGHWRSGTTLLHELMVRDERFSSPSTFQCFAPSHFLVTQWFF 165
Query 127 APYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQ 186
+ +L+ R MDNMD PQEDEF GLPSPY IAFP R EYLDL
Sbjct 166 RKFASWLLPGKRPMDNMDAGWERPQEDEFALMNLGLPSPYRRIAFPRRKQVDMEYLDLID 225
Query 187 VAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
V+ + E W TL F+ +V + +++K+PTH+ RI L FPQAKF+HI RDP
Sbjct 226 VSNEDRETWLSTLRSFLLRVSVSTNRPLVIKSPTHTGRIGHLARAFPQAKFVHITRDPRS 285
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
++PST L ++L + LQ +GLD+ V++ +Y R +D ++RYE+
Sbjct 286 LFPSTCRLWRSLDEVQSLQTSDEEGLDEYVLTCLAKMYDSFHADRPEIDEHHIIDIRYEN 345
Query 307 LIGDPEGQLRRLYQHLGLGDFECYLPRLRQYL-ADHADYKTNSYQLTVEQRAIVDEHWGE 365
LI DP G LR +Y+ L L DF+ ++ + +H YKTN +QL +Q ++ + W +
Sbjct 346 LIADPVGTLRTIYESLRLSDFDTVSEDIQDWADNEHRQYKTNKHQLDPDQEKLLLDRWSD 405
Query 366 IIDRYGY 372
DRYGY
Sbjct 406 YFDRYGY 412
>gi|87311169|ref|ZP_01093292.1| hypothetical protein DSM3645_16110 [Blastopirellula marina DSM
3645]
gi|87286077|gb|EAQ77988.1| hypothetical protein DSM3645_16110 [Blastopirellula marina DSM
3645]
Length=391
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 136/359 (38%), Positives = 203/359 (57%), Gaps = 2/359 (0%)
Query 16 WAAP-LWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVA 74
W +P +W G F WMRLL R+ FA+H R AVL T ++ NS Q + G ++A
Sbjct 19 WYSPRIWHGMRFRPWMRLLARHHFALHPLRIGMAVLVTPFTVFNSLAYRLQLALHGEKIA 78
Query 75 ETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLT-EWFAPYVEFL 133
P +FIVGHWR+GTT LHEL+ +D+ +T P+ +C P FLL ++ + + F+
Sbjct 79 AATPHTPMVFIVGHWRSGTTFLHELMSLDEAYTSPSTIQCFGPCQFLLIGDFVSRWFNFI 138
Query 134 VSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELE 193
+ R MDNM + PQEDEF G PSPY +AFP+ P + E+LD+E + +L
Sbjct 139 MPSTRPMDNMKVGWSKPQEDEFALLALGAPSPYYRMAFPDHPAEGTEFLDMEGIDEADLA 198
Query 194 IWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIH 253
W+ TL +FV+ + +R K +ILK+PTH+ RI +L E++P AKFIHI R+P V+ ST
Sbjct 199 KWRETLDQFVRMITVQRDKPIILKSPTHTGRIGLLSEMYPDAKFIHIARNPLEVFASTER 258
Query 254 LHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEG 313
L + + I Q P + + +Y + + P R E RYED++ DP G
Sbjct 259 LWQTMDEIQSFQHPKNPQYRQYIFDCFDRMYGGYFRDVDKLGPDRLVETRYEDIVADPVG 318
Query 314 QLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
+L ++Y LGLGDFE P++ A+ ++ N +Q+ + A + W + +RYGY
Sbjct 319 ELEKIYAALGLGDFEQVRPQMEAATAESRSFQRNKHQMEDDLAAEIYRRWSQYFERYGY 377
>gi|149176793|ref|ZP_01855404.1| hypothetical protein PM8797T_15101 [Planctomyces maris DSM 8797]
gi|148844434|gb|EDL58786.1| hypothetical protein PM8797T_15101 [Planctomyces maris DSM 8797]
Length=387
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/362 (38%), Positives = 216/362 (60%), Gaps = 13/362 (3%)
Query 20 LWVGCNFSAWMRLL-IRNRFAVHHSRWHFA---VLYTFLSMVNSCLGLWQKIVFGRRVAE 75
+W G F+ +++L+ +R R RW + LS+ NS + + +++ R+V +
Sbjct 18 VWSGIGFTNFVKLMSLRPRI-----RWSGLGRLISSGILSVSNSFFSMLENLIYSRKVKK 72
Query 76 TVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYV-EFLV 134
T + +PP+FI+GHWR+GTTLLH L+ DD+ P L P HFLLTE +V + L+
Sbjct 73 TQL-EPPVFIIGHWRSGTTLLHNLMSKDDQFIYPNMGAMLFPSHFLLTERVLKHVVKHLL 131
Query 135 SKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEI 194
K R MDNM ++ PQEDE + L SPYL I F ++P Y Y +L+Q+ PRE I
Sbjct 132 PKQRPMDNMPVTWDLPQEDETSIMLLHLMSPYLAITFSDQPEVYNRYYELDQLTPRETSI 191
Query 195 WKRTLFRFVQQVYFRR--RKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTI 252
WK+T F++++ ++ K ++LK+PTH+FRI LLE+FP A+F++I RDPY VY ST+
Sbjct 192 WKKTFLYFMKKLTYKAGANKHILLKSPTHTFRIPFLLEMFPDARFVYIYRDPYKVYNSTL 251
Query 253 HLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPE 312
HL K ++ +G + L++ + + YV+ + R++V + +E+R+EDL DP
Sbjct 252 HLRKTMFGDNGFAPLDMEKLEEDMSNIYVNHLNVYERDRKIVPEGQLHEVRFEDLEEDPV 311
Query 313 GQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
G+LR++Y+HL L FE ++ YL D YK N Y++ Q + E W + + +GY
Sbjct 312 GELRKVYEHLNLSGFEGLEQNMQPYLKDQKSYKKNKYEMDAAQEKKIYERWQKAFEMFGY 371
Query 373 DR 374
+R
Sbjct 372 ER 373
>gi|296121097|ref|YP_003628875.1| hypothetical protein Plim_0831 [Planctomyces limnophilus DSM
3776]
gi|296013437|gb|ADG66676.1| hypothetical protein Plim_0831 [Planctomyces limnophilus DSM
3776]
Length=437
Score = 240 bits (612), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 120/355 (34%), Positives = 199/355 (57%), Gaps = 2/355 (0%)
Query 20 LWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIA 79
+W G F ++L+ + R +H+SR V F+ NS + +++GR++ +T +
Sbjct 71 VWHGLTFGGLLQLMAK-RPRMHYSRALRLVSLFFICPFNSIYSMISGLIYGRKIQQTQVT 129
Query 80 DPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEW-FAPYVEFLVSKHR 138
PPIFI+GHWR+GTTLLH L+ +D + T P Y+ + P HFLLTE + + K R
Sbjct 130 KPPIFILGHWRSGTTLLHNLMTLDSQFTYPNLYQVMYPQHFLLTESVISKLAAPFLPKTR 189
Query 139 AMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRT 198
MDNM PQEDE ++ SPYL +AFPN Y D+ ++P + WKR+
Sbjct 190 PMDNMPAGWKLPQEDEVALLIETQLSPYLMVAFPNERKYYGHTFDVRHMSPGDQAKWKRS 249
Query 199 LFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKAL 258
L FV+++ R K +++K+P+H++R+ LLE+FP A+F++I RDPY V+ S++HL + +
Sbjct 250 LVNFVKKLTVRADKPIVMKSPSHTYRVATLLELFPDARFVYIHRDPYAVFSSSLHLRRTM 309
Query 259 YRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRL 318
Y + +P+ + L + T + +E R+++ E+RY DL P Q++R+
Sbjct 310 YMENSFIEPSEEMLYQDTLETLDTCLKTYEETRDMIPEKNLVEIRYTDLEAHPVEQMQRV 369
Query 319 YQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYD 373
Y+ LG ++ P + ++YK N + + E R ++ + D+YGYD
Sbjct 370 YETLGFDGWDRMKPIFEREAQAMSEYKKNRFIMDDETRQMIYSRLKDFFDKYGYD 424
>gi|325107012|ref|YP_004268080.1| hypothetical protein Plabr_0431 [Planctomyces brasiliensis DSM
5305]
gi|324967280|gb|ADY58058.1| hypothetical protein Plabr_0431 [Planctomyces brasiliensis DSM
5305]
Length=404
Score = 223 bits (568), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/359 (37%), Positives = 189/359 (53%), Gaps = 11/359 (3%)
Query 22 VGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADP 81
G S W RLL N F V W A T S+V S L W + R V ++ +P
Sbjct 36 AGVRCSDWWRLLAANDFYVSPRFWGKAAHLTVSSLVTSPLS-WLEGYLYRPVLDSTAVEP 94
Query 82 PIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWF-APYVEFLVSKHRAM 140
P+F++G WR+GTT LH LL D+R P Y+ + P F L+ W+ P + + + R M
Sbjct 95 PLFVLGSWRSGTTFLHNLLSQDERFAAPDLYQTMYPRTFRLSRWWWEPMLRMGLPRKRFM 154
Query 141 DNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLF 200
DN++ S P EDE + S L FP +YE YL E RE +K L
Sbjct 155 DNVEQSFSEPAEDEMAIGILSRRSNMLAWTFPRNEARYERYLTFEGTTEREQAEFKNALK 214
Query 201 RFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYR 260
FV++V R + +ILK+P H+ RI++LLE FP+AKF+HI R PY V+ S H+ + +
Sbjct 215 YFVRKVQQRAGRPLILKSPNHTARIRLLLETFPEAKFLHIRRHPYNVFRSFRHMARQVIP 274
Query 261 IHGLQQPTFDGLDDKVVSTYVDLYRKLDEG----RELVDPTRFYELRYEDLIGDPEGQLR 316
+ GLQ+ D +D+ +V LYRKL+E R+L+ R +E+ YEDL P ++
Sbjct 275 VWGLQKYNDDAIDEMIVR----LYRKLNEAYFAQRDLIPAGRLHEIAYEDLAAAPRAKVE 330
Query 317 RLYQHLGLGDFECYLPRLRQYLADHADYKTNSY-QLTVEQRAIVDEHWGEIIDRYGYDR 374
+Y+ L L DF P L YL + +Y+ N + + E R I+ WG DR+ Y+R
Sbjct 331 EIYEALNLPDFRQMKPALDAYLGEVGEYRKNRHADIPAETREILHREWGFCFDRWNYER 389
>gi|325107656|ref|YP_004268724.1| hypothetical protein Plabr_1084 [Planctomyces brasiliensis DSM
5305]
gi|324967924|gb|ADY58702.1| hypothetical protein Plabr_1084 [Planctomyces brasiliensis DSM
5305]
Length=375
Score = 219 bits (558), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 129/374 (35%), Positives = 205/374 (55%), Gaps = 3/374 (0%)
Query 1 MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSC 60
M S+++ R + +W G + R L + +H S+ H + + N+
Sbjct 1 MGKSTSTNKPVRHSQRGLVIWHGMRMRDF-RKLRKVGAELHWSQLHRILPTLGMLPYNTV 59
Query 61 LGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHF 120
+ + + +++AET + PP+F++GHWR+GTTLLH LL +DDR T P Y+C+ PHHF
Sbjct 60 MEKVEGWRYEKKLAETEVK-PPLFVLGHWRSGTTLLHNLLTLDDRFTYPNLYQCIFPHHF 118
Query 121 LLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYE 179
L TE A +LV K R MDNM+ PQEDE + SPY +AF +YE
Sbjct 119 LSTEKAMAGLTSWLVPKRRPMDNMETGWKLPQEDELALLLTTTYSPYRNLAFQGHRERYE 178
Query 180 EYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIH 239
+Y D + P+E E WK + RF++++ R K +I K+P H++R+++L E+FP AKF++
Sbjct 179 DYFDFKSADPQEREQWKAAMMRFMKKITLRTGKPIITKSPGHTYRVEILREMFPDAKFVY 238
Query 240 IVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRF 299
I R PY V STIHL +++ + L + + D+ V Y R +E ++ +
Sbjct 239 IHRHPYDVIRSTIHLRAVMFQTNALGKINLENHDELVYQAYEQCIRTYEEDKQNIPEGHL 298
Query 300 YELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIV 359
YEL+YE+ D G + ++Y +L L DFE P++ QY+A +YK N + +V
Sbjct 299 YELKYEEFEKDLLGHMHKVYDNLQLPDFEHVRPKIEQYVAGQKEYKKNVFPTDAALAEVV 358
Query 360 DEHWGEIIDRYGYD 373
+ ++D+YGYD
Sbjct 359 NTRMKFVLDKYGYD 372
>gi|332707922|ref|ZP_08427927.1| hypothetical protein LYNGBM3L_75560 [Lyngbya majuscula 3L]
gi|332353309|gb|EGJ32844.1| hypothetical protein LYNGBM3L_75560 [Lyngbya majuscula 3L]
Length=277
Score = 216 bits (550), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/259 (45%), Positives = 158/259 (62%), Gaps = 2/259 (0%)
Query 14 REWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRV 73
+ W LW G +F AW RLL +N FAV R H AV T S+ N+ L Q++ +GRRV
Sbjct 20 KPWMPKLWHGMDFFAWWRLLRKNHFAVEWRRAHTAVAVTGFSVANTSLRWLQELCYGRRV 79
Query 74 AETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAP-YVEF 132
T I DP IFI+GH+RTGTTLLHEL+ +D+R T PT YEC +P+HFLLTE F + F
Sbjct 80 RATEIQDP-IFIIGHYRTGTTLLHELIALDERLTFPTTYECFSPNHFLLTEAFVSRFFGF 138
Query 133 LVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPREL 192
L+ R DNM PQEDE +G +PY AFPN P Y DL + +
Sbjct 139 LLPAKRLQDNMHQGWGRPQEDESALLNRGAATPYARCAFPNHAPPYPGAEDLRTLPREQR 198
Query 193 EIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTI 252
E W + L +F++QV + R + +++K+P H+ R+ LLE+FP+A+F++ VR+P V+ ST
Sbjct 199 EQWMQVLEQFLRQVTYLRPRPIVVKSPLHTCRVPTLLEMFPRARFLYTVREPQAVFSSTC 258
Query 253 HLHKALYRIHGLQQPTFDG 271
L + +Y G Q+P + G
Sbjct 259 KLWRVIYENQGFQKPNYVG 277
>gi|303278906|ref|XP_003058746.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459906|gb|EEH57201.1| predicted protein [Micromonas pusilla CCMP1545]
Length=350
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 106/332 (32%), Positives = 173/332 (53%), Gaps = 7/332 (2%)
Query 49 VLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTG 108
+ T +S+VN+ + +++GR +A + D P+FI+GH RTGTT LH LL D
Sbjct 18 IFLTIMSLVNTIGAIADSVLYGRAIASQELNDEPVFILGHPRTGTTHLHNLLSRDPSFAF 77
Query 109 PTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWC-MQGLPSPYL 167
T + P FL W AP++ ++ R MDNM LS PQEDE + G SPY+
Sbjct 78 ATTFSVGFPSGFLSCRWLAPFMGAIMDDTRPMDNMALSHDTPQEDEVATNQLSGGASPYM 137
Query 168 TIAFPNRPPQYEEYLDL-EQVAPRELEIWKRTLFRFVQQVYFR---RRKTVILKNPTHSF 223
+ FP R + + + + + E+ WK + F+++ + +RK ++LK+P H+
Sbjct 138 PLMFPKREALFRRWYSMRDGASSAEIARWKESFLYFLRKTQYAAGGKRKRLLLKSPVHTA 197
Query 224 RIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDL 283
R+ VL E+FP+A+F+ I R+PY V+ S +H+ A Y Q P+ + + + ++ L
Sbjct 198 RVDVLREMFPKAQFVFIHRNPYEVFQSAVHMADAYYWQCYFQVPSAEDVQEFILYQGEYL 257
Query 284 YRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLG-LGDFECYLPRLRQYLADHA 342
+ + V +E+R+++L DP G LR LY LG +F P + Y
Sbjct 258 HDAYERDIRKVKKGNKHEVRFDELNKDPLGTLRALYDALGWSANFASIRPAIESYAGSLR 317
Query 343 DYKTNSY-QLTVEQRAIVDEHWGEIIDRYGYD 373
D+K N++ +L+ E + +V WG GYD
Sbjct 318 DFKMNAHARLSEEAKEVVRARWGNWFKDLGYD 349
>gi|308802137|ref|XP_003078382.1| unnamed protein product [Ostreococcus tauri]
gi|116056834|emb|CAL53123.1| unnamed protein product [Ostreococcus tauri]
Length=385
Score = 181 bits (458), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 115/348 (34%), Positives = 186/348 (54%), Gaps = 14/348 (4%)
Query 30 MRLLIRNRFAVHHSRWHFAVLY-TFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGH 88
M +L R+R A+ +R V + LSMVN+ L ++ R T I D P+FI+GH
Sbjct 29 MEMLWRHRDAIDWTRSMVRVGFLATLSMVNAVWALVDGALWVR-WRRTRIRDDPVFIIGH 87
Query 89 WRTGTTLLHELLVVDDRHTGP-TGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSL 147
RTGTT H L +D+ G T ++ P+ FL +EW +E ++ + R MDNM+L++
Sbjct 88 PRTGTTHAHNTLAMDEGRFGTCTTFDVGFPNGFLTSEWTKGALELMMDETRPMDNMELTM 147
Query 148 HHPQEDEFVW-CMQGLPSPYLTIAFPNRPPQYEEYLDLEQ------VAPRELEIWKRTLF 200
PQEDE + G SPY I F ++ ++ +L + + P EL+ WK
Sbjct 148 SSPQEDELATNILSGGASPYAAIMFMTEEERFRKFYELREDHEEYPIEPSELKRWKSAFL 207
Query 201 RFVQQVYFRR--RKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKAL 258
FV+++ ++R K ++LK+P H+ R+++L E+FP+A FI + R PY V+ S +++
Sbjct 208 TFVKKLQYKRGEDKRLLLKSPVHTARVRLLREMFPRASFIFMSRHPYDVFRSAVNMADKY 267
Query 259 YRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRL 318
Y ++PT + + ++ L+ + YE+R+EDL + EG +R+L
Sbjct 268 YWQCYFKEPTVAQVLEFILKQGEILHDAYIRDAAELPAEALYEIRFEDLDANLEGTMRKL 327
Query 319 YQHLGLGDFECYL-PRLRQYLADHADYKTNSY-QLTVEQRAIVDEHWG 364
Y+H G DFE L P+LR Y ++K NS+ +L E + IV W
Sbjct 328 YEHFGWDDFEDALAPKLRDYSESLRNFKKNSFSELDEETKKIVQRRWA 375
>gi|307592182|ref|YP_003899773.1| hypothetical protein Cyan7822_5853 [Cyanothece sp. PCC 7822]
gi|306985827|gb|ADN17707.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length=377
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/361 (31%), Positives = 188/361 (53%), Gaps = 9/361 (2%)
Query 19 PLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTF-LSMVNSCLGLWQKIVFGRRVAETV 77
PL G + R++I NR +++ LY F L + + ++++++F ++A T
Sbjct 13 PLGYGS-LRNFFRVIIANRGV--DTQYFIKFLYAFFLCLSGIPVRIFERVIFDHKIASTT 69
Query 78 IADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLL---TEWFAPYVEFLV 134
I PP+FI+GHWR+GTT LH L++ D +P FL ++ AP +E L+
Sbjct 70 IDYPPVFILGHWRSGTTYLHNLMIQDSNFAFVPSIYSYSPEMFLSLNSKKFMAPLLEALL 129
Query 135 SKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEI 194
R MDN+ S+H P+E+E+ S Y FP Q E L Q R L+
Sbjct 130 PNQRPMDNVAYSIHVPEEEEYAIGNMMPLSFYNGWMFPKYLRQNFERSVLFQGLSRSLKA 189
Query 195 -WKRTLFRFVQQV-YFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTI 252
W++ + +++ +F + K +++KNP ++ RI LL++FPQ+KFI+I R+PY VY ST
Sbjct 190 EWEKVYIKILKKTTFFSQGKRLLIKNPANTARIDTLLKLFPQSKFIYIYRNPYDVYSSTK 249
Query 253 HLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPE 312
++ L + LQQ + + ++D + Y L + E ++ + E++YED +G+
Sbjct 250 LFYEKLMPTYALQQISEEYIEDCIFDFYEQLINQYLESKQNIPLGNIIEIKYEDFLGNEM 309
Query 313 GQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
L ++Y L DFE QY+ + Y N + L + +D+ WG I+++GY
Sbjct 310 MYLNKIYTQFNLPDFEEKSQVFLQYVHSKSKYIKNQHSLDRDLVKKIDQRWGFFIEQWGY 369
Query 373 D 373
D
Sbjct 370 D 370
>gi|326427251|gb|EGD72821.1| hypothetical protein PTSG_12190 [Salpingoeca sp. ATCC 50818]
Length=413
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 113/365 (31%), Positives = 197/365 (54%), Gaps = 28/365 (7%)
Query 29 WMRLLIRNRFAVH-HSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVG 87
W+R L R R + W + TF S+V++ + + I+ G ++ I P+F++G
Sbjct 50 WIRFLWRFRSIITWRVYWRRILAVTFASIVSTAFAIIEWILNGAKIRNAAINKRPVFVLG 109
Query 88 HWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFL------------VS 135
H R+GTTLLH LL T + C P F++ + F+ ++
Sbjct 110 HPRSGTTLLHNLL-----SENTTDFFC--PTTFIV----GLHKSFIWRYNLRHKHGQHLT 158
Query 136 KHRAMDNMDLSLHHPQEDEFVWCMQGLP-SPYLTIAFPNRPPQYEEYLDLEQVAPRELEI 194
K R MD++ L++ PQEDEF + S Y + F + + ++Y+ L+ V RE +
Sbjct 159 KTRPMDDVALNIDTPQEDEFAYLRSTAGVSMYASFIFMSHSEELKKYIRLKDVDQRERDE 218
Query 195 WKRTLFRFVQQV-YFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIH 253
K + FV+++ + + ++LK+P+H+ ++K+LLE+FP A+F++I R+PY VY STI+
Sbjct 219 HKSAIMDFVRRLSVMAKGRRLLLKSPSHTGKVKLLLELFPDAQFVYIHRNPYRVYRSTIN 278
Query 254 LHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEG 313
L L + L PT +++ V + Y +L+ E R+L+ E+ Y++L D G
Sbjct 279 LFDKLLWYNFLSMPTNAQMNEFVFAMYEELFAGYMEDRKLIPKHNLVEISYDELQADKIG 338
Query 314 QLRRLYQHLGLGDFECY-LPRLRQYLADHADYKTNSYQ-LTVEQRAIVDEHWGEIIDRYG 371
+R++Y+ LG DFE LP+L+++L + D++ N ++ LT QR ++ WG + +G
Sbjct 339 TIRKVYEQLGWPDFETVALPKLKEHLNEIRDFQKNVFEPLTSAQRDAINRRWGAAFEAFG 398
Query 372 YDRHT 376
YD T
Sbjct 399 YDMET 403
>gi|332880316|ref|ZP_08447994.1| hypothetical protein HMPREF9074_03768 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332681761|gb|EGJ54680.1| hypothetical protein HMPREF9074_03768 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length=371
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 101/324 (32%), Positives = 166/324 (52%), Gaps = 10/324 (3%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q F ++ P+FI+GHWR+GTT +H + DD T Y+ + PH
Sbjct 50 SLLAPIQDRRFEEKLGAYEFDHDPVFILGHWRSGTTFVHNIFAQDDNFCYTTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQ 177
+ + +F + +L+ R DNM+L+ PQE+EF S Y FP + +
Sbjct 110 LMMFGQPFFKKTMGWLMPNKRPTDNMELAPDLPQEEEFALSNMMPYSFYDFWFFPQKWQE 169
Query 178 Y-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTV-----ILKNPTHSFRIKVLLEV 231
Y ++YL E + EL+++K T FV+ + R T + KNP H+ R+K L+E+
Sbjct 170 YCDKYLTFENITKEELQVFKET---FVKLMKISRYCTTGGDVYLSKNPPHTGRVKALVEM 226
Query 232 FPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGR 291
FP AKFI+++R+PY V+ ST + LQ + + ++ ++ TY LYR +E +
Sbjct 227 FPNAKFIYLMRNPYTVFESTRSFFTNTIKPLELQHISDEEMEKNILLTYTKLYRAYEEQK 286
Query 292 ELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQL 351
+ V +E+++ED D G + Y+ LG+ +F+C +RQY YK N Y+
Sbjct 287 KYVPEGNLFEVKFEDFEADAFGTTKLAYEKLGIREFDCAEAAIRQYTDRKKGYKKNKYEY 346
Query 352 TVEQRAIVDEHWGEIIDRYGYDRH 375
+V+E+WG + + Y+ H
Sbjct 347 KPRTIQLVNENWGYALKDWDYEIH 370
>gi|258647598|ref|ZP_05735067.1| conserved hypothetical protein [Prevotella tannerae ATCC 51259]
gi|260852406|gb|EEX72275.1| conserved hypothetical protein [Prevotella tannerae ATCC 51259]
Length=369
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 100/320 (32%), Positives = 164/320 (52%), Gaps = 5/320 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ F +++AE + P+FI+GHWR+GTT +H +L D T Y+ + PH
Sbjct 50 SLLAPLQEKRFQKKLAEKPLEHAPVFILGHWRSGTTFVHNVLSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ HR DNM+L++ PQE+EF +P Y F P R
Sbjct 110 LMMFGQSFFKQTMSWLMPSHRPTDNMELAVDLPQEEEFT-MTNMMPYTYYNFWFLPQRMR 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y + +L E ++ EL ++ T + ++ + T L KNP H+ R++ L+ +FP
Sbjct 169 EYADRFLCFENISEEELRTFEETFVKIIKISLWNTGGTQFLSKNPPHTGRVRELVRMFPD 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST R LQ L D ++ Y L+RK + +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIRPLQLQDIAETELVDNILYVYEKLHRKYQSEKAFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
ELR+ED + Q LYQ L + +E ++QY Y+ N Y E
Sbjct 289 PAGNLVELRFEDFESNAYAQTEMLYQKLSIPGWEEAQAAIKQYTDAKKGYQKNKYAYKPE 348
Query 355 QRAIVDEHWGEIIDRYGYDR 374
A+V+ HWG+I++ + Y++
Sbjct 349 TVALVNRHWGDIVEHWNYEK 368
>gi|330998081|ref|ZP_08321909.1| hypothetical protein HMPREF9442_03016 [Paraprevotella xylaniphila
YIT 11841]
gi|329569170|gb|EGG50961.1| hypothetical protein HMPREF9442_03016 [Paraprevotella xylaniphila
YIT 11841]
Length=371
Score = 169 bits (429), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 100/324 (31%), Positives = 165/324 (51%), Gaps = 10/324 (3%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q F ++ P+FI+GHWR+GTT +H + DD T Y+ + PH
Sbjct 50 SLLAPIQDRRFEEKLGAYEFDHDPVFILGHWRSGTTFVHNIFAQDDNFCYTTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQ 177
+ + +F + +L+ R DNM+L+ PQE+EF S Y FP + +
Sbjct 110 LMMFGQPFFKKTMGWLMPDKRPTDNMELAPDLPQEEEFALSNMMPYSFYDFWFFPQKWQE 169
Query 178 Y-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTV-----ILKNPTHSFRIKVLLEV 231
Y ++YL E + EL+++K T FV+ + R T + KNP H+ R+K L+E+
Sbjct 170 YCDKYLTFENITKEELQVFKET---FVKLMKISRYCTTGGDVYLSKNPPHTGRVKALVEM 226
Query 232 FPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGR 291
FP AKFI+++R+PY V+ ST + LQ + + ++ ++ TY LYR +E +
Sbjct 227 FPNAKFIYLMRNPYTVFESTRSFFSNTIKPLELQHISDEEMEKNILLTYTKLYRAYEEQK 286
Query 292 ELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQL 351
+ V +E+++ED D G + Y+ LG+ +F C +R+Y YK N Y+
Sbjct 287 KYVPEGNLFEVKFEDFEADAFGTTKLAYEKLGIREFHCAEAAIRRYTDRKKGYKKNKYEY 346
Query 352 TVEQRAIVDEHWGEIIDRYGYDRH 375
+V+E+WG + + Y+ H
Sbjct 347 KPRTIQLVNENWGYALKDWDYEIH 370
>gi|255078840|ref|XP_002503000.1| predicted protein [Micromonas sp. RCC299]
gi|226518266|gb|ACO64258.1| predicted protein [Micromonas sp. RCC299]
Length=380
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/360 (31%), Positives = 182/360 (51%), Gaps = 9/360 (2%)
Query 23 GCNFSAWMRLLIRNRFAV--HHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIAD 80
G W RLL R R+ + + W + T L+ +N+ + I++ ++ + D
Sbjct 22 GVTLLQWARLL-RARWTQIDYLTYWPRLIFLTLLAALNTIGAIADWILYDAKIRAQELND 80
Query 81 PPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAM 140
P+F++GH RTGTT LH LL D R ++ P FL T W AP++ ++ R M
Sbjct 81 EPVFVLGHPRTGTTHLHNLLSKDPRFAYANTFQVGFPSSFLSTSWLAPHMGLIMDSTRPM 140
Query 141 DNMDLSLHHPQEDEF-VWCMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTL 199
DNM L+ PQEDE V + SPY + F R P++ ++ D + + W+ +
Sbjct 141 DNMALAWDTPQEDEVAVNQLSSGASPYAPLLFMRREPEFRKFYDFDDCDADDFARWRDSF 200
Query 200 FRFVQQVYFR---RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHK 256
F++++ F + K ++LK+P H+ R+K+L E+FP+A FI + R PY V+ S + +
Sbjct 201 VYFLRKIQFAAGGKHKRLLLKSPVHTARVKLLKEMFPKATFIFVHRHPYEVFKSAVTMAD 260
Query 257 ALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLR 316
Y LQ+P + + + ++ L+RK E V R E+ +E++ + L
Sbjct 261 RYYWQCYLQKPRVEDVQEFILYQGELLHRKYTEDVRGVSEARKMEVSFEEVTENTVTALS 320
Query 317 RLYQHLGLG-DFECYLPRLRQYLADHADYKTNSY-QLTVEQRAIVDEHWGEIIDRYGYDR 374
++Y+ LG G DF + P + Y D+K N + +L + RA+V E W D GY R
Sbjct 321 QVYKALGWGKDFARFKPVVEAYSQSLRDFKMNEHKELGEDARAVVRERWKAWFDDLGYAR 380
>gi|145344489|ref|XP_001416764.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576990|gb|ABO95057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length=389
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/370 (29%), Positives = 180/370 (49%), Gaps = 24/370 (6%)
Query 23 GCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIV------FGRRVAET 76
G W R L H R AV + M +C+ L + R T
Sbjct 25 GVTLVGWARTLW------AHGRSIDAVAFAPRLMFLTCMALANTLAAIADGALRPRWGRT 78
Query 77 VIADPPIFIVGHWRTGTTLLHELLVVDD-RHTGPTGYECLAPHHFLLTEWFAPYVEFLVS 135
+ D P+F++GH RTGTT LH +L D+ R T ++ P FL + + PY+ ++
Sbjct 79 KVRDDPVFVLGHPRTGTTHLHNILAKDETRFAAATTFDVGFPSGFLSSGFVKPYLAKMMD 138
Query 136 KHRAMDNMDLSLHHPQEDEFVWC-MQGLPSPYLTIAFPNRPPQYEEYLDLEQ------VA 188
R MDNM L++ PQEDE + G SPY + F ++ +Y +L + +
Sbjct 139 STRPMDNMALTMDTPQEDELATNQLSGCASPYAPLMFMRDEAKFRKYYELREDHDEYPIE 198
Query 189 PRELEIWKRTLFRFVQQVYFR--RRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
ELE WK F+ ++ ++ K ++LK+P H+ R++VL ++FP+A+F+ I R PY
Sbjct 199 RAELEAWKSAFMTFMTKLQYKHGEHKRLVLKSPVHAARVEVLRKLFPRAQFVFISRHPYD 258
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
V+ S +++ Y LQ+PT + + ++ L+ + + +E R++D
Sbjct 259 VFRSAVNMADKYYWQCFLQRPTVADVQEFILKQGEILHDAYVRDSKSLPREALFETRFDD 318
Query 307 LIGDPEGQLRRLYQHLGLGDF-ECYLPRLRQYLADHADYKTNSY-QLTVEQRAIVDEHWG 364
L DP G L ++Y+H G F E P L++Y AD+K NS+ +L+ + + +++ W
Sbjct 319 LDADPVGTLSKIYKHFGWDGFDETVAPVLKEYATSLADFKKNSFAELSDDAKEVINSRWA 378
Query 365 EIIDRYGYDR 374
Y++
Sbjct 379 RWFTDLNYEK 388
>gi|326435248|gb|EGD80818.1| hypothetical protein PTSG_01404 [Salpingoeca sp. ATCC 50818]
Length=407
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 108/365 (30%), Positives = 187/365 (52%), Gaps = 6/365 (1%)
Query 22 VGCNFSAWMRLLIRNRFAVHHSRWHFAVLY-TFLSMVNSCLGLWQKIVFGRRVAETVIAD 80
+G W+ +L + +A+ + F VL+ TF++ +NS L + + F R+ VI
Sbjct 37 LGVTLGPWLTVLWKYGYAIEWKHYWFRVLFLTFMACLNSTLSFLEWLFFRHRIRSAVINR 96
Query 81 PPIFIVGHWRTGTTLLHELLVVDD-RHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRA 139
P+FI+GH RTGTT LH L+ +DD PT +LL + ++S R
Sbjct 97 RPVFILGHPRTGTTHLHNLISLDDDEFFAPTTLAAGFSAAYLLLHPVRHLLSGVLSDTRP 156
Query 140 MDNMDLSLHHPQEDEFVWCMQG-LPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRT 198
MDNM L+ PQEDE + L S Y + F P++ +Y ++ V+ E + +
Sbjct 157 MDNMALTFDVPQEDELSYTQSTPLLSMYSPLVFMTEEPKFRKYFRMQDVSQDEKKRYTDV 216
Query 199 LFRFVQQVYFRRR-KTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKA 257
+ F+Q++ + + +LK+PTH+ +++ LLE+FP+A+FI+I R PY V+ S +++
Sbjct 217 MLAFLQKLAVHAQGRRFVLKSPTHTAKVRFLLELFPEAQFIYIHRHPYRVFRSAMNMADK 276
Query 258 LYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRR 317
Y L PT + + + V+ Y +L+ E R L+ E+ +++L P + R
Sbjct 277 TYWYSYLATPTNEQVAEFVMHQYEELFDAYMEDRSLIPEGNLVEVSFDELQQQPLQTMER 336
Query 318 LYQHLGLGDFECYL-PRLRQYLADHADYKTNSYQ-LTVEQRAIVDEHWGEIIDRYGYDRH 375
+Y L F+ + P+L++YL +K N+++ LT QR V+ W + +GY
Sbjct 337 IYTTLQWTGFDDRVKPKLQRYLKSLRGFKKNAFETLTDTQRQEVNRRWRKSFKAFGYTMQ 396
Query 376 TPEPA 380
+ A
Sbjct 397 EKQGA 401
>gi|307109301|gb|EFN57539.1| hypothetical protein CHLNCDRAFT_143160 [Chlorella variabilis]
Length=436
Score = 162 bits (411), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 109/342 (32%), Positives = 172/342 (51%), Gaps = 15/342 (4%)
Query 46 HFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDR 105
H A + ++ +NS L L +++GR VA + P+ I+GH RTGTT +H LL +D +
Sbjct 57 HRAAFLSLMACLNSLLSLVDSLLYGRAVAAQQLHPQPVIILGHPRTGTTHIHNLLALDPQ 116
Query 106 HTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEF-VWCMQGLPS 164
P FL E F + LV R MD M LSL P EDE V + G S
Sbjct 117 FAYARTLHAGFPASFLALERFKWLLAGLVDDTRPMDFMPLSLDTPAEDEIAVSALTGTVS 176
Query 165 PYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRR-----------KT 213
Y+ + F +++ + E + E + W+ +L F++++ RR K
Sbjct 177 AYMPLVFMRDRHRFDAFYTFEGASEAEFDSWRSSLLWFLKKLEQRRGLPQVTLRWGGCKP 236
Query 214 VILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLD 273
+++K+P H+ R+K+LL++FP+A+F+++ RDP + S H+ Y LQ+PT +
Sbjct 237 LLIKSPVHTARLKLLLKLFPRARFVYVHRDPLSTFQSAAHMANTYYWYCYLQRPTDAAVT 296
Query 274 DKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPR 333
D ++ + LYR R+LV P E+ + +L DP G LRRLY L LGDF+ P
Sbjct 297 DFILEQFSLLYRIFTADRKLVPPGNLVEVSFAELDSDPLGTLRRLYTSLDLGDFQAVRPA 356
Query 334 LRQYLA--DHADYKTNSYQ-LTVEQRAIVDEHWGEIIDRYGY 372
+Y + + +K N ++ L+ E R V W +GY
Sbjct 357 FERYCGGLEMSGFKKNKHRPLSPELRRRVQHLWDPFYREFGY 398
>gi|325279282|ref|YP_004251824.1| hypothetical protein Odosp_0560 [Odoribacter splanchnicus DSM
20712]
gi|324311091|gb|ADY31644.1| hypothetical protein Odosp_0560 [Odoribacter splanchnicus DSM
20712]
Length=369
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/318 (29%), Positives = 163/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
SCL Q + +R+ + I P+FI+GHWR+GTT +H +L D T Y+ + PH
Sbjct 50 SCLKPIQDRRYDKRLKDQAINMEPVFILGHWRSGTTFVHNVLAHDKHFGYTTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 MMMWGQPMFKKTMAWLMPDKRPTDNMELNVDLPQEEEFALS-NMMPCSYYDFWFLPQNML 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y + +L ++ P E +++ T + ++ + + + L KNP H+ ++K +LE+FP
Sbjct 169 EYCDRFLTMKTATPEEHRMFRETFLKLIKISLWNTQGSQFLSKNPPHTGKVKEILEMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST LQ+ + + L+ ++ Y LYRK +E ++L+
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIIPLQLQKISPEELEKNILEVYTRLYRKYEEDKKLI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D ++Y+ L + FE + YL YK N+Y+
Sbjct 289 PAGNLIEIKFEDFEADALAMTEKIYRTLAIPGFEAAKADIAAYLDKKKGYKKNAYKYETR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V++HW + ++ Y
Sbjct 349 TVELVEKHWDYALKQWDY 366
>gi|326434796|gb|EGD80366.1| hypothetical protein PTSG_10621 [Salpingoeca sp. ATCC 50818]
Length=404
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/361 (30%), Positives = 180/361 (50%), Gaps = 6/361 (1%)
Query 22 VGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTF-LSMVNSCLGLWQKIVFGRRVAETVIAD 80
VG + W ++++ + + + F VL+ F ++ VN+ L + + G R ++ VI
Sbjct 34 VGMRLAQWWKVVVGHWRDIDWRHYWFRVLFLFIMACVNTVLTGLEYVFHGHRTSDVVINK 93
Query 81 PPIFIVGHWRTGTTLLHELLVVD-DRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRA 139
P+F++GH R+GTTLLH L ++ D+ PT + + L + +V R
Sbjct 94 RPVFLLGHNRSGTTLLHNLFSLNTDQFRVPTTFSVGFSAIYFLLYPIRRVMNSIVDPSRP 153
Query 140 MDNMDLSLHHPQEDEFVWCMQG-LPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRT 198
MDN+ LS+ PQEDE + L SPY FP Y +Y + V E +
Sbjct 154 MDNLPLSMDVPQEDELAYNQSTPLLSPYANNIFPREADHYHKYFRMIDVPAEERARYMEL 213
Query 199 LFRFVQQVYFRRR-KTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKA 257
V+Q+ + + K+P H+ ++K+LLE FP A+F+ I R+PY V+ S +HL
Sbjct 214 FRAMVKQLSVHAEGRRLCFKSPPHTAKVKLLLEEFPDAQFVFIHRNPYRVFRSMLHLADN 273
Query 258 LYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRR 317
L+ LQ + L + +++ Y ++ E R+L+ E+ +++L D +RR
Sbjct 274 LWGHSTLQTASDARLLETILTMYEVVHDAYLEDRKLIPKGNLVEISFDELQRDKIATMRR 333
Query 318 LYQHLGLGDFE-CYLPRLRQYLADHADYKTNSY-QLTVEQRAIVDEHWGEIIDRYGYDRH 375
+Y+ L +G FE LP L ++ + +YK N++ LT QR IV+ W +GY
Sbjct 334 IYESLKIGGFEKSALPALEAHVKEIKNYKKNAFVGLTDAQRRIVNTRWARFFTAFGYKMQ 393
Query 376 T 376
T
Sbjct 394 T 394
>gi|333031249|ref|ZP_08459310.1| hypothetical protein Bcop_2162 [Bacteroides coprosuis DSM 18011]
gi|332741846|gb|EGJ72328.1| hypothetical protein Bcop_2162 [Bacteroides coprosuis DSM 18011]
Length=369
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/315 (30%), Positives = 169/315 (54%), Gaps = 11/315 (3%)
Query 65 QKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTE 124
Q F +++A +++ P+FI+GHWR+GTT +H +L D R T Y+ + PH + +
Sbjct 56 QNKRFDKKLANIPLSEDPVFILGHWRSGTTFVHNVLSCDKRFGYNTTYQTVFPHLMMWGQ 115
Query 125 -WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQY----- 178
+F + FL+ R DNM+L++ PQE+EF +P Y F PQY
Sbjct 116 TFFKGNMSFLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFL---PQYMQEYA 171
Query 179 EEYLDLEQVAPRELEIWKRTLFRFVQ-QVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKF 237
++YL ++ EL+I++ T + ++ ++ + + + KNP H+ R+K L+++FP AKF
Sbjct 172 DKYLLFNDISENELQIFEETFKKLIKISLWNTKGEQFLSKNPPHTGRVKELIKMFPNAKF 231
Query 238 IHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPT 297
I+++R+PY V ST + LQ + + ++ ++S Y LY + + + L+
Sbjct 232 IYLMRNPYTVLESTRSFFTNTIQPLKLQDISNEEIEKNIISIYAKLYHQYEAEKHLIPEG 291
Query 298 RFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRA 357
E+++ED D G +++Y+ L L F+ ++ Y+ + YK N Y+
Sbjct 292 NLIEVKFEDFEADAMGMTQKIYESLNLKGFDEAKGAIQNYVGEKKGYKKNKYKYDDRTIK 351
Query 358 IVDEHWGEIIDRYGY 372
+V+E+WG + ++GY
Sbjct 352 LVEENWGFALKQWGY 366
>gi|189463425|ref|ZP_03012210.1| hypothetical protein BACCOP_04144 [Bacteroides coprocola DSM
17136]
gi|189429854|gb|EDU98838.1| hypothetical protein BACCOP_04144 [Bacteroides coprocola DSM
17136]
Length=368
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 164/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLKPLQDKRYEKLLANQPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKHMQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + ++ EL++++ T + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDISDEELKVFEETFTKLIKISLWNTHGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + + L + ++S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDISNEQLQENILSVYAKLYHKYEADKKFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED + + +YQ L + F+ P + Y+ YK N YQ E
Sbjct 289 PEGNLVEVKFEDYEKNAFDLTQEIYQKLSIPGFDEARPAIEAYVNKKKGYKKNQYQYKPE 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+++W +D++GY
Sbjct 349 TVELVEKNWSFALDQWGY 366
>gi|77165206|ref|YP_343731.1| sulfotransferase [Nitrosococcus oceani ATCC 19707]
gi|254433203|ref|ZP_05046711.1| hypothetical protein NOC27_134 [Nitrosococcus oceani AFC27]
gi|76883520|gb|ABA58201.1| possible sulfotransferase [Nitrosococcus oceani ATCC 19707]
gi|207089536|gb|EDZ66807.1| hypothetical protein NOC27_134 [Nitrosococcus oceani AFC27]
Length=336
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/308 (34%), Positives = 155/308 (51%), Gaps = 3/308 (0%)
Query 69 FGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLL-TEWFA 127
+ RRV IA P+FIVGHWR+GTT L LL D + + T + P +LL +E
Sbjct 28 YHRRVERQEIAPDPLFIVGHWRSGTTHLQNLLNCDPQFSCVTLLQAGMPREYLLLSEGVK 87
Query 128 PYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQ-YEEYLDLEQ 186
++ L+ R MDN+ ++ P E+E S Y FP + ++E + +
Sbjct 88 RWLGRLLPSTRLMDNVSIAADVPWEEELALAAASRYSFYHVSFFPRSMERIFDEAVMFDS 147
Query 187 VAPRELEIWKRTLFRFVQQV-YFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPY 245
V + W RF+Q V Y + + ++LKNP ++ RI++L + FP+A+FIHI R+PY
Sbjct 148 VPQAAIRKWWTGYLRFLQMVQYDQPGRRLLLKNPANTARIRLLKKRFPKAQFIHIHRNPY 207
Query 246 VVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYE 305
V+ S++HL+ GLQ + V+++Y L R E RE++ T E+ +
Sbjct 208 KVFVSSVHLYLQAQNAWGLQSTDRQRVVAHVLASYPQLMRAYFEQREVLAETDLAEVSFA 267
Query 306 DLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGE 365
L P L +Y L L FE +PR R YL Y+ N +LT +RA V W +
Sbjct 268 SLQKAPLETLESIYCRLDLTGFEEAVPRFRAYLERQKGYRKNRLELTESERAAVATCWRD 327
Query 366 IIDRYGYD 373
I GY+
Sbjct 328 IFTGLGYE 335
>gi|254425194|ref|ZP_05038912.1| hypothetical protein S7335_5357 [Synechococcus sp. PCC 7335]
gi|196192683|gb|EDX87647.1| hypothetical protein S7335_5357 [Synechococcus sp. PCC 7335]
Length=367
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 91/301 (31%), Positives = 153/301 (51%), Gaps = 9/301 (2%)
Query 79 ADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFL-LTEWFAPYVEFLVSKH 137
+ PPIFI+GHWR+GTT LH +L + + P FL L P +E + K
Sbjct 65 SKPPIFIIGHWRSGTTFLHSVLSQSPQFAYTSPLAVGLPWDFLTLGNALRPILEGALPKD 124
Query 138 RAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLD----LEQVAPRELE 193
R +D + ++ PQEDE L S Y + FP Q+ ++ + E E+
Sbjct 125 RFIDRVPVNPDSPQEDEIALASMQLLSFYQGLYFPK---QFAKHFNAGIFFEGCTDIEMT 181
Query 194 IWKRTLFRFVQQVYFRR-RKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTI 252
W++ + F +++ + + +++KNP ++ R+K L E++P+AKFIHI R+PY+VY ST+
Sbjct 182 EWQQAMVLFCKKLQIQNPHQQLLIKNPVYTARVKKLRELWPKAKFIHIYRNPYIVYRSTL 241
Query 253 HLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPE 312
+ + L+R LQ +++ V+ +Y + + F ELR+E +P
Sbjct 242 NFYDKLFRELSLQSFEQVPVEEIVLESYPKMIEAAQRETRALPTQDFVELRFETFETNPV 301
Query 313 GQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
QL ++Y L L +E LP ++YL Y+ N Y + V + W ++DR+GY
Sbjct 302 EQLEKIYDRLELTGWEEDLPHFQRYLESQKHYRKNDYAFPADMIERVRDRWQPLLDRWGY 361
Query 373 D 373
+
Sbjct 362 E 362
>gi|339441104|ref|YP_004707109.1| hypothetical protein CXIVA_00400 [Clostridium sp. SY8519]
gi|338900505|dbj|BAK46007.1| hypothetical protein CXIVA_00400 [Clostridium sp. SY8519]
Length=370
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/357 (29%), Positives = 166/357 (47%), Gaps = 9/357 (2%)
Query 22 VGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADP 81
+GC W+ LL R+ +R A TF+ + + L +K+++ RR+ T +
Sbjct 12 MGCTLGNWIALL-RDNPITRENRPQ-AAFMTFVISLLTPPALAEKLIYDRRIKATRLKKD 69
Query 82 PIFIVGHWRTGTTLLHELLVVDDRHT--GPTGYECLAPHHFLLTEWFAPYVEFLVSKHRA 139
PI+IVG WR+GTT L LL D + P + LL Y+ + R
Sbjct 70 PIYIVGFWRSGTTFLQNLLTRDPQFAWFDPVNTVTFN-NSILLRPILEKYMNVFLKGARP 128
Query 140 MDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPN--RPPQYEEYLDLEQVAPRELEIWKR 197
MDN++ + P E+ F + + FP+ R +Y E + + + R+ W+R
Sbjct 129 MDNLEYTTDLPMEEVFAQATISTQAISHMLVFPDGGRGTKYIETAFISEQSSRKKRQWRR 188
Query 198 TLFRFVQQVYF-RRRKTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHK 256
+++ F + K ++LK+P ++ RI L + +P AKFI+I R PY + PSTI++
Sbjct 189 AYDYILKKATFVKDGKQLLLKSPENTCRIDALKKCYPAAKFINIFRHPYALIPSTINMFT 248
Query 257 ALYRIHGLQQPT-FDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQL 315
L P + ++D + +YRK E + P ++RYED DPE L
Sbjct 249 KEMDNFCLNTPAPREVIEDVSIDLCARVYRKAIHELEEMKPEDHIDIRYEDFCQDPEAYL 308
Query 316 RRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
R++YQ L L + P YL +Y+ N +QL R +++ D YGY
Sbjct 309 RKIYQQLQLEGYAEARPYFEDYLDSQKNYQKNHFQLEDRIRRKINDRLDFYFDYYGY 365
>gi|254883656|ref|ZP_05256366.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA]
gi|319642206|ref|ZP_07996866.1| hypothetical protein HMPREF9011_02466 [Bacteroides sp. 3_1_40A]
gi|254836449|gb|EET16758.1| hypothetical protein BSFG_02905 [Bacteroides sp. 4_3_47FAA]
gi|317386192|gb|EFV67111.1| hypothetical protein HMPREF9011_02466 [Bacteroides sp. 3_1_40A]
Length=368
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 164/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQEKRYRKLLADKPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL ++ EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFNDISDEELKVFEDIFTKLIKISLWNTGGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + + L+ + + L+ V+S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRNFFTNTIQPLKLEDISPEALEQNVLSIYTKLYHKYEADKQFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D +Y+ L + FE P + QY+ YK N Y+
Sbjct 289 PEGNLMEVKFEDFEADAMAMTEHIYKSLSIPGFEAAAPAISQYIGGKKGYKKNKYKYDDR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+E+W +D++GY
Sbjct 349 TVRLVEENWKFALDQWGY 366
>gi|150003008|ref|YP_001297752.1| hypothetical protein BVU_0415 [Bacteroides vulgatus ATCC 8482]
gi|149931432|gb|ABR38130.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length=368
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 164/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQEKRYRKLLADKPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL ++ EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFNDISDEELKVFEDIFTKLIKISLWNTGGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + + L+ + + L+ V+S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRNFFTNTIQPLKLEDISPEALEQNVLSIYAKLYHKYEADKQFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D +Y+ L + FE P + QY+ YK N Y+
Sbjct 289 PEGNLMEVKFEDFEADAMAMTEHIYKSLSIPGFEAAAPAISQYIGGKKGYKKNKYKYDDR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+E+W +D++GY
Sbjct 349 TVRLVEENWKFALDQWGY 366
>gi|294775643|ref|ZP_06741151.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294450487|gb|EFG18979.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length=368
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/318 (29%), Positives = 164/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQEKRYRKLLADKPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL ++ EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFNDISDEELKVFEDIFTKLIKISLWNTGGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + + L+ + + L+ ++S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRNFFTNTIQPLKLEDISPEALEQNILSVYAKLYHKYEADKQFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D +Y+ L + FE P + QY+ YK N Y+
Sbjct 289 PEGNLMEVKFEDFEADAMAMTEHIYKSLSIPGFEAAAPAISQYIGGKKGYKKNKYKYDDR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+E+W +D++GY
Sbjct 349 TVRLVEENWKFALDQWGY 366
>gi|237707998|ref|ZP_04538479.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|237725270|ref|ZP_04555751.1| conserved hypothetical protein [Bacteroides sp. D4]
gi|265754216|ref|ZP_06089405.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|229436536|gb|EEO46613.1| hypothetical protein BSEG_02754 [Bacteroides dorei 5_1_36/D4]
gi|229457984|gb|EEO63705.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|263234925|gb|EEZ20480.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length=368
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 163/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQEKRYRKLLADKSLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL ++ EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFNDISDEELKVFEDIFTKLIKISLWNTGGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + + L+ + + L+ V+S Y LY K + + +
Sbjct 229 AKFIYLMRNPYTVFESTRNFFTNTIQPLKLEDISPETLEQNVLSIYAKLYHKYEADKRFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D +Y+ L + FE P + QY+ YK N Y+
Sbjct 289 PEGNLMEVKFEDFEADAMAMTEYIYKSLSIPGFEAAAPAISQYIGGKKGYKKNKYKYNDR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+E+W +D++GY
Sbjct 349 TVRLVEENWKFALDQWGY 366
>gi|212690546|ref|ZP_03298674.1| hypothetical protein BACDOR_00028 [Bacteroides dorei DSM 17855]
gi|212666895|gb|EEB27467.1| hypothetical protein BACDOR_00028 [Bacteroides dorei DSM 17855]
Length=368
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 163/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q+ + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQEKRYRKLLADKSLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL ++ EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFNDISDEELKVFEDIFTKLIKISLWNTGGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + + L+ + + L+ V+S Y LY K + + +
Sbjct 229 AKFIYLMRNPYTVFESTRNFFTNTIQPLKLEDISPETLEQNVLSIYAKLYHKYEADKRFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D +Y+ L + FE P + QY+ YK N Y+
Sbjct 289 PEGNLMEVKFEDFEADAMAMTEYIYKSLSIPGFETAAPAISQYIGGKKGYKKNKYKYNDR 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+E+W +D++GY
Sbjct 349 TVRLVEENWKFALDQWGY 366
>gi|159030655|emb|CAO88325.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length=365
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/359 (28%), Positives = 168/359 (47%), Gaps = 13/359 (3%)
Query 23 GCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPP 82
G N S +RL + N + A L +++ ++I+ V P
Sbjct 10 GSNLSTLLRLFLTNG-GIDRPNLAPATLALAVTLARLPFSTLERILMTGFYERRVQVKAP 68
Query 83 IFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFL-LTEWFAPYVEFLVSKHRAMD 141
IFIVG+WR+GTT LH LL + + P L + F P +E + R +D
Sbjct 69 IFIVGYWRSGTTHLHNLLGQSEHFGYISPLAVGLPWDILGIVRLFQPLLELALPSDRHVD 128
Query 142 NMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNRPPQYEEYLD----LEQVAPRELEIWKR 197
N+ ++ + PQED S Y + FP R ++ + D + + +E+ W+R
Sbjct 129 NVAVTPNSPQEDSIALASMIPLSYYHGLYFPQR---FQYHFDRGVFFQGCSEKEIANWQR 185
Query 198 TLFRFVQQVYFRRR-KTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHK 256
+++V ++ K ++LKNP ++ I L ++P AKFIHI R+PY+V+PST H
Sbjct 186 WHTHLLKKVSIHQKGKQLLLKNPVYTAHIARLRAIWPDAKFIHIYRNPYLVFPSTRHFFT 245
Query 257 ALYRIHGLQ---QPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEG 313
+ LQ + D ++ ++ +Y + L + F E+R+EDL +P
Sbjct 246 RILPELALQPYDNLSIDVIEQAILKSYPLMLNSLLGDSANLPTDSFVEIRFEDLEKEPLT 305
Query 314 QLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
Q+ ++Y L L D + +PR +Y++ YK N+Y + +V+ HW I R+ Y
Sbjct 306 QIEKIYDQLQLPDLKISMPRFEKYISSLQGYKKNNYPPEPKAIELVESHWLPFIQRWNY 364
>gi|166365918|ref|YP_001658191.1| hypothetical protein MAE_31770 [Microcystis aeruginosa NIES-843]
gi|166088291|dbj|BAG02999.1| hypothetical protein MAE_31770 [Microcystis aeruginosa NIES-843]
Length=365
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 99/356 (28%), Positives = 164/356 (47%), Gaps = 7/356 (1%)
Query 23 GCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPP 82
G N S +RL + N + A L +++ ++I+ V P
Sbjct 10 GSNLSTLLRLFLTNG-GIDRPNLAPATLAMAVTLARLPFSTLERILITGFYERGVQVKAP 68
Query 83 IFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFL-LTEWFAPYVEFLVSKHRAMD 141
IFIVG+WR+GTT LH LL + + P L + F P +E + R +D
Sbjct 69 IFIVGYWRSGTTHLHNLLGQSEHFGYISPLAVGLPWDILGIVRLFQPLLELALPSDRHVD 128
Query 142 NMDLSLHHPQEDEFVWCMQGLPSPYLTIAFPNR-PPQYEEYLDLEQVAPRELEIWKRTLF 200
N+ ++ PQED S Y + FP R ++ + + + E+ W+R
Sbjct 129 NVAVTPDSPQEDSIALASMIPLSYYHGLYFPQRFQYHFQRGVFFQGCSEGEIATWQRWHT 188
Query 201 RFVQQVYFRRR-KTVILKNPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALY 259
+++V +R K +++KNP ++ I L ++P AKFIHI R+PY+V+PST H +
Sbjct 189 HLLKKVSIHQRGKQLLIKNPVYTAHIAKLRAIWPDAKFIHIYRNPYLVFPSTRHFFTRIL 248
Query 260 RIHGLQ---QPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLR 316
LQ + D ++ ++ +Y + L + F E+R+EDL P Q+
Sbjct 249 PELALQSYDNLSTDEIEQVILKSYPPMINSLLRDSADLPADSFVEIRFEDLEKTPLEQIE 308
Query 317 RLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGY 372
++Y L L D + +PR +Y+A YK N+Y + +V+ HW I R+ Y
Sbjct 309 KIYGQLQLPDLKIAMPRFEKYIASLQGYKKNNYPPDAKAIELVESHWLPFIQRWNY 364
>gi|325300102|ref|YP_004260019.1| hypothetical protein Bacsa_3017 [Bacteroides salanitronis DSM
18170]
gi|324319655|gb|ADY37546.1| hypothetical protein Bacsa_3017 [Bacteroides salanitronis DSM
18170]
Length=369
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 95/318 (30%), Positives = 160/318 (51%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 SVLKPLQDKKYEKLLASKPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFA-LTNMMPYTYYNFWFLPKHMQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL E + EL++++ T + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFEDITNDELKVFEETFTKLIKISLWNTNGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + + L + ++S Y LY K + + +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIKPLQLQSISPEELQENILSVYAKLYHKYEADKRFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+R+ED + + +YQ L L FE P + Y+ YK N YQ E
Sbjct 289 PEGNLVEVRFEDYEKNAFDLTQEIYQKLSLPGFEEARPAIEAYVNKKKGYKKNKYQYKPE 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V++HW +D + Y
Sbjct 349 TVELVEKHWRFALDEWNY 366
>gi|167763506|ref|ZP_02435633.1| hypothetical protein BACSTE_01880 [Bacteroides stercoris ATCC
43183]
gi|167698800|gb|EDS15379.1| hypothetical protein BACSTE_01880 [Bacteroides stercoris ATCC
43183]
Length=368
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/319 (29%), Positives = 159/319 (50%), Gaps = 5/319 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQDKRYEKLLADKPLEHDPVFILGHWRSGTTFVHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKHQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + ++ EL++++ T R ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDISEAELKVFEETFTRLIKISLWNTHGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ T L+ ++S Y LY K + + +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDITPAELEQNILSAYAKLYHKYEADKASI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+++ED D G +Y L + F + QY+ YK N Y+
Sbjct 289 PAGNLIEVKFEDFEADAMGMTEHIYDALSIPGFADARTAIEQYVGGKKGYKKNKYKYDDR 348
Query 355 QRAIVDEHWGEIIDRYGYD 373
+V ++WG + ++ Y+
Sbjct 349 TVQLVQDNWGFALKQWNYE 367
>gi|218260668|ref|ZP_03475864.1| hypothetical protein PRABACTJOHN_01528 [Parabacteroides johnsonii
DSM 18315]
gi|218224418|gb|EEC97068.1| hypothetical protein PRABACTJOHN_01528 [Parabacteroides johnsonii
DSM 18315]
Length=367
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/306 (30%), Positives = 155/306 (51%), Gaps = 5/306 (1%)
Query 71 RRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTE-WFAPY 129
R++A+ + P+FI+GHWR+GTT +H + D T Y+ + P+ L + +F
Sbjct 61 RKIADKPLEMDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPNLMLWGQPFFKKN 120
Query 130 VEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTI-AFPNRPPQY-EEYLDLEQV 187
+ FL+ R DNM+L + PQE+EF +P Y FP +Y + YL + +
Sbjct 121 MAFLMPDKRPTDNMELKVDLPQEEEFALA-NMMPYTYYNFWFFPKHMLEYCDRYLLFDNI 179
Query 188 APRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
+ E +++K T + ++ + T L KNP H+ R+K L+E+FP AKFI++ R+PY
Sbjct 180 SEHERKVFKETFLKLIKISLWNTNGTQFLSKNPPHTGRVKTLVEMFPNAKFIYLKRNPYT 239
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
V+ ST + LQ+ + + ++ + Y L+ K +E + L+ E+++ED
Sbjct 240 VFESTRSFFTNTIQPLRLQEISNEQIESNFIEVYRRLFYKYEEQKHLIPEGNLVEVKFED 299
Query 307 LIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEI 366
D +Y+ L L FE + +YL YK N Y+ +V+E+WG
Sbjct 300 FEQDAFAMTEDIYKKLNLPGFEESKAEIEKYLGKKKGYKKNQYKYDDRTVQLVEENWGMA 359
Query 367 IDRYGY 372
+ +GY
Sbjct 360 LKEWGY 365
>gi|154492288|ref|ZP_02031914.1| hypothetical protein PARMER_01922 [Parabacteroides merdae ATCC
43184]
gi|154087513|gb|EDN86558.1| hypothetical protein PARMER_01922 [Parabacteroides merdae ATCC
43184]
Length=367
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 91/306 (30%), Positives = 155/306 (51%), Gaps = 5/306 (1%)
Query 71 RRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTE-WFAPY 129
R++A+ + P+FI+GHWR+GTT +H + D T Y+ + P+ L + +F
Sbjct 61 RKIADKPLEMDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPNLMLWGQPFFKKN 120
Query 130 VEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTI-AFPNRPPQY-EEYLDLEQV 187
+ FL+ R DNM+L + PQE+EF +P Y FP +Y + YL + +
Sbjct 121 MAFLMPDKRPTDNMELKVDLPQEEEFALA-NMMPYTYYNFWFFPKHMLEYCDRYLLFDNI 179
Query 188 APRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
+ E E++K T + ++ + + + L KNP H+ R+K L+E+FP AKFI++ R+PY
Sbjct 180 SEHEREVFKETFLKLIKISLWNTKGSQFLSKNPPHTGRVKTLVEMFPNAKFIYLKRNPYT 239
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
V+ ST + LQ + + ++ + Y L+ K +E + L+ E+++ED
Sbjct 240 VFESTRSFFTNTIQPLRLQDISNEQIESNFIEVYRRLFYKYEEQKHLIPEGNLVEVKFED 299
Query 307 LIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEI 366
D +Y+ L L FE + +YL YK N Y+ +V+E+WG
Sbjct 300 FEQDAFAMTEDIYKKLNLPGFEESKAEIEKYLGKKKGYKKNQYKYDDRTVRLVEENWGMA 359
Query 367 IDRYGY 372
+ +GY
Sbjct 360 LKEWGY 365
>gi|116073267|ref|ZP_01470529.1| hypothetical protein RS9916_32492 [Synechococcus sp. RS9916]
gi|116068572|gb|EAU74324.1| hypothetical protein RS9916_32492 [Synechococcus sp. RS9916]
Length=346
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 151/325 (47%), Gaps = 4/325 (1%)
Query 39 AVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHE 98
A+ SRW + +V L Q ++ R+ + D PI IVGHWR+GTT LH+
Sbjct 4 AMQPSRWLVGLQLVLPGVVLEPLAWLQVLILRTRLRALQVPDDPIVIVGHWRSGTTFLHQ 63
Query 99 LLVVDDRHTGPTGYECLAPH-HFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVW 157
LL VD + +AP LL W AP ++ +S+ R +D + S PQEDE
Sbjct 64 LLSVDPQTATARNSFTVAPQVAVLLKPWLAPVLQRWMSRTRPIDAVPWSALDPQEDEIGL 123
Query 158 CMQGLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILK 217
+ +AFP P + L A + ++ T ++ + R +++K
Sbjct 124 ARLTPDTNMAGVAFPQHYPHHFRRCVLASTADFQQQLLHFTRLTWLHDGAGKTR--LLIK 181
Query 218 NPTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLD-DKV 276
N H+ R+ +LL +FP+A+F+ + R+P S + + ++L + GLQ P + ++
Sbjct 182 NSAHTARVALLLRMFPKARFVLLKREPIASIRSLVQVKQSLAHLVGLQAPLDEVAQVEET 241
Query 277 VSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQ 336
+ + L + R L+ P + E+ Y DLI DP R+Y+ L L + P + Q
Sbjct 242 TAAHRALMHAFERSRSLIPPGQLVEVAYGDLIADPLAATERIYRELNLSGWHLAQPAIAQ 301
Query 337 YLADHADYKTNSYQLTVEQRAIVDE 361
A Y+ QL++ A + E
Sbjct 302 RAAMAQSYQAQPVQLSLAAEARLQE 326
>gi|224540135|ref|ZP_03680674.1| hypothetical protein BACCELL_05048 [Bacteroides cellulosilyticus
DSM 14838]
gi|224518243|gb|EEF87348.1| hypothetical protein BACCELL_05048 [Bacteroides cellulosilyticus
DSM 14838]
Length=368
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 79/273 (29%), Positives = 141/273 (52%), Gaps = 5/273 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + +R+A + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQNGRYEKRLASQPLEHDPVFILGHWRSGTTFVHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + + +EL++++ + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDITEKELKVFEEVFIKLIKISLWNTNGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + D ++ ++S Y LY K + + +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDISNDEIEKNILSIYAKLYHKYEADKSCI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDF 327
E+++ED D G ++Y+ L + F
Sbjct 289 PAGNLMEVKFEDFEADAMGMTEQIYRGLSIPGF 321
>gi|256839695|ref|ZP_05545204.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256738625|gb|EEU51950.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length=367
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 96/306 (32%), Positives = 151/306 (50%), Gaps = 5/306 (1%)
Query 71 RRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTE-WFAPY 129
+++A+ + P+FI+GHWR+GTT +H + D T Y+ + PH L + +F
Sbjct 61 KKLADKPLEMDPLFILGHWRSGTTFVHNIFACDKHFGYTTTYQTVFPHLMLWGQPFFKKN 120
Query 130 VEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTI-AFPNRPPQY-EEYLDLEQV 187
+ FL+ R DNM+L + PQE+EF +P Y FP R +Y + YL +
Sbjct 121 MAFLMPDKRPTDNMELKVDLPQEEEFALS-NMMPYTYYNFWFFPKRWMEYCDRYLLFNDI 179
Query 188 APRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQAKFIHIVRDPYV 246
E I+ T R V+ + T L KNP H+ R+K LLE+FP AKFI++ R+PY
Sbjct 180 TEEERRIFMDTFMRLVKVSLWNTNGTQYLSKNPPHTGRVKTLLEMFPNAKFIYLKRNPYT 239
Query 247 VYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYED 306
V+ ST + LQ T + ++ + Y L+ K +E + L+ E+++ED
Sbjct 240 VFESTRSFFTNTIQPLRLQDITNEQIEANFIEVYRRLFYKYEEEKHLIPEGNLVEVKFED 299
Query 307 LIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEI 366
D G +Y L L F+ + +YL YK N Y+ +V+E+WG
Sbjct 300 FEKDAFGMTENIYGSLNLPGFKESKADIEKYLGKKKGYKKNQYKYEDRTVRLVEENWGMA 359
Query 367 IDRYGY 372
+ +GY
Sbjct 360 LKEWGY 365
>gi|198274257|ref|ZP_03206789.1| hypothetical protein BACPLE_00397 [Bacteroides plebeius DSM 17135]
gi|198272932|gb|EDY97201.1| hypothetical protein BACPLE_00397 [Bacteroides plebeius DSM 17135]
Length=368
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/318 (29%), Positives = 164/318 (52%), Gaps = 5/318 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A+ + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 STLAPLQDKRYEKLLADKPLEHDPVFILGHWRSGTTFMHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALA-NMMPYTYYNFWFLPKHMQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + ++ EL++++ T + ++ + T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDISEAELKVFEETFTKLIKISLWNTHGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + + L + ++S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDISNEELQENILSVYAKLYHKYEADKKFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRLRQYLADHADYKTNSYQLTVE 354
E+R+ED + + +YQ L + FE + Y+ YK N YQ E
Sbjct 289 PEGNLVEVRFEDYETNAYDMTQEIYQKLQIPGFEDARADIEAYVNKKKGYKKNKYQYKPE 348
Query 355 QRAIVDEHWGEIIDRYGY 372
+V+++W ++++GY
Sbjct 349 TVELVEKNWSFALEQWGY 366
>gi|299149160|ref|ZP_07042221.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|336417232|ref|ZP_08597558.1| hypothetical protein HMPREF1017_04666 [Bacteroides ovatus 3_8_47FAA]
gi|298512827|gb|EFI36715.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|335936430|gb|EGM98360.1| hypothetical protein HMPREF1017_04666 [Bacteroides ovatus 3_8_47FAA]
Length=368
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/273 (30%), Positives = 141/273 (52%), Gaps = 5/273 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 SPLASLQDRRYEKLLANQPLEHDPVFILGHWRSGTTFVHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALS-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + + EL++++ + ++ + R T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDITDAELKVFEEVFTKLIKISLWNTRGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + + L++ ++S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDVSNEQLEENILSIYAKLYHKYESDKKFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDF 327
E+++ED D G +YQ L + F
Sbjct 289 PEGNLMEVKFEDFEADAMGMTENIYQSLSIPGF 321
>gi|315919130|ref|ZP_07915370.1| conserved hypothetical protein [Bacteroides sp. D2]
gi|313693005|gb|EFS29840.1| conserved hypothetical protein [Bacteroides sp. D2]
Length=368
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/273 (30%), Positives = 141/273 (52%), Gaps = 5/273 (1%)
Query 59 SCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPH 118
S L Q + + +A + P+FI+GHWR+GTT +H + D T Y+ + PH
Sbjct 50 SPLASLQDRRYEKLLANQPLEHDPVFILGHWRSGTTFVHNVFSCDKHFGYNTTYQTVFPH 109
Query 119 HFLLTE-WFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQGLPSPYLTIAF-PNRPP 176
+ + +F + +L+ R DNM+L++ PQE+EF +P Y F P
Sbjct 110 LMMWGQPFFKKNMSWLMPDKRPTDNMELAVDLPQEEEFALS-NMMPYTYYNFWFLPKYQQ 168
Query 177 QY-EEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVIL-KNPTHSFRIKVLLEVFPQ 234
+Y ++YL + + EL++++ + ++ + R T L KNP H+ R+K L+++FP
Sbjct 169 EYADKYLLFDDITDAELKVFEEVFTKLIKISLWNTRGTQFLSKNPPHTGRVKELVKMFPN 228
Query 235 AKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELV 294
AKFI+++R+PY V+ ST + LQ + + L++ ++S Y LY K + ++ +
Sbjct 229 AKFIYLMRNPYTVFESTRSFFTNTIQPLKLQDVSNEQLEENILSIYAKLYHKYESDKKFI 288
Query 295 DPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDF 327
E+++ED D G +YQ L + F
Sbjct 289 PEGNLMEVKFEDFEADAMGMTENIYQSLSIPGF 321
Lambda K H
0.326 0.142 0.467
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 758906400850
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40