BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2075c
Length=487
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609212|ref|NP_216591.1| hypothetical protein Rv2075c [Mycob... 979 0.0
gi|289443579|ref|ZP_06433323.1| hypothetical exported protein [M... 976 0.0
gi|340627086|ref|YP_004745538.1| hypothetical protein MCAN_20981... 974 0.0
gi|308403485|ref|ZP_07493829.2| hypothetical exported protein [M... 954 0.0
gi|298525578|ref|ZP_07012987.1| conserved hypothetical protein [... 951 0.0
gi|289762232|ref|ZP_06521610.1| hypothetical exported or envelop... 759 0.0
gi|183983058|ref|YP_001851349.1| hypothetical protein MMAR_3058 ... 723 0.0
gi|240172464|ref|ZP_04751123.1| hypothetical protein MkanA1_2433... 702 0.0
gi|31793257|ref|NP_855750.1| hypothetical protein Mb2100c [Mycob... 520 3e-145
gi|304310136|ref|YP_003809734.1| hypothetical protein HDN1F_0485... 171 3e-40
gi|83643085|ref|YP_431520.1| QXW lectin repeat-containing protei... 158 2e-36
gi|254447517|ref|ZP_05060983.1| QXW lectin repeat protein [gamma... 152 2e-34
gi|87119030|ref|ZP_01074928.1| hypothetical protein MED121_12210... 129 1e-27
gi|94500127|ref|ZP_01306661.1| protein containing QXW lectin rep... 113 8e-23
gi|45658205|ref|YP_002291.1| hypothetical protein LIC12359 [Lept... 107 3e-21
gi|24214075|ref|NP_711556.1| hypothetical protein LA_1375 [Lepto... 107 4e-21
gi|90412318|ref|ZP_01220323.1| hypothetical protein P3TCK_09798 ... 94.7 3e-17
gi|54302545|ref|YP_132538.1| hypothetical protein PBPRB0866 [Pho... 93.2 1e-16
gi|54302747|ref|YP_132740.1| hypothetical protein PBPRB1068 [Pho... 92.8 1e-16
gi|72129125|ref|XP_800669.1| PREDICTED: hypothetical protein [St... 81.6 3e-13
gi|325189698|emb|CCA24181.1| conserved hypothetical protein [Alb... 77.0 8e-12
gi|115373732|ref|ZP_01461026.1| conserved hypothetical protein [... 75.5 2e-11
gi|301119405|ref|XP_002907430.1| conserved hypothetical protein ... 73.9 5e-11
gi|298708683|emb|CBJ26170.1| conserved unknown protein [Ectocarp... 69.7 1e-09
gi|312219199|emb|CBX99143.1| similar to lectin C-type domain con... 65.5 2e-08
gi|301094668|ref|XP_002896438.1| conserved hypothetical protein ... 65.1 3e-08
gi|299115957|emb|CBN75962.1| conserved unknown protein [Ectocarp... 61.2 4e-07
gi|156357276|ref|XP_001624147.1| predicted protein [Nematostella... 60.8 5e-07
gi|301100928|ref|XP_002899553.1| conserved hypothetical protein ... 57.8 4e-06
gi|307103812|gb|EFN52069.1| hypothetical protein CHLNCDRAFT_1393... 57.4 5e-06
gi|156370135|ref|XP_001628327.1| predicted protein [Nematostella... 56.6 1e-05
gi|325181739|emb|CCA16195.1| conserved hypothetical protein [Alb... 53.9 7e-05
gi|212537327|ref|XP_002148819.1| Lectin C-type domain protein [P... 52.0 2e-04
gi|302532317|ref|ZP_07284659.1| predicted protein [Streptomyces ... 50.8 5e-04
gi|302836171|ref|XP_002949646.1| hypothetical protein VOLCADRAFT... 50.4 6e-04
gi|299115958|emb|CBN75963.1| conserved unknown protein [Ectocarp... 50.4 7e-04
gi|339468640|gb|EGP83740.1| hypothetical protein MYCGRDRAFT_7603... 50.4 7e-04
gi|87118774|ref|ZP_01074673.1| hypothetical protein MED121_17144... 50.1 8e-04
gi|224123376|ref|XP_002330300.1| predicted protein [Populus tric... 49.7 0.001
gi|326469631|gb|EGD93640.1| hypothetical protein TESG_01181 [Tri... 48.9 0.002
gi|326478842|gb|EGE02852.1| lectin C-type domain containing prot... 48.5 0.002
gi|242809580|ref|XP_002485399.1| Lectin C-type domain protein [T... 48.5 0.002
gi|169620399|ref|XP_001803611.1| hypothetical protein SNOG_13399... 48.5 0.003
gi|302655206|ref|XP_003019396.1| Lectin C-type domain protein [T... 48.5 0.003
gi|294817167|ref|ZP_06775809.1| glycoside hydrolase family prote... 48.1 0.003
gi|327303498|ref|XP_003236441.1| hypothetical protein TERG_03486... 48.1 0.003
gi|302509228|ref|XP_003016574.1| Lectin C-type domain protein [A... 48.1 0.003
gi|189198319|ref|XP_001935497.1| lectin C-type domain containing... 48.1 0.003
gi|291451795|ref|ZP_06591185.1| chitinase [Streptomyces albus J1... 48.1 0.003
gi|302804570|ref|XP_002984037.1| hypothetical protein SELMODRAFT... 47.8 0.004
>gi|15609212|ref|NP_216591.1| hypothetical protein Rv2075c [Mycobacterium tuberculosis H37Rv]
gi|15841564|ref|NP_336601.1| hypothetical protein MT2135 [Mycobacterium tuberculosis CDC1551]
gi|148661889|ref|YP_001283412.1| hypothetical protein MRA_2089 [Mycobacterium tuberculosis H37Ra]
44 more sequence titles
Length=487
Score = 979 bits (2530), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/487 (100%), Positives = 487/487 (100%), Gaps = 0/487 (0%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA
Sbjct 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL
Sbjct 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG
Sbjct 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ
Sbjct 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
Query 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW
Sbjct 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
Query 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP
Sbjct 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG
Sbjct 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
Query 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW
Sbjct 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
Query 481 VHYLLPP 487
VHYLLPP
Sbjct 481 VHYLLPP 487
>gi|289443579|ref|ZP_06433323.1| hypothetical exported protein [Mycobacterium tuberculosis T46]
gi|289570185|ref|ZP_06450412.1| hypothetical exported protein [Mycobacterium tuberculosis T17]
gi|289745349|ref|ZP_06504727.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
11 more sequence titles
Length=487
Score = 976 bits (2522), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/487 (99%), Positives = 486/487 (99%), Gaps = 0/487 (0%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA
Sbjct 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL
Sbjct 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG
Sbjct 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ
Sbjct 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
Query 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW
Sbjct 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
Query 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP
Sbjct 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
PKV AMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG
Sbjct 361 PKVQAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
Query 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW
Sbjct 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
Query 481 VHYLLPP 487
VHYLLPP
Sbjct 481 VHYLLPP 487
>gi|340627086|ref|YP_004745538.1| hypothetical protein MCAN_20981 [Mycobacterium canettii CIPT
140010059]
gi|340005276|emb|CCC44430.1| putative hypothetical exported or envelope protein [Mycobacterium
canettii CIPT 140010059]
Length=487
Score = 974 bits (2517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/487 (99%), Positives = 485/487 (99%), Gaps = 0/487 (0%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA
Sbjct 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL
Sbjct 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG
Sbjct 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAY SVVATLDQ
Sbjct 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYASVVATLDQ 240
Query 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW
Sbjct 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
Query 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP
Sbjct 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
PKV AMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG
Sbjct 361 PKVQAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
Query 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW
Sbjct 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
Query 481 VHYLLPP 487
VHYLLPP
Sbjct 481 VHYLLPP 487
>gi|308403485|ref|ZP_07493829.2| hypothetical exported protein [Mycobacterium tuberculosis SUMu012]
gi|308365702|gb|EFP54553.1| hypothetical exported protein [Mycobacterium tuberculosis SUMu012]
Length=475
Score = 954 bits (2467), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/475 (100%), Positives = 475/475 (100%), Gaps = 0/475 (0%)
Query 13 MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV 72
MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV
Sbjct 1 MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV 60
Query 73 PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS 132
PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS
Sbjct 61 PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS 120
Query 133 FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV 192
FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV
Sbjct 121 FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV 180
Query 193 EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY 252
EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY
Sbjct 181 EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY 240
Query 253 RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG 312
RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG
Sbjct 241 RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG 300
Query 313 YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVN 372
YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVN
Sbjct 301 YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVN 360
Query 373 LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG 432
LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG
Sbjct 361 LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG 420
Query 433 RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP 487
RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP
Sbjct 421 RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP 475
>gi|298525578|ref|ZP_07012987.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298495372|gb|EFI30666.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|339294998|gb|AEJ47109.1| hypothetical protein CCDC5079_1919 [Mycobacterium tuberculosis
CCDC5079]
gi|339298622|gb|AEJ50732.1| hypothetical protein CCDC5180_1895 [Mycobacterium tuberculosis
CCDC5180]
Length=475
Score = 951 bits (2459), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/475 (99%), Positives = 474/475 (99%), Gaps = 0/475 (0%)
Query 13 MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV 72
MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV
Sbjct 1 MGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFADAVAAECRRVGV 60
Query 73 PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS 132
PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS
Sbjct 61 PDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDS 120
Query 133 FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV 192
FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV
Sbjct 121 FTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTV 180
Query 193 EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY 252
EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY
Sbjct 181 EPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIY 240
Query 253 RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG 312
RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG
Sbjct 241 RPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSG 300
Query 313 YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVN 372
YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKV AMTDCGVN
Sbjct 301 YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVQAMTDCGVN 360
Query 373 LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG 432
LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG
Sbjct 361 LFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAG 420
Query 433 RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP 487
RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP
Sbjct 421 RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP 475
>gi|289762232|ref|ZP_06521610.1| hypothetical exported or envelope protein [Mycobacterium tuberculosis
GM 1503]
gi|289709738|gb|EFD73754.1| hypothetical exported or envelope protein [Mycobacterium tuberculosis
GM 1503]
Length=392
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/390 (97%), Positives = 382/390 (98%), Gaps = 3/390 (0%)
Query 99 HRTARFQDALQDPVPLRET-QWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRAL 157
HR +R + ++PVPLRE QWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRAL
Sbjct 5 HRGSRMR--CKNPVPLRENLQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRAL 62
Query 158 ELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVI 217
ELDLHYLP LEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVI
Sbjct 63 ELDLHYLPCLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVI 122
Query 218 LLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIR 277
LLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIR
Sbjct 123 LLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIR 182
Query 278 ASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYE 337
ASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYE
Sbjct 183 ASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYE 242
Query 338 DSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDE 397
DSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDE
Sbjct 243 DSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDE 302
Query 398 PRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADF 457
PRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADF
Sbjct 303 PRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADF 362
Query 458 TLPRTGNQNARLHAVAGPAGGAWVHYLLPP 487
TLPRTGNQNARLHAVAGPAGGAWVHYLLPP
Sbjct 363 TLPRTGNQNARLHAVAGPAGGAWVHYLLPP 392
>gi|183983058|ref|YP_001851349.1| hypothetical protein MMAR_3058 [Mycobacterium marinum M]
gi|183176384|gb|ACC41494.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=489
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/487 (73%), Positives = 393/487 (81%), Gaps = 0/487 (0%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MP +RW++ A ++G VVL T PVAAD V APPSPTA CD ISP+A+PCVALGK
Sbjct 1 MPGSRWMKGAVVIGVFGVVLTTIPPVAADTSGVVAPPSPTAPCDAISPIAVPCVALGKAT 60
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DA AECRRVG+ DA CVLPLAH+VTQAAR AYLQSWVHR A+FQ ALQD +PLR+ QWL
Sbjct 61 DAFGAECRRVGIADAHCVLPLAHKVTQAARGAYLQSWVHRVAQFQYALQDELPLRQAQWL 120
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLS+SFT SHADSNQQLSLAQQLDIDVRALELDLHY+ RL+ G GVTVCHG
Sbjct 121 GTHNSFNSLSESFTPSHADSNQQLSLAQQLDIDVRALELDLHYIRRLDLVGGRGVTVCHG 180
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
LGP ANLGCT EP VLP+IANWL P H+++VILLYLED+LK+A AY S V TLD
Sbjct 181 LGPDKANLGCTTEPAFGNVLPEIANWLGTPAHSDQVILLYLEDELKDARAYASAVGTLDG 240
Query 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
VLRR DG+SLIYRPNPA+RA +GCV LPL++SR ++RASGA+ V+VGSCA GW++ VF+W
Sbjct 241 VLRRPDGSSLIYRPNPAQRAADGCVRLPLNLSRNDVRASGAQVVVVGSCASGWASDVFNW 300
Query 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
GVE+E GS SGYR YPACDATYG G YA RLVRYYEDSTL +AL PTRPP +P+AL P
Sbjct 301 DGVEVEKGSTSGYRAYPACDATYGAGTYASRLVRYYEDSTLVSALVKPTRPPTDPEALAP 360
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
K AM DCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAG C LQ DGRWV+A C
Sbjct 361 AKAKAMIDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGECTLQDRDGRWVSAPCT 420
Query 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
D HPAAC AAG W VT V FAGA LACTA GADF LPR+G+QNARLHAV+ GGAW
Sbjct 421 DAHPAACVTAAGTWAVTSTAVTFAGAPLACTAAGADFALPRSGDQNARLHAVSSSVGGAW 480
Query 481 VHYLLPP 487
V Y L P
Sbjct 481 VRYTLSP 487
>gi|240172464|ref|ZP_04751123.1| hypothetical protein MkanA1_24330 [Mycobacterium kansasii ATCC
12478]
Length=502
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/485 (72%), Positives = 388/485 (80%), Gaps = 5/485 (1%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MP RWL A ++ ++ +IT APV AD + SPTA CD +SP+AIPCVAL KFA
Sbjct 1 MPGTRWLHRAVVVSVTSLTVITPAPVIADPSEQT---SPTAPCDAVSPIAIPCVALNKFA 57
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DAVAAECRRVG+ DA C LPLAH+VTQAARDAYLQSWVHRTA+FQ AL DP+P+ + QWL
Sbjct 58 DAVAAECRRVGIADAHCALPLAHKVTQAARDAYLQSWVHRTAQFQYALADPLPISQAQWL 117
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLSDSFT+SHADSNQQLSLAQQLDIDVR LELDLHYLPRLE G VTVCHG
Sbjct 118 GTHNSFNSLSDSFTLSHADSNQQLSLAQQLDIDVRGLELDLHYLPRLELLGKREVTVCHG 177
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
L P N NLGCT EP L VLPQI NWLN PGHT+EVILLYLED+L++A+AY S +ATL+
Sbjct 178 LAPNNGNLGCTNEPPLTAVLPQIKNWLNIPGHTDEVILLYLEDELRDATAYSSALATLED 237
Query 241 VLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW 300
LRR DG SLIY P+PA RATNGCVPLPL SR ++RA+GA+ VLV SC P WSA VF W
Sbjct 238 TLRRPDGQSLIYHPDPAGRATNGCVPLPLQTSRNDVRAAGAQVVLVSSCIPNWSADVFTW 297
Query 301 SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTP 360
G E+ESGS GY+PYP CD TYG VYA +LVRYYEDSTL +AL PTRPPANP ALTP
Sbjct 298 KGPEVESGSTPGYQPYPTCDVTYGSDVYATKLVRYYEDSTLVSALTKPTRPPANPAALTP 357
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCG 420
KV AMTDCGVNLFGFDQLLPEDGRIQA+LWSWAPDEPR AG+C LQ GRWVAA C
Sbjct 358 AKVQAMTDCGVNLFGFDQLLPEDGRIQATLWSWAPDEPRPTAGSCTLQAPTGRWVAAPCA 417
Query 421 DPHPAACRDAAGRWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAW 480
DPHPAACR+AAG WT+TP PV F A LAC A+ A+F LPRTGNQNA+LHA A AGGAW
Sbjct 418 DPHPAACRNAAGTWTLTPNPVSFDQAQLACAAVNAEFALPRTGNQNAQLHAAA--AGGAW 475
Query 481 VHYLL 485
+ Y L
Sbjct 476 LCYPL 480
>gi|31793257|ref|NP_855750.1| hypothetical protein Mb2100c [Mycobacterium bovis AF2122/97]
gi|121637959|ref|YP_978183.1| hypothetical protein BCG_2093c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990453|ref|YP_002645140.1| putative hypothetical exported or envelope protein [Mycobacterium
bovis BCG str. Tokyo 172]
8 more sequence titles
Length=262
Score = 520 bits (1338), Expect = 3e-145, Method: Compositional matrix adjust.
Identities = 259/259 (100%), Positives = 259/259 (100%), Gaps = 0/259 (0%)
Query 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA
Sbjct 1 MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVAIPCVALGKFA 60
Query 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL
Sbjct 61 DAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWL 120
Query 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG
Sbjct 121 GTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHG 180
Query 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ
Sbjct 181 LGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQ 240
Query 241 VLRRADGTSLIYRPNPARR 259
VLRRADGTSLIYRPNPARR
Sbjct 241 VLRRADGTSLIYRPNPARR 259
>gi|304310136|ref|YP_003809734.1| hypothetical protein HDN1F_04850 [gamma proteobacterium HdN1]
gi|301795869|emb|CBL44068.1| hypothetical protein HDN1F_04850 [gamma proteobacterium HdN1]
Length=496
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/397 (31%), Positives = 181/397 (46%), Gaps = 38/397 (9%)
Query 97 WVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRA 156
W+ + Q L D PL + +L THNS N+ + S+ D NQ+LSL QQL +R+
Sbjct 91 WLQNALQLQRHLDDREPLATSSFLMTHNSANAAAYRTVFSYIDPNQKLSLGQQLGAGIRS 150
Query 157 LELDLHYLPRLEG---HGAPGVTVCHGLGPKNANLGCT-VEPLLATVLPQIANWLNAPGH 212
+ELD+H + G + +CHG +N +LGC+ + +L+ + ++ +WL +
Sbjct 151 IELDVHQFFSMRGWPWQWKKRILLCHG---QNNHLGCSPYDRVLSAGIDEVKDWLKKEEN 207
Query 213 TEEVILLYLEDQLKN--ASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLD 270
+EVI++Y ED + A ++V A L G S IYRP C +P+
Sbjct 208 RQEVIVIYFEDHVDGNYAELVDAVAARL--------GDS-IYRPTSGAN----CEGIPMQ 254
Query 271 VSREEIRASGARAVLVGS-----CAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGR 325
VS+++I A+G + +L+G + GW+ F G L + + R
Sbjct 255 VSKQDILAAGKQVLLMGGSEVCRSSHGWNTWAFAGVGDRLNGYPTGDLAQVSDTNCQFDR 314
Query 326 GVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGR 385
Y VR+YED T+ ++L A P V + CGVNL GFD+ P D R
Sbjct 315 SFYDRYWVRFYEDRTVISSLF------AKPDRFGAEDVERLQKCGVNLIGFDRFSPTDAR 368
Query 386 IQASLWSWAPDEPRA-GAGACALQGADGRWVAASCGDPHPAACR-DAAGRWTVTPAPVVF 443
+A +WSW +P A ACAL +GR+ A +C + ACR W +T + +
Sbjct 369 TRAYVWSWDEGQPAAISDAACALSQVNGRFSANACSEVARYACRVSGTHEWRITESAGTW 428
Query 444 AGAALAC---TAIGADFTLPRTGNQNARLHAVAGPAG 477
C T A F P G N L AG
Sbjct 429 QEGKFLCAHETGGEAVFVTPTNGYDNQSLQNAKAQAG 465
>gi|83643085|ref|YP_431520.1| QXW lectin repeat-containing protein [Hahella chejuensis KCTC
2396]
gi|83631128|gb|ABC27095.1| protein containing QXW lectin repeats [Hahella chejuensis KCTC
2396]
Length=550
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/404 (31%), Positives = 189/404 (47%), Gaps = 47/404 (11%)
Query 93 YLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDI 152
+ SW +R Q L PL + THNS+NS + + S+ D N SL QLD+
Sbjct 25 FRNSWTYRALTHQRTLDLGEPLGRANFPYTHNSYNSSAYANLGSYWDPNHIYSLVDQLDM 84
Query 153 DVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCT-VEPLLATVLPQIANWLNAPG 211
+RALELD+HY + +CHG N + GC+ + L ++A WL G
Sbjct 85 GIRALELDVHYT-------YGDLKLCHG---ANDHTGCSAFDRRFEDGLKEVATWLRQDG 134
Query 212 HTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDV 271
+ EV+++YLE+ + Y+ VA L++ + LIY+P C LP+++
Sbjct 135 NRGEVLIIYLEEHVD--GRYDDAVAALNRQM-----GDLIYKPGS-------CATLPMNI 180
Query 272 SREEIRASGARAVLV-GSC-APGWSAAVFDWSGVELESGSNSGYRPYPACDA-TYGRGVY 328
S+ +I SG + +L+ G+C + W+ V+++ G + N + PYP C Y
Sbjct 181 SKADILNSGRQVLLIGGNCGSDAWAQTVYNY-GFPTD---NDHFHPYPECRTDKYDLNFV 236
Query 329 AWRLVRYYEDST-LATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQ 387
LVR +EDST L+ +P PQ +TP + C + + G DQL D R+
Sbjct 237 QNNLVRIFEDSTRLSDVFGDP------PQPITPELMAQAARCSLGVVGLDQLKAFDERMT 290
Query 388 ASLWSWAPDEPRAGAGA--CALQGADGRWVAASCGDPHPAACRDAAGR-WTVTPAPVVFA 444
A++WSW +EP CA Q +GR+ A+C + P AC W VT + ++
Sbjct 291 AAVWSWDQNEPNNANNNEHCAEQWGNGRFNDAACTNARPFACYSKTHDAWAVTQSNAIWE 350
Query 445 GAALAC-TAIGAD--FTLPRTGNQNARLHAVAGPAGGA--WVHY 483
C G D F P+ G QN L G A W++Y
Sbjct 351 QGEFFCQQEFGGDYRFATPKNGYQNQLLQNAKAEQGYANVWLNY 394
>gi|254447517|ref|ZP_05060983.1| QXW lectin repeat protein [gamma proteobacterium HTCC5015]
gi|198262860|gb|EDY87139.1| QXW lectin repeat protein [gamma proteobacterium HTCC5015]
Length=433
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 118/396 (30%), Positives = 185/396 (47%), Gaps = 51/396 (12%)
Query 97 WVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRA 156
W + L PLR+ ++GTHNS+NS + + + D NQ S+ QLD+ R
Sbjct 37 WQRTALDLERDLDKAAPLRQATFVGTHNSYNSSAYADITRYIDPNQNQSIRAQLDMGARF 96
Query 157 LELDLHYLPRLEGHGAP---------GVTVCHGLGPKNANLGC-TVEPLLATVLPQIANW 206
LE D+H + + HG+P + +CHG ++ +LGC + + L ++ ++
Sbjct 97 LEFDVHMTNKFDTHGSPWAWEWTSNDQLLLCHG---QSNHLGCSSADRYFRDGLNELRDF 153
Query 207 LNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVP 266
+ A + +EV+LLY+ED + A+ S + LD + + +YRP+ + +GC
Sbjct 154 IAA--NRDEVVLLYIEDHMDGEYAWASDI--LDNSIGQ-----YLYRPS---QHGSGCQG 201
Query 267 LPLDVSREEIRASGARAVLV--GSCAPGWSAAVFDWSGVELESGSNSGYRPY---PACDA 321
LP +++++I SG V++ G C+ W G N R CD
Sbjct 202 LPNQLTKQDILNSGRNVVVITGGGCSGNAQYDARVW-------GQNFNTRNTANAANCDG 254
Query 322 TYGRGVYAWRLVRYYEDST-LATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLL 380
R + LVRYYED T L+ A NP P +T + + CG N+ GFD+L
Sbjct 255 L-SRSGHDSALVRYYEDRTNLSAAFGNPGEP------ITTGNIEQLLACGANVIGFDKLD 307
Query 381 PEDGRIQASLWSWAPDEPR--AGAGACALQGADGRWVAASCGDPHPAACRDAA-GRWTVT 437
+DGR++ ++WSW +EP GA CA DGR+ C ACR A W VT
Sbjct 308 EDDGRLERAIWSWGYNEPNNYNGAEDCAESRNDGRFNDIGCSAVRRFACRQAGTHNWYVT 367
Query 438 PAPVVFAGAALAC---TAIGADFTLPRTGNQNARLH 470
++ A C TA F +P +N +L+
Sbjct 368 NGSGSWSQGASTCANETAGQYQFAVPGNAFENNQLN 403
>gi|87119030|ref|ZP_01074928.1| hypothetical protein MED121_12210 [Marinomonas sp. MED121]
gi|86165421|gb|EAQ66688.1| hypothetical protein MED121_12210 [Marinomonas sp. MED121]
Length=411
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/400 (27%), Positives = 177/400 (45%), Gaps = 63/400 (15%)
Query 91 DAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNS---LSDSFTVS--HADSNQQLS 145
D + SW+ +T +Q AL + P+ + LGTHN++NS S +F+V + D Q+ S
Sbjct 36 DDFNASWLGQTMDYQRALDNYAPIIDNNILGTHNTYNSEVYTSCNFSVGCRYLDPQQKYS 95
Query 146 LAQQLDIDVRALELDLHYLPRLEGHGA--PGVTVCHGLGPKNANLGCTVEPLLATV-LPQ 202
+ QL + R +ELD+H+ ++E + + +CHG C++ AT +
Sbjct 96 IKDQLRMGARFIELDVHWTTKMESLFSYPKRLLLCHGF--------CSINDKYATEGFNE 147
Query 203 IANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATN 262
I +WL + EVI+LY+ED + + + + L+ R D IY P++
Sbjct 148 IKSWLASSESQGEVIILYIEDDSE--GHHSDLYSQLND--RFGDK---IY---PSQ---- 193
Query 263 GCVPLPLDVSREEIRASGARAVL--VGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACD 320
GC +P +++ ++ A G + +L G C+ + A ++G+ N G
Sbjct 194 GCGNIPDTLTKAQVLAQGKQIILWKDGGCSSNANMANLAFTGL-----GNVG-------- 240
Query 321 ATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLL 380
R +ED+T +A ++LT V G N+ D ++
Sbjct 241 -------------RIWEDATTLGTIA--EFFDGGIKSLTANDVSTAFATGANIVNLDDMV 285
Query 381 PEDGRIQASLWSWAPDEPRAGAGA--CALQGADGRWVAASCGDPHPAACRDAAGRWTV-T 437
DGRI+A+ WSW +EP G CA+Q +GRW +C AC D++G W V
Sbjct 286 MNDGRIEAAAWSWDNNEPNNSGGNQDCAVQWENGRWDDNNCAASFAFACEDSSGNWFVPD 345
Query 438 PAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAG 477
++ C +G F++P N +L G
Sbjct 346 NLTGTWSQGPSVCANLGGTFSMPTNSQSNQKLKLAKESVG 385
>gi|94500127|ref|ZP_01306661.1| protein containing QXW lectin repeats [Oceanobacter sp. RED65]
gi|94427700|gb|EAT12676.1| protein containing QXW lectin repeats [Oceanobacter sp. RED65]
Length=416
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 92/403 (23%), Positives = 168/403 (42%), Gaps = 56/403 (13%)
Query 93 YLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDI 152
+ SW + Q + PL E +G+HNS+NS + D Q +S+ QL +
Sbjct 44 FQNSWAGKALAHQRNMDANKPLAENNIIGSHNSYNSRKYRNATRYLDPQQIVSIYDQLRL 103
Query 153 DVRALELDLHYLPRLEG---HGAPGVTVCH---GLGPKNANLGCTV-EPLLATVLPQIAN 205
R +ELD H+ G + +CH G+ + ++GC++ + + + ++A
Sbjct 104 GARFIELDAHWTAHTHGWPWQWGTDLLLCHSGIGVDVGDLHVGCSLTDRRVEDGIAEVAR 163
Query 206 WLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCV 265
W+N + +EVI+LY ED + L V+ + G ++ A+ GC
Sbjct 164 WINE--NPKEVIILYFEDHT------DGRHQELFNVINKQLGANIY--------ASQGCK 207
Query 266 PLPLDVSREEIRASGARAVLV--GSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATY 323
+P +++ ++ ASG + ++ G C+ ++ SN + +
Sbjct 208 AIPNTLTKNQVLASGKQVIVWKDGGCSGN-------------QNMSNMAFTSLGDIN--- 251
Query 324 GRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPED 383
R +ED T A+ + + + + A + G N+ D + D
Sbjct 252 ----------RIWEDRTSIGAIGAFFTNGSVKKIESEDVIQAFKNGG-NIVNLDDMTHSD 300
Query 384 GRIQASLWSWAPDEPR--AGAGACALQGADGRWVAASCGDPHPAACR-DAAGRWTVTPAP 440
R+ A++WSW +EP G CALQ +GRW SC + H AC+ + W ++
Sbjct 301 DRLSAAIWSWDVNEPNNWGGNQDCALQWENGRWDDTSCSNQHFFACQHNETQEWNISTYQ 360
Query 441 VVFAGAALACTAIGA-DFTLPRTGNQNARLHAVAGPAGGAWVH 482
+ AC+ +G F+ P +N +L G W++
Sbjct 361 DAWQAGQQACSLLGNYRFSTPSNSLENEKLKTAKGGISHVWLN 403
>gi|45658205|ref|YP_002291.1| hypothetical protein LIC12359 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|45601447|gb|AAS70928.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
Length=440
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/348 (28%), Positives = 154/348 (45%), Gaps = 40/348 (11%)
Query 101 TARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELD 160
TA+ + + +PL + GTH+S+NS + SNQ ++ QL + R LEL+
Sbjct 43 TAQRKVQVNMNLPLNRALFFGTHDSYNSSA----YRRNPSNQTYTITDQLRLGARYLELE 98
Query 161 LHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPL-LATVLPQIANWLNAPGHTEEVILL 219
+H+ G+ + +C G + N GC L L +I+ W+ P + EV++L
Sbjct 99 VHWTTGRSGNKE--LLLCRGSNLNDHN-GCYRYDLTFEAGLNEISQWIQKPENQNEVLIL 155
Query 220 YLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRAS 279
Y++D+ +E V+ + L GT L+YR + +R N P+ + + ++++
Sbjct 156 YIKDR------FEGHVSEFMRTLSSKLGT-LLYR-HQSRDCLNQS-PMVMPKLEDMVKST 206
Query 280 GARAVLVGSCAPGWSAAVFDWSGVELE-----SGSNSGYRPYPACDATYGRGVYAWRLVR 334
R L + +S + D G S SG+R YP C+ + R Y LVR
Sbjct 207 NHRIFLTSNNC--YSPELSDTWGYYFRKDPFVSFQPSGFRGYPDCN--FSRETYHNSLVR 262
Query 335 YYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSW- 393
Y D+ A + T + +M C VNLFGFDQ + ++WSW
Sbjct 263 VYNDTIARNA-------NDRGGSFTNSNIQSMLACEVNLFGFDQFNANFAK--QAVWSWD 313
Query 394 -APDEP--RAGAGACALQGADGRWVAASCGDPHPAACRD-AAGRWTVT 437
A ++P R CA +GRW C AC+D G W +T
Sbjct 314 SATNQPLNREDQEHCARISVNGRWSTHHCDMNLKFACKDRNTGNWIIT 361
>gi|24214075|ref|NP_711556.1| hypothetical protein LA_1375 [Leptospira interrogans serovar
Lai str. 56601]
gi|24194954|gb|AAN48574.1| hypothetical protein LA_1375 [Leptospira interrogans serovar
Lai str. 56601]
Length=440
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 95/348 (28%), Positives = 154/348 (45%), Gaps = 40/348 (11%)
Query 101 TARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELD 160
TA+ + + +PL + GTH+S+NS + SNQ ++ QL + R LEL+
Sbjct 43 TAQRKVQVNMNLPLNRALFFGTHDSYNSSA----YRRNPSNQTYTITDQLRLGARYLELE 98
Query 161 LHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPL-LATVLPQIANWLNAPGHTEEVILL 219
+H+ G+ + +C G + N GC L L +I+ W+ P + EV++L
Sbjct 99 VHWTTGRSGNKE--LLLCRGSNLNDHN-GCYRYDLTFEAGLNEISQWIQKPENQNEVLIL 155
Query 220 YLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRAS 279
Y++D+ +E V+ + L GT L+YR + +R N P+ + + ++++
Sbjct 156 YIKDR------FEGHVSEFMRTLSSKLGT-LLYR-HQSRDCLNQS-PMVMPKLEDMVKST 206
Query 280 GARAVLVGSCAPGWSAAVFDWSGVELE-----SGSNSGYRPYPACDATYGRGVYAWRLVR 334
R L + +S + D G S SG+R YP C+ + R Y L+R
Sbjct 207 NHRIFLTSNNC--YSPELSDTWGYYFRKDPFVSFQPSGFRGYPDCN--FSRETYHNSLIR 262
Query 335 YYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSW- 393
Y D+ A + T + +M C VNLFGFDQ + ++WSW
Sbjct 263 VYNDTIARNA-------NDRGGSFTNSNIQSMLACEVNLFGFDQFNANFAK--QAVWSWD 313
Query 394 -APDEP--RAGAGACALQGADGRWVAASCGDPHPAACRD-AAGRWTVT 437
A ++P R CA +GRW C AC+D G W +T
Sbjct 314 SATNQPLNREDQEHCARISVNGRWSTHHCDMNLKFACKDRNTGNWIIT 361
>gi|90412318|ref|ZP_01220323.1| hypothetical protein P3TCK_09798 [Photobacterium profundum 3TCK]
gi|90326809|gb|EAS43202.1| hypothetical protein P3TCK_09798 [Photobacterium profundum 3TCK]
Length=559
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/401 (25%), Positives = 164/401 (41%), Gaps = 78/401 (19%)
Query 105 QDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYL 164
Q+ L P+ + W+GTHNS+NS D + S A+ NQ S+ +QL+ VRA+E+D+
Sbjct 203 QNELVTYSPIYKATWMGTHNSYNS-GDYYWAS-ANPNQSTSIIEQLESGVRAIEIDV--- 257
Query 165 PRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNA-PGHTEEVILLYLED 223
G T+ H + + V+ +I NWL PG + I + E
Sbjct 258 --------VGRTLKHKVDTSGTS--------FVRVMSEIKNWLRVNPG---QFIYVKFEH 298
Query 224 QLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARA 283
KN + V + + ++++R A NGC P ++ +++ G +
Sbjct 299 SSKNEGYEQDVAREIIETF-----GNMVFRD-----AVNGCNYAPESLTTKQLLDDGKQI 348
Query 284 VLVGSCAPGWSAAVFDWSGVELESGSNSGYR--------PYPACDATYGRG----VYAWR 331
+ F ++G + G+N+ YR P + D Y G + AW
Sbjct 349 MF------------FAFNG---DCGNNTDYRSVIWNRMGPETSDDHDYAAGCPSSLPAWD 393
Query 332 LVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPE--DGRIQAS 389
L R+ + + R L +V +CG+N G DQ LP+ DG I
Sbjct 394 LGRF-------STIVEDKRGWVWDHYLPVSQVRPALECGINFIGRDQFLPDDADGYIANH 446
Query 390 LWSW--APDEPRAGAGACALQ-GADG--RWVAASCGDPHPAACRDAAGRWTVTPAPVVFA 444
++SW + PR G L G+DG + AS + +PA C + G+ T V +
Sbjct 447 IFSWRNGLETPRVGRQHVKLSVGSDGYAHFATASQSEQYPALCMNREGQIQATSQAVSYD 506
Query 445 GAALACTAIGAD--FTLPRTGNQNARLHAVAGPAGGAWVHY 483
A C+ AD FT+P + + W++Y
Sbjct 507 QAQATCSNEFADSRFTVPTNARELSLFVKSVNEGAQFWMNY 547
>gi|54302545|ref|YP_132538.1| hypothetical protein PBPRB0866 [Photobacterium profundum SS9]
gi|46915967|emb|CAG22738.1| hypothetical protein PBPRB0866 [Photobacterium profundum SS9]
Length=620
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/401 (26%), Positives = 162/401 (41%), Gaps = 78/401 (19%)
Query 105 QDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYL 164
Q+ L P+ + W+GTHNS+NS D + S A NQ S+ +QL+ VRA+E+D+
Sbjct 264 QNELVTYSPIYKATWMGTHNSYNS-GDYYWAS-AKPNQSTSIVEQLESGVRAIEIDV--- 318
Query 165 PRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNA-PGHTEEVILLYLED 223
G T+ H + + V+ +I NWL PG + I + E
Sbjct 319 --------VGRTLKHKVDTSGTS--------FVRVMSEIKNWLRVNPG---KFIYVKFEH 359
Query 224 QLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARA 283
KN + V + + ++++R A NGC P ++ +++ G +
Sbjct 360 SRKNEGYEQDVAREIIETF-----GNMVFRD-----AGNGCNYAPESLTTKQLLDDGKQI 409
Query 284 VLVGSCAPGWSAAVFDWSGVELESGSNSGYR--------PYPACDATYGRG----VYAWR 331
+ F ++G + G+N+ YR P + D Y G + AW
Sbjct 410 MF------------FAFNG---DCGNNTDYRSVIWNRMGPETSDDHDYAAGCPSSLPAWD 454
Query 332 LVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPE--DGRIQAS 389
L R+ + + R A L +V +CG+N G DQ LP+ DG I
Sbjct 455 LGRF-------STIVEDKRGWAWDHYLPVSQVRPALECGINFIGRDQFLPDDADGYIANH 507
Query 390 LWSW--APDEPRAGAGACALQ-GADG--RWVAASCGDPHPAACRDAAGRWTVTPAPVVFA 444
++SW + P G L G+DG + AS + +PA C D G+ T V +
Sbjct 508 IFSWRNGLETPSVGRQHVKLSVGSDGYAHFATASQSEQYPALCMDREGQLQATSQAVSYD 567
Query 445 GAALACTAIGAD--FTLPRTGNQNARLHAVAGPAGGAWVHY 483
A C+ AD FT+P + W++Y
Sbjct 568 QAQATCSNEFADSRFTVPTNARALSLFAKSVNEGDQFWMNY 608
>gi|54302747|ref|YP_132740.1| hypothetical protein PBPRB1068 [Photobacterium profundum SS9]
gi|46916171|emb|CAG22940.1| hypothetical protein PBPRB1068 [Photobacterium profundum SS9]
Length=620
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/401 (25%), Positives = 162/401 (41%), Gaps = 78/401 (19%)
Query 105 QDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYL 164
Q+ L P+ + W+GTHNS+NS D + S A NQ S+ +QL+ VRA+E+D+
Sbjct 264 QNELVTYSPIYKATWMGTHNSYNS-GDYYWAS-AKPNQSTSIVEQLESGVRAIEIDV--- 318
Query 165 PRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNA-PGHTEEVILLYLED 223
G T+ H + + V+ +I NWL PG + I + E
Sbjct 319 --------VGRTLKHKVDTSGTS--------FVRVMSEIKNWLRVNPG---KFIYVKFEH 359
Query 224 QLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARA 283
KN + V + + ++++R A NGC P ++ +++ G +
Sbjct 360 SRKNEGYEQDVAREIIETF-----GNMVFRD-----AGNGCNYAPESLTTKQLLDDGKQI 409
Query 284 VLVGSCAPGWSAAVFDWSGVELESGSNSGYR--------PYPACDATYGRG----VYAWR 331
+ F ++G + G+N+ YR P + D Y G + AW
Sbjct 410 MF------------FAFNG---DCGNNTDYRSVIWNRMGPETSDDHDYAAGCPSSLPAWD 454
Query 332 LVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPE--DGRIQAS 389
L R+ + + R A L +V +CG+N G DQ LP+ DG I
Sbjct 455 LGRF-------STIVEDKRGWAWDHYLPVSQVRPALECGINFIGRDQFLPDDADGYIANH 507
Query 390 LWSW--APDEPRAGAGACALQ-GADG--RWVAASCGDPHPAACRDAAGRWTVTPAPVVFA 444
++SW + P G L G+DG + AS + +PA C D G+ T V +
Sbjct 508 IFSWRNGLETPSVGRQHVKLSVGSDGYAHFATASQSEQYPALCMDRDGQLQATSQAVSYD 567
Query 445 GAALACTAIGAD--FTLPRTGNQNARLHAVAGPAGGAWVHY 483
C+ AD FT+P + + W++Y
Sbjct 568 QTQATCSNEFADSRFTVPTNARELSLFAKSVNEGDQFWMNY 608
>gi|72129125|ref|XP_800669.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
gi|115974394|ref|XP_001183258.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
Length=479
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/401 (23%), Positives = 157/401 (40%), Gaps = 60/401 (14%)
Query 97 WVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADS---------------N 141
W+ + Q LQ + L +HNSF + + D+ N
Sbjct 55 WMQYALKTQRELQIDFTFDQFIMLDSHNSFQARAYGLRYGANDTCVWPPPYPENCTSIAN 114
Query 142 QQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLL-ATVL 200
+ ++ QL++ +R +E+D Y +GA + VCH LG C + +L A +L
Sbjct 115 HEFTIVDQLNLGMRGIEIDNWYC-----YGA--MRVCH-LGTHEYLGVCEADHMLFADLL 166
Query 201 PQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRA 260
I +WL+ P + +E+I LY ++ E ++ ++R A GT ++ P+ R
Sbjct 167 SDIGDWLDQPENQDEIIRLYFNEKEDQGHDDE-----VNAMIRDAFGTRVL-TPSDLRDT 220
Query 261 TNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELE------SGSNSGYR 314
G P + ++R G ++ A G + F V + + +
Sbjct 221 YGGSWP-----TIRKMREDGKHVLI----AAGGTYGFFTHGDVYIHPLYFDADLRTNLFT 271
Query 315 PYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLF 374
PYP C G +VR Y DS L L + + C +
Sbjct 272 PYPDC-----SGRNDTNIVRVYSDS-LNFPLNEKGYYSGEETVGSIKDLTEYVKCRIQYP 325
Query 375 GFDQLLPEDGRIQASLWSWAPDEPRA--GAGACA-LQGADGRW-VAASCGDPHPAACRDA 430
D + P+ I+ +W+WA +P +C L+G D RW V++ C + H AC+
Sbjct 326 TLDMINPD--LIKTGVWTWAEGQPSGELSYDSCVMLKGTDHRWYVSSDCSENHYYACQHD 383
Query 431 AGR--WTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARL 469
WTV+ ++ C G F++P G + +L
Sbjct 384 NDHEVWTVSDEAGPYSTTGDVCPQ-GYSFSIPHNGYRKQKL 423
>gi|325189698|emb|CCA24181.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325192084|emb|CCA26548.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length=727
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 99/442 (23%), Positives = 163/442 (37%), Gaps = 96/442 (21%)
Query 94 LQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTV-----------------S 136
+ W+ RT +Q AL PL E Q GTHNS +LSD + + S
Sbjct 282 INDWLRRTLAYQRALTYKAPLCEAQLPGTHNSAITLSDGYGLRDKAMNAYAFNTPQKPWS 341
Query 137 HADSNQQ-LSLAQQLDIDVRALELDLHYLPR--LEGHGAPGVT---------------VC 178
+ +N Q LSL QLD VR LE+D H+ H G +
Sbjct 342 YIKTNNQALSLTDQLDSGVRFLEVDTHFFLNDFYSAHCGGGTNNIMNQFTFLKDFADQLS 401
Query 179 H-----------GLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLED--QL 225
H G P + + + + T + +I +W+ + +E ++LYL++ ++
Sbjct 402 HYGPVFWDQNLVGCYPSLSGISASKQVKTRTHIAEIRDWIEK--NKDEFLMLYLDNGVEI 459
Query 226 KNASAYESVVATLDQVLRRADGTSLIYRPNPARR-ATNGCVPLPLDVSREEIRASGARAV 284
N + L ++L D + + ++ A++G ++ ++ A G R +
Sbjct 460 TNFQKWNG----LHEILLENDFNKVFVPLSKLKQMASSGWSKTSIN----DLMAEGYRVL 511
Query 285 LVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGRG---------VYAWRLVRY 335
L+ + + ++ G L+ P D G G + + R
Sbjct 512 LLSNTETELFYEINNFCGKLLD-------LPADCPDKKIGNGEVSTPQDPVASSTKFTRM 564
Query 336 Y-EDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWA 394
Y E+ L + N NP+ LT +P C VN+ D L + +++A +WSW
Sbjct 565 YQEELRLFSINGNLKFSHGNPRFLTAETIPQSFKCNVNVIAPDML--DISKMEAMIWSWN 622
Query 395 PDEPR-AGAGACALQGADGRWVAASCGDPHPAACRDAAGR-WTVTP----APVVFAGAAL 448
DEPR G RW+ AC + W + P+ F A
Sbjct 623 VDEPRNVGPDTSVYMTESARWITGEKSLSDWKACFNKENMIWRIVKDLDDCPMQFVYEA- 681
Query 449 ACTAIGADFTLPRTGNQNARLH 470
P+ GNQN L
Sbjct 682 -----------PQNGNQNFLLQ 692
>gi|115373732|ref|ZP_01461026.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|310823620|ref|YP_003955978.1| hypothetical protein STAUR_6394 [Stigmatella aurantiaca DW4/3-1]
gi|115369279|gb|EAU68220.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|309396692|gb|ADO74151.1| uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length=496
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 113/471 (24%), Positives = 176/471 (38%), Gaps = 100/471 (21%)
Query 86 TQAARDAYLQSWVHRTARFQDA-LQDPVPLRETQWLGTHNSFNSLSDSFT------VSHA 138
TQ A + +W + AR Q LQ VPL Q LGTHNS ++ ++T +
Sbjct 40 TQEQPLALVDTWAAKAARIQQRDLQANVPLNRWQRLGTHNS--HVATTYTKCGAGFCYYV 97
Query 139 DSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLAT 198
+NQ SL+ QLD+ +R L LD++ G G VC G + G +
Sbjct 98 RANQHRSLSAQLDMGIRTLMLDVYDYGCQWGWG-----VCFG------HEGEQFVQWSVS 146
Query 199 VLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRR-----------ADG 247
+ +IA W+N P + +EV+ L LED + + + + R
Sbjct 147 LEDEIAQWINTPQNQDEVLFLILEDYFNDDARKRQFFSEIRYRFDRDYWPNANTPVGVTS 206
Query 248 TSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVL---------VGSCAPGWSAAVF 298
LI+RP R P P E+ G R V+ V A G++ +
Sbjct 207 GDLIFRPVDKERLFPSRWPTPA-----ELVQQGKRIVIAVKDRSKYEVSLSAEGYAGPMK 261
Query 299 DWSGVELESGSNSGYRPY------PACDAT----------------YGRGVYAWRLVRYY 336
DW G S P+ PA D G + ++
Sbjct 262 DWFFSVNSVGYPSVQYPWYSANFAPAFDGARCGSTDIKDGSGNTSPLGLQFTQFEELKIC 321
Query 337 EDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPE-----------DGR 385
+ + L + + P N + V A+ DCG ++ DQ +
Sbjct 322 DHFEACSGLYDTS--PFNKRL----DVKAVVDCGFSV-AMDQAEGDPSYTGQGYDYYSRT 374
Query 386 IQASLWSWAPDEPRAGAGA--CALQGADGRWVAASC-GDPHPAACRDA----------AG 432
++ ++WS+A EP G CA GRW SC G AC+ +
Sbjct 375 LKQAIWSFAEGEPNDAGGNEDCAQMTPGGRWNDLSCTGSSRRYACKKKDASCDPASCPSD 434
Query 433 RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHY 483
WTV+ + V+A + AC G F +P+ G +N +L G W+++
Sbjct 435 FWTVSSSAGVWANGSTACPQ-GYAFGVPQNGYENRKLRERIGNE-DVWLNF 483
>gi|301119405|ref|XP_002907430.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105942|gb|EEY63994.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length=411
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 87/383 (23%), Positives = 148/383 (39%), Gaps = 46/383 (12%)
Query 140 SNQQLSLAQQLDIDVRALELDLHY---------------------------LPRLEGHGA 172
++Q SL QL + VR +ELD+H+ + +L G G
Sbjct 6 NDQLFSLTDQLHMGVRFIELDVHWFDGDLHIAHCGGFKSKLLDGMIEVFNEIAKLLGTGI 65
Query 173 PGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYE 232
+ G P +++ + L L ++A WL+AP H +E ++++ +D+ N ++
Sbjct 66 EWDSETIGCKPSLSSIPSKEQRPLKEALKELATWLHAPEHKDEFLMVFFDDE-TNLMKWK 124
Query 233 SVVATLDQVLRRADGTSLIYRPNPARRATNG-CVPLPLDVSREEIRASGARAVLVGSCAP 291
V LD L+ I RP T + L V + + SG G
Sbjct 125 KVGKLLD-YLKDYFPEEEILRPIELAYDTKWPTIEELLRVGKRVVFMSGVDYFSDGEELL 183
Query 292 GWSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPT-R 350
V +W L R + G + + R E S + N +
Sbjct 184 FVKDTVCNWQEPPLPLAPFPACR-FNESKTNIGISDENFTIFRP-ETSEIEYGFLNADGQ 241
Query 351 PPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAGAGACALQGA 410
N L +P + CGVNL D + P+ R++A++W+ + + AL
Sbjct 242 LGINKNLLNEESLPGVAQCGVNLPSPDNITPK--RMEATIWAVSKGQELNPKQCVALMRE 299
Query 411 DGRWVAASCGDPHPA-ACRDAAG--RWTVTPAPVVFAGAALACTAIGA---DFTLPRTGN 464
W + C + AC D +W + A VV A AA+AC ++ + +++P +G
Sbjct 300 SKTWQSVECDTANLVPACVDVKNPRQWQLGSASVVEADAAIACASLASASMTYSVPASGY 359
Query 465 QNARLH-----AVAGPAGGAWVH 482
+N LH A GG W++
Sbjct 360 ENGLLHDQLVQNAASSIGGVWLN 382
>gi|298708683|emb|CBJ26170.1| conserved unknown protein [Ectocarpus siliculosus]
Length=375
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/377 (25%), Positives = 145/377 (39%), Gaps = 67/377 (17%)
Query 112 VPLRETQWLGTHNSFNSLSDSFTVSHAD----SNQQLSLAQQLD-IDVRALELDLHYLPR 166
+PL THNSF+ D +HA+ + Q S+ QL + VR LE+DLHY+
Sbjct 17 IPLSSKTLTATHNSFSH--DRNISTHANFEVVTAQVYSMTDQLSCLGVRGLEIDLHYIDE 74
Query 167 L--EGHGAPGVTVCHG------------------------LGPKNANLGCT-VEPLLATV 199
L EG + +CH + + + GC P
Sbjct 75 LAVEGDEESAIRMCHASEDVAEQMVDLCETFGWDVCEAADIFDYDEDTGCRPGAPTARAG 134
Query 200 LPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARR 259
++A+WL + EV+ L L+ L + E+V + V G +I+ P
Sbjct 135 FEEVASWLFLEENANEVLFLKLDSSLNDED--ETVSTIVSDVF----GEDVIFSPTDWEE 188
Query 260 ATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPAC 319
+ G P S E+ A G R ++ GS + S VF +E ++ C
Sbjct 189 FS-GSDDWP---SPAELVAMGTRVIIAGSES---STLVFSTDSDAVEVLISAEDFDTSEC 241
Query 320 DATYGRGVYAW-----RLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLF 374
G +W +V Y D +T P + +ALT + +CG
Sbjct 242 ADVEGFPEPSWYRVQGDMVEYRLDRANSTIFE---LVPGSDEALTSSMTDNVMNCGFTP- 297
Query 375 GFDQLLPEDGRIQASLWSWAPDEPRAGAG---ACALQGADGRW----VAASCGDPHPAAC 427
FD+L + +++++WSW D P G A + GRW + G+ H AC
Sbjct 298 TFDRLDAD--LMESTIWSWDTDRPETGFDLPRAAVISSDGGRWTDVETDSDSGERHSFAC 355
Query 428 RDAAGRWTVTPAPVVFA 444
R++ + P+ V F
Sbjct 356 RNSGT--VIFPSGVRFT 370
>gi|312219199|emb|CBX99143.1| similar to lectin C-type domain containing protein [Leptosphaeria
maculans]
Length=647
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/156 (32%), Positives = 75/156 (49%), Gaps = 23/156 (14%)
Query 338 DSTLATALANPTRPPANPQALTP-PKVPAMTDCGVNLFGFDQL---------LPEDGRIQ 387
+S+ A A+A +NP + P P + +T CG+ F + L LP +
Sbjct 395 NSSWAVAVAPSLDISSNPDLMVPVPSIANLTSCGLTAFLNETLAGATADKNPLPYAAYVH 454
Query 388 ASLWSWAPDEP-RAGAGA------CALQGAD---GRWVAASCGDPHPAACR--DAAGRWT 435
++LW+WAP EP A +G+ CA+ GRW A C D + AC+ W
Sbjct 455 STLWTWAPGEPANATSGSSNTANRCAVMTTSPYPGRWRVADCADKYHVACQVPSQPYNWQ 514
Query 436 VTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
++P +AGA +AC A+F++P T +NA L A
Sbjct 515 ISPDTTNYAGADMAC-GPDAEFSVPHTALENAHLLA 549
>gi|301094668|ref|XP_002896438.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109413|gb|EEY67465.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length=894
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 111/439 (26%), Positives = 170/439 (39%), Gaps = 99/439 (22%)
Query 67 CRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTARFQDALQDPVPLRETQWLGTHNS- 125
C R+ D+ V +V +A A ++ WV R +Q L L + THNS
Sbjct 378 CIRLDFCDSDDV---CSKVCEAGS-ALIEPWVARAITYQRNLTYSETLCYAELPATHNSV 433
Query 126 -------------FNS-LSDSFTVSHA-DSNQQLSLAQQLDIDVRALELDLHYLPR---- 166
FN+ L+ S S+ SNQ LSL+ QLD+ VR LELD+H+
Sbjct 434 ITQAHGYGNRDQLFNARLNASNAASYMRTSNQFLSLSDQLDLGVRFLELDVHFFASSLRS 493
Query 167 -------------------------LEGHGAPGV----TVCHGLGPKNANLGCTVEPLLA 197
L+ G + G P + + + L
Sbjct 494 AHCSDSGVAFVDDAASALVSSLESVLDASGQDSTVQWGSELVGCLPSLSGIRADEQRLHN 553
Query 198 TVLPQIANWLNAPGHTEEVILLYLE--DQLKNASAYESVVATLDQVLRRADGTSLIYRPN 255
L +IA WL++ H +++++LY E D++ S E+++ + L++ P+
Sbjct 554 ESLGEIATWLSS--HPDDLVVLYTEIGDEVGTYSQSEALLELYTTIFGD-----LLFSPS 606
Query 256 PARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVF-------DWSGVELESG 308
A L +E+ + G + +LV + P + +F W+ V S
Sbjct 607 DFDDAGGDWNGFTL----QELISQGKQVILVTT--PEANDQMFYMRELCAGWADVPSSST 660
Query 309 SNSGYRPYPACDATYGRGVYAWRLVRYYED----STLATALAN----PTRPPANPQALTP 360
SG +G + A LVR ++ +TL + + + P +
Sbjct 661 GASG--------TFFGESMNAGSLVRVFKSVLHYATLTESAMSGGGAEVDTASEPGHVNA 712
Query 361 PKVPAMTDCGVNLFGFDQLLPEDGRIQ-ASLWSWAPDEPRAG-AGACALQGADGRW--VA 416
+P D GVN+ D L DG I A +WSWA +EP A A L ADGRW VA
Sbjct 713 STLPVFVDAGVNILAPDGL---DGAIMTAMVWSWAKNEPDVDTATAVQLSAADGRWYGVA 769
Query 417 ASCGDPHPAACRDAAGRWT 435
S H AC R T
Sbjct 770 DSSSISH-VACVSNNNRTT 787
>gi|299115957|emb|CBN75962.1| conserved unknown protein [Ectocarpus siliculosus]
Length=481
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 91/348 (27%), Positives = 143/348 (42%), Gaps = 60/348 (17%)
Query 177 VCHGLGPKN--ANLGCT-VEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYES 233
+C LG ++ + GC+ P L +I+ WL+ P ++EE++ + +ED +
Sbjct 35 LCEALGIQDFGDDTGCSSTAPSLKETFDEISAWLDLPENSEELLFIKIEDYTGDN----- 89
Query 234 VVATLDQVLRRADGTSLIYRPNP--ARRATNGCVPLPLDVSREEIRASGARAVLVGSCAP 291
V+ L + GT +++ P +G P + E + + G R V G+
Sbjct 90 -VSLLPDYITTVFGTEIVFGPLDFIEWNRISGSEQWP---TTEYLVSQGKRLVF-GTNGE 144
Query 292 GWSAAVF------DWSGVELESG-SNSGYRPYPACDATYGRGVYAWRLVR--------YY 336
+ +F D G+ E S +R C +T R +W V+ Y
Sbjct 145 EDADTMFRISRENDADGLFHEDNISAVRFRTSAPC-STRTRSP-SWSRVQGEASTWVIEY 202
Query 337 EDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRI-QASLWSWAP 395
D TL L P A+ + A+ CG+ + FD++ D R+ +A++WSW
Sbjct 203 TDVTLYALL-----PEADEFFGASDAMDALK-CGL-IPTFDRM---DSRLLEATMWSWEE 252
Query 396 DEPRAGAG---ACALQGADGRWVAASCG-------DPHPAACRDAA----GRWTVTP-AP 440
EP A A + GRW + S D H ACR+ + G W V+ A
Sbjct 253 GEPHAYFSSPRAAVVHQETGRWTSGSASTSEEESDDIHSYACRNDSSGERGEWVVSSGAA 312
Query 441 VVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAG--GAWVHYLLP 486
F+ A L C + G F PRT +NA L G AWV+ L P
Sbjct 313 GYFSAAELVCLSQGLVFGCPRTAEENAALRVSMTDVGVKDAWVNLLSP 360
>gi|156357276|ref|XP_001624147.1| predicted protein [Nematostella vectensis]
gi|156210905|gb|EDO32047.1| predicted protein [Nematostella vectensis]
Length=406
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/407 (23%), Positives = 152/407 (38%), Gaps = 65/407 (15%)
Query 92 AYLQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSF-------------TVSHA 138
A ++ W+ + Q LQ + Q L HN+FN SD + V
Sbjct 21 ARVKPWLAFALKTQRELQSNASFEKYQMLAAHNAFNDRSDGYGEMDDCRWPPPYHGVCID 80
Query 139 DSNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCT-VEPLLA 197
+NQ+ S LD+ VRALE+D + G ++ H +A LGC+ +
Sbjct 81 FANQEFSFTDLLDMGVRALEIDPWWC-----FGKIRMSHAH----DHAYLGCSPWDREFH 131
Query 198 TVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPA 257
+ +IA W+ + +EV+ +YLED + ++ ++ + ++ G ++ PN
Sbjct 132 YGIQEIAEWIKR--NPKEVVRIYLEDSGSHTKGHDDLI---NGPIKDYLGDKVL-TPNDT 185
Query 258 RRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDWSGVELESG-----SNSG 312
NG P + E+R G V+ + +++ G+ + + +
Sbjct 186 LVYFNGRWP-----TVSEMRKLGKTVVVA-------TGNLYNHKGMYIHKSYWQEMTYNK 233
Query 313 YRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVN 372
+ C A + +R Y DST N T CGV
Sbjct 234 FLSQANCSAMGNNSI----PIRVYSDSTKYGPFWNGPWKTG-----TILNYMDFLKCGVT 284
Query 373 LFGFDQLLPEDGRIQASLWSWAPDEP--RAGAGACALQ-GADGRWVAASCGDPHPAACRD 429
DQ+ P + ++++WA EP + C L G D RW A C + H AC
Sbjct 285 YPAADQVNPH--LLATAVFTWAEGEPSTKLQTDTCVLLCGGDKRWHVADCSEKHHFACMS 342
Query 430 AAG--RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAG 474
+ +W ++ V G F LP+T + L G
Sbjct 343 SHDVFKWLIS---AVSGPYNEPICPDGYQFGLPQTARHSVILQEALG 386
>gi|301100928|ref|XP_002899553.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103861|gb|EEY61913.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length=775
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 101/423 (24%), Positives = 158/423 (38%), Gaps = 72/423 (17%)
Query 97 WVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSH----------------ADS 140
W+ T +Q L P Q THNS +L+D F +
Sbjct 375 WLKSTLAYQRNLAFSGPFCFAQIPATHNSAITLADGFGNRDQLFNKNLNPDKWWSYLKTN 434
Query 141 NQQLSLAQQLDIDVRALELDLHYLPR--LEGH----GAPGVTVCHG-LGPKNAN------ 187
NQ LS+ QLDI +R LE+D H+ GH G+ V G LG N
Sbjct 435 NQMLSMTDQLDIGIRFLEIDTHFFLNDLRTGHCGSLGSEAVAGFFGTLGKTLGNYGTYNW 494
Query 188 ----LGC---------TVEPLLATVLPQIANWLNAPGHTEEVILLYLED--QLKNASAYE 232
LGC + +PL + +I WLNA + E +++YL+ +K + +
Sbjct 495 GPELLGCFPSISGIKASEQPLTKDSMDEIKAWLNA--NPTEFVVVYLDTGADIKRSDKFG 552
Query 233 SVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPG 292
++ D + G SL+ P + + + +G + + + +
Sbjct 553 AI----DTLFTDTFGDSLV----PLKALDDLAKGKWTGGRINDFINAGHQVLALANTKTK 604
Query 293 WSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYA---WRLVRYYEDSTLATALA-NP 348
+ +++D E + A G +Y+ W +R + + +LA +
Sbjct 605 AAYSLYDMCTAEKDLTVEFIDDLPDAKRLINGLAIYSNTNW--IRSWSEQIRYISLAASG 662
Query 349 TRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRAG-AGACAL 407
P L +P VNL D + ++ A +WSWA EP A A L
Sbjct 663 ALTRKFPVFLDAESIPKYLRWNVNLIALDN--ADIAKMAALVWSWAEKEPSTTVADAYVL 720
Query 408 QGADGRWVAASCGDPHPAACRDAAGR-WTVTPAPVVFAGAALACTAIGADFTLPRTGNQN 466
+GRWVA++ AC D A W++ V FA A TA F P +Q
Sbjct 721 MDVNGRWVASTDAKKGSRACWDGAKLAWSI----VAFAKDCPAGTA----FKAPTDPSQT 772
Query 467 ARL 469
RL
Sbjct 773 RRL 775
>gi|307103812|gb|EFN52069.1| hypothetical protein CHLNCDRAFT_139316 [Chlorella variabilis]
Length=598
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 112/465 (25%), Positives = 162/465 (35%), Gaps = 127/465 (27%)
Query 94 LQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSF-----------------TVS 136
++ W+ + Q L +PL LGTHNS SL+D +
Sbjct 116 VEPWLAHAIKQQTKLVQTLPLCYQFLLGTHNSAISLADGYGNLDDYFRGFFKYIKWALPG 175
Query 137 HAD-----SNQQLSLAQQLDIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNAN---- 187
AD +NQ LSL QL + VRALELD H++ G + C GL N
Sbjct 176 FADAPLHTNNQLLSLTDQLRLGVRALELDTHWV-----GGVMRIAHCGGLHVPQLNKLIE 230
Query 188 -------------------LGC---------TVEPLLATVLPQIANWLNAPGHTEEVILL 219
LGC + LL + ++ +W+ + +E ++L
Sbjct 231 ALNFVARLLHRSIRWDTETLGCMPSLSSIPSMEQRLLTDAMQEVKDWMEESSNADEFLVL 290
Query 220 YLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLD---VSREEI 276
Y +DQ N + V LD +L P D + +++
Sbjct 291 YFDDQ-PNLKTWGVVGNLLDDILSV----------------------FPRDWIFSTEDKM 327
Query 277 RASGARAVLVG------SCAP---GWSAAVFDWSGVELES---------GSNSGYRPYPA 318
+G R +LV + P G A+ W+ L S P P
Sbjct 328 LEAGKRLMLVSGTDYGDTMEPLIFGRGKALCGWNEPPLASVDGTPECLINQQGMIEPQPL 387
Query 319 CDATYGRGVYAWRLVRYYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQ 378
D R V + L + A T P +A PP +T CG+N+ D
Sbjct 388 FDGMLTR-VISCELQYGPMNCDFAY---RGTNDPVFDEATLPP----VTGCGLNMPSPDL 439
Query 379 LLPEDGRIQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTP 438
L P+ R A++W+WAP P L G + S G A+ G W
Sbjct 440 LTPD--RAAATIWTWAPGHPFDPTSELGL----GDSQSTSAG--RNASLNATDGGW---- 487
Query 439 APVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAG--GAWV 481
V+ A GA+F LPR +N L A AG AW+
Sbjct 488 --VLDASLPRGSCPTGAEFDLPRHPRENYLLAAALQRAGHEAAWL 530
>gi|156370135|ref|XP_001628327.1| predicted protein [Nematostella vectensis]
gi|156215301|gb|EDO36264.1| predicted protein [Nematostella vectensis]
Length=356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/130 (33%), Positives = 56/130 (44%), Gaps = 17/130 (13%)
Query 121 GTHNS---FN----SLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLEGHG-A 172
GTHNS FN SH NQQ + QLD +R ++D Y+ + G
Sbjct 46 GTHNSGSGFNGHLYHWGGGLAGSHFFRNQQWNFTHQLDYGIRYFDIDTCYVGKGNGDWWK 105
Query 173 PGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYL----EDQLKNA 228
G CH +GP A V LL QI NW+ P H EVI++ E+
Sbjct 106 EGAWTCH-MGPAGAAFAGPVRQLL----NQIRNWMEKPEHRNEVIVIKFGRDVEESKNRK 160
Query 229 SAYESVVATL 238
+ YE ++ TL
Sbjct 161 NIYEDILKTL 170
>gi|325181739|emb|CCA16195.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length=376
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/185 (28%), Positives = 78/185 (43%), Gaps = 23/185 (12%)
Query 94 LQSWVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSH-------------ADS 140
+Q WV R R Q + Q +G+HNS S + F VS S
Sbjct 106 MQPWVSRALRLQRLATYRRDICTMQVIGSHNSAISRAYGFGVSDYPSNKNTTEDQYLNTS 165
Query 141 NQQLSLAQQLDIDVRALELDLHYLP---RLEGHGAPGVTVCHGLGPKNANLGCTVEPLLA 197
NQ S+ QL + VR +E+DLHY R+ GA G+ C P ++ + P +
Sbjct 166 NQFFSVLDQLQLGVRFIEVDLHYFGNDLRVAHCGAVGLIGCE---PSSSGIPTYDRPSVN 222
Query 198 TVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPA 257
VL +IA WL T++ + + + ++ V+ L ++ + IYRP+
Sbjct 223 NVLIEIATWLKKS--TDQFVFVLFDGD--TIFPQQNKVSILINYIKSHFVNTEIYRPSDK 278
Query 258 RRATN 262
R N
Sbjct 279 SRTEN 283
>gi|212537327|ref|XP_002148819.1| Lectin C-type domain protein [Penicillium marneffei ATCC 18224]
gi|210068561|gb|EEA22652.1| Lectin C-type domain protein [Penicillium marneffei ATCC 18224]
Length=562
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/157 (28%), Positives = 69/157 (44%), Gaps = 28/157 (17%)
Query 344 ALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQ---------ASLWSWA 394
ALA N + T + T CG++ D L + I+ +S+WSWA
Sbjct 313 ALATLDGISHNTSSSTALLLQNYTGCGISPVINDTLGGQTANIEVDPYRNMSISSMWSWA 372
Query 395 PDEPRAGAGA--------------CALQG--ADGRWVAASCGDPHPAACRDAAG--RWTV 436
DEPR + CA+ ++GRW A +C + + AACR + W +
Sbjct 373 VDEPRNVSSLPGFEDLGPNNDILRCAMLDPTSNGRWRAGNCSNAYRAACRVDSEPYSWVL 432
Query 437 TPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVA 473
+ F+ ++ C + G+ F +PRTG +N L+ A
Sbjct 433 SDRKQSFSDSSNICPS-GSSFDIPRTGLENTYLYHTA 468
>gi|302532317|ref|ZP_07284659.1| predicted protein [Streptomyces sp. C]
gi|302441212|gb|EFL13028.1| predicted protein [Streptomyces sp. C]
Length=401
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/158 (32%), Positives = 70/158 (45%), Gaps = 22/158 (13%)
Query 101 TARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELD 160
TAR+ D D E +L THNSF + DS + NQ S+ QLD VR L LD
Sbjct 107 TARWGDRRLD-----EAAFLTTHNSFTNYEDS---RWSSVNQSESVRAQLDNGVRGLSLD 158
Query 161 LHYLPR-----LEGHGA----PGVTVCHGLGPKNANLGCTV-EPLLATVLPQIANWLNAP 210
H+ R + G+ V +CHG A + + + ++L A
Sbjct 159 THWYERSTWLCVISFGSDCYPSDVYLCHGDCKTFAGATYALPRQSFHGTMQTVVDFLAA- 217
Query 211 GHTEEVILLYLEDQLKNASAYESV--VATLDQVLRRAD 246
H EE + ++LED + +S+ V LDQ+L R D
Sbjct 218 -HPEEFVTVFLEDYVSAGQLRQSLGRVRGLDQLLFRPD 254
>gi|302836171|ref|XP_002949646.1| hypothetical protein VOLCADRAFT_90100 [Volvox carteri f. nagariensis]
gi|300265005|gb|EFJ49198.1| hypothetical protein VOLCADRAFT_90100 [Volvox carteri f. nagariensis]
Length=3693
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 83/352 (24%), Positives = 136/352 (39%), Gaps = 72/352 (20%)
Query 97 WVHRTARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHAD----------------- 139
W+ +Q L PL Q LGTHNS +L+D + + H D
Sbjct 3095 WLRFAVDYQWRLSRKQPLCFAQLLGTHNSAITLADGYGM-HDDVYTQYLHYLGLASGSQR 3153
Query 140 ---SNQQLSLAQQLDIDVRALELDLH------YLPRLEGHGAP----------GVTVCHG 180
+NQ LSL QL++ VR LELD+H ++ G +P + G
Sbjct 3154 LMTNNQVLSLTDQLNLGVRFLELDVHWIQSDLHIAHCGGFHSPQLNALVAALSALAQLFG 3213
Query 181 LGPKN---ANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVAT 237
P LGC +P ++++ + E ++LYL++Q+ + + V
Sbjct 3214 HPPVEWDAETLGC--DPSMSSLPTRDQRTFVDALRESEFLVLYLDNQM-DLLRWGRVGTL 3270
Query 238 LDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAV 297
++QV+ T+LI P T +P E + G R +L+ G +
Sbjct 3271 MEQVMS-VIPTALIITPPELNNITQQRGSMP--SVDELVHVYGKRLLLMSGSDYGEEMSW 3327
Query 298 FDWSG---VELESGSNSGYRPYPACD-----ATYGRGVYAWRLVRYYEDSTLATALANPT 349
+S +++ G++ P C V A +L+R T N T
Sbjct 3328 LAFSHHNLCDMDEPLFRGFQGPPHCQFHNWHLDMDTPVMAGKLIR--------TPTCNLT 3379
Query 350 RPPANPQALTPPKVPAM--------TDCGVNLFGFDQLLPEDGRIQASLWSW 393
P N L +P + T CG+N+ DQ+ P+ +Q+ +WSW
Sbjct 3380 YGPYNCSMLRGDNIPQLDDHQLPEATSCGINVPAPDQITPQ--LVQSYIWSW 3429
>gi|299115958|emb|CBN75963.1| conserved unknown protein [Ectocarpus siliculosus]
Length=376
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 43/136 (32%), Positives = 60/136 (45%), Gaps = 19/136 (13%)
Query 350 RPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRA---GAGACA 406
RP A + A+ +CG+ + FD++ + ++A++WSW EP A A A
Sbjct 107 RPTAEEFFGAGDSMSAL-ECGL-IPTFDRM--DSSLLEATMWSWEEGEPHAYFSSARAAV 162
Query 407 LQGADGRWVAAS-------CGDPHPAACRDAA----GRWTVTP-APVVFAGAALACTAIG 454
GRW + S + H ACRD + G W V+ A F A L C A G
Sbjct 163 AHQETGRWTSGSALANEEESDEVHSYACRDDSSGERGEWVVSNGAAGCFGAAELVCLAQG 222
Query 455 ADFTLPRTGNQNARLH 470
F PRT +NA L
Sbjct 223 LVFGCPRTAKENAALR 238
>gi|339468640|gb|EGP83740.1| hypothetical protein MYCGRDRAFT_76036 [Mycosphaerella graminicola
IPO323]
Length=645
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/109 (33%), Positives = 47/109 (44%), Gaps = 21/109 (19%)
Query 388 ASLWSWAPDEPRAGAGACA--------LQGADGRWVAASCGDPHPAACRDAAGR----WT 435
A +WSWAP +P + + A L GRW A+ C AACR GR W
Sbjct 448 AGIWSWAPSQPINASSSMATANQRCAILNATSGRWSASDCDSSRHAACR--VGREPYVWR 505
Query 436 VTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYL 484
++ + AC DF +PRT +NA L +V W +YL
Sbjct 506 ISQDGAPYDRVEQACDEDDLDFDVPRTALENAHLLSV-------WRNYL 547
>gi|87118774|ref|ZP_01074673.1| hypothetical protein MED121_17144 [Marinomonas sp. MED121]
gi|86166408|gb|EAQ67674.1| hypothetical protein MED121_17144 [Marinomonas sp. MED121]
Length=738
Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/101 (32%), Positives = 44/101 (44%), Gaps = 4/101 (3%)
Query 386 IQASLWSWAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAG 445
I+ +WSW D P+ G+ ACAL G ASC AC D + W +T +
Sbjct 353 IKDFVWSWEKDYPQ-GSNACALSTTGGAVQDASCSADRVHACVDESRNWYLTNTAGEWQE 411
Query 446 AALACTAIGADFTLPRTGNQNARLHAVAGPA---GGAWVHY 483
C A+G F +P +NA L V A W++Y
Sbjct 412 GFAQCAALGYQFAMPYNPYENAALAKVKTEAQVSASVWLNY 452
>gi|224123376|ref|XP_002330300.1| predicted protein [Populus trichocarpa]
gi|222871335|gb|EEF08466.1| predicted protein [Populus trichocarpa]
Length=351
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/115 (29%), Positives = 56/115 (49%), Gaps = 14/115 (12%)
Query 112 VPLRETQWLGTHNSFNSLSD---SFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLE 168
+P + WL THNSF L D + ++ A +NQQ ++ QL+ +R LD++
Sbjct 71 LPFNQYTWLTTHNSFAKLGDRSATGSIILAPTNQQDTVTSQLNNGIRGFMLDMYDFQN-- 128
Query 169 GHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLED 223
+ +CH G N +P + VL +I +L A + E+I +++ED
Sbjct 129 -----DIWLCHSFGGNCYNF-TAFQPAI-NVLKEIQAFLEA--NPSEIITIFIED 174
>gi|326469631|gb|EGD93640.1| hypothetical protein TESG_01181 [Trichophyton tonsurans CBS 112818]
Length=594
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (31%), Positives = 43/98 (44%), Gaps = 15/98 (15%)
Query 388 ASLWSWAPDEPRAGAGACALQGAD------------GRWVAASCGDPHPAACR--DAAGR 433
++ WSWA EPR GA + D GRW A C D + ACR ++
Sbjct 392 STTWSWANGEPRNSTGASTPKTKDSFRCAAMHAVSSGRWHAHDCNDVYRVACRVGNSPHE 451
Query 434 WTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
W ++ + A AC F++PRT +N L+A
Sbjct 452 WIISKHATTYFNAEKACPD-STLFSVPRTALENTYLYA 488
>gi|326478842|gb|EGE02852.1| lectin C-type domain containing protein [Trichophyton equinum
CBS 127.97]
Length=594
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (31%), Positives = 43/98 (44%), Gaps = 15/98 (15%)
Query 388 ASLWSWAPDEPRAGAGACALQGAD------------GRWVAASCGDPHPAACR--DAAGR 433
++ WSWA EPR GA + D GRW A C D + ACR ++
Sbjct 392 STTWSWANGEPRNSTGASTPKTKDSFRCAAMHAVSSGRWHAHDCNDVYRVACRVGNSPHE 451
Query 434 WTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
W ++ + A AC F++PRT +N L+A
Sbjct 452 WIISKHATTYFNAEKACPD-STLFSVPRTALENTYLYA 488
>gi|242809580|ref|XP_002485399.1| Lectin C-type domain protein [Talaromyces stipitatus ATCC 10500]
gi|218716024|gb|EED15446.1| Lectin C-type domain protein [Talaromyces stipitatus ATCC 10500]
Length=559
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/101 (30%), Positives = 49/101 (49%), Gaps = 19/101 (18%)
Query 388 ASLWSWAPDEPRAGAGA--------------CALQG--ADGRWVAASCGDPHPAACRDAA 431
+++WSWA EPR + CA+ ++G W A +C D + AACR +
Sbjct 365 STMWSWAVGEPRNASSLPGYEEIAPSSDILRCAMMDPTSNGHWRAGNCSDTYRAACRVDS 424
Query 432 G--RWTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLH 470
W ++ + FA + C+ G+ F +PRTG +N L+
Sbjct 425 RPYSWVLSDSRQSFADSNKICSN-GSSFDVPRTGLENTYLY 464
>gi|169620399|ref|XP_001803611.1| hypothetical protein SNOG_13399 [Phaeosphaeria nodorum SN15]
gi|111058163|gb|EAT79283.1| hypothetical protein SNOG_13399 [Phaeosphaeria nodorum SN15]
Length=650
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/144 (28%), Positives = 57/144 (40%), Gaps = 23/144 (15%)
Query 353 ANPQALTP-PKVPAMTDCGVNLFGFDQL---------LPEDGRIQASLWSWAPDEPRA-- 400
+ P + P P V +T CG+ L LP + ++LW+WAP EP+
Sbjct 414 SRPDLMIPLPAVSNLTSCGITPLLNQTLGGTTADKNPLPYAAYVHSTLWTWAPGEPKNIT 473
Query 401 -----GAGACALQGA---DGRWVAASCGDPHPAACR--DAAGRWTVTPAPVVFAGAALAC 450
CA+ GRW C D + ACR W ++ + A AC
Sbjct 474 SGGDRSDSRCAVMTTSPYSGRWRVTDCKDRYRVACRVPGQIYNWQISSETSSYFDAVEAC 533
Query 451 TAIGADFTLPRTGNQNARLHAVAG 474
A +F +P T +NA L A G
Sbjct 534 RA-PYEFDVPHTALENAHLIAAIG 556
>gi|302655206|ref|XP_003019396.1| Lectin C-type domain protein [Trichophyton verrucosum HKI 0517]
gi|291183115|gb|EFE38751.1| Lectin C-type domain protein [Trichophyton verrucosum HKI 0517]
Length=594
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/98 (31%), Positives = 42/98 (43%), Gaps = 15/98 (15%)
Query 388 ASLWSWAPDEPRAGAGACALQGAD------------GRWVAASCGDPHPAACR--DAAGR 433
++ WSWA DEPR A + D GRW A C D + ACR +
Sbjct 392 STTWSWAKDEPRNSTRASTPKTKDSFRCAAMHAVSSGRWHAHDCNDVYRVACRVGNLPYE 451
Query 434 WTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
W ++ + A AC F++PRT +N L+A
Sbjct 452 WVISEHATTYFNAEKACPD-NTLFSVPRTALENTYLYA 488
>gi|294817167|ref|ZP_06775809.1| glycoside hydrolase family protein [Streptomyces clavuligerus
ATCC 27064]
gi|326446050|ref|ZP_08220784.1| hypothetical protein SclaA2_33517 [Streptomyces clavuligerus
ATCC 27064]
gi|294321982|gb|EFG04117.1| glycoside hydrolase family protein [Streptomyces clavuligerus
ATCC 27064]
Length=1089
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/139 (33%), Positives = 64/139 (47%), Gaps = 26/139 (18%)
Query 109 QDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYLPRLE 168
QDP L + +L THN+FN+ D F ++ NQ S+AQQL VR L LD+H E
Sbjct 189 QDP-RLDQVTFLTTHNAFNNPKDGFPLA---VNQSNSMAQQLSDGVRGLMLDIH-----E 239
Query 169 GHGAPGVTVCHGLGPKNANLGCTVEPL-LATVLPQIANWLNAPGHTEEVILLYLEDQLKN 227
GA V +CHG C + L L + +L + V+ +++ED K+
Sbjct 240 RDGA--VLMCHGT--------CEIGSKPLKDGLRDVVAFLET--NKNAVVTIFMEDYAKD 287
Query 228 ----ASAYESVVATLDQVL 242
A + V LD V
Sbjct 288 REKLAQQFVDVPGLLDLVF 306
>gi|327303498|ref|XP_003236441.1| hypothetical protein TERG_03486 [Trichophyton rubrum CBS 118892]
gi|326461783|gb|EGD87236.1| hypothetical protein TERG_03486 [Trichophyton rubrum CBS 118892]
Length=594
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/147 (26%), Positives = 59/147 (41%), Gaps = 27/147 (18%)
Query 348 PTRPPANPQALTPPKVPAMTDCGV---------NLFGFDQLLPEDGRIQASLWSWAPDEP 398
P+R +N +L ++ + CG+ N+ + P ++ WSWA DEP
Sbjct 346 PSRNTSNELSLLTRQLAS---CGISAIVNHTLFNVTADTDISPYQNVSFSTTWSWAQDEP 402
Query 399 RAGAGACALQGAD------------GRWVAASCGDPHPAACRDAAG--RWTVTPAPVVFA 444
R A + D GRW C D + ACR + W ++ +
Sbjct 403 RNSTRASTPKTKDSFRCAAMHAVSSGRWHTHDCNDVYRVACRVGSSPHEWVISEHATTYF 462
Query 445 GAALACTAIGADFTLPRTGNQNARLHA 471
A AC + F++PRT +N L+A
Sbjct 463 NAEKACPD-NSLFSVPRTALENTYLYA 488
>gi|302509228|ref|XP_003016574.1| Lectin C-type domain protein [Arthroderma benhamiae CBS 112371]
gi|291180144|gb|EFE35929.1| Lectin C-type domain protein [Arthroderma benhamiae CBS 112371]
Length=501
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/98 (31%), Positives = 42/98 (43%), Gaps = 15/98 (15%)
Query 388 ASLWSWAPDEPRAGAGACALQGAD------------GRWVAASCGDPHPAACR--DAAGR 433
++ WSWA DEPR A + D GRW A C D + ACR +
Sbjct 299 STTWSWAKDEPRNSTRASTPKTKDSFRCAAMHAVSSGRWHAHDCNDVYRVACRVGNLPYE 358
Query 434 WTVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
W ++ + A AC F++PRT +N L+A
Sbjct 359 WVISEHATTYFNAEKACPD-NTLFSVPRTALENTYLYA 395
>gi|189198319|ref|XP_001935497.1| lectin C-type domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187981445|gb|EDU48071.1| lectin C-type domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length=639
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/157 (28%), Positives = 67/157 (43%), Gaps = 24/157 (15%)
Query 338 DSTLATALANPTRPPANPQALTP-PKVPAMTDCGVNLFGFDQL---------LPEDGRIQ 387
+S+ A A P ANP +P P + +T CG+ F L LP +
Sbjct 389 NSSFALTPAPPLSIAANPDFASPIPSIANLTACGLTPFLNQTLANTTADKNPLPYAAYVH 448
Query 388 ASLWSWAPDEPRAGA--------GACALQGAD---GRWVAASCGDPHPAACRDAAG--RW 434
++LW++AP +P + C + RW +CG+ H AC D W
Sbjct 449 STLWTFAPGQPLNASDDSTDPSENRCVVMMRSPYPSRWRVTNCGEAHRVACHDPHKPYAW 508
Query 435 TVTPAPVVFAGAALACTAIGADFTLPRTGNQNARLHA 471
++ +A AA C A A+F++P T +NA L +
Sbjct 509 HISSDATPYANAASFC-ASPAEFSVPHTPLENAHLFS 544
>gi|291451795|ref|ZP_06591185.1| chitinase [Streptomyces albus J1074]
gi|291354744|gb|EFE81646.1| chitinase [Streptomyces albus J1074]
Length=408
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/119 (33%), Positives = 57/119 (48%), Gaps = 16/119 (13%)
Query 109 QDPVP----LRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELDLHYL 164
++P+P L + +L HN+ ++ D S A NQ +A+QLD VRAL LD H
Sbjct 122 REPMPANPTLADLTFLTAHNAMHNTEDQGRSSLAAPNQPHRVARQLDDGVRALMLDAH-- 179
Query 165 PRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVILLYLED 223
H V +CH + N C ATV IA++L+ E V+ ++LED
Sbjct 180 -----HANGRVRMCHAIPVLNP---CGSNADAATVFTAIADFLDR--DREAVVTVFLED 228
>gi|302804570|ref|XP_002984037.1| hypothetical protein SELMODRAFT_119480 [Selaginella moellendorffii]
gi|300148389|gb|EFJ15049.1| hypothetical protein SELMODRAFT_119480 [Selaginella moellendorffii]
Length=359
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/144 (28%), Positives = 70/144 (49%), Gaps = 19/144 (13%)
Query 100 RTARFQD-ALQDPVPLRETQWLGTHNSFN-----SLSDSFTVSHADSNQQLSLAQQLDID 153
RT F L + +P + WL THNSF+ SL+ + ++ NQ+ S+ QQL
Sbjct 30 RTQSFNVLGLNNSMPFNKYSWLTTHNSFSIKGSPSLTGTPILTF--DNQEDSVTQQLQNG 87
Query 154 VRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHT 213
VR L LD++ + +CH + N +P + T L +I +++ +
Sbjct 88 VRGLMLDMYDFMN-------DIWLCHSFQGQCQNFT-AFQPAINT-LREIETFMSQ--NP 136
Query 214 EEVILLYLEDQLKNASAYESVVAT 237
EVI +++ED ++ ++A ++ A
Sbjct 137 SEVITIFIEDYVRRSNAVSTLFAN 160
Lambda K H
0.319 0.134 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1038195005428
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40