BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3471c
Length=177
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610607|ref|NP_217988.1| hypothetical protein Rv3471c [Mycob... 353 4e-96
gi|289555740|ref|ZP_06444950.1| conserved hypothetical protein [... 352 1e-95
gi|15843083|ref|NP_338120.1| hypothetical protein MT3577 [Mycoba... 352 2e-95
gi|31794647|ref|NP_857140.1| hypothetical protein Mb3500c [Mycob... 349 9e-95
gi|167969810|ref|ZP_02552087.1| hypothetical protein MtubH3_1800... 321 3e-86
gi|254234060|ref|ZP_04927385.1| conserved hypothetical protein [... 260 5e-68
gi|308232472|ref|ZP_07664097.1| hypothetical protein TMAG_03999 ... 226 8e-58
gi|333911254|ref|YP_004484987.1| Cupin 2 barrel domain-containin... 93.2 1e-17
gi|20092417|ref|NP_618492.1| hypothetical protein MA3617 [Methan... 92.8 2e-17
gi|220936297|ref|YP_002515196.1| hypothetical protein Tgr7_3140 ... 92.0 3e-17
gi|289191639|ref|YP_003457580.1| Cupin 2 conserved barrel domain... 91.7 4e-17
gi|334130461|ref|ZP_08504258.1| hypothetical protein METUNv1_012... 90.5 8e-17
gi|73669333|ref|YP_305348.1| hypothetical protein Mbar_A1827 [Me... 89.7 1e-16
gi|256811439|ref|YP_003128808.1| Cupin 2 conserved barrel domain... 89.4 2e-16
gi|15669814|ref|NP_248628.1| hypothetical protein MJ_1618 [Metha... 89.4 2e-16
gi|196232809|ref|ZP_03131659.1| Cupin 2 conserved barrel domain ... 88.2 4e-16
gi|218887715|ref|YP_002437036.1| cupin [Desulfovibrio vulgaris s... 86.7 1e-15
gi|289207320|ref|YP_003459386.1| cupin [Thioalkalivibrio sp. K90... 85.9 2e-15
gi|171910538|ref|ZP_02926008.1| Cupin family protein [Verrucomic... 85.5 3e-15
gi|226942299|ref|YP_002797372.1| hypothetical protein Avin_01320... 84.3 7e-15
gi|46579046|ref|YP_009854.1| cupin family protein [Desulfovibrio... 82.4 2e-14
gi|311232891|gb|ADP85745.1| Cupin 2 conserved barrel domain prot... 82.4 2e-14
gi|21226608|ref|NP_632530.1| hypothetical protein MM_0506 [Metha... 82.0 3e-14
gi|257093661|ref|YP_003167302.1| Cupin 2 barrel domain-containin... 81.6 4e-14
gi|120603370|ref|YP_967770.1| cupin [Desulfovibrio vulgaris DP4]... 81.3 5e-14
gi|119510105|ref|ZP_01629245.1| Cupin [Nodularia spumigena CCY94... 80.5 9e-14
gi|283778367|ref|YP_003369122.1| Cupin 2 barrel domain-containin... 80.1 1e-13
gi|332701417|ref|ZP_08421505.1| Cupin 2 conserved barrel domain ... 79.0 3e-13
gi|87306927|ref|ZP_01089073.1| Cupin region protein [Blastopirel... 78.2 5e-13
gi|297568172|ref|YP_003689516.1| Cupin 2 conserved barrel domain... 77.0 9e-13
gi|147919765|ref|YP_686489.1| hypothetical protein RCIX2009 [unc... 77.0 1e-12
gi|119897242|ref|YP_932455.1| phosphomannose protein [Azoarcus s... 76.6 1e-12
gi|320355102|ref|YP_004196441.1| Cupin 2 conserved barrel domain... 76.3 2e-12
gi|334118212|ref|ZP_08492302.1| Cupin 2 conserved barrel domain ... 75.5 3e-12
gi|91773006|ref|YP_565698.1| cupin family protein [Methanococcoi... 75.5 3e-12
gi|223936780|ref|ZP_03628690.1| Cupin 2 conserved barrel domain ... 75.1 4e-12
gi|218439791|ref|YP_002378120.1| cupin [Cyanothece sp. PCC 7424]... 73.9 8e-12
gi|20095078|ref|NP_614925.1| mannose-6-phosphate isomerase [Meth... 73.2 1e-11
gi|334111873|ref|ZP_08486140.1| Cupin 2 conserved barrel domain ... 73.2 2e-11
gi|344344355|ref|ZP_08775218.1| Cupin 2 conserved barrel domain ... 72.8 2e-11
gi|88604201|ref|YP_504379.1| CMP/dCMP deaminase, zinc-binding [M... 72.4 2e-11
gi|335419957|ref|ZP_08551000.1| Cupin 2 barrel domain-containing... 72.0 3e-11
gi|268323115|emb|CBH36703.1| conserved hypothetical protein, con... 71.6 4e-11
gi|284041664|ref|YP_003392004.1| cupin [Conexibacter woesei DSM ... 71.6 4e-11
gi|345130800|gb|EGW61702.1| Cupin 2 conserved barrel domain prot... 71.6 5e-11
gi|71907533|ref|YP_285120.1| cupin region [Dechloromonas aromati... 71.2 5e-11
gi|149177636|ref|ZP_01856237.1| hypothetical protein PM8797T_272... 70.9 8e-11
gi|15678380|ref|NP_275495.1| hypothetical protein MTH352 [Methan... 70.5 9e-11
gi|218782538|ref|YP_002433856.1| cupin [Desulfatibacillum alkeni... 69.7 1e-10
gi|302036369|ref|YP_003796691.1| hypothetical protein NIDE1004 [... 69.3 2e-10
>gi|15610607|ref|NP_217988.1| hypothetical protein Rv3471c [Mycobacterium tuberculosis H37Rv]
gi|121639391|ref|YP_979615.1| hypothetical protein BCG_3536c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|148663335|ref|YP_001284858.1| hypothetical protein MRA_3511 [Mycobacterium tuberculosis H37Ra]
42 more sequence titles
Length=177
Score = 353 bits (907), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 177/177 (100%), Positives = 177/177 (100%), Gaps = 0/177 (0%)
Query 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF
Sbjct 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
Query 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD
Sbjct 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
Query 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP
Sbjct 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
>gi|289555740|ref|ZP_06444950.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|297733129|ref|ZP_06962247.1| hypothetical protein MtubKR_18655 [Mycobacterium tuberculosis
KZN R506]
gi|289440372|gb|EFD22865.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=177
Score = 352 bits (903), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 176/177 (99%), Positives = 177/177 (100%), Gaps = 0/177 (0%)
Query 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF
Sbjct 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
Query 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
VAP+LSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD
Sbjct 61 VAPELSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
Query 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP
Sbjct 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
>gi|15843083|ref|NP_338120.1| hypothetical protein MT3577 [Mycobacterium tuberculosis CDC1551]
gi|13883428|gb|AAK47934.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
Length=184
Score = 352 bits (902), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/177 (100%), Positives = 177/177 (100%), Gaps = 0/177 (0%)
Query 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF
Sbjct 8 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 67
Query 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD
Sbjct 68 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 127
Query 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP
Sbjct 128 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 184
>gi|31794647|ref|NP_857140.1| hypothetical protein Mb3500c [Mycobacterium bovis AF2122/97]
gi|31620244|emb|CAD95687.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
Length=177
Score = 349 bits (895), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 176/177 (99%), Positives = 176/177 (99%), Gaps = 0/177 (0%)
Query 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF
Sbjct 1 MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPF 60
Query 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD
Sbjct 61 VAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD 120
Query 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCA GPAYLPERDQRMGEAAVIGAWP
Sbjct 121 ESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCAWGPAYLPERDQRMGEAAVIGAWP 177
>gi|167969810|ref|ZP_02552087.1| hypothetical protein MtubH3_18005 [Mycobacterium tuberculosis
H37Ra]
gi|254366080|ref|ZP_04982125.1| hypothetical protein TBHG_03414 [Mycobacterium tuberculosis str.
Haarlem]
gi|254552575|ref|ZP_05143022.1| hypothetical protein Mtube_19365 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|134151593|gb|EBA43638.1| hypothetical protein TBHG_03414 [Mycobacterium tuberculosis str.
Haarlem]
gi|339296291|gb|AEJ48402.1| hypothetical protein CCDC5079_3213 [Mycobacterium tuberculosis
CCDC5079]
gi|339299894|gb|AEJ52004.1| hypothetical protein CCDC5180_3167 [Mycobacterium tuberculosis
CCDC5180]
Length=161
Score = 321 bits (822), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/161 (99%), Positives = 161/161 (100%), Gaps = 0/161 (0%)
Query 17 VLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPFVAPDLSEIRVLVDRAT 76
+LQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPFVAPDLSEIRVLVDRAT
Sbjct 1 MLQATVALSAGHKPAFRGFVKDPPRARAHAAAMFVSNAREAEPFVAPDLSEIRVLVDRAT 60
Query 77 VGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAG 136
VGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAG
Sbjct 61 VGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAG 120
Query 137 VPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
VPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP
Sbjct 121 VPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 161
>gi|254234060|ref|ZP_04927385.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|308406167|ref|ZP_07669532.1| hypothetical protein TMLG_04113 [Mycobacterium tuberculosis SUMu012]
gi|124599589|gb|EAY58693.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|308364299|gb|EFP53150.1| hypothetical protein TMLG_04113 [Mycobacterium tuberculosis SUMu012]
gi|323717867|gb|EGB27057.1| hypothetical protein TMMG_03646 [Mycobacterium tuberculosis CDC1551A]
Length=129
Score = 260 bits (665), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 129/129 (100%), Positives = 129/129 (100%), Gaps = 0/129 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF
Sbjct 1 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMG 168
VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMG
Sbjct 61 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMG 120
Query 169 EAAVIGAWP 177
EAAVIGAWP
Sbjct 121 EAAVIGAWP 129
>gi|308232472|ref|ZP_07664097.1| hypothetical protein TMAG_03999 [Mycobacterium tuberculosis SUMu001]
gi|308370276|ref|ZP_07666908.1| hypothetical protein TMBG_03948 [Mycobacterium tuberculosis SUMu002]
gi|308371359|ref|ZP_07667148.1| hypothetical protein TMCG_02634 [Mycobacterium tuberculosis SUMu003]
19 more sequence titles
Length=113
Score = 226 bits (577), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 112/113 (99%), Positives = 113/113 (100%), Gaps = 0/113 (0%)
Query 65 LSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE 124
+SEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE
Sbjct 1 MSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE 60
Query 125 VGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 177
VGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP
Sbjct 61 VGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAAVIGAWP 113
>gi|333911254|ref|YP_004484987.1| Cupin 2 barrel domain-containing protein [Methanotorris igneus
Kol 5]
gi|333751843|gb|AEF96922.1| Cupin 2 conserved barrel domain protein [Methanotorris igneus
Kol 5]
Length=125
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/116 (39%), Positives = 66/116 (57%), Gaps = 0/116 (0%)
Query 52 SNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLS 111
S + +P+V D S IR L+ + G SLA A V G++T+ H+ ++EIY++L
Sbjct 9 SEYDKIKPYVTKDGSIIRELMHPSVHGDVKQSLAEAIVPVGSKTLLHKHHESEEIYYILE 68
Query 112 GRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRM 167
G+GL+++G+E EV GD + IP P KI +G VPL LC P Y E + M
Sbjct 69 GKGLMTLGNEKFEVKKGDTICIPPETPHKIENIGKVPLKILCCSFPPYSHEDTELM 124
>gi|20092417|ref|NP_618492.1| hypothetical protein MA3617 [Methanosarcina acetivorans C2A]
gi|19917673|gb|AAM06972.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length=121
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/111 (42%), Positives = 64/111 (58%), Gaps = 0/111 (0%)
Query 58 EPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVS 117
EP++ D S IR L+ A G + SLA A V AG ET+ HR + ++EIY + G G+++
Sbjct 11 EPYITKDGSIIRELMHPAVHGNSKQSLAEAIVPAGGETLLHRHRLSEEIYHITEGSGIMT 70
Query 118 VGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMG 168
+G E EV GD++ I G P ++R G L LC C PAY E + MG
Sbjct 71 LGSEEFEVRKGDSICIAPGTPHRVRNAGEEELKILCCCAPAYSHEDTELMG 121
>gi|220936297|ref|YP_002515196.1| hypothetical protein Tgr7_3140 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|219997607|gb|ACL74209.1| conserved hypothetical phosphomannose protein [Thioalkalivibrio
sulfidophilus HL-EbGr7]
Length=137
Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/103 (48%), Positives = 58/103 (57%), Gaps = 0/103 (0%)
Query 60 FVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVG 119
+ D SEIR L+ G + SLA A VA GAET HR T+EIY + G GL+ +G
Sbjct 11 YDTKDGSEIRELMHPDHHGNRAQSLAEAIVAPGAETRLHRHGRTEEIYHITRGEGLMRLG 70
Query 120 DESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
D++ V GD V IP G P IR G PL LCAC PAY E
Sbjct 71 DQTFAVTVGDTVCIPPGTPHNIRNTGETPLHILCACSPAYSHE 113
>gi|289191639|ref|YP_003457580.1| Cupin 2 conserved barrel domain protein [Methanocaldococcus sp.
FS406-22]
gi|288938089|gb|ADC68844.1| Cupin 2 conserved barrel domain protein [Methanocaldococcus sp.
FS406-22]
Length=123
Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/102 (41%), Positives = 61/102 (60%), Gaps = 0/102 (0%)
Query 58 EPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVS 117
+P++ D S IR L+ SLA A V G++T+ H+ ++EIY++L G+GL++
Sbjct 12 KPYITKDGSIIRELMHPNIYKNVKQSLAEAIVPVGSKTLLHKHHKSEEIYYILEGKGLMT 71
Query 118 VGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+G+E EV GDA+ IP P KI +G+VPL LC P Y
Sbjct 72 LGNEKFEVKKGDAILIPPETPHKIENIGNVPLKILCCSFPPY 113
>gi|334130461|ref|ZP_08504258.1| hypothetical protein METUNv1_01284 [Methyloversatilis universalis
FAM5]
gi|333444570|gb|EGK72519.1| hypothetical protein METUNv1_01284 [Methyloversatilis universalis
FAM5]
Length=128
Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/112 (46%), Positives = 64/112 (58%), Gaps = 1/112 (0%)
Query 60 FVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVG 119
+V D S IR L+ G + SLA ATV G T+ HR T+E+Y V +GRGL+++
Sbjct 17 YVTKDGSIIRELMHPQQHGGRAQSLAEATVPPGTRTLLHRHGLTEELYHVTAGRGLMTLS 76
Query 120 DESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGEAA 171
+ EVGPGD V IP G I LG PLT LC C PAY E D + +AA
Sbjct 77 ERRFEVGPGDTVLIPPGAAHCIETLGEAPLTLLCCCSPAYAHE-DTELLDAA 127
>gi|73669333|ref|YP_305348.1| hypothetical protein Mbar_A1827 [Methanosarcina barkeri str.
Fusaro]
gi|72396495|gb|AAZ70768.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length=121
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/108 (41%), Positives = 62/108 (58%), Gaps = 0/108 (0%)
Query 55 REAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRG 114
E EP++ D S IR L+ G ++ SLA ATV AG +T+ H+ T+EIY + G G
Sbjct 8 EEVEPYITKDSSIIRELMHPFVHGNSNQSLAEATVPAGGKTILHKHCLTEEIYHITDGSG 67
Query 115 LVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
++++G E +V G V I G P +I+ G+ PL LC C PAY E
Sbjct 68 IMTLGSEEFKVKKGYTVCISPGTPHRIQNTGNTPLKILCCCAPAYSHE 115
>gi|256811439|ref|YP_003128808.1| Cupin 2 conserved barrel domain protein [Methanocaldococcus fervens
AG86]
gi|256794639|gb|ACV25308.1| Cupin 2 conserved barrel domain protein [Methanocaldococcus fervens
AG86]
Length=123
Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/112 (40%), Positives = 63/112 (57%), Gaps = 0/112 (0%)
Query 58 EPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVS 117
+P++ D S IR L+ SLA A V G++T+ HR ++EIY++L G+GL++
Sbjct 12 KPYITKDGSIIRELMHPNIYEWVKQSLAEAIVPVGSKTLLHRHHKSEEIYYILEGKGLMT 71
Query 118 VGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMGE 169
+GDE EV GD + IP KI +GSVPL LC P Y E + + E
Sbjct 72 LGDEKFEVKEGDTILIPPKTDHKIENIGSVPLKILCCSYPPYSHEDTEILEE 123
>gi|15669814|ref|NP_248628.1| hypothetical protein MJ_1618 [Methanocaldococcus jannaschii DSM
2661]
gi|42559937|sp|Q59013.1|Y1618_METJA RecName: Full=Uncharacterized protein MJ1618
gi|1592216|gb|AAB99639.1| conserved hypothetical protein [Methanocaldococcus jannaschii
DSM 2661]
Length=125
Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/104 (39%), Positives = 61/104 (59%), Gaps = 0/104 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGL 115
+ +P++ D S IR L+ SLA A V G++T+ H+ ++EIY++L GRGL
Sbjct 13 KIKPYITKDGSIIRELLHPNIYKGVKQSLAEAIVPVGSKTLLHKHYTSEEIYYILEGRGL 72
Query 116 VSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+++ +E EV GD ++IP P KI +G+VPL LC P Y
Sbjct 73 MTLDNEKFEVKKGDTIYIPPKTPHKIENIGNVPLKILCCSYPPY 116
>gi|196232809|ref|ZP_03131659.1| Cupin 2 conserved barrel domain protein [Chthoniobacter flavus
Ellin428]
gi|196223008|gb|EDY17528.1| Cupin 2 conserved barrel domain protein [Chthoniobacter flavus
Ellin428]
Length=117
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/114 (44%), Positives = 61/114 (54%), Gaps = 2/114 (1%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M V N R PF D S IR ++DRA V + SLA A V AG T H + ++E YF
Sbjct 1 MTVLNLRTQPPFTTKDGSTIRSILDRANAPVQNQSLAEAQVPAGGATQRHYHKLSEEFYF 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
+L G G + + E VGPGDA+ IP G I A + L FLC C PAY E
Sbjct 61 ILEGNGEMEINGEKRTVGPGDAILIPPGAWHTIVARET--LRFLCCCAPAYAHE 112
>gi|218887715|ref|YP_002437036.1| cupin [Desulfovibrio vulgaris str. 'Miyazaki F']
gi|218758669|gb|ACL09568.1| Cupin 2 conserved barrel domain protein [Desulfovibrio vulgaris
str. 'Miyazaki F']
Length=138
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/104 (45%), Positives = 60/104 (58%), Gaps = 0/104 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGL 115
+A P+V D S IR L+ A G + SLA A V G T+ HR ++E+Y V +G+GL
Sbjct 22 DAAPYVTRDGSIIRELMHPAVHGNRNQSLAEAEVPPGCVTLLHRHPQSEELYHVTAGQGL 81
Query 116 VSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+++GD S VGPGD V I P +I G VPL LC C P Y
Sbjct 82 MTLGDASFAVGPGDTVHIAPSTPHRIANTGDVPLLVLCCCAPPY 125
>gi|289207320|ref|YP_003459386.1| cupin [Thioalkalivibrio sp. K90mix]
gi|288942951|gb|ADC70650.1| Cupin 2 conserved barrel domain protein [Thioalkalivibrio sp.
K90mix]
Length=118
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 58/100 (58%), Gaps = 0/100 (0%)
Query 60 FVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVG 119
+ D SEIR L+ + G A+ SLA A VA GA T HR T+E+Y + GRG + +G
Sbjct 11 YDTKDGSEIRELMHPDSHGNAAQSLAEAVVAPGATTHLHRHAQTEELYHITRGRGEMRLG 70
Query 120 DESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+E+ EV GD V I G P IR G+ PL LC+C P Y
Sbjct 71 EETFEVTVGDTVCIHPGTPHNIRNTGTEPLHILCSCAPPY 110
>gi|171910538|ref|ZP_02926008.1| Cupin family protein [Verrucomicrobium spinosum DSM 4136]
Length=122
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/114 (40%), Positives = 63/114 (56%), Gaps = 2/114 (1%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M ++ + PF D S IR ++DR V + SLA A++ G T H + ++E YF
Sbjct 6 MTINRLSDQPPFTTKDGSTIRSILDRTNAPVQNQSLAEASLPVGRATDRHYHKLSEEFYF 65
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
+L G G + + E+ EVGPGDA+ IPA +I A +V L FLC C P Y E
Sbjct 66 ILEGSGRMEIDGETREVGPGDAILIPANAWHQITA--TVDLRFLCCCAPPYAHE 117
>gi|226942299|ref|YP_002797372.1| hypothetical protein Avin_01320 [Azotobacter vinelandii DJ]
gi|226717226|gb|ACO76397.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
Length=122
Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/83 (52%), Positives = 57/83 (69%), Gaps = 1/83 (1%)
Query 81 SVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE-VGPGDAVWIPAGVPQ 139
++S+A ATVA G T HRL+ +E Y +L G+GLVS+GDE+ E VGPGD V IPAG+ Q
Sbjct 33 ALSIALATVAPGETTQSHRLRGVEERYLILHGQGLVSLGDEAPEPVGPGDLVLIPAGMAQ 92
Query 140 KIRALGSVPLTFLCACGPAYLPE 162
+I +G+ L F C C P + PE
Sbjct 93 RIANIGNGELAFYCICTPRFTPE 115
>gi|46579046|ref|YP_009854.1| cupin family protein [Desulfovibrio vulgaris str. Hildenborough]
gi|46448459|gb|AAS95113.1| cupin family protein [Desulfovibrio vulgaris str. Hildenborough]
Length=205
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/104 (45%), Positives = 56/104 (54%), Gaps = 0/104 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGL 115
+A P+V D S IR L+ A G A+ SLA A V G T+ H T+E+Y VL G G
Sbjct 93 DAPPYVTRDGSIIRELMHPAVHGNANQSLAEAEVPPGCATLRHTHPRTEELYHVLEGDGE 152
Query 116 VSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+++ D V PGD V IP G P IR G L LC C PAY
Sbjct 153 MALDDAVFAVAPGDTVCIPPGTPHSIRNTGVTSLRILCCCSPAY 196
>gi|311232891|gb|ADP85745.1| Cupin 2 conserved barrel domain protein [Desulfovibrio vulgaris
RCH1]
Length=150
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/104 (45%), Positives = 56/104 (54%), Gaps = 0/104 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGL 115
+A P+V D S IR L+ A G A+ SLA A V G T+ H T+E+Y VL G G
Sbjct 38 DAPPYVTRDGSIIRELMHPAVHGNANQSLAEAEVPPGCATLRHTHPRTEELYHVLEGDGE 97
Query 116 VSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+++ D V PGD V IP G P IR G L LC C PAY
Sbjct 98 MALDDAVFAVAPGDTVCIPPGTPHSIRNTGVTSLRILCCCSPAY 141
>gi|21226608|ref|NP_632530.1| hypothetical protein MM_0506 [Methanosarcina mazei Go1]
gi|20904886|gb|AAM30202.1| conserved protein [Methanosarcina mazei Go1]
Length=121
Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/110 (39%), Positives = 60/110 (55%), Gaps = 0/110 (0%)
Query 58 EPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVS 117
+P++ D S IR L+ A G + SLA ATV AG +T+ H+ +EIY + G G+++
Sbjct 11 DPYITKDGSIIRELMHPAIHGNSGQSLAEATVPAGGQTMLHKHAVAEEIYHITEGCGIMT 70
Query 118 VGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRM 167
+G + E+ GD V I G P +I G L LC C PAY E + M
Sbjct 71 LGGQEFEIRKGDTVCILPGTPHRIMNTGEKELKILCCCAPAYSHEDTELM 120
>gi|257093661|ref|YP_003167302.1| Cupin 2 barrel domain-containing protein [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257046185|gb|ACV35373.1| Cupin 2 conserved barrel domain protein [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length=230
Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/101 (41%), Positives = 59/101 (59%), Gaps = 0/101 (0%)
Query 59 PFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSV 118
P++ D SEIR L+ A G + SLA AT+ AG T+ HR + T+EIY + +G G +++
Sbjct 116 PYITKDGSEIRELMHPAVQGNRNQSLAEATLPAGTRTMLHRHRVTEEIYHITAGEGQMTL 175
Query 119 GDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
G + +VG GD + I G +I A + L LC C PAY
Sbjct 176 GADLFKVGVGDTICIAPGTAHRIEASTTGALVLLCCCSPAY 216
>gi|120603370|ref|YP_967770.1| cupin [Desulfovibrio vulgaris DP4]
gi|120563599|gb|ABM29343.1| Cupin 2, conserved barrel domain protein [Desulfovibrio vulgaris
DP4]
Length=150
Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 46/104 (45%), Positives = 56/104 (54%), Gaps = 0/104 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGL 115
+A P+V D S IR L+ A G A+ SLA A V G T+ H T+E+Y VL G G
Sbjct 38 DAPPYVTRDGSIIRELMHPAVHGNANQSLAEAEVPPGCATLRHIHPRTEELYHVLGGDGE 97
Query 116 VSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+++ D V PGD V IP G P IR G L LC C PAY
Sbjct 98 MALDDAVFAVAPGDTVCIPPGTPHSIRNTGVTSLRILCCCSPAY 141
>gi|119510105|ref|ZP_01629245.1| Cupin [Nodularia spumigena CCY9414]
gi|119465292|gb|EAW46189.1| Cupin [Nodularia spumigena CCY9414]
Length=119
Score = 80.5 bits (197), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 45/116 (39%), Positives = 67/116 (58%), Gaps = 4/116 (3%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLV--DRATVGVASVSLAHATVAAGAETVWHRLQATDEI 106
M V + + F+A D + +R L+ D+ + + SLAHAT+ G + H L T E+
Sbjct 1 MLVQKLNDCQEFIAGDNTILRELLHPDKQPLAL-RYSLAHATLPVGKTSQPHSL-TTSEV 58
Query 107 YFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
Y++L+G+G + + DE+ V PGDAV+IP Q IR GS PL F+C PA+ E
Sbjct 59 YYILNGQGEMHIDDETQIVEPGDAVYIPPNTRQFIRNSGSEPLVFICMVDPAWRKE 114
>gi|283778367|ref|YP_003369122.1| Cupin 2 barrel domain-containing protein [Pirellula staleyi DSM
6068]
gi|283436820|gb|ADB15262.1| Cupin 2 conserved barrel domain protein [Pirellula staleyi DSM
6068]
Length=119
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/111 (41%), Positives = 57/111 (52%), Gaps = 0/111 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M V N + PF+ D S+IR ++ + + SLA A V G T H + T+EIYF
Sbjct 1 MEVVNIQSTTPFITKDGSQIREILAHRNSSIRNQSLAEAVVYPGRWTAAHFHRVTEEIYF 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+L G GL+ + E V GDAV IP G I S PL LC C PAY
Sbjct 61 ILDGSGLMRLDGEERPVFTGDAVAIPPGKIHSILCTSSQPLKMLCCCAPAY 111
>gi|332701417|ref|ZP_08421505.1| Cupin 2 conserved barrel domain protein [Desulfovibrio africanus
str. Walvis Bay]
gi|332551566|gb|EGJ48610.1| Cupin 2 conserved barrel domain protein [Desulfovibrio africanus
str. Walvis Bay]
Length=122
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 59/105 (57%), Gaps = 1/105 (0%)
Query 56 EAEPFVAPDLSEIRVLVDRAT-VGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRG 114
+ EP+ D S +R L+ + G+ + SLA A V GA+T+ HR ++E+Y VLSG+G
Sbjct 10 QVEPYRTKDGSTVRELMHPSVHAGIRNQSLAEAVVQPGAKTLVHRHAKSEELYHVLSGQG 69
Query 115 LVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
++ + E V PGD V IP G P + G L LC C PAY
Sbjct 70 VMLLAGERFAVNPGDTVLIPPGTPHGLDNPGPDNLIILCCCAPAY 114
>gi|87306927|ref|ZP_01089073.1| Cupin region protein [Blastopirellula marina DSM 3645]
gi|87290300|gb|EAQ82188.1| Cupin region protein [Blastopirellula marina DSM 3645]
Length=120
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/111 (37%), Positives = 57/111 (52%), Gaps = 0/111 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M + N E F D SEIR L+ + SLA A + G +T+ H + T+EIY+
Sbjct 1 MDIYNLDEVPAFTTKDGSEIRELLAHRNSAIRQQSLAEARIPIGRQTIAHYHKLTEEIYY 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+ G +++ E V GDA+ IP G +I +G+V L LC C PAY
Sbjct 61 ITQGTAEMNIDGEVRNVTVGDAIAIPPGATHQITNIGAVELRLLCCCAPAY 111
>gi|297568172|ref|YP_003689516.1| Cupin 2 conserved barrel domain protein [Desulfurivibrio alkaliphilus
AHT2]
gi|296924087|gb|ADH84897.1| Cupin 2 conserved barrel domain protein [Desulfurivibrio alkaliphilus
AHT2]
Length=119
Score = 77.0 bits (188), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/117 (39%), Positives = 64/117 (55%), Gaps = 0/117 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M S+ R+ + ++ D SEIR L+ G A+ SLA A V GA T+ HR + ++EIY
Sbjct 1 MSRSSYRKIKAYITKDGSEIRELMHPGVHGNANQSLAEARVPPGALTLAHRHRVSEEIYH 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQ 165
+GRG++ +G+ + V GD V I G + G PL LCAC PAY E +
Sbjct 61 FTAGRGVMGLGETTFAVAAGDTVAISPGTTHWLENPGPGPLVVLCACSPAYSHEDTE 117
>gi|147919765|ref|YP_686489.1| hypothetical protein RCIX2009 [uncultured methanogenic archaeon
RC-I]
gi|110621885|emb|CAJ37163.1| conserved hypothetical protein [uncultured methanogenic archaeon
RC-I]
Length=119
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/113 (39%), Positives = 59/113 (53%), Gaps = 1/113 (0%)
Query 50 FVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFV 109
V N +P+ D S I L+ V VS+A A V +G ET H + EIY+V
Sbjct 1 MVQNRERVKPYTTKDGSTIWELLHPLKHPVMDVSVAEAYVESGNETRLHVHHESQEIYYV 60
Query 110 LSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
L G G++ +G+ +V GDA+ IP G P KI+A G + LC C P Y+ E
Sbjct 61 LDGEGIMWLGNRRIDVRTGDAILIPQGTPHKIKA-GEGGVRILCICAPPYMHE 112
>gi|119897242|ref|YP_932455.1| phosphomannose protein [Azoarcus sp. BH72]
gi|119669655|emb|CAL93568.1| conserved hypothetical phosphomannose protein [Azoarcus sp. BH72]
Length=119
Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 58/100 (58%), Gaps = 0/100 (0%)
Query 60 FVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVG 119
+V D SEIR L+ A G + SLA A VA G TV HR + ++E+Y V +G G++++G
Sbjct 11 YVTKDGSEIRELLHPALHGARNQSLAEAVVAPGMRTVLHRHRRSEELYHVTAGSGIMTLG 70
Query 120 DESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
++ V GD V IP G I+A + L LC C PAY
Sbjct 71 EDRFAVEVGDTVLIPPGTAHCIQAGEAAALHILCCCSPAY 110
>gi|320355102|ref|YP_004196441.1| Cupin 2 conserved barrel domain-containing protein [Desulfobulbus
propionicus DSM 2032]
gi|320123604|gb|ADW19150.1| Cupin 2 conserved barrel domain protein [Desulfobulbus propionicus
DSM 2032]
Length=124
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 49/82 (60%), Gaps = 1/82 (1%)
Query 81 SVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDE-SGEVGPGDAVWIPAGVPQ 139
+VS+A A VAAG T WH L+ E Y +L G GLV VGDE +V GD V IP G Q
Sbjct 25 AVSVARARVAAGQSTRWHGLEGIQERYLLLEGNGLVEVGDEPPRQVQAGDVVLIPPGCRQ 84
Query 140 KIRALGSVPLTFLCACGPAYLP 161
+I +G V L FL C P +LP
Sbjct 85 RITNIGHVDLLFLAVCTPRFLP 106
>gi|334118212|ref|ZP_08492302.1| Cupin 2 conserved barrel domain protein [Microcoleus vaginatus
FGP-2]
gi|333460197|gb|EGK88807.1| Cupin 2 conserved barrel domain protein [Microcoleus vaginatus
FGP-2]
Length=120
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/118 (37%), Positives = 65/118 (56%), Gaps = 4/118 (3%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLV--DRATVGVASVSLAHATVAAGAETVWHRLQATDEI 106
M + + F+A D +++R L+ D+ V + SLAHAT+ G + H L T E+
Sbjct 1 MLIQKLNACDEFIAGDGTQLRELLHPDKQAVDL-RYSLAHATLPPGQTSALHSL-TTSEV 58
Query 107 YFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERD 164
Y++LSG G + + DE+ V GDAV+IP Q I G+ PL F+C PA+ E +
Sbjct 59 YYILSGVGEMHIDDENQFVEAGDAVYIPPNAKQFIYNCGTEPLIFICIVDPAWRKEDE 116
>gi|91773006|ref|YP_565698.1| cupin family protein [Methanococcoides burtonii DSM 6242]
gi|91712021|gb|ABE51948.1| Cupin family protein [Methanococcoides burtonii DSM 6242]
Length=119
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/102 (41%), Positives = 54/102 (53%), Gaps = 0/102 (0%)
Query 58 EPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVS 117
EPF+ D S IR L+ G S+A A V G+ T+ HR +EIY + +G G ++
Sbjct 9 EPFITKDGSIIRELMHPDGGGSEKQSVAEAIVPMGSSTLAHRHPVAEEIYHITTGSGRMT 68
Query 118 VGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+ + EV GD V I +GV KI GS L LC C PAY
Sbjct 69 LDSDVFEVNAGDTVLINSGVSHKIENTGSEDLKILCCCSPAY 110
>gi|223936780|ref|ZP_03628690.1| Cupin 2 conserved barrel domain protein [bacterium Ellin514]
gi|223894631|gb|EEF61082.1| Cupin 2 conserved barrel domain protein [bacterium Ellin514]
Length=121
Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/111 (37%), Positives = 54/111 (49%), Gaps = 0/111 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M + N E + F D SEIR L+ + + SLA A V T H +EIYF
Sbjct 1 MDIRNIAEMQAFKTKDGSEIRELLAYRNSIIKNQSLAEARVPVNQSTQEHYHSKAEEIYF 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+ SG G + + +E +V GDA+ IP G K+ G L LC C PAY
Sbjct 61 ITSGVGRIRIENEVRDVKAGDAIAIPPGQKHKLWNTGQETLKLLCCCAPAY 111
>gi|218439791|ref|YP_002378120.1| cupin [Cyanothece sp. PCC 7424]
gi|218172519|gb|ACK71252.1| Cupin 2 conserved barrel domain protein [Cyanothece sp. PCC 7424]
Length=119
Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/118 (36%), Positives = 64/118 (55%), Gaps = 4/118 (3%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLV--DRATVGVASVSLAHATVAAGAETVWHRLQATDEI 106
M V E E F A D + +R L+ D+ + + SLAHA V G ++ H L T E+
Sbjct 1 MLVQKLAECEEFTAGDGTLLRELLHPDKQPIAL-RYSLAHAIVPVGQTSIVHSL-TTSEV 58
Query 107 YFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERD 164
Y+++SG+G + + +E V GDA++IP Q I G+ PL F+C PA+ E +
Sbjct 59 YYMISGKGEMHIDEEVQNVEAGDAIYIPPNAKQYIHNSGNEPLIFICIVDPAWRKEDE 116
>gi|20095078|ref|NP_614925.1| mannose-6-phosphate isomerase [Methanopyrus kandleri AV19]
gi|19888360|gb|AAM02855.1| Mannose-6-phosphate isomerase [Methanopyrus kandleri AV19]
Length=132
Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/114 (42%), Positives = 58/114 (51%), Gaps = 0/114 (0%)
Query 55 REAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRG 114
R++ P+V D S I +V V +VSLA A + G TV H DE+Y+VL GRG
Sbjct 6 RDSVPYVTLDGSLIYEVVRPEFSRVNTVSLAVAEIPPGESTVPHYHLDFDEVYWVLEGRG 65
Query 115 LVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRMG 168
+V VG S EV P D V IP G + GS L LC C P Y E +G
Sbjct 66 IVHVGSRSLEVHPEDCVEIPRGSVHWVENDGSETLRILCVCSPPYRHETTVTLG 119
>gi|334111873|ref|ZP_08486140.1| Cupin 2 conserved barrel domain protein [Methylomicrobium album
BG8]
gi|333597978|gb|EGL02807.1| Cupin 2 conserved barrel domain protein [Methylomicrobium album
BG8]
Length=139
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/83 (46%), Positives = 49/83 (60%), Gaps = 1/83 (1%)
Query 81 SVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGD-ESGEVGPGDAVWIPAGVPQ 139
+VS+A A VA G T WHR++ T E Y +L GRG V +G +VGPGD V IP PQ
Sbjct 33 AVSIARARVAPGVTTRWHRVRETAERYVILEGRGRVEIGSLPPQDVGPGDVVLIPPSCPQ 92
Query 140 KIRALGSVPLTFLCACGPAYLPE 162
+I +G+ L FL C P + E
Sbjct 93 RIANIGAGDLIFLAVCTPRFTNE 115
>gi|344344355|ref|ZP_08775218.1| Cupin 2 conserved barrel domain protein [Marichromatium purpuratum
984]
gi|343804025|gb|EGV21928.1| Cupin 2 conserved barrel domain protein [Marichromatium purpuratum
984]
Length=127
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/83 (49%), Positives = 49/83 (60%), Gaps = 1/83 (1%)
Query 81 SVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE-VGPGDAVWIPAGVPQ 139
VS+A A VA G T HRL AT E Y +L G G +++ D + + VGPGD V IPAG PQ
Sbjct 34 EVSIARARVAPGTATRLHRLAATTERYLILHGSGRIALDDGTDQDVGPGDLVRIPAGTPQ 93
Query 140 KIRALGSVPLTFLCACGPAYLPE 162
+I G L FL C P + PE
Sbjct 94 RIANTGMDDLIFLAICTPRFRPE 116
>gi|88604201|ref|YP_504379.1| CMP/dCMP deaminase, zinc-binding [Methanospirillum hungatei JF-1]
gi|88189663|gb|ABD42660.1| CMP/dCMP deaminase, zinc-binding protein [Methanospirillum hungatei
JF-1]
Length=274
Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/87 (43%), Positives = 52/87 (60%), Gaps = 1/87 (1%)
Query 83 SLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIR 142
S+AHA V G T+ HRL + E+Y++LSG G++ +GDE E+G G +IP G Q I
Sbjct 37 SIAHAQVPVGVTTLPHRLIRSSEVYYILSGTGIMHIGDEHMEIGEGQLAYIPPGKVQWIE 96
Query 143 ALGSVPLTFLCACGPAYLPERDQRMGE 169
G+ L FL C P + E D+ +GE
Sbjct 97 NTGTRDLIFLAICDPLW-REEDEVVGE 122
>gi|335419957|ref|ZP_08551000.1| Cupin 2 barrel domain-containing protein [Salinisphaera shabanensis
E1L3A]
gi|334895603|gb|EGM33771.1| Cupin 2 barrel domain-containing protein [Salinisphaera shabanensis
E1L3A]
Length=125
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/87 (44%), Positives = 50/87 (58%), Gaps = 1/87 (1%)
Query 76 TVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDESGE-VGPGDAVWIP 134
+V ++VS+A A VAAG T WHRL T E Y ++SG G+V VG++ + V GD VWIP
Sbjct 32 SVDDSAVSIARARVAAGVTTAWHRLHGTAERYLIISGEGIVEVGNDLRDTVTAGDVVWIP 91
Query 135 AGVPQKIRALGSVPLTFLCACGPAYLP 161
Q+I G L F C P + P
Sbjct 92 PDAAQRIINTGDDELLFYAICTPRFEP 118
>gi|268323115|emb|CBH36703.1| conserved hypothetical protein, containing cupin domain [uncultured
archaeon]
gi|268326055|emb|CBH39643.1| conserved hypothetical protein, containing cupin domain [uncultured
archaeon]
Length=122
Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/120 (33%), Positives = 61/120 (51%), Gaps = 3/120 (2%)
Query 49 MFVSNAREAEPFVAPD---LSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDE 105
M + + + E F A D L E+ V S+AHA V G T+ H+L+ + E
Sbjct 1 MLIKDIQNGEYFRAIDNTILCELLHPAKEDEVLNIRYSIAHAIVKPGETTLPHKLKTSTE 60
Query 106 IYFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQ 165
+Y+VL G G++ + +ES EV G A++IP Q I+ G+ L LC P + E ++
Sbjct 61 VYYVLDGEGIIHIDEESAEVHSGQAIYIPPNTKQYIQNRGNSDLKILCIVYPMWRIEDEK 120
>gi|284041664|ref|YP_003392004.1| cupin [Conexibacter woesei DSM 14684]
gi|283945885|gb|ADB48629.1| Cupin 2 conserved barrel domain protein [Conexibacter woesei
DSM 14684]
Length=126
Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 43/114 (38%), Positives = 56/114 (50%), Gaps = 0/114 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M ++ E EP+V D S IR + SLA AT+ G T H +A +E+Y
Sbjct 1 MHLARHAELEPYVTRDGSTIREWAGPGYSPARNQSLAEATLPPGRATTAHYHRAAEELYL 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPE 162
+GRG + VGD +V GD V IP G K+ G L +CAC PAY E
Sbjct 61 FTAGRGRLRVGDAERDVQSGDCVVIPPGAVHKLWNTGDDDLVLVCACSPAYSHE 114
>gi|345130800|gb|EGW61702.1| Cupin 2 conserved barrel domain protein [Dechlorosoma suillum
PS]
Length=130
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 0/100 (0%)
Query 60 FVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVG 119
+ D SEIR L+ G SLA ATV G T+ HR + ++E+Y V +G G++++G
Sbjct 15 YRTKDGSEIRELMHPDVHGNRQQSLAEATVPPGTRTLLHRHRLSEELYHVTAGHGVMTLG 74
Query 120 DESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
+ + GD V I G + G PL LCAC PAY
Sbjct 75 ERRFLIAVGDTVHIAPGTAHALENSGDQPLVVLCACSPAY 114
>gi|71907533|ref|YP_285120.1| cupin region [Dechloromonas aromatica RCB]
gi|71847154|gb|AAZ46650.1| Cupin region [Dechloromonas aromatica RCB]
Length=120
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/101 (39%), Positives = 52/101 (52%), Gaps = 0/101 (0%)
Query 59 PFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSV 118
P++ D SEIR L+ V SLA A V G T H+ T+EIY V G GL+++
Sbjct 12 PYITKDGSEIRELLHPNLHAVRHQSLAEAVVPPGTATQLHKHGVTEEIYHVTKGSGLMTL 71
Query 119 GDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
G +S + GD++ I G P + G L LC C PAY
Sbjct 72 GGDSFVIAVGDSIAIAPGTPHCVENTGPEALHILCCCAPAY 112
>gi|149177636|ref|ZP_01856237.1| hypothetical protein PM8797T_27272 [Planctomyces maris DSM 8797]
gi|148843454|gb|EDL57816.1| hypothetical protein PM8797T_27272 [Planctomyces maris DSM 8797]
Length=123
Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 39/81 (49%), Positives = 45/81 (56%), Gaps = 1/81 (1%)
Query 81 SVSLAHATVAAGAETVWHRLQATDEIYFVLSGRGLVSVGDE-SGEVGPGDAVWIPAGVPQ 139
VSLA A V G +T +HRL+ T E Y +LSG GLV VGD EV PGD V IP Q
Sbjct 32 KVSLARARVEPGKKTRFHRLKGTFERYIMLSGTGLVEVGDYPPTEVYPGDVVRIPPDTDQ 91
Query 140 KIRALGSVPLTFLCACGPAYL 160
I +G L F C P +L
Sbjct 92 SITNIGEDDLVFFVVCNPHFL 112
>gi|15678380|ref|NP_275495.1| hypothetical protein MTH352 [Methanothermobacter thermautotrophicus
str. Delta H]
gi|2621410|gb|AAB84858.1| conserved protein [Methanothermobacter thermautotrophicus str.
Delta H]
Length=131
Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 39/110 (36%), Positives = 58/110 (53%), Gaps = 1/110 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGV-ASVSLAHATVAAGAETVWHRLQATDEIY 107
M + N RE + F A D + + L+ G+ SLAHA + G + HRL+ + E+Y
Sbjct 1 MNIRNIRECDYFRAADGTLLCELLHPDNEGLDMGFSLAHAILRRGEASKPHRLRESVEVY 60
Query 108 FVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGP 157
+++ G + + DES V GDA+ IP+G Q I G L+FLC P
Sbjct 61 YIMEGEATMHIDDESFTVKEGDAIHIPSGAVQYIENTGKSELSFLCIVSP 110
>gi|218782538|ref|YP_002433856.1| cupin [Desulfatibacillum alkenivorans AK-01]
gi|218763922|gb|ACL06388.1| Cupin 2 conserved barrel domain protein [Desulfatibacillum alkenivorans
AK-01]
Length=123
Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/111 (35%), Positives = 54/111 (49%), Gaps = 0/111 (0%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQATDEIYF 108
M + P+V D + IR L+ A G SLA A +A G + H +AT+EIY
Sbjct 1 MIKTEYDHIAPYVTKDQTLIRELLHPAVHGEGRTSLAEAVLAPGLVSELHLHEATEEIYH 60
Query 109 VLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAY 159
G G++++G E +V G V IP G P ++ G + LC C P Y
Sbjct 61 YTQGSGVMTLGVEKLDVRRGSTVLIPPGTPHQVENTGDGDMRILCICSPPY 111
>gi|302036369|ref|YP_003796691.1| hypothetical protein NIDE1004 [Candidatus Nitrospira defluvii]
gi|300604433|emb|CBK40765.1| conserved protein of unknown function, RmlC-type Cupin [Candidatus
Nitrospira defluvii]
Length=120
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/122 (35%), Positives = 71/122 (59%), Gaps = 3/122 (2%)
Query 49 MFVSNAREAEPFVAPDLSEIRVLVDRATVGVA-SVSLAHATVAAGAETVWHRLQATDEIY 107
M ++ + F+A D + +R L+ A +A SLAH + G +++WHRLQ++ E+Y
Sbjct 1 MVNTHLQRCPEFLAGDHTRLRELLHPAKASLALGYSLAHGLLDPGQQSLWHRLQSS-EVY 59
Query 108 FVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYLPERDQRM 167
+ + GRG++ V +ES V G +++P G Q + G+ P+ FLC PA+ E D+ +
Sbjct 60 YFIGGRGIMKVEEESVVVEAGSVIYVPPGAKQSLVNNGTDPIEFLCLVDPAWRAE-DEAV 118
Query 168 GE 169
GE
Sbjct 119 GE 120
Lambda K H
0.318 0.132 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 152280848256
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40