BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2293c
Length=246
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609430|ref|NP_216809.1| hypothetical protein Rv2293c [Mycob... 489 1e-136
gi|306780325|ref|ZP_07418662.1| hypothetical protein TMBG_00838 ... 488 3e-136
gi|323719194|gb|EGB28339.1| hypothetical protein TMMG_01574 [Myc... 469 1e-130
gi|289758415|ref|ZP_06517793.1| LOW QUALITY PROTEIN: conserved h... 442 2e-122
gi|31793471|ref|NP_855964.1| hypothetical protein Mb2315c [Mycob... 442 3e-122
gi|289745565|ref|ZP_06504943.1| conserved hypothetical protein [... 421 4e-116
gi|240173118|ref|ZP_04751776.1| hypothetical protein MkanA1_2764... 318 3e-85
gi|296166267|ref|ZP_06848705.1| conserved hypothetical protein [... 207 1e-51
gi|16126505|ref|NP_421069.1| phosphorylase family protein [Caulo... 63.2 3e-08
gi|295690288|ref|YP_003593981.1| MTA/SAH nucleosidase [Caulobact... 60.5 2e-07
gi|340789394|ref|YP_004754859.1| menaquinone via futalosine step... 58.5 8e-07
gi|225449647|ref|XP_002262699.1| PREDICTED: hypothetical protein... 56.6 3e-06
gi|308159921|gb|EFO62436.1| 5'-methylthioadenosine nucleosidase,... 55.8 5e-06
gi|148906549|gb|ABR16427.1| unknown [Picea sitchensis] 55.1 9e-06
gi|326388005|ref|ZP_08209609.1| Adenosylhomocysteine nucleosidas... 55.1 1e-05
gi|319793906|ref|YP_004155546.1| mta/sah nucleosidase [Variovora... 54.3 1e-05
gi|332284458|ref|YP_004416369.1| methylthioadenosine nucleosidas... 54.3 2e-05
gi|239815640|ref|YP_002944550.1| adenosylhomocysteine nucleosida... 53.5 3e-05
gi|18416327|ref|NP_567699.1| phosphorylase family protein [Arabi... 53.1 4e-05
gi|21617914|gb|AAM66964.1| unknown [Arabidopsis thaliana] 53.1 4e-05
gi|225429975|ref|XP_002283985.1| PREDICTED: hypothetical protein... 53.1 4e-05
gi|338971870|ref|ZP_08627249.1| 5'-methylthioadenosine nucleosid... 52.8 4e-05
gi|296081863|emb|CBI20868.3| unnamed protein product [Vitis vini... 52.8 4e-05
gi|315498952|ref|YP_004087756.1| mta/sah nucleosidase [Asticcaca... 52.4 6e-05
gi|329849780|ref|ZP_08264626.1| MTA/SAH nucleosidase [Asticcacau... 51.6 1e-04
gi|255645394|gb|ACU23193.1| unknown [Glycine max] 51.6 1e-04
gi|253748444|gb|EET02552.1| 5'-methylthioadenosine nucleosidase,... 51.6 1e-04
gi|50086149|ref|YP_047659.1| phosphorylase family protein [Acine... 50.8 2e-04
gi|255550998|ref|XP_002516547.1| mta/sah nucleosidase, putative ... 50.4 2e-04
gi|121611340|ref|YP_999147.1| methylthioadenosine nucleosidase [... 50.4 2e-04
gi|242051491|ref|XP_002454891.1| hypothetical protein SORBIDRAFT... 50.1 3e-04
gi|326503550|dbj|BAJ86281.1| predicted protein [Hordeum vulgare ... 48.9 7e-04
gi|225445182|ref|XP_002280745.1| PREDICTED: hypothetical protein... 48.9 7e-04
gi|297803678|ref|XP_002869723.1| hypothetical protein ARALYDRAFT... 48.9 7e-04
gi|159117992|ref|XP_001709215.1| 5'-methylthioadenosine nucleosi... 48.9 7e-04
gi|297738794|emb|CBI28039.3| unnamed protein product [Vitis vini... 48.9 7e-04
gi|218196319|gb|EEC78746.1| hypothetical protein OsI_18961 [Oryz... 48.5 8e-04
gi|115462741|ref|NP_001054970.1| Os05g0228000 [Oryza sativa Japo... 48.5 8e-04
gi|115435464|ref|NP_001042490.1| Os01g0229800 [Oryza sativa Japo... 48.5 8e-04
gi|125525025|gb|EAY73139.1| hypothetical protein OsI_01011 [Oryz... 48.5 9e-04
gi|255586615|ref|XP_002533940.1| mta/sah nucleosidase, putative ... 48.5 9e-04
gi|147852653|emb|CAN83797.1| hypothetical protein VITISV_002973 ... 48.1 0.001
gi|145348229|ref|NP_194623.2| Phosphorylase-like protein protein... 48.1 0.001
gi|224119752|ref|XP_002331152.1| predicted protein [Populus tric... 48.1 0.001
gi|242051489|ref|XP_002454890.1| hypothetical protein SORBIDRAFT... 47.8 0.001
gi|226503841|ref|NP_001148584.1| LOC100282200 [Zea mays] >gi|195... 47.8 0.001
gi|302804997|ref|XP_002984250.1| hypothetical protein SELMODRAFT... 47.8 0.001
gi|302781292|ref|XP_002972420.1| hypothetical protein SELMODRAFT... 47.8 0.002
gi|109896877|ref|YP_660132.1| purine phosphorylases family prote... 47.4 0.002
gi|297803158|ref|XP_002869463.1| hypothetical protein ARALYDRAFT... 47.0 0.002
>gi|15609430|ref|NP_216809.1| hypothetical protein Rv2293c [Mycobacterium tuberculosis H37Rv]
gi|15841784|ref|NP_336821.1| hypothetical protein MT2350 [Mycobacterium tuberculosis CDC1551]
gi|148662115|ref|YP_001283638.1| hypothetical protein MRA_2310 [Mycobacterium tuberculosis H37Ra]
41 more sequence titles
Length=246
Score = 489 bits (1260), Expect = 1e-136, Method: Compositional matrix adjust.
Identities = 245/246 (99%), Positives = 246/246 (100%), Gaps = 0/246 (0%)
Query 1 VGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
+GAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA
Sbjct 1 MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
Query 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV
Sbjct 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
Query 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV
Sbjct 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
Query 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS
Sbjct 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
Query 241 RRRDPG 246
RRRDPG
Sbjct 241 RRRDPG 246
>gi|306780325|ref|ZP_07418662.1| hypothetical protein TMBG_00838 [Mycobacterium tuberculosis SUMu002]
gi|306793760|ref|ZP_07432062.1| hypothetical protein TMEG_02658 [Mycobacterium tuberculosis SUMu005]
gi|308326765|gb|EFP15616.1| hypothetical protein TMBG_00838 [Mycobacterium tuberculosis SUMu002]
gi|308337922|gb|EFP26773.1| hypothetical protein TMEG_02658 [Mycobacterium tuberculosis SUMu005]
Length=246
Score = 488 bits (1257), Expect = 3e-136, Method: Compositional matrix adjust.
Identities = 244/246 (99%), Positives = 246/246 (100%), Gaps = 0/246 (0%)
Query 1 VGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
+GAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAV+AHTALDA
Sbjct 1 MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVVAHTALDA 60
Query 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV
Sbjct 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
Query 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV
Sbjct 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
Query 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS
Sbjct 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
Query 241 RRRDPG 246
RRRDPG
Sbjct 241 RRRDPG 246
>gi|323719194|gb|EGB28339.1| hypothetical protein TMMG_01574 [Mycobacterium tuberculosis CDC1551A]
Length=236
Score = 469 bits (1208), Expect = 1e-130, Method: Compositional matrix adjust.
Identities = 235/236 (99%), Positives = 236/236 (100%), Gaps = 0/236 (0%)
Query 11 VAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR 70
+AAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR
Sbjct 1 MAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR 60
Query 71 YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG 130
YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG
Sbjct 61 YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG 120
Query 131 DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR 190
DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR
Sbjct 121 DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR 180
Query 191 QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFSRRRDPG 246
QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFSRRRDPG
Sbjct 181 QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFSRRRDPG 236
>gi|289758415|ref|ZP_06517793.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T85]
gi|340627297|ref|YP_004745749.1| hypothetical protein MCAN_23151 [Mycobacterium canettii CIPT
140010059]
gi|289713979|gb|EFD77991.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T85]
gi|326903905|gb|EGE50838.1| LOW QUALITY PROTEIN: hypothetical protein TBPG_01791 [Mycobacterium
tuberculosis W-148]
gi|340005487|emb|CCC44647.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=247
Score = 442 bits (1137), Expect = 2e-122, Method: Compositional matrix adjust.
Identities = 225/245 (92%), Positives = 227/245 (93%), Gaps = 0/245 (0%)
Query 1 VGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
+GAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA
Sbjct 1 MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
Query 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV
Sbjct 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
Query 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV
Sbjct 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
Query 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVF P P L F
Sbjct 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFGCQPCSAPDRSLGYTGNFF 240
Query 241 RRRDP 245
+ P
Sbjct 241 QAAGP 245
>gi|31793471|ref|NP_855964.1| hypothetical protein Mb2315c [Mycobacterium bovis AF2122/97]
gi|121638174|ref|YP_978398.1| hypothetical protein BCG_2309c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990668|ref|YP_002645355.1| hypothetical protein JTY_2303 [Mycobacterium bovis BCG str. Tokyo
172]
17 more sequence titles
Length=334
Score = 442 bits (1136), Expect = 3e-122, Method: Compositional matrix adjust.
Identities = 225/245 (92%), Positives = 227/245 (93%), Gaps = 0/245 (0%)
Query 1 VGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
+GAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA
Sbjct 1 MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
Query 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV
Sbjct 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
Query 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV
Sbjct 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
Query 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFS 240
PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVF P P L F
Sbjct 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFGCQPCSAPDRSLGYTGNFF 240
Query 241 RRRDP 245
+ P
Sbjct 241 QAAGP 245
>gi|289745565|ref|ZP_06504943.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|298525778|ref|ZP_07013187.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|289686093|gb|EFD53581.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|298495572|gb|EFI30866.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|339295199|gb|AEJ47310.1| hypothetical protein CCDC5079_2120 [Mycobacterium tuberculosis
CCDC5079]
gi|339298819|gb|AEJ50929.1| hypothetical protein CCDC5180_2092 [Mycobacterium tuberculosis
CCDC5180]
Length=324
Score = 421 bits (1083), Expect = 4e-116, Method: Compositional matrix adjust.
Identities = 215/235 (92%), Positives = 217/235 (93%), Gaps = 0/235 (0%)
Query 11 VAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR 70
+AAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR
Sbjct 1 MAAALSLGCGVAAADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRR 60
Query 71 YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG 130
YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG
Sbjct 61 YYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIG 120
Query 131 DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR 190
DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR
Sbjct 121 DVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGR 180
Query 191 QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHPIAHLAIPVTFSRRRDP 245
QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVF P P L F + P
Sbjct 181 QPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFGCQPCSAPDRSLGYTGNFFQAAGP 235
>gi|240173118|ref|ZP_04751776.1| hypothetical protein MkanA1_27646 [Mycobacterium kansasii ATCC
12478]
Length=337
Score = 318 bits (816), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 164/229 (72%), Positives = 183/229 (80%), Gaps = 4/229 (1%)
Query 5 LRHCLLVAAALSLGCGVAA----ADPGYVANVIPCEQRTLVLSAFPAEADAVLAHTALDA 60
LR + AALSLG + AD G A V PC+QRTL+LSAFPAEADAVLAHT LDA
Sbjct 5 LRAACALVAALSLGATAVSSNTRADAGGFATVSPCQQRTLILSAFPAEADAVLAHTTLDA 64
Query 61 NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGV 120
NPVVVA R +YLG+I+GK+VIVAMTGIGLVNAT+TTE A RFTC+S IA+ AV+FSGV
Sbjct 65 NPVVVAHHRHFYLGAIAGKRVIVAMTGIGLVNATHTTEIALDRFTCSSGIAVGAVVFSGV 124
Query 121 AGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTLSVVLDNINTLGNPVCLCRNV 180
AGGAGRT+IGDVAIPARWTLDNGATF+ VDPGMLA AQTL+V L ++N LGNP CLC++V
Sbjct 125 AGGAGRTAIGDVAIPARWTLDNGATFQAVDPGMLAAAQTLAVSLGSVNNLGNPACLCQHV 184
Query 181 PVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGGSVFAANPVVHP 229
P V LN LGRQ QL VGGDGSSSD NNGQAFPCIPNGG VF P P
Sbjct 185 PTVALNSLGRQTQLVVGGDGSSSDNNNGQAFPCIPNGGDVFGCQPCSAP 233
>gi|296166267|ref|ZP_06848705.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898362|gb|EFG77930.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=291
Score = 207 bits (527), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/194 (63%), Positives = 137/194 (71%), Gaps = 21/194 (10%)
Query 36 EQRTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATN 95
E+RTLVLSAFP EADAVL+HTALD +PVV+A R +YLG+I+GKKVIVAMTGIGLVNAT+
Sbjct 14 ERRTLVLSAFPGEADAVLSHTALDPDPVVIAGGRHFYLGAIAGKKVIVAMTGIGLVNATD 73
Query 96 TTETAFARFTCASSIAIAAVMFSGVAGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLA 155
TT A ARF I AV+FSGVAGGAGRT I DVA+PARWTLD+G TFR VDP ML
Sbjct 74 TTVVALARF------GIGAVVFSGVAGGAGRTRIADVAVPARWTLDHGRTFRPVDPDMLG 127
Query 156 TAQTLSVVLDNINTLGNPVCLCRNVPVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIP 215
A LSV L++ VVR L R+P++ VGGDG S D NNG AFPCIP
Sbjct 128 EAGKLSVALES---------------VVRRIDLRRRPRVVVGGDGCSWDNNNGVAFPCIP 172
Query 216 NGGSVFAANPVVHP 229
NGG VF P P
Sbjct 173 NGGDVFGCQPRTAP 186
>gi|16126505|ref|NP_421069.1| phosphorylase family protein [Caulobacter crescentus CB15]
gi|221235285|ref|YP_002517722.1| bark storage protein A [Caulobacter crescentus NA1000]
gi|13423779|gb|AAK24237.1| phosphorylase family protein [Caulobacter crescentus CB15]
gi|220964458|gb|ACL95814.1| bark storage protein A precursor [Caulobacter crescentus NA1000]
Length=299
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/131 (34%), Positives = 65/131 (50%), Gaps = 10/131 (7%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
R V+SAFP E A+ A TA + R+ G + GK V+V ++G+ +VNA TT
Sbjct 28 RVAVISAFPPEIGALNAATA--EQKAYEVNGVRFMTGKLEGKPVVVFLSGVSMVNAAMTT 85
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARWTLDNGATF-RGVDPGMLA 155
+ A RF I ++FSG+AGG + IGDV + +W + + F R D G
Sbjct 86 QMALERFN------ITRIVFSGIAGGVDESLDIGDVVVADQWAQNLESAFARETDKGFEV 139
Query 156 TAQTLSVVLDN 166
+ + L N
Sbjct 140 SPSIRTTTLAN 150
>gi|295690288|ref|YP_003593981.1| MTA/SAH nucleosidase [Caulobacter segnis ATCC 21756]
gi|295432191|gb|ADG11363.1| MTA/SAH nucleosidase [Caulobacter segnis ATCC 21756]
Length=297
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/135 (34%), Positives = 66/135 (49%), Gaps = 18/135 (13%)
Query 38 RTLVLSAFPAEADAVLAHT----ALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNA 93
R V+SAFP E A+ A T D + V ++ G + GK V+V ++G+ +VNA
Sbjct 28 RVAVISAFPPEIGALNAATQDQKTFDVHGV------KFMTGQMEGKPVVVFLSGVSMVNA 81
Query 94 TNTTETAFARFTCASSIAIAAVMFSGVAGGAGR-TSIGDVAIPARWTLDNGATF-RGVDP 151
TT+ A RF I ++FSG+AGG IGDV + RW + + F R D
Sbjct 82 AMTTQMAIDRFN------ITRIVFSGIAGGVDEGLDIGDVVVADRWAQNLESAFARETDK 135
Query 152 GMLATAQTLSVVLDN 166
G ++ + L N
Sbjct 136 GFEVSSSIRNSTLPN 150
>gi|340789394|ref|YP_004754859.1| menaquinone via futalosine step 2 [Collimonas fungivorans Ter331]
gi|340554661|gb|AEK64036.1| Menaquinone via futalosine step 2 [Collimonas fungivorans Ter331]
Length=352
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/107 (31%), Positives = 56/107 (53%), Gaps = 9/107 (8%)
Query 35 CEQRTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNAT 94
C+ R ++SAF AEAD +L T + + R+ G + G +V++ ++G+ +VNAT
Sbjct 47 CKPRIGIVSAFGAEADILLEQTQKKREWKINGN--RFTTGELRGNQVVIVLSGVSVVNAT 104
Query 95 NTTETAFARFTCASSIAIAAVMFSGVAGGAGRTS-IGDVAIPARWTL 140
T+ F + ++ SG+AGG S +GDV +P W +
Sbjct 105 MVTQLMMDHFR------VERLLMSGIAGGVDPASHVGDVTVPESWAM 145
>gi|225449647|ref|XP_002262699.1| PREDICTED: hypothetical protein [Vitis vinifera]
gi|296090373|emb|CBI40192.3| unnamed protein product [Vitis vinifera]
Length=340
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/102 (35%), Positives = 52/102 (51%), Gaps = 10/102 (9%)
Query 41 VLSAFPAEADAVLAHTALDANP---VVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
+++ FP E DA A + NP V RR+ +G + G+KVI G+G+VNA T
Sbjct 50 LITVFPPEEDAFFTTGAFEINPHHPFVDLSGRRFRIGKVHGRKVIYVRCGVGMVNAAAAT 109
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
+ F I ++ G+AG A + SIGDV IP ++
Sbjct 110 QQMLDLFD------IRGIVHFGIAGNANNSMSIGDVTIPKQF 145
>gi|308159921|gb|EFO62436.1| 5'-methylthioadenosine nucleosidase, S-adenosylhomocysteine nucleosidase
[Giardia lamblia P15]
Length=298
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 53/165 (33%), Positives = 80/165 (49%), Gaps = 33/165 (20%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGG-AGRT 127
R +Y+G ISG+ V+++ +GIG V A +T + F+C+ ++F GVAGG GR
Sbjct 38 RTFYIGKISGQTVVISKSGIGKVAAASTAAIMISVFSCSE------IIFLGVAGGIQGRA 91
Query 128 SIGDVAIPA---------RWTLDNGATFRG------VDPGMLATAQ-TLSVVLDNINTLG 171
+IGD+ + A R ++ F D + AQ + VL + TL
Sbjct 92 AIGDIVVSAAAIQHDFDGRPWVERSVVFSVGTCEIPADERLQTRAQAAVQTVLADDMTL- 150
Query 172 NPVCLCRNVPVVR--LNHLGRQPQLFVG----GDG-SSSDKNNGQ 209
V +VP+ R L LGR P+L +G GD SSD+ N +
Sbjct 151 --VLSDDSVPIGRRILAQLGRSPKLLIGSILSGDQFVSSDEMNKE 193
>gi|148906549|gb|ABR16427.1| unknown [Picea sitchensis]
Length=346
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/92 (32%), Positives = 51/92 (56%), Gaps = 7/92 (7%)
Query 48 EADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCA 107
E D +L ++ + + + RR+++GSI +VI+ M+G+G++N+ TT+ + F
Sbjct 59 EMDPLLQSSSFNVSKTLDVAGRRFHIGSIHKHRVIIVMSGLGMLNSGITTQLLLSFF--- 115
Query 108 SSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
+ V+ G+AG A + IGDV IP W
Sbjct 116 ---EVKGVVHYGIAGNANSSLHIGDVTIPRYW 144
>gi|326388005|ref|ZP_08209609.1| Adenosylhomocysteine nucleosidase [Novosphingobium nitrogenifigens
DSM 19370]
gi|326207506|gb|EGD58319.1| Adenosylhomocysteine nucleosidase [Novosphingobium nitrogenifigens
DSM 19370]
Length=312
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/117 (34%), Positives = 62/117 (53%), Gaps = 23/117 (19%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYL-------GSISGKKVIVAMTGIGL 90
RT+V++AF E + AL+ + V DRR + + G ++GK V++ +G+
Sbjct 32 RTVVMTAFSPE------YAALEGS---VEDRRSWTINGMTLLTGKLAGKPVVLMQSGVSE 82
Query 91 VNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARWTLDNGATF 146
VNA T+ RF A+ ++FSG+AGG + SIGDV +P +W A+F
Sbjct 83 VNAAMNTQMVVDRF------AVRRIVFSGIAGGVDPSLSIGDVVVPEQWGQYLEASF 133
>gi|319793906|ref|YP_004155546.1| mta/sah nucleosidase [Variovorax paradoxus EPS]
gi|315596369|gb|ADU37435.1| MTA/SAH nucleosidase [Variovorax paradoxus EPS]
Length=316
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/102 (35%), Positives = 54/102 (53%), Gaps = 9/102 (8%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
R V+SAF E +LA A V + + G++ GK V++ ++GI + NAT T
Sbjct 44 RIAVISAFEPELKLLLARVQGPAKHSV--NGVEFTTGTLEGKPVVLFLSGISMTNATMNT 101
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
+ RF S ++FSG+AGG + +GDV +PA+W
Sbjct 102 QLVLNRFRVTS------IVFSGIAGGVNPSLHVGDVTVPAQW 137
>gi|332284458|ref|YP_004416369.1| methylthioadenosine nucleosidase [Pusillimonas sp. T7-7]
gi|330428411|gb|AEC19745.1| methylthioadenosine nucleosidase [Pusillimonas sp. T7-7]
Length=311
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/102 (36%), Positives = 52/102 (51%), Gaps = 9/102 (8%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
R V+SAF E +L T D + + G + GK V++ ++GI + NA TT
Sbjct 36 RIAVISAFEPELSLLLDKT--DQTRQFSVNGITFTTGVLQGKPVVLFLSGISMTNAAMTT 93
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAG-RTSIGDVAIPARW 138
+ A RF I+ ++FSG+AGG IGDV + ARW
Sbjct 94 QLALDRFN------ISHIVFSGIAGGVNPDLHIGDVVVAARW 129
>gi|239815640|ref|YP_002944550.1| adenosylhomocysteine nucleosidase [Variovorax paradoxus S110]
gi|239802217|gb|ACS19284.1| Adenosylhomocysteine nucleosidase [Variovorax paradoxus S110]
Length=314
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/138 (30%), Positives = 64/138 (47%), Gaps = 14/138 (10%)
Query 7 HCLLVAAALSLG---CGVAAADPGYVANVIPCEQ--RTLVLSAFPAEADAVLAHTALDAN 61
LL A L+ G C PG + + R V+SAF E +L A
Sbjct 6 RSLLAAGVLAFGLAGCAGVYKSPGAADGSVRLDDTPRIAVVSAFQPELTLLLNRLQQPAK 65
Query 62 PVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVA 121
V + + G++ GK V++ ++GI + NAT T+ RF + ++FSG+A
Sbjct 66 HSV--NGVEFTTGTLEGKPVVLFLSGISMTNATMNTQLVLDRFR------VTHIVFSGIA 117
Query 122 GGAG-RTSIGDVAIPARW 138
GG + +GDV +PA+W
Sbjct 118 GGVNPQLHVGDVTVPAQW 135
>gi|18416327|ref|NP_567699.1| phosphorylase family protein [Arabidopsis thaliana]
gi|26453148|dbj|BAC43650.1| unknown protein [Arabidopsis thaliana]
gi|28950909|gb|AAO63378.1| At4g24340 [Arabidopsis thaliana]
gi|332659491|gb|AEE84891.1| phosphorylase family protein [Arabidopsis thaliana]
Length=338
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/116 (33%), Positives = 59/116 (51%), Gaps = 12/116 (10%)
Query 41 VLSAFPAEADAVLAHT--ALD-ANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
+++ F E DA L LD +P + RR+ +G I GKKV+ GIG+VNA T
Sbjct 55 LVTVFETEEDAFLGSVDFRLDPTHPFLDLSGRRFRIGKIHGKKVVYVRCGIGMVNAAAAT 114
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARWTLDNGATFRGVDPG 152
+ F+ + ++ G+AG + SIGDV+IP + T N + ++PG
Sbjct 115 QQMIDVFS------VKGIVHFGIAGNINNSMSIGDVSIPKQIT--NAGLWDWLNPG 162
>gi|21617914|gb|AAM66964.1| unknown [Arabidopsis thaliana]
Length=338
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/116 (33%), Positives = 59/116 (51%), Gaps = 12/116 (10%)
Query 41 VLSAFPAEADAVLAHT--ALD-ANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
+++ F E DA L LD +P + RR+ +G I GKKV+ GIG+VNA T
Sbjct 55 LVTVFETEEDAFLGSVDFRLDPTHPFLDLSGRRFRIGKIHGKKVVYVRCGIGMVNAAAAT 114
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARWTLDNGATFRGVDPG 152
+ F+ + ++ G+AG + SIGDV+IP + T N + ++PG
Sbjct 115 QQMIDVFS------VKGIVHFGIAGNINNSMSIGDVSIPKQIT--NAGLWDWLNPG 162
>gi|225429975|ref|XP_002283985.1| PREDICTED: hypothetical protein [Vitis vinifera]
Length=355
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/71 (41%), Positives = 41/71 (58%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ G+I+ KKVI+ MTG+ ++NA TT+ + F I V+ G+AG A
Sbjct 76 RRFRFGTIADKKVILVMTGLSMINAGITTQLLLSTFD------IEGVVHYGIAGNANPSL 129
Query 128 SIGDVAIPARW 138
+IGDV IP W
Sbjct 130 NIGDVTIPQYW 140
>gi|338971870|ref|ZP_08627249.1| 5'-methylthioadenosine nucleosidase/ S-adenosylhomocysteine nucleosidase
[Bradyrhizobiaceae bacterium SG-6C]
gi|338234764|gb|EGP09875.1| 5'-methylthioadenosine nucleosidase/ S-adenosylhomocysteine nucleosidase
[Bradyrhizobiaceae bacterium SG-6C]
Length=303
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/105 (34%), Positives = 57/105 (55%), Gaps = 15/105 (14%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRR---YYLGSISGKKVIVAMTGIGLVNAT 94
R + SAF E A+LA + PV + R + + G + G V++ ++G+ +VNA
Sbjct 31 RVALFSAFEPEWKALLA---IVEQPV--SHREKGVDFVTGRVEGHDVVLVLSGVSMVNAA 85
Query 95 NTTETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
TT+ A RF + A++FSG+AGG + +IGDV + +W
Sbjct 86 MTTQMAVDRFN------LRAILFSGIAGGVDPSLNIGDVVVSQQW 124
>gi|296081863|emb|CBI20868.3| unnamed protein product [Vitis vinifera]
Length=322
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/71 (41%), Positives = 41/71 (58%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ G+I+ KKVI+ MTG+ ++NA TT+ + F I V+ G+AG A
Sbjct 43 RRFRFGTIADKKVILVMTGLSMINAGITTQLLLSTFD------IEGVVHYGIAGNANPSL 96
Query 128 SIGDVAIPARW 138
+IGDV IP W
Sbjct 97 NIGDVTIPQYW 107
>gi|315498952|ref|YP_004087756.1| mta/sah nucleosidase [Asticcacaulis excentricus CB 48]
gi|315416964|gb|ADU13605.1| MTA/SAH nucleosidase [Asticcacaulis excentricus CB 48]
Length=306
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 86/200 (43%), Gaps = 42/200 (21%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
R V+SAF E A++ DA + + ++ G++ K V++ M+ + +VNA T
Sbjct 31 RIGVISAFEPEWQALIGRLK-DAQSYSL-NGMKFVTGTLEDKPVVLTMSSMSMVNAAMNT 88
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW-----TLDNGATFRG-VD 150
+ RF + ++FSG+AGG + IGDV PARW T+ T +G V
Sbjct 89 QLLIDRFH------VERIVFSGIAGGIDPSLKIGDVVAPARWAQSLETIMGRQTAKGFVK 142
Query 151 PGMLATA----QTLSVVLDNINTLG--------------NPVCLC--RNVPVVRLNH--- 187
P L A + +++ N +G +P L + L H
Sbjct 143 PDWLTWAPDGMEAYGMIIPNSVVVGSAKHAPSPKVWFEADPAMLAVAAKLEQTELTHCTA 202
Query 188 ----LGRQPQLFVGGDGSSS 203
L +PQL VGGD SS
Sbjct 203 DGRCLDHRPQLHVGGDAVSS 222
>gi|329849780|ref|ZP_08264626.1| MTA/SAH nucleosidase [Asticcacaulis biprosthecum C19]
gi|328841691|gb|EGF91261.1| MTA/SAH nucleosidase [Asticcacaulis biprosthecum C19]
Length=297
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/203 (26%), Positives = 87/203 (43%), Gaps = 37/203 (18%)
Query 30 ANVIPCEQ-RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGI 88
A +P ++ RT ++ AF E DA+L H ++A D Y G I + V++A++G+
Sbjct 19 AKAMPWDKPRTAIVLAFDPEWDALLPH--IEAAKTETHDGVTYVTGRIGDEPVVMALSGV 76
Query 89 GLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW-----TLDN 142
+VNA T+ + ++ ++ SG+AGG + +IGD+ + W TL
Sbjct 77 SMVNAAMNTQRLLGHYK------VSRIVMSGIAGGVDPSLNIGDIVVADAWSQPMETLTA 130
Query 143 GATFRGVDP-------------GMLATAQTLSVVLDNINTLGNP--VCLCRNVPVVRLNH 187
T G P GM Q +D N +P + L + + L
Sbjct 131 RETREGYIPPAWLWGMSAKPNFGMFLPRQVRIGTIDYDNFPADPALLALAHKLEPMPLKQ 190
Query 188 -------LGRQPQLFVGGDGSSS 203
L P++ VGG+G S+
Sbjct 191 CVSDTLCLDHAPKVVVGGEGVSA 213
>gi|255645394|gb|ACU23193.1| unknown [Glycine max]
Length=366
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/71 (41%), Positives = 41/71 (58%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRT- 127
RR+ G+I K VI+ MTG+ ++NA TT+ + FT + V+ G+AG A +
Sbjct 89 RRFRFGAIGDKPVILVMTGLSVINAAITTQLLLSLFT------VEGVVHYGIAGNANPSL 142
Query 128 SIGDVAIPARW 138
IGDVAIP W
Sbjct 143 HIGDVAIPQYW 153
>gi|253748444|gb|EET02552.1| 5'-methylthioadenosine nucleosidase, S-adenosylhomocysteine nucleosidase
[Giardia intestinalis ATCC 50581]
Length=263
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/173 (31%), Positives = 76/173 (44%), Gaps = 49/173 (28%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGG-AGRT 127
R +Y+G I+G+ V++A +GIG V A +T + F C+ ++F GVAGG GR
Sbjct 38 RTFYIGEINGQTVVIARSGIGKVAAASTATVMISVFRCSE------IVFLGVAGGIQGRA 91
Query 128 SIGDVAIPA---------RWTLDNGATFRG------VDPGMLATAQT---------LSVV 163
IGD+ I R ++ F D + + A T L++V
Sbjct 92 EIGDIVISTAAVQHDFDGRPWVERSVVFSVGKCEIPADEKLQSQAHTAVQAVLTDDLALV 151
Query 164 LDNINTLGNPVCLCRNVPVVR--LNHLGRQPQLFVG----GDGS-SSDKNNGQ 209
LD+ + PV R L LGR P+L G GD SSD+ N +
Sbjct 152 LDD-----------NSEPVGRQILTKLGRSPKLLTGSILSGDQFISSDETNRE 193
>gi|50086149|ref|YP_047659.1| phosphorylase family protein [Acinetobacter sp. ADP1]
gi|49532125|emb|CAG69837.1| putative phosphorylase family protein [Acinetobacter sp. ADP1]
Length=349
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/102 (29%), Positives = 53/102 (52%), Gaps = 9/102 (8%)
Query 38 RTLVLSAFPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
R +++AF EAD ++ T D ++ + R++ G++ G +V++ ++GI + NA TT
Sbjct 48 RLAIIAAFGQEADLLIESTK-DKKEYII-NGRKFVTGTLEGAEVVITLSGISMTNAAMTT 105
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAG-RTSIGDVAIPARW 138
+ F + ++FSG+AG +GDV I W
Sbjct 106 QMITDHFN------LKGIVFSGIAGSLNPDLHVGDVVIAKSW 141
>gi|255550998|ref|XP_002516547.1| mta/sah nucleosidase, putative [Ricinus communis]
gi|223544367|gb|EEF45888.1| mta/sah nucleosidase, putative [Ricinus communis]
Length=349
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/95 (36%), Positives = 48/95 (51%), Gaps = 8/95 (8%)
Query 46 PAEADAVLAHTALDA-NPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
P E + +L A N + RR+ G+I +KVI+ MTG+ L+NA TT+ + F
Sbjct 52 PYEMNPLLQSPNFTATNLTIDFSGRRFRFGTIGERKVILVMTGLSLINAGITTQLLISFF 111
Query 105 TCASSIAIAAVMFSGVAGGAG-RTSIGDVAIPARW 138
I V+ G+AG A +IGDV IP W
Sbjct 112 N------IEGVVHYGIAGNANPSLNIGDVTIPRYW 140
>gi|121611340|ref|YP_999147.1| methylthioadenosine nucleosidase [Verminephrobacter eiseniae
EF01-2]
gi|121555980|gb|ABM60129.1| methylthioadenosine nucleosidase [Verminephrobacter eiseniae
EF01-2]
Length=323
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/70 (38%), Positives = 41/70 (59%), Gaps = 7/70 (10%)
Query 70 RYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGR-TS 128
+ G ++GK V++ ++GI + NA T+ RF I+ +MFSG+AGG
Sbjct 79 EFSTGLLAGKPVVLFLSGISMTNAAMNTQRVLDRFD------ISHIMFSGIAGGVNPGLH 132
Query 129 IGDVAIPARW 138
IGDV++PA+W
Sbjct 133 IGDVSVPAQW 142
>gi|242051491|ref|XP_002454891.1| hypothetical protein SORBIDRAFT_03g000860 [Sorghum bicolor]
gi|241926866|gb|EES00011.1| hypothetical protein SORBIDRAFT_03g000860 [Sorghum bicolor]
Length=360
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/78 (34%), Positives = 42/78 (54%), Gaps = 7/78 (8%)
Query 62 PVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVA 121
P++ RR+ G+I G+ V++ MTG+ ++NA TT+ + F + ++ G+A
Sbjct 71 PILDVQGRRFRFGTIGGQNVVMVMTGLSMLNAGLTTQLLLSLFR------VKGIVHWGIA 124
Query 122 GGAGRT-SIGDVAIPARW 138
G A IGDV IP W
Sbjct 125 GNANEDLQIGDVTIPESW 142
>gi|326503550|dbj|BAJ86281.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length=361
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/71 (36%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGRT- 127
RR+ G+I G+ V++ MTG+ ++NA TT+ + F + ++ G+AG A
Sbjct 78 RRFRFGTIEGQSVVMVMTGLSMLNAGLTTQMLLSLFR------VKGIVHWGIAGNANEDL 131
Query 128 SIGDVAIPARW 138
IGDV IP W
Sbjct 132 QIGDVTIPEYW 142
>gi|225445182|ref|XP_002280745.1| PREDICTED: hypothetical protein [Vitis vinifera]
Length=338
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/95 (33%), Positives = 47/95 (50%), Gaps = 10/95 (10%)
Query 48 EADAVL---AHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
E D +L + A D P + RR+ +G + K VI+ MTG+ ++NA TT+ + F
Sbjct 45 EMDPLLQSPSFVADDKFPYLDFAGRRFRIGKLETKNVIIVMTGLSMLNAGLTTQLLLSLF 104
Query 105 TCASSIAIAAVMFSGVAGGAG-RTSIGDVAIPARW 138
+ V+ G+AG A IGDV +P W
Sbjct 105 ------KVNGVLHYGIAGNANPNLQIGDVTVPKYW 133
>gi|297803678|ref|XP_002869723.1| hypothetical protein ARALYDRAFT_354336 [Arabidopsis lyrata subsp.
lyrata]
gi|297315559|gb|EFH45982.1| hypothetical protein ARALYDRAFT_354336 [Arabidopsis lyrata subsp.
lyrata]
Length=244
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/117 (29%), Positives = 56/117 (48%), Gaps = 14/117 (11%)
Query 41 VLSAFPAEADAVLAHTALDANPV-----VVADRRRYYLGSISGKKVIVAMTGIGLVNATN 95
+++ F E +A L +P+ + R R+ +G I GKKV+ GIG+VNA
Sbjct 59 LVTVFETEENAFLGSVDFRPDPMHPFLDLSGRRFRFRIGKIHGKKVVYVRCGIGMVNAAA 118
Query 96 TTETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARWTLDNGATFRGVDP 151
T+ F + ++ G+AG + SIGDV+IP + + N + ++P
Sbjct 119 ATQQMIDVFN------VKGIVHFGIAGNINNSMSIGDVSIPKQ--ITNAGLWDWLNP 167
>gi|159117992|ref|XP_001709215.1| 5'-methylthioadenosine nucleosidase, S-adenosylhomocysteine nucleosidase
[Giardia lamblia ATCC 50803]
gi|157437331|gb|EDO81541.1| 5-methylthioadenosine nucleosidase, S-adenosylhomocysteine nucleosidase
[Giardia lamblia ATCC 50803]
Length=294
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/67 (42%), Positives = 40/67 (60%), Gaps = 7/67 (10%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGG-AGRT 127
R +Y G ISG+ V+++ +GIG V A +T + F C+ V+F GVAGG GR
Sbjct 38 RTFYTGEISGQTVVISKSGIGKVAAASTAAIMISVFGCSE------VIFLGVAGGIQGRA 91
Query 128 SIGDVAI 134
+IGDV +
Sbjct 92 AIGDVVV 98
>gi|297738794|emb|CBI28039.3| unnamed protein product [Vitis vinifera]
Length=329
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/95 (33%), Positives = 47/95 (50%), Gaps = 10/95 (10%)
Query 48 EADAVL---AHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
E D +L + A D P + RR+ +G + K VI+ MTG+ ++NA TT+ + F
Sbjct 36 EMDPLLQSPSFVADDKFPYLDFAGRRFRIGKLETKNVIIVMTGLSMLNAGLTTQLLLSLF 95
Query 105 TCASSIAIAAVMFSGVAGGAG-RTSIGDVAIPARW 138
+ V+ G+AG A IGDV +P W
Sbjct 96 ------KVNGVLHYGIAGNANPNLQIGDVTVPKYW 124
>gi|218196319|gb|EEC78746.1| hypothetical protein OsI_18961 [Oryza sativa Indica Group]
Length=258
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/71 (39%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ GSI KKVI+ MTG+G++N+ TT+ F + ++ G+AG A
Sbjct 87 RRFRFGSIGEKKVIIVMTGLGMLNSGVTTQLLLTLFD------VEGIVHFGIAGNADPDL 140
Query 128 SIGDVAIPARW 138
IGDV +P W
Sbjct 141 HIGDVTVPRYW 151
>gi|115462741|ref|NP_001054970.1| Os05g0228000 [Oryza sativa Japonica Group]
gi|50511386|gb|AAT77309.1| unknown protein [Oryza sativa Japonica Group]
gi|113578521|dbj|BAF16884.1| Os05g0228000 [Oryza sativa Japonica Group]
gi|215678730|dbj|BAG95167.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222630682|gb|EEE62814.1| hypothetical protein OsJ_17617 [Oryza sativa Japonica Group]
Length=360
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/71 (39%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ GSI KKVI+ MTG+G++N+ TT+ F + ++ G+AG A
Sbjct 87 RRFRFGSIGEKKVIIVMTGLGMLNSGVTTQLLLTLFD------VEGIVHFGIAGNADPDL 140
Query 128 SIGDVAIPARW 138
IGDV +P W
Sbjct 141 HIGDVTVPRYW 151
>gi|115435464|ref|NP_001042490.1| Os01g0229800 [Oryza sativa Japonica Group]
gi|113532021|dbj|BAF04404.1| Os01g0229800 [Oryza sativa Japonica Group]
Length=303
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/71 (36%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAGR-T 127
RR+ G+I G+ V++ MTG+ ++NA TT+ + F + ++ G+AG A
Sbjct 25 RRFRFGAIGGQNVVMVMTGLSMLNAGLTTQLLLSLFR------VKGIVHWGIAGNADEGL 78
Query 128 SIGDVAIPARW 138
IGDV IP W
Sbjct 79 QIGDVTIPEHW 89
>gi|125525025|gb|EAY73139.1| hypothetical protein OsI_01011 [Oryza sativa Indica Group]
Length=354
Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 30/96 (32%), Positives = 47/96 (49%), Gaps = 11/96 (11%)
Query 48 EADAVLAHTALDANPVVVA----DRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFAR 103
E D VL + P + + RR+ G+I G+ V++ MTG+ ++NA TT+ +
Sbjct 51 EMDPVLNSPSFTPTPTLPSYLDIQGRRFRFGAIGGQNVVMVMTGLSMLNAGLTTQLLLSL 110
Query 104 FTCASSIAIAAVMFSGVAGGAGR-TSIGDVAIPARW 138
F + ++ G+AG A IGDV IP W
Sbjct 111 FR------VKGIVHWGIAGNADEGLQIGDVTIPEHW 140
>gi|255586615|ref|XP_002533940.1| mta/sah nucleosidase, putative [Ricinus communis]
gi|223526095|gb|EEF28447.1| mta/sah nucleosidase, putative [Ricinus communis]
Length=317
Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 31/102 (31%), Positives = 49/102 (49%), Gaps = 10/102 (9%)
Query 41 VLSAFPAEADAVLAHTALD---ANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTT 97
+++ +P E ++ A A +P V RR+ +G I KKVI G+G+VNA T
Sbjct 49 LITVYPPEENSFFATGAFKPYGKHPFVDLSGRRFRIGKIREKKVIYVRCGVGMVNAAAAT 108
Query 98 ETAFARFTCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
+ F + ++ G+AG + SIGDV IP +
Sbjct 109 QQMVDMFD------VKGIVHFGIAGNVNNSMSIGDVTIPKEF 144
>gi|147852653|emb|CAN83797.1| hypothetical protein VITISV_002973 [Vitis vinifera]
Length=1618
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 25/71 (36%), Positives = 38/71 (54%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ +G + K VI+ MTG+ ++NA TT+ + F + V+ G+AG A
Sbjct 24 RRFRIGKLETKNVIIVMTGLSMLNAGLTTQLLLSLFK------VNGVLHYGIAGNANPNL 77
Query 128 SIGDVAIPARW 138
IGDV +P W
Sbjct 78 QIGDVTVPKYW 88
>gi|145348229|ref|NP_194623.2| Phosphorylase-like protein protein [Arabidopsis thaliana]
gi|332660165|gb|AEE85565.1| Phosphorylase-like protein protein [Arabidopsis thaliana]
Length=348
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/71 (37%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
R++ G +S + VI+ M+G+G+VNA TT+ + F + V+ G+AG A
Sbjct 77 RKFRFGKLSNQPVIIVMSGLGMVNAGVTTQLLVSLFR------LKGVLHYGIAGNADVNL 130
Query 128 SIGDVAIPARW 138
IGDV IP W
Sbjct 131 EIGDVTIPQYW 141
>gi|224119752|ref|XP_002331152.1| predicted protein [Populus trichocarpa]
gi|222873235|gb|EEF10366.1| predicted protein [Populus trichocarpa]
Length=351
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/71 (36%), Positives = 37/71 (53%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
RR+ G + +KVI+ MTG+ ++NA T+ F + V+ G+AG A +
Sbjct 82 RRFRFGKVENEKVIIVMTGLSMLNAGIATQLLLILFK------VKGVLHYGIAGNANPQL 135
Query 128 SIGDVAIPARW 138
IGDV IP W
Sbjct 136 QIGDVTIPQYW 146
>gi|242051489|ref|XP_002454890.1| hypothetical protein SORBIDRAFT_03g000850 [Sorghum bicolor]
gi|241926865|gb|EES00010.1| hypothetical protein SORBIDRAFT_03g000850 [Sorghum bicolor]
Length=362
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/95 (31%), Positives = 47/95 (50%), Gaps = 10/95 (10%)
Query 48 EADAVLAHTALDAN---PVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
E D VL+ + P++ RR+ G+I + V++ MTG+ ++NA TT+ + F
Sbjct 54 EMDPVLSSPDFKPSNNIPILDVQGRRFRFGTIGSQNVVMVMTGLSMLNAGLTTQLLLSLF 113
Query 105 TCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
+ ++ G+AG A IGDV IP W
Sbjct 114 R------VKGIVHWGIAGNANEDLQIGDVTIPESW 142
>gi|226503841|ref|NP_001148584.1| LOC100282200 [Zea mays]
gi|195620590|gb|ACG32125.1| vegetative storage protein PNI288 [Zea mays]
Length=365
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/78 (33%), Positives = 41/78 (53%), Gaps = 7/78 (8%)
Query 62 PVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVA 121
P++ RR+ G+I + V++ MTG+ ++NA TT+ + F + ++ G+A
Sbjct 75 PILDVQGRRFRFGTIGSQNVVMVMTGLSMLNAGLTTQLLLSLFR------VKGIVHWGIA 128
Query 122 GGAGRT-SIGDVAIPARW 138
G A IGDV IP W
Sbjct 129 GNANEDLQIGDVTIPESW 146
>gi|302804997|ref|XP_002984250.1| hypothetical protein SELMODRAFT_445786 [Selaginella moellendorffii]
gi|300148099|gb|EFJ14760.1| hypothetical protein SELMODRAFT_445786 [Selaginella moellendorffii]
Length=471
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/95 (32%), Positives = 49/95 (52%), Gaps = 10/95 (10%)
Query 48 EADAVLAHTALDANPVV-VADR--RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
E DA+ T +P + D RR+++G+I G KV++ + ++NA TT+ F
Sbjct 59 EMDAIRQQTIFAPHPYIPWLDLAGRRFHIGTIQGHKVVLVQSASAMLNAAITTQQLLDFF 118
Query 105 TCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
I V+ G+AG A + +IGD++IP W
Sbjct 119 D------INGVLHYGIAGSANDSLTIGDISIPKYW 147
>gi|302781292|ref|XP_002972420.1| hypothetical protein SELMODRAFT_441751 [Selaginella moellendorffii]
gi|300159887|gb|EFJ26506.1| hypothetical protein SELMODRAFT_441751 [Selaginella moellendorffii]
Length=489
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/95 (32%), Positives = 49/95 (52%), Gaps = 10/95 (10%)
Query 48 EADAVLAHTALDANPVV-VADR--RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARF 104
E DA+ T +P + D RR+++G+I G KV++ + ++NA TT+ F
Sbjct 59 EMDAIRQQTIFAPHPYIPWLDLAGRRFHIGTIQGHKVVLVQSASAMLNAAITTQQLLDFF 118
Query 105 TCASSIAIAAVMFSGVAGGAGRT-SIGDVAIPARW 138
I V+ G+AG A + +IGD++IP W
Sbjct 119 D------INGVLHYGIAGSANDSLTIGDISIPKYW 147
>gi|109896877|ref|YP_660132.1| purine phosphorylases family protein 1 [Pseudoalteromonas atlantica
T6c]
gi|109699158|gb|ABG39078.1| methylthioadenosine nucleosidase [Pseudoalteromonas atlantica
T6c]
Length=329
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/115 (30%), Positives = 57/115 (50%), Gaps = 11/115 (9%)
Query 37 QRTLVLSAFPAEADAVLAHTAL--DAN--PVVVADRRRYYLGSISGKKVIVAMTGIGLVN 92
+ T +++A+ E A+ A A DAN V +Y LG+ G+ +++ TGI + N
Sbjct 31 KWTAIVAAYEPEIKAIDAAIAKLPDANITKTVTFQGVKYQLGTYKGEPIVIFTTGISVPN 90
Query 93 ATNTTETAFARFTCASSIAIAAVMFSGVAGG-AGRTSIGDVAIPARWTLDNGATF 146
A T + A F I V+ G+AG + + GD+A+PARW + + +
Sbjct 91 AAMTMQMALDYF------PIDRVIMMGIAGAISSEFTPGDIAVPARWYFHDESVY 139
>gi|297803158|ref|XP_002869463.1| hypothetical protein ARALYDRAFT_491863 [Arabidopsis lyrata subsp.
lyrata]
gi|297315299|gb|EFH45722.1| hypothetical protein ARALYDRAFT_491863 [Arabidopsis lyrata subsp.
lyrata]
Length=348
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/71 (36%), Positives = 39/71 (55%), Gaps = 7/71 (9%)
Query 69 RRYYLGSISGKKVIVAMTGIGLVNATNTTETAFARFTCASSIAIAAVMFSGVAGGAG-RT 127
R++ G +S + V++ M+G+G+VNA TT+ + F + V+ G+AG A
Sbjct 77 RKFRFGKLSKQPVVIVMSGLGMVNAGVTTQLLVSLFR------LKGVLHYGIAGNADINL 130
Query 128 SIGDVAIPARW 138
IGDV IP W
Sbjct 131 EIGDVTIPQYW 141
Lambda K H
0.322 0.136 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 346350635400
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40