CA2391404A1 - Udp-glucose:aglycon-glucosyltransferase - Google Patents
Udp-glucose:aglycon-glucosyltransferase Download PDFInfo
- Publication number
- CA2391404A1 CA2391404A1 CA002391404A CA2391404A CA2391404A1 CA 2391404 A1 CA2391404 A1 CA 2391404A1 CA 002391404 A CA002391404 A CA 002391404A CA 2391404 A CA2391404 A CA 2391404A CA 2391404 A1 CA2391404 A1 CA 2391404A1
- Authority
- CA
- Canada
- Prior art keywords
- glucose
- udp
- glucosyltransferase
- aglycon
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 title claims abstract description 58
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims abstract description 57
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 46
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims abstract description 26
- 239000008103 glucose Substances 0.000 claims abstract description 26
- 230000001268 conjugating effect Effects 0.000 claims abstract description 22
- 150000003505 terpenes Chemical class 0.000 claims abstract description 19
- 108020004414 DNA Proteins 0.000 claims description 44
- GLZPCOQZEFWAFX-UHFFFAOYSA-N Geraniol Chemical compound CC(C)=CCCC(C)=CCO GLZPCOQZEFWAFX-UHFFFAOYSA-N 0.000 claims description 32
- 102000004169 proteins and genes Human genes 0.000 claims description 30
- 150000001413 amino acids Chemical class 0.000 claims description 25
- NNICRUQPODTGRU-UHFFFAOYSA-N mandelonitrile Chemical compound N#CC(O)C1=CC=CC=C1 NNICRUQPODTGRU-UHFFFAOYSA-N 0.000 claims description 24
- 125000000539 amino acid group Chemical group 0.000 claims description 23
- 238000006243 chemical reaction Methods 0.000 claims description 22
- HOOOPXDSCKBLFG-UHFFFAOYSA-N 4-hydroxymandelonitrile Chemical compound N#CC(O)C1=CC=C(O)C=C1 HOOOPXDSCKBLFG-UHFFFAOYSA-N 0.000 claims description 18
- 239000002299 complementary DNA Substances 0.000 claims description 18
- HOOOPXDSCKBLFG-QMMMGPOBSA-N p-hydroxy-mandelonitrile Natural products N#C[C@H](O)C1=CC=C(O)C=C1 HOOOPXDSCKBLFG-QMMMGPOBSA-N 0.000 claims description 17
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 claims description 16
- 238000000034 method Methods 0.000 claims description 16
- 102000053602 DNA Human genes 0.000 claims description 15
- ZSIAUFGUXNUGDI-UHFFFAOYSA-N hexan-1-ol Chemical compound CCCCCCO ZSIAUFGUXNUGDI-UHFFFAOYSA-N 0.000 claims description 15
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 13
- 230000009261 transgenic effect Effects 0.000 claims description 13
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 11
- FJKROLUGYXJWQN-UHFFFAOYSA-N 4-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 claims description 11
- GLZPCOQZEFWAFX-YFHOEESVSA-N Geraniol Natural products CC(C)=CCC\C(C)=C/CO GLZPCOQZEFWAFX-YFHOEESVSA-N 0.000 claims description 11
- 239000005792 Geraniol Substances 0.000 claims description 11
- 229940113087 geraniol Drugs 0.000 claims description 11
- BVJSUAQZOZWCKN-UHFFFAOYSA-N p-hydroxybenzyl alcohol Chemical compound OCC1=CC=C(O)C=C1 BVJSUAQZOZWCKN-UHFFFAOYSA-N 0.000 claims description 10
- 239000005711 Benzoic acid Substances 0.000 claims description 8
- 235000010233 benzoic acid Nutrition 0.000 claims description 8
- 108090000994 Catalytic RNA Proteins 0.000 claims description 7
- 102000053642 Catalytic RNA Human genes 0.000 claims description 7
- -1 N-hydroxy amino Chemical group 0.000 claims description 7
- 108091034117 Oligonucleotide Proteins 0.000 claims description 7
- 238000002955 isolation Methods 0.000 claims description 7
- WVDDGKGOMKODPV-ZQBYOMGUSA-N phenyl(114C)methanol Chemical compound O[14CH2]C1=CC=CC=C1 WVDDGKGOMKODPV-ZQBYOMGUSA-N 0.000 claims description 7
- 108091092562 ribozyme Proteins 0.000 claims description 7
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 6
- VMEHOTODTPXCKT-UHFFFAOYSA-N 2-hydroxy-2-methylbutanenitrile Chemical compound CCC(C)(O)C#N VMEHOTODTPXCKT-UHFFFAOYSA-N 0.000 claims description 6
- 108020005544 Antisense RNA Proteins 0.000 claims description 6
- 229960004365 benzoic acid Drugs 0.000 claims description 6
- 238000004587 chromatography analysis Methods 0.000 claims description 6
- 239000003184 complementary RNA Substances 0.000 claims description 6
- 229940090248 4-hydroxybenzoic acid Drugs 0.000 claims description 5
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 claims description 5
- GLZPCOQZEFWAFX-JXMROGBWSA-N Nerol Natural products CC(C)=CCC\C(C)=C\CO GLZPCOQZEFWAFX-JXMROGBWSA-N 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 150000002923 oximes Chemical class 0.000 claims description 5
- YQUVCSBJEUQKSH-UHFFFAOYSA-N protochatechuic acid Natural products OC(=O)C1=CC=C(O)C(O)=C1 YQUVCSBJEUQKSH-UHFFFAOYSA-N 0.000 claims description 5
- WKOLLVMJNQIZCI-UHFFFAOYSA-N vanillic acid Chemical compound COC1=CC(C(O)=O)=CC=C1O WKOLLVMJNQIZCI-UHFFFAOYSA-N 0.000 claims description 5
- TUUBOHWZSQXCSW-UHFFFAOYSA-N vanillic acid Natural products COC1=CC(O)=CC(C(O)=O)=C1 TUUBOHWZSQXCSW-UHFFFAOYSA-N 0.000 claims description 5
- MWOOGOJBHIARFG-UHFFFAOYSA-N vanillin Chemical compound COC1=CC(C=O)=CC=C1O MWOOGOJBHIARFG-UHFFFAOYSA-N 0.000 claims description 5
- FGQOOHJZONJGDT-UHFFFAOYSA-N vanillin Natural products COC1=CC(O)=CC(C=O)=C1 FGQOOHJZONJGDT-UHFFFAOYSA-N 0.000 claims description 5
- 235000012141 vanillin Nutrition 0.000 claims description 5
- ZCHHRLHTBGRGOT-SNAWJCMRSA-N (E)-hex-2-en-1-ol Chemical compound CCC\C=C\CO ZCHHRLHTBGRGOT-SNAWJCMRSA-N 0.000 claims description 4
- UFLHIIWVXFIJGU-ARJAWSKDSA-N (Z)-hex-3-en-1-ol Chemical compound CC\C=C/CCO UFLHIIWVXFIJGU-ARJAWSKDSA-N 0.000 claims description 4
- FXRLMCRCYDHQFW-UHFFFAOYSA-N 2,3,3,3-tetrafluoropropene Chemical compound FC(=C)C(F)(F)F FXRLMCRCYDHQFW-UHFFFAOYSA-N 0.000 claims description 4
- MWFMGBPGAXYFAR-UHFFFAOYSA-N 2-hydroxy-2-methylpropanenitrile Chemical compound CC(C)(O)C#N MWFMGBPGAXYFAR-UHFFFAOYSA-N 0.000 claims description 4
- 239000002253 acid Substances 0.000 claims description 4
- 239000000523 sample Substances 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims description 4
- 101710198130 NADPH-cytochrome P450 reductase Proteins 0.000 claims description 3
- 108700026244 Open Reading Frames Proteins 0.000 claims description 3
- 239000012614 Q-Sepharose Substances 0.000 claims description 3
- 150000003839 salts Chemical class 0.000 claims description 3
- FZENGILVLUJGJX-NSCUHMNNSA-N (E)-acetaldehyde oxime Chemical compound C\C=N\O FZENGILVLUJGJX-NSCUHMNNSA-N 0.000 claims description 2
- CKWCTBJSFBLMOH-UHFFFAOYSA-N 3-methylhex-2-en-1-ol Chemical compound CCCC(C)=CCO CKWCTBJSFBLMOH-UHFFFAOYSA-N 0.000 claims description 2
- 125000004093 cyano group Chemical group *C#N 0.000 claims description 2
- UFLHIIWVXFIJGU-UHFFFAOYSA-N hex-3-en-1-ol Natural products CCC=CCCO UFLHIIWVXFIJGU-UHFFFAOYSA-N 0.000 claims description 2
- 238000007857 nested PCR Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 150000002825 nitriles Chemical class 0.000 claims 2
- BQURYLHIDQYWFD-UHFFFAOYSA-N 3-methylhex-3-en-1-ol Chemical compound CCC=C(C)CCO BQURYLHIDQYWFD-UHFFFAOYSA-N 0.000 claims 1
- 108091028043 Nucleic acid sequence Proteins 0.000 claims 1
- QMVPMAAFGQKVCJ-UHFFFAOYSA-N citronellol Chemical compound OCCC(C)CCC=C(C)C QMVPMAAFGQKVCJ-UHFFFAOYSA-N 0.000 claims 1
- 229930182478 glucoside Natural products 0.000 abstract description 30
- 150000008131 glucosides Chemical class 0.000 abstract description 30
- 230000015572 biosynthetic process Effects 0.000 abstract description 14
- 238000012250 transgenic expression Methods 0.000 abstract description 3
- 241000196324 Embryophyta Species 0.000 description 49
- 108010055629 Glucosyltransferases Proteins 0.000 description 27
- 102000000340 Glucosyltransferases Human genes 0.000 description 27
- 230000000694 effects Effects 0.000 description 21
- 230000000515 cyanogenic effect Effects 0.000 description 20
- 239000000758 substrate Substances 0.000 description 20
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 18
- 229940024606 amino acid Drugs 0.000 description 18
- 239000011541 reaction mixture Substances 0.000 description 14
- 239000000872 buffer Substances 0.000 description 13
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- 240000006394 Sorghum bicolor Species 0.000 description 12
- 229940088598 enzyme Drugs 0.000 description 12
- 239000000047 product Substances 0.000 description 11
- 150000001875 compounds Chemical class 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 9
- 238000002869 basic local alignment search tool Methods 0.000 description 9
- 239000000796 flavoring agent Substances 0.000 description 8
- 235000019634 flavors Nutrition 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 108090000765 processed proteins & peptides Proteins 0.000 description 8
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 7
- 101100004924 Sorghum bicolor CYP79A1 gene Proteins 0.000 description 7
- NVLTYOJHPBMILU-YOVYLDAJSA-N (S)-4-hydroxymandelonitrile beta-D-glucoside Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@H](C#N)C1=CC=C(O)C=C1 NVLTYOJHPBMILU-YOVYLDAJSA-N 0.000 description 6
- ASDREVVGQFYRTH-UHFFFAOYSA-N 3-[[2-acetamido-4-[(4-amino-6-chloro-1,3,5-triazin-2-yl)amino]phenyl]diazenyl]naphthalene-1,5-disulfonic acid Chemical group C=1C=C(N=NC=2C=C3C(=CC=CC3=C(C=2)S(O)(=O)=O)S(O)(=O)=O)C(NC(=O)C)=CC=1NC1=NC(N)=NC(Cl)=N1 ASDREVVGQFYRTH-UHFFFAOYSA-N 0.000 description 6
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- 101100275669 Arabidopsis thaliana CYP71 gene Proteins 0.000 description 6
- 101100219305 Catharanthus roseus CYP71AY1 gene Proteins 0.000 description 6
- NVLTYOJHPBMILU-JAYDOHCTSA-N Dhurrin Natural products O([C@@H](C#N)c1ccc(O)cc1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 NVLTYOJHPBMILU-JAYDOHCTSA-N 0.000 description 6
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 238000005567 liquid scintillation counting Methods 0.000 description 6
- 230000037361 pathway Effects 0.000 description 6
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- NVLTYOJHPBMILU-UHFFFAOYSA-N taxiphyllin Natural products OC1C(O)C(O)C(CO)OC1OC(C#N)C1=CC=C(O)C=C1 NVLTYOJHPBMILU-UHFFFAOYSA-N 0.000 description 6
- DKVBOUDTNWVDEP-NJCHZNEYSA-N teicoplanin aglycone Chemical compound N([C@H](C(N[C@@H](C1=CC(O)=CC(O)=C1C=1C(O)=CC=C2C=1)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)OC=1C=C3C=C(C=1O)OC1=CC=C(C=C1Cl)C[C@H](C(=O)N1)NC([C@H](N)C=4C=C(O5)C(O)=CC=4)=O)C(=O)[C@@H]2NC(=O)[C@@H]3NC(=O)[C@@H]1C1=CC5=CC(O)=C1 DKVBOUDTNWVDEP-NJCHZNEYSA-N 0.000 description 6
- 238000005481 NMR spectroscopy Methods 0.000 description 5
- 239000007983 Tris buffer Substances 0.000 description 5
- 238000010828 elution Methods 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 5
- RGHHSNMVTDWUBI-UHFFFAOYSA-N 4-hydroxybenzaldehyde Chemical compound OC1=CC=C(C=O)C=C1 RGHHSNMVTDWUBI-UHFFFAOYSA-N 0.000 description 4
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 4
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- 240000006365 Vitis vinifera Species 0.000 description 4
- 235000014787 Vitis vinifera Nutrition 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 4
- 239000002243 precursor Substances 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 229960004441 tyrosine Drugs 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- WVDDGKGOMKODPV-UHFFFAOYSA-N Benzyl alcohol Chemical compound OCC1=CC=CC=C1 WVDDGKGOMKODPV-UHFFFAOYSA-N 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 102000051366 Glycosyltransferases Human genes 0.000 description 3
- 108700023372 Glycosyltransferases Proteins 0.000 description 3
- 239000006137 Luria-Bertani broth Substances 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 229930003935 flavonoid Natural products 0.000 description 3
- 235000017173 flavonoids Nutrition 0.000 description 3
- 150000002215 flavonoids Chemical class 0.000 description 3
- 229960003681 gluconolactone Drugs 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- IOVCWXUNBOPUCH-UHFFFAOYSA-M nitrite group Chemical group N(=O)[O-] IOVCWXUNBOPUCH-UHFFFAOYSA-M 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 229930000044 secondary metabolite Natural products 0.000 description 3
- 238000002864 sequence alignment Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000005382 thermal cycling Methods 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 235000013311 vegetables Nutrition 0.000 description 3
- WXTMDXOMEHJXQO-UHFFFAOYSA-N 2,5-dihydroxybenzoic acid Chemical compound OC(=O)C1=CC(O)=CC=C1O WXTMDXOMEHJXQO-UHFFFAOYSA-N 0.000 description 2
- WRMNZCZEMHIOCP-UHFFFAOYSA-N 2-phenylethanol Chemical compound OCCC1=CC=CC=C1 WRMNZCZEMHIOCP-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 2
- 102000018832 Cytochromes Human genes 0.000 description 2
- 108010052832 Cytochromes Proteins 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- QIGBRXMKCJKVMJ-UHFFFAOYSA-N Hydroquinone Chemical compound OC1=CC=C(O)C=C1 QIGBRXMKCJKVMJ-UHFFFAOYSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- 101100049938 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) exr-1 gene Proteins 0.000 description 2
- 101150053185 P450 gene Proteins 0.000 description 2
- REFJWTPEDVJJIY-UHFFFAOYSA-N Quercetin Chemical compound C=1C(O)=CC(O)=C(C(C=2O)=O)C=1OC=2C1=CC=C(O)C(O)=C1 REFJWTPEDVJJIY-UHFFFAOYSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- 235000007230 Sorghum bicolor Nutrition 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 235000009754 Vitis X bourquina Nutrition 0.000 description 2
- 235000012333 Vitis X labruscana Nutrition 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- WUADCCWRTIWANL-UHFFFAOYSA-N biochanin A Chemical compound C1=CC(OC)=CC=C1C1=COC2=CC(O)=CC(O)=C2C1=O WUADCCWRTIWANL-UHFFFAOYSA-N 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- VEVZSMAEJFVWIL-UHFFFAOYSA-O cyanidin cation Chemical compound [O+]=1C2=CC(O)=CC(O)=C2C=C(O)C=1C1=CC=C(O)C(O)=C1 VEVZSMAEJFVWIL-UHFFFAOYSA-O 0.000 description 2
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000002024 ethyl acetate extract Substances 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 125000004383 glucosinolate group Chemical group 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- 150000002338 glycosides Chemical class 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000005342 ion exchange Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000000329 molecular dynamics simulation Methods 0.000 description 2
- 229930003658 monoterpene Natural products 0.000 description 2
- 150000002773 monoterpene derivatives Chemical class 0.000 description 2
- 239000004570 mortar (masonry) Substances 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- MDHYEMXUFSJLGV-UHFFFAOYSA-N phenethyl acetate Chemical compound CC(=O)OCCC1=CC=CC=C1 MDHYEMXUFSJLGV-UHFFFAOYSA-N 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 229940030961 sorghum extract Drugs 0.000 description 2
- QAIPRVGONGVQAS-DUXPYHPUSA-N trans-caffeic acid Chemical compound OC(=O)\C=C\C1=CC=C(O)C(O)=C1 QAIPRVGONGVQAS-DUXPYHPUSA-N 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 235000014101 wine Nutrition 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- TVXJJNJGTDWFLD-UHFFFAOYSA-N (4-hydroxyphenyl)acetaldehyde oxime Chemical compound ON=CCC1=CC=C(O)C=C1 TVXJJNJGTDWFLD-UHFFFAOYSA-N 0.000 description 1
- ACEAELOMUCBPJP-UHFFFAOYSA-N (E)-3,4,5-trihydroxycinnamic acid Natural products OC(=O)C=CC1=CC(O)=C(O)C(O)=C1 ACEAELOMUCBPJP-UHFFFAOYSA-N 0.000 description 1
- XABCFXXGZPWJQP-BYPYZUCNSA-N (S)-3-aminoadipic acid Chemical compound OC(=O)C[C@@H](N)CCC(O)=O XABCFXXGZPWJQP-BYPYZUCNSA-N 0.000 description 1
- FTVWIRXFELQLPI-ZDUSSCGKSA-N (S)-naringenin Chemical compound C1=CC(O)=CC=C1[C@H]1OC2=CC(O)=CC(O)=C2C(=O)C1 FTVWIRXFELQLPI-ZDUSSCGKSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- BOBATFKNMFWLFG-UHFFFAOYSA-N 2-amino-2-cyano-n-methylacetamide Chemical compound CNC(=O)C(N)C#N BOBATFKNMFWLFG-UHFFFAOYSA-N 0.000 description 1
- SNDPXSYFESPGGJ-UHFFFAOYSA-N 2-aminopentanoic acid Chemical compound CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 1
- WLJVXDMOQOGPHL-PPJXEINESA-N 2-phenylacetic acid Chemical compound O[14C](=O)CC1=CC=CC=C1 WLJVXDMOQOGPHL-PPJXEINESA-N 0.000 description 1
- QCHPKSFMDHPSNR-UHFFFAOYSA-N 3-aminoisobutyric acid Chemical compound NCC(C)C(O)=O QCHPKSFMDHPSNR-UHFFFAOYSA-N 0.000 description 1
- YHXHKYRQLYQUIH-UHFFFAOYSA-N 4-hydroxymandelic acid Chemical compound OC(=O)C(O)C1=CC=C(O)C=C1 YHXHKYRQLYQUIH-UHFFFAOYSA-N 0.000 description 1
- 241000023308 Acca Species 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- 241001136792 Alle Species 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101000761642 Arabidopsis thaliana UDP-glycosyltransferase 74B1 Proteins 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- ZFSIGJMSVGZVGP-DHATWTDPSA-N Arg-Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)[C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZFSIGJMSVGZVGP-DHATWTDPSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000870659 Crassula perfoliata var. minor Species 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108091065810 E family Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- 102000016354 Glucuronosyltransferase Human genes 0.000 description 1
- 108010092364 Glucuronosyltransferase Proteins 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- PBWMCUAFLPMYPF-ZQINRCPSSA-N Ile-Trp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PBWMCUAFLPMYPF-ZQINRCPSSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- 101001018085 Lysobacter enzymogenes Lysyl endopeptidase Proteins 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 244000137850 Marrubium vulgare Species 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- PQNASZJZHFPQLE-LURJTMIESA-N N(6)-methyl-L-lysine Chemical compound CNCCCC[C@H](N)C(O)=O PQNASZJZHFPQLE-LURJTMIESA-N 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- BACYUWVYYTXETD-UHFFFAOYSA-N N-Lauroylsarcosine Chemical compound CCCCCCCCCCCC(=O)N(C)CC(O)=O BACYUWVYYTXETD-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- OLNLSTNFRUFTLM-BYPYZUCNSA-N N-ethyl-L-asparagine Chemical compound CCN[C@H](C(O)=O)CC(N)=O OLNLSTNFRUFTLM-BYPYZUCNSA-N 0.000 description 1
- YPIGGYHFMKJNKV-UHFFFAOYSA-N N-ethylglycine Chemical compound CC[NH2+]CC([O-])=O YPIGGYHFMKJNKV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 101710107606 Putative glycosyltransferase Proteins 0.000 description 1
- ZVOLCUVKHLEPEV-UHFFFAOYSA-N Quercetagetin Natural products C1=C(O)C(O)=CC=C1C1=C(O)C(=O)C2=C(O)C(O)=C(O)C=C2O1 ZVOLCUVKHLEPEV-UHFFFAOYSA-N 0.000 description 1
- QNVSXXGDAPORNA-UHFFFAOYSA-N Resveratrol Natural products OC1=CC=CC(C=CC=2C=C(O)C(O)=CC=2)=C1 QNVSXXGDAPORNA-UHFFFAOYSA-N 0.000 description 1
- HWTZYBCRDDUBJY-UHFFFAOYSA-N Rhynchosin Natural products C1=C(O)C(O)=CC=C1C1=C(O)C(=O)C2=CC(O)=C(O)C=C2O1 HWTZYBCRDDUBJY-UHFFFAOYSA-N 0.000 description 1
- ZONYXWQDUYMKFB-UHFFFAOYSA-N SJ000286395 Natural products O1C2=CC=CC=C2C(=O)CC1C1=CC=CC=C1 ZONYXWQDUYMKFB-UHFFFAOYSA-N 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- 244000191761 Sida cordifolia Species 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 101100231547 Sorghum bicolor UGT85B1 gene Proteins 0.000 description 1
- PJANXHGTPQOBST-VAWYXSNFSA-N Stilbene Natural products C=1C=CC=CC=1/C=C/C1=CC=CC=C1 PJANXHGTPQOBST-VAWYXSNFSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- GAMYVSCDDLXAQW-AOIWZFSPSA-N Thermopsosid Natural products O(C)c1c(O)ccc(C=2Oc3c(c(O)cc(O[C@H]4[C@H](O)[C@@H](O)[C@H](O)[C@H](CO)O4)c3)C(=O)C=2)c1 GAMYVSCDDLXAQW-AOIWZFSPSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- XYNPYHXGMWJBLV-VXPJTDKGSA-N Tomatidine Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CC[C@H](O)C[C@@H]4CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@@]11CC[C@H](C)CN1 XYNPYHXGMWJBLV-VXPJTDKGSA-N 0.000 description 1
- LUKBXSAWLPMMSZ-OWOJBTEDSA-N Trans-resveratrol Chemical compound C1=CC(O)=CC=C1\C=C\C1=CC(O)=CC(O)=C1 LUKBXSAWLPMMSZ-OWOJBTEDSA-N 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- XKKBFNPJFZLTMY-CWRNSKLLSA-N Trp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O XKKBFNPJFZLTMY-CWRNSKLLSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 150000003797 alkaloid derivatives Chemical class 0.000 description 1
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 1
- QUMXDOLUJCHOAY-UHFFFAOYSA-N alpha-methylbenzyl acetate Natural products CC(=O)OC(C)C1=CC=CC=C1 QUMXDOLUJCHOAY-UHFFFAOYSA-N 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 239000001166 ammonium sulphate Substances 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 229930014669 anthocyanidin Natural products 0.000 description 1
- 150000001452 anthocyanidin derivatives Chemical class 0.000 description 1
- 235000008758 anthocyanidins Nutrition 0.000 description 1
- XADJWCRESPGUTB-UHFFFAOYSA-N apigenin Natural products C1=CC(O)=CC=C1C1=CC(=O)C2=CC(O)=C(O)C=C2O1 XADJWCRESPGUTB-UHFFFAOYSA-N 0.000 description 1
- 235000008714 apigenin Nutrition 0.000 description 1
- KZNIFHPLKGYRTM-UHFFFAOYSA-N apigenin Chemical compound C1=CC(O)=CC=C1C1=CC(=O)C2=C(O)C=C(O)C=C2O1 KZNIFHPLKGYRTM-UHFFFAOYSA-N 0.000 description 1
- 229940117893 apigenin Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 235000019445 benzyl alcohol Nutrition 0.000 description 1
- 229960004217 benzyl alcohol Drugs 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 235000004883 caffeic acid Nutrition 0.000 description 1
- 229940074360 caffeic acid Drugs 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000005465 channeling Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- QAIPRVGONGVQAS-UHFFFAOYSA-N cis-caffeic acid Natural products OC(=O)C=CC1=CC=C(O)C(O)=C1 QAIPRVGONGVQAS-UHFFFAOYSA-N 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 235000007336 cyanidin Nutrition 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000003810 ethyl acetate extraction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 229930003949 flavanone Natural products 0.000 description 1
- 150000002208 flavanones Chemical class 0.000 description 1
- 235000011981 flavanones Nutrition 0.000 description 1
- 229930003944 flavone Natural products 0.000 description 1
- 150000002212 flavone derivatives Chemical class 0.000 description 1
- 235000011949 flavones Nutrition 0.000 description 1
- HVQAJTFOCKOKIN-UHFFFAOYSA-N flavonol Natural products O1C2=CC=CC=C2C(=O)C(O)=C1C1=CC=CC=C1 HVQAJTFOCKOKIN-UHFFFAOYSA-N 0.000 description 1
- 150000007946 flavonol Chemical class 0.000 description 1
- 235000011957 flavonols Nutrition 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 229960005219 gentisic acid Drugs 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 235000002532 grape seed extract Nutrition 0.000 description 1
- 244000038280 herbivores Species 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 239000012135 ice-cold extraction buffer Substances 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 108010046180 indole-3-acetate beta-glucosyltransferase Proteins 0.000 description 1
- 239000003617 indole-3-acetic acid Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000013383 initial experiment Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- GOMNOOKGLZYEJT-UHFFFAOYSA-N isoflavone Chemical compound C=1OC2=CC=CC=C2C(=O)C=1C1=CC=CC=C1 GOMNOOKGLZYEJT-UHFFFAOYSA-N 0.000 description 1
- CJWQYWQDLBZGPD-UHFFFAOYSA-N isoflavone Natural products C1=C(OC)C(OC)=CC(OC)=C1C1=COC2=C(C=CC(C)(C)O3)C3=C(OC)C=C2C1=O CJWQYWQDLBZGPD-UHFFFAOYSA-N 0.000 description 1
- 235000008696 isoflavones Nutrition 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- MWDZOUNAPSSOEL-UHFFFAOYSA-N kaempferol Natural products OC1=C(C(=O)c2cc(O)cc(O)c2O1)c3ccc(O)cc3 MWDZOUNAPSSOEL-UHFFFAOYSA-N 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- WGEYAGZBLYNDFV-UHFFFAOYSA-N naringenin Natural products C1(=O)C2=C(O)C=C(O)C=C2OC(C1)C1=CC=C(CC1)O WGEYAGZBLYNDFV-UHFFFAOYSA-N 0.000 description 1
- 235000007625 naringenin Nutrition 0.000 description 1
- 229940117954 naringenin Drugs 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- PMOWTIHVNWZYFI-UHFFFAOYSA-N o-Coumaric acid Natural products OC(=O)C=CC1=CC=CC=C1O PMOWTIHVNWZYFI-UHFFFAOYSA-N 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- RSMOYVJMDHSTEV-UYYXNEDZSA-N p-Glucosyloxymandelonitrile Natural products O([C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1)c1ccc([C@H](O)C#N)cc1 RSMOYVJMDHSTEV-UYYXNEDZSA-N 0.000 description 1
- RSMOYVJMDHSTEV-MXNNCRBYSA-N p-Glucosyloxymandelonitrile Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=CC=C(C(O)C#N)C=C1 RSMOYVJMDHSTEV-MXNNCRBYSA-N 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 229940067107 phenylethyl alcohol Drugs 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 239000001253 polyvinylpolypyrrolidone Substances 0.000 description 1
- 235000013809 polyvinylpolypyrrolidone Nutrition 0.000 description 1
- 229920000523 polyvinylpolypyrrolidone Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 229940124272 protein stabilizer Drugs 0.000 description 1
- 235000005875 quercetin Nutrition 0.000 description 1
- 229960001285 quercetin Drugs 0.000 description 1
- 235000021067 refined food Nutrition 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000021283 resveratrol Nutrition 0.000 description 1
- 229940016667 resveratrol Drugs 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 1
- 108700004121 sarkosyl Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000019702 secondary metabolite biosynthetic process Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- PWRIIDWSQYQFQD-UHFFFAOYSA-N sisunine Natural products CC1CCC2(NC1)OC3CC4C5CCC6CC(CCC6(C)C5CCC4(C)C3C2C)OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OC(CO)C(O)C(O)C9OC%10OC(CO)C(O)C(O)C%10O)C8O)C(O)C7O PWRIIDWSQYQFQD-UHFFFAOYSA-N 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- PJANXHGTPQOBST-UHFFFAOYSA-N stilbene Chemical compound C=1C=CC=CC=1C=CC1=CC=CC=C1 PJANXHGTPQOBST-UHFFFAOYSA-N 0.000 description 1
- 235000021286 stilbenes Nutrition 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000012134 supernatant fraction Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 235000007586 terpenes Nutrition 0.000 description 1
- XYNPYHXGMWJBLV-OFMODGJOSA-N tomatidine Natural products O[C@@H]1C[C@H]2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]5[C@@H](C)[C@]6(O[C@H]5C4)NC[C@@H](C)CC6)CC3)CC2)CC1 XYNPYHXGMWJBLV-OFMODGJOSA-N 0.000 description 1
- YNJBWRMUSHSURL-UHFFFAOYSA-N trichloroacetic acid Chemical compound OC(=O)C(Cl)(Cl)Cl YNJBWRMUSHSURL-UHFFFAOYSA-N 0.000 description 1
- 229960004319 trichloroacetic acid Drugs 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- VHBFFQKBGNRLFZ-UHFFFAOYSA-N vitamin p Natural products O1C2=CC=CC=C2C(=O)C=C1C1=CC=CC=C1 VHBFFQKBGNRLFZ-UHFFFAOYSA-N 0.000 description 1
- 239000002676 xenobiotic agent Substances 0.000 description 1
- 230000002034 xenobiotic effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0077—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with a reduced iron-sulfur protein as one donor (1.14.15)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Botany (AREA)
- Nutrition Science (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
The present invention provides DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose. Transgenic expression of corresponding genes in plants can be used to influence the biosynthesis of the corresponding glucosides.
Description
UDP-Glucose:aglycon-glucosYltransferase The present invention provides DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose. Transgenic expression of corresponding genes in plants can be used to influence the biosynthesis of the corresponding glucosides.
The biosynthetic pathway of dhurrin has been studied in etiolated seedlings of Sorghum bicolor, and was found to involve two membrane-bound multi-functional cytochrome P450s.
The amino acid precursor L-tyrosine is hydroxylated twice by the enzyme (P450T,.R) forming (Z)-p-hydroxyphenylacetaldoxime (WO 95/16041 ), which subsequently is converted by the enzyme CYP71 E1 (P450 ox) to the cyanohydrine p-hydroxymandelonitrile (WO 98140470). Transgenic expression of said enzymes is used to modify, reconstitute, or newly establish the biosynthetic pathway of cyanogenic glucosides or to modify glucosinolate production in plants .
In dhurrin biosynthesis, the cyanohydrin p-hydroxymandelonitrile forms an equilibrium with p-hydroxybenzaldehyde and CN ~ at physiological pH and is conjugated to glucose by a UDP-glucose:aglycon-glucosyltransferase. Plants have a large capability to glucosylate a wide range of different chemical structures, but the number of glucosyltransferases present in plants and the range of substrate specificities are largely unknown.
Earlier studies indicate that both narrow and broad substrate specificities can be found.
Unfortunately, the difficulties encountered in isolating glucosyltransferases to homogeneity without a simultaneous loss of their biological activity confuse the picture. The difficulties encountered partly reflect that many glucosyltransferases have similar molecular mass, are labile and present in minute amounts. Whereas over one hundred different cDNAs encoding putative, secondary plant metabolism glucosyltransferases are described in publicly accessible databases, only a few of the proteins have been verified. There are no reports of the isolation of a cyanohydrin glucosyltransferase from a cyanogenic plant. The present invention demonstrates that expression of both the UDP-glucose:mandelonitrile-glucosyltransferase and the enzymes CYP79A1 and CYP71 E1 in transgenic plants enables these plants to catalyze the conversion of the amino acid tyrosine to the cyanogenic glucoside dhurrin. Thus, the combined expression of proteins catalyzing the reactions
The biosynthetic pathway of dhurrin has been studied in etiolated seedlings of Sorghum bicolor, and was found to involve two membrane-bound multi-functional cytochrome P450s.
The amino acid precursor L-tyrosine is hydroxylated twice by the enzyme (P450T,.R) forming (Z)-p-hydroxyphenylacetaldoxime (WO 95/16041 ), which subsequently is converted by the enzyme CYP71 E1 (P450 ox) to the cyanohydrine p-hydroxymandelonitrile (WO 98140470). Transgenic expression of said enzymes is used to modify, reconstitute, or newly establish the biosynthetic pathway of cyanogenic glucosides or to modify glucosinolate production in plants .
In dhurrin biosynthesis, the cyanohydrin p-hydroxymandelonitrile forms an equilibrium with p-hydroxybenzaldehyde and CN ~ at physiological pH and is conjugated to glucose by a UDP-glucose:aglycon-glucosyltransferase. Plants have a large capability to glucosylate a wide range of different chemical structures, but the number of glucosyltransferases present in plants and the range of substrate specificities are largely unknown.
Earlier studies indicate that both narrow and broad substrate specificities can be found.
Unfortunately, the difficulties encountered in isolating glucosyltransferases to homogeneity without a simultaneous loss of their biological activity confuse the picture. The difficulties encountered partly reflect that many glucosyltransferases have similar molecular mass, are labile and present in minute amounts. Whereas over one hundred different cDNAs encoding putative, secondary plant metabolism glucosyltransferases are described in publicly accessible databases, only a few of the proteins have been verified. There are no reports of the isolation of a cyanohydrin glucosyltransferase from a cyanogenic plant. The present invention demonstrates that expression of both the UDP-glucose:mandelonitrile-glucosyltransferase and the enzymes CYP79A1 and CYP71 E1 in transgenic plants enables these plants to catalyze the conversion of the amino acid tyrosine to the cyanogenic glucoside dhurrin. Thus, the combined expression of proteins catalyzing the reactions
-2-involved in the biosynthesis of cyanogenic glucosides in plants actually establishes the complete pathway for cyanogenic glucoside synthesis in these transgenic plants.
Gene refers to a coding sequence and associated regulatory sequences wherein the coding sequence is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, double stranded RNA, sense RNA or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 3' untranslated sequences and termination sequences. Further elements such as introns may be present as well.
Expression generally refers to the transcription and translation of an endogenous gene or transgene in plants. However, in connection with genes which do not encode a protein such as antisense constructs, the term expression refers to transcription only.
The following solutions are provided by the present invention:
~ A DNA molecule coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin (like mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrine or 2-hydroxy-2-methylbutyronitrile); a terpenoid (like geraniol, nerol or ~3-citronellol); a phenylderivative (like p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol, 2-hydroxy-3-methoxybenzylalcohol, vanillic acid or vanillin) or a hexanolderivative (like 1-hexanol, trans-2-hexen-1-ol, cis-3-hexen-1-ol, 3-methyl-3-hexen-1-of or 3-methyl-2-hexen-1-ol) to glucose as well as the encoded protein itself;
~ Said DNA molecule coding for glucosyltransferase having the formula R1-R2-R3, wherein -- R,, R2 and R3 are component sequences consisting of amino acid residues independently selected from the group of the amino acid residues Gly, Ala, Val, Leu, Ile, Phe, Pro, Ser, Thr, Cys, Met, Trp, Tyr, Asn, Gln, Asp, Glu, Lys, Arg and His, and -- R2 consists of 150 or more amino acid residues the sequence of which is at least 50%
identical to an aligned component sequence of SEQ ID NO: 1 as determined using the computer program blastp of the BLAST 2.0 set of similarity search programs, optional parameters set to the default values ~ Said DNA molecule, wherein R2 encodes 150-425 amino acid residues such as amino acids 21 to 445, 168 to 448, or 281 to 448 of SEQ ID NO: 1;
~ Said DNA molecule, wherein R, and R3 consist independently of 0 to 500 amino acid residues;
Gene refers to a coding sequence and associated regulatory sequences wherein the coding sequence is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, double stranded RNA, sense RNA or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 3' untranslated sequences and termination sequences. Further elements such as introns may be present as well.
Expression generally refers to the transcription and translation of an endogenous gene or transgene in plants. However, in connection with genes which do not encode a protein such as antisense constructs, the term expression refers to transcription only.
The following solutions are provided by the present invention:
~ A DNA molecule coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin (like mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrine or 2-hydroxy-2-methylbutyronitrile); a terpenoid (like geraniol, nerol or ~3-citronellol); a phenylderivative (like p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol, 2-hydroxy-3-methoxybenzylalcohol, vanillic acid or vanillin) or a hexanolderivative (like 1-hexanol, trans-2-hexen-1-ol, cis-3-hexen-1-ol, 3-methyl-3-hexen-1-of or 3-methyl-2-hexen-1-ol) to glucose as well as the encoded protein itself;
~ Said DNA molecule coding for glucosyltransferase having the formula R1-R2-R3, wherein -- R,, R2 and R3 are component sequences consisting of amino acid residues independently selected from the group of the amino acid residues Gly, Ala, Val, Leu, Ile, Phe, Pro, Ser, Thr, Cys, Met, Trp, Tyr, Asn, Gln, Asp, Glu, Lys, Arg and His, and -- R2 consists of 150 or more amino acid residues the sequence of which is at least 50%
identical to an aligned component sequence of SEQ ID NO: 1 as determined using the computer program blastp of the BLAST 2.0 set of similarity search programs, optional parameters set to the default values ~ Said DNA molecule, wherein R2 encodes 150-425 amino acid residues such as amino acids 21 to 445, 168 to 448, or 281 to 448 of SEQ ID NO: 1;
~ Said DNA molecule, wherein R, and R3 consist independently of 0 to 500 amino acid residues;
-3-~ Said DNA molecule, wherein R, or R3 encode one or more additional component sequences having a length of at least 30 amino acids and being at least 65%
identical to an aligned component sequence of SEQ ID NO: 1, such as amino acids 21 to 55, 142 to 174, or 303 to 343 of SEQ ID NO: 1;
~ Said DNA molecule coding for a protein of 300 to 600 amino acid residues length such as defined in SEQ ID NO: 2 or the protein defined in SEO ID NO: 1;
~ A method for the isolation of such cDNA molecules;
~ A method for producing purified recombinant UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose;
~ A method for obtaining a transgenic plant as well as the transgenic plant itself comprising stably integrated into its genome DNA coding for said protein or DNA encoding sense RNA, anti sense RNA, double stranded RNA or a ribozyme, the expression of which reduces expression of said protein .
The Arabidopsis thaliana genome is expected to contain approximately 120 genes encoding glucosyltransferases involved in natural product synthesis as deduced from the current state of the Arabidopsis genome sequencing programme. Other plants are also expected to contain a large number of genes encoding glucosyltransferases. In spite of the presence of numerous glucosyltransferases in S. bicolor, none of these except one exert high specificity towards mandelonitrile and p-hydroxymandelonitrile. The presence of several isoforms of this glucosyltransferase is likely considering the evolution and taxonomical background of sorghum and polyploidal forms. The lability of p-hydroxymandelonitrile and the absence of multiple peaks containing p-hydroxymandelonitrile glucosyltransferase activities in S. bicolor during column chromatography demonstrate that a specific glucosyltransferase (sbHMNGT) is involved in the biosynthesis of the cyanogenic glucoside dhurrin.
The biosynthesis of cyanogenic glucosides proceeds according to a general pathway, i.e involving the same type of intermediates in all plants. Accordingly, the enzymes catalyzing these processes in different plant species are expected to show significant similarity. This has already been clearly demonstrated for the part of the pathway involving conversion of amino acids to oximes. This part has in all plants tested been demonstrated to be catalyzed by one or more cytochrome P450 enzymes belonging to the CYP79 family. These
identical to an aligned component sequence of SEQ ID NO: 1, such as amino acids 21 to 55, 142 to 174, or 303 to 343 of SEQ ID NO: 1;
~ Said DNA molecule coding for a protein of 300 to 600 amino acid residues length such as defined in SEQ ID NO: 2 or the protein defined in SEO ID NO: 1;
~ A method for the isolation of such cDNA molecules;
~ A method for producing purified recombinant UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose;
~ A method for obtaining a transgenic plant as well as the transgenic plant itself comprising stably integrated into its genome DNA coding for said protein or DNA encoding sense RNA, anti sense RNA, double stranded RNA or a ribozyme, the expression of which reduces expression of said protein .
The Arabidopsis thaliana genome is expected to contain approximately 120 genes encoding glucosyltransferases involved in natural product synthesis as deduced from the current state of the Arabidopsis genome sequencing programme. Other plants are also expected to contain a large number of genes encoding glucosyltransferases. In spite of the presence of numerous glucosyltransferases in S. bicolor, none of these except one exert high specificity towards mandelonitrile and p-hydroxymandelonitrile. The presence of several isoforms of this glucosyltransferase is likely considering the evolution and taxonomical background of sorghum and polyploidal forms. The lability of p-hydroxymandelonitrile and the absence of multiple peaks containing p-hydroxymandelonitrile glucosyltransferase activities in S. bicolor during column chromatography demonstrate that a specific glucosyltransferase (sbHMNGT) is involved in the biosynthesis of the cyanogenic glucoside dhurrin.
The biosynthesis of cyanogenic glucosides proceeds according to a general pathway, i.e involving the same type of intermediates in all plants. Accordingly, the enzymes catalyzing these processes in different plant species are expected to show significant similarity. This has already been clearly demonstrated for the part of the pathway involving conversion of amino acids to oximes. This part has in all plants tested been demonstrated to be catalyzed by one or more cytochrome P450 enzymes belonging to the CYP79 family. These
-4-cytochromes P450 show more than 40% sequence identity at the amino acid level.
The initial conversion of the amino acids to oximes in glucosinolate synthesis is also catalyzed by a cytochrome P450 enzyme belonging to the CYP79 family . In line with these previous findings, it is expected that in plants synthesizing cyanogenic glucosides conjugation of glucose to cyanohydrins follows a conserved biochemical pathway involving structurally related glucosyltransferases. The aim of the present invention is to provide DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a number of cyanohydrins, a terpenoids, phenylderivatives, and hexanolderivatives ( p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol and/or geraniol )?
to glucose and to define their general structure in cyanogenic plants on the basis of the amino acid sequence of the S. bicolor UDP-glucose:hydroxymandelonitrile-O-glucosyltransferase and its corresponding gene sequence. Thus the present invention provides DNA
molecules coding for a UDP-glucose:aglycon-glucosyltransferase and conjugating a cyanohydrin (like mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrine or 2-hydroxy-2-methylbutyronitrile); a terpenoid (like geraniol, nerol or ~3-citronellol); a phenylderivative (like p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol, 2-hydroxy-3-methoxybenzylalcohol, vanillic acid or vanillin) or a hexanolderivative (like 1-hexanol, trans-2-hexes-1-ol, cis-3-hexes-1-ol, 3-methyl-3-hexes-1-of or 3-methyl-2-hexes-1-ol) to glucose having the formula R1-R2-R3, wherein -- R,, R2 and R3 are component sequences consisting of amino acid residues independently selected from the group of the amino acid residues Gly, Ala, Val, Leu, Ile, Phe, Pro, Ser, Thr, Cys, Met, Trp, Tyr, Asn, Gln, Asp, Glu, Lys, Arg, His and optionally any other amino acid residue which can result from posttranslational modification within a living cell, and -- Rz consists of 150, preferably 250 or more amino acid residues the sequence of which is at least 50%, preferably at least 55%, or even more prefered at least 70%
identical to an aligned component sequence of SEQ ID NO: 1.
Typical amino acid residues which can result from posttranslational modification within a living cell are Aad, bAad, bAla, Abu, 4Abu, Acp, Ahe, Aib, bAib, Apm, Dbu, Des, Dpm, Dpr, EtGly, EtAsn, Hyl, aHyl, 3Hyp, 4Hyp, Ide, alle, MeGly, Melle, MeLys, MeVal, Nva, Nle and Orn.
The initial conversion of the amino acids to oximes in glucosinolate synthesis is also catalyzed by a cytochrome P450 enzyme belonging to the CYP79 family . In line with these previous findings, it is expected that in plants synthesizing cyanogenic glucosides conjugation of glucose to cyanohydrins follows a conserved biochemical pathway involving structurally related glucosyltransferases. The aim of the present invention is to provide DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a number of cyanohydrins, a terpenoids, phenylderivatives, and hexanolderivatives ( p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol and/or geraniol )?
to glucose and to define their general structure in cyanogenic plants on the basis of the amino acid sequence of the S. bicolor UDP-glucose:hydroxymandelonitrile-O-glucosyltransferase and its corresponding gene sequence. Thus the present invention provides DNA
molecules coding for a UDP-glucose:aglycon-glucosyltransferase and conjugating a cyanohydrin (like mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrine or 2-hydroxy-2-methylbutyronitrile); a terpenoid (like geraniol, nerol or ~3-citronellol); a phenylderivative (like p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p -hydroxy-benzylalcohol, 2-hydroxy-3-methoxybenzylalcohol, vanillic acid or vanillin) or a hexanolderivative (like 1-hexanol, trans-2-hexes-1-ol, cis-3-hexes-1-ol, 3-methyl-3-hexes-1-of or 3-methyl-2-hexes-1-ol) to glucose having the formula R1-R2-R3, wherein -- R,, R2 and R3 are component sequences consisting of amino acid residues independently selected from the group of the amino acid residues Gly, Ala, Val, Leu, Ile, Phe, Pro, Ser, Thr, Cys, Met, Trp, Tyr, Asn, Gln, Asp, Glu, Lys, Arg, His and optionally any other amino acid residue which can result from posttranslational modification within a living cell, and -- Rz consists of 150, preferably 250 or more amino acid residues the sequence of which is at least 50%, preferably at least 55%, or even more prefered at least 70%
identical to an aligned component sequence of SEQ ID NO: 1.
Typical amino acid residues which can result from posttranslational modification within a living cell are Aad, bAad, bAla, Abu, 4Abu, Acp, Ahe, Aib, bAib, Apm, Dbu, Des, Dpm, Dpr, EtGly, EtAsn, Hyl, aHyl, 3Hyp, 4Hyp, Ide, alle, MeGly, Melle, MeLys, MeVal, Nva, Nle and Orn.
-5-Typically R2 consists of 150 to 425 amino acid residues, a length of 150 to 280 amino acid residues being preferred. Specific embodiments of R 2 are represented by amino acids 21 to 445, 168 to 448 or 281 to 448 of SEQ ID NO: 1.
R1 and R3 independently consist of 0 to 500, preferably 0 to 350 amino acid residues and may comprise one or more additional component sequences having a length of at least 30 amino acids and being at least 65% , but preferably at least 70% identical to an aligned component sequence of SEQ ID NO: 1. Examples of such additional component sequences are represented by amino acids 21 to 55, 142 to 174 or 303 to 343 of SEQ ID
NO: 1.
The glycosyltransferases encoded by said DNA molecules generally consist of 300 to 600 amino acid residues, the S. bicolor enzyme having a size of 492 amino acid residues as described in SEQ ID NO: 1 and as encoded by SEO ID NO: 2.
In general there exist two approaches towards sequence alignment. Dynamic programming algorithms as proposed by Needleman and Wunsch and by Sellers align the entire length of two sequences providing a global alingment of the sequences. The Smith-Waterman algorithm on the other hand yields local alignments. A local alignment aligns the pair of regions within the sequences that are most similiar given the choice of scoring matrix and gap penalties. This allows a database search to focus on the most highly conserved regions of the sequences. It also allows similiar domains within sequences to be identified. To speed up alignments using the Smith-Waterman algorithm programs such as BLAST
(Basic Local Alignment Search Tool) and FASTA place additional restrictions on the alignments.
Within the context of the present invention overall sequence alignments are conveniently performed using using the program PILEUP available from the Genetic Computer Group, Madison, WI.
Local alignments are performed conveniently using BLAST, a set of similarity search programs designed to explore all of the available sequence databases regardless of whether the query is protein or DNA. Version BLAST 2.0 (Gapped BLAST) of this search tool has been made publicly available on the Internet (currently http://www.ncbi.nlm.nih.gov/BLAST/). It uses a heuristic algorithm which seeks local as opposed to global alignments and is therefore able to detect relationships among sequences which share only isolated regions. The scores assigned in a BLAST
search have a well-defined statistical interpretation. Particularly useful within the scope of the present invention are the blastp program allowing for the introduction of gaps in the local sequence
R1 and R3 independently consist of 0 to 500, preferably 0 to 350 amino acid residues and may comprise one or more additional component sequences having a length of at least 30 amino acids and being at least 65% , but preferably at least 70% identical to an aligned component sequence of SEQ ID NO: 1. Examples of such additional component sequences are represented by amino acids 21 to 55, 142 to 174 or 303 to 343 of SEQ ID
NO: 1.
The glycosyltransferases encoded by said DNA molecules generally consist of 300 to 600 amino acid residues, the S. bicolor enzyme having a size of 492 amino acid residues as described in SEQ ID NO: 1 and as encoded by SEO ID NO: 2.
In general there exist two approaches towards sequence alignment. Dynamic programming algorithms as proposed by Needleman and Wunsch and by Sellers align the entire length of two sequences providing a global alingment of the sequences. The Smith-Waterman algorithm on the other hand yields local alignments. A local alignment aligns the pair of regions within the sequences that are most similiar given the choice of scoring matrix and gap penalties. This allows a database search to focus on the most highly conserved regions of the sequences. It also allows similiar domains within sequences to be identified. To speed up alignments using the Smith-Waterman algorithm programs such as BLAST
(Basic Local Alignment Search Tool) and FASTA place additional restrictions on the alignments.
Within the context of the present invention overall sequence alignments are conveniently performed using using the program PILEUP available from the Genetic Computer Group, Madison, WI.
Local alignments are performed conveniently using BLAST, a set of similarity search programs designed to explore all of the available sequence databases regardless of whether the query is protein or DNA. Version BLAST 2.0 (Gapped BLAST) of this search tool has been made publicly available on the Internet (currently http://www.ncbi.nlm.nih.gov/BLAST/). It uses a heuristic algorithm which seeks local as opposed to global alignments and is therefore able to detect relationships among sequences which share only isolated regions. The scores assigned in a BLAST
search have a well-defined statistical interpretation. Particularly useful within the scope of the present invention are the blastp program allowing for the introduction of gaps in the local sequence
-6-alignments and the PSI-BLAST program, both programs comparing an amino acid query sequence against a protein sequence database, as well as a blastp variant program allowing local alignment of two sequences only. Said programs are preferably run with optional parameters set to the default values.
Additionally, sequence alignments using BLAST can take into account whether the substitution of one amino acid for another is likely to conserve the physical and chemical properties necessary to maintain the structure and function of a protein or is more likely to disrupt essential structural and functional features. Such sequence similarity is quantified in terms of of a percentage of 'positive' amino acids, as compared to the percentage of identical amino acids and can help assigning a protein to the correct protein family in border-line cases.
Investigations into the quantitative and qualitative substrate specificity of sbHMNGT showed a strong preference for the cyanohydrin present in S, bicolor. Thus, in vivo cyanohydrin glucosyltransferases show strong preferences for a limited number of cyanohydrins, terpenoids, phenylderivatives and hexanolderivatives . Nevertheless enzymes catalyzing reactions at the end of biosynthetic pathways often have a broader substrate specificity than those catalyzing preceding reactions resulting in greater flexibility with respect to the evolution of novel secondary metabolite biosynthesis and xenobiotic catabolism. This is illustrated by the finding that whilst the first enzyme of the pathway (CYP79A1 ) is exclusive for tyrosine, CYP71 E1 and sbHMNGT also accept phenylalanine derived oximes and cyanohydrins, respectively. The presence of a nitrite group is also not necessarily required for substrate recognition by sbHMNGT, as demonstrated by the ability of sbHMNGT to glucosylate benzyl alcohol, benzoic acid , vanillic acid, vanillin and 2-hydroxy-3-methoxybenzylalcohol, geraniol, nerol and (3-citronellol. The results demonstrate that sbHMNGT accepts substrates which are structurally similar to the mandelonitrile or p-hydroxy-mandelonitrile. This group of substrate compounds also includes Green Note Flavours such as hexan-1-ol, trans-2-hexene-1-of and cis-3-hexene-1-of and other tyrosine or phenylalanine related aroma compounds like phenylacetic acid, phenylethylalcohol, and phenylethylacetate (Krings et al, Appl. Microbiol. Biotechnol. 49: 1-8, 1998).
The rates observed for glucosylation of benzyl alcohol, benzoic acid and geraniol are lower than those observed for the cyanohydrins. However, they are still high. To this date there are no reports on the isolation or cloning of a monoterpenoid glucosyltransferase nor of glucosyltransferases for hexanol or hexanol derived compounds , despite the obvious importance of these enzyme classes in defining taste and aroma of processed foods and vegetables.
In the process of glycosylation, unstable compounds (aglycons) are generally rendered less chemically reactive and more water soluble through the enzymatic addition of sugar groups.
This typically enables the plant to store increased amounts of these aglycons in the form of glycosides. Many of the secondary metabolites synthesised by plants are glycosylated. For instance over 1500 glycosides of flavonoids alone have been characterised.
Glycosylation generally occurs as a late or the last step in the biosynthesis of compounds otherwise unstable in the cellular environment, and can provide a pool of inactive and transportable precursor forms of compounds that can be obtained in an active form by hydrolysis with glucosidase enzymes. Conversion of free aglycons such as terpenoids and Green Note Flavours into corresponding glucosides through the introduction of a glucosyltransferase can be used to preserve aroma, flavour and colour components in fruits, vegetables and other plants. The aglycons can be liberated by the action of specific or unspecific b-glusidases during food preparation or consumption. Further optimization of the catalytic properties towards individual desired aroma, flavours or colour compounds may be achieved through directed evaluation or methods of genetic engineering such as gene shuffling or mutation .
For example in the grapevine the glucosylation of many secondary metabolites has recently become the focus of significant research efforts arising from the discovery that many of the aroma, flavour and colour components of wine are derived from grape compounds which occur in large part as glucosides. Among such target compounds are the terpenes, e.g.
geraniol which is found in both a free and a glucosylated form. In view of the present invention the glucoside pool of aroma and flavour precursors can be modulated through manipulation of glucosyltransferase activities and aroma and flavour can be released from stored pools of glucosides via acid or enzyme mediated hydrolysis. Thus, in the grape berry and other fruits, vegetables and plants, the introduction of specific glucosyltransferases such as the cloned sbHMGT or reduction of their expression through anti-sense techniques allows directed modification of secondary metabolite composition. This permits modulatation of important free and bound flavour pools of plants allowing the design of fruits, wines and other plant derived products with defined, organoleptic properties.
_g_ The ability of a glucosyltransferase to conjugate an aglycon to glucose can for example be determined in an assay comprising the following steps:
a) Incubation of a reaction mixture comprising '4C-UDP-glucose, aglycon and UDP
glucose:aglycon-glucosyltransferase at 30°C between 2 minutes and 2 hours b) terminating the reaction, and c) chemical identification and quantification of the glucoside produced.
Typically the reaction mixture has a volume of 5 to 2000 p1, but preferably 20 p1 and includes 10-200 mM Tris~HCl (pH 7.9); 1-5 pM'4C-UDP-glucose (about 11.0 GBq~mmol-'); 0-300 pM UDP-glucose; 0-20 mM aglycone; 25 mM 'y-gluconolactone; 0-2 pg/pl BSA
and 0-10 ng/pl UDP-glucose:aglycon-glucosyltransferase. ~3-glucosidase inhibitors other than 'y-gluconolactone and protein stabilizers other than BSA may be included as appropriate. One possibility to terminate the reaction is to acidify the reaction mixture for example by adding 1/10 volume of 10% acetic acid.
Chemical identification and quantification of the glucoside formed in the reaction mixture may be achieved using a variety of methodologies including NMR spectroscopy, TLC
analysis, HPLC analyses or GLC analysis in proper combinations with mass spectrometric analysis of the glucoside.
Reaction mixtures for analysis by NMR spectroscopy usually have a total volume of 0.5 -1 ml, are incubated for 2 hours and include 0-lOmM aglycon, e.g.2 mM p-hydroxy-mandelonitrile or 6.5 mM geraniol, 3 mM UDP-glucose, 2.5 p,g recombinant sbHMNGT, and 0.5 mg BSA.
Glucosides are extracted for example with ethyl acetate and lyophillized prior to NMR
analysis.
For TLC analysis the reaction mixtures are applied to Silica Gel 60 F254 plates (Merck), dried and eluted in a solvent such as ethyl acetate : acetone :
dichloromethane : methanol H20 (40:30:12:10:8, v/v). Plates are dried for one hour at room temperature and exposed to storage phosphorlmaging plates prior to scanning on a Phosphorlmager. Based on the specific radioactivity of the radiolabelled UDP-glucose, the amount of glucoside formed is quantified.
The radioactivity may also be determined by liquid scintillation counting ( LSC analysis). In some cases, where the glucoside formed is derived from a very hydrophobic aglycon, e.g.
mandelonitrile, the glucoside can be extracted into an ethyl acetate phase and thereby be _g_ separated from unincorporated '4C-UDP-glucose. 2 ml of scintillation cocktail are added to 250 p1 of each ethyl acetate extract and analyzed using a liquid scintillation counter. During column fractionation, those fractions containing sbHMNGT activity can be identified using mandelonitrile as the aglycon substrate and ethyl acetate extraction of the glucoside formed.
Knowledge of SEQ ID NO: 1 and SEQ ID NO: 2 can be used to accelerate the isolation and production of DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose which method comprises (a) preparing a cDNA library from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, (b) using at least one oligonucleotide designed on the basis of SEQ ID NO: 2 or SEO ID
NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA from the cDNA library, (c) optionally using one or more oligonucleotides designed on the basi s of SEQ ID NO: 2 or SEQ ID NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA
from the cDNA library in a nested PCR reaction, (d) using the DNA obtained in steps (b) or (c) as a probe to screen the DNA
library prepared from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, and (e) identifying and purifying vector DNA comprising an open reading frame encoding a protein characterized by an amino acid component sequence of at least 150 amino acid residues length having 50% or more sequence identity to an aligned component sequence of SEQ ID NO: 2, and (f) optionally further processing the purified DNA to achieve, for example, heterologous expression of the protein in a microorganism like Escherichia coli or Pichia pastoris for subsequent isolation of the glucosyltransferase, determination of its substrate specificity and generation of an antibody.
In process steps (b) and (c) the second oligonucleotide used for amplification is preferably an oligonucleotide complementary to a region within in the vector DNA used for preparing the cDNA library. However, a second oligonucleotide designed on the basis of the sequence of SEQ ID NO: 2 or SEA ID NO: 1 can also be used. A prefered embodiment of this method for the isolation of cDNA is described in Example 4. cDNA clones coding for UDP-glucose:aglycon-glucosyltransferase or fragments of this clone may also be used on DNA chips alone or in combination with the cDNA clones encoding proteins belonging to the CYP79 or CYP71 E1 family of proteins or fragments of these clones. This provides an easy way to monitor the induction or repression of cyanogenic glucoside synthesis in plants as a result of biotic and abiotic factors.
A further embodiment of the present invention are UDP-glucose:aglycon-glucosyl-transferases conjugating a cyanohydrin to glucose such as the S. bicolor enzyme conjugating p-hydroxymandelonitrile to glucose.
Purified recombinant UDP-glucose:aglycon-glucosyltransferases can be obtained by a method comprising dye chromatography and elution with UDP-glucose. An appropriate column material for dye chromatography is Reactive Yellow 3 preferably cross-linked on beaded agarose. Elution of the protein is conveniently achieved using 2 mM UDP-glucose.
The present invention also provides nucleic acid compounds comprising an open reading frame encoding the novel proteins according to the present invention. Said compounds are characterized by the formula RA-RB-R~, wherein -- RA, RB and R~ constitute component sequences consisting of nucleotide residues independently selected from the group of the nucleotide residues G, A, T and C
or the group of nucleotide residues G, A, U and C, -- RA and R~ consist independently of 0 to 1500, preferably 0 to 1050 nucleotide residues;
-- RB consists of 450-1260 and preferably 450-840 nucleotide residues; and -- the component sequence R B is at least 65% identical to an aligned component sequence of SEQ ID NO: 2.
Specific examples of the component sequence R a are represented by nucleotides 61 to 1335, 502 to 1344, or 841 to 1344 of SEQ ID NO: 2.
In a preferrred embodiment of the present invention at least one of the component sequences RA or R~ comprises one or more additional component sequences which have a length of at least 150 nucleotide residues and are at least 60% identical to an aligned component sequence of SEQ ID NO: 2. Specific examples of such additional component sequences are represented by nucleotides 61 to 165, 427 to 522, or 907 to 1029 of SEQ ID
NO: 2.
The pathway for dhurrin synthesis can be introduced into acyanogenic plants by expression of CYP79A1, CYP71 E1 and the sbHMNGT. These three gene products derived from the same plant species, i.e. sorghum, assemble as a macromolecular complex resulting in stronger channeling of the intermediates in the pathway and less free intermediates are released into the plant.
Expressed as transgenes the DNA molecules encoding glycosyltransferases according to the present invention are particularly useful to modify the biosynthesis of cyanogenic glucosides in plants. When the gene encoding a UDP-glucose:cyanohydrin glucosyltransferase is expressed in conjunction with genes encoding cytochrome enzymes belonging to the CYP79 family (catalyzing the conversion of an amino acid to the corresponding N-hydroxyamino acid and the oxime derived from this N-hydroxy amino acid or a cytochrome P450 monooxygenase) and CYP71 E family (catalyzing the conversion of an aldoxime to a nitrite and the con version of said nitrite to the corresponding cyano hydrin), acyanogenic wild-type plants are converted into cyanogenic plants. Proper selection of promoters to provide constitutive, inducible or tissue specific expression of the genes provides means to obtain transgenic cyanogenic plants with desired disease and herbivor responses. Likewise, the content of cyanogenic glucosides in cyanogenic plants may be modified or reduced using anti-sense , double stranded RNA (dsRNA) or ribozyme technology using the same genes. Cyanogenic glucosides belong to the group of phytoanticipins. In cyanogenic plants, blockage or reduction of UDP-glucose:cyanohydrin glucosyltransferase activity is expected to result in production and accumulation of the same products as normally produced by degradation of cyanogenic glucosides in damaged or infected plant cells. Thus using anti-sense or ribozyme technology, plants can be obtained that produce the degradation products of cyanogenic glucosides in the same tissues where cyanogenic glucosides are produced in the wild-type plant resulting in plants with an altered resistance to pathogens and herbivors. Thus, it is a further aspect of the present invention to provide transgenic plants comprising stably integrated into the genome DNA coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose or DNA encoding sense RNA, anti sense RNA, double stranded RNA or a ribozyme, the expression of which reduces expression of a UDP-glucose:aglycon-glucosyltransferase conjugating p-hydroxymandelonitrile to glucose. Such plants can be produced by a method comprising (a) introducing into a plant cell or tissue which can be regenerated to a complete plant, DNA comprising a gene expressible in that plant encoding a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose or DNA encoding sense RNA, anti sense RNA or a ribozyme, the expression of which reduces the expression of a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin to glucose ; and (b) selecting transgenic plants.
EXAMPLES
Example 1 - UDP-glucose:p-hydroxymandelonitrile-glucosyltransferase assay Generally a 20 p1 reaction mixture including 100 mM Tris~HCl (pH 7.9), 1-5 pM'4C-UDP-glucose (11.0 GBq~mmol-', Amersham LIFE SCIENCE), 0-300 pM UDP-glucose, 0-20 mM p-hydroxymandelonitrile (dissolved in water, freshly prepared), 25 mM y-gluconolactone, 0-1 mg BSA and 0.5-10 p1 of protein preparation, is incubated at 30°C between 2 minutes and 2 hours. Thereafter the reaction is terminated by the addition of 1/10 of the reaction volume of 10% acetic acid. The same assay conditions are used to determine the glucosylation of mandelonitrile, benzoic acid, benzylalcohol, geraniol and a number of other aglycons.
To determine the substrate specificity of recombinant sbHMNGT incubation lasts for 20 min at 30°C and the general protocol above is adapted to include 1.25 mM aglycone (dissolved in ethanol except for flavonoids which are dissolved in ethylene glycol monoether), 1.25 pM'4C-UDP-glucose, 12.5 pM UDP-glucose, 100 ng recombinant sbHMNGT, and 4 ~g BSA.
Quantitative determination of the activity of recombinant sbHMNGT is carried out using 4 minutes incubation at 30°C. Analyses are carried out as for the determination of substrate specificity except that the reaction mixtures are composed as folows:
1, 5 or 10 mM aglycone, pM'4C-UDP-glucose, 0.2 mM UDP-glucose, 200 ng recombinant sbHMNGT, and 24 pg BSA.
Reaction mixtures for analysis by NMR spectroscopy are incubated for 2 hours in a total volume of 0.5-1 ml including 2 mM p-hydroxymandelonitrile or 6.5 mM geraniol, 3 mM UDP-glucose, 2.5 ~g recombinant sbHMNGT, and 0.5 mg BSA.
Glucosides are extracted with ethyl acetate and lyophillized using speedy-vac prior to NMR
analysis.
For TLC anal~rsis the reaction mixture is applied to Silica Gel 60 F254 plates (Merck), dried and eluted in a solvent containing ethyl acetate:acetone:dichloromethane:methanol:H 20 (40:30:12:10:8, v/v). Plates are dried for one hour at room temperature and exposed to storage phosphorlmaging plates (Molecular Dynamics) prior to scanning on a Storm 860 Phosphorlmager (Molecular Dynamics).
For analysis by liquid scintillation counting (LSC) reaction mixtures are extracted with 400 p1 of ethyl acetate to separate glucosides from unincorporated '4C-UDP-glucose. 2 ml of Ecoscint A (National Diagnostics, New Jersey, USA) are added to 250 p,1 of each ethyl acetate extract and analyzed using a Win Spectral 1414 (Wallac) liquid scintillation counter.
Mandelonitrile is used as substrate to assay fractions generated by liquid chromatography.
Example 2 - Purification of UDP-glucose:~rhydroxymandelonitrile-glucosyltransferase Except where indicated all steps are carried out at 4°C. Although the endogenous substrate of sbHMNGT is p-hydroxymandelonitrile, mandelonitrile is employed as the substrate for the assay of sbHMNGT activity throughout purification, since it is an equally good substrate.
Furthermore, the absence of a hydroxyl group at the para-position of the benzene ring rules out the possibility of p-glucosyloxymandelonitrile synthesis, which would be indistinguishable from dhurrin using the LSC assay.
1 kg of S. bicolor seeds are soaked in water over night at room temperature and subsequently grown for 2 days at 30°C in darkness as described in ( Halkier et al, Plant Physiol. 90: 1552-1559, 1989 ). Seedling shoots are harvested and extracted in 2 volumes of ice-cold extraction buffer (250 mM sucrose; 100 mM Tris ~HCI (pH 7.5); 50 mM NaCI; 2 mM EDTA; 5% (w/v) of polyvinylpolypyrrolidone; 200 pM phenylmethylsulfonyl fluoride; 6 mM DTT) using mortar and pestle. The extract is filtered through a nylon mesh prior to centrifugation at 20,000 x g for 20 min. The supernatant fraction is subjected to differential ammonium sulphate fractionation (35-70%) with 1 hour precipitations and centrifugations at 20,000 x g for 20 min. The pellet is resuspended in buffer A (20 mM Tris-HCI
(pH 7.5); 5 mM
DTT) using a paint brush and desalted using a 100 ml Sephadex G-25 (Pharmacia) or Biogel P-6 (Bio-Rad) column (20 ml/min flow-rate) equilibrated in buffer A.
Whilst these purification steps do not result in a measurable increase of the specific activity of sbHMNGT, low molecular weight solutes (including cyanide-precursors) are effectively removed. The first UV-absorbing peak is collected and applied to a 20 ml Q-sepharose (Pharmacia) column (60-80 ml/hr flow-rate) equilibrated in buffer B (buffer A
+ 50 mM
NaCI). The column is washed with buffer B until the baseline has stabilised and proteins are eluted with a linear gradient from 50 to 400 mM NaCI in buffer A (800 ml total). 10 ml fractions are collected and 3-5 p1 assayed for mandelonitrile glucosyltransferase activity by LSC. All sbHMNGT activity bound to Q-sepharose is eluted between 150-200 mM
NaCI with a -7-fold purification. Combined active fractions are diluted five-fold in buffer B and concentrated 20-fold using an Amicon YM30 or YM10 membrane prior to storage at -80°C.
The remaining steps of the dye chromatography purification are carried out at room temperature or at 4°C. One quarter of combined concentrated ion-exchange fractions (~10-15 mg protein in 5 ml) is applied to a column (1 cm x 10 cm) containing Reactive Yellow 3 cross-linked on 4% beaded agarose (Lot 63H9502; Sigma) equilibrated in buffer B (10-15 ml/hr). The column is washed with buffer B until the baseline has stabilised.
Proteins are eluted with 10 ml of 2 mM UDP-glucose in buffer B. Active fractions containing essentially pure sbHMNGT are pooled and stored at -80°C with or without addition of 1 mg/ml BSA.
Results: Initial experiments indicated that a 2-day germination period was optimal with regards to total sbHMNGT activity, protein concentration and extract volume.
The use of a Waring blender resulted in less than 50% of the activity as compared to extraction with mortar and pestle. sbHMNGT activity was largely unaffected by freezing at -80°C and the addition of glycerol had no effect. The addition of elevated concentrations of DTT in buffer solutions (5 mM compared to 2 mM) resulted in a ten-fold greater activity after storage at 4°C for 2 days. This pronounced effect of DTT was primarily found in crude preparations, wheras partially purified ion-exchange preparations were less responsive to the concentration of reducing agents.
Several pseudoaffinity reagents were tested out in mini-column format including Cibachron blue 3G, Reactive Green 19, Reactive Yellow 3 and UDP-glucoronic acid cross-linked with 4% beaded agarose. Trials with elution using NaCI and UDP-glucose at varying salt concentrations identified Reactive Yellow 3 as the superior column material.
sbHMNGT
activity binds to the Reactive Yellow 3 at 50 mM NaCI and could be eluted after washing with a slight increase in NaCf concentration, without any measurable UV
absorbance in the eluate. sbHMNGT activity binds at either salt concentration and can be eluted after washing with a slight increase in NaCI concentration, without any measurable UV-absorbance in the eluate. sbHMNGT activity correlates with a polypeptide migrating around 50-55 kDa by SDS-PAGE, although there are several impurities present (data not shown).
Elution with 2 mM UDP-glucose instead of NaCI results in the elution of a similarly migrating polypeptide in apparent homogeneity. When the protocol is repeated it was found that a low column height in relation to total protein was crucial in order to obtain the same degree of purity. Assuming that all of the polypeptide which was visualised by SDS-PAGE
was active (and therefore that all inactive protein had been lost) and compensating for cold substrate dilution (UDP-glucose), sbHMNGT represented approximately 0.25% of total protein and was purified 420-fold with a yield of 22%.
Example 3 - Peptide Generation and Sequencing Approximately 5 pg of sbHMNGT is subjected to N-terminal sequencing using a protein sequencer (mode! G1000A, Hewlett-Packard). For peptide digestion, approximately 100 pg of sbHMNGT are precipitated with trichloracetic acid and resuspended in 50 p1 of 50 mM
TrisHCl (pH 8.0), 5 mM DTT and 6.4 M Urea. The preparation is incubated at 60°C for 50 min, cooled to room temperature, and diluted with 3 volumes of 30 mM Tris (pH
Additionally, sequence alignments using BLAST can take into account whether the substitution of one amino acid for another is likely to conserve the physical and chemical properties necessary to maintain the structure and function of a protein or is more likely to disrupt essential structural and functional features. Such sequence similarity is quantified in terms of of a percentage of 'positive' amino acids, as compared to the percentage of identical amino acids and can help assigning a protein to the correct protein family in border-line cases.
Investigations into the quantitative and qualitative substrate specificity of sbHMNGT showed a strong preference for the cyanohydrin present in S, bicolor. Thus, in vivo cyanohydrin glucosyltransferases show strong preferences for a limited number of cyanohydrins, terpenoids, phenylderivatives and hexanolderivatives . Nevertheless enzymes catalyzing reactions at the end of biosynthetic pathways often have a broader substrate specificity than those catalyzing preceding reactions resulting in greater flexibility with respect to the evolution of novel secondary metabolite biosynthesis and xenobiotic catabolism. This is illustrated by the finding that whilst the first enzyme of the pathway (CYP79A1 ) is exclusive for tyrosine, CYP71 E1 and sbHMNGT also accept phenylalanine derived oximes and cyanohydrins, respectively. The presence of a nitrite group is also not necessarily required for substrate recognition by sbHMNGT, as demonstrated by the ability of sbHMNGT to glucosylate benzyl alcohol, benzoic acid , vanillic acid, vanillin and 2-hydroxy-3-methoxybenzylalcohol, geraniol, nerol and (3-citronellol. The results demonstrate that sbHMNGT accepts substrates which are structurally similar to the mandelonitrile or p-hydroxy-mandelonitrile. This group of substrate compounds also includes Green Note Flavours such as hexan-1-ol, trans-2-hexene-1-of and cis-3-hexene-1-of and other tyrosine or phenylalanine related aroma compounds like phenylacetic acid, phenylethylalcohol, and phenylethylacetate (Krings et al, Appl. Microbiol. Biotechnol. 49: 1-8, 1998).
The rates observed for glucosylation of benzyl alcohol, benzoic acid and geraniol are lower than those observed for the cyanohydrins. However, they are still high. To this date there are no reports on the isolation or cloning of a monoterpenoid glucosyltransferase nor of glucosyltransferases for hexanol or hexanol derived compounds , despite the obvious importance of these enzyme classes in defining taste and aroma of processed foods and vegetables.
In the process of glycosylation, unstable compounds (aglycons) are generally rendered less chemically reactive and more water soluble through the enzymatic addition of sugar groups.
This typically enables the plant to store increased amounts of these aglycons in the form of glycosides. Many of the secondary metabolites synthesised by plants are glycosylated. For instance over 1500 glycosides of flavonoids alone have been characterised.
Glycosylation generally occurs as a late or the last step in the biosynthesis of compounds otherwise unstable in the cellular environment, and can provide a pool of inactive and transportable precursor forms of compounds that can be obtained in an active form by hydrolysis with glucosidase enzymes. Conversion of free aglycons such as terpenoids and Green Note Flavours into corresponding glucosides through the introduction of a glucosyltransferase can be used to preserve aroma, flavour and colour components in fruits, vegetables and other plants. The aglycons can be liberated by the action of specific or unspecific b-glusidases during food preparation or consumption. Further optimization of the catalytic properties towards individual desired aroma, flavours or colour compounds may be achieved through directed evaluation or methods of genetic engineering such as gene shuffling or mutation .
For example in the grapevine the glucosylation of many secondary metabolites has recently become the focus of significant research efforts arising from the discovery that many of the aroma, flavour and colour components of wine are derived from grape compounds which occur in large part as glucosides. Among such target compounds are the terpenes, e.g.
geraniol which is found in both a free and a glucosylated form. In view of the present invention the glucoside pool of aroma and flavour precursors can be modulated through manipulation of glucosyltransferase activities and aroma and flavour can be released from stored pools of glucosides via acid or enzyme mediated hydrolysis. Thus, in the grape berry and other fruits, vegetables and plants, the introduction of specific glucosyltransferases such as the cloned sbHMGT or reduction of their expression through anti-sense techniques allows directed modification of secondary metabolite composition. This permits modulatation of important free and bound flavour pools of plants allowing the design of fruits, wines and other plant derived products with defined, organoleptic properties.
_g_ The ability of a glucosyltransferase to conjugate an aglycon to glucose can for example be determined in an assay comprising the following steps:
a) Incubation of a reaction mixture comprising '4C-UDP-glucose, aglycon and UDP
glucose:aglycon-glucosyltransferase at 30°C between 2 minutes and 2 hours b) terminating the reaction, and c) chemical identification and quantification of the glucoside produced.
Typically the reaction mixture has a volume of 5 to 2000 p1, but preferably 20 p1 and includes 10-200 mM Tris~HCl (pH 7.9); 1-5 pM'4C-UDP-glucose (about 11.0 GBq~mmol-'); 0-300 pM UDP-glucose; 0-20 mM aglycone; 25 mM 'y-gluconolactone; 0-2 pg/pl BSA
and 0-10 ng/pl UDP-glucose:aglycon-glucosyltransferase. ~3-glucosidase inhibitors other than 'y-gluconolactone and protein stabilizers other than BSA may be included as appropriate. One possibility to terminate the reaction is to acidify the reaction mixture for example by adding 1/10 volume of 10% acetic acid.
Chemical identification and quantification of the glucoside formed in the reaction mixture may be achieved using a variety of methodologies including NMR spectroscopy, TLC
analysis, HPLC analyses or GLC analysis in proper combinations with mass spectrometric analysis of the glucoside.
Reaction mixtures for analysis by NMR spectroscopy usually have a total volume of 0.5 -1 ml, are incubated for 2 hours and include 0-lOmM aglycon, e.g.2 mM p-hydroxy-mandelonitrile or 6.5 mM geraniol, 3 mM UDP-glucose, 2.5 p,g recombinant sbHMNGT, and 0.5 mg BSA.
Glucosides are extracted for example with ethyl acetate and lyophillized prior to NMR
analysis.
For TLC analysis the reaction mixtures are applied to Silica Gel 60 F254 plates (Merck), dried and eluted in a solvent such as ethyl acetate : acetone :
dichloromethane : methanol H20 (40:30:12:10:8, v/v). Plates are dried for one hour at room temperature and exposed to storage phosphorlmaging plates prior to scanning on a Phosphorlmager. Based on the specific radioactivity of the radiolabelled UDP-glucose, the amount of glucoside formed is quantified.
The radioactivity may also be determined by liquid scintillation counting ( LSC analysis). In some cases, where the glucoside formed is derived from a very hydrophobic aglycon, e.g.
mandelonitrile, the glucoside can be extracted into an ethyl acetate phase and thereby be _g_ separated from unincorporated '4C-UDP-glucose. 2 ml of scintillation cocktail are added to 250 p1 of each ethyl acetate extract and analyzed using a liquid scintillation counter. During column fractionation, those fractions containing sbHMNGT activity can be identified using mandelonitrile as the aglycon substrate and ethyl acetate extraction of the glucoside formed.
Knowledge of SEQ ID NO: 1 and SEQ ID NO: 2 can be used to accelerate the isolation and production of DNA molecules coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose which method comprises (a) preparing a cDNA library from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, (b) using at least one oligonucleotide designed on the basis of SEQ ID NO: 2 or SEO ID
NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA from the cDNA library, (c) optionally using one or more oligonucleotides designed on the basi s of SEQ ID NO: 2 or SEQ ID NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA
from the cDNA library in a nested PCR reaction, (d) using the DNA obtained in steps (b) or (c) as a probe to screen the DNA
library prepared from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, and (e) identifying and purifying vector DNA comprising an open reading frame encoding a protein characterized by an amino acid component sequence of at least 150 amino acid residues length having 50% or more sequence identity to an aligned component sequence of SEQ ID NO: 2, and (f) optionally further processing the purified DNA to achieve, for example, heterologous expression of the protein in a microorganism like Escherichia coli or Pichia pastoris for subsequent isolation of the glucosyltransferase, determination of its substrate specificity and generation of an antibody.
In process steps (b) and (c) the second oligonucleotide used for amplification is preferably an oligonucleotide complementary to a region within in the vector DNA used for preparing the cDNA library. However, a second oligonucleotide designed on the basis of the sequence of SEQ ID NO: 2 or SEA ID NO: 1 can also be used. A prefered embodiment of this method for the isolation of cDNA is described in Example 4. cDNA clones coding for UDP-glucose:aglycon-glucosyltransferase or fragments of this clone may also be used on DNA chips alone or in combination with the cDNA clones encoding proteins belonging to the CYP79 or CYP71 E1 family of proteins or fragments of these clones. This provides an easy way to monitor the induction or repression of cyanogenic glucoside synthesis in plants as a result of biotic and abiotic factors.
A further embodiment of the present invention are UDP-glucose:aglycon-glucosyl-transferases conjugating a cyanohydrin to glucose such as the S. bicolor enzyme conjugating p-hydroxymandelonitrile to glucose.
Purified recombinant UDP-glucose:aglycon-glucosyltransferases can be obtained by a method comprising dye chromatography and elution with UDP-glucose. An appropriate column material for dye chromatography is Reactive Yellow 3 preferably cross-linked on beaded agarose. Elution of the protein is conveniently achieved using 2 mM UDP-glucose.
The present invention also provides nucleic acid compounds comprising an open reading frame encoding the novel proteins according to the present invention. Said compounds are characterized by the formula RA-RB-R~, wherein -- RA, RB and R~ constitute component sequences consisting of nucleotide residues independently selected from the group of the nucleotide residues G, A, T and C
or the group of nucleotide residues G, A, U and C, -- RA and R~ consist independently of 0 to 1500, preferably 0 to 1050 nucleotide residues;
-- RB consists of 450-1260 and preferably 450-840 nucleotide residues; and -- the component sequence R B is at least 65% identical to an aligned component sequence of SEQ ID NO: 2.
Specific examples of the component sequence R a are represented by nucleotides 61 to 1335, 502 to 1344, or 841 to 1344 of SEQ ID NO: 2.
In a preferrred embodiment of the present invention at least one of the component sequences RA or R~ comprises one or more additional component sequences which have a length of at least 150 nucleotide residues and are at least 60% identical to an aligned component sequence of SEQ ID NO: 2. Specific examples of such additional component sequences are represented by nucleotides 61 to 165, 427 to 522, or 907 to 1029 of SEQ ID
NO: 2.
The pathway for dhurrin synthesis can be introduced into acyanogenic plants by expression of CYP79A1, CYP71 E1 and the sbHMNGT. These three gene products derived from the same plant species, i.e. sorghum, assemble as a macromolecular complex resulting in stronger channeling of the intermediates in the pathway and less free intermediates are released into the plant.
Expressed as transgenes the DNA molecules encoding glycosyltransferases according to the present invention are particularly useful to modify the biosynthesis of cyanogenic glucosides in plants. When the gene encoding a UDP-glucose:cyanohydrin glucosyltransferase is expressed in conjunction with genes encoding cytochrome enzymes belonging to the CYP79 family (catalyzing the conversion of an amino acid to the corresponding N-hydroxyamino acid and the oxime derived from this N-hydroxy amino acid or a cytochrome P450 monooxygenase) and CYP71 E family (catalyzing the conversion of an aldoxime to a nitrite and the con version of said nitrite to the corresponding cyano hydrin), acyanogenic wild-type plants are converted into cyanogenic plants. Proper selection of promoters to provide constitutive, inducible or tissue specific expression of the genes provides means to obtain transgenic cyanogenic plants with desired disease and herbivor responses. Likewise, the content of cyanogenic glucosides in cyanogenic plants may be modified or reduced using anti-sense , double stranded RNA (dsRNA) or ribozyme technology using the same genes. Cyanogenic glucosides belong to the group of phytoanticipins. In cyanogenic plants, blockage or reduction of UDP-glucose:cyanohydrin glucosyltransferase activity is expected to result in production and accumulation of the same products as normally produced by degradation of cyanogenic glucosides in damaged or infected plant cells. Thus using anti-sense or ribozyme technology, plants can be obtained that produce the degradation products of cyanogenic glucosides in the same tissues where cyanogenic glucosides are produced in the wild-type plant resulting in plants with an altered resistance to pathogens and herbivors. Thus, it is a further aspect of the present invention to provide transgenic plants comprising stably integrated into the genome DNA coding for a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose or DNA encoding sense RNA, anti sense RNA, double stranded RNA or a ribozyme, the expression of which reduces expression of a UDP-glucose:aglycon-glucosyltransferase conjugating p-hydroxymandelonitrile to glucose. Such plants can be produced by a method comprising (a) introducing into a plant cell or tissue which can be regenerated to a complete plant, DNA comprising a gene expressible in that plant encoding a UDP-glucose:aglycon-glucosyltransferase conjugating cyanohydrins, terpenoids, phenylderivatives or hexanolderivatives to glucose or DNA encoding sense RNA, anti sense RNA or a ribozyme, the expression of which reduces the expression of a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin to glucose ; and (b) selecting transgenic plants.
EXAMPLES
Example 1 - UDP-glucose:p-hydroxymandelonitrile-glucosyltransferase assay Generally a 20 p1 reaction mixture including 100 mM Tris~HCl (pH 7.9), 1-5 pM'4C-UDP-glucose (11.0 GBq~mmol-', Amersham LIFE SCIENCE), 0-300 pM UDP-glucose, 0-20 mM p-hydroxymandelonitrile (dissolved in water, freshly prepared), 25 mM y-gluconolactone, 0-1 mg BSA and 0.5-10 p1 of protein preparation, is incubated at 30°C between 2 minutes and 2 hours. Thereafter the reaction is terminated by the addition of 1/10 of the reaction volume of 10% acetic acid. The same assay conditions are used to determine the glucosylation of mandelonitrile, benzoic acid, benzylalcohol, geraniol and a number of other aglycons.
To determine the substrate specificity of recombinant sbHMNGT incubation lasts for 20 min at 30°C and the general protocol above is adapted to include 1.25 mM aglycone (dissolved in ethanol except for flavonoids which are dissolved in ethylene glycol monoether), 1.25 pM'4C-UDP-glucose, 12.5 pM UDP-glucose, 100 ng recombinant sbHMNGT, and 4 ~g BSA.
Quantitative determination of the activity of recombinant sbHMNGT is carried out using 4 minutes incubation at 30°C. Analyses are carried out as for the determination of substrate specificity except that the reaction mixtures are composed as folows:
1, 5 or 10 mM aglycone, pM'4C-UDP-glucose, 0.2 mM UDP-glucose, 200 ng recombinant sbHMNGT, and 24 pg BSA.
Reaction mixtures for analysis by NMR spectroscopy are incubated for 2 hours in a total volume of 0.5-1 ml including 2 mM p-hydroxymandelonitrile or 6.5 mM geraniol, 3 mM UDP-glucose, 2.5 ~g recombinant sbHMNGT, and 0.5 mg BSA.
Glucosides are extracted with ethyl acetate and lyophillized using speedy-vac prior to NMR
analysis.
For TLC anal~rsis the reaction mixture is applied to Silica Gel 60 F254 plates (Merck), dried and eluted in a solvent containing ethyl acetate:acetone:dichloromethane:methanol:H 20 (40:30:12:10:8, v/v). Plates are dried for one hour at room temperature and exposed to storage phosphorlmaging plates (Molecular Dynamics) prior to scanning on a Storm 860 Phosphorlmager (Molecular Dynamics).
For analysis by liquid scintillation counting (LSC) reaction mixtures are extracted with 400 p1 of ethyl acetate to separate glucosides from unincorporated '4C-UDP-glucose. 2 ml of Ecoscint A (National Diagnostics, New Jersey, USA) are added to 250 p,1 of each ethyl acetate extract and analyzed using a Win Spectral 1414 (Wallac) liquid scintillation counter.
Mandelonitrile is used as substrate to assay fractions generated by liquid chromatography.
Example 2 - Purification of UDP-glucose:~rhydroxymandelonitrile-glucosyltransferase Except where indicated all steps are carried out at 4°C. Although the endogenous substrate of sbHMNGT is p-hydroxymandelonitrile, mandelonitrile is employed as the substrate for the assay of sbHMNGT activity throughout purification, since it is an equally good substrate.
Furthermore, the absence of a hydroxyl group at the para-position of the benzene ring rules out the possibility of p-glucosyloxymandelonitrile synthesis, which would be indistinguishable from dhurrin using the LSC assay.
1 kg of S. bicolor seeds are soaked in water over night at room temperature and subsequently grown for 2 days at 30°C in darkness as described in ( Halkier et al, Plant Physiol. 90: 1552-1559, 1989 ). Seedling shoots are harvested and extracted in 2 volumes of ice-cold extraction buffer (250 mM sucrose; 100 mM Tris ~HCI (pH 7.5); 50 mM NaCI; 2 mM EDTA; 5% (w/v) of polyvinylpolypyrrolidone; 200 pM phenylmethylsulfonyl fluoride; 6 mM DTT) using mortar and pestle. The extract is filtered through a nylon mesh prior to centrifugation at 20,000 x g for 20 min. The supernatant fraction is subjected to differential ammonium sulphate fractionation (35-70%) with 1 hour precipitations and centrifugations at 20,000 x g for 20 min. The pellet is resuspended in buffer A (20 mM Tris-HCI
(pH 7.5); 5 mM
DTT) using a paint brush and desalted using a 100 ml Sephadex G-25 (Pharmacia) or Biogel P-6 (Bio-Rad) column (20 ml/min flow-rate) equilibrated in buffer A.
Whilst these purification steps do not result in a measurable increase of the specific activity of sbHMNGT, low molecular weight solutes (including cyanide-precursors) are effectively removed. The first UV-absorbing peak is collected and applied to a 20 ml Q-sepharose (Pharmacia) column (60-80 ml/hr flow-rate) equilibrated in buffer B (buffer A
+ 50 mM
NaCI). The column is washed with buffer B until the baseline has stabilised and proteins are eluted with a linear gradient from 50 to 400 mM NaCI in buffer A (800 ml total). 10 ml fractions are collected and 3-5 p1 assayed for mandelonitrile glucosyltransferase activity by LSC. All sbHMNGT activity bound to Q-sepharose is eluted between 150-200 mM
NaCI with a -7-fold purification. Combined active fractions are diluted five-fold in buffer B and concentrated 20-fold using an Amicon YM30 or YM10 membrane prior to storage at -80°C.
The remaining steps of the dye chromatography purification are carried out at room temperature or at 4°C. One quarter of combined concentrated ion-exchange fractions (~10-15 mg protein in 5 ml) is applied to a column (1 cm x 10 cm) containing Reactive Yellow 3 cross-linked on 4% beaded agarose (Lot 63H9502; Sigma) equilibrated in buffer B (10-15 ml/hr). The column is washed with buffer B until the baseline has stabilised.
Proteins are eluted with 10 ml of 2 mM UDP-glucose in buffer B. Active fractions containing essentially pure sbHMNGT are pooled and stored at -80°C with or without addition of 1 mg/ml BSA.
Results: Initial experiments indicated that a 2-day germination period was optimal with regards to total sbHMNGT activity, protein concentration and extract volume.
The use of a Waring blender resulted in less than 50% of the activity as compared to extraction with mortar and pestle. sbHMNGT activity was largely unaffected by freezing at -80°C and the addition of glycerol had no effect. The addition of elevated concentrations of DTT in buffer solutions (5 mM compared to 2 mM) resulted in a ten-fold greater activity after storage at 4°C for 2 days. This pronounced effect of DTT was primarily found in crude preparations, wheras partially purified ion-exchange preparations were less responsive to the concentration of reducing agents.
Several pseudoaffinity reagents were tested out in mini-column format including Cibachron blue 3G, Reactive Green 19, Reactive Yellow 3 and UDP-glucoronic acid cross-linked with 4% beaded agarose. Trials with elution using NaCI and UDP-glucose at varying salt concentrations identified Reactive Yellow 3 as the superior column material.
sbHMNGT
activity binds to the Reactive Yellow 3 at 50 mM NaCI and could be eluted after washing with a slight increase in NaCf concentration, without any measurable UV
absorbance in the eluate. sbHMNGT activity binds at either salt concentration and can be eluted after washing with a slight increase in NaCI concentration, without any measurable UV-absorbance in the eluate. sbHMNGT activity correlates with a polypeptide migrating around 50-55 kDa by SDS-PAGE, although there are several impurities present (data not shown).
Elution with 2 mM UDP-glucose instead of NaCI results in the elution of a similarly migrating polypeptide in apparent homogeneity. When the protocol is repeated it was found that a low column height in relation to total protein was crucial in order to obtain the same degree of purity. Assuming that all of the polypeptide which was visualised by SDS-PAGE
was active (and therefore that all inactive protein had been lost) and compensating for cold substrate dilution (UDP-glucose), sbHMNGT represented approximately 0.25% of total protein and was purified 420-fold with a yield of 22%.
Example 3 - Peptide Generation and Sequencing Approximately 5 pg of sbHMNGT is subjected to N-terminal sequencing using a protein sequencer (mode! G1000A, Hewlett-Packard). For peptide digestion, approximately 100 pg of sbHMNGT are precipitated with trichloracetic acid and resuspended in 50 p1 of 50 mM
TrisHCl (pH 8.0), 5 mM DTT and 6.4 M Urea. The preparation is incubated at 60°C for 50 min, cooled to room temperature, and diluted with 3 volumes of 30 mM Tris (pH
7.7) and 1.25 mM EDTA. Endo Lys-C (Promega) is added at a 1:25 ratio (w/w) and the reaction mixture is allowed to incubate for 24 hours at 37°C. Peptides are purified by reverse-phase HPLC using a Vydac 208TP52 C8 column (250 mm x 21 mm) and Beckman System Gold HPLC equipment. Peptides are applied at a 0.2 ml/min flow-rate in buffer C
(0.1 trifluoroacetic acid) and eluted with a linear gradient from 0 to 80%
acetonitrile in buffer C.
Fractions are collected manually and sequenced as described above.
Example 4 - Cloning PCR amplification: 1 st round PCR amplification reactions are carried out using 2 units of Taq DNA polymerase (Pharmacia), 4 p1 of lOxTaq DNA polymerase buffer, 5% (v/v) dimethyl sulfoxide, 1 p,1 dNTPs (10 mM), 80 pmoles each of primers C2EF ( 5 ~ -TTYGTIws-ICAYTGYGGITGGAA-3 ~, SEQ ID NO: 3) and T7 (5 ~-AATACGACTCACTATAG-3 ~, SEQ ID
NO: 4) and about 10 ng of plasmid DNA template in a total volume of 40 p1. The plasmid DNA template is prepared from a unidirectional pcDNAlI (Invitrogen) plasmid library made from 1-2 cm high etiolated S. bicolor seedlings (Bak et al, Plant Mol. Biol.
36: 393-405, 1998). Thermal cycling parameters are 95°C, 5 min, 3 x (95°C for 5 sec, 42°C for 30 sec, 72°C for 30 sec), 32 x (95°C for 5 sec, 50°C for 30 sec, 72°C for 30 sec) and a final 72°C
for 5 min.
2nd round PCR amplifications are carried out as above, except for using primers C2DF
(5 ~-GARGCIACIGCIGCIGGICARCC-3 ~, SEQ ID NO: 5) and T7, and 1 q1 of 1st round reaction as DNA template. Thermal cycling parameters are 95°C, 5 min, 32 x (95°C for 5 sec, 55°C for 30 sec, 72°C for 30 sec) and a final 72°C
for 5 min. The PCR reaction mixtures are subjected to gel electrophoresis using a 1.5% agarose gel and an approximately 600 by band is excised and cleaned using a Qiaex II gel extraction kit (Qiagen). The cleaned PCR product is then ligated into the pGEM-T vector and used to transform the E. coli JM109 strain according to the manufacturers instructions (Promega).
Nucleic acid sequencing reveals the presence of two previously obtained peptide sequences in the translation product of PCR clone 15#44.
Cloning and Library Screening: The PCR clone 15#44 is used as a template for generating a 306 by digoxigenin-11-dUTP-labelled probe by PCR using primers 441F (5 ~-GAGGCGA-CGGCGGCGGGGCAG-3 ', SEQ ID NO: 6) and 4428 (5 ~-CATGTCACTGCTTGCCCCCGACCA-3 SEQ ID NO: 7) according to the manufacturer's instructions (Boehringer Mannheim). The labelled probe is cleaned using the Qiaex II gel extraction kit after gel electrophoresis with a 1.5% agarose gel and employed to screen approximately 50,000 colonies of the abovementioned plasmid library. Hybridizations are carried out over night at 65°C in 5x SSC, 0.1 % (w/v) N-lauroylsarcosine, 0.02% (w/v) SDS and 1 % blocking reagent (Boehringer Mannheim). Membranes are then washed in 0.5x SSC at 60°C, 3 x 15 min.
Seven hybridizing clones are isolated and one full-length clone, sbHMNGT1, is chosen for further characterization.
Example 5 - Identity and similarity between sbHMNGT and translation products of known or putative glucosyltransferase-encoding cDNAs Table 1 summarizes the overall identity respectively similarity between sbHMNGT and known or putatice glycosyltransferase amino acid sequences as well as the identities respectively similarities in the corresponding N-terminal regions, i.e. the region defined as the sequence N-terminal of the consensus sequence xCLxWL with the split-point being at amino acid residue 291/292 of sbHMNGT.
Table 2 summarizes the similarity respectively identity between the amino acid sequence of sbHMNGT region a, defined as residues 188-229 in HMNGT, and corresponding sequences in known or putative glycosyltransferase amino acid sequences.
The calculations of similarity and identity are based on a pairwise comparisons of cDNA
translation products using the GAP program (Genetic Computer Group, Madison, WI), wherein A/G, Y/F, SlT, V/I/L, R/K/H, and D/E/N/Q are considered to constitute similar residues. Abbreviated sequence names are stSGT ( Solanum tuberosum solanidine-_18_ glucosyltransferase: GenBankT"" accession number U82367); bnTHGT ( Brassica napes thiohydroximate-S-glucosyltransferase: SEQ ID NO: 28 of EP -771 878-A1 ), zmUFGT
(Maize flavonoid-glucosyltransferase: GenBankT"'' accession number X13502), wUFGT
(Vitis vinifera anthocyanidin-glucosyltransferase: GenBank T"" accession number AF000371 ), psGT (Pisum sativum UDP-glucuronosyltransferase: GenBank T"" accession number AF034743), meGT (Cassava UTP-glucose glucosyltransferase: GenBank T""
accession number X77464), and zmIAAGT (Maize Indole-3-acetate beta-glucosyltransferase:
GenBankT"'' accession number L34847).
Table 1:
sbHMNGT
Overall % N-terminal Identity SimilarityIdentity Similarity zmUFGT 36.7 41.5 32.6 37.1 wUFGT 30.0 38.7 23.8 33.3 psGT 41.6 51.5 32.9 46.3 meGT 31.3 41.6 25.3 36.8 zmIAAGT 34.9 41.3 27.8 35.0 snSGT 28.9 38.0 23.6 31.0 bnTHGT 30.7 38.0 24.7 33.3 Table 2: a region identities (italic) and similarities (bold face) sbHMNGT psGT zmUFGT wUFGT mhGT
sbHMNGT 45.2% 26.2% 19.1 % 20%
psGT 69.1 % ______ ______ ______ zmUFGT 35.7% ------ 47.6% 37.5%
wUFGT 35.7% ------ 59.5% ------mhGT 37.5% ------ 55.0% ------Example 6 - Heterologous expression Primers EXF1 (5' -AATAAAAGCATATGGGAAGCAACGCGCCGCCTCCG-3 ' , SEQ ID NO: 8) and EXR1 ( 5 ' -TTGGATCCTCACTGCTTGCCCCCGACCA-3 ' , SEQ ID NO: 9) are used to amplify a 1500 by full-length sbHMNGT insert by PCR, using the sbHMNGT1 plasmid as template.
The primers contain 5' recognition sites for restriction endonucleases Ndel (EXF1 ) and BamHl (EXR1 ). PCR reaction conditions are essentially as described in example 4, except for the thermal cycling parameters which are 95°C, 3 min, 30 x (95°C for 5 sec, 53°C for 30 sec, 72°C for 90 sec) and a final 72°C for 5 min. The PCR
product is gel purified, digested with Ndel and BamHl, gel purified once again and ligated into the plasmid expression vector pSP19g10L (Barnes, Methods in Enzymology 272: 3-14, 1996) which has also been digested with the restriction enzymes Ndel and BamHl and gel purified. The ligation reaction mixture is then used to transform E. coli JM109 cells according to the manufacturers instructions (Promega). After selection of successfully cloned cells, expression is initiated as described in (Ford et al, J. BioI.Chem. 273: 9224-9233, 1998 ). Briefly, 600 p,1 of a 37°C over night culture are added to 300 ml luria broth (LB) containing 100 pg/ml ampicillin. The culture is allowed to grow at 28°C under continuous shaking at 150 rpm for 5 hours and IPTG is then added to a final concentration of 0.4 mM. After induction the culture is allowed to continue growing over night and harvested by centrifugation at 2500 x g for 10 min. The pellet is resuspended in 9 ml of 200 mM Tris pH 7.9, 1 mM EDTA, 5 mM DTT and 0.1 mg/ml lysozyme. An equal volume of ice-cold water is added and the mixture allowed to incubate for 10 min at RT, followed by 20 min incubation on ice. After the addition of 18 gmoles of phenylmethylsulfonyl fluoride and 100 units of DNasel/ml (Sigma), the suspension is subjected to three freeze and thaw cycles at -20°C.
Phenylmethylsulfonyl fluoride is adjusted to 1.5 mM final concentration and the preparation centrifuged at 15,000 x g for 15 min. Negative controls, containing no insert in the plasmid vector, are prepared as above.
For purification of the recombinant protein two 300 ml cultures are lysed as above and further purified as for the native protein. Briefly, crude cell lysate is subjected to Q-sepahrose chromatography, desalting and Reactive Yellow 3 chromatography as described in example 2. The yield of recombinant protein is approximately 1 mg/100 ml LB
culture.
Example 7 - Substrate specificity of recombinant sbHMNGTcompared to desalted crude etiolated Sorghum seedling extract Glucosyltransferase activity was determined by TLC using '4C-UDP-glucose.
Filled boxes in Table 1 below (t) indicate that a radiolabelled product was visualised after incubation with the respective aglycone substrate. Empty boxes ( O) indicate that no radiolablled products could be detected under the experimental conditions employed. Figures in brackets indicate the relative Vmax for each aglycon with calculated standard deviations. The V max value for p-hydroxymandelonitrile was 1500 mol of product / mol of sbHMNGT /
sec.
Table 3:
SUBSTRATES ACTIVITY
cyanohydrins Crude Sorghum extract Recombinant sbHMNGT
1 ) mandelonitrile ~ ~ (77,g ~ g,6%) 2) p-hydroxymandelonitrile t t (100 ~ 7.2%) 3) acetone cyanohydrin 0 p benzyl derivatives 4) hydroquinone ~ p 5) benzyl alcohol t t (13.1 t 2.1 %) 6) p-hydroxybenzyl alcohol 7) benzoic acid ~ ~ (4.2 t 0.8%)
(0.1 trifluoroacetic acid) and eluted with a linear gradient from 0 to 80%
acetonitrile in buffer C.
Fractions are collected manually and sequenced as described above.
Example 4 - Cloning PCR amplification: 1 st round PCR amplification reactions are carried out using 2 units of Taq DNA polymerase (Pharmacia), 4 p1 of lOxTaq DNA polymerase buffer, 5% (v/v) dimethyl sulfoxide, 1 p,1 dNTPs (10 mM), 80 pmoles each of primers C2EF ( 5 ~ -TTYGTIws-ICAYTGYGGITGGAA-3 ~, SEQ ID NO: 3) and T7 (5 ~-AATACGACTCACTATAG-3 ~, SEQ ID
NO: 4) and about 10 ng of plasmid DNA template in a total volume of 40 p1. The plasmid DNA template is prepared from a unidirectional pcDNAlI (Invitrogen) plasmid library made from 1-2 cm high etiolated S. bicolor seedlings (Bak et al, Plant Mol. Biol.
36: 393-405, 1998). Thermal cycling parameters are 95°C, 5 min, 3 x (95°C for 5 sec, 42°C for 30 sec, 72°C for 30 sec), 32 x (95°C for 5 sec, 50°C for 30 sec, 72°C for 30 sec) and a final 72°C
for 5 min.
2nd round PCR amplifications are carried out as above, except for using primers C2DF
(5 ~-GARGCIACIGCIGCIGGICARCC-3 ~, SEQ ID NO: 5) and T7, and 1 q1 of 1st round reaction as DNA template. Thermal cycling parameters are 95°C, 5 min, 32 x (95°C for 5 sec, 55°C for 30 sec, 72°C for 30 sec) and a final 72°C
for 5 min. The PCR reaction mixtures are subjected to gel electrophoresis using a 1.5% agarose gel and an approximately 600 by band is excised and cleaned using a Qiaex II gel extraction kit (Qiagen). The cleaned PCR product is then ligated into the pGEM-T vector and used to transform the E. coli JM109 strain according to the manufacturers instructions (Promega).
Nucleic acid sequencing reveals the presence of two previously obtained peptide sequences in the translation product of PCR clone 15#44.
Cloning and Library Screening: The PCR clone 15#44 is used as a template for generating a 306 by digoxigenin-11-dUTP-labelled probe by PCR using primers 441F (5 ~-GAGGCGA-CGGCGGCGGGGCAG-3 ', SEQ ID NO: 6) and 4428 (5 ~-CATGTCACTGCTTGCCCCCGACCA-3 SEQ ID NO: 7) according to the manufacturer's instructions (Boehringer Mannheim). The labelled probe is cleaned using the Qiaex II gel extraction kit after gel electrophoresis with a 1.5% agarose gel and employed to screen approximately 50,000 colonies of the abovementioned plasmid library. Hybridizations are carried out over night at 65°C in 5x SSC, 0.1 % (w/v) N-lauroylsarcosine, 0.02% (w/v) SDS and 1 % blocking reagent (Boehringer Mannheim). Membranes are then washed in 0.5x SSC at 60°C, 3 x 15 min.
Seven hybridizing clones are isolated and one full-length clone, sbHMNGT1, is chosen for further characterization.
Example 5 - Identity and similarity between sbHMNGT and translation products of known or putative glucosyltransferase-encoding cDNAs Table 1 summarizes the overall identity respectively similarity between sbHMNGT and known or putatice glycosyltransferase amino acid sequences as well as the identities respectively similarities in the corresponding N-terminal regions, i.e. the region defined as the sequence N-terminal of the consensus sequence xCLxWL with the split-point being at amino acid residue 291/292 of sbHMNGT.
Table 2 summarizes the similarity respectively identity between the amino acid sequence of sbHMNGT region a, defined as residues 188-229 in HMNGT, and corresponding sequences in known or putative glycosyltransferase amino acid sequences.
The calculations of similarity and identity are based on a pairwise comparisons of cDNA
translation products using the GAP program (Genetic Computer Group, Madison, WI), wherein A/G, Y/F, SlT, V/I/L, R/K/H, and D/E/N/Q are considered to constitute similar residues. Abbreviated sequence names are stSGT ( Solanum tuberosum solanidine-_18_ glucosyltransferase: GenBankT"" accession number U82367); bnTHGT ( Brassica napes thiohydroximate-S-glucosyltransferase: SEQ ID NO: 28 of EP -771 878-A1 ), zmUFGT
(Maize flavonoid-glucosyltransferase: GenBankT"'' accession number X13502), wUFGT
(Vitis vinifera anthocyanidin-glucosyltransferase: GenBank T"" accession number AF000371 ), psGT (Pisum sativum UDP-glucuronosyltransferase: GenBank T"" accession number AF034743), meGT (Cassava UTP-glucose glucosyltransferase: GenBank T""
accession number X77464), and zmIAAGT (Maize Indole-3-acetate beta-glucosyltransferase:
GenBankT"'' accession number L34847).
Table 1:
sbHMNGT
Overall % N-terminal Identity SimilarityIdentity Similarity zmUFGT 36.7 41.5 32.6 37.1 wUFGT 30.0 38.7 23.8 33.3 psGT 41.6 51.5 32.9 46.3 meGT 31.3 41.6 25.3 36.8 zmIAAGT 34.9 41.3 27.8 35.0 snSGT 28.9 38.0 23.6 31.0 bnTHGT 30.7 38.0 24.7 33.3 Table 2: a region identities (italic) and similarities (bold face) sbHMNGT psGT zmUFGT wUFGT mhGT
sbHMNGT 45.2% 26.2% 19.1 % 20%
psGT 69.1 % ______ ______ ______ zmUFGT 35.7% ------ 47.6% 37.5%
wUFGT 35.7% ------ 59.5% ------mhGT 37.5% ------ 55.0% ------Example 6 - Heterologous expression Primers EXF1 (5' -AATAAAAGCATATGGGAAGCAACGCGCCGCCTCCG-3 ' , SEQ ID NO: 8) and EXR1 ( 5 ' -TTGGATCCTCACTGCTTGCCCCCGACCA-3 ' , SEQ ID NO: 9) are used to amplify a 1500 by full-length sbHMNGT insert by PCR, using the sbHMNGT1 plasmid as template.
The primers contain 5' recognition sites for restriction endonucleases Ndel (EXF1 ) and BamHl (EXR1 ). PCR reaction conditions are essentially as described in example 4, except for the thermal cycling parameters which are 95°C, 3 min, 30 x (95°C for 5 sec, 53°C for 30 sec, 72°C for 90 sec) and a final 72°C for 5 min. The PCR
product is gel purified, digested with Ndel and BamHl, gel purified once again and ligated into the plasmid expression vector pSP19g10L (Barnes, Methods in Enzymology 272: 3-14, 1996) which has also been digested with the restriction enzymes Ndel and BamHl and gel purified. The ligation reaction mixture is then used to transform E. coli JM109 cells according to the manufacturers instructions (Promega). After selection of successfully cloned cells, expression is initiated as described in (Ford et al, J. BioI.Chem. 273: 9224-9233, 1998 ). Briefly, 600 p,1 of a 37°C over night culture are added to 300 ml luria broth (LB) containing 100 pg/ml ampicillin. The culture is allowed to grow at 28°C under continuous shaking at 150 rpm for 5 hours and IPTG is then added to a final concentration of 0.4 mM. After induction the culture is allowed to continue growing over night and harvested by centrifugation at 2500 x g for 10 min. The pellet is resuspended in 9 ml of 200 mM Tris pH 7.9, 1 mM EDTA, 5 mM DTT and 0.1 mg/ml lysozyme. An equal volume of ice-cold water is added and the mixture allowed to incubate for 10 min at RT, followed by 20 min incubation on ice. After the addition of 18 gmoles of phenylmethylsulfonyl fluoride and 100 units of DNasel/ml (Sigma), the suspension is subjected to three freeze and thaw cycles at -20°C.
Phenylmethylsulfonyl fluoride is adjusted to 1.5 mM final concentration and the preparation centrifuged at 15,000 x g for 15 min. Negative controls, containing no insert in the plasmid vector, are prepared as above.
For purification of the recombinant protein two 300 ml cultures are lysed as above and further purified as for the native protein. Briefly, crude cell lysate is subjected to Q-sepahrose chromatography, desalting and Reactive Yellow 3 chromatography as described in example 2. The yield of recombinant protein is approximately 1 mg/100 ml LB
culture.
Example 7 - Substrate specificity of recombinant sbHMNGTcompared to desalted crude etiolated Sorghum seedling extract Glucosyltransferase activity was determined by TLC using '4C-UDP-glucose.
Filled boxes in Table 1 below (t) indicate that a radiolabelled product was visualised after incubation with the respective aglycone substrate. Empty boxes ( O) indicate that no radiolablled products could be detected under the experimental conditions employed. Figures in brackets indicate the relative Vmax for each aglycon with calculated standard deviations. The V max value for p-hydroxymandelonitrile was 1500 mol of product / mol of sbHMNGT /
sec.
Table 3:
SUBSTRATES ACTIVITY
cyanohydrins Crude Sorghum extract Recombinant sbHMNGT
1 ) mandelonitrile ~ ~ (77,g ~ g,6%) 2) p-hydroxymandelonitrile t t (100 ~ 7.2%) 3) acetone cyanohydrin 0 p benzyl derivatives 4) hydroquinone ~ p 5) benzyl alcohol t t (13.1 t 2.1 %) 6) p-hydroxybenzyl alcohol 7) benzoic acid ~ ~ (4.2 t 0.8%)
8) p-hydroxybenzoic acid t p
9) p-hydroxybenzaldehyde ~ p
10) gentisic acid 0 0
11 ) caffeic acid ~ p
12) 2-hydroxy cinnamic acid ~ p
13) resveratrol (stilbene) ~ p
14) salicylic acid ~ p
15) p-hydroxymandelic acid ~ p
16) vanillic acid
17) vanillin
18) 2-hydroxy-3-methoxybenzylalcoholt Table 3 continued:
SUBSTRATES ACTIVITY
cyanohydrins Crude Sorghum extract Recombinant sbHMNGT
flavonoids
SUBSTRATES ACTIVITY
cyanohydrins Crude Sorghum extract Recombinant sbHMNGT
flavonoids
19) quercetin (flavonol) ~ 0
20) cyanidin (anthocyanidin) ~ O
21 ) biochanin A (isoflavone) t O
22) naringenin (flavanone) ~ 0
23) apigenin (flavone) ~ 0 hexanol derivatives
24) 1-hexanol
25) trans-2-hexen-1-of ~ t
26) cis-3-hexen-1-of t
27) 3-methyl-3-hexen-1-of t t
28) 3-methyl-2-hexen-1-of t others
29) indole acetic acid (plant hormone) t p
30) geraniol (monoterpenoid) ~ t (11.0 t 0.5%)
31 ) tomatidine (alkaloid) ~ 0
32) nerol
33) p-citronellol ~ t SEQU~TCE LISTING
<110> LUMIIVIS PTY, LIMITED
Royal Veterinary & Agricultural University <120> Organic Compounds <130> S-31227/P1 <140>
<141>
<160> 9 <170> PatentIn Ver. 2.1 <210> 1 <211> 492 <212> PRT
<213> Sorghum bicolor <400> 1 Met Gly Ser Asn Ala Pro Pro Pro Pro Thr Pro His Val Val Leu Val Pro Phe Pro Gly Gln Gly His Val Ala Pro Leu Met Gln Leu Ala Arg Leu Leu His Ala Arg Gly Ala Arg Val Thr Phe Val Tyr Thr Gln Tyr Asn Tyr Arg Arg Leu Leu Arg Ala Lys Gly Glu Ala Ala Val Arg Pro Pro Ala Thr Ser Ser Ala Arg Phe Arg Ire Glu Val Ire Asp Asp Gly Leu Ser Leu Ser Val Pro Gln Asn Asp Val Gly Gly Leu Val Asp Ser Leu Arg Lys Asn Cys Leu His Pro Phe Arg Ala Leu Leu Arg Arg Leu Gly Gln Glu Val Glu Gly Gln Asp Ala Pro Pro Val Thr Cps Val Val Gly Asp Val Val Met Thr Phe Ala Ala Ala Ala Ala Arg Glu Ala Gly Ire Pro Glu Val Gln Phe Phe Thr Ala Ser Ala Cys Gly Leu Leu Gly Tyr Leu His Tyr Gly Glu Leu Val Glu Arg Gly Leu Val Pro Phe Arg Asp Ala Ser Leu Leu Ala Asp Asp Asp Tyr Leu Asp Thr Pro Leu Glu Trp Val Pro Gly Met Ser His Met Arg Leu Arg Asp Met Pro Thr Phe Cps Arg Thr Thr Asp Pro Asp Asp Val Met Val Ser Ala Thr Leu Gln Gln Met Glu Ser Ala Ala Gly Ser Lys Ala Leu Ile Leu Asn Thr Leu Tyr Glu Leu Glu Lys Asp Val Val Asp Ala Leu Ala Ala Phe Phe Pro Pro Ile '1'yr Thr Val Gly Pro Leu Ala Glu Val Ile Ala Ser Ser Asp Ser Ala Ser Ala Gly Leu Ala Ala Met Asp Ile Ser Ile Trp Gln Glu Asp '1'hr Arg Cps Leu Ser Trp Leu Asp Gly Lys Pro Ala Gly Ser Val Val Tyr Val Asn Phe Gly Ser Met Ala Val Met Thr Ala Ala Gln Ala Arg Glu Phe Ala Leu Gly Leu Ala Ser Cps Gly Ser Pro Phe Leu Trp Val Lys Arg Pro Asp Val Val Glu Gly Glu Glu Val Leu Leu Pro Glu Ala Leu Leu Asp Glu Val Ala Arg Gly Arg Gly Leu Val Val Pro Trp Cys Pro Gln Ala Ala Val Leu Lys His Ala Ala Val Gly Leu Phe Val Ser His Cps Gly Trp Asn Ser Leu Leu Glu Ala Thr Ala Ala Gly Gln Pro Val Leu Ala Trp Pro Cps His Gly Glu Gln Thr Thr Asn Cps Arg Gln Leu Cps Glu Val Trp Gly Asn Gly Ala Gln Leu Pro Arg Glu Val Glu Ser Gly Ala Val Ala Arg Leu Val Arg Glu Met Met Val Gly Asp Leu Gly Lys Glu Lys Arg Ala Lys Ala Ala Glu Trp Lys Ala Ala Ala Glu Ala Ala Ala Arg Lys Gly Gly Ala Ser Trp Arg Asn Val Glu Arg Val Val Asn Asp Leu Leu Leu Val Gly Gly Lys Gln <210> 2 <211> 1479 <212> I~
<213> SorghLUn bicolor <400> 2 atgggcagca acgcgccgcc tccgccgacg cctcacgtgg tgctggtccc gttcccgggg 60 cagggccacg tcgcgccgct gatgcagctg gcgcgcctcc tccacgcccg gggcgcgcgc 120 gtcaccttcg tctacaccca gtacaactac cgccgcctcc tgcgcgccaa gggcgaggcc 180 gccgtcaggc cccccgccac ctcctccgcg aggttccgca tcgaggtcat cgacgacggc 240 ctctccctct ccgtgccgca gaacgacgtc ggggggctcg tcgactccct gcgcaaaaac 300 tgcctccacc cgttccgcgc cctgctgcgc cgcctggggc aggaggtgga ggggcaagac 360 gcgccgcccg tcacctgcgt cgtcggcgac gtcgtcatga ccttcgccgc cgcagctgcc 420 agggaggccg gcatccccga ggtgcagttc ttcacggcct cagcatgcgg actcttgggc 480 tacttgcact acggcgagct cgtcgaacga ggcctcgtcc ctttcagaga cgccagcctc 540 ctcgccgacg acgattacct ggacacgccg ctggagtggg tgcccgggat gagccacatg 600 cggctcaggg acatgccgac gttctgccgc accacggacc ccgacgacgt catggtgtcc 660 gccacgctcc agcagatgga gagcgccgcc ggctccaagg ccctcatcct caacaccctg 720 tacgagctcg agaaggacgt ggtggacgcg ctcgccgcct tcttcccgcc gatctacacc 780 gtggggccgc tcgccgaggt catcgcgtcc tccgactccg cctccgccgg cctcgccgcc 840 atggacatca gcatctggca ggaggacacg cggtgcctgt cgtggctcga cgggaagccg 900 gccggctccg tggtgtacgt caacttcggc agcatggccg tcatgacggc cgcgcaggcg 960 cgggagttcg cgctgggcct ggcaagctgc ggctccccgt tcctgtgggt gaagcgcccc 1020 gacgtggtgg aaggcgagga ggtgctgctg ccggaggccc tgctggacga ggtggctcgc 1080 ggcaggggcc tcgtggtgcc atggtgcccg caggcagcag tgctcaagca cgccgccgtg 1140 ggactgttcg tctcgcactg cggatggaac tccctgctgg aggcgacggc ggcggggcag 1200 ccggtgctcg cctggccctg ccacggggaa cagaccacca actgcaggca gctgtgcgag 1260 gtctggggca acggcgcgca gctgcccaga gaagtggaga gcggcgcggt ggcccgtctg 1320 gtgagggaga tgatggtcgg ggacctgggc aaggagaagc gggcgaaggc ggcggagtgg 1380 aaggcggcgg cggaggccgc ggccaggaaa ggcggcgcgt cgtggcgtaa tgttgaacgc 1440 gtggtgaacg acctgctgct ggtcgggggc aagcagtga 1479 <210> 3 <211> 23 <212> L~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer C2EF' <220>
<221> modified_base <222> (6) <223> i <220>
<221> modified_base <222> (9) <223> i <220>
<221> modified base <222> (18) <223> i <400> 3 ttygtnwsnc aytgyggntg gaa 23 <210> 4 <211> 17 <212> I~1A
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: T7 primer <400> 4 aatacgactc actatag 17 <210> 5 <211> 23 <212> L~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer C2DF
<220>
<221> modified base <222> ( 6 ) <223> i <220>
<221> modified_base <222> ( 9 ) <223> i <220>
<221> modified base <222> (12) <223> i <220>
<221> modified base <222> (15) <223> i <220>
<221> modified base <222> (18) <223> i <400> 5 gargcnacng cngcnggnca rcc 23 <210> 6 <211> 21 <212> L7N~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer 441F
<400> 6 gaggcgacgg cggcggggca g 21 <210> 7 <211> 24 <212> I~1 <213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer 4428 <400> 7 catgtcactg cttgcccccg acca 24 <210> 8 <211> 35 <212> I~IA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer EXF'1 <400> 8 aataaaagca tatgggaagc aacgcgccgc ctccg 35 <210> 9 <211> 28 <212 > I7NF~
<213> Artificial Sequence <400> 9 ttggatcctc actgcttgcc cccgacca 28
<110> LUMIIVIS PTY, LIMITED
Royal Veterinary & Agricultural University <120> Organic Compounds <130> S-31227/P1 <140>
<141>
<160> 9 <170> PatentIn Ver. 2.1 <210> 1 <211> 492 <212> PRT
<213> Sorghum bicolor <400> 1 Met Gly Ser Asn Ala Pro Pro Pro Pro Thr Pro His Val Val Leu Val Pro Phe Pro Gly Gln Gly His Val Ala Pro Leu Met Gln Leu Ala Arg Leu Leu His Ala Arg Gly Ala Arg Val Thr Phe Val Tyr Thr Gln Tyr Asn Tyr Arg Arg Leu Leu Arg Ala Lys Gly Glu Ala Ala Val Arg Pro Pro Ala Thr Ser Ser Ala Arg Phe Arg Ire Glu Val Ire Asp Asp Gly Leu Ser Leu Ser Val Pro Gln Asn Asp Val Gly Gly Leu Val Asp Ser Leu Arg Lys Asn Cys Leu His Pro Phe Arg Ala Leu Leu Arg Arg Leu Gly Gln Glu Val Glu Gly Gln Asp Ala Pro Pro Val Thr Cps Val Val Gly Asp Val Val Met Thr Phe Ala Ala Ala Ala Ala Arg Glu Ala Gly Ire Pro Glu Val Gln Phe Phe Thr Ala Ser Ala Cys Gly Leu Leu Gly Tyr Leu His Tyr Gly Glu Leu Val Glu Arg Gly Leu Val Pro Phe Arg Asp Ala Ser Leu Leu Ala Asp Asp Asp Tyr Leu Asp Thr Pro Leu Glu Trp Val Pro Gly Met Ser His Met Arg Leu Arg Asp Met Pro Thr Phe Cps Arg Thr Thr Asp Pro Asp Asp Val Met Val Ser Ala Thr Leu Gln Gln Met Glu Ser Ala Ala Gly Ser Lys Ala Leu Ile Leu Asn Thr Leu Tyr Glu Leu Glu Lys Asp Val Val Asp Ala Leu Ala Ala Phe Phe Pro Pro Ile '1'yr Thr Val Gly Pro Leu Ala Glu Val Ile Ala Ser Ser Asp Ser Ala Ser Ala Gly Leu Ala Ala Met Asp Ile Ser Ile Trp Gln Glu Asp '1'hr Arg Cps Leu Ser Trp Leu Asp Gly Lys Pro Ala Gly Ser Val Val Tyr Val Asn Phe Gly Ser Met Ala Val Met Thr Ala Ala Gln Ala Arg Glu Phe Ala Leu Gly Leu Ala Ser Cps Gly Ser Pro Phe Leu Trp Val Lys Arg Pro Asp Val Val Glu Gly Glu Glu Val Leu Leu Pro Glu Ala Leu Leu Asp Glu Val Ala Arg Gly Arg Gly Leu Val Val Pro Trp Cys Pro Gln Ala Ala Val Leu Lys His Ala Ala Val Gly Leu Phe Val Ser His Cps Gly Trp Asn Ser Leu Leu Glu Ala Thr Ala Ala Gly Gln Pro Val Leu Ala Trp Pro Cps His Gly Glu Gln Thr Thr Asn Cps Arg Gln Leu Cps Glu Val Trp Gly Asn Gly Ala Gln Leu Pro Arg Glu Val Glu Ser Gly Ala Val Ala Arg Leu Val Arg Glu Met Met Val Gly Asp Leu Gly Lys Glu Lys Arg Ala Lys Ala Ala Glu Trp Lys Ala Ala Ala Glu Ala Ala Ala Arg Lys Gly Gly Ala Ser Trp Arg Asn Val Glu Arg Val Val Asn Asp Leu Leu Leu Val Gly Gly Lys Gln <210> 2 <211> 1479 <212> I~
<213> SorghLUn bicolor <400> 2 atgggcagca acgcgccgcc tccgccgacg cctcacgtgg tgctggtccc gttcccgggg 60 cagggccacg tcgcgccgct gatgcagctg gcgcgcctcc tccacgcccg gggcgcgcgc 120 gtcaccttcg tctacaccca gtacaactac cgccgcctcc tgcgcgccaa gggcgaggcc 180 gccgtcaggc cccccgccac ctcctccgcg aggttccgca tcgaggtcat cgacgacggc 240 ctctccctct ccgtgccgca gaacgacgtc ggggggctcg tcgactccct gcgcaaaaac 300 tgcctccacc cgttccgcgc cctgctgcgc cgcctggggc aggaggtgga ggggcaagac 360 gcgccgcccg tcacctgcgt cgtcggcgac gtcgtcatga ccttcgccgc cgcagctgcc 420 agggaggccg gcatccccga ggtgcagttc ttcacggcct cagcatgcgg actcttgggc 480 tacttgcact acggcgagct cgtcgaacga ggcctcgtcc ctttcagaga cgccagcctc 540 ctcgccgacg acgattacct ggacacgccg ctggagtggg tgcccgggat gagccacatg 600 cggctcaggg acatgccgac gttctgccgc accacggacc ccgacgacgt catggtgtcc 660 gccacgctcc agcagatgga gagcgccgcc ggctccaagg ccctcatcct caacaccctg 720 tacgagctcg agaaggacgt ggtggacgcg ctcgccgcct tcttcccgcc gatctacacc 780 gtggggccgc tcgccgaggt catcgcgtcc tccgactccg cctccgccgg cctcgccgcc 840 atggacatca gcatctggca ggaggacacg cggtgcctgt cgtggctcga cgggaagccg 900 gccggctccg tggtgtacgt caacttcggc agcatggccg tcatgacggc cgcgcaggcg 960 cgggagttcg cgctgggcct ggcaagctgc ggctccccgt tcctgtgggt gaagcgcccc 1020 gacgtggtgg aaggcgagga ggtgctgctg ccggaggccc tgctggacga ggtggctcgc 1080 ggcaggggcc tcgtggtgcc atggtgcccg caggcagcag tgctcaagca cgccgccgtg 1140 ggactgttcg tctcgcactg cggatggaac tccctgctgg aggcgacggc ggcggggcag 1200 ccggtgctcg cctggccctg ccacggggaa cagaccacca actgcaggca gctgtgcgag 1260 gtctggggca acggcgcgca gctgcccaga gaagtggaga gcggcgcggt ggcccgtctg 1320 gtgagggaga tgatggtcgg ggacctgggc aaggagaagc gggcgaaggc ggcggagtgg 1380 aaggcggcgg cggaggccgc ggccaggaaa ggcggcgcgt cgtggcgtaa tgttgaacgc 1440 gtggtgaacg acctgctgct ggtcgggggc aagcagtga 1479 <210> 3 <211> 23 <212> L~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer C2EF' <220>
<221> modified_base <222> (6) <223> i <220>
<221> modified_base <222> (9) <223> i <220>
<221> modified base <222> (18) <223> i <400> 3 ttygtnwsnc aytgyggntg gaa 23 <210> 4 <211> 17 <212> I~1A
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: T7 primer <400> 4 aatacgactc actatag 17 <210> 5 <211> 23 <212> L~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer C2DF
<220>
<221> modified base <222> ( 6 ) <223> i <220>
<221> modified_base <222> ( 9 ) <223> i <220>
<221> modified base <222> (12) <223> i <220>
<221> modified base <222> (15) <223> i <220>
<221> modified base <222> (18) <223> i <400> 5 gargcnacng cngcnggnca rcc 23 <210> 6 <211> 21 <212> L7N~
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer 441F
<400> 6 gaggcgacgg cggcggggca g 21 <210> 7 <211> 24 <212> I~1 <213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer 4428 <400> 7 catgtcactg cttgcccccg acca 24 <210> 8 <211> 35 <212> I~IA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: primer EXF'1 <400> 8 aataaaagca tatgggaagc aacgcgccgc ctccg 35 <210> 9 <211> 28 <212 > I7NF~
<213> Artificial Sequence <400> 9 ttggatcctc actgcttgcc cccgacca 28
Claims (15)
1. A DNA molecule coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose.
2. The DNA molecule of claim 1 coding for a UDP-glucose:aglycon-glucosyltransferase conjugating mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrine or 2-hydroxy-2-methylbutyronitrile; geraniol, nerol or .beta.-citronellol; p-hydroxybenzoic acid, benzoic acid, benzylalcohol, p-hydroxy-benzylalcohol, 2-hydroxy-3-methoxybenzylalcohol, vanillic acid or vanillin; 1-hexanol, trans-2-hexen-1-ol, cis-3-hexen-1-ol, 3-methyl-3-hexen-1-ol or 3-methyl-2-hexen-1-ol to glucose.
3. The DNA molecule of claim 1 coding for a UDP-glucose:aglycon-glucosyltransferase having the formula R1-R2-R3, wherein - R1, R2 and R3 are component sequences consisting of amino acid residues independently selected from the group of the amino acid residues Gly, Ala, Val, Leu, Ile, Phe, Pro, Ser, Thr, Cys, Met, Trp, Tyr, Asn, Gln, Asp, Glu, Lys, Arg and His, and - R2 consists of 150 or more amino acid residues the sequence of which is at least 50% identical to an aligned component sequence of SEQ ID NO: 1.
4. The DNA molecule of claim 1, wherein the amino acid sequence of R2 is represented by amino acids 21 to 445, 168 to 448, or 281 to 448 of SEQ ID NO: 1.
5. The DNA molecule of claim 1, wherein R1 or R3 comprise one or more additional component sequences having a length of at least 30 amino acids and being at least 65% identical to an aligned component sequence of SEQ ID NO: 1.
6. The DNA molecule of claim 1 coding for a UDP-glucose:aglycon-glucosyltransferase of 300 to 600 amino acid residues length.
7. The DNA molecule of claim 1 coding for a UDP-glucose:aglycon-glucosyltransferase having the amino acid sequence of SEQ ID NO: 1.
8. The DNA molecule of claim 1 having the nucleotide seq uence of SEQ ID NO:
2.
2.
9. A UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin , a terpenoid, a phenylderivative or a hexanolderivative to glucose as coded for by the DNA
molecule of any one of claims 1 to 8.
molecule of any one of claims 1 to 8.
A method for the isolation of a cDNA molecule coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose; comprising (a) preparing a cDNA library from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, (b) using at least one oligonucleotide designed on the basis of SEQ ID NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA from the cDNA
library, (c) optionally using a further oligonucleotide designed on the basis of SEO ID
NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA from the cDNA
library in a nested PCR reaction, (d) using the DNA obtained in steps (b) or (c) as a probe to screen a cDNA
library prepared from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, and (e) identifying and purifying vector DNA comprising an open reading frame encoding a protein characterized by an amino acid component sequence of at least 150 amino acid residues length having 50% or more sequence identity to an aligned component sequence of SEO ID NO: 2 or a sequence encoding part of SEQ ID
NO: 1 (f) optionally further processing the purified DNA.
library, (c) optionally using a further oligonucleotide designed on the basis of SEO ID
NO: 1 to amplify part of the UDP-glucose:aglycon-glucosyltransferase cDNA from the cDNA
library in a nested PCR reaction, (d) using the DNA obtained in steps (b) or (c) as a probe to screen a cDNA
library prepared from plant tissue expressing UDP-glucose:aglycon-glucosyltransferase, and (e) identifying and purifying vector DNA comprising an open reading frame encoding a protein characterized by an amino acid component sequence of at least 150 amino acid residues length having 50% or more sequence identity to an aligned component sequence of SEO ID NO: 2 or a sequence encoding part of SEQ ID
NO: 1 (f) optionally further processing the purified DNA.
11. A method for producing purified recombinant UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose; comprising (a) Q-Sepharose chromatography eluting with a linear salt gradient, and (b) dye chromatography eluting with UDP-glucose.
12. A transgenic plant comprising stably integrated into its genome DNA coding for a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose or DNA encoding sense RNA, anti sense RNA, double stranded RNA or a ribozyme, the expression of which reduces expression of a UDP-glucose:aglycon-glucosyltransferase conjugating p-hydroxymandelonitrile to glucose.
13. The transgenic plant of claim 12 additionally comprising stably integrated into its genome DNA coding for a cytochrome P-450 mono oxygenase catalyzing the conversion of an amino acid to the corresponding N-hydroxy amino acid and the oxime derived from this N-hydroxyamino acid or a cytochrome P450 mono oxygenase catalyzing the conversion of an aldoxime to a nitrile and the con version of said nitrile to the corresponding cyano hydrin.
14. A method for obtaining a transgenic plant according to claim 12 comprising (a) introducing into a plant cell or tissue which can be regenerated to a complete plant, DNA comprising a gene expressible in that plant encoding a a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose, and (b) selecting transgenic plants.
15. A method for obtaining a transgenic plant according to claim 12 comprising (a) introducing into a plant cell or tissue which can be regenerated to a complete plant, DNA encoding sense RNA, anti sense RNA or a ribozyme, the expression of which reduces the expression of a UDP-glucose:aglycon-glucosyltransferase conjugating a cyanohydrin, a terpenoid, a phenylderivative or a hexanolderivative to glucose, and (b) selecting transgenic plants.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP99123838 | 1999-12-01 | ||
EP99123838.7 | 1999-12-01 | ||
PCT/EP2000/011982 WO2001040491A2 (en) | 1999-12-01 | 2000-11-29 | Udp-glucose:aglycon-glucosyltransferase |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2391404A1 true CA2391404A1 (en) | 2001-06-07 |
Family
ID=8239507
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002391404A Abandoned CA2391404A1 (en) | 1999-12-01 | 2000-11-29 | Udp-glucose:aglycon-glucosyltransferase |
Country Status (11)
Country | Link |
---|---|
US (1) | US7122657B2 (en) |
EP (1) | EP1234046B1 (en) |
JP (1) | JP2003515345A (en) |
CN (1) | CN1415017A (en) |
AR (1) | AR035387A1 (en) |
AT (1) | ATE414160T1 (en) |
AU (1) | AU772161B2 (en) |
CA (1) | CA2391404A1 (en) |
DE (1) | DE60040802D1 (en) |
DK (1) | DK1234046T3 (en) |
WO (1) | WO2001040491A2 (en) |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE414160T1 (en) | 1999-12-01 | 2008-11-15 | Adelaide Res & Innovation Pty | UDP-GLUCOSE:AGLYCON GLUCOSYLTRANSFERASE |
DK1649029T3 (en) | 2003-06-19 | 2015-03-09 | Evolva Sa | PROCEDURE FOR PREPARING A LOW MOLECULAR WEIGHT SECONDARY PLANT METABOLITE IN A YARCELL |
EP1510586A1 (en) * | 2003-08-26 | 2005-03-02 | Poalis A/S | A method of producing a low molecular weight organic compound in a cell |
WO2006087370A1 (en) * | 2005-02-18 | 2006-08-24 | Poalis A/S | Use of aroma glycosides as flavor or fragrance ingredient |
WO2007094521A1 (en) * | 2006-02-17 | 2007-08-23 | International Flower Developments Proprietary Limited | Flavonoid glycosyltransferase and utilization thereof |
KR100792251B1 (en) | 2006-06-19 | 2008-01-07 | 고려대학교 산학협력단 | GF-Glucosyltransferase Gene Specificly Expressed to Red Pepper Fruit |
WO2008062165A2 (en) * | 2006-11-22 | 2008-05-29 | The University Of York | Monoterpenoid modifying enzymes |
DE102008055606A1 (en) * | 2008-11-03 | 2010-05-06 | Bayer Technology Services Gmbh | A method for the high-throughput preparation of sequencing-competent DNA from individual plaques of peptides presenting phages |
WO2012126059A1 (en) * | 2011-03-22 | 2012-09-27 | Monash University | Novel stock feed crop plant |
EP2742126B1 (en) | 2011-08-08 | 2018-11-07 | International Flavors & Fragrances Inc. | Compositions and methods for the biosynthesis of vanillin or vanillin beta-d-glucoside |
AU2013218574A1 (en) * | 2012-02-06 | 2014-09-11 | Suntory Holdings Limited | Monoterpene glycosyltransferase originating from hop and method for using same |
AU2013339881B2 (en) | 2012-11-05 | 2017-08-24 | Danstar Ferment Ag | Vanillin synthase |
AU2014297224A1 (en) | 2013-08-02 | 2016-03-03 | Suntory Holdings Limited | Method for using hexenol glycosyl transferase |
TR201802540T4 (en) | 2013-12-18 | 2018-03-21 | Univ Copenhagen | Glycosyltransferase which glycosylates flavocermesic acid and / or cermesic acid. |
EP2960330A1 (en) * | 2014-06-27 | 2015-12-30 | Technische Universität München | Glycosyl transferases and their uses |
EP3353312A1 (en) | 2015-09-24 | 2018-08-01 | Evolva SA | Production of anthocyanin from simple sugars |
CN107058257B (en) * | 2017-05-05 | 2020-12-04 | 安徽农业大学 | A kind of biological enzyme for synthesizing alcohol-based aroma glycosides and preparation method thereof |
CN108342410A (en) * | 2018-02-05 | 2018-07-31 | 安徽农业大学 | A method of improving tobacco flowers and fruits perfume (or spice) fragrance glycosidic precursors |
CN110714036A (en) * | 2019-09-30 | 2020-01-21 | 西南大学 | Application of Phenyllactate Uridine Diphosphate Glucosyltransferase |
CN110982830B (en) * | 2019-12-06 | 2022-07-12 | 中国药科大学 | Glycosyl transferase gene RyUGT3A, and coding protein and application thereof |
CN112029741B (en) * | 2020-08-26 | 2022-04-22 | 中国中医科学院中药研究所 | Auxin N glycosyltransferase protein and its encoding gene and application in Ginkgo biloba |
CN112844854B (en) * | 2020-12-21 | 2024-06-04 | 矿冶科技集团有限公司 | Flotation collector for copper oxide ore and preparation method thereof |
CN113249352B (en) * | 2021-05-14 | 2022-03-22 | 山东大学 | N-glycosyltransferase mutant P1 and application thereof |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2176949A1 (en) | 1993-12-08 | 1995-06-15 | Birgit Maria Koch | Cytochrome p-450 monooxygenases |
WO1998040470A2 (en) * | 1997-03-07 | 1998-09-17 | Novartis Ag | Cytochrome p450 monooxygenases |
ATE414160T1 (en) | 1999-12-01 | 2008-11-15 | Adelaide Res & Innovation Pty | UDP-GLUCOSE:AGLYCON GLUCOSYLTRANSFERASE |
-
2000
- 2000-11-29 AT AT00987323T patent/ATE414160T1/en not_active IP Right Cessation
- 2000-11-29 AR ARP000106284A patent/AR035387A1/en unknown
- 2000-11-29 CA CA002391404A patent/CA2391404A1/en not_active Abandoned
- 2000-11-29 DK DK00987323T patent/DK1234046T3/en active
- 2000-11-29 DE DE60040802T patent/DE60040802D1/en not_active Expired - Lifetime
- 2000-11-29 EP EP00987323A patent/EP1234046B1/en not_active Expired - Lifetime
- 2000-11-29 JP JP2001542555A patent/JP2003515345A/en active Pending
- 2000-11-29 US US10/148,606 patent/US7122657B2/en not_active Expired - Lifetime
- 2000-11-29 AU AU23611/01A patent/AU772161B2/en not_active Ceased
- 2000-11-29 CN CN00817844.5A patent/CN1415017A/en active Pending
- 2000-11-29 WO PCT/EP2000/011982 patent/WO2001040491A2/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
US7122657B2 (en) | 2006-10-17 |
CN1415017A (en) | 2003-04-30 |
ATE414160T1 (en) | 2008-11-15 |
AU772161B2 (en) | 2004-04-08 |
AR035387A1 (en) | 2004-05-26 |
DK1234046T3 (en) | 2009-03-16 |
EP1234046A2 (en) | 2002-08-28 |
AU2361101A (en) | 2001-06-12 |
WO2001040491A2 (en) | 2001-06-07 |
JP2003515345A (en) | 2003-05-07 |
WO2001040491A3 (en) | 2001-11-29 |
DE60040802D1 (en) | 2008-12-24 |
EP1234046B1 (en) | 2008-11-12 |
US20050277766A1 (en) | 2005-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1234046B1 (en) | Udp-glucose:aglycon-glucosyltransferase | |
Jones et al. | The UDP-glucose: p-hydroxymandelonitrile-O-glucosyltransferase that catalyzes the last step in synthesis of the cyanogenic glucoside dhurrin in Sorghum bicolor: isolation, cloning, heterologous expression, and substrate specificity | |
Bak et al. | The presence of CYP79 homologues in glucosinolate-producing plants shows evolutionary conservation of the enzymes in the conversion of amino acid to aldoxime in the biosynthesis of cyanogenic glucosides and glucosinolates | |
EP1544300B1 (en) | Novel glycosyltransferase genes | |
Boss et al. | Molecular biology of sugar and anthocyanin accumulation in grape berries | |
US20100011462A1 (en) | Flavin monooxygenases and transcription factors involved in glucosinolate biosynthesis | |
EP2235051B1 (en) | Glycosyltransferases, polynucleotides encoding these and methods of use | |
US6734343B1 (en) | Rhamnosyl-transferase gene and uses thereof | |
Alexander et al. | A novel embryo-specific barley cDNA clone encodes a protein with homologies to bacterial glucose and ribitol dehydrogenase | |
Marillia et al. | Molecular cloning of a Brassica napus thiohydroximate S‐glucosyltransferase gene and its expression in Escherichia coli | |
US8273950B2 (en) | Polynucleotides encoding phenylpropanoid and flavonoid biosynthetic pathway enzymes in coffee | |
AU778179B2 (en) | Gibberellin 3 beta-hydroxylase genes of rice and uses thereof | |
WO2001005984A2 (en) | Flavonoid biosynthesis | |
Hu et al. | Cloning and characterization of the cDNA and promoter of UDP-glucose: flavonoid 3-O-glucosyltransferase gene from a purple-fleshed sweet potato | |
US20030166202A1 (en) | P450 Monooxygenases of the cyp79 family | |
US7439419B1 (en) | Solanum tuberosum β-solanine/β-chaconine rhamnosyl transferase sequences and uses thereof | |
US6501004B1 (en) | Transgenic reduction of sinapine in crucifera | |
EP2143793B1 (en) | Genes of enzymes participating in vitamin e biosynthesis in para rubber tree | |
US7105324B2 (en) | Nucleic acid isolated from Tetrahymena which codes for a triterpenoid cyclase, its production, and use | |
US20040010822A1 (en) | Hydroperoxyde lyases | |
Chen et al. | Cloning and sequence analysis of a mutation-type cinnamate 4-hydroxylase gene from Brassica oleracea L. var. acephala DC. | |
Dixon et al. | PLANT SECONDARY METABOLISM |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |