CN105985938B - Glycosyltransferase mutant protein and its application - Google Patents
Glycosyltransferase mutant protein and its application Download PDFInfo
- Publication number
- CN105985938B CN105985938B CN201510051992.5A CN201510051992A CN105985938B CN 105985938 B CN105985938 B CN 105985938B CN 201510051992 A CN201510051992 A CN 201510051992A CN 105985938 B CN105985938 B CN 105985938B
- Authority
- CN
- China
- Prior art keywords
- seq
- glycosyltransferase
- ugtpg1
- mutant protein
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 102000008300 Mutant Proteins Human genes 0.000 title claims abstract description 107
- 108010021466 Mutant Proteins Proteins 0.000 title claims abstract description 107
- 108700023372 Glycosyltransferases Proteins 0.000 title claims abstract description 81
- 102000051366 Glycosyltransferases Human genes 0.000 title claims abstract description 64
- 230000000694 effects Effects 0.000 claims abstract description 48
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 34
- 230000003197 catalytic effect Effects 0.000 claims abstract description 33
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 29
- BBEUDPAEKGPXDG-UHFFFAOYSA-N protopanaxatriol Natural products CC(CCC=C(C)C)C1CCC2(C)C1C(O)CC3C4(C)CCC(O)C(C)(C)C4C(O)CC23C BBEUDPAEKGPXDG-UHFFFAOYSA-N 0.000 claims description 58
- SHCBCKBYTHZQGZ-CJPZEJHVSA-N protopanaxatriol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2[C@@H](O)C[C@@]3(C)[C@]4(C)CC[C@H]([C@@](C)(O)CCC=C(C)C)[C@H]4[C@H](O)C[C@@H]3[C@]21C SHCBCKBYTHZQGZ-CJPZEJHVSA-N 0.000 claims description 55
- -1 triterpene compound Chemical class 0.000 claims description 42
- 238000000034 method Methods 0.000 claims description 38
- 150000001875 compounds Chemical class 0.000 claims description 32
- 238000006206 glycosylation reaction Methods 0.000 claims description 31
- 239000000348 glycosyl donor Substances 0.000 claims description 27
- 125000003147 glycosyl group Chemical group 0.000 claims description 25
- 108091033319 polynucleotide Proteins 0.000 claims description 25
- 102000040430 polynucleotide Human genes 0.000 claims description 25
- 239000002157 polynucleotide Substances 0.000 claims description 25
- BGHNZAWRRWLKPO-UHFFFAOYSA-N Ginsenoside F1 Natural products CC(=C)CCCC(C)(OC1OC(CO)C(O)C(O)C1O)C2CCC3(C)C2C(O)CC4C5(C)CCC(O)C(C)(C)C5C(O)CC34C BGHNZAWRRWLKPO-UHFFFAOYSA-N 0.000 claims description 24
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 21
- 238000006243 chemical reaction Methods 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 20
- RAQNTCRNSXYLAH-UHFFFAOYSA-N Ginsenoside Rh1 Natural products CC(C)=CCCC(C)(O)C1CCC(C2(C3)C)(C)C1C(O)CC2C1(C)CCC(O)C(C)(C)C1C3OC1OC(CO)C(O)C(O)C1O RAQNTCRNSXYLAH-UHFFFAOYSA-N 0.000 claims description 14
- XNGXWSFSJIQMNC-FIYORUNESA-N ginsenoside F1 Chemical compound O([C@@](C)(CCC=C(C)C)[C@@H]1[C@@H]2[C@@]([C@@]3(C[C@H](O)[C@H]4C(C)(C)[C@@H](O)CC[C@]4(C)[C@H]3C[C@H]2O)C)(C)CC1)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O XNGXWSFSJIQMNC-FIYORUNESA-N 0.000 claims description 13
- RAQNTCRNSXYLAH-RFCGZQMISA-N (20S)-ginsenoside Rh1 Chemical compound O([C@@H]1[C@H]2C(C)(C)[C@@H](O)CC[C@]2(C)[C@H]2C[C@@H](O)[C@H]3[C@@]([C@@]2(C1)C)(C)CC[C@@H]3[C@@](C)(O)CCC=C(C)C)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RAQNTCRNSXYLAH-RFCGZQMISA-N 0.000 claims description 8
- YURJSTAIMNSZAE-HHNZYBFYSA-N ginsenoside Rg1 Chemical compound O([C@@](C)(CCC=C(C)C)[C@@H]1[C@@H]2[C@@]([C@@]3(C[C@@H]([C@H]4C(C)(C)[C@@H](O)CC[C@]4(C)[C@H]3C[C@H]2O)O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)C)(C)CC1)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YURJSTAIMNSZAE-HHNZYBFYSA-N 0.000 claims description 7
- 238000002360 preparation method Methods 0.000 claims description 7
- YURJSTAIMNSZAE-UHFFFAOYSA-N UNPD89172 Natural products C1CC(C2(CC(C3C(C)(C)C(O)CCC3(C)C2CC2O)OC3C(C(O)C(O)C(CO)O3)O)C)(C)C2C1C(C)(CCC=C(C)C)OC1OC(CO)C(O)C(O)C1O YURJSTAIMNSZAE-UHFFFAOYSA-N 0.000 claims description 6
- CBEHEBUBNAGGKC-UHFFFAOYSA-N ginsenoside Rg1 Natural products CC(=CCCC(C)(OC1OC(CO)C(O)C(O)C1O)C2CCC3(C)C2C(O)CC4C5(C)CCC(O)C(C)(C)C5CC(OC6OC(CO)C(O)C(O)C6O)C34C)C CBEHEBUBNAGGKC-UHFFFAOYSA-N 0.000 claims description 6
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 238000012546 transfer Methods 0.000 claims description 4
- 238000006555 catalytic reaction Methods 0.000 claims description 3
- 230000001172 regenerating effect Effects 0.000 claims description 2
- 230000009261 transgenic effect Effects 0.000 claims description 2
- 125000003535 D-glucopyranosyl group Chemical group [H]OC([H])([H])[C@@]1([H])OC([H])(*)[C@]([H])(O[H])[C@@]([H])(O[H])[C@]1([H])O[H] 0.000 claims 2
- 150000003648 triterpenes Chemical class 0.000 claims 1
- 150000001413 amino acids Chemical class 0.000 abstract description 97
- 102000004190 Enzymes Human genes 0.000 abstract description 35
- 108090000790 Enzymes Proteins 0.000 abstract description 35
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 33
- 230000014509 gene expression Effects 0.000 abstract description 22
- 238000002741 site-directed mutagenesis Methods 0.000 abstract description 19
- 230000004048 modification Effects 0.000 abstract description 11
- 238000012986 modification Methods 0.000 abstract description 11
- 230000002255 enzymatic effect Effects 0.000 abstract description 2
- 238000002474 experimental method Methods 0.000 abstract description 2
- 101000644025 Panax ginseng UDP-glycosyltransferase 1 Proteins 0.000 description 60
- 210000004027 cell Anatomy 0.000 description 42
- 101000644012 Panax ginseng UDP-glycosyltransferase 100 Proteins 0.000 description 34
- 230000013595 glycosylation Effects 0.000 description 28
- 239000002773 nucleotide Substances 0.000 description 22
- 125000003729 nucleotide group Chemical group 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 21
- 229930182494 ginsenoside Natural products 0.000 description 20
- 241000196324 Embryophyta Species 0.000 description 18
- 101000644019 Panax ginseng UDP-glucosyltransferase 103 Proteins 0.000 description 18
- 101000644021 Panax ginseng UDP-glucosyltransferase 102 Proteins 0.000 description 16
- 240000005373 Panax quinquefolius Species 0.000 description 13
- 235000003140 Panax quinquefolius Nutrition 0.000 description 13
- 101000644022 Panax ginseng UDP-glycosyltransferase 101 Proteins 0.000 description 12
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 12
- 239000012634 fragment Substances 0.000 description 12
- 230000006870 function Effects 0.000 description 12
- 235000008434 ginseng Nutrition 0.000 description 12
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 12
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 229930182490 saponin Natural products 0.000 description 11
- 235000017709 saponins Nutrition 0.000 description 11
- 238000006467 substitution reaction Methods 0.000 description 11
- 125000000539 amino acid group Chemical group 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 230000035772 mutation Effects 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 238000012217 deletion Methods 0.000 description 9
- 230000037430 deletion Effects 0.000 description 9
- 229940089161 ginsenoside Drugs 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 150000007949 saponins Chemical class 0.000 description 8
- 235000000346 sugar Nutrition 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 230000001965 increasing effect Effects 0.000 description 7
- 150000003384 small molecules Chemical class 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 108020004414 DNA Proteins 0.000 description 6
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 6
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000007796 conventional method Methods 0.000 description 6
- 210000003527 eukaryotic cell Anatomy 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- INLFWQCRAJUDCR-IQVMEADQSA-N (1R,2S,4S,5'S,6R,7S,8R,9S,12S,13S)-5',7,9,13-tetramethylspiro[5-oxapentacyclo[10.8.0.02,9.04,8.013,18]icosane-6,2'-oxane] Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CCCCC4CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@]11CC[C@H](C)CO1 INLFWQCRAJUDCR-IQVMEADQSA-N 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 241000180649 Panax notoginseng Species 0.000 description 5
- 235000003143 Panax notoginseng Nutrition 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 108091081024 Start codon Proteins 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 4
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 239000001177 diphosphate Substances 0.000 description 4
- 235000011180 diphosphates Nutrition 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- 239000002777 nucleoside Substances 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 108020005091 Replication Origin Proteins 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- PLQMEXSCSAIXGB-SAXRGWBVSA-N (+)-artemisinic acid Chemical compound C1=C(C)CC[C@H]2[C@H](C)CC[C@@H](C(=C)C(O)=O)[C@H]21 PLQMEXSCSAIXGB-SAXRGWBVSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- WFPZSXYXPSUOPY-UHFFFAOYSA-N ADP-mannose Natural products C1=NC=2C(N)=NC=NC=2N1C(C(C1O)O)OC1COP(O)(=O)OP(O)(=O)OC1OC(CO)C(O)C(O)C1O WFPZSXYXPSUOPY-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- HDYANYHVCAPMJV-GXNRKQDOSA-N UDP-alpha-D-galacturonic acid Chemical compound C([C@@H]1[C@H]([C@H]([C@@H](O1)N1C(NC(=O)C=C1)=O)O)O)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](C(O)=O)[C@H](O)[C@H](O)[C@H]1O HDYANYHVCAPMJV-GXNRKQDOSA-N 0.000 description 2
- DRDCJEIZVLVWNC-SLBWPEPYSA-N UDP-beta-L-rhamnose Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 DRDCJEIZVLVWNC-SLBWPEPYSA-N 0.000 description 2
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 2
- USAZACJQJDHAJH-KDEXOMDGSA-N [[(2r,3s,4r,5s)-5-(2,4-dioxo-1h-pyrimidin-6-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](C=2NC(=O)NC(=O)C=2)O1 USAZACJQJDHAJH-KDEXOMDGSA-N 0.000 description 2
- JCPSMIOSLWKUPV-XQQPQPTDSA-N [[(3r,4r,5s)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-[(2s,3s,4r,5r)-5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl] phosphono hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@H](CO)OC1C(OP(O)(=O)OP(O)(O)=O)[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 JCPSMIOSLWKUPV-XQQPQPTDSA-N 0.000 description 2
- 230000002929 anti-fatigue Effects 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- JYGAZEJXUVDYHI-DGTMBMJNSA-N dihydroartemisinic acid Chemical compound C1CC(C)=C[C@@H]2[C@H]([C@@H](C)C(O)=O)CC[C@@H](C)[C@@H]21 JYGAZEJXUVDYHI-DGTMBMJNSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 239000000937 glycosyl acceptor Substances 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000037353 metabolic pathway Effects 0.000 description 2
- 238000009335 monocropping Methods 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 239000003471 mutagenic agent Substances 0.000 description 2
- 231100000707 mutagenic chemical Toxicity 0.000 description 2
- 230000003505 mutagenic effect Effects 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 239000000575 pesticide Substances 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- PYXFVCFISTUSOO-HKUCOEKDSA-N (20S)-protopanaxadiol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@H]([C@@](C)(O)CCC=C(C)C)[C@H]4[C@H](O)C[C@@H]3[C@]21C PYXFVCFISTUSOO-HKUCOEKDSA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- MVMSCBBUIHUTGJ-UHFFFAOYSA-N 10108-97-1 Natural products C1=2NC(N)=NC(=O)C=2N=CN1C(C(C1O)O)OC1COP(O)(=O)OP(O)(=O)OC1OC(CO)C(O)C(O)C1O MVMSCBBUIHUTGJ-UHFFFAOYSA-N 0.000 description 1
- ZUXNULGHCOXCFL-UHFFFAOYSA-N 2-(4-tert-butyl-2,6-dimethylphenyl)acetonitrile Chemical compound CC1=CC(C(C)(C)C)=CC(C)=C1CC#N ZUXNULGHCOXCFL-UHFFFAOYSA-N 0.000 description 1
- MIJYXULNPSFWEK-GTOFXWBISA-N 3beta-hydroxyolean-12-en-28-oic acid Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C MIJYXULNPSFWEK-GTOFXWBISA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- WFPZSXYXPSUOPY-ROYWQJLOSA-N ADP alpha-D-glucoside Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H]1O)O)N1C=2N=CN=C(C=2N=C1)N)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O WFPZSXYXPSUOPY-ROYWQJLOSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- CGPHZDRCVSLMCF-JZMIEXBBSA-N CDP-alpha-D-glucose Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)O1 CGPHZDRCVSLMCF-JZMIEXBBSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- JKLISIRFYWXLQG-UHFFFAOYSA-N Epioleonolsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4CCC3C21C JKLISIRFYWXLQG-UHFFFAOYSA-N 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- MVMSCBBUIHUTGJ-LRJDVEEWSA-N GDP-alpha-D-glucose Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H]1O)O)N1C=2N=C(NC(=O)C=2N=C1)N)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O MVMSCBBUIHUTGJ-LRJDVEEWSA-N 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 1
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- YBRJHZPWOMJYKQ-UHFFFAOYSA-N Oleanolic acid Natural products CC1(C)CC2C3=CCC4C5(C)CCC(O)C(C)(C)C5CCC4(C)C3(C)CCC2(C1)C(=O)O YBRJHZPWOMJYKQ-UHFFFAOYSA-N 0.000 description 1
- MIJYXULNPSFWEK-UHFFFAOYSA-N Oleanolinsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C MIJYXULNPSFWEK-UHFFFAOYSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 108010045306 T134 peptide Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000003712 anti-aging effect Effects 0.000 description 1
- 230000003266 anti-allergic effect Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000636 anti-proteolytic effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- LZMOBPWDHUQTKL-RWMBFGLXSA-N artemisinic acid Natural products CC1=C[C@@H]2[C@@H](CCC[C@H]2C(=C)C(=O)O)CC1 LZMOBPWDHUQTKL-RWMBFGLXSA-N 0.000 description 1
- 229930101531 artemisinin Natural products 0.000 description 1
- BLUAFEHZUWYNDE-NNWCWBAJSA-N artemisinin Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2OC(=O)[C@@H]4C BLUAFEHZUWYNDE-NNWCWBAJSA-N 0.000 description 1
- 229960004191 artemisinin Drugs 0.000 description 1
- PLQMEXSCSAIXGB-UHFFFAOYSA-N artemisininic acid Natural products C1=C(C)CCC2C(C)CCC(C(=C)C(O)=O)C21 PLQMEXSCSAIXGB-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- PYXFVCFISTUSOO-UHFFFAOYSA-N betulafolienetriol Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC(C(C)(O)CCC=C(C)C)C4C(O)CC3C21C PYXFVCFISTUSOO-UHFFFAOYSA-N 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 239000000039 congener Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- YSYKRGRSMLTJNL-URARBOGNSA-N dTDP-alpha-D-glucose Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@@H](O)C1 YSYKRGRSMLTJNL-URARBOGNSA-N 0.000 description 1
- ZOSQFDVXNQFKBY-GPNJKSCQSA-N dTDP-alpha-L-rhamnose Chemical compound C[C@@H]1O[C@@H](OP(O)(=O)OP(O)(=O)OC[C@H]2O[C@H](C[C@@H]2O)n2cc(C)c(=O)[nH]c2=O)[C@H](O)[C@H](O)[C@H]1O ZOSQFDVXNQFKBY-GPNJKSCQSA-N 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- JYGAZEJXUVDYHI-UHFFFAOYSA-N dihydroartemisininic acid Natural products C1CC(C)=CC2C(C(C)C(O)=O)CCC(C)C21 JYGAZEJXUVDYHI-UHFFFAOYSA-N 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- WQLVFSAGQJTQCK-UHFFFAOYSA-N diosgenin Natural products CC1C(C2(CCC3C4(C)CCC(O)CC4=CCC3C2C2)C)C2OC11CCC(C)CO1 WQLVFSAGQJTQCK-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 230000003832 immune regulation Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 229940042040 innovative drug Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 229940100243 oleanolic acid Drugs 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- HZLWUYJLOIAQFC-UHFFFAOYSA-N prosapogenin PS-A Natural products C12CC(C)(C)CCC2(C(O)=O)CCC(C2(CCC3C4(C)C)C)(C)C1=CCC2C3(C)CCC4OC1OCC(O)C(O)C1O HZLWUYJLOIAQFC-UHFFFAOYSA-N 0.000 description 1
- SWQINCWATANGKN-UHFFFAOYSA-N protopanaxadiol Natural products CC(CCC=C(C)C)C1CCC2(C)C1C(O)CC1C3(C)CCC(O)C(C)(C)C3CCC21C SWQINCWATANGKN-UHFFFAOYSA-N 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- NWMIYTWHUDFRPL-UHFFFAOYSA-N sapogenin Natural products COC(=O)C1(CO)C(O)CCC2(C)C1CCC3(C)C2CC=C4C5C(C)(O)C(C)CCC5(CCC34C)C(=O)O NWMIYTWHUDFRPL-UHFFFAOYSA-N 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 150000008130 triterpenoid saponins Chemical class 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
- C12P33/20—Preparation of steroids containing heterocyclic rings
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Cell Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及生物技术和植物生物学领域,具体地,本发明涉及一种糖基转移酶突变蛋白及其应用。The present invention relates to the fields of biotechnology and plant biology, and in particular to a glycosyltransferase mutant protein and application thereof.
背景技术Background Art
人参皂苷是从人参及其同属植物(如三七、西洋参等)中分离到的皂苷的总称,属于三萜类皂苷,是人参中的主要有效成份。目前,已经从人参中分离出了至少100种皂苷。从结构上来看,人参皂苷是皂苷元经过糖基化后形成的生物活性小分子。人参皂苷的皂苷元只有有限的几种,主要是达玛烷型的原人参二醇和原人参三醇,以及齐墩果烷酸。皂苷元通过糖基化后,可以提高水溶性,并产生出不同的生理活性。原人参三醇在C3,C6和C20位都有一个羟基,目前发现的原人参三醇型皂苷都是在C6和(或C20)的羟基上结合糖基化,在C3上结合糖基的原人参三醇型皂苷尚未见报道。糖基可以是葡萄糖、鼠李糖、木糖、和阿拉伯糖。Ginsenoside is a general term for saponins isolated from ginseng and its congener plants (such as Panax notoginseng, American ginseng, etc.). It belongs to the triterpenoid saponin and is the main active ingredient in ginseng. At present, at least 100 saponins have been isolated from ginseng. From a structural point of view, ginsenosides are bioactive small molecules formed by glycosylation of sapogenins. There are only a limited number of sapogenins in ginsenosides, mainly dammarane-type protopanaxadiol and protopanaxatriol, as well as oleanolic acid. After glycosylation, sapogenins can increase water solubility and produce different physiological activities. Protopanaxatriol has a hydroxyl group at C3, C6 and C20. The protopanaxatriol-type saponins discovered so far are all glycosylated on the hydroxyl groups of C6 and (or C20). Protopanaxatriol-type saponins with glycosyl groups at C3 have not been reported. The glycosyl group can be glucose, rhamnose, xylose, and arabinose.
一些原人参三醇型皂苷被证实具有广泛的生理功能和药用价值:包括抗过敏和抗炎症、免疫调节、抗疲劳、护心等功能。人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol)属于原人参三醇型皂苷,它在人参中的含量也非常低,也属于稀有人参皂苷。人参皂苷F1的结构与CK非常接近,也是在皂苷元的C-20位羟基上连有一个葡萄糖基。人参皂苷F1也具有独特的药用价值。它具有抗衰老和抗氧化的功能。人参皂苷Rh1(6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol)属于原人参三醇型皂苷,它在人参中的含量也非常低,也属于稀有人参皂苷。人参皂苷Rh1的结构与F1非常接近,但是它的糖基化位点是在C6位上的羟基。人参皂苷Rh1也具有特殊的生理功能,能够抗过敏和抗炎症。人参皂苷Rg1(6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxadiol)属于原人参三醇型皂苷,具有抗疲劳、护心等功能。Some protopanaxatriol saponins have been shown to have a wide range of physiological functions and medicinal values, including anti-allergic and anti-inflammatory, immune regulation, anti-fatigue, and heart protection. Ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol) belongs to the protopanaxatriol saponin, and its content in ginseng is also very low, and it is also a rare ginsenoside. The structure of ginsenoside F1 is very similar to CK, and there is also a glucose group attached to the C-20 hydroxyl group of the sapogenin. Ginsenoside F1 also has unique medicinal value. It has anti-aging and antioxidant functions. Ginsenoside Rh1 (6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol) belongs to the protopanaxatriol saponin, and its content in ginseng is also very low, and it is also a rare ginsenoside. The structure of ginsenoside Rh1 is very similar to that of F1, but its glycosylation site is the hydroxyl group at the C6 position. Ginsenoside Rh1 also has special physiological functions and can resist allergies and inflammation. Ginsenoside Rg1 (6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxadiol) belongs to the protopanaxatriol type saponin, which has anti-fatigue and heart protection functions.
稀有人参皂苷的糖基化修饰程度一般都比较低(只含有1-2个糖基),而丰富皂苷的糖基化修饰程度更高(由多个糖基组成的糖链)。稀有人参皂苷单体的传统生产方法,是以人参或者三七的总皂苷或者丰富皂苷为原料,依赖化学、酶和微生物发酵的水解方法。由于野生的人参资源已基本耗竭,人参皂苷资源目前来源于人参或三七的人工栽培,而其人工栽培的生长周期长(一般需要5-7年以上),并且受到地域的限制,还经常受到病虫害而需要施用大量的农药,所以,人参或三七的人工栽培有严重的连作障碍(人参或三七种植地需要休耕5-15年以上才能克服连作障碍),所以人参皂苷的产量、品质及安全性都面临挑战。The degree of glycosylation modification of rare ginsenosides is generally low (containing only 1-2 sugar groups), while the degree of glycosylation modification of rich saponins is higher (sugar chains composed of multiple sugar groups). The traditional production method of rare ginsenoside monomers is to use the total saponins or rich saponins of ginseng or Panax notoginseng as raw materials, relying on chemical, enzymatic and microbial fermentation hydrolysis methods. Since wild ginseng resources have been basically exhausted, ginsenoside resources currently come from the artificial cultivation of ginseng or Panax notoginseng, and the growth cycle of artificial cultivation is long (generally more than 5-7 years), and is subject to geographical restrictions. It is also often affected by pests and diseases and requires the application of large amounts of pesticides. Therefore, the artificial cultivation of ginseng or Panax notoginseng has serious continuous cropping obstacles (ginseng or Panax notoginseng planting areas need to be fallow for more than 5-15 years to overcome the continuous cropping obstacles), so the yield, quality and safety of ginsenosides are facing challenges.
合成生物学的发展为植物来源的天然产物的异源合成提供了新的机遇。以酵母为底盘,通过代谢途径的组装和优化,已经实现了用廉价的单糖来发酵合成青蒿酸或者双氢青蒿酸,继而再通过一步化学转化的方法生产青蒿素,这表明合成生物学在天然产物的药物合成方面具有的巨大潜力。利用酵母底盘细胞通过合成生物学方法异源合成稀有人参皂苷单体,原料为廉价的单糖,制备过程为安全性可调控的发酵过程,避免了任何外来污染(例如,原料植物人工种植时使用的农药),因此,通过合成生物学技术制备稀有人参皂苷单体,不仅具有成本优势,而且,可以保证成品的品质与安全性。利用合成生物学技术制备足够量的各种高纯度稀有人参皂苷单体,用于活性测定及临床实验,促进稀有人参皂苷的创新药物研发。The development of synthetic biology has provided new opportunities for the heterologous synthesis of natural products from plants. Using yeast as the chassis, through the assembly and optimization of metabolic pathways, it has been realized to use cheap monosaccharides to ferment and synthesize artemisinic acid or dihydroartemisinic acid, and then produce artemisinin through a one-step chemical transformation method, which shows the great potential of synthetic biology in the drug synthesis of natural products. Rare ginsenoside monomers are heterologously synthesized by synthetic biology methods using yeast chassis cells. The raw materials are cheap monosaccharides, and the preparation process is a safe and controllable fermentation process, avoiding any external pollution (for example, pesticides used when the raw material plants are artificially planted). Therefore, the preparation of rare ginsenoside monomers by synthetic biology technology not only has cost advantages, but also can ensure the quality and safety of the finished product. Use synthetic biology technology to prepare sufficient amounts of various high-purity rare ginsenoside monomers for activity determination and clinical trials, and promote the development of innovative drugs for rare ginsenosides.
利用合成生物学方法来人工合成这些具有药用活性的三醇型人参皂苷,不仅需要构建合成原人参三醇的代谢途径,还需要鉴定催化人参皂苷的糖基化的糖基转移酶。糖基转移酶的功能是将糖基供体(核苷二磷酸糖,例如UDP-葡萄糖)上的糖基转移到不同的糖基受体上。根据氨基酸序列的不同,目前糖基转移酶已有94个家族。目前已测序的植物基因组中,发现了上百种以上不同的糖基转移酶。这些糖基转移酶的糖基受体包括糖、脂、蛋白、核酸、抗生素和其它的小分子。The use of synthetic biology methods to artificially synthesize these triol-type ginsenosides with medicinal activity requires not only the construction of a metabolic pathway for the synthesis of protopanaxatriol, but also the identification of glycosyltransferases that catalyze the glycosylation of ginsenosides. The function of glycosyltransferases is to transfer the glycosyl on the glycosyl donor (nucleoside diphosphate sugar, such as UDP-glucose) to different glycosyl acceptors. Depending on the amino acid sequence, there are currently 94 families of glycosyltransferases. More than a hundred different glycosyltransferases have been found in the currently sequenced plant genomes. The glycosyl acceptors of these glycosyltransferases include sugars, lipids, proteins, nucleic acids, antibiotics and other small molecules.
因此,本领域需要对糖基转移酶进行更多的研究和开发,通过更多高产的糖基转移酶,以经济、高效的方式获取稀有人参皂苷。Therefore, more research and development of glycosyltransferases is needed in this field to obtain rare ginsenosides in an economical and efficient manner through more high-yield glycosyltransferases.
发明内容Summary of the invention
本发明提供了一种糖基转移酶的突变蛋白,通过采用所述的突变蛋白可以改善其酶活性或增加其表达量。The invention provides a mutant protein of glycosyltransferase, and the enzyme activity thereof can be improved or the expression amount thereof can be increased by adopting the mutant protein.
本发明的第一方面,提供了一种糖基转移酶的突变蛋白,所述的突变蛋白为非天然蛋白,且所述突变蛋白具有糖基转移酶催化活性,并且所述突变蛋白含有与酶催化活性相关的以下核心氨基酸:In a first aspect of the present invention, a mutant protein of a glycosyltransferase is provided, wherein the mutant protein is a non-natural protein, and the mutant protein has glycosyltransferase catalytic activity, and the mutant protein contains the following core amino acids related to the enzyme catalytic activity:
第142位氨基酸为A;The 142nd amino acid is A;
第144位氨基酸为H或F;和Amino acid at position 144 is H or F; and
第339位氨基酸为G;The 339th amino acid is G;
其中,所述氨基酸位置编号基于SEQ ID NO.:13所示的序列。Wherein, the amino acid position numbering is based on the sequence shown in SEQ ID NO.:13.
在另一优选例中,所述的突变蛋白除142、144、339位氨基酸外,其余氨基酸与SEQID NO.:13所示的序列相同或基本相同。In another preferred embodiment, except for amino acids 142, 144, and 339, the remaining amino acids of the mutant protein are identical or substantially identical to the sequence shown in SEQ ID NO.:13.
在另一优选例中,所述的基本相同是至多有50个(较佳地为1-20个,更佳地为1-10个)氨基酸不相同,其中,所述的不相同包括氨基酸的取代、缺失或添加,且所述的突变蛋白仍具有糖基转移酶活性。In another preferred embodiment, the basic identity means that there are at most 50 (preferably 1-20, more preferably 1-10) amino acid differences, wherein the differences include amino acid substitutions, deletions or additions, and the mutant protein still has glycosyltransferase activity.
在另一优选例中,与SEQ ID NO.:13所示序列的同源性至少为80%,较佳地至少为85%-90%,更佳地至少为95%,最佳地至少为98%。In another preferred embodiment, the homology with the sequence shown in SEQ ID NO.: 13 is at least 80%, preferably at least 85%-90%, more preferably at least 95%, and most preferably at least 98%.
在另一优选例中,所述的突变蛋白如SEQ ID NO.:13、或16所示。In another preferred embodiment, the mutant protein is shown in SEQ ID NO.: 13 or 16.
在另一优选例中,所述的突变蛋白不是SEQ ID NO.:1、2或4所示的序列。In another preferred embodiment, the mutant protein is not the sequence shown in SEQ ID NO.: 1, 2 or 4.
在另一优选例中,所述糖基转移酶催化活性包括催化以下一种或多种反应:In another preferred embodiment, the glycosyltransferase catalytic activity includes catalyzing one or more of the following reactions:
(a)将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟基;(a) transferring a glycosyl group from a glycosyl donor to the hydroxyl group at the C-20 position of a tetracyclic triterpenoid compound;
(b)将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基。(b) Transferring the glycosyl group from the glycosyl donor to the hydroxyl group at the C-6 position of the tetracyclic triterpenoid compound.
在另一优选例中,所述的突变蛋白如SEQ ID NO.:6、11-12所示,且所述的糖基转移酶催化活性包括将来自糖基供体的糖基转移到四环三萜类化合物的C-20位以及C-6位的羟基。In another preferred embodiment, the mutant protein is as shown in SEQ ID NO.: 6, 11-12, and the glycosyltransferase catalytic activity includes transferring the glycosyl from the glycosyl donor to the C-20 position and the C-6 hydroxyl group of the tetracyclic triterpenoid compound.
在另一优选例中,所述的突变蛋白如SEQ ID NO.:6、8、11-12、16所示,且所述的糖基转移酶活性包括将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基。In another preferred embodiment, the mutant protein is as shown in SEQ ID NO.: 6, 8, 11-12, 16, and the glycosyltransferase activity includes transferring the glycosyl from the glycosyl donor to the C-6 hydroxyl group of the tetracyclic triterpenoid compound.
在另一优选例中,所述的突变蛋白如SEQ ID NO.:6-7、9-13所示,且所述的糖基转移酶活性包括将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟基。In another preferred embodiment, the mutant protein is as shown in SEQ ID NO.: 6-7, 9-13, and the glycosyltransferase activity includes transferring the glycosyl from the glycosyl donor to the C-20 hydroxyl group of the tetracyclic triterpenoid compound.
在另一优选例中,所述的糖基转移酶催化活性包括催化以下一种或多种反应:In another preferred embodiment, the glycosyltransferase catalytic activity includes catalyzing one or more of the following reactions:
(A)(A)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),式(II)化合物为人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),所述的突变蛋白包括SEQID NO.:6-7、9-13;Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 6-7, 9-13;
(B)(B)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),式(III)化合物为人参皂苷Rh1(6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol);或Wherein, the compound of formula (I) is protopanaxatriol, and the compound of formula (III) is ginsenoside Rh1 (6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol); or
式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(IV)化合物是人参皂苷Rg1(6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol);The compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the compound of formula (IV) is ginsenoside Rg1 (6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol);
所述的突变蛋白包括SEQ ID NO.:6、8、11-12、16;The mutant proteins include SEQ ID NO.: 6, 8, 11-12, 16;
(C)(C)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),所述式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(IV)化合物为人参皂苷Rg1(6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol),所述的突变蛋白包括SEQ ID NO.:6、11-12;Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), the compound of formula (IV) is ginsenoside Rg1 (6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 6, 11-12;
(D)(D)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),所述式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(III)化合物为人参皂苷Rh1(6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),所述的突变蛋白包括SEQ ID NO.:11-12。Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), the compound of formula (III) is ginsenoside Rh1 (6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 11-12.
在另一优选例中,所述的糖基供体包括核苷二磷酸糖,优选地,所述的糖基供体选自下组:UDP-葡萄糖,ADP-葡萄糖,TDP-葡萄糖,CDP-葡萄糖,GDP-葡萄糖,UDP-半乳糖醛酸,ADP-半乳糖醛酸,TDP-半乳糖醛酸,CDP-半乳糖醛酸,GDP-半乳糖醛酸,UDP-半乳糖,ADP-半乳糖,TDP-半乳糖,CDP-半乳糖,GDP-半乳糖,UDP-阿拉伯糖,ADP-阿拉伯糖,TDP-阿拉伯糖,CDP-阿拉伯糖,GDP-阿拉伯糖,UDP-鼠李糖,ADP-鼠李糖,TDP-鼠李糖,CDP-鼠李糖,GDP-鼠李糖,或其他核苷二磷酸己糖或核苷二磷酸戊糖,或其组合。In another preferred embodiment, the glycosyl donor includes a nucleoside diphosphate sugar. Preferably, the glycosyl donor is selected from the following group: UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose, TDP-galactose, CDP-galactose, GDP-galactose, UDP-arabinose, ADP-arabinose, TDP-arabinose, CDP-arabinose, GDP-arabinose, UDP-rhamnose, ADP-rhamnose, TDP-rhamnose, CDP-rhamnose, GDP-rhamnose, or other nucleoside diphosphate hexoses or nucleoside diphosphate pentoses, or a combination thereof.
在另一优选例中,所述的糖基供体包括尿苷二磷酸(UDP)糖,优选地,所述的糖基供体选自下组:UDP-葡萄糖,UDP-半乳糖醛酸,UDP-半乳糖,UDP-阿拉伯糖,UDP-鼠李糖,或其他尿苷二磷酸己糖或尿苷二磷酸戊糖,或其组合。In another preferred embodiment, the glycosyl donor includes uridine diphosphate (UDP) sugar, preferably, the glycosyl donor is selected from the following group: UDP-glucose, UDP-galacturonic acid, UDP-galactose, UDP-arabinose, UDP-rhamnose, or other uridine diphosphate hexose or uridine diphosphate pentose, or a combination thereof.
在另一优选例中,反应体系的pH为:pH4.0-10.0,优选pH为6.0-8.5。In another preferred embodiment, the pH of the reaction system is: pH 4.0-10.0, preferably pH 6.0-8.5.
在另一优选例中,反应体系的温度为:10℃-105℃,优选25℃-35℃。In another preferred embodiment, the temperature of the reaction system is 10°C-105°C, preferably 25°C-35°C.
在另一优选例中,所述的突变蛋白选自:In another preferred embodiment, the mutant protein is selected from:
(a1)SEQ ID NO.:13所示的序列;或(a1) the sequence shown in SEQ ID NO.: 13; or
(b1)SEQ ID NO.:13所示的序列经过一个或多个(优选为1-50个,较佳地,为1-20个,更佳地,为1-10个,如1、2、3、4、5、6、7、8、9或10个)氨基酸的取代、添加和/或缺失后,并保留糖基转移酶催化活性的突变蛋白。(b1) A mutant protein having the glycosyltransferase catalytic activity after one or more (preferably 1-50, preferably 1-20, more preferably 1-10, such as 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) amino acid substitutions, additions and/or deletions of the sequence shown in SEQ ID NO.: 13.
在另一优选例中,所述的取代为1-50个保守氨基酸的取代,为1-20个,更佳地,为1-10个,如1、2、3、4、5、6、7、8、9或10个。In another preferred embodiment, the substitution is 1-50 conservative amino acid substitutions, preferably 1-20, more preferably 1-10, such as 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10.
在另一优选例中,所述的缺失为C端1-5个氨基酸的缺失,较佳地为2-4个氨基酸的缺失。In another preferred embodiment, the deletion is a deletion of 1-5 amino acids at the C-terminus, preferably a deletion of 2-4 amino acids.
在另一优选例中,所述的突变蛋白序列如SEQ ID NO.:6-8所示。In another preferred embodiment, the mutant protein sequence is shown in SEQ ID NO.: 6-8.
在另一优选例中,当所述突变蛋白的C端缺失1-5个氨基酸后,其表达量比SEQ IDNO.:13所示的蛋白提高了至少50%,更佳地为70%-100%。In another preferred embodiment, when 1-5 amino acids are missing from the C-terminus of the mutant protein, its expression level is increased by at least 50%, more preferably 70%-100%, compared with the protein shown in SEQ ID NO.: 13.
在另一优选例中,所述的添加为添加标签序列、信号序列或分泌信号序列。In another preferred embodiment, the addition is the addition of a tag sequence, a signal sequence or a secretion signal sequence.
在另一优选例中,所述的突变蛋白还含有以下一个或多个氨基酸:In another preferred embodiment, the mutant protein further contains one or more of the following amino acids:
第10位氨基酸为V;The 10th amino acid is V;
第13位氨基酸为F;The 13th amino acid is F;
第82位氨基酸为C;和Amino acid at position 82 is C; and
第186位氨基酸为L。The 186th amino acid is L.
本发明第二方面,提供了一种多核苷酸,所述的多核苷酸编码本发明第一方面所述的突变蛋白。The second aspect of the present invention provides a polynucleotide, wherein the polynucleotide encodes the mutant protein described in the first aspect of the present invention.
在另一优选例中,所述的多核苷酸序列如SEQ ID NO.:19和21所示,分别编码SEQID NO.:13和16所示的序列。In another preferred embodiment, the polynucleotide sequences are as shown in SEQ ID NOs.: 19 and 21, encoding the sequences shown in SEQ ID NOs.: 13 and 16, respectively.
本发明第三方面,提供了一种载体,所述的载体含有本发明第二方面所述的多核苷酸。The third aspect of the present invention provides a vector, wherein the vector contains the polynucleotide described in the second aspect of the present invention.
在另一优选例中,所述载体包括表达载体、穿梭载体、整合载体。In another preferred embodiment, the vector includes an expression vector, a shuttle vector, and an integration vector.
本发明第四方面,提供了一种宿主细胞,所述的宿主细胞含有本发明第三方面所述的载体,或其基因组中整合有本发明第二方面所示的多核苷酸。The fourth aspect of the present invention provides a host cell, wherein the host cell contains the vector described in the third aspect of the present invention, or the polynucleotide described in the second aspect of the present invention is integrated into its genome.
在另一优选例中,所述的宿主细胞为真核细胞,如酵母细胞或植物细胞。In another preferred embodiment, the host cell is a eukaryotic cell, such as a yeast cell or a plant cell.
在另一优选例中,所述的宿主细胞为原核细胞,如大肠杆菌。In another preferred embodiment, the host cell is a prokaryotic cell, such as Escherichia coli.
在另一优选例中,所述的宿主细胞为人参细胞。In another preferred embodiment, the host cell is a ginseng cell.
本发明第五方面,提供了一种改造糖基转移酶的方法,包括步骤:In a fifth aspect, the present invention provides a method for modifying glycosyltransferase, comprising the steps of:
(i)将所述糖基转移酶与SEQ ID NO.:13所示的序列进行同源性比对;(i) performing homology comparison between the glycosyltransferase and the sequence shown in SEQ ID NO.: 13;
(ii)筛选出步骤(i)中与SEQ ID NO.:13所示序列同源性至少为80%的序列;较佳地至少为85%-90%,更佳地至少为95%,最佳地至少为98%;(ii) screening out the sequence in step (i) having at least 80% homology to the sequence shown in SEQ ID NO.: 13; preferably at least 85%-90%, more preferably at least 95%, and most preferably at least 98%;
(iii)对步骤(ii)中筛选出的序列进行改造,改造后的糖基转移酶含有以下核心氨基酸:(iii) modifying the sequence selected in step (ii), wherein the modified glycosyltransferase contains the following core amino acids:
第142位氨基酸为A;The 142nd amino acid is A;
第144位氨基酸为H或F;和Amino acid at position 144 is H or F; and
第339位氨基酸为G;The 339th amino acid is G;
其中,所述氨基酸位置编号基于SEQ ID NO.:13所示的序列,Wherein, the amino acid position numbering is based on the sequence shown in SEQ ID NO.: 13,
且所述的核心氨基酸至少有一个是经人为改造的。And at least one of the core amino acids is artificially modified.
在另一优选例中,所述改造后的糖基转移酶还含有以下核心氨基酸:In another preferred embodiment, the modified glycosyltransferase further contains the following core amino acids:
第10位氨基酸为V;The 10th amino acid is V;
第13位氨基酸为F;The 13th amino acid is F;
第82位氨基酸为C;和/或The amino acid at position 82 is C; and/or
第186位氨基酸为L;The 186th amino acid is L;
在另一优选例中,所述改造后的糖基转移酶C末端缺失了1-5个氨基酸。In another preferred embodiment, the C-terminus of the modified glycosyltransferase lacks 1-5 amino acids.
本发明第六方面,提供了本发明第一方面所述的突变蛋白的用途,所述的突变蛋白用于催化以下一种或多种反应,或被用于制备催化以下一种或多种反应的催化制剂:In a sixth aspect, the present invention provides the use of the mutant protein described in the first aspect of the present invention, wherein the mutant protein is used to catalyze one or more of the following reactions, or is used to prepare a catalytic preparation for catalyzing one or more of the following reactions:
(a)将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟基;(a) transferring a glycosyl group from a glycosyl donor to the hydroxyl group at the C-20 position of a tetracyclic triterpenoid compound;
(b)将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基。(b) Transferring the glycosyl group from the glycosyl donor to the hydroxyl group at the C-6 position of the tetracyclic triterpenoid compound.
在另一优选例中,所述的突变蛋白用于催化下述一种或多种反应或被用于制备催化下述一种或多种反应的催化制剂:In another preferred embodiment, the mutant protein is used to catalyze one or more of the following reactions or is used to prepare a catalytic preparation for catalyzing one or more of the following reactions:
本发明第七方面,提供了一种进行糖基催化反应的方法,包括步骤:在本发明第一方面所述突变蛋白的存在下,进行糖基化反应。The seventh aspect of the present invention provides a method for performing a glycosylation catalytic reaction, comprising the steps of: performing a glycosylation reaction in the presence of the mutant protein described in the first aspect of the present invention.
在另一优选例中,所述的糖基化反应包括在本发明第一方面所述突变蛋白的存在下,将糖基供体转移到四环三萜类化合物的以下位点上:In another preferred embodiment, the glycosylation reaction comprises transferring a glycosyl donor to the following site of the tetracyclic triterpenoid compound in the presence of the mutant protein according to the first aspect of the present invention:
C-20位和/或C-6位,从而形成糖基化的四环三萜类化合物。C-20 and/or C-6, thereby forming a glycosylated tetracyclic triterpenoid compound.
在另一优选例中,所述的糖基催化反应的底物包括达玛烷型四环三萜类化合物、羊毛脂烷型四环三萜类化合物、甘遂烷型四环三萜类化合物、环阿屯烷(环阿尔廷烷)型四环三萜类化合物、葫芦烷四环三萜类化合物、或楝烷型四环三萜类化合物。In another preferred embodiment, the substrates of the sugar-catalyzed reaction include dammarane-type tetracyclic triterpenoid compounds, lanolin-type tetracyclic triterpenoid compounds, gansuane-type tetracyclic triterpenoid compounds, cycloartanane (cycloartinane)-type tetracyclic triterpenoid compounds, cucurbitane-type tetracyclic triterpenoid compounds, or meline-type tetracyclic triterpenoid compounds.
本发明第八方面,提供了一种提高糖基转移酶表达量的方法,包括步骤:In an eighth aspect, the present invention provides a method for increasing the expression level of glycosyltransferase, comprising the steps of:
(i)将所述糖基转移酶与SEQ ID NO.:13所示的序列进行同源性比对;(i) performing homology comparison between the glycosyltransferase and the sequence shown in SEQ ID NO.: 13;
(ii)筛选出步骤(i)中与SEQ ID NO.:13所示序列同源性至少为80%的序列;较佳地至少为85%-90%,更佳地至少为95%,最佳地至少为98%;(ii) screening out the sequence in step (i) having at least 80% homology to the sequence shown in SEQ ID NO.: 13; preferably at least 85%-90%, more preferably at least 95%, and most preferably at least 98%;
(iii)对步骤(ii)中筛选出的序列进行C端1-5个氨基酸的缺失。(iii) deleting 1 to 5 amino acids at the C-terminus of the sequence selected in step (ii).
本发明第九方面,提供了本发明第四方面所述宿主细胞的用途,用于制备糖基转移酶,或作为催化细胞、或生产糖基化后的四环三萜类化合物。The ninth aspect of the present invention provides the use of the host cell described in the fourth aspect of the present invention for preparing glycosyltransferase, or as a catalytic cell, or for producing glycosylated tetracyclic triterpenoid compounds.
本发明第十方面,提供了一种产生转基因植物的方法,包括步骤:将本发明第四方面所述的宿主细胞再生为植物,其中,所述的宿主细胞为植物细胞。The tenth aspect of the present invention provides a method for producing transgenic plants, comprising the steps of: regenerating the host cell described in the fourth aspect of the present invention into a plant, wherein the host cell is a plant cell.
本发明第十一方面,提供了一种糖基转移酶的突变蛋白,所述的突变蛋白为非天然蛋白,且所述突变蛋白具有糖基转移酶催化活性,并且所述突变蛋白含有与酶催化活性相关的以下核心氨基酸:In the eleventh aspect of the present invention, a mutant protein of a glycosyltransferase is provided, wherein the mutant protein is a non-natural protein, and the mutant protein has glycosyltransferase catalytic activity, and the mutant protein contains the following core amino acids related to the enzyme catalytic activity:
第82位氨基酸为C;和/或The amino acid at position 82 is C; and/or
第144位氨基酸为FThe 144th amino acid is F
其中,所述氨基酸位置编号基于SEQ ID NO.:11所示的序列,且所述的糖基转移酶催化活性包括将来自糖基供体的糖基转移到四环三萜类化合物的C-20位以及C-6位的羟基。Wherein, the amino acid position numbering is based on the sequence shown in SEQ ID NO.: 11, and the glycosyltransferase catalytic activity includes transferring the glycosyl from the glycosyl donor to the C-20 position and the C-6 hydroxyl group of the tetracyclic triterpene compound.
在另一优选例中,本发明第十一方面所述的突变蛋白与SEQ ID NO.:11所示序列同源性至少为90%,较佳地至少为95%,更佳地至少为98%。In another preferred embodiment, the mutant protein described in the eleventh aspect of the present invention has a sequence homology of at least 90%, preferably at least 95%, and more preferably at least 98% with the sequence shown in SEQ ID NO.: 11.
在另一优选例中,本发明第十一方面所述的突变蛋白除82、和/或144位氨基酸外,其余氨基酸与SEQ ID NO.:11所示的序列相同或基本相同。In another preferred embodiment, except for amino acid 82 and/or amino acid 144, the remaining amino acids of the mutant protein described in the eleventh aspect of the present invention are identical or substantially identical to the sequence shown in SEQ ID NO.: 11.
在另一优选例中,所述的基本相同是至多有50个(较佳地为1-20个,更佳地为1-10个)氨基酸不相同,其中,所述的不相同包括氨基酸的取代、缺失或添加,且所述的突变蛋白具有将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟的糖基转移酶活性。In another preferred embodiment, the basic identity is that there are at most 50 (preferably 1-20, more preferably 1-10) amino acids that are different, wherein the difference includes amino acid substitution, deletion or addition, and the mutant protein has glycosyltransferase activity of transferring the glycosyl from the glycosyl donor to the C-20 hydroxyl of the tetracyclic triterpene compound.
本发明第十二方面,提供了一种糖基转移酶的突变蛋白,所述的突变蛋白为非天然蛋白,且所述突变蛋白具有糖基转移酶催化活性,并且所述突变蛋白含有与酶催化活性相关的以下核心氨基酸:In a twelfth aspect of the present invention, a mutant protein of a glycosyltransferase is provided, wherein the mutant protein is a non-natural protein, and the mutant protein has glycosyltransferase catalytic activity, and the mutant protein contains the following core amino acids related to the enzyme catalytic activity:
第142位氨基酸为A;The 142nd amino acid is A;
第186位氨基酸为L;和The amino acid at position 186 is L; and
第338位氨基酸为GThe 338th amino acid is G
其中,所述氨基酸位置编号基于SEQ ID NO.:16所示的序列,且所述的糖基转移酶催化活性包括将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基。Wherein, the amino acid position numbering is based on the sequence shown in SEQ ID NO.: 16, and the glycosyltransferase catalytic activity includes transferring the glycosyl from the glycosyl donor to the C-6 hydroxyl group of the tetracyclic triterpene compound.
在另一优选例中,本发明第十二方面所述的突变蛋白与SEQ ID NO.:16所示序列同源性至少为90%,较佳地至少为95%,更佳地至少为98%。In another preferred embodiment, the mutant protein described in the twelfth aspect of the present invention has a sequence homology of at least 90% with that of SEQ ID NO.: 16, preferably at least 95%, and more preferably at least 98%.
在另一优选例中,本发明第十二方面所述的突变蛋白除142、186、338位氨基酸外,其余氨基酸与SEQ ID NO.:16所示的序列相同或基本相同。In another preferred embodiment, except for amino acids 142, 186, and 338, the remaining amino acids of the mutant protein described in the twelfth aspect of the present invention are identical or substantially identical to the sequence shown in SEQ ID NO.:16.
在另一优选例中,所述的基本相同是至多有50个(较佳地为1-20个,更佳地为1-10个)氨基酸不相同,其中,所述的不相同包括氨基酸的取代、缺失或添加,且所述的突变蛋白具有将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟的糖基转移酶活性。In another preferred embodiment, the basic identity is that there are at most 50 (preferably 1-20, more preferably 1-10) amino acids that are different, wherein the difference includes amino acid substitution, deletion or addition, and the mutant protein has glycosyltransferase activity that transfers the glycosyl from the glycosyl donor to the C-6 hydroxyl of the tetracyclic triterpene compound.
应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described below (such as embodiments) can be combined with each other to form a new or preferred technical solution. Due to space limitations, they will not be described one by one here.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1.五个糖基转移酶的氨基酸序列比对。Figure 1. Amino acid sequence alignment of five glycosyltransferases.
图2.五个糖基转移酶体外催化的HPLC图。1,UGTPg1催化PPT;2,UGTPg100催化PPT;3,UGTPg101催化PPT;4,UGTPg102催化PPT;5,UGTPg103催化PPT;6,pET28a空载体作为control;7,UGTPg101催化F1;8,UGTPg101催化Rh1。Figure 2. HPLC images of five glycosyltransferases catalyzed in vitro. 1, UGTPg1 catalyzes PPT; 2, UGTPg100 catalyzes PPT; 3, UGTPg101 catalyzes PPT; 4, UGTPg102 catalyzes PPT; 5, UGTPg103 catalyzes PPT; 6, pET28a empty vector as control; 7, UGTPg101 catalyzes F1; 8, UGTPg101 catalyzes Rh1.
图3.决定糖基转移酶UGTPg1和UGTPg102催化C20-OH糖基化的关键氨基酸位点。HPLC图如下:1,UGTPg1催化PPT;2,UGTPg102催化PPT;3,UGTPg1-L38F催化PPT;4,UGTPg1-A40S催化PPT;5,UGTPg1-F85L催化PPT;6,UGTPg1-T134I催化PPT;7,UGTPg1-H144Y催化PPT;8,UGTPg1-Q388H催化PPT;9,UGTPg102-Y144H催化PPT。Figure 3. Key amino acid sites that determine the glycosylation of C20-OH catalyzed by glycosyltransferases UGTPg1 and UGTPg102. The HPLC graphs are as follows: 1, PPT catalyzed by UGTPg1; 2, PPT catalyzed by UGTPg102; 3, PPT catalyzed by UGTPg1-L38F; 4, PPT catalyzed by UGTPg1-A40S; 5, PPT catalyzed by UGTPg1-F85L; 6, PPT catalyzed by UGTPg1-T134I; 7, PPT catalyzed by UGTPg1-H144Y; 8, PPT catalyzed by UGTPg1-Q388H; 9, PPT catalyzed by UGTPg102-Y144H.
图4.决定糖基转移酶UGTPg100和UGTPg103催化C6-OH糖基化的关键氨基酸位点。HPLC图如下:1,UGTPg100催化PPT;2,UGTPg103催化PPT;3,UGTPg100-A142T催化PPT;4,UGTPg100-L186S催化PPT;5,UGTPg100-W205R催化PPT;6,UGTPg100-G338R催化PPT;7,UGTPg103-T142A催化PPT;8,UGTPg103-T142A/S186L催化PPT;9,UGTPg103-T142A/S186L/G338R催化PPT。Figure 4. Key amino acid sites that determine the glycosylation of C6-OH catalyzed by glycosyltransferases UGTPg100 and UGTPg103. The HPLC graphs are as follows: 1, PPT catalyzed by UGTPg100; 2, PPT catalyzed by UGTPg103; 3, PPT catalyzed by UGTPg100-A142T; 4, PPT catalyzed by UGTPg100-L186S; 5, PPT catalyzed by UGTPg100-W205R; 6, PPT catalyzed by UGTPg100-G338R; 7, PPT catalyzed by UGTPg103-T142A; 8, PPT catalyzed by UGTPg103-T142A/S186L; 9, PPT catalyzed by UGTPg103-T142A/S186L/G338R.
图5.改变UGTPg1底物区域特异性的关键氨基酸位点。HPLC图如下:1,UGTPg1催化PPT;2,UGTPg100催化PPT;3,UGTPg1-A10V催化PPT;4,UGTPg1-I13F催化PPT;5,UGTPg1-H82C催化PPT;6,UGTPg1-H144F催化PPT。Figure 5. Key amino acid sites that change the substrate regiospecificity of UGTPg1. The HPLC graphs are as follows: 1, UGTPg1 catalyzed PPT; 2, UGTPg100 catalyzed PPT; 3, UGTPg1-A10V catalyzed PPT; 4, UGTPg1-I13F catalyzed PPT; 5, UGTPg1-H82C catalyzed PPT; 6, UGTPg1-H144F catalyzed PPT.
图6.截短UGTPg1,UGTPg100和UGTPg101的C末端氨基酸增强其在大肠杆菌中的可溶表达量。A,野生型UGTs以及C末端截短后的UGTs在大肠杆菌中表达后细胞裂解液上清的Western blot。C2,C4,C5分别表示C末端截短2,4,5个氨基酸。B,野生型UGTs以及C末端截短后的UGTs催化PPT的TLC图。Figure 6. Truncated C-terminal amino acids of UGTPg1, UGTPg100, and UGTPg101 enhance their soluble expression in E. coli. A, Western blot of cell lysate supernatant after wild-type UGTs and C-terminally truncated UGTs were expressed in E. coli. C2, C4, and C5 indicate C-terminally truncated 2, 4, and 5 amino acids, respectively. B, TLC of PPT catalyzed by wild-type UGTs and C-terminally truncated UGTs.
具体实施方式DETAILED DESCRIPTION
通过广泛而深入的研究,本发明人通过同源比对、定点突变等方式确定了糖基转移酶催化C-6或C-20的羟基糖基化的关键氨基酸位点。本发明发现,对糖基转移酶中的关键位点进行改造后,可以改变原本无活性或活性较差的酶,或者赋予新的酶活性。此外,本发明人还发现,对本发明糖基转移酶进行末端截短后,其表达量具有明显的上升。在此基础上,完成了本发明。Through extensive and in-depth research, the inventors determined the key amino acid sites of glycosyltransferase catalyzing the glycosylation of C-6 or C-20 hydroxyl groups by homology comparison, site-directed mutagenesis, etc. The present invention found that after the key sites in the glycosyltransferase were modified, the originally inactive or poorly active enzymes could be changed, or new enzyme activities could be conferred. In addition, the inventors also found that after the terminal truncation of the glycosyltransferase of the present invention, its expression level had a significant increase. On this basis, the present invention was completed.
本发明突变蛋白及其编码核酸Mutant protein and its encoding nucleic acid of the present invention
如本文所用,术语“突变蛋白”、“本发明突变蛋白”、“本发明糖基转移酶”均指非天然存在的糖基转移酶突变蛋白,且所述突变蛋白为SEQ ID NO.:13(或SEQ ID NO.:16)所示蛋白,或基于SEQ ID NO.:13(或SEQ ID NO.:16)所示蛋白进行人工改造的蛋白,其中,所述的突变蛋白含有与酶催化活性相关的核心氨基酸,且所述核心氨基酸中至少有一个是经过人工改造的;并且本发明突变蛋白具有催化以下一种或多种反应的糖基转移酶活性:As used herein, the terms "mutant protein", "mutant protein of the present invention", and "glycosyltransferase of the present invention" all refer to non-naturally occurring glycosyltransferase mutant proteins, and the mutant protein is a protein shown in SEQ ID NO.: 13 (or SEQ ID NO.: 16), or a protein artificially modified based on the protein shown in SEQ ID NO.: 13 (or SEQ ID NO.: 16), wherein the mutant protein contains core amino acids related to enzyme catalytic activity, and at least one of the core amino acids is artificially modified; and the mutant protein of the present invention has glycosyltransferase activity that catalyzes one or more of the following reactions:
(a)将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟基;(a) transferring a glycosyl group from a glycosyl donor to the hydroxyl group at the C-20 position of a tetracyclic triterpenoid compound;
(b)将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基。(b) Transferring the glycosyl group from the glycosyl donor to the hydroxyl group at the C-6 position of the tetracyclic triterpenoid compound.
如非特别说明,本文所说的人参皂苷和皂苷元,是的C20位S构型的人参皂苷和皂苷元。Unless otherwise specified, the ginsenosides and sapogenins mentioned in this article are ginsenosides and sapogenins with S configuration at the C20 position.
术语“核心氨基酸”指的是基于SEQ ID NO.:13(或SEQ ID NO.:16),且与SEQ IDNO.:13(或SEQ ID NO.:16)同源性达至少80%,如84%、85%、90%、92%、95%、98%的序列中,相应位点是本文所述的特定氨基酸,如基于SEQ ID NO.:13所示的序列,核心氨基酸为:The term "core amino acid" refers to a sequence based on SEQ ID NO.: 13 (or SEQ ID NO.: 16) and having at least 80%, such as 84%, 85%, 90%, 92%, 95%, 98% homology to SEQ ID NO.: 13 (or SEQ ID NO.: 16), wherein the corresponding positions are specific amino acids described herein, such as based on the sequence shown in SEQ ID NO.: 13, the core amino acids are:
第142位氨基酸为A;The 142nd amino acid is A;
第144位氨基酸为H或F;和Amino acid at position 144 is H or F; and
第339位氨基酸为G,且含有所述核心氨基酸的突变蛋白具有将来自糖基供体的糖基转移到四环三萜类化合物的C-20位的羟基的催化活性。The amino acid at position 339 is G, and a mutant protein containing the core amino acid has a catalytic activity of transferring a glycosyl group from a glycosyl donor to a hydroxyl group at position C-20 of a tetracyclic triterpenoid compound.
又如基于SEQ ID NO.:16所示的序列,核心氨基酸为:For example, based on the sequence shown in SEQ ID NO.: 16, the core amino acids are:
第142位氨基酸为A;The 142nd amino acid is A;
第186位氨基酸为L;和The amino acid at position 186 is L; and
第338位氨基酸为G,且含有所述核心氨基酸的突变蛋白具有将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的羟基的催化活性。The amino acid at position 338 is G, and a mutant protein containing the core amino acid has a catalytic activity of transferring a glycosyl group from a glycosyl donor to a hydroxyl group at the C-6 position of a tetracyclic triterpenoid compound.
应理解,本发明突变蛋白中的氨基酸编号基于SEQ ID NO.:13(或SEQ ID NO.:16)作出,当某一具体突变蛋白与SEQ ID NO.:13(或SEQ ID NO.:16)所示序列的同源性达到80%或以上时,突变蛋白的氨基酸编号可能会有相对于SEQ ID NO.:13(或SEQ ID NO.:16)的氨基酸编号的错位,如向氨基酸的N末端或C末端错位1-5位,而采用本领域常规的序列比对技术,本领域技术人员通常可以理解这样的错位是在合理范围内的,且不应当由于氨基酸编号的错位而使同源性达80%(如90%、95%、98%)的、具有相同或相似糖基转移酶活性的突变蛋白不在本发明突变蛋白的范围内。It should be understood that the amino acid numbering in the mutant protein of the present invention is based on SEQ ID NO.: 13 (or SEQ ID NO.: 16). When the homology between a specific mutant protein and the sequence shown in SEQ ID NO.: 13 (or SEQ ID NO.: 16) reaches 80% or more, the amino acid numbering of the mutant protein may be misplaced relative to the amino acid numbering of SEQ ID NO.: 13 (or SEQ ID NO.: 16), such as misplacement of 1-5 positions to the N-terminus or C-terminus of the amino acid. Using conventional sequence alignment techniques in the art, those skilled in the art can generally understand that such misplacement is within a reasonable range, and mutant proteins with a homology of 80% (such as 90%, 95%, 98%) and the same or similar glycosyltransferase activity should not be excluded from the scope of the mutant protein of the present invention due to the misplacement of amino acid numbering.
本发明突变蛋白是合成蛋白或重组蛋白,即可以是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、植物)中产生。根据重组生产方案所用的宿主,本发明的突变蛋白可以是糖基化的,或可以是非糖基化的。本发明的突变蛋白还可包括或不包括起始的甲硫氨酸残基。Mutants of the present invention are synthetic proteins or recombinant proteins, i.e., they can be the product of chemical synthesis, or they can be produced from prokaryotic or eukaryotic hosts (e.g., bacteria, yeast, plants) using recombinant technology. Depending on the host used in the recombinant production scheme, the mutagen of the present invention can be glycosylated, or it can be non-glycosylated. The mutagen of the present invention may also include or not include an initial methionine residue.
本发明还包括所述突变蛋白的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持所述突变蛋白相同的生物学功能或活性的蛋白。The present invention also includes fragments, derivatives and analogs of the mutant protein. As used herein, the terms "fragment", "derivative" and "analog" refer to proteins that substantially retain the same biological function or activity of the mutant protein.
本发明的突变蛋白片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的突变蛋白,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的突变蛋白,或(iii)成熟突变蛋白与另一个化合物(比如延长突变蛋白半衰期的化合物,例如聚乙二醇)融合所形成的突变蛋白,或(iv)附加的氨基酸序列融合到此突变蛋白序列而形成的突变蛋白(如前导序列或分泌序列或用来纯化此突变蛋白的序列或蛋白原序列,或与抗原IgG片段的形成的融合蛋白)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。本发明中,保守性替换的氨基酸最好根据表I进行氨基酸替换而产生。Mutant protein fragments, derivatives or analogs of the present invention can be (i) substituted mutant proteins with one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues), and such substituted amino acid residues may or may not be encoded by the genetic code, or (ii) mutant proteins with substituent groups in one or more amino acid residues, or (iii) mutant proteins formed by fusion of mature mutant proteins with another compound (such as a compound that prolongs the half-life of the mutant protein, such as polyethylene glycol), or (iv) mutant proteins formed by fusion of additional amino acid sequences to the mutant protein sequence (such as a leader sequence or secretory sequence or a sequence or proprotein sequence used to purify the mutant protein, or a fusion protein formed with an antigen IgG fragment). According to the teachings of this article, these fragments, derivatives and analogs belong to the well-known scope of those skilled in the art. In the present invention, the conservatively substituted amino acids are preferably produced by amino acid substitution according to Table I.
表ITable I
在本发明的活性突变蛋白具有糖基转移酶活性,并且能够催化以下一种或多种反应:The active mutant protein of the present invention has glycosyltransferase activity and can catalyze one or more of the following reactions:
(A)(A)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),式(II)化合物为人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),所述的突变蛋白包括SEQID NO.:6-7、9-13;Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 6-7, 9-13;
(B)(B)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),式(III)化合物为人参皂苷Rh1(6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol);或Wherein, the compound of formula (I) is protopanaxatriol, and the compound of formula (III) is ginsenoside Rh1 (6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol); or
式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(IV)化合物是人参皂苷Rg1(6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol);The compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the compound of formula (IV) is ginsenoside Rg1 (6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol);
所述的突变蛋白包括SEQ ID NO.:6、8、11-12、16;The mutant proteins include SEQ ID NO.: 6, 8, 11-12, 16;
(C)(C)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),所述式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(IV)化合物为人参皂苷Rg1(6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol),所述的突变蛋白包括SEQ ID NO.:6、11-12;Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), the compound of formula (IV) is ginsenoside Rg1 (6-O-β-(D-glucopyranosyl)-20-O-β-(D-glucopyranosyl)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 6, 11-12;
(D)(D)
其中,所述式(I)化合物是原人参三醇(protopanaxatriol),所述式(II)化合物是人参皂苷F1(20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),式(III)化合物为人参皂苷Rh1(6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol),所述的突变蛋白包括SEQ ID NO.:11-12。Wherein, the compound of formula (I) is protopanaxatriol, the compound of formula (II) is ginsenoside F1 (20-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), the compound of formula (III) is ginsenoside Rh1 (6-O-β-D-glucopyranosyl-20(S)-protopanaxatriol), and the mutant protein includes SEQ ID NO.: 11-12.
通常,除了具有上述核心氨基酸以外,本发明突变蛋白还可以具有以下一个或多个氨基酸:Generally, in addition to the above core amino acids, the mutant protein of the present invention may also have one or more of the following amino acids:
第10位氨基酸为V;The 10th amino acid is V;
第13位氨基酸为F;The 13th amino acid is F;
第82位氨基酸为C;和Amino acid at position 82 is C; and
第186位氨基酸为L。The 186th amino acid is L.
优选地,所述的突变蛋白如SEQ ID NO.:9-13、16所示,且所述的突变蛋白不是SEQID NO.:1-3所示的序列。Preferably, the mutant protein is as shown in SEQ ID NO.: 9-13, 16, and the mutant protein is not the sequence shown in SEQ ID NO.: 1-3.
本发明突变蛋白还包括将具有糖基转移酶活性的突变蛋白经C末端缺失(截短)后的突变蛋白,例如SEQ ID NO.:6-8所示的序列。The mutant proteins of the present invention also include mutant proteins obtained by deleting (truncating) the C-terminus of the mutant protein having glycosyltransferase activity, such as the sequences shown in SEQ ID NO.: 6-8.
应理解,本发明突变蛋白与SEQ ID NO.:13所示的序列相比,通常具有较高的同源性(相同性),优选地,所述的突变蛋白与SEQ ID NO.:13所示序列的同源性至少为80%,较佳地至少为85%-90%,更佳地至少为95%,最佳地至少为98%。It should be understood that the mutant protein of the present invention generally has a higher homology (identity) than the sequence shown in SEQ ID NO.: 13. Preferably, the homology of the mutant protein to the sequence shown in SEQ ID NO.: 13 is at least 80%, more preferably at least 85%-90%, more preferably at least 95%, and most preferably at least 98%.
此外,还可以对本发明突变蛋白进行修饰。修饰(通常不改变一级结构)形式包括:体内或体外的突变蛋白的化学衍生形式如乙酰化或羧基化。修饰还包括糖基化,如那些在突变蛋白的合成和加工中或进一步加工步骤中进行糖基化修饰而产生的突变蛋白。这种修饰可以通过将突变蛋白暴露于进行糖基化的酶(如哺乳动物的糖基化酶或去糖基化酶)而完成。修饰形式还包括具有磷酸化氨基酸残基(如磷酸酪氨酸,磷酸丝氨酸,磷酸苏氨酸)的序列。还包括被修饰从而提高了其抗蛋白水解性能或优化了溶解性能的突变蛋白。In addition, the mutant protein of the present invention can also be modified. Modifications (usually without changing the primary structure) include: chemical derivatization forms of mutant proteins in vivo or in vitro, such as acetylation or carboxylation. Modifications also include glycosylation, such as those mutant proteins produced by glycosylation modification during the synthesis and processing of mutant proteins or in further processing steps. This modification can be accomplished by exposing the mutant protein to an enzyme that performs glycosylation (such as a mammalian glycosylase or deglycosylation enzyme). Modified forms also include sequences with phosphorylated amino acid residues (such as phosphotyrosine, phosphoserine, phosphothreonine). Also included are mutant proteins that have been modified to improve their anti-proteolytic properties or optimize their solubility properties.
术语“编码突变蛋白的多核苷酸”可以是包括编码本发明突变蛋白的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。The term "polynucleotide encoding a mutant protein" may include a polynucleotide encoding a mutant protein of the present invention, or may include a polynucleotide further including additional coding and/or non-coding sequences.
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或突变蛋白的片段、类似物和衍生物。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的突变蛋白的功能。The present invention also relates to variants of the above-mentioned polynucleotides, which encode fragments, analogs and derivatives of polypeptides or mutant proteins having the same amino acid sequence as the present invention. These nucleotide variants include substitution variants, deletion variants and insertion variants. As known in the art, an allelic variant is a replacement form of a polynucleotide, which may be a substitution, deletion or insertion of one or more nucleotides, but will not substantially change the function of the mutant protein encoded by it.
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件(或严紧条件)下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指:(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在90%以上,更好是95%以上时才发生杂交。The present invention also relates to polynucleotides that hybridize to the above-mentioned sequences and have at least 50%, preferably at least 70%, and more preferably at least 80% identity between the two sequences. The present invention particularly relates to polynucleotides that can hybridize to the polynucleotides of the present invention under stringent conditions (or stringent conditions). In the present invention, "stringent conditions" refer to: (1) hybridization and elution at lower ionic strength and higher temperature, such as 0.2×SSC, 0.1% SDS, 60°C; or (2) the addition of denaturing agents during hybridization, such as 50% (v/v) formamide, 0.1% calf serum/0.1% Ficoll, 42°C, etc.; or (3) hybridization occurs only when the identity between the two sequences is at least 90%, preferably 95%.
本发明的突变蛋白和多核苷酸优选以分离的形式提供,更佳地,被纯化至均质。The mutant proteins and polynucleotides of the present invention are preferably provided in an isolated form, and more preferably, purified to homogeneity.
本发明多核苷酸全长序列通常可以通过PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。The full-length sequence of the polynucleotide of the present invention can usually be obtained by PCR amplification, recombination or artificial synthesis. For PCR amplification, primers can be designed based on the relevant nucleotide sequences disclosed in the present invention, especially the open reading frame sequences, and commercially available cDNA libraries or cDNA libraries prepared by conventional methods known to those skilled in the art are used as templates to amplify and obtain the relevant sequences. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then splice the fragments amplified in each time together in the correct order.
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。Once the relevant sequence is obtained, it can be obtained in large quantities by recombinant methods. This is usually done by cloning it into a vector, then transferring it into cells, and then isolating the relevant sequence from the propagated host cells by conventional methods.
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。In addition, artificial synthesis methods can also be used to synthesize related sequences, especially when the fragment length is shorter. Usually, a long fragment of sequence can be obtained by synthesizing multiple small fragments first and then connecting them.
目前,已经可以完全通过化学合成来得到编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中已知的各种现有的DNA分子(或如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。At present, the DNA sequence encoding the protein of the present invention (or its fragment, or its derivative) can be obtained completely by chemical synthesis. The DNA sequence can then be introduced into various existing DNA molecules (or vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequence of the present invention by chemical synthesis.
应用PCR技术扩增DNA/RNA的方法被优选用于获得本发明的多核苷酸。特别是很难从文库中得到全长的cDNA时,可优选使用RACE法(RACE-cDNA末端快速扩增法),用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。The method of amplifying DNA/RNA using PCR technology is preferably used to obtain the polynucleotides of the present invention. In particular, when it is difficult to obtain full-length cDNA from a library, the RACE method (RACE-cDNA terminal rapid amplification method) can be preferably used. The primers used for PCR can be appropriately selected based on the sequence information of the present invention disclosed herein, and can be synthesized by conventional methods. The amplified DNA/RNA fragments can be separated and purified by conventional methods such as by gel electrophoresis.
野生型糖基转移酶Wild-type glycosyltransferase
在本发明中,自糖基转移酶中筛选出具有C20-OH、C6-OH糖基化活性的系列基因进行研究。其中:In the present invention, a series of genes with C20-OH and C6-OH glycosylation activities were screened out from glycosyltransferases for research. Among them:
UGTPg101基因具有序列表中SEQ ID NO:17的核苷酸序列。自SEQ ID NO:17的5’端第1-1428位核苷酸为UGTPg1的开放阅读框,自SEQ ID NO:17的5’端的第1-3位核苷酸为UGTPg1基因的起始密码子ATG,自SEQ ID NO:17的5’端的第1426-1428位核苷酸为UGTPg1基因的终止密码子TAA。糖基转移酶基因UGTPg1编码一个含有475个氨基酸的蛋白质UGTPg1,具有SEQ ID NO:1的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为53.4kDa,等电点pI为5.01。自SEQ ID NO:1的氨基端的第344位氨基酸至第387位氨基酸为植物小分子糖基化修饰的糖基转移酶的保守PSPG基序。The UGTPg101 gene has the nucleotide sequence of SEQ ID NO:17 in the sequence list. The nucleotides 1-1428 from the 5' end of SEQ ID NO:17 are the open reading frame of UGTPg1, the nucleotides 1-3 from the 5' end of SEQ ID NO:17 are the start codon ATG of the UGTPg1 gene, and the nucleotides 1426-1428 from the 5' end of SEQ ID NO:17 are the stop codon TAA of the UGTPg1 gene. The glycosyltransferase gene UGTPg1 encodes a protein UGTPg1 containing 475 amino acids, having the amino acid residue sequence of SEQ ID NO:1. The theoretical molecular weight of the protein predicted by the software is 53.4 kDa, and the isoelectric point pI is 5.01. The amino acids 344 to 387 from the amino terminal of SEQ ID NO:1 are the conserved PSPG motifs of the glycosyltransferase modified by plant small molecule glycosylation.
UGTPg1基因具有序列表中SEQ ID NO:18的核苷酸序列。自SEQ ID NO:18的5’端第1-1428位核苷酸为UGTPg1的开放阅读框(Open Reading Frame,ORF),自SEQ ID NO:18的5’端的第1-3位核苷酸为UGTPg1基因的起始密码子ATG,自SEQ ID NO:18的5’端的第1426-1428位核苷酸为UGTPg1基因的终止密码子TAA。糖基转移酶基因UGTPg1编码一个含有475个氨基酸的蛋白质UGTPg1,具有SEQ ID NO:的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为53.4kDa,等电点pI为5.14。自SEQ ID NO:2的氨基端的第344位氨基酸至第387位氨基酸为为植物小分子糖基化修饰的糖基转移酶的保守PSPG(Plant SecondaryProduct Glycosyltransferase)基序。The UGTPg1 gene has the nucleotide sequence of SEQ ID NO: 18 in the sequence table. The nucleotides 1-1428 from the 5' end of SEQ ID NO: 18 are the open reading frame (ORF) of UGTPg1, the nucleotides 1-3 from the 5' end of SEQ ID NO: 18 are the start codon ATG of the UGTPg1 gene, and the nucleotides 1426-1428 from the 5' end of SEQ ID NO: 18 are the stop codon TAA of the UGTPg1 gene. The glycosyltransferase gene UGTPg1 encodes a protein UGTPg1 containing 475 amino acids, having the amino acid residue sequence of SEQ ID NO:, and the theoretical molecular weight of the protein predicted by the software is 53.4 kDa, and the isoelectric point pI is 5.14. The amino acid from position 344 to position 387 of the amino terminal of SEQ ID NO: 2 is a conserved PSPG (Plant Secondary Product Glycosyltransferase) motif of a glycosyltransferase that modifies plant small molecule glycosylation.
UGTPg102基因具有序列表中SEQ ID NO:19的核苷酸序列。自SEQ ID NO:19的5’端第1-1428位核苷酸为UGTPg1的开放阅读框,自SEQ ID NO:19的5’端的第1-3位核苷酸为UGTPg1基因的起始密码子ATG,自SEQ ID NO:19的5’端的第1426-1428位核苷酸为UGTPg1基因的终止密码子TAA。糖基转移酶基因UGTPg1编码一个含有475个氨基酸的蛋白质UGTPg1,具有SEQ ID NO:3的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为53.6kDa,等电点pI为5.08。自SEQ ID NO:3的氨基端的第344位氨基酸至第387位氨基酸为为植物小分子糖基化修饰的糖基转移酶的保守PSPG基序。The UGTPg102 gene has the nucleotide sequence of SEQ ID NO:19 in the sequence list. The nucleotides 1-1428 from the 5' end of SEQ ID NO:19 are the open reading frame of UGTPg1, the nucleotides 1-3 from the 5' end of SEQ ID NO:19 are the start codon ATG of the UGTPg1 gene, and the nucleotides 1426-1428 from the 5' end of SEQ ID NO:19 are the stop codon TAA of the UGTPg1 gene. The glycosyltransferase gene UGTPg1 encodes a protein UGTPg1 containing 475 amino acids, having the amino acid residue sequence of SEQ ID NO:3. The theoretical molecular weight of the protein predicted by the software is 53.6 kDa, and the isoelectric point pI is 5.08. The amino acids 344 to 387 from the amino terminal of SEQ ID NO:3 are the conserved PSPG motifs of the glycosyltransferase modified by plant small molecule glycosylation.
UGTPg100基因具有序列表中SEQ ID NO:20的核苷酸序列。自SEQ ID NO:20的5’端第1-1419位核苷酸为UGTPg1的开放阅读框,自SEQ ID NO:20的5’端的第1-3位核苷酸为UGTPg1基因的起始密码子ATG,自SEQ ID NO:20的5’端的第1417-1419位核苷酸为UGTPg1基因的终止密码子TAA。糖基转移酶基因UGTPg1编码一个含有472个氨基酸的蛋白质UGTPg1,具有SEQ ID NO:4的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为53.1kDa,等电点pI为5.08。自SEQ ID NO.:4的氨基端的第343位氨基酸至第386位氨基酸为为植物小分子糖基化修饰的糖基转移酶的保守PSPG基序。The UGTPg100 gene has the nucleotide sequence of SEQ ID NO:20 in the sequence list. The nucleotides 1-1419 from the 5' end of SEQ ID NO:20 are the open reading frame of UGTPg1, the nucleotides 1-3 from the 5' end of SEQ ID NO:20 are the start codon ATG of the UGTPg1 gene, and the nucleotides 1417-1419 from the 5' end of SEQ ID NO:20 are the stop codon TAA of the UGTPg1 gene. The glycosyltransferase gene UGTPg1 encodes a protein UGTPg1 containing 472 amino acids, having the amino acid residue sequence of SEQ ID NO:4. The theoretical molecular weight of the protein predicted by the software is 53.1 kDa, and the isoelectric point pI is 5.08. The amino acids 343 to 386 from the amino terminal of SEQ ID NO.:4 are the conserved PSPG motifs of glycosyltransferases for plant small molecule glycosylation modification.
UGTPg103基因具有序列表中SEQ ID NO:21的核苷酸序列。自SEQ ID NO:21的5’端第1-1419位核苷酸为UGTPg1的开放阅读框,自SEQ ID NO:21的5’端的第1-3位核苷酸为UGTPg1基因的起始密码子ATG,自SEQ ID NO:21的5’端的第1417-1419位核苷酸为UGTPg1基因的终止密码子TAA。糖基转移酶基因UGTPg1编码一个含有472个氨基酸的蛋白质UGTPg1,具有SEQ ID NO:5的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为53.2kDa,等电点pI为5.23。自SEQ ID NO.:5的氨基端的第343位氨基酸至第386位氨基酸为为植物小分子糖基化修饰的糖基转移酶的保守PSPG基序。The UGTPg103 gene has the nucleotide sequence of SEQ ID NO:21 in the sequence list. The nucleotides 1-1419 from the 5' end of SEQ ID NO:21 are the open reading frame of UGTPg1, the nucleotides 1-3 from the 5' end of SEQ ID NO:21 are the start codon ATG of the UGTPg1 gene, and the nucleotides 1417-1419 from the 5' end of SEQ ID NO:21 are the stop codon TAA of the UGTPg1 gene. The glycosyltransferase gene UGTPg1 encodes a protein UGTPg1 containing 472 amino acids, having the amino acid residue sequence of SEQ ID NO:5. The theoretical molecular weight of the protein predicted by the software is 53.2 kDa, and the isoelectric point pI is 5.23. The amino acids 343 to 386 from the amino terminal of SEQ ID NO.:5 are the conserved PSPG motifs of glycosyltransferases for plant small molecule glycosylation modification.
上述涉及的野生蛋白、本发明突变蛋白及其衍生蛋白的序列信息如表2所示:The sequence information of the wild protein, mutant protein of the present invention and its derivative protein involved above is shown in Table 2:
表2Table 2
表达载体Expression vector
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或本发明突变蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。The present invention also relates to a vector comprising the polynucleotide of the present invention, a host cell produced by genetic engineering using the vector of the present invention or the mutant protein coding sequence of the present invention, and a method for producing the polypeptide of the present invention by recombinant technology.
通过常规的重组DNA技术,可利用本发明的多聚核苷酸序列可用来表达或生产重组的突变蛋白。一般来说有以下步骤:The polynucleotide sequences of the present invention can be used to express or produce recombinant mutant proteins by conventional recombinant DNA techniques. Generally, the following steps are involved:
(1).用本发明的编码本发明突变蛋白的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(1) Transforming or transducing a suitable host cell with a polynucleotide (or variant) encoding a mutant protein of the present invention, or a recombinant expression vector containing the polynucleotide;
(2).在合适的培养基中培养的宿主细胞;(2) Host cells cultured in a suitable culture medium;
(3).从培养基或细胞中分离、纯化蛋白质(3) Isolation and purification of proteins from culture medium or cells
本发明中,编码突变蛋白的多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。In the present invention, the polynucleotide sequence encoding the mutant protein can be inserted into a recombinant expression vector. The term "recombinant expression vector" refers to bacterial plasmids, bacteriophages, yeast plasmids, plant cell viruses, mammalian cell viruses such as adenoviruses, retroviruses or other vectors well known in the art. As long as they can replicate and be stable in the host, any plasmid and vector can be used. An important feature of an expression vector is that it usually contains a replication origin, a promoter, a marker gene and a translation control element.
本领域的技术人员熟知的方法能用于构建含本发明突变蛋白编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。这些启动子的代表性例子有:大肠杆菌的lac或trp启动子;λ噬菌体PL启动子;真核启动子包括CMV立即早期启动子、HSV胸苷激酶启动子、早期和晚期SV40启动子、反转录病毒的LTRs和其他一些已知的可控制基因在原核或真核细胞或其病毒中表达的启动子。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。Methods well known to those skilled in the art can be used to construct expression vectors containing the DNA sequence encoding the mutant protein of the present invention and appropriate transcription/translation control signals. These methods include in vitro recombinant DNA technology, DNA synthesis technology, in vivo recombination technology, etc. The DNA sequence can be effectively linked to an appropriate promoter in the expression vector to guide mRNA synthesis. Representative examples of these promoters include: lac or trp promoters of Escherichia coli; λ phage PL promoter; eukaryotic promoters include CMV immediate early promoter, HSV thymidine kinase promoter, early and late SV40 promoter, retroviral LTRs and other known promoters that can control gene expression in prokaryotic or eukaryotic cells or their viruses. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。In addition, the expression vector preferably contains one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells, such as dihydrofolate reductase, neomycin resistance and green fluorescent protein (GFP) for eukaryotic cell culture, or tetracycline or ampicillin resistance for Escherichia coli.
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。The vector containing the above-mentioned appropriate DNA sequence and an appropriate promoter or control sequence can be used to transform appropriate host cells to enable them to express proteins.
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,链霉菌属;鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母、植物细胞(如人参细胞)。Host cells can be prokaryotic cells, such as bacterial cells; or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as mammalian cells. Representative examples include: Escherichia coli, Streptomyces; bacterial cells of Salmonella typhimurium; fungal cells such as yeast, plant cells (such as ginseng cells).
本发明的多核苷酸在高等真核细胞中表达时,如果在载体中插入增强子序列时将会使转录得到增强。增强子是DNA的顺式作用因子,通常大约有10到300个碱基对,作用于启动子以增强基因的转录。可举的例子包括在复制起始点晚期一侧的100到270个碱基对的SV40增强子、在复制起始点晚期一侧的多瘤增强子以及腺病毒增强子等。When the polynucleotide of the present invention is expressed in higher eukaryotic cells, transcription will be enhanced if an enhancer sequence is inserted into the vector. Enhancers are cis-acting factors of DNA, usually about 10 to 300 base pairs, which act on the promoter to enhance gene transcription. Examples include the SV40 enhancer of 100 to 270 base pairs on the late side of the replication origin, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
本领域一般技术人员都清楚如何选择适当的载体、启动子、增强子和宿主细胞。Those skilled in the art will appreciate how to select appropriate vectors, promoters, enhancers and host cells.
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl2法处理,所用的步骤在本领域众所周知。另一种方法是使用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法:磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。Transformation of host cells with recombinant DNA can be carried out using conventional techniques well known to those skilled in the art. When the host is a prokaryotic organism such as Escherichia coli, competent cells that can absorb DNA can be harvested after the exponential growth phase and treated with the CaCl2 method, the steps used are well known in the art. Another method is to use MgCl2. If necessary, transformation can also be carried out using electroporation. When the host is a eukaryotic organism, the following DNA transfection methods can be selected: calcium phosphate coprecipitation method, conventional mechanical methods such as microinjection, electroporation, liposome packaging, etc.
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。The obtained transformant can be cultured by conventional methods to express the polypeptide encoded by the gene of the present invention. Depending on the host cell used, the culture medium used in the culture can be selected from various conventional culture media. Culture is carried out under conditions suitable for the growth of the host cells. After the host cells grow to an appropriate cell density, the selected promoter is induced by a suitable method (such as temperature conversion or chemical induction), and the cells are cultured for a period of time.
在上面的方法中的重组多肽可在细胞内、或在细胞膜上表达、或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。The recombinant polypeptide in the above method can be expressed in the cell, on the cell membrane, or secreted outside the cell. If necessary, the recombinant protein can be separated and purified by various separation methods using its physical, chemical and other properties. These methods are well known to those skilled in the art. Examples of these methods include but are not limited to: conventional renaturation treatment, treatment with a protein precipitant (salting out method), centrifugation, osmotic sterilization, ultra-treatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography techniques and combinations of these methods.
本发明有益效果:Beneficial effects of the present invention:
经大量筛选和改造,本发明发现了糖基转移酶催化活性位点,改造了相关位点后,能够将原本没有催化活性的糖基转移酶改造为具有活性的新蛋白,而对高度同源性的蛋白进行关键位点改造后可以人为改变其底物专一性。此外,在进一步缺失了C端数个氨基酸后,还能够有效地提高突变蛋白的表达量。After extensive screening and modification, the present invention discovered the catalytic active site of glycosyltransferase. After modifying the relevant site, the glycosyltransferase that originally had no catalytic activity can be transformed into a new protein with activity, and the substrate specificity of highly homologous proteins can be artificially changed after modifying the key sites. In addition, after further deleting several amino acids at the C-terminus, the expression level of the mutant protein can be effectively increased.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring HarborLaboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。除非另外说明,否则百分比和份数是重量百分比和重量份数。The present invention will be further described below in conjunction with specific examples. It should be understood that these examples are intended to illustrate the present invention only and are not intended to limit the scope of the present invention. The experimental methods in the following examples where specific conditions are not specified are usually performed under conventional conditions, such as those described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or under conditions recommended by the manufacturer. Unless otherwise indicated, percentages and parts are weight percentages and weight parts.
实施例1.糖基转移酶UGTPg1、UGTPg101、UGTPg102、UGTPg100和UGTPg103在大肠杆菌BL21(DE3)中的表达与催化功能鉴定Example 1. Expression and catalytic function identification of glycosyltransferases UGTPg1, UGTPg101, UGTPg102, UGTPg100 and UGTPg103 in Escherichia coli BL21 (DE3)
分别合成如SEQ ID NO:22和SEQ ID NO:23核苷酸序列的两条引物。在合成的引物SEQ ID NO:22和SEQ ID NO:23两端分别设置BamHI和XhoI两个酶切位点,分别以pMDT-UGTPg1、pMDT-UGTPg101、pMDT-UGTPg102、pMDT-UGTPg100和pMDT-UGTPg103为模板进行PCR。Two primers with nucleotide sequences of SEQ ID NO: 22 and SEQ ID NO: 23 were synthesized respectively. Two restriction sites, BamHI and XhoI, were set at both ends of the synthesized primers SEQ ID NO: 22 and SEQ ID NO: 23, and PCR was performed using pMDT-UGTPg1, pMDT-UGTPg101, pMDT-UGTPg102, pMDT-UGTPg100 and pMDT-UGTPg103 as templates.
PCR扩增程序:94℃2min;94℃15s,58℃30s,68℃1.5min,共35个循环;68℃10min,降至10℃。PCR产物经琼脂糖凝胶电泳分离、回收后经BamHI和XhoI双酶切,利用NEB公司的T4DNA连接酶连入同样经BamHI和XhoI双酶切的pET28a载体(Novagen公司)中。PCR amplification program: 94°C for 2 min; 94°C for 15 s, 58°C for 30 s, 68°C for 1.5 min, for a total of 35 cycles; 68°C for 10 min, then reduced to 10°C. The PCR products were separated and recovered by agarose gel electrophoresis, then double-digested with BamHI and XhoI, and ligated into the pET28a vector (Novagen) that was also double-digested with BamHI and XhoI using T4 DNA ligase from NEB.
所获得的重组质粒命名为pET28a-UGTPg1、pET28a-UGTPg101、pET28a-UGTPg102、pET28a-UGTPg100和pET28a-UGTPg103。将这些重组质粒分别转化大肠杆菌BL21(DE3)(Novagen公司)中构建重组菌pET28a-UGTPg1-BL21、pET28a-UGTPg101-BL21、pET28a-UGTPg102-BL21、pET28a-UGTPg100-BL21和pET28a-UGTPg103-BL21。The obtained recombinant plasmids were named pET28a-UGTPg1, pET28a-UGTPg101, pET28a-UGTPg102, pET28a-UGTPg100 and pET28a-UGTPg103. These recombinant plasmids were transformed into Escherichia coli BL21 (DE3) (Novagen) to construct recombinant bacteria pET28a-UGTPg1-BL21, pET28a-UGTPg101-BL21, pET28a-UGTPg102-BL21, pET28a-UGTPg100-BL21 and pET28a-UGTPg103-BL21.
分别从平板挑取这5个重组菌株单克隆接种至含有50μg/mL卡那霉素的LB试管中37℃200rpm震荡培养过夜,按1%的比例接种至50mL LB培养基中37℃200rpm震荡培养至OD600为0.6-0.8,冰水浴冷却菌液,加入终浓度为0.1mM的IPTG,16℃下110rpm诱导18h。12000g,离心3min收集菌体,每克湿重菌体加入10mL PBS buffer(pH8.0)重悬后裂解菌体,12000g离心20min取上清作为粗酶液。同时诱导含有pET28a空载体的BL21菌株pET28a-BL21制备粗酶液作为后续实施例的对照。The five recombinant strains were picked from the plate and inoculated into LB test tubes containing 50 μg/mL kanamycin, shaken and cultured at 37°C 200rpm overnight, and inoculated into 50mL LB medium at a ratio of 1%, shaken and cultured at 37°C 200rpm until OD600 was 0.6-0.8, cooled in an ice-water bath, and IPTG was added at a final concentration of 0.1mM, and induced at 110rpm at 16°C for 18h. 12000g, centrifuged for 3min to collect the cells, added 10mL PBS buffer (pH8.0) per gram of wet weight of the cells, and then the cells were lysed after resuspending, and centrifuged at 12000g for 20min to obtain the supernatant as a crude enzyme solution. At the same time, the BL21 strain pET28a-BL21 containing the pET28a empty vector was induced to prepare a crude enzyme solution as a control for subsequent examples.
糖基转移酶催化人参皂苷苷元合成稀有人参皂苷的反应体系如下:200μL的反应液中包含PBS buffer(pH8.0),5mM UDP-葡萄糖,0.5mM原人参三醇(PPT),1%吐温-20和150μL粗酶液。40℃水浴条件下反应4h,加入等体积的正丁醇终止反应并进行抽提,取正丁醇相真空浓缩,产物溶解于甲醇中,HPLC结果如图2。The reaction system for synthesizing rare ginsenosides from ginsenoside aglycones catalyzed by glycosyltransferase is as follows: 200 μL of reaction solution contains PBS buffer (pH 8.0), 5 mM UDP-glucose, 0.5 mM protopanaxatriol (PPT), 1% Tween-20 and 150 μL crude enzyme solution. The reaction is carried out in a 40°C water bath for 4 h, and an equal volume of n-butanol is added to terminate the reaction and extract. The n-butanol phase is concentrated in vacuo, and the product is dissolved in methanol. The HPLC results are shown in Figure 2.
结果显示UGTPg1催化PPT生成F1,即在C20-OH加上一个糖基;UGTPg101催化PPT的C20-OH糖基化先生F1,再在F1的C6-OH糖基化生成少量Rg1;UGTPg100催化PPT生成Rh1,即在PPT的C6-OH加上一分子葡萄糖;而UGTPg102和UGTPg103没有检测到酶活性。The results showed that UGTPg1 catalyzed PPT to produce F1, i.e., adding a sugar group to C20-OH; UGTPg101 catalyzed the glycosylation of C20-OH of PPT to produce F1, and then glycosylated F1 at C6-OH to produce a small amount of Rg1; UGTPg100 catalyzed PPT to produce Rh1, i.e., adding a molecule of glucose to C6-OH of PPT; while no enzyme activity was detected for UGTPg102 and UGTPg103.
实施例2.决定糖基转移酶UGTPg1和UGTPg102催化C20-OH糖基化的关键氨基酸位点Example 2. Key amino acid sites determining glycosyltransferase UGTPg1 and UGTPg102 catalyzing C20-OH glycosylation
通过SWISS-MODEL对UGTPg1进行蛋白质同源建模,并用PyMOL软件(DeLanoScientific)进行三维结构的分析。H15和D117是UGTPg1保守的催化活性位点,在这两个位点附近找到了6个UGTPg1和UGTPg102不同的氨基酸位点,即UGTPg1的L38、A40、F85、T134、H144和Q388,分别对应于UGTPg102的F38、S40、L85、I134、Y144和H388。将UGTPg1的这6个氨基酸分别突变为UGTPg102对应的氨基酸,并在大肠杆菌BL21(DE3)中进行表达,测定酶活性。Protein homology modeling of UGTPg1 was performed by SWISS-MODEL, and the three-dimensional structure was analyzed by PyMOL software (DeLanoScientific). H15 and D117 are the conserved catalytic active sites of UGTPg1. Six different amino acid sites between UGTPg1 and UGTPg102 were found near these two sites, namely L38, A40, F85, T134, H144 and Q388 of UGTPg1, which correspond to F38, S40, L85, I134, Y144 and H388 of UGTPg102, respectively. These six amino acids of UGTPg1 were mutated to the corresponding amino acids of UGTPg102, and expressed in Escherichia coli BL21 (DE3), and the enzyme activity was measured.
合成分别具有序列表中SEQ ID NO:24和SEQ ID NO:25核苷酸序列的两条引物。以质粒pET28a-UGTPg1为模板,利用Stratagen公司的QuickChange II site-directedmutagenesis kit进行UGTPg1的L38位点进行定点突变。PCR扩增程序为:95℃30s;95℃30s,55℃1min,68℃7min,共16个循环;降至10℃。PCR产物用DpnI酶切,37℃水浴反应2h,取10μL转化大肠杆菌TOP 10感受态。Synthesize two primers with nucleotide sequences of SEQ ID NO:24 and SEQ ID NO:25 in the sequence table. Use plasmid pET28a-UGTPg1 as template and use Stratagen's QuickChange II site-directed mutagenesis kit to perform site-directed mutagenesis at the L38 site of UGTPg1. The PCR amplification program is: 95℃30s; 95℃30s, 55℃1min, 68℃7min, 16 cycles in total; reduce to 10℃. The PCR product is digested with DpnI, reacted in a water bath at 37℃ for 2h, and 10μL is taken to transform Escherichia coli TOP 10 competent cells.
挑取单克隆并抽提质粒,测序验证突变位点,所获得的质粒命名为pET28a-UGTPg1-L38F。将质粒pET28a-UGTPg1-L38F转化到大肠杆菌BL21(DE3),构建重组菌株pET28a-UGTPg1-L38F-BL21,诱导表达的步骤同实施例1。同样,对应构建UGTPg1突变后的重组质粒pET28a-UGTPg1-A40S,pET28a-UGTPg1-F85L,pET28a-UGTPg1-T143I,pET28a-UGTPg1-H144Y和pET28a-UGTPg1-Q388H,并构建了重组菌株pET28a-UGTPg1-A40S-BL21,pET28a-UGTPg1-F85L-BL21,pET28a-UGTPg1-T143I-BL21,pET28a-UGTPg1-H144Y-BL21和pET28a-UGTPg1-Q388H-BL21。定点突变的方法同上(点突变引物序列分别为SEQ ID NO:26-SEQ ID NO:35),诱导表达和酶活测定的方法同实施例1。Single clones were picked and plasmids were extracted. The mutation sites were verified by sequencing. The obtained plasmid was named pET28a-UGTPg1-L38F. The plasmid pET28a-UGTPg1-L38F was transformed into Escherichia coli BL21 (DE3) to construct the recombinant strain pET28a-UGTPg1-L38F-BL21. The steps of inducing expression were the same as those in Example 1. Similarly, corresponding recombinant plasmids pET28a-UGTPg1-A40S, pET28a-UGTPg1-F85L, pET28a-UGTPg1-T143I, pET28a-UGTPg1-H144Y and pET28a-UGTPg1-Q388H after UGTPg1 mutation were constructed, and recombinant strains pET28a-UGTPg1-A40S-BL21, pET28a-UGTPg1-F85L-BL21, pET28a-UGTPg1-T143I-BL21, pET28a-UGTPg1-H144Y-BL21 and pET28a-UGTPg1-Q388H-BL21 were constructed. The method of site-directed mutagenesis is the same as above (the sequences of the primers for point mutations are SEQ ID NO: 26-SEQ ID NO: 35, respectively), and the methods of inducing expression and enzyme activity determination are the same as those in Example 1.
结果显示:UGTPg1-L38F,UGTPg1-A40S,UGTPg1-F85L,UGTPg1-T143I和UGTPg1-Q388H仍具有催化C20-OH糖基化的酶活性,即催化PPT生成F1,而UGTPg1-H144Y没有检测到酶活性(图3)。结果表明,H144对于UGTPg1的催化功能非常重要。The results showed that UGTPg1-L38F, UGTPg1-A40S, UGTPg1-F85L, UGTPg1-T143I and UGTPg1-Q388H still had the enzyme activity to catalyze the glycosylation of C20-OH, that is, catalyze PPT to generate F1, while UGTPg1-H144Y had no enzyme activity detected (Figure 3). The results showed that H144 is very important for the catalytic function of UGTPg1.
合成分别具有序列表中SEQ ID NO:36和SEQ ID NO:37核苷酸序列的两条引物。以质粒pET28a-UGTPg102为模板,利用Stratagen公司的QuickChange II site-directedmutagenesis kit对UGTPg102的Y144位点进行定点突变,构建了UGTPg102突变后的重组质粒pET28a-UGTPg102-Y144H和重组菌株pET28a-UGTPg102-Y144H-BL21。定点突变的方法同上,诱导表达和酶活测定的方法同实施例2。UGTPg102-Y144H获得了催化C20-OH糖基化的功能,即能够催化PPT生成F1(图3)。Synthesize two primers having the nucleotide sequences of SEQ ID NO:36 and SEQ ID NO:37 in the sequence table, respectively. Using plasmid pET28a-UGTPg102 as a template, the Y144 site of UGTPg102 was subjected to site-directed mutagenesis using Stratagen's QuickChange II site-directed mutagenesis kit to construct the recombinant plasmid pET28a-UGTPg102-Y144H and recombinant strain pET28a-UGTPg102-Y144H-BL21 after the mutation of UGTPg102. The method of site-directed mutagenesis is the same as above, and the methods of induced expression and enzyme activity determination are the same as in Example 2. UGTPg102-Y144H has acquired the function of catalyzing C20-OH glycosylation, that is, it can catalyze PPT to generate F1 (Figure 3).
结论:UGTPg1和UGTPg102氨基酸序列的identity高达95.4%,而UGTPg1具有C20-OH糖基转移酶活性,UGTPg102没有酶活性,而本实施例证明,是关键氨基酸的替换导致UGTPg102了催化功能的丢失。Conclusion: The identity of the amino acid sequences of UGTPg1 and UGTPg102 is as high as 95.4%, and UGTPg1 has C20-OH glycosyltransferase activity, while UGTPg102 has no enzyme activity. This example proves that the replacement of key amino acids leads to the loss of catalytic function of UGTPg102.
实施例3.决定糖基转移酶UGTPg100和UGTPg103催化C6-OH糖基化的关键氨基酸位点Example 3. Key amino acid sites determining glycosyltransferases UGTPg100 and UGTPg103 catalyzing C6-OH glycosylation
UGTPg100和UGTPg103氨基酸序列的identity高达98.7%,即两者之间仅有6个氨基酸的差异。通过SWISS-MODEL对UGTPg100进行蛋白质同源建模,并用PyMOL软件(DeLanoScientific)进行三维结构的分析。H15和D117是UGTPg100保守的催化活性位点,在这两个位点附近找到了2个UGTPg100和UGTPg103不同的氨基酸位点,即UGTPg100的A142和L186,分别对应于UGTPg102的T142和S186。另外两个个氨基酸位点分别位于C端和N端结构域,在UGTPg100中为G338和W205,对应于UGTPg103的R338和R205。将UGTPg100的这4个氨基酸分别突变为UGTPg103对应的氨基酸,并在大肠杆菌BL21(DE3)中进行表达,测定酶活性。另外,有两个氨基酸没有仔细研究,因为他们氨基酸侧链的结构和性质非常相似,相互替换可能对酶活的影响很小,即UGTPg100中的V111和I349分别对应于UGTPg103中的I111和V349。The identity of the amino acid sequences of UGTPg100 and UGTPg103 is as high as 98.7%, that is, there are only 6 amino acid differences between the two. Protein homology modeling of UGTPg100 was performed using SWISS-MODEL, and the three-dimensional structure was analyzed using PyMOL software (DeLanoScientific). H15 and D117 are the conserved catalytic active sites of UGTPg100. Two different amino acid sites between UGTPg100 and UGTPg103 were found near these two sites, namely A142 and L186 of UGTPg100, corresponding to T142 and S186 of UGTPg102, respectively. The other two amino acid sites are located in the C-terminal and N-terminal domains, respectively, which are G338 and W205 in UGTPg100, corresponding to R338 and R205 of UGTPg103. The four amino acids of UGTPg100 were mutated to the corresponding amino acids of UGTPg103, and the enzyme activity was measured after expression in E. coli BL21 (DE3). In addition, two amino acids were not studied in detail because the structures and properties of their amino acid side chains are very similar, and mutual substitution may have little effect on enzyme activity, that is, V111 and I349 in UGTPg100 correspond to I111 and V349 in UGTPg103, respectively.
以质粒pET28a-UGTPg100为模板,利用Stratagen公司的QuickChange II site-directed mutagenesis kit对UGTPg100的A142,L186,W205和G338位点进行定点突变,构建了UGTPg100突变后的重组质粒pET28a-UGTPg100-A142T,pET28a-UGTPg100-L186S,pET28a-UGTPg100-W205R和pET28a-UGTPg100-G338R(点突变引物序列分别为SEQ ID NO:38-SEQ IDNO:45),以及重组菌株pET28a-UGTPg100-A142T-BL21,pET28a-UGTPg100-L186S-BL21,pET28a-UGTPg100-W205-BL21和pET28a-UGTPg100-G338R-BL21。定点突变的方法同上,诱导表达和酶活测定的方法同实施例1。Plasmid pET28a-UGTPg100 was used as a template, and the A142, L186, W205 and G338 sites of UGTPg100 were subjected to site-directed mutagenesis using Stratagen's QuickChange II site-directed mutagenesis kit to construct the mutated recombinant plasmids pET28a-UGTPg100-A142T, pET28a-UGTPg100-L186S, pET28a-UGTPg100-W205R and pET28a-UGTPg100-G338R (the sequences of the point mutation primers are SEQ ID NO: 38-SEQ ID NO: 38, respectively). IDNO: 45), and recombinant strains pET28a-UGTPg100-A142T-BL21, pET28a-UGTPg100-L186S-BL21, pET28a-UGTPg100-W205-BL21 and pET28a-UGTPg100-G338R-BL21. The method of site-directed mutagenesis is the same as above, and the method of induced expression and enzyme activity determination is the same as Example 1.
结果如图4所示,UGTPg100-A142T,UGTPg100-L186S和UGTPg100-G338R都失去了C6-OH糖基转移酶的活性;而UGTPg100-W205R仍然保留着酶活性。The results are shown in FIG4 . UGTPg100-A142T, UGTPg100-L186S and UGTPg100-G338R all lost the activity of C6-OH glycosyltransferase; whereas UGTPg100-W205R still retained the enzyme activity.
以质粒pET28a-UGTPg103为模板,利用Stratagen公司的QuickChange II site-directed mutagenesis kit对UGTPg100的T142,S186和R338位分别进行定点突变(点突变引物序列分别为SEQ ID NO:46-SEQ ID NO:47),构建了UGTPg103突变后的重组质粒pET28a-UGTPg103-T142A,pET28a-UGTPg103-S186L以及重组菌株pET28a-UGTPg103-T142A-BL21。定点突变的方法同上,诱导表达和酶活测定的方法同实施例2。如图4,UGTPg103-T142A并没有检测到酶活性。Using plasmid pET28a-UGTPg103 as a template, the QuickChange II site-directed mutagenesis kit of Stratagen was used to perform site-directed mutagenesis on T142, S186 and R338 of UGTPg100 (the sequences of the point mutation primers were SEQ ID NO:46-SEQ ID NO:47, respectively), and the recombinant plasmids pET28a-UGTPg103-T142A, pET28a-UGTPg103-S186L and the recombinant strain pET28a-UGTPg103-T142A-BL21 after mutating UGTPg103 were constructed. The method of site-directed mutagenesis was the same as above, and the methods of induced expression and enzyme activity determination were the same as in Example 2. As shown in Figure 4, no enzyme activity was detected in UGTPg103-T142A.
以质粒pET28a-UGTPg103-T142A为模板,核苷酸序列SEQ ID NO:48和SEQ ID NO:49为引物,利用Stratagen公司的QuickChange II site-directed mutagenesis kit对pET28a-UGTPg103-T142A的S186位点进行定点突变,构建了UGTPg103双突变后的重组质粒pET28a-UGTPg103-T142A-S 186L,以及重组菌株pET28a-UGTPg103-T142A-S186L-BL21。类似地,构建了UGTPg103三突变的重组质粒pET28a-UGTPg103-T142A-S186L-R338G,以及重组菌株pET28a-UGTPg103-T142A-S186L-R338G-BL21。定点突变的方法同上,诱导表达和酶活测定的方法同实施例1。Using plasmid pET28a-UGTPg103-T142A as template and nucleotide sequences SEQ ID NO:48 and SEQ ID NO:49 as primers, the S186 site of pET28a-UGTPg103-T142A was site-directed mutagenesis kit from Stratagen was used to construct the recombinant plasmid pET28a-UGTPg103-T142A-S186L with double mutation of UGTPg103 and the recombinant strain pET28a-UGTPg103-T142A-S186L-BL21. Similarly, the recombinant plasmid pET28a-UGTPg103-T142A-S186L-R338G and the recombinant strain pET28a-UGTPg103-T142A-S186L-R338G-BL21 with triple mutations of UGTPg103 were constructed. The site-directed mutagenesis method was the same as above, and the methods of induced expression and enzyme activity determination were the same as in Example 1.
结果如图4,UGTPg103-T142A-S186L也没有催化C6-OH糖基化的功能,而UGTPg103-T142A-S186L-R338G获得了催化C6-OH糖基化的功能,即能够催化PPT生成Rh1。The results are shown in Figure 4. UGTPg103-T142A-S186L also does not have the function of catalyzing C6-OH glycosylation, while UGTPg103-T142A-S186L-R338G has the function of catalyzing C6-OH glycosylation, that is, it can catalyze PPT to produce Rh1.
结论:UGTPg100和UGTPg103氨基酸序列的identity高达98.7%,即两者之间仅有6个氨基酸的差异,而UGTPg100具有C6-OH糖基转移酶活性,UGTPg103没有酶活性,说明关键氨基酸的替换导致UGTPg103了催化功能的丢失。Conclusion: The identity of the amino acid sequences of UGTPg100 and UGTPg103 is as high as 98.7%, that is, there are only 6 amino acids different between the two. UGTPg100 has C6-OH glycosyltransferase activity, while UGTPg103 has no enzyme activity, indicating that the replacement of key amino acids has led to the loss of catalytic function of UGTPg103.
实施例4.决定糖基转移酶UGTPg1和UGTPg100底物区域专一性的关键氨基酸位点和区域Example 4. Key amino acid positions and regions that determine the substrate regiospecificity of glycosyltransferases UGTPg1 and UGTPg100
UGTPg1和UGTPg100氨基酸序列的identity高达84.1%,而UGTPg1具有C20-OH糖基转移酶活性,UGTPg100具有C6-OH糖基转移酶活性,说明关键氨基酸的替换改变了其底物专一性。通过SWISS-MODEL对UGTPg1和UGTPg100进行蛋白质同源建模,并用PyMOL软件(DeLanoScientific)进行三维结构的分析,如图。H15和D117是UGTPg1和UGTPg100保守的催化活性位点,在这两个位点附近找到了4个UGTPg1和UGTPg100不同的氨基酸位点,即UGTPg1的A10、I13、H82和H144,分别对应于UGTPg100的V10、F13、C82和F144。将UGTPg1的这4个氨基酸分别突变为UGTPg100对应的氨基酸,并在大肠杆菌BL21(DE3)中进行表达,测定酶活性。The identity of the amino acid sequences of UGTPg1 and UGTPg100 is as high as 84.1%, and UGTPg1 has C20-OH glycosyltransferase activity, while UGTPg100 has C6-OH glycosyltransferase activity, indicating that the replacement of key amino acids has changed their substrate specificity. Protein homology modeling of UGTPg1 and UGTPg100 was performed by SWISS-MODEL, and the three-dimensional structure was analyzed by PyMOL software (DeLanoScientific), as shown in the figure. H15 and D117 are the conserved catalytic active sites of UGTPg1 and UGTPg100. Four different amino acid sites of UGTPg1 and UGTPg100 were found near these two sites, namely A10, I13, H82 and H144 of UGTPg1, corresponding to V10, F13, C82 and F144 of UGTPg100, respectively. The four amino acids of UGTPg1 were mutated into the corresponding amino acids of UGTPg100, respectively, and expressed in Escherichia coli BL21 (DE3), and the enzyme activity was measured.
合成分别具有序列表中SEQ ID NO:52和SEQ ID NO:53,SEQ ID NO:54和SEQ IDNO:55,SEQ ID NO:56和SEQ ID NO:57,SEQ ID NO:58和SEQ ID NO:59核苷酸序列作为引物。以质粒pET28a-UGTPg1为模板,利用Stratagen公司的QuickChange II site-directedmutagenesis kit对UGTPg1的A10、I13、H82和H144位点进行定点突变,构建了UGTPg1突变后的重组质粒pET28a-UGTPg1-A10V、pET28a-UGTPg1-I13F、pET28a-UGTPg1-H82C和pET28a-UGTPg1-H144F,以及重组菌株pET28a-UGTPg1-A10V-BL21、pET28a-UGTPg1-I13F-BL21、pET28a-UGTPg1-H82C-BL21和pET28a-UGTPg1-H144F-BL21。定点突变的方法同上,诱导表达和酶活测定的方法同实施例2。如图5,UGTPg1-A10V,UGTPg1-I13F和UGTPg1-H82C的C20-OH糖基转移酶的活性降低了;UGTPg1-H82C和UGTPg1-H144F获得了催化C6-OH糖基化的功能,但UGTPg1-H144F催化C6-OH糖基化的活性很弱。The nucleotide sequences of SEQ ID NO: 52 and SEQ ID NO: 53, SEQ ID NO: 54 and SEQ ID NO: 55, SEQ ID NO: 56 and SEQ ID NO: 57, SEQ ID NO: 58 and SEQ ID NO: 59 in the sequence listing were synthesized as primers. Using plasmid pET28a-UGTPg1 as a template, the A10, I13, H82 and H144 sites of UGTPg1 were subjected to site-directed mutagenesis using Stratagen's QuickChange II site-directed mutagenesis kit to construct recombinant plasmids pET28a-UGTPg1-A10V, pET28a-UGTPg1-I13F, pET28a-UGTPg1-H82C and pET28a-UGTPg1-H144F after UGTPg1 mutation, as well as recombinant strains pET28a-UGTPg1-A10V-BL21, pET28a-UGTPg1-I13F-BL21, pET28a-UGTPg1-H82C-BL21 and pET28a-UGTPg1-H144F-BL21. The site-directed mutagenesis method was the same as above, and the methods for inducing expression and enzyme activity determination were the same as in Example 2. As shown in Figure 5, the C20-OH glycosyltransferase activity of UGTPg1-A10V, UGTPg1-I13F and UGTPg1-H82C was reduced; UGTPg1-H82C and UGTPg1-H144F acquired the function of catalyzing C6-OH glycosylation, but the activity of UGTPg1-H144F in catalyzing C6-OH glycosylation was very weak.
实施例5.糖基转移酶UGTPg1,UGTPg101和UGTPg100的C末端截短促进可溶表达Example 5. C-terminal truncation of glycosyltransferases UGTPg1, UGTPg101 and UGTPg100 promotes soluble expression
糖基转移酶UGTPg1,UGTPg101和UGTPg100在大肠杆菌中表达主要以包涵体的形式存在,可溶表达量低。在N端截短5个氨基酸后,UGTPg1,UGTPg101和UGTPg100对于底物PPT都没有酶活性。在C端截短5个氨基酸后,UGTPg1,UGTPg101和UGTPg100对于底物PPT的酶活力降低,但可溶表达量却显著提高。在C端截短2个、4个氨基酸后,结果发现UGTPg1和UGTPg101的可溶表达量相对于野生型至少有50%提高(图6A),但酶活力没有明显改变(图6B)。虽然在UGTPg100的C末端截短2个、4个氨基酸后,其可溶表达量相对于野生型也至少提高了50%,但是其酶活力却显著降低(图6)。Glycosyltransferases UGTPg1, UGTPg101 and UGTPg100 expressed in E. coli mainly exist in the form of inclusion bodies, and the soluble expression level is low. After truncating 5 amino acids at the N-terminus, UGTPg1, UGTPg101 and UGTPg100 have no enzyme activity for the substrate PPT. After truncating 5 amino acids at the C-terminus, the enzyme activity of UGTPg1, UGTPg101 and UGTPg100 for the substrate PPT is reduced, but the soluble expression level is significantly increased. After truncating 2 and 4 amino acids at the C-terminus, it was found that the soluble expression level of UGTPg1 and UGTPg101 increased by at least 50% relative to the wild type (Figure 6A), but the enzyme activity did not change significantly (Figure 6B). Although the soluble expression level of UGTPg100 increased by at least 50% relative to the wild type after truncating 2 and 4 amino acids at the C-terminus, its enzyme activity was significantly reduced (Figure 6).
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned in the present invention are cited as references in this application, just as each document is cited as reference individually. In addition, it should be understood that after reading the above teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the claims attached to this application.
Claims (11)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510051992.5A CN105985938B (en) | 2015-01-30 | 2015-01-30 | Glycosyltransferase mutant protein and its application |
PCT/CN2016/073074 WO2016119756A1 (en) | 2015-01-30 | 2016-02-01 | Mutant protein of glycosyltransferase and uses thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510051992.5A CN105985938B (en) | 2015-01-30 | 2015-01-30 | Glycosyltransferase mutant protein and its application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105985938A CN105985938A (en) | 2016-10-05 |
CN105985938B true CN105985938B (en) | 2024-11-05 |
Family
ID=56542475
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510051992.5A Active CN105985938B (en) | 2015-01-30 | 2015-01-30 | Glycosyltransferase mutant protein and its application |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105985938B (en) |
WO (1) | WO2016119756A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108866020A (en) * | 2017-05-16 | 2018-11-23 | 中国科学院上海生命科学研究院 | Glycosyl transferase, mutant and its application |
CN109868265B (en) * | 2017-12-01 | 2023-04-25 | 中国科学院分子植物科学卓越创新中心 | Novel glycosyltransferase and application thereof |
CN111073864B (en) * | 2018-10-19 | 2022-06-28 | 中国科学院天津工业生物技术研究所 | Novel mutant protein for improving malic acid yield |
CN111909909A (en) * | 2019-05-09 | 2020-11-10 | 湖北工业大学 | Glycosyltransferase mutant and application thereof in synthesizing beta-D-glucoside |
CN112831481B (en) * | 2019-11-22 | 2024-01-19 | 生合万物(上海)生物科技有限公司 | Glycosyltransferase and method for catalyzing sugar chain extension |
CN115678952A (en) * | 2021-07-30 | 2023-02-03 | 生合万物(苏州)生物科技有限公司 | Rhamnose Highly Specific Glycosyltransferase and Its Application |
CN114836398B (en) * | 2022-06-06 | 2023-06-23 | 南京工业大学 | Application of glycosyltransferase mutant in directional synthesis of non-natural ginsenoside |
CN117187297B (en) * | 2023-09-07 | 2024-08-23 | 昆明理工大学 | Method for synthesizing ginsenoside CK in ginseng cells |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103849672A (en) * | 2012-12-06 | 2014-06-11 | 中国科学院上海生命科学研究院 | Group of glycosyl transferase and application thereof |
CN104232723A (en) * | 2013-06-07 | 2014-12-24 | 中国科学院上海生命科学研究院 | Glycosyl transferases and applications of glycosyl transferases |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102112150A (en) * | 2008-05-30 | 2011-06-29 | 格利科菲公司 | Yeast strain for the production of proteins with terminal alpha-1, 3-linked galactose |
-
2015
- 2015-01-30 CN CN201510051992.5A patent/CN105985938B/en active Active
-
2016
- 2016-02-01 WO PCT/CN2016/073074 patent/WO2016119756A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103849672A (en) * | 2012-12-06 | 2014-06-11 | 中国科学院上海生命科学研究院 | Group of glycosyl transferase and application thereof |
CN104232723A (en) * | 2013-06-07 | 2014-12-24 | 中国科学院上海生命科学研究院 | Glycosyl transferases and applications of glycosyl transferases |
Non-Patent Citations (1)
Title |
---|
Eucommia ulmoides glucosyltransferase(CAGT1)mRNA;Zhao,D.,et al;《Genbank: KJ413006》;20140427 * |
Also Published As
Publication number | Publication date |
---|---|
WO2016119756A1 (en) | 2016-08-04 |
CN105985938A (en) | 2016-10-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105985938B (en) | Glycosyltransferase mutant protein and its application | |
CN110225971B (en) | UDP-glycosyltransferase for catalyzing sugar chain extension and application thereof | |
CN105087739B (en) | A kind of new catalyst system and its application for preparing rare ginsenoside | |
WO2015188742A2 (en) | Group of glycosyltransferases and use thereof | |
JP7086107B2 (en) | Glycosyltransferases, mutants and their use | |
KR102310518B1 (en) | Cytochrome P450 mutant protein and its applications | |
CN114107240A (en) | Tartary buckwheat-derived emodin glycosyltransferase and coding gene and application thereof | |
CN112831481B (en) | Glycosyltransferase and method for catalyzing sugar chain extension | |
KR20240032944A (en) | Rhamnose highly specific glycosyltransferase and its applications | |
WO2022105729A1 (en) | Cytochrome p450 mutant protein and use thereof | |
CN112553175B (en) | Preparation and application of glycosyltransferase UGT76G1 mutant | |
CN108866142B (en) | Cytochrome P450, nicotinamide adenine dinucleotide-cytochrome P450 reductase and application thereof | |
CN100537754C (en) | A kind of S-adenosylmethionine is produced the structure and the large scale fermentation of bacterial strain | |
CN111926000B (en) | Gynostemma pentaphylla glycosyltransferase and application thereof | |
CN113444703B (en) | Glycosyltransferase mutant for catalyzing sugar chain extension and application thereof | |
CN119530188A (en) | Application of constructing efficient glycosyltransferase cascade catalytic system in synthesis of novel triterpene glycoside | |
CN118048329A (en) | A CYP98A20 protein involved in the biosynthesis of verbascoside and its encoding gene and application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200507 Address after: 200032 building 4, No. 300 Fenglin Road, Xuhui District, Shanghai Applicant after: Center for excellence and innovation in molecular plant science, Chinese Academy of Sciences Address before: 200031 Yueyang Road, Shanghai, No. 319, No. Applicant before: SHANGHAI INSTITUTES FOR BIOLOGICAL SCIENCES, CHINESE ACADEMY OF SCIENCES |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210730 Address after: Room 302, No. 120, Medical College Road, Xuhui District, Shanghai 200032 Applicant after: Zhou Zhihua Address before: No.4 building, No.300 Fenglin Road, Xuhui District, Shanghai 200032 Applicant before: Center for excellence and innovation in molecular plant science, Chinese Academy of Sciences |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20211020 Address after: No.4 building, No.300 Fenglin Road, Xuhui District, Shanghai 200032 Applicant after: Center for excellence and innovation in molecular plant science, Chinese Academy of Sciences Address before: Room 302, No. 120, Medical College Road, Xuhui District, Shanghai 200032 Applicant before: Zhou Zhihua |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220623 Address after: 215127 unit e622, floor 5, Lecheng Plaza, phase II, biomedical industrial park, No. 218, Sangtian street, Suzhou Industrial Park, China (Jiangsu) pilot Free Trade Zone, Suzhou, Jiangsu Applicant after: Shenghe everything (Suzhou) Biotechnology Co.,Ltd. Address before: No.4 building, No.300 Fenglin Road, Xuhui District, Shanghai 200032 Applicant before: Center for excellence and innovation in molecular plant science, Chinese Academy of Sciences |
|
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Floor 1-2, Building No. 2, 500 Lane, Furong Hualu, Pudong New Area, Shanghai, 201321 Applicant after: Shenghe Everything (Shanghai) Biotechnology Co.,Ltd. Address before: 215127 unit e622, floor 5, Lecheng Plaza, phase II, biomedical industrial park, No. 218, Sangtian street, Suzhou Industrial Park, China (Jiangsu) pilot Free Trade Zone, Suzhou, Jiangsu Applicant before: Shenghe everything (Suzhou) Biotechnology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |