KR102335035B1 - Process for preparing crocin-4 using enzyme reaction - Google Patents
Process for preparing crocin-4 using enzyme reaction Download PDFInfo
- Publication number
- KR102335035B1 KR102335035B1 KR1020190114070A KR20190114070A KR102335035B1 KR 102335035 B1 KR102335035 B1 KR 102335035B1 KR 1020190114070 A KR1020190114070 A KR 1020190114070A KR 20190114070 A KR20190114070 A KR 20190114070A KR 102335035 B1 KR102335035 B1 KR 102335035B1
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- crosin
- glu
- val
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 57
- 238000006911 enzymatic reaction Methods 0.000 title abstract description 12
- ATQIQIBBBWQWOT-UHFFFAOYSA-N Crocin 4 Natural products COC(=O)C(=CC=CC(=CC=CC=C(/C)C=CC=C(/C)C(=O)OC1OC(CO)C(O)C(O)C1O)C)C ATQIQIBBBWQWOT-UHFFFAOYSA-N 0.000 title description 3
- ATQIQIBBBWQWOT-QNFREAFUSA-N Crocin 4 Chemical compound COC(=O)C(\C)=C/C=C/C(/C)=C/C=C/C=C(/C)\C=C\C=C(/C)C(=O)OC1OC(CO)C(O)C(O)C1O ATQIQIBBBWQWOT-QNFREAFUSA-N 0.000 title description 2
- 102000004190 Enzymes Human genes 0.000 claims abstract description 80
- 108090000790 Enzymes Proteins 0.000 claims abstract description 80
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 72
- PANKHBYNKQNAHN-JTBLXSOISA-N Crocetin Natural products OC(=O)C(\C)=C/C=C/C(/C)=C\C=C\C=C(\C)/C=C/C=C(/C)C(O)=O PANKHBYNKQNAHN-JTBLXSOISA-N 0.000 claims abstract description 57
- PANKHBYNKQNAHN-JUMCEFIXSA-N carotenoid dicarboxylic acid Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C(=O)O)C=CC=C(/C)C(=O)O PANKHBYNKQNAHN-JUMCEFIXSA-N 0.000 claims abstract description 57
- PANKHBYNKQNAHN-MQQNZMFNSA-N crocetin Chemical compound OC(=O)C(/C)=C/C=C/C(/C)=C/C=C/C=C(\C)/C=C/C=C(\C)C(O)=O PANKHBYNKQNAHN-MQQNZMFNSA-N 0.000 claims abstract description 57
- 239000000203 mixture Substances 0.000 claims abstract description 45
- 238000000034 method Methods 0.000 claims abstract description 32
- 101000644026 Gardenia jasminoides Crocetin glucosyltransferase, chloroplastic Proteins 0.000 claims abstract description 28
- 240000001829 Catharanthus roseus Species 0.000 claims abstract description 17
- 235000018958 Gardenia augusta Nutrition 0.000 claims abstract description 15
- 244000061176 Nicotiana tabacum Species 0.000 claims abstract description 14
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims abstract description 14
- 240000001972 Gardenia jasminoides Species 0.000 claims abstract 3
- 238000006243 chemical reaction Methods 0.000 claims description 25
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 claims description 15
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims description 15
- 230000015572 biosynthetic process Effects 0.000 claims description 13
- 239000000047 product Substances 0.000 claims description 9
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- 101150055425 aldh gene Proteins 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 8
- 239000002773 nucleotide Substances 0.000 claims 8
- 125000003729 nucleotide group Chemical group 0.000 claims 8
- 239000007795 chemical reaction product Substances 0.000 claims 2
- 244000124209 Crocus sativus Species 0.000 abstract description 15
- 235000015655 Crocus sativus Nutrition 0.000 abstract description 14
- 239000004248 saffron Substances 0.000 abstract description 13
- 235000013974 saffron Nutrition 0.000 abstract description 13
- 150000001875 compounds Chemical class 0.000 abstract description 8
- 229940088598 enzyme Drugs 0.000 description 66
- 108020004414 DNA Proteins 0.000 description 36
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 18
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 17
- 239000002904 solvent Substances 0.000 description 16
- 108090000992 Transferases Proteins 0.000 description 14
- 102000004357 Transferases Human genes 0.000 description 14
- 235000000346 sugar Nutrition 0.000 description 13
- 244000111489 Gardenia augusta Species 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 11
- 244000005700 microbiome Species 0.000 description 11
- 239000013598 vector Substances 0.000 description 11
- 239000002609 medium Substances 0.000 description 10
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 239000008103 glucose Substances 0.000 description 8
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 229910052799 carbon Inorganic materials 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010053725 prolylvaline Proteins 0.000 description 6
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 4
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 4
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 4
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 4
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 4
- 101000946067 Fragaria ananassa Cinnamate beta-D-glucosyltransferase Proteins 0.000 description 4
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 4
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 4
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 4
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 4
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 4
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 4
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 235000019253 formic acid Nutrition 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 4
- 102000004169 proteins and genes Human genes 0.000 description 4
- SGAWOGXMMPSZPB-UHFFFAOYSA-N safranal Chemical compound CC1=C(C=O)C(C)(C)CC=C1 SGAWOGXMMPSZPB-UHFFFAOYSA-N 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 108010078580 tyrosylleucine Proteins 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 230000005526 G1 to G0 transition Effects 0.000 description 3
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 3
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 3
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 3
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 3
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 3
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 3
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 3
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 3
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 3
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 3
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 3
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 3
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 3
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 3
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 3
- -1 UDP-glucose Chemical class 0.000 description 3
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 3
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 3
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 3
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 235000001014 amino acid Nutrition 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 239000003086 colorant Substances 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 3
- 230000014759 maintenance of location Effects 0.000 description 3
- 230000002503 metabolic effect Effects 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 229910052760 oxygen Inorganic materials 0.000 description 3
- PXDJXZJSCPSGGI-UHFFFAOYSA-N palmityl palmitate Chemical compound CCCCCCCCCCCCCCCCOC(=O)CCCCCCCCCCCCCCC PXDJXZJSCPSGGI-UHFFFAOYSA-N 0.000 description 3
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 239000000376 reactant Substances 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 3
- JKQXZKUSFCKOGQ-JLGXGRJMSA-N (3R,3'R)-beta,beta-carotene-3,3'-diol Chemical compound C([C@H](O)CC=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-JLGXGRJMSA-N 0.000 description 2
- HMUNWXXNJPVALC-UHFFFAOYSA-N 1-[4-[2-(2,3-dihydro-1H-inden-2-ylamino)pyrimidin-5-yl]piperazin-1-yl]-2-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)ethanone Chemical compound C1C(CC2=CC=CC=C12)NC1=NC=C(C=N1)N1CCN(CC1)C(CN1CC2=C(CC1)NN=N2)=O HMUNWXXNJPVALC-UHFFFAOYSA-N 0.000 description 2
- LDXJRKWFNNFDSA-UHFFFAOYSA-N 2-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)-1-[4-[2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidin-5-yl]piperazin-1-yl]ethanone Chemical compound C1CN(CC2=NNN=C21)CC(=O)N3CCN(CC3)C4=CN=C(N=C4)NCC5=CC(=CC=C5)OC(F)(F)F LDXJRKWFNNFDSA-UHFFFAOYSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- YHCIKUXPWFLCFN-MTGLMCJBSA-N Crocetin dialdehyde Natural products CC(=C/C=C/C(=C/C=C/C=C(C)/C=C/C=C(C)/C=O)/C)C=O YHCIKUXPWFLCFN-MTGLMCJBSA-N 0.000 description 2
- SEBIKDIMAPSUBY-ARYZWOCPSA-N Crocin Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H](O)[C@@H]1O)O)OC(=O)C(C)=CC=CC(C)=C\C=C\C=C(/C)\C=C\C=C(C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)O1)O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O SEBIKDIMAPSUBY-ARYZWOCPSA-N 0.000 description 2
- JDHMXPSXWMPYQZ-AAEUAGOBSA-N Cys-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N JDHMXPSXWMPYQZ-AAEUAGOBSA-N 0.000 description 2
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 2
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108010072062 GEKG peptide Proteins 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 2
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 108700023372 Glycosyltransferases Proteins 0.000 description 2
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 2
- JUXONJROIXKHEV-GUBZILKMSA-N Met-Cys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCNC(N)=N JUXONJROIXKHEV-GUBZILKMSA-N 0.000 description 2
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 2
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 2
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 2
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 2
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 2
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 2
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- JKQXZKUSFCKOGQ-LQFQNGICSA-N Z-zeaxanthin Natural products C([C@H](O)CC=1C)C(C)(C)C=1C=CC(C)=CC=CC(C)=CC=CC=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-LQFQNGICSA-N 0.000 description 2
- QOPRSMDTRDMBNK-RNUUUQFGSA-N Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCC(O)C1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C QOPRSMDTRDMBNK-RNUUUQFGSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- JKQXZKUSFCKOGQ-LOFNIBRQSA-N all-trans-Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C JKQXZKUSFCKOGQ-LOFNIBRQSA-N 0.000 description 2
- 229910021529 ammonia Inorganic materials 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 238000010923 batch production Methods 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- YHCIKUXPWFLCFN-QHUUTLAPSA-N crocetin dialdehyde Chemical compound O=CC(/C)=C/C=C/C(/C)=C/C=C/C=C(\C)/C=C/C=C(\C)C=O YHCIKUXPWFLCFN-QHUUTLAPSA-N 0.000 description 2
- 239000007857 degradation product Substances 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 150000004665 fatty acids Chemical class 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- 235000019796 monopotassium phosphate Nutrition 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 239000011574 phosphorus Substances 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 235000017509 safranal Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 229910052708 sodium Inorganic materials 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 235000013599 spices Nutrition 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- KBPHJBAIARWVSC-XQIHNALSSA-N trans-lutein Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2C(=CC(O)CC2(C)C)C KBPHJBAIARWVSC-XQIHNALSSA-N 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 235000010930 zeaxanthin Nutrition 0.000 description 2
- 239000001775 zeaxanthin Substances 0.000 description 2
- 229940043269 zeaxanthin Drugs 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 1
- 108010068049 1-deoxy-D-xylulose 5-phosphate reductoisomerase Proteins 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- PAWQVTBBRAZDMG-UHFFFAOYSA-N 2-(3-bromo-2-fluorophenyl)acetic acid Chemical compound OC(=O)CC1=CC=CC(Br)=C1F PAWQVTBBRAZDMG-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- 108010068139 Ala-Leu-Pro-Met Proteins 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- HTOZUYZQPICRAP-BPUTZDHNSA-N Asp-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N HTOZUYZQPICRAP-BPUTZDHNSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- TYPQAKSJJYOJBL-UHFFFAOYSA-N Asp-Met-Phe-Cys Chemical compound OC(=O)CC(N)C(=O)NC(CCSC)C(=O)NC(C(=O)NC(CS)C(O)=O)CC1=CC=CC=C1 TYPQAKSJJYOJBL-UHFFFAOYSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 1
- 101100397224 Bacillus subtilis (strain 168) isp gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- SEBIKDIMAPSUBY-JAUCNNNOSA-N Crocin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C(=O)OC1OC(COC2OC(CO)C(O)C(O)C2O)C(O)C(O)C1O)C=CC=C(/C)C(=O)OC3OC(COC4OC(CO)C(O)C(O)C4O)C(O)C(O)C3O SEBIKDIMAPSUBY-JAUCNNNOSA-N 0.000 description 1
- CZSBHMFVVLYIQQ-RDVATZJHSA-N Crocin 2 Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C(=O)OC1OC(COC2OC(CO)C(O)C(O)C2O)C(O)C(O)C1O)C=CC=C(/C)C(=O)OC3OC(CO)C(O)C(O)C3O CZSBHMFVVLYIQQ-RDVATZJHSA-N 0.000 description 1
- 101000807296 Crocosmia x crocosmiiflora Myricetin 3-O-rhamnosyltransferase UGT77B2 Proteins 0.000 description 1
- 235000019750 Crude protein Nutrition 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 1
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- 101100246459 Escherichia coli (strain K12) puuC gene Proteins 0.000 description 1
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 1
- 101100208822 Gardenia jasminoides UGT75L6 gene Proteins 0.000 description 1
- GVVPGTZRZFNKDS-YFHOEESVSA-N Geranyl diphosphate Natural products CC(C)=CCC\C(C)=C/COP(O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-YFHOEESVSA-N 0.000 description 1
- 102100039291 Geranylgeranyl pyrophosphate synthase Human genes 0.000 description 1
- 108010026318 Geranyltranstransferase Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- AMHIFFIUJOJEKJ-SZMVWBNQSA-N Gln-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N AMHIFFIUJOJEKJ-SZMVWBNQSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- DSRVQBZAMPGEKU-AVGNSLFASA-N Gln-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DSRVQBZAMPGEKU-AVGNSLFASA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- FBUMPXILDTWCJW-UHFFFAOYSA-N Gly-Trp-Ala-Pro Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)CN)C(=O)NC(C)C(=O)N1CCCC1C(O)=O FBUMPXILDTWCJW-UHFFFAOYSA-N 0.000 description 1
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- JSHOVJTVPXJFTE-HOCLYGCPSA-N His-Gly-Trp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JSHOVJTVPXJFTE-HOCLYGCPSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- MVLDERGQICFFLL-ZQINRCPSSA-N Ile-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 MVLDERGQICFFLL-ZQINRCPSSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 108010065958 Isopentenyl-diphosphate Delta-isomerase Proteins 0.000 description 1
- 102100027665 Isopentenyl-diphosphate Delta-isomerase 1 Human genes 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 241001625930 Luria Species 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 1
- ZACMJPCWVSLCNS-JYJNAYRXSA-N Met-Phe-Met Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(O)=O)CC1=CC=CC=C1 ZACMJPCWVSLCNS-JYJNAYRXSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 241000208136 Nicotiana sylvestris Species 0.000 description 1
- 241000588912 Pantoea agglomerans Species 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- 241001055169 Populus fremontii x Populus angustifolia Species 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- JRBWMRUPXWPEID-JYJNAYRXSA-N Pro-Trp-Cys Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(=O)O)C(=O)[C@@H]1CCCN1 JRBWMRUPXWPEID-JYJNAYRXSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710204244 Processive diacylglycerol beta-glucosyltransferase Proteins 0.000 description 1
- 241000169446 Promethis Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- 101100052502 Shigella flexneri yciB gene Proteins 0.000 description 1
- 235000011564 Solanum pennellii Nutrition 0.000 description 1
- 241001136583 Solanum pennellii Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- 241000192589 Synechococcus elongatus PCC 7942 Species 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- OQMQBYOEAHVCGD-GQGQLFGLSA-N Trp-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OQMQBYOEAHVCGD-GQGQLFGLSA-N 0.000 description 1
- JZHJLBPBQKPTNX-UBHSHLNASA-N Trp-Cys-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 JZHJLBPBQKPTNX-UBHSHLNASA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- KPEVFMGKBCMTJF-SZMVWBNQSA-N Trp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N KPEVFMGKBCMTJF-SZMVWBNQSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 101710091289 UDP-glucosyltransferase 2 Proteins 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 238000000862 absorption spectrum Methods 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- IAJILQKETJEXLJ-QTBDOELSSA-N aldehydo-D-glucuronic acid Chemical compound O=C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-QTBDOELSSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 150000007514 bases Chemical class 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- CZSBHMFVVLYIQQ-DRVLGOCHSA-N beta-D-gentiobiosyl beta-D-glucosyl crocetin Chemical compound O([C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)O1)O)C(=O)C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)C(=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O CZSBHMFVVLYIQQ-DRVLGOCHSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- QBZWPZHDUZGTLS-IIDMIUPYSA-N bis(beta-D-glucosyl) crocetin Chemical compound O([C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)C(=O)C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)C(=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O QBZWPZHDUZGTLS-IIDMIUPYSA-N 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 239000004359 castor oil Substances 0.000 description 1
- 235000019438 castor oil Nutrition 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 238000000451 chemical ionisation Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000003240 coconut oil Substances 0.000 description 1
- 235000019864 coconut oil Nutrition 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 239000001209 crocus sativus l. Substances 0.000 description 1
- 239000001599 crocus sativus l. flower extract Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060155 deoxyxylulose-5-phosphate synthase Proteins 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 102000024323 dimethylallyltranstransferase activity proteins Human genes 0.000 description 1
- 108040001168 dimethylallyltranstransferase activity proteins Proteins 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 235000019797 dipotassium phosphate Nutrition 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 1
- 229940097043 glucuronic acid Drugs 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 239000003630 growth substance Substances 0.000 description 1
- IPCSVZSSVZVIGE-UHFFFAOYSA-M hexadecanoate Chemical compound CCCCCCCCCCCCCCCC([O-])=O IPCSVZSSVZVIGE-UHFFFAOYSA-M 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- FBAFATDZDUQKNH-UHFFFAOYSA-M iron chloride Chemical compound [Cl-].[Fe] FBAFATDZDUQKNH-UHFFFAOYSA-M 0.000 description 1
- 229910000358 iron sulfate Inorganic materials 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 101150064873 ispA gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 239000013028 medium composition Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 235000014593 oils and fats Nutrition 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 235000012830 plain croissants Nutrition 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 229920001522 polyglycol ester Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 229940076591 saffron extract Drugs 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/44—Preparation of O-glycosides, e.g. glucosides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/44—Polycarboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/01—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
- C12Y102/01003—Aldehyde dehydrogenase (NAD+) (1.2.1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/99—Miscellaneous (1.14.99)
- C12Y114/99036—Beta-carotene 15,15'-monooxygenase (1.14.99.36)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01017—Glucuronosyltransferase (2.4.1.17)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
본 발명은 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 CaUGT3 효소를 포함하는 크로신-4 생산용 조성물, 상기 CaUGT3 효소를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물, 상기 조성물을 사용하여 크로신-2로부터 크로신-4를 생산하는 방법, 니코티아나 타바쿰(Nicotiana tabacum) 균주로부터 유래된 NtUGT 효소 또는 가르데니아 자스미노이데스(Gardenia jasminoides) 균주로부터 유래된 GjUGT1 효소와 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 CaUGT3 효소를 포함하는 크로신-4 생산용 조성물, 상기 각 효소를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물 및 상기 조성물을 사용하여 크로세틴으로부터 크로신-4를 생산하는 방법에 관한 것이다. 본 발명에서 제공하는 크로신-4의 생산방법을 이용하면, 간단한 효소반응 또는 생물전환 공정을 이용하여, 크로신-4를 효율적으로 생산할 수 있으므로, 사프란 화합물의 경제적인 생산에 널리 활용될 수 있을 것이다.The present invention provides a composition for producing crosin-4 comprising a CaUGT3 enzyme derived from a Catharanthus roseus strain, and a transformant into which a gene encoding the CaUGT3 enzyme is introduced. Composition, method for producing crosin-4 from crosin- 2 using the composition, NtUGT enzyme derived from Nicotiana tabacum strain or GjUGT1 derived from Gardenia jasminoides strain Crosin-4 production composition comprising an enzyme and CaUGT3 enzyme derived from Catharanthus roseus strain, Crosin-4 production composition comprising a transformant into which the gene encoding each enzyme is introduced and to a method for producing crosine-4 from crocetin using the composition. Using the method for producing crosin-4 provided in the present invention, crosin-4 can be efficiently produced using a simple enzymatic reaction or bioconversion process, so it can be widely used for economical production of saffron compounds. will be.
Description
본 발명은 효소반응을 이용한 크로신-4의 생산방법에 관한 것으로, 보다 구체적으로, 본 발명은 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 CaUGT3 효소를 포함하는 크로신-4 생산용 조성물, 상기 CaUGT3 효소를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물, 상기 조성물을 사용하여 크로신-2로부터 크로신-4를 생산하는 방법, 니코티아나 타바쿰(Nicotiana tabacum) 균주로부터 유래된 NtUGT 효소 또는 가르데니아 자스미노이데스(Gardenia jasminoides) 균주로부터 유래된 GjUGT1 효소와 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 CaUGT3 효소를 포함하는 크로신-4 생산용 조성물, 상기 각 효소를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물 및 상기 조성물을 사용하여 크로세틴으로부터 크로신-4를 생산하는 방법에 관한 것이다. 본 발명에서 제공하는 크로신-4의 생산방법을 이용하면, 간단한 효소반응 또는 생물전환 공정을 이용하여, 크로신-4를 효율적으로 생산할 수 있으므로, 사프란 화합물의 경제적인 생산에 널리 활용될 수 있을 것이다. The present invention relates to a method for producing crosin- 4 using an enzymatic reaction, and more particularly, the present invention relates to a composition for producing crosin-4 comprising a CaUGT3 enzyme derived from a Catharanthus roseus strain; A composition for producing crosin-4 comprising a transformant into which a gene encoding the CaUGT3 enzyme is introduced, a method for producing crosin-4 from crosin-2 using the composition, Nicotiana tabacum ) NtUGT enzyme derived from the strain or Gardenia jasminoides ( Gardenia jasminoides ) GjUGT1 enzyme derived from the strain Catharanthus roseus ( Catharanthus roseus ) Crosin-4 production composition comprising a CaUGT3 enzyme derived from the strain, the It relates to a composition for producing crosin-4 including a transformant into which a gene encoding each enzyme is introduced, and a method for producing crocetin-4 from crocetin using the composition. Using the method for producing crosin-4 provided in the present invention, it is possible to efficiently produce crosin-4 using a simple enzymatic reaction or bioconversion process, so it can be widely used for economical production of saffron compounds. will be.
사프란은 크로쿠스 사티부스 L. 꽃의 암술머리로부터의 추출에 의해 제조된 건조 향신료이며, 이는 3500년 이상 사용된 것으로 생각된다. 이러한 향신료는 다수의 의학적 목적을 위해 역사적으로 사용되어 왔으나, 최근에는 이의 착색제 특성에 대해 주로 이용된다. 사프란의 주요 성분 중 하나인 크로세틴은 관련된 카로티노이드-타입 분자와 유사한 항산화제 특성을 가지며, 또한 착색제이다. 사프란의 주요 색소는 황적색의 색을 부여하는 글리코시드의 혼합물인 크로신이다. 크로신의 주요 성분은 색에 있어서 황색인 α-크로신이다. 사프라날은 건조 과정의 생성물인 것으로 생각되며, 식품 제조에 사용될 수 있는 후각자극제 특성을 또한 갖는다. 사프라날은 무색인 사프란 추출물인 피크로크로신의 쓴 부분의 아글리콘 형태이다. 따라서, 사프란 추출물은 착색제 또는 플라보란트로서 또는 이의 후각자극제 특성에 대해 많은 목적에 사용된다.Saffron is a dried spice prepared by extraction from the stigma of the flower Crocus sativus L., which is thought to have been used for over 3500 years. This spice has been used historically for a number of medicinal purposes, but more recently it is mainly used for its colorant properties. Crocetin, one of the main components of saffron, has antioxidant properties similar to the related carotenoid-type molecules, and is also a colorant. The main pigment in saffron is crosine, a mixture of glycosides that gives it its yellow-red color. The main component of crosine is α-crosine, which is yellow in color. Safranal is believed to be a product of a drying process, and also has olfactory properties that can be used in food preparation. Safranal is the aglycone form of the bitter part of picrocrosin, a colorless extract of saffron. Thus, saffron extract is used for many purposes as a colorant or flavorant or for its olfactory-stimulant properties.
사프란 식물은 이탈리아, 프랑스, 인도, 스페인, 그리스, 모로코, 터키, 스위스, 이스라엘, 파키스탄, 아제르바이잔, 중국, 이집트, 아랍에미리트 연합, 일본, 호주, 및 이란을 포함하는 많은 국가에서 상업적으로 재배된다. 이란은 전세계의 매년 사프란 생산량(200톤 바로 위로 추정됨)의 약 80%를 생산한다. 1 kg의 생성에 150,000개 초과의 꽃이 필요한 것으로 보고되었다. 생산량을 증가시키기 위한 식물 품종개량 노력은 열매를 맺지 않는 식물을 발생시키는 식물의 유전체의 세배수체에 의해 복잡해진다. 또한, 상기 식물은 10월 중순 또는 하순에 시작하여 약 15일 동안만 개화 상태로 존재한다. 통상적으로, 생산은 또한 비효율적인 과정인 꽃으로부터 암술머리의 수작업 제거를 포함한다. 사프란 kg 당 $1000을 초과하는 판매 가격이 통상적이다. The saffron plant is grown commercially in many countries including Italy, France, India, Spain, Greece, Morocco, Turkey, Switzerland, Israel, Pakistan, Azerbaijan, China, Egypt, United Arab Emirates, Japan, Australia, and Iran. Iran produces about 80% of the world's annual saffron production (estimated just above 200 tonnes). It has been reported that more than 150,000 flowers are required to produce 1 kg. Efforts to improve plant breeding to increase yield are complicated by the triplet in the genome of plants that give rise to non-bearing plants. In addition, the plant exists in a flowering state only for about 15 days starting in mid or late October. Typically, production also involves the manual removal of the stigma from the flower, an inefficient process. Selling prices in excess of $1000 per kg of saffron are typical.
이처럼 비싼 생산비용을 대체하기 위한 수단으로, 생물전환공정을 통해 사프란 성분을 생산하려는 연구가 활발히 진행되고 있다. 예를 들어, 미국공개특허 제2017-0306376호에는 제아잔틴 분해 디옥시게나제를 단독으로 또는 UDP-글리코실트랜스페라제(UGT)를 엔코딩하는 재조합 유전자와 조합하여 발현하도록 조작된 재조합 미생물을 이용하여 크로세틴, 크로세틴 디알데하이드, 크로신, 또는 피크로크로신과 같은 사프란으로부터의 화합물을 생산하는 방법이 개시되어 있다. 그러나, 상기 사용된 UDP-글리코실트랜스페라제에 의한 사프란 화합물의 생성수율이 낮아서 이들의 수율을 향상시킬 필요가 있었다.As a means to replace such expensive production costs, research to produce saffron components through a bioconversion process is being actively conducted. For example, U.S. Patent Application Publication No. 2017-0306376 discloses a method using a recombinant microorganism engineered to express zeaxanthin-degrading dioxygenase alone or in combination with a recombinant gene encoding UDP-glycosyltransferase (UGT). Methods for producing compounds from saffron, such as cetin, crocetin dialdehyde, crosine, or picrocrosine, are disclosed. However, the production yield of the saffron compound by the used UDP-glycosyltransferase was low, so it was necessary to improve the yield thereof.
이러한 배경하에서, 본 발명자들은 사프란 화합물의 생성수율을 증가시키는 방법을 개발하고자 예의 연구노력한 결과, 니코티아나 타바쿰 균주, 가르데니아 자스미노이데스 균주 및 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 사용할 경우, 크로신-4를 높은 수율로 생산할 수 있음을 확인하고, 본 발명을 완성하였다.Under this background, the present inventors made intensive research efforts to develop a method for increasing the production yield of saffron compounds, and as a result, UDP-glycosyltrans derived from Nicotiana tabacum strain, Gardenia jasminoides strain and Catalanthus roseus strain. When ferase was used, it was confirmed that crosin-4 could be produced in high yield, and the present invention was completed.
본 발명의 주된 목적은 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 포함하는 크로신-4 생산용 조성물을 제공하는 것이다.The main object of the present invention is to provide a composition for producing crosin-4 comprising UDP-glycosyltransferase derived from a Catalanthus roseus strain.
본 발명의 다른 목적은 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 이용하여 크로신-2로부터 크로신-4를 생산하는 방법을 제공하는 것이다.Another object of the present invention is to provide a method for producing crosin-4 from crosin-2 using UDP-glycosyltransferase derived from Cat. roseus strain.
본 발명의 또 다른 목적은 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물을 제공하는 것이다.Another object of the present invention is to provide a composition for producing crosin-4 comprising a transformant into which a gene encoding UDP-glycosyltransferase derived from a Catalanthus roseus strain is introduced.
본 발명의 또 다른 목적은 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자가 도입된 형질전환체를 사용하여 크로신-2로부터 크로신-4를 생산하는 방법을 제공하는 것이다.Another object of the present invention is to provide a method for producing crosin-4 from crosin-2 using a transformant into which a gene encoding UDP-glycosyltransferase derived from a Cat. roseus strain is introduced. will do
본 발명의 또 다른 목적은 니코티아나 타바쿰 균주 또는 가르데니아 자스미노이데스 균주로부터 유래된 UDP-글리코실트랜스페라제와 카타란투스 로세우스 균주 로부터 유래된 UDP-글리코실트랜스페라제를 포함하는 크로신-4 생산용 조성물을 제공하는 것이다.Another object of the present invention is a crosin containing UDP-glycosyltransferase derived from a Nicotiana tabacum strain or a Gardenia jasminoides strain and a UDP-glycosyltransferase derived from a Catalanthus roseus strain. -4 to provide a composition for production.
본 발명의 또 다른 목적은 니코티아나 타바쿰 균주 또는 가르데니아 자스미노이데스 균주로부터 유래된 UDP-글리코실트랜스페라제와 카타란투스 로세우스 균주 로부터 유래된 UDP-글리코실트랜스페라제를 이용하여 크로세틴으로부터 크로신-4를 생산하는 방법을 제공하는 것이다.Another object of the present invention is Crocetin using UDP-glycosyltransferase derived from Nicotiana tabacum strain or Gardenia jasminoides strain and UDP-glycosyltransferase derived from Catalanthus roseus strain. To provide a method for producing crosine-4 from
본 발명의 또 다른 목적은 니코티아나 타바쿰 균주 또는 가르데니아 자스미노이데스 균주로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자와 카타란투스 로세우스 균주 로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자가 도입된 형질전환체를 포함하는 크로신-4 생산용 조성물을 제공하는 것이다.Another object of the present invention is a gene encoding a UDP-glycosyltransferase derived from a Nicotiana tabacum strain or a Gardenia jasminoides strain and a UDP-glycosyltransferase derived from a Catalanthus roseus strain. To provide a composition for producing crosin-4 comprising a transformant into which a coding gene is introduced.
본 발명의 또 다른 목적은 상기 형질전환체를 사용하여 크로세틴으로부터 크로신-4를 생산하는 방법을 제공하는 것이다.Another object of the present invention is to provide a method for producing crosin-4 from crocetin using the transformant.
상술한 목적을 달성하기 위한 본 발명의 일 실시양태는 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 포함하는 크로신-4 생산용 조성물을 제공한다.One embodiment of the present invention for achieving the above object provides a composition for producing crosin-4 comprising UDP-glycosyltransferase derived from Catalanthus roseus strain.
본 발명의 용어 "UDP-글리코실트랜스페라제(UDP-glucosyltransferase, UGT)"란, UDP-글루코스와 같은 다양한 UDP-글루쿠론산의 글루쿠론산 성분을 작은 소수성 분자로 전달하는 반응을 촉매하는 효소를 의미한다. As used herein, the term "UDP-glycosyltransferase (UDP-glucosyltransferase, UGT)" refers to an enzyme that catalyzes the transfer of the glucuronic acid component of various UDP-glucuronic acids, such as UDP-glucose, to small hydrophobic molecules. means
본 발명에 있어서, 상기 UGT는 크로세틴으로부터 크로신-2를 생산하거나 또는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여하는 효소인 것으로 해석할 수 있다. In the present invention, the UGT can be interpreted as an enzyme directly involved in a reaction for producing crosin-2 from crocetin or crosin-4 from crocetin-2.
본 발명에서 규명한 바와 같이, 모든 종류의 UGT가 크로세틴으로부터 크로신-2를 생산하거나 또는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여하는 것은 아니며, 특정 균주로부터 유래된 UGT가 크로세틴으로부터 크로신-2를 생산하거나 또는 크로신-2로부터 크로신-4를 생산하는 반응에 관여할 수 있다.As clarified in the present invention, all types of UGT are not directly involved in the reaction of producing crosin-2 from crocetin or crosin-4 from crocetin, and are derived from a specific strain. UGT may be involved in a reaction that produces crosin-2 from crocetin or crosin-4 from crosin-2.
일 예로서, 가르데니아 자스미노이데스(Gardenia jasminoides) 균주로부터 유래된 UGT인 GjUGT1 효소(서열번호 1) 또는 니코티아나 타바쿰(Nicotiana tabacum) 균주로부터 유래된 UGT인 NtUGT 효소(서열번호 9)는 크로세틴으로부터 크로신-2를 생산하는 반응에 직접적으로 관여할 수 있다.As an example, GjUGT1 enzyme (SEQ ID NO: 1), which is a UGT derived from a Gardenia jasminoides strain, or a NtUGT enzyme (SEQ ID NO: 9) that is a UGT derived from a Nicotiana tabacum strain It may be directly involved in the reaction that produces crosine-2 from cetin.
다른 예로서, 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 UGT인 CaUGT3 효소(서열번호 21)는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여할 수 있다.As another example, CaUGT3 enzyme (SEQ ID NO: 21), which is a UGT derived from a Catharanthus roseus strain, may be directly involved in a reaction for producing crosin-4 from crosin-2.
상술한 바와 같이, 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 UGT인 CaUGT3 효소(서열번호 21)는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여할 수 있으므로, 상기 CaUGT3 효소(서열번호 21)를 포함하는 조성물은 크로신-4 생산용 조성물로서 사용될 수 있다.As described above, the CaUGT3 enzyme (SEQ ID NO: 21), which is a UGT derived from a Catharanthus roseus strain, can be directly involved in the reaction for producing crosin-4 from crosin-2, so that the CaUGT3 The composition comprising the enzyme (SEQ ID NO: 21) may be used as a composition for producing Crosin-4.
상기 크로신-4 생산용 조성물은 CaUGT3 효소(서열번호 21) 이외에도 크로신-4 생산에 필요한 크로신-2 및 UGT-글루코스를 추가로 포함할 수 있다.The composition for crosin-4 production may further include crosin-2 and UGT-glucose required for crosin-4 production in addition to CaUGT3 enzyme (SEQ ID NO: 21).
본 발명의 다른 실시양태는 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 이용하여 크로신-2로부터 크로신-4를 생산하는 방법을 제공한다.Another embodiment of the present invention provides a method for producing crosin-4 from crosin-2 using UDP-glycosyltransferase derived from a Cat. roseus strain.
상기 효소반응은 기질인 크로세틴, 크로신-2 및 UGT-글루코스를 포함하는 반응물에서 수행될 수 있다.The enzymatic reaction may be performed in a reactant containing the substrates crocetin, crosine-2 and UGT-glucose.
상기 효소반응을 수행하기 위한 온도 조건은 특별히 이에 제한되지 않으나, 일 예로서, 20 내지 40℃가 될 수 있고, 다른 예로서, 25 내지 35℃가 될 수 있으며, 또 다른 예로서, 30℃가 될 수 있다.The temperature conditions for carrying out the enzymatic reaction are not particularly limited thereto, but as an example, may be 20 to 40 ℃, as another example, may be 25 to 35 ℃, as another example, 30 ℃ can be
상기 효소반응을 수행하기 위한 시간 조건은 특별히 이에 제한되지 않으나, 일 예로서, 10 내지 240분이 될 수 있고, 다른 예로서, 60 내지 180분이 될 수 있으며, 또 다른 예로서, 120분이 될 수 있다.The time condition for performing the enzymatic reaction is not particularly limited thereto, but as an example, it may be 10 to 240 minutes, as another example, it may be 60 to 180 minutes, and as another example, it may be 120 minutes. .
본 발명의 또 다른 실시양태는 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자가 도입된 형질전환체 또는 상기 형질전환체의 배양산물을 포함하는 크로신-4 생산용 조성물을 제공한다.Another embodiment of the present invention is a transformant into which a gene encoding a UDP-glycosyltransferase derived from a Cat. roseus strain is introduced or Crosin-4 production comprising a culture product of the transformant A composition is provided.
상술한 바와 같이, 특정한 균주로부터 유래된 UGT를 사용하면 크로신-2로부터 크로신-4를 생산할 수 있으므로, 상기 UGT를 코딩하는 유전자가 도입된 형질전환체를 사용한 생물전환 공정에 의하여 크로신-2로부터 크로신-4를 생산할 수 있다.As described above, since crosin-4 can be produced from crosin-2 by using UGT derived from a specific strain, crosin- Crocin-4 can be produced from 2.
이때, 상기 형질전환체는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여하는 CaUGT3 유전자(서열번호 22)도 함께 도입된 것이 될 수 있다.In this case, the transformant may be one in which the CaUGT3 gene (SEQ ID NO: 22) directly involved in the reaction for producing crosin-4 from crosin-2 is also introduced.
상기 형질전환체의 배양산물은 크로신-2로부터 크로신-4를 높은 수율로 생산하는데 사용될 수 있는 한, 특별히 이에 제한되지 않으나, 일 예로서, 상기 형질전환체의 배양물, 배양상등액, 파쇄물, 이들의 분획물 등이 될 수 있고, 다른 예로서, 형질전환체의 배양물을 원심분리하여 수득한 배양상등액, 형질전환체를 물리적으로 또는 초음파처리하여 수득한 파쇄물, 상기 배양물, 배양상등액, 파쇄물 등을 원심분리, 크로마토그래피 등의 방법에 적용하여 수득한 분획물 등이 될 수 있다.The culture product of the transformant is not particularly limited as long as it can be used to produce crosin-4 from crosin-2 in high yield, but as an example, the transformant culture, culture supernatant, and lysate , their fractions, and the like, and as another example, a culture supernatant obtained by centrifuging the transformant culture, a lysate obtained by physically or sonicating the transformant, the culture, the culture supernatant, It may be a fraction obtained by subjecting the lysate to a method such as centrifugation or chromatography.
본 발명의 또 다른 실시양태는 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 코딩하는 유전자가 도입된 형질전환체를 사용하여 크로신-2로부터 크로신-4를 생산하는 방법을 제공한다.Another embodiment of the present invention provides a method for producing crosin-4 from crosin-2 using a transformant into which a gene encoding UDP-glycosyltransferase derived from a Cat. roseus strain is introduced. to provide.
구체적으로, 상기 크로신-2로부터 크로신-4를 생산하는 방법은 상기 CaUGT3 유전자(서열번호 22)가 도입된 형질전환체를 크로신-2가 포함된 배지에서 배양하는 단계를 포함한다.Specifically, the method for producing crosin-4 from crosin-2 includes culturing the transformant into which the CaUGT3 gene (SEQ ID NO: 22) is introduced in a medium containing crosin-2.
상술한 바와 같이, CaUGT3 효소(서열번호 21)를 사용하면 크로신-2로부터 크로신-4를 생산할 수 있으므로, 상기 CaUGT3 효소를 코딩하는 유전자(서열번호 22)가 도입된 형질전환체를 사용하면, 생물전환 공정에 의하여 크로신-2로부터 크로신-4를 생산할 수 있다.As described above, when using the CaUGT3 enzyme (SEQ ID NO: 21), crosin-4 can be produced from crosin-2, so when using the transformant into which the gene encoding the CaUGT3 enzyme (SEQ ID NO: 22) is introduced , Crosin-4 can be produced from Crosin-2 by the bioconversion process.
또한, 상기 배양을 통해 수득한 배양물로부터 크로신-4를 회수하는 단계를 추가로 포함할 수 있다.In addition, it may further comprise the step of recovering crosin-4 from the culture obtained through the culture.
본 발명의 용어 "배양"이란, 미생물을 적당히 인공적으로 조절한 환경조건에서 생육시키는 방법을 의미한다. As used herein, the term "cultivation" refers to a method of growing microorganisms in an appropriately artificially controlled environmental condition.
본 발명에 있어서, 상기 형질전환체를 배양하는 방법은 당업계에 널리 알려져 있는 방법을 이용하여 수행할 수 있다. 구체적으로 상기 배양은 상기 형질전환체로부터 크로신-4를 생산할 수 있는 한 특별히 이에 제한되지 않으나, 배치 공정 또는 주입 배치 또는 반복 주입 배치 공정(fed batch or repeated fed batch process)에서 연속식으로 배양할 수 있다.In the present invention, the method for culturing the transformant can be performed using a method widely known in the art. Specifically, the culture is not particularly limited as long as it can produce crosin-4 from the transformant, but it can be continuously cultured in a batch process or injection batch or repeated fed batch process. can
배양에 사용되는 배지는 적당한 탄소원, 질소원, 아미노산, 비타민 등을 함유한 통상의 배지 내에서 호기성 조건 하에서 온도, pH 등을 조절하면서 적절한 방식으로 특정 균주의 요건을 충족해야 한다. 사용될 수 있는 탄소원으로는 글루코즈, 자일로즈, 수크로즈, 락토즈, 프락토즈, 말토즈, 전분, 셀룰로즈와 같은 당 및 탄수화물, 대두유, 해바라기유, 피마자유, 코코넛유 등과 같은 오일 및 지방, 팔미트산, 스테아린산, 리놀레산과 같은 지방산, 글리세롤, 에탄올과 같은 알코올, 아세트산과 같은 유기산 등을 부가적으로 사용할 수 있는데, 이들 물질은 개별적으로 또는 혼합물로서 사용될 수 있다. 사용될 수 있는 질소원으로는 암모니아, 황산암모늄, 염화암모늄, 초산암모늄, 인산암모늄, 탄산안모늄, 및 질산 암모늄과 같은 무기질소원; 글루탐산, 메티오닌, 글루타민과 같은 아미노산 및 펩톤, NZ-아민, 육류 추출물, 효모 추출물, 맥아 추출물, 옥수수 침지액, 카세인 가수분해물, 어류 또는 그의 분해생성물, 탈지 대두 케이크 또는 그의 분해생성물 등 유기질소원이 사용될 수 있다. 이들 질소원은 단독 또는 조합되어 사용될 수 있다. 상기 배지에는 인원으로서 인산 제1칼륨, 인산 제2칼륨 및 대응되는 소듐-함유 염이 포함될 수 있다. 사용될 수 있는 인원으로는 인산이수소칼륨 또는 인산수소이칼륨 또는 상응하는 나트륨-함유 염이 포함된다. 또한, 무기화합물로는 염화나트륨, 염화칼슘, 염화철, 황산마그네슘, 황산철, 황산망간 및 탄산칼슘 등이 사용될 수 있다. 마지막으로, 상기 물질에 더하여 아미노산 및 비타민과 같은 필수 성장 물질이 사용될 수 있다.The medium used for culture should meet the requirements of a specific strain in an appropriate manner while controlling temperature, pH, etc. under aerobic conditions in a conventional medium containing an appropriate carbon source, nitrogen source, amino acids, vitamins, and the like. Carbon sources that can be used include sugars and carbohydrates such as glucose, xylose, sucrose, lactose, fructose, maltose, starch, cellulose, oils and fats such as soybean oil, sunflower oil, castor oil, coconut oil, palmitate, etc. Acids, fatty acids such as stearic acid and linoleic acid, alcohols such as glycerol, ethanol, organic acids such as acetic acid, and the like may additionally be used, and these substances may be used individually or as a mixture. Examples of the nitrogen source that can be used include inorganic nitrogen sources such as ammonia, ammonium sulfate, ammonium chloride, ammonium acetate, ammonium phosphate, anmonium carbonate, and ammonium nitrate; Amino acids such as glutamic acid, methionine, glutamine, and organic nitrogen sources such as peptone, NZ-amine, meat extract, yeast extract, malt extract, corn steep liquor, casein hydrolyzate, fish or its degradation products, defatted soybean cake or its degradation products, etc. can These nitrogen sources may be used alone or in combination. The medium may contain monopotassium phosphate, dipotassium phosphate and the corresponding sodium-containing salt as phosphorus. The phosphorus that may be used includes potassium dihydrogen phosphate or dipotassium hydrogen phosphate or the corresponding sodium-containing salt. In addition, sodium chloride, calcium chloride, iron chloride, magnesium sulfate, iron sulfate, manganese sulfate, calcium carbonate, etc. may be used as the inorganic compound. Finally, in addition to the above substances, essential growth substances such as amino acids and vitamins can be used.
또한, 배양 배지에 적절한 전구체들이 사용될 수 있다. 상기된 원료들은 배양과정에서 배양물에 적절한 방식에 의해 회분식, 유가식 또는 연속식으로 첨가될 수 있으나, 특별히 이에 제한되지는 않는다. 수산화나트륨, 수산화칼륨, 암모니아와 같은 기초 화합물 또는 인산 또는 황산과 같은 산 화합물을 적절한 방식으로 사용하여 배양물의 pH를 조절할 수 있다.In addition, precursors suitable for the culture medium may be used. The above-mentioned raw materials may be added in a batch, fed-batch or continuous manner by an appropriate method to the culture during the culturing process, but is not particularly limited thereto. Basic compounds such as sodium hydroxide, potassium hydroxide, ammonia or acid compounds such as phosphoric acid or sulfuric acid may be used in an appropriate manner to adjust the pH of the culture.
또한, 지방산 폴리글리콜 에스테르와 같은 소포제를 사용하여 기포 생성을 억제할 수 있다. 호기 상태를 유지하기 위해 배양물 내로 산소 또는 산소-함유 기체(예, 공기)를 주입한다. 배양물의 온도는 보통 27℃ 내지 37℃, 바람직하게는 30℃ 내지 35℃이다. 배양은 상기 펩타이드의 생성량이 최대로 얻어질 때까지 계속한다. 이러한 목적으로 보통 10 내지 100 시간에서 달성된다.In addition, an antifoaming agent such as a fatty acid polyglycol ester may be used to inhibit the formation of bubbles. Oxygen or oxygen-containing gas (eg air) is injected into the culture to maintain aerobic conditions. The temperature of the culture is usually 27°C to 37°C, preferably 30°C to 35°C. Cultivation is continued until the maximum amount of the peptide is obtained. This is usually achieved in 10 to 100 hours for this purpose.
아울러, 반응액으로부터 크로신-4를 회수하는 단계는 투석, 원심분리, 여과, 용매추출, 크로마토그래피, 결정화 등의 당업계에 공지된 방법에 의해 수행될 수 있다. 예를 들면, 상기 반응액을 원심분리하여 형질전환체를 제거하고 얻어진 상등액을, 용매추출법에 적용하여 목적하는 크로신-4를 회수하는 방법을 사용할 수 있고, 이외에도 상기 목적하는 크로신-4의 특성에 맞추어 공지된 실험방법을 조합하여 상기 크로신-4를 회수할 수 있는 방법이라면 특별히 제한되지 않고 사용될 수 있다.In addition, the step of recovering crosin-4 from the reaction solution may be performed by methods known in the art, such as dialysis, centrifugation, filtration, solvent extraction, chromatography, and crystallization. For example, a method of recovering the desired crosin-4 by centrifuging the reaction solution to remove the transformant and applying the obtained supernatant to a solvent extraction method may be used. If it is a method capable of recovering the crossin-4 by combining known experimental methods according to the characteristics, it may be used without particular limitation.
본 발명의 또 다른 실시양태는 니코티아나 타바쿰 균주 또는 가르데니아 자스미노이데스 균주로부터 유래된 UDP-글리코실트랜스페라제와 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 포함하는 크로신-4 생산용 조성물을 제공한다.Another embodiment of the present invention is a croissant comprising UDP-glycosyltransferase derived from a Nicotiana tabacum strain or a Gardenia jasminoides strain and a UDP-glycosyltransferase derived from a Catalanthus roseus strain. A composition for producing Shin-4 is provided.
상술한 바와 같이, 가르데니아 자스미노이데스(Gardenia jasminoides) 균주로부터 유래된 UGT인 GjUGT1 효소(서열번호 1) 또는 니코티아나 타바쿰(Nicotiana tabacum) 균주로부터 유래된 UGT인 NtUGT 효소(서열번호 9)는 크로세틴으로부터 크로신-2를 생산하는 반응에 직접적으로 관여할 수 있고, 카타란투스 로세우스(Catharanthus roseus) 균주로부터 유래된 UGT인 CaUGT3 효소(서열번호 21)는 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여할 수 있으므로, 상기 효소의 연속반응을 이용하면 크로세틴으로부터 크로신-4를 생산할 수 있다.As described above, GjUGT1 enzyme (SEQ ID NO: 1), which is a UGT derived from a Gardenia jasminoides strain, or NtUGT enzyme (SEQ ID NO: 9), which is a UGT derived from a Nicotiana tabacum strain, is CaUGT3 enzyme (SEQ ID NO: 21), which can be directly involved in the reaction to produce crosin-2 from crocetin, and is a UGT derived from a Catharanthus roseus strain, is crosin-4 from crosin-2 Since it can be directly involved in the reaction producing
따라서, 상기 각 UGT를 포함하는 조성물은 크로신-4 생산용 조성물로서 사용될 수 있다.Therefore, the composition including each UGT can be used as a composition for producing Crosin-4.
상기 크로신-4 생산용 조성물은 GjUGT1 효소(서열번호 1) 또는 NtUGT 효소(서열번호 9)와 CaUGT3 효소(서열번호 21) 이외에도 크로신-4 생산에 필요한 크로세틴 및 UGT-글루코스를 추가로 포함할 수 있다.The composition for crosin-4 production further includes crocetin and UGT-glucose required for crosin-4 production in addition to GjUGT1 enzyme (SEQ ID NO: 1) or NtUGT enzyme (SEQ ID NO: 9) and CaUGT3 enzyme (SEQ ID NO: 21) can do.
본 발명의 일 실시예에 의하면, 실험실조건(in vitro)에서 GjUGT1 효소 또는 NtUGT 효소와, CaUGT3 효소, SpUGT 효소 또는 NsUGT 효소의 조합을 시험한 결과, GjUGT1 효소 또는 NtUGT 효소에 의해 생성된 크로신-2를 CaUGT3 효소가 크로신-4로 전환시킬 수 있음을 확인하였다(도 4).According to one embodiment of the present invention, as a result of testing the combination of GjUGT1 enzyme or NtUGT enzyme and CaUGT3 enzyme, SpUGT enzyme or NsUGT enzyme in vitro, crosine produced by GjUGT1 enzyme or NtUGT enzyme- 2, it was confirmed that the CaUGT3 enzyme can convert crosine-4 (FIG. 4).
본 발명의 또 다른 실시양태는 니코티아나 타바쿰 균주 또는 가르데니아 자스미노이데스 균주로부터 유래된 UDP-글리코실트랜스페라제와 카타란투스 로세우스 균주로부터 유래된 UDP-글리코실트랜스페라제를 이용하여 크로세틴으로부터 크로신-4를 생산하는 방법을 제공한다.Another embodiment of the present invention uses a UDP-glycosyltransferase derived from a Nicotiana tabacum strain or a Gardenia jasminoides strain and a UDP-glycosyltransferase derived from a Cataranthus roseus strain to cross A method for producing crosine-4 from cetin is provided.
구체적으로, 크로세틴으로부터 크로신-4를 생산하는 방법은 (a) 니코티아나 타바쿰(Nicotiana tabacum) 균주로부터 유래된 NtUGT 효소 또는 가르데니아 자스미노이데스(Gardenia jasminoides) 균주로부터 유래된 GjUGT1 효소의 존재하에, 크로세틴과 UDP-글루코스를 반응시켜서 크로신-2를 포함하는 반응물을 수득하는 단계; 및 (b) 상기 반응물에 카타란투스 로세우스(Catharanthus roseus) 균주 로부터 유래된 CaUGT3 효소를 가하여 반응시켜서 크로신-4를 수득하는 단계를 포함한다.Specifically, the method for producing crosine-4 from crocetin includes (a) the presence of NtUGT enzyme derived from Nicotiana tabacum strain or GjUGT1 enzyme derived from Gardenia jasminoides strain. a step of reacting crocetin with UDP-glucose to obtain a reactant containing crosine-2; and (b) adding and reacting CaUGT3 enzyme derived from Catharanthus roseus strain to the reaction to obtain crosin-4.
상기 효소반응은 기질인 크로세틴, UGT 및 UGT-글루코스를 포함하는 반응물에서 수행될 수 있다.The enzymatic reaction may be carried out in a reactant containing the substrates Crocetin, UGT and UGT-glucose.
상기 효소반응을 수행하기 위한 온도 조건은 특별히 이에 제한되지 않으나, 일 예로서, 20 내지 40℃가 될 수 있고, 다른 예로서, 25 내지 35℃가 될 수 있으며, 또 다른 예로서, 30℃가 될 수 있다.The temperature conditions for carrying out the enzymatic reaction are not particularly limited thereto, but as an example, may be 20 to 40 ℃, as another example, may be 25 to 35 ℃, as another example, 30 ℃ can be
상기 효소반응을 수행하기 위한 시간 조건은 특별히 이에 제한되지 않으나, 일 예로서, 10 내지 240분이 될 수 있고, 다른 예로서, 60 내지 180분이 될 수 있으며, 또 다른 예로서, 120분이 될 수 있다.The time condition for performing the enzymatic reaction is not particularly limited thereto, but as an example, it may be 10 to 240 minutes, as another example, it may be 60 to 180 minutes, and as another example, it may be 120 minutes. .
한편, 상기 (b) 단계의 반응을 수행함에 있어서, CaUGT3 효소 이외에도 UGT-글루코스를 추가할 수도 있다.Meanwhile, in performing the reaction of step (b), UGT-glucose may be added in addition to the CaUGT3 enzyme.
본 발명의 또 다른 실시양태는 상기 각 효소를 코딩하는 유전자가 도입된 형질전환체 또는 상기 형질전환체의 배양산물을 포함하는 크로신-4 생산용 조성물을 제공한다.Another embodiment of the present invention provides a composition for producing crosin-4 comprising a transformant or a culture product of the transformant into which the gene encoding each enzyme is introduced.
상술한 바와 같이, 특정한 균주로부터 유래된 UGT를 사용하면 크로세틴 또는 크로신-2로부터 크로신-4를 생산할 수 있으므로, 상기 UGT를 코딩하는 각 유전자가 도입된 형질전환체 또는 상기 형질전환체의 배양산물을 포함하는 조성물은 크로신-4 생산용 조성물로 사용될 수 있다.As described above, when UGT derived from a specific strain is used, crocetin or crosin-4 can be produced from crocetin or crosin-2. The composition containing the culture product may be used as a composition for producing crosin-4.
이때, 상기 형질전환체는 크로세틴으로부터 크로신-2를 생산하는 반응에 직접적으로 관여하는 GjUGT1 유전자(서열번호 2) 또는 NtUGT 유전자(서열번호 10)가 도입되어야 할 뿐만 아니라, 크로신-2로부터 크로신-4를 생산하는 반응에 직접적으로 관여하는 CaUGT3 유전자(서열번호 22)도 함께 도입된 것이어야만 한다.At this time, in the transformant, the GjUGT1 gene (SEQ ID NO: 2) or NtUGT gene (SEQ ID NO: 10) directly involved in the reaction for producing crosin-2 from crocetin must be introduced, as well as from crosin-2 The CaUGT3 gene (SEQ ID NO: 22) directly involved in the reaction for producing crosin-4 must also be introduced together.
상기 GjUGT1 유전자(서열번호 2) 또는 NtUGT 유전자(서열번호 10)와 CaUGT3 유전자(서열번호 22)는 하나의 벡터를 이용하여 형질전환체에 함께 도입될 수도 있고, 각각의 벡터를 이용하여 형질전환체에 개별적으로 도입될 수도 있다.The GjUGT1 gene (SEQ ID NO: 2) or NtUGT gene (SEQ ID NO: 10) and CaUGT3 gene (SEQ ID NO: 22) may be introduced together into a transformant using one vector, or transformants using each vector may be introduced individually.
본 발명의 일 실시예에서는 NtUGT 유전자(서열번호 10)와 CaUGT3 유전자(서열번호 22)가 도입된 형질전환체를 사용한 생물전환 공정에 의해, 크로신-4를 생산할 수 있음을 확인하였다(도 6)In an embodiment of the present invention, it was confirmed that crosin-4 could be produced by a bioconversion process using a transformant into which the NtUGT gene (SEQ ID NO: 10) and the CaUGT3 gene (SEQ ID NO: 22) were introduced (FIG. 6) )
또한, 상기 형질전환체는 크로세틴 생합성 유전자가 추가로 도입된 형태가 될 수 있는데, 이처럼 크로세틴 생합성 유전자가 함께 도입된 형태의 형질전환체는 기질인 크로세틴을 별도로 사용하지 않아도 생물전환 공정에 의해 크로신-4를 생산할 수 있다.In addition, the transformant may be in a form in which the biosynthesis gene is additionally introduced. As such, the transformant in the form in which the gene biosynthesis is introduced can be used in the bioconversion process without separately using the substrate, crocetin. Crosin-4 can be produced by
이때, 사용되는 크로세틴 생합성 유전자는 특별히 이에 제한되지 않으나, CsCCD2의 유전자 및 aldH 유전자가 될 수 있다.In this case, the used crocetin biosynthesis gene is not particularly limited thereto, but may be a CsCCD2 gene or aldH gene.
아울러, 상기 크로신-4 생산용 조성물은 상기 형질전환체 이외에도 크로신-4 생산에 필요한 크로세틴 및 UGT-글루코스를 추가로 포함할 수 있다.In addition, the composition for producing crosin-4 may further include crocetin and UGT-glucose necessary for production of crosin-4 in addition to the transformant.
본 발명의 또 다른 실시양태는 상기 각 효소를 코딩하는 유전자가 도입된 형질전환체를 사용하여 크로세틴으로부터 크로신-4를 생산하는 방법을 제공한다.Another embodiment of the present invention provides a method for producing crosine-4 from crocetin using a transformant into which a gene encoding each of the above enzymes has been introduced.
구체적으로, 상기 크로세틴으로부터 크로신-4를 생산하는 방법은 상기 각 효소를 코딩하는 유전자가 도입된 형질전환체를 배양하는 단계를 포함한다.Specifically, the method for producing crosin-4 from crocetin includes culturing a transformant into which the gene encoding each enzyme is introduced.
이때, 상기 배양시 사용되는 배지에는 크로세틴과 UDP-글루코스를 별도로 첨가하지 않아도, 상기 배양시에 수행되는 형질전환체의 대사과정에 의해 크로신-4를 생산할 수 있다. 다만, 크로신-4의 생산성을 향상시키기 위하여, 상기 배지는 크로세틴과 UDP-글루코스가 추가로 포함된 것을 사용할 수도 있다.At this time, even without separately adding crocetin and UDP-glucose to the medium used for the culture, crosin-4 can be produced by the metabolic process of the transformant performed during the culture. However, in order to improve the productivity of crosin-4, the medium may additionally contain crocetin and UDP-glucose.
또한, 상기 배양을 통해 수득한 배양물로부터 크로신-4를 회수하는 단계를 추가로 포함할 수 있다.In addition, it may further comprise the step of recovering crosin-4 from the culture obtained through the culture.
본 발명의 일 실시예에 의하면, 본 발명의 일 실시예에서는 NtUGT 유전자(서열번호 10)와 CaUGT3 유전자(서열번호 22)가 도입된 형질전환체를 사용한 생물전환 공정에 의해, 크로신-4를 생산할 수 있음을 확인하였다(도 6)According to one embodiment of the present invention, in one embodiment of the present invention, crosin-4 is converted into It was confirmed that it can be produced (FIG. 6)
본 발명에서 제공하는 크로신-4의 생산방법을 이용하면, 간단한 효소반응 또는 생물전환 공정을 이용하여, 크로신-4를 효율적으로 생산할 수 있으므로, 사프란 화합물의 경제적인 생산에 널리 활용될 수 있을 것이다.Using the method for producing crosin-4 provided in the present invention, crosin-4 can be efficiently produced using a simple enzymatic reaction or bioconversion process, so it can be widely used for economical production of saffron compounds. will be.
도 1은 크로세틴을 크로신-2로 전환시킬 수 있는 당 전이 효소의 후보 유전자를 발현시킨 결과를 나타내는 전기영동사진이다.
도 2는 실험실조건(in vitro)에서 GjUGT1 효소, GT1-316 효소 또는 NtUGT 효소를 사용하여 크로세틴을 크로신-2로 전환시킨 결과를 나타내는 그래프이다.
도 3은 크로신-2를 크로신-4로 전환시킬 수 있는 당 전이 효소의 후보 유전자를 발현시킨 결과를 나타내는 전기영동사진이다.
도 4는 실험실조건(in vitro)에서 GjUGT1 효소 또는 NtUGT 효소와 CaUGT3 효소, SpUGT 효소 또는 NsUGT 효소의 조합을 사용하여 크로세틴을 크로신-4로 전환시킨 결과를 나타내는 그래프이다.
도 5는 형질전환 미생물의 구성을 나타내는 개략도 및 성장곡선을 나타내는 그래프이다.
도 6은 형질전환 미생물을 사용하여, 크로신-2와 크로신-4를 생산한 결과를 나타내는 그래프이다.1 is an electrophoretic photograph showing the result of expressing a candidate gene of a sugar transferase capable of converting crocetin to crosin-2.
2 is a graph showing the results of converting crocetin into crosin-2 using GjUGT1 enzyme, GT1-316 enzyme, or NtUGT enzyme under laboratory conditions ( in vitro).
3 is an electrophoretic photograph showing the result of expressing a candidate gene of a sugar transferase capable of converting crosin-2 to crosin-4.
4 is a graph showing the result of converting crocetin to crosine-4 using a combination of GjUGT1 enzyme or NtUGT enzyme and CaUGT3 enzyme, SpUGT enzyme or NsUGT enzyme under laboratory conditions ( in vitro).
5 is a schematic diagram showing the composition of a transformed microorganism and a graph showing a growth curve.
6 is a graph showing the results of the production of crosin-2 and crosin-4 using the transformed microorganism.
이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다.Hereinafter, the present invention will be described in more detail through examples. However, these examples are for illustrative purposes only and the scope of the present invention is not limited to these examples.
실시예 1: 1차 당 전이 효소의 발굴Example 1: Identification of primary sugar transferase
크로세틴(Crocetin)을 크로신-2(Crocetin diglucosyl-ester)로 전환시킬 수 있는 당 전이 효소(UDP-glucosyltransferase-1, UGT-1)를 발굴하기 위하여, Gardenia jasminoides 균주유래의 당 전이 효소인 UGT75L6의 유전자를 기준으로 BLAST 검색을 수행하여, Gardenia jasminoides 균주로부터 유래된 GjUGT1 유전자(서열번호 2), Populus 속 균주(Populus fremontii X Populus angustifolia)로부터 유래된 GT1-316 유전자(서열번호 6), Nicotiana tabacum 균주로부터 유래된 NtUGT 유전자(서열번호 10), Fragaria ananassa 균주로부터 유래된 FaGT2 유전자(서열번호 14) 및 Solanum tuberosum 균주로부터 유래된 StUGT 유전자(서열번호 18)를 발굴하고, 이들의 염기서열을 코돈 최적화시킨 후, 합성하였다.To discover a sugar transferase (UDP-glucosyltransferase-1, UGT-1) that can convert crocetin into crocetin-2 (Crocetin diglucosyl-ester), UGT75L6, a sugar transferase derived from Gardenia jasminoides strain by performing a BLAST search on the basis of the gene, the gene derived from Gardenia jasminoides GjUGT1 strain (SEQ ID NO: 2), the GT1-316 gene derived from Populus sp (Populus fremontii X Populus angustifolia) (SEQ ID NO: 6), Nicotiana tabacum The NtUGT gene (SEQ ID NO: 10) derived from the strain, the FaGT2 gene (SEQ ID NO: 14) derived from the Fragaria ananassa strain and the StUGT gene (SEQ ID NO: 18) derived from the Solanum tuberosum strain were discovered, and their nucleotide sequences were codon-optimized After that, it was synthesized.
이때, 사용된 각 프라이머의 염기서열은 다음과 같다:At this time, the base sequence of each primer used is as follows:
GjUGT1_F: 5'-CGGAATTCATGGTTCAGCAGCGTCAC-3'(서열번호 3)GjUGT1_F: 5'-CGGAATTCATGGTTCAGCAGCGTCAC-3' (SEQ ID NO: 3)
GjUGT1_R: 5'-CCGCTCGAGGTTGCTCTCCGCTTGATAG-3'(서열번호 4)GjUGT1_R: 5'-CCGCTCGAGGTTGCTCTCCGCTTGATAG-3' (SEQ ID NO: 4)
GT1-316_F: 5'-CGGGATCCATGGTCTCCGAGAGTCTAG-3'(서열번호 7)GT1-316_F: 5'-CGGGATCCATGGTCTCCGAGAGTCTAG-3' (SEQ ID NO: 7)
GT1-316_R: 5'-ACGCGTCGACTGCGGATGATCCGACTAAC-3'(서열번호 8)GT1-316_R: 5'-ACGCGTCGACTGCGGATGATCCGACTAAC-3' (SEQ ID NO: 8)
NtUGT_F: 5'-CGGGATCCATGGTTCAACCTCATGTCCT-3'(서열번호 11)NtUGT_F: 5'-CGGGATCCATGGTTCAACCTCATGTCCT-3' (SEQ ID NO: 11)
NtUGT_R: 5'-CCCAAGCTTACAACCTTTTCCTACTTCTTGC-3'(서열번호 12)NtUGT_R: 5'-CCCAAGCTTACAACCTTTTCCTACTTCTTGC-3' (SEQ ID NO: 12)
FaGT2_F: 5'-CGGGATCCATGGGTAGTGAGTCTT-3'(서열번호 15)FaGT2_F: 5'-CGGGATCCATGGGTAGTGAGTCTT-3' (SEQ ID NO: 15)
FaGT2_R: 5'-CCCAAGCTTTGACTCTACTAACTCGACTT-3'(서열번호 16)FaGT2_R: 5'-CCCAAGCTTTGACTCTACTAACTCGACTT-3' (SEQ ID NO: 16)
StUGT_F: 5'-CCGGAATTCATGGTTCAACCTCATGTGCTT-3'(서열번호 19)StUGT_F: 5'-CCGGAATTCATGGTTCAACCTCATGTGCTT-3' (SEQ ID NO: 19)
StUGT_R: 5'-CCGCTCGAGACAAGATTTTCCCACTTCCTG-3'(서열번호 20)StUGT_R: 5'-CCGCTCGAGACAAGATTTTCCCACTTCCTG-3' (SEQ ID NO: 20)
상기 합성된 각 유전자가 클로닝된 pET21α(+) 벡터들을 공벡터 pET21α(+)와 함께 각각 BL21 균주에 도입시킨 뒤 50 mL의 LB(Luria bertani) 배지로 호기성 조건에서 250 rpm에서 배양을 진행하였다. 배양 온도의 경우 37℃에서 배양을 진행하다가 OD 0.6이 되었을 때 1 mM IPTG로 induction하였고 이후 30℃에서 4시간 배양을 진행하였다. 이후 4℃, 3800 rpm으로 20분 원심분리하여 균체를 회수하였으며 50 mL의 50 mM Tris 버퍼(pH 7.0)로 세척하였다. 이후 10 mL의 Tris 버퍼에 현탁시키고, 0.2 mg/mL 리소자임(lysozyme)과 0.1 mM PMSF를 처리하고, 25% amp.로 10초간 펄스를 주고 10초간 쉬는 초음파 분쇄법으로 세포를 파쇄시켰다. 이후, 4℃ 13200 rpm 10분간 원심분리하여 상등액(soluble protein)과 침전물(insoluble pellet)로 분리하였으며, 상기 침전물은 소량의 Tris 버퍼에 현탁시켰다. 브래드포드 단백질 정량법을 사용하여 동일량 만큼의 단백질을 확보하여 SDS-PAGE 분석을 통해 단백질 발현을 확인하였다(도 1).Each of the synthesized pET21α(+) vectors cloned together with the empty vector pET21α(+) was introduced into the BL21 strain, respectively, and then cultured in 50 mL of LB (Luria bertani) medium at 250 rpm under aerobic conditions. In the case of incubation temperature, culture was performed at 37°C, and when the OD reached 0.6, 1 mM IPTG was inducted, followed by incubation at 30°C for 4 hours. Thereafter, the cells were recovered by centrifugation at 4° C. and 3800 rpm for 20 minutes and washed with 50 mL of 50 mM Tris buffer (pH 7.0). Then, suspended in 10 mL of Tris buffer, treated with 0.2 mg/mL lysozyme and 0.1 mM PMSF, pulsed at 25% amp. for 10 seconds and disrupted by sonication with rest for 10 seconds. Thereafter, centrifugation was performed at 4° C. 13200 rpm for 10 minutes to separate a supernatant (soluble protein) and a precipitate (insoluble pellet), and the precipitate was suspended in a small amount of Tris buffer. Using the Bradford protein quantification method, the same amount of protein was obtained, and protein expression was confirmed through SDS-PAGE analysis (FIG. 1).
도 1은 크로세틴을 크로신-2로 전환시킬 수 있는 당 전이 효소의 후보 유전자를 발현시킨 결과를 나타내는 전기영동사진이다.1 is an electrophoretic photograph showing the result of expressing a candidate gene of a sugar transferase capable of converting crocetin to crosin-2.
도 1에서 보듯이, 52030 내지 61389Da의 크기를 갖는 다양한 당 전이 효소가 발현되었는데, GjUGT1 유전자(서열번호 2), GT1-316 유전자(서열번호 6) 및 NtUGT 유전자(서열번호 10)는 정상적으로 발현되었으나, FaGT2 유전자(서열번호 14) 및 StUGT 유전자(서열번호 18)는 정상적으로 발현되지 않음을 확인하였다.As shown in FIG. 1, various sugar transferases having a size of 52030 to 61389Da were expressed, GjUGT1 gene (SEQ ID NO: 2), GT1-316 gene (SEQ ID NO: 6) and NtUGT gene (SEQ ID NO: 10) were normally expressed, but , FaGT2 gene (SEQ ID NO: 14) and StUGT gene (SEQ ID NO: 18) were confirmed not to be expressed normally.
이어, 상기 정상적으로 발현된 3종의 당 전이 효소(GjUGT1, GT1-316 및 NtUGT)가 크로세틴을 크로신-2로 전환시킬 수 있는지의 여부를, 실험실조건(in vitro)에서 평가하였다. Then, it was evaluated in vitro whether the three normally expressed sugar transferases (GjUGT1, GT1-316 and NtUGT) could convert crocetin to crosin-2.
대략적으로, 당 전이 효소 활성을 확인하기 위한 반응 부피를 100㎕로 정하고 그 조성이 50 mM Tris-HCl 버퍼(pH 7.0), 5 mM UDP-glucose, 0.1 mM 캡슐화된 크로세틴(encapsulated crocetin) 및 25㎍ 세포파쇄물(crude protein extract)을 준비하였다Roughly, the reaction volume for confirming glucose transferase activity is 100 μl, and the composition is 50 mM Tris-HCl buffer (pH 7.0), 5 mM UDP-glucose, 0.1 mM encapsulated crocetin, and 25 ㎍ cell lysate (crude protein extract) was prepared
최종 농도 50 mM Tris-HCl 버퍼(pH 7.0)와 기질에 해당하는 5 mM UDP-glucose, 0.1 mM 캡슐화된 크로세틴을 모두 첨가한 뒤 마지막으로 25㎍ 세포파쇄물(GjUGT1 효소, GT1-316 효소 또는 NtUGT 효소)들을 첨가하고, 30℃에서 2시간동안 반응시켰다. 이어, 200㎕의 차가운 메탄올(-20℃ chilled MeOH)을 첨가하여 반응을 종결하였다. 각 반응 샘플들을 4℃, 13200 rpm으로 10분간 원심분리하여 상등액의 HPLC 분석을 통해 얼마나 크로세틴이 크로신-2로 전환되었는지 확인하였다(도 2). 이때, 크로세틴 및 크로신-2는 HPLC 머무름 시간(retention time), 자외선-가시광선(UV-Vis) 스펙트럼을 통해 확인하였다. 대략적으로, A: 100% MeOH(2% formic acid) 및 B: 100% DDW(2% formic acid)를 이동상으로 하여 분석하였다. 구배(Gradient) 조건으로는 A용매를 50%로 시작하여 A용매를 80%로 60분까지, A용매를 100%로 80분까지, 다시 A용매를 50%로 90분까지, A용매를 50%로 100분까지 지속하여 총 100분 분석하였다. Zorbax eclipse XDB-C18 컬럼(4.6150 mm, 5 ; Agilent Technology)을 고정상으로 0.8 mL/분의 유속으로 분석하였다. After adding all
도 2는 실험실조건(in vitro)에서 GjUGT1 효소, GT1-316 효소 또는 NtUGT 효소를 사용하여 크로세틴을 크로신-2로 전환시킨 결과를 나타내는 그래프이다.2 is a graph showing the results of converting crocetin into crosin-2 using GjUGT1 enzyme, GT1-316 enzyme, or NtUGT enzyme under laboratory conditions ( in vitro).
도 2에서 보듯이, GT1-316 효소는 크로세틴을 크로신-2로 전환시키지 못하였고, GjUGT1 효소는 대부분의 크로세틴을 크로신-1으로 전환시키고, 소량의 크로세틴만을 크로신-2로 전환시켰으나, NtUGT 효소는 크로세틴을 동등한 수준의 크로신-1과 크로신-2로 전환시킴을 확인하였다.As shown in FIG. 2 , GT1-316 enzyme failed to convert crossetin to crossin-2, and GjUGT1 enzyme converted most of crossetin to crossin-1 and only a small amount of crossetin to crossin-2. However, it was confirmed that the NtUGT enzyme converts crocetin to the equivalent level of crosin-1 and crosin-2.
따라서, 크로세틴을 크로신-2로 전환시킬 수 있는 효소는 NtUGT 효소와 GjUGT1 효소임을 확인하였다.Therefore, it was confirmed that the enzymes capable of converting crocetin to crosin-2 are the NtUGT enzyme and the GjUGT1 enzyme.
실시예 2: 2차 당 전이 효소의 발굴Example 2: Identification of secondary sugar transferase
크로신-2를 크로신-4(Crocetin digentiobiosyl-ester)로 전환시킬 수 있는 당 전이 효소(UDP-glucosyltransferase-2, UGT-2)를 발굴하기 위하여, Catharanthus roseus 균주로부터 유래된 CaUGT3 유전자(서열번호 22), Nicotiana sylvestris 균주로부터 유래된 NsUGT 유전자(서열번호 26) 및 Solanum pennellii 균주로부터 유래된 SpUGT 유전자(서열번호 30)를 발굴하고, 이들의 염기서열을 코돈 최적화시킨 후, 합성하였으며, 상기 실시예 3의 방법으로 이를 발현시켰다(도 3). 이때, 상기 각 유전자의 합성시에 사용된 각 프라이머의 염기서열은 다음과 같다:Crocin -2 to -4 to crocin (Crocetin digentiobiosyl-ester) to identify transferase (UDP-glucosyltransferase-2, UGT -2) each capable of converting, a CaUGT3 gene derived from Catharanthus roseus isolates (SEQ ID NO: 22), the NsUGT gene (SEQ ID NO: 26) derived from the Nicotiana sylvestris strain and the SpUGT gene (SEQ ID NO: 30) derived from the Solanum pennellii strain were discovered, and their nucleotide sequences were codon-optimized, and then synthesized, in the above Example It was expressed by the method of 3 (FIG. 3). At this time, the base sequence of each primer used in the synthesis of each gene is as follows:
CaUGT3_F: 5'-CGGGATCCATGGCCACCGAACAGCA-3'(서열번호 23)CaUGT3_F: 5'-CGGGATCCATGGCCACCGAACAGCA-3' (SEQ ID NO: 23)
CaUGT3_R: 5'-ACGCGTCGACTACGCACAATTGCTTCAGC-3'(서열번호 24)CaUGT3_R: 5'-ACGCGTCGACTACGCACAATTGCTTCAGC-3' (SEQ ID NO: 24)
NsUGT_F: 5'-CGGGATCCATGGACACAGAACACAGCA-3'(서열번호 27)NsUGT_F: 5'-CGGGATCCATGGACACAGAACACAGCA-3' (SEQ ID NO: 27)
NsUGT_R: 5'-ACGCGTCGACCAACATCATGTTATTACTTTTGC-3'(서열번호 28)NsUGT_R: 5'-ACGCGTCGACCAACATCATGTTATTACTTTTGC-3' (SEQ ID NO: 28)
SpUGT_F: 5'-CGGGATCCATGGGAACTCAAGTTACAGAA-3'(서열번호 31)SpUGT_F: 5'-CGGGATCCATGGGAACTCAAGTTACAGAA-3' (SEQ ID NO: 31)
SpUGT_R: 5'-ACGCGTCGACATTGGTGTTGATCTTACAGAAC-3'(서열번호 32)SpUGT_R: 5'-ACGCGTCGACATTGGTGTTGATCTTACAGAAC-3' (SEQ ID NO: 32)
도 3은 크로신-2를 크로신-4로 전환시킬 수 있는 당 전이 효소의 후보 유전자를 발현시킨 결과를 나타내는 전기영동사진이다.3 is an electrophoretic photograph showing the result of expressing a candidate gene of a sugar transferase capable of converting crosin-2 to crosin-4.
도 3에서 보듯이, 50 내지 52 kDa의 크기를 갖는 다양한 당 전이 효소가 발현됨을 확인하였다.As shown in FIG. 3 , it was confirmed that various sugar transferases having a size of 50 to 52 kDa were expressed.
이어, 상기 발현된 각각의 당 전이 효소가 크로신-2를 크로신-4로 전환시킬 수 있는지의 여부를, 실험실조건(in vitro)에서 평가하였다. Then, whether each of the expressed sugar transferases can convert crosin-2 to crosin-4 was evaluated under laboratory conditions (in vitro ).
대략적으로, 50 mM Tris-HCl(pH 7.0), 5 mM UDP-glucose 및 0.1 mM 캡슐화된 크로세틴의 혼합액에 상기 실시예 3에서 발현시킨 GjUGT1 또는 NtUGT 16㎍를 가하고, 30℃에서 30분간 반응시켰다. 이어, 50 mM Tris-HCl(pH 7.0) 및 7.5 mM UDP-글루코스를 추가한 다음, CaUGT3, SpUGT 또는 NsUGT 55㎍을 조합하여 처리하고, 30℃에서 2시간 동안 반응시켰다. 끝으로, 200㎕의 차가운 메탄올(-20℃ chilled MeOH)을 첨가하여 반응을 종결하였다. 각 반응 샘플들을 4℃, 13200 rpm으로 10분간 원심분리하여 상등액의 HPLC 분석을 통해 얼마나 크로세틴이 크로신-4로 전환되었는지를 확인하였다(도 4).Roughly, 16 μg of GjUGT1 or NtUGT expressed in Example 3 was added to a mixture of 50 mM Tris-HCl (pH 7.0), 5 mM UDP-glucose, and 0.1 mM encapsulated crocetin, and reacted at 30° C. for 30 minutes. . Then, 50 mM Tris-HCl (pH 7.0) and 7.5 mM UDP-glucose were added, and then, 55 μg of CaUGT3, SpUGT or NsUGT was combined and treated, followed by reaction at 30° C. for 2 hours. Finally, 200 μl of cold methanol (-20°C chilled MeOH) was added to terminate the reaction. Each reaction sample was centrifuged at 4° C. and 13200 rpm for 10 minutes to determine how much crossetin was converted to crossin-4 through HPLC analysis of the supernatant ( FIG. 4 ).
크로신-2로의 전환을 확인하기 위한 분석 방법과는 다르게 물질들의 머무름 시간이 앞쪽 시간에 몰릴 것을 예방하기 위해 다른 이동상(Mobile phase)과 구배(Gradient) 조건을 사용하여 분석을 진행하였다. A: 100% ACN 및 B: 100% DDW를 이동상으로 하여 분석하였다. 구배(Gradient) 조건으로는 A용매를 20%로 시작하여 A용매를 40%로 20분까지, A용매를 60%로 25분까지, A용매를 80%로 30분까지, A용매를 100%로 35분까지, A용매를 100%로 40분까지 지속, A용매를 20%로 45분까지로 하여 총 45분 분석하였다. Zorbax eclipse XDB-C18 컬럼(4.6150 nm, 5 μm; Agilent Technology)을 고정상으로 1 mL/분의 유속으로 분석하였다.Unlike the analysis method for confirming the conversion to crosin-2, the analysis was conducted using different mobile phase and gradient conditions to prevent the retention time of substances from being concentrated in the front time. A: 100% ACN and B: 100% DDW were analyzed as mobile phases. For gradient conditions, start with solvent A at 20%, with solvent A at 40% for 20 minutes, with solvent A at 60% for 25 minutes, with solvent A at 80% for 30 minutes, and with solvent A 100% The analysis was conducted for a total of 45 minutes by setting the solvent A to 35 minutes, 100% solvent A for 40 minutes, and 20% solvent A to 45 minutes. A Zorbax eclipse XDB-C18 column (4.6150 nm, 5 μm; Agilent Technology) was analyzed as a stationary phase at a flow rate of 1 mL/min.
도 4는 실험실조건(in vitro)에서 GjUGT1 효소 또는 NtUGT 효소와, CaUGT3 효소, SpUGT 효소 또는 NsUGT 효소의 조합을 사용하여 크로세틴을 크로신-4로 전환시킨 결과를 나타내는 그래프이다.4 is a graph showing the results of converting crocetin to crosine-4 using a combination of GjUGT1 enzyme or NtUGT enzyme and CaUGT3 enzyme, SpUGT enzyme or NsUGT enzyme under laboratory conditions ( in vitro).
도 4에서 보듯이, GjUGT1 효소 또는 NtUGT 효소와, CaUGT3 효소의 조합만이 크로세틴으로부터 크로신-4를 생산할 수 있음을 확인하였다.As shown in FIG. 4 , it was confirmed that only the combination of the GjUGT1 enzyme or the NtUGT enzyme and the CaUGT3 enzyme could produce crosine-4 from crocetin.
실시예 3: 형질전환 미생물을 이용한 크로신-4의 생산Example 3: Production of Crosin-4 using a transgenic microorganism
상기 실시예 1 및 2에서 확인된 각 효소의 유전자가 도입된 형질전환 미생물을 사용하여 크로세틴으로부터 크로신-4를 생산하고자 하였다.Crocetin-4 was produced from crocetin by using the transgenic microorganism into which the genes of each enzyme identified in Examples 1 and 2 were introduced.
실시예 3-1: 대사회로유전자의 고도화 및 모듈화Example 3-1: Advancement and modularization of metabolic circuit genes
먼저, 대장균 MG1655 균주의 염색체에, 대장균 유래의 ispA (geranyl diphosphate/farnesyl diphosphate synthase), idi (isopentenyl-diphosphate Δ-isomerase), dxs (1-deoxy-D-xylulose-5-phosphate synthase) 및 dxr (1-deoxy-D-xylulose 5-phosphate reductoisomerase) 유전자를 도입하고, 이들 도입된 유전자가 lac 프로모터로 발현되도록 형질전환시켜서, MEP 대사회로가 강화된 대장균주(1차 변이주)를 제작하였다.First, in the chromosome of E. coli MG1655 strain, E. coli-derived ispA (geranyl diphosphate/farnesyl diphosphate synthase), idi (isopentenyl-diphosphate Δ-isomerase), dxs (1-deoxy-D-xylulose-5-phosphate synthase) and dxr ( 1-deoxy-D-xylulose 5-phosphate reductoisomerase) genes were introduced, and these introduced genes were transformed to be expressed by the lac promoter, thereby preparing an E. coli strain (primary mutant) with enhanced MEP metabolic circuit.
다음으로, 상기 제작된 1차 변이주에 Pantoea agglomerans 유래의 지아잔틴 합성 유전자인 CrtE, CrtB, CrtI, CrtY 및 CrtZ 유전자를 도입하고, 이들 도입된 유전자가 trc 프로모터로 발현되도록 형질전환시켜서, MEP 대사회로가 강화되고 지아잔틴이 생산가능한 ZEA-1 균주(2차 변이주)를 제작하였다.Next, the CrtE, CrtB, CrtI, CrtY and CrtZ genes, which are zeaxanthin synthesis genes derived from Pantoea agglomerans, are introduced into the prepared primary mutant strain, and these introduced genes are transformed to be expressed by the trc promoter, and the MEP metabolic cycle is strengthened and produced a ZEA-1 strain (secondary mutant) capable of producing zeaxanthin.
실시예 3-2: 크로세틴 생합성 유전자를 포함하는 발현벡터의 제작Example 3-2: Construction of expression vector containing crocetin biosynthesis gene
먼저, 크로세틴 다이알데하이드의 생합성에 사용되는 카로티노이드 절단 효소인 CsCCD2의 유전자를 합성 및 증폭시킨 후, 증폭산물을 pKK223-3 벡터의 EcoRI 및 HindIII 자리에 클로닝하였다. 그런 다음, pSTVM 벡터의 BglII 및 NotI 자리에 서브클로닝하여 발현벡터 pSTVM_CsCCD2를 제작하였다. First, the gene of CsCCD2, a carotenoid cleaving enzyme used for biosynthesis of crocetin dialdehyde, was synthesized and amplified, and the amplified product was cloned into EcoRI and HindIII sites of pKK223-3 vector. Then, the expression vector pSTVM_CsCCD2 was constructed by subcloning in the BglII and NotI sites of the pSTVM vector.
상기 CsCCD2 유전자의 클로닝 및 서브클로닝시에 사용된 프라이머의 염기서열은 다음과 같다:The base sequences of the primers used for cloning and subcloning of the CsCCD2 gene are as follows:
CsCCD2_F ; 5'-CGGAATTCATGGCGAACAAAGAAGAGG-3'(서열번호 33)CsCCD2_F ; 5'-CGGAATTCATGGCGAACAAAGAAGAGG-3' (SEQ ID NO: 33)
CsCCD2_R ; 5'-CCCAAGCTTTTAGGTCTCCGCTTGATGC-3'(서열번호 34)CsCCD2_R ; 5'-CCCAAGCTTTTAGGTCTCCGCTTGATGC-3' (SEQ ID NO: 34)
sub_CCD2_F ; 5'-GGAAGATCTGCTGTGCAGGTCGTAAA-3'(서열번호 37)sub_CCD2_F ; 5'-GGAAGATCTGCTGTGCAGGTCGTAAA-3' (SEQ ID NO: 37)
sub_CCD2_R ; 5'-ATAAGAATGCGGCCGCGAAACGCAAAAAGGCCA-3'(서열번호 38)sub_CCD2_R ; 5'-ATAAGAATGCGGCCGCGAAACGCAAAAAGGCCA-3' (SEQ ID NO: 38)
다음으로, 크로세틴의 생합성에 사용되는 aldH의 유전자를 Synechococcus elongatus PCC 7942 균주로부터 수득 및 증폭시킨 후, 증폭산물을 pUCM 벡터의 XbaI 및 EcoRI 자리에 클로닝하였다. 그런 다음, 상기 제작된 pSTVM_CsCCD2 벡터에 서브클로닝하여, pSTVM_CsCCD2_aldH7942를 제작하였다. Next, the gene of aldH used for biosynthesis of crocetin was obtained and amplified from
상기 aldH 유전자의 클로닝 및 서브클로닝시에 사용된 프라이머의 염기서열은 다음과 같다:The nucleotide sequences of the primers used for cloning and subcloning of the aldH gene are as follows:
aldH_F ; 5'-GCTCTAGAAGGAGGATTACAAAATGACTGCTGTCGTTCTCC-3'(서열번호 35)aldH_F ; 5'-GCTCTAGAAGGAGGATTACAAAATGACTGCTGTCGTTCTCC-3' (SEQ ID NO: 35)
aldH_R ; 5'-CGGAATTCCTAGAGCTTGCGGAAGAG-3'(서열번호 36)aldH_R ; 5'-CGGAATTCCTAGAGCTTGCGGAAG-3' (SEQ ID NO: 36)
sub_aldH_F ; 5'-AAGAGCGUCTAGAAGGAGGATTACAAAA-3'(서열번호 39)sub_aldH_F ; 5'-AAGAGCGUCTAGAAGGAGGATTACAAAA-3' (SEQ ID NO: 39)
sub_aldH_R ; 5'-AGCTGATUGTACTGAGAGTGCACCAT-3'(서열번호 40)sub_aldH_R ; 5'-AGCTGATUGTACTGAGAGTGCACCAT-3' (SEQ ID NO: 40)
실시예 3-3: 당전이 효소 유전자를 포함하는 발현벡터의 제작Example 3-3: Construction of expression vector containing glycotransferase gene
먼저, 상기 실시예 1을 통해 크로세틴으로부터 크로신-2의 생산에 적합한 것으로 확인된 NtUGT 유전자를 pUCrop 벡터에 클로닝하여, pUCrop_NtUGT 벡터를 제작하였다. 이때, 상기 클로닝시에 사용된 프라이머의 염기서열은 다음과 같다:First, the NtUGT gene confirmed to be suitable for the production of crosin-2 from crocetin through Example 1 was cloned into the pUCrop vector to construct the pUCrop_NtUGT vector. In this case, the base sequence of the primer used in the cloning is as follows:
pUCrop_NtUGT_F: 5'-GCTCTAGAAGAGGATTACAAAATGGTTCAACCTCATGTCC-3'(서열번호 41)pUCrop_NtUGT_F: 5'-GCTCTAGAAGAGGATTACAAAATGGTTCAACCTCATGTCC-3' (SEQ ID NO: 41)
pUCrop_NtUGT_R: 5'-TCCCCCCGGGTTAACAACCTTTTCCTACTTCTTG-3'(서열번호 42)pUCrop_NtUGT_R: 5'-TCCCCCCGGGTTAACAACCTTTTCCTACTTCTTG-3' (SEQ ID NO: 42)
다음으로, 상기 제작된 pUCrop_NtUGT 벡터에, 상기 실시예 2를 통해 크로신-2로부터 크로신-4의 생산에 적합한 것으로 확인된 CaUGT3 유전자를 클로닝하여, 당전이 효소 유전자를 포함하는 발현벡터(pUCrop_NtUGT_CaUGT3)를 제작하였다. 이때, 상기 클로닝시에 사용된 프라이머의 염기서열은 다음과 같다:Next, the CaUGT3 gene confirmed to be suitable for the production of crosin-4 from crosin-2 through Example 2 was cloned into the prepared pUCrop_NtUGT vector, and an expression vector containing a glycosyltransferase gene (pUCrop_NtUGT_CaUGT3) was produced. In this case, the base sequence of the primer used in the cloning is as follows:
pUCrop_NtUGT_CaUGT_F: 5'-TCCCCCCGGGAGGAGGACAGCTAAATGGCCACCGAACAGC-3'(서열번호 43)pUCrop_NtUGT_CaUGT_F: 5'-TCCCCCCGGGAGGAGGACAGCTAAATGGCCACCGAACAGC-3' (SEQ ID NO: 43)
pUCrop_NtUGT_CaUGT_R: 5'-AAGGAAAAAAGCGGCCGCTTATACGCACAATTGCTTCAGC-3'(서열번호 44)pUCrop_NtUGT_CaUGT_R: 5'-AAGGAAAAAAGCGGCCGCTTATACGCACAATTGCTTCAGC-3' (SEQ ID NO: 44)
실시예 3-4: 형질전환 미생물의 제작Example 3-4: Production of transformed microorganisms
상기 실시예 3-1에서 제작된 ZEA-1 균주에 실시예 3-2에서 제작한 크로세틴 생합성 유전자를 포함하는 발현벡터 및 실시예 3-3에서 제작한 당전이 효소 유전자를 포함하는 발현벡터를 도입하고, 앰피실린 및 클로람페니콜이 첨가된 평판배지에 도말하여 37℃에서 12시간동안 정치배양 진행하여 콜로니를 확보하였다. 확보된 콜로니를 500ml 플라스크에 150ml TB (Terrific broth)배지에 접종하여 37℃ , 250rpm에서 12시간동안 진탕 배양을 진행하였다. To the ZEA-1 strain prepared in Example 3-1, the expression vector containing the crocetin biosynthesis gene prepared in Example 3-2 and the expression vector containing the glycosyltransferase gene prepared in Example 3-3 Introduced and spread on a plate medium to which ampicillin and chloramphenicol were added, the colonies were secured by stationary culture at 37° C. for 12 hours. The obtained colonies were inoculated in 150ml TB (terrific broth) medium in a 500ml flask and cultured with shaking at 37°C and 250rpm for 12 hours.
상기 12시간 동안 진탕배양을 통해 수득한 배양액을 사용하여 비연속 발효를 진행한다. 이때, 비연속 발효의 배지 조성은 탄소원으로 글리세롤 20g/L가 포함된 1.5L TB (Terrific broth) 배지이며, 50 μg/ml 클로람페니콜, 100 μg/ml 앰피실린을 모두 첨가하였다. 배양 온도의 경우 30℃에서 배양을 진행하다가 OD600 5~6 사이가 되면 20℃로 온도를 전환한 후, 계속 배양 하였다. 배양시, 산소의 농도는 30%로 진행하였으며, 공기 공급은 1vvm으로 진행하였다(도 5). Non-continuous fermentation is performed using the culture medium obtained through shaking culture for 12 hours. At this time, the medium composition of the discontinuous fermentation was 1.5L TB (terrific broth) medium containing 20 g/L of glycerol as a carbon source, and 50 μg/ml chloramphenicol and 100 μg/ml ampicillin were all added. In the case of incubation temperature, culture was carried out at 30°C, and when the OD600 reached between 5 and 6, the temperature was changed to 20°C, and the culture was continued. During incubation, the oxygen concentration was 30%, and air supply was performed at 1vvm (FIG. 5).
도 5는 형질전환 미생물의 구성을 나타내는 개략도 및 성장곡선을 나타내는 그래프이다.5 is a schematic diagram showing the composition of a transformed microorganism and a graph showing a growth curve.
상기 발효가 종료된 후, 배양액에 포함된 크로신 화합물을 HPLC분석을 통해 확인하였다(도 6).After the fermentation was completed, the crosine compound contained in the culture was confirmed through HPLC analysis (FIG. 6).
이때, HPLC는 A : 100% MeOH(25mM formic acid) 와 B : 100% DDW(25mM formic acid)를 이동상으로 하여 분석하였다. 농도구배 조건으로는 A용매를 50%로 50분까지 , A용매를 80%로 60분까지, A용매를 100%로 80분까지로 하였다. Zorbax eclipse XDB-C18 컬럼 (4.6× 150 mm 혹은 250 mm, 5 μm; Agilent Technology)을 고정상으로 0.8ml/min의 유속으로 분석하였다. 구조분석을 위해 HPLC retention time 과 흡수 spectrum, mass spectrum을 비교하여 분석하였다. Mass spectrum은 Varian 1200L LC/MS system을 이용하여 positive 모드와 negative 모드를 모두 모니터 하였고, 이온화에는 APCI (atmosphere pressure chemical ionization) 모듈을 사용하였다.At this time, HPLC was analyzed using A: 100% MeOH (25 mM formic acid) and B: 100% DDW (25 mM formic acid) as mobile phases. The concentration gradient conditions were: 50% of solvent A for 50 minutes, 80% of solvent A for 60 minutes, and 100% of solvent A for 80 minutes. A Zorbax eclipse XDB-C18 column (4.6×150 mm or 250 mm, 5 μm; Agilent Technology) was analyzed as a stationary phase at a flow rate of 0.8 ml/min. For structural analysis, HPLC retention time, absorption spectrum, and mass spectrum were compared and analyzed. For mass spectrum, both positive and negative modes were monitored using Varian 1200L LC/MS system, and APCI (atmosphere pressure chemical ionization) module was used for ionization.
도 6은 형질전환 미생물을 사용하여, 크로신-2와 크로신-4를 생산한 결과를 나타내는 그래프이다.6 is a graph showing the results of the production of crosin-2 and crosin-4 using the transformed microorganism.
도 6에서 보듯이, 크로신-2와 크로신-4가 생산됨을 확인하였다.As shown in FIG. 6 , it was confirmed that crosin-2 and crosin-4 were produced.
상술한 결과에서 보듯이, 형질전환 미생물로부터 생산된 크로신-2는, 상기 미생물에 도입된 크로세틴 생합성 유전자에 의해 생성된 크로세틴으로부터 전환된 것이고, 이처럼 전환된 크로신-2로 부터 크로신-4가 생성된 것으로 분석되었다.As can be seen from the above results, crosin-2 produced from the transformed microorganism is converted from crocetin produced by the crocetin biosynthesis gene introduced into the microorganism, and crosin-2 from the transformed crosin-2 -4 was analyzed to be generated.
이상의 설명으로부터, 본 발명이 속하는 기술분야의 당업자는 본 발명이 그 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 이와 관련하여, 이상에서 기술한 실시 예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로서 이해해야만 한다. 본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허 청구범위의 의미 및 범위 그리고 그 등가 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.From the above description, those skilled in the art to which the present invention pertains will understand that the present invention may be embodied in other specific forms without changing the technical spirit or essential characteristics thereof. In this regard, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. The scope of the present invention should be construed as being included in the scope of the present invention, rather than the above detailed description, all changes or modifications derived from the meaning and scope of the claims to be described later and their equivalents.
<110> AJOU UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION <120> Process for preparing crocin-4 using enzyme reaction <130> KPA190916-KR <160> 44 <170> KopatentIn 2.0 <210> 1 <211> 474 <212> PRT <213> Artificial Sequence <220> <223> recombinant GjUGT1 <400> 1 Met Val Gln Gln Arg His Val Leu Leu Ile Thr Tyr Pro Ala Gln Gly 1 5 10 15 His Ile Asn Pro Ala Leu Gln Phe Ala Gln Arg Leu Leu Arg Met Gly 20 25 30 Ile Gln Val Thr Leu Ala Thr Ser Val Tyr Ala Leu Ser Arg Met Lys 35 40 45 Lys Ser Ser Gly Ser Thr Pro Lys Gly Leu Thr Phe Ala Thr Phe Ser 50 55 60 Asp Gly Tyr Asp Asp Gly Phe Arg Pro Lys Gly Val Asp His Thr Glu 65 70 75 80 Tyr Met Ser Ser Leu Ala Lys Gln Gly Ser Asn Thr Leu Arg Asn Val 85 90 95 Ile Asn Thr Ser Ala Asp Gln Gly Cys Pro Val Thr Cys Leu Val Tyr 100 105 110 Thr Leu Leu Leu Pro Trp Ala Ala Thr Val Ala Arg Glu Cys His Ile 115 120 125 Pro Ser Ala Leu Leu Trp Ile Gln Pro Val Ala Val Met Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Arg Gly Tyr Glu Asp Asp Val Lys Asn Asn Ser Asn 145 150 155 160 Asp Pro Thr Trp Ser Ile Gln Phe Pro Gly Leu Pro Ser Met Lys Ala 165 170 175 Lys Asp Leu Pro Ser Phe Ile Leu Pro Ser Ser Asp Asn Ile Tyr Ser 180 185 190 Phe Ala Leu Pro Thr Phe Lys Lys Gln Leu Glu Thr Leu Asp Glu Glu 195 200 205 Glu Arg Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu Pro Gln 210 215 220 Ala Leu Lys Ala Ile Glu Ser Tyr Asn Leu Ile Ala Ile Gly Pro Leu 225 230 235 240 Thr Pro Ser Ala Phe Leu Asp Gly Lys Asp Pro Ser Glu Thr Ser Phe 245 250 255 Ser Gly Asp Leu Phe Gln Lys Ser Lys Asp Tyr Lys Glu Trp Leu Asn 260 265 270 Ser Arg Pro Ala Gly Ser Val Val Tyr Val Ser Phe Gly Ser Leu Leu 275 280 285 Thr Leu Pro Lys Gln Gln Met Glu Glu Ile Ala Arg Gly Leu Leu Lys 290 295 300 Ser Gly Arg Pro Phe Leu Trp Val Ile Arg Ala Lys Glu Asn Gly Glu 305 310 315 320 Glu Glu Lys Glu Glu Asp Arg Leu Ile Cys Met Glu Glu Leu Glu Glu 325 330 335 Gln Gly Met Ile Val Pro Trp Cys Ser Gln Ile Glu Val Leu Thr His 340 345 350 Pro Ser Leu Gly Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Leu 355 360 365 Glu Thr Leu Val Cys Gly Val Pro Val Val Ala Phe Pro His Trp Thr 370 375 380 Asp Gln Gly Thr Asn Ala Lys Leu Ile Glu Asp Val Trp Glu Thr Gly 385 390 395 400 Val Arg Val Val Pro Asn Glu Asp Gly Thr Val Glu Ser Asp Glu Ile 405 410 415 Lys Arg Cys Ile Glu Thr Val Met Asp Asp Gly Glu Lys Gly Val Glu 420 425 430 Leu Lys Arg Asn Ala Lys Lys Trp Lys Glu Leu Ala Arg Glu Ala Met 435 440 445 Gln Glu Asp Gly Ser Ser Asp Lys Asn Leu Lys Ala Phe Val Glu Asp 450 455 460 Ala Gly Lys Gly Tyr Gln Ala Glu Ser Asn 465 470 <210> 2 <211> 1425 <212> DNA <213> Artificial Sequence <220> <223> recombinant GjUGT1 <400> 2 atggttcagc agcgtcacgt tttgttgatt acctatccag cacagggtca cattaaccca 60 gcgttgcagt tcgcacagag attgttgcgt atgggtattc aagttaccct ggcgaccagc 120 gtgtacgctc tgagccgtat gaaaaagagc agcggtagca ccccgaaagg tctgaccttt 180 gcaaccttca gcgatggtta cgatgacggt tttcgtccga aaggtgttga ccataccgaa 240 tatatgagca gcctggcgaa gcaaggtagc aataccctgc gtaatgtgat caacaccagc 300 gctgatcagg gttgcccggt tacctgtttg gtgtatacct tgttgctgcc atgggctgct 360 accgttgcac gtgaatgcca cattccgagc gcgctgctgt ggatccaacc ggttgctgtg 420 atggatatct actattacta tttccgtggt tatgaagatg acgttaagaa taacagcaat 480 gacccgacct ggagcatcca gtttccgggt ctgccgagca tgaaagctaa ggatctgccg 540 agcttcattc tgccgagcag cgacaacatc tacagctttg cactgccgac cttcaaaaag 600 caactggaaa ccctggatga agaagaacgt ccgaaagttc tggtgaatac ctttgacgcg 660 ctggaaccgc aggcactgaa ggcgattgaa agctataacc tgattgcaat tggtccactg 720 accccgagcg cgttcctgga tggtaaagac ccgagcgaaa ccagctttag cggtgacctg 780 tttcagaaaa gcaaggacta caaggaatgg ctgaatagcc gtccggctgg tagcgttgtg 840 tatgttagct ttggtagcct gctgaccctg ccgaaacaac agatggaaga aattgcacgt 900 ggtctgctga agagcggtcg tccgttcctg tgggttattc gtgcgaaaga aaacggtgaa 960 gaagaaaagg aagaagatcg tctgatttgc atggaagaac tggaagaaca gggtatgatt 1020 gttccgtggt gtagccagat cgaagtgctg acccatccga gcctgggttg ctttgttacc 1080 cactgtggtt ggaatagcac cctggaaacc ctggtttgcg gtgtgccggt tgtggctttc 1140 ccgcattgga ccgatcaagg taccaatgca aaactgatcg aagacgtttg ggaaaccggt 1200 gtgcgtgttg tgccgaacga agatggtacc gttgaaagcg acgaaattaa acgttgtatc 1260 gaaaccgtta tggatgacgg tgaaaaaggt gtggaactga agcgtaacgc gaaaaagtgg 1320 aaggaactgg ctcgtgaagc aatgcaggaa gatggtagca gcgacaaaaa cctgaaagcg 1380 ttcgtggagg atgcgggcaa gggctatcaa gcggagagca actaa 1425 <210> 3 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 3 cggaattcat ggttcagcag cgtcac 26 <210> 4 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 4 ccgctcgagg ttgctctccg cttgatag 28 <210> 5 <211> 502 <212> PRT <213> Artificial Sequence <220> <223> recombinant GT1-316 <400> 5 Met Val Ser Glu Ser Leu Gly His Leu Phe Leu Val Ser Phe Pro Gly 1 5 10 15 Gln Gly His Val Asn Pro Leu Leu Arg Leu Gly Lys Ile Leu Ala Ser 20 25 30 Lys Gly Phe Leu Val Thr Phe Ser Thr Thr Glu Thr Thr Gly Glu Gln 35 40 45 Met Arg Lys Ala Ser Asp Ile Ile Asp Lys Leu Thr Pro Phe Gly Asp 50 55 60 Gly Phe Ile Arg Phe Glu Phe Ile Ala Asp Gly Trp Glu Glu Asp Glu 65 70 75 80 Pro Arg Arg Gln Asp Leu Asp Gln Tyr Leu Leu Gln Leu Glu Leu Val 85 90 95 Gly Lys Gln Val Ile Pro Gln Met Ile Lys Lys Asn Ala Glu Gln Gly 100 105 110 Arg Pro Val Ser Cys Leu Ile Asn Asn Pro Phe Ile Pro Trp Val Thr 115 120 125 Asp Val Ala Thr Thr Leu Gly Leu Pro Ser Ala Met Leu Trp Val Gln 130 135 140 Ser Cys Ala Cys Phe Ala Ser Tyr Tyr His Tyr Tyr His Gly Thr Val 145 150 155 160 Pro Phe Pro Asp Glu Glu His Pro Glu Ile Asp Val Gln Leu Pro Trp 165 170 175 Met Pro Leu Leu Lys Tyr Asp Glu Val Pro Ser Tyr Leu Tyr Pro Thr 180 185 190 Thr Pro Tyr Pro Phe Leu Arg Arg Ala Ile Leu Gly Gln Tyr Lys Asn 195 200 205 Leu Asp Lys Pro Phe Cys Ile Leu Met Glu Thr Phe Glu Glu Leu Glu 210 215 220 Pro Glu Leu Ile Lys His Met Ser Glu Ile Phe Pro Ile Lys Ala Val 225 230 235 240 Gly Pro Leu Phe Arg Asn Thr Lys Ala Pro Lys Thr Thr Val His Gly 245 250 255 Asp Phe Leu Lys Ala Asp Asp Cys Ile Glu Trp Leu Asp Thr Lys Pro 260 265 270 Pro Ser Ser Val Val Tyr Val Ser Phe Gly Ser Val Val Gln Leu Lys 275 280 285 Gln Asp Gln Trp Asn Glu Ile Ala Tyr Gly Leu Leu Asn Ser Gly Val 290 295 300 Ser Phe Leu Leu Val Met Lys Pro Ala His Lys Asp Ala Gly His Asp 305 310 315 320 Leu Leu Val Leu Pro Asp Gly Phe Leu Glu Lys Ala Gly Asp Arg Gly 325 330 335 Lys Val Val Gln Trp Ser Pro Gln Glu Lys Val Leu Gly His Pro Ser 340 345 350 Val Ala Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Met Glu Ala 355 360 365 Leu Thr Ser Gly Met Pro Val Val Ala Phe Pro Gln Trp Gly Asp Gln 370 375 380 Val Thr Asn Ala Lys Tyr Leu Val Asp Ile Leu Lys Val Gly Val Arg 385 390 395 400 Met Cys Arg Gly Glu Ala Glu Asn Lys Leu Ile Thr Arg Asp Glu Ile 405 410 415 Glu Lys Cys Leu Leu Glu Ala Thr Val Gly Pro Lys Ala Val Glu Met 420 425 430 Lys Gln Asn Ala Met Lys Trp Lys Glu Ala Ala Glu Ala Ala Val Ala 435 440 445 Glu Gly Gly Ser Ser Asp Gln Asn Ile Arg Tyr Phe Thr Asp Asp Ile 450 455 460 Val Lys Ala Asn Glu Ser Glu Ile Ala Arg Lys Cys Ile Gly Ser Asn 465 470 475 480 Glu Phe Pro Val Ser Val Val Val Lys Ser Asn Glu Lys Val Asp Glu 485 490 495 Leu Val Gly Ser Ser Ala 500 <210> 6 <211> 1506 <212> DNA <213> Artificial Sequence <220> <223> recombinant GT1-316 <400> 6 atggtctccg agagtctagg tcacctattc ctagtatctt ttccggggca ggggcatgta 60 aaccctttat taagattagg aaaaatcctt gctagtaaag gtttcctagt taccttttcc 120 accaccgaga caacagggga gcagatgcgt aaggcaagtg acatcataga taaacttaca 180 ccattcggtg acggcttcat tcgttttgaa ttcattgccg acggttggga ggaggatgaa 240 ccccgtaggc aagaccttga tcaatatcta ctacaacttg aattagtggg caagcaggta 300 ataccgcaaa tgattaagaa gaatgccgaa cagggtagac ccgtgtcctg cctgatcaac 360 aaccccttca tcccatgggt cacagatgtc gctacgactc taggccttcc ttcagctatg 420 ctgtgggtac aaagttgtgc ttgttttgcg agctactatc actactacca tggcacagtc 480 ccttttcccg atgaggagca ccccgaaatt gatgtacagc taccgtggat gcctttgctt 540 aagtatgatg aggtccctag ttatctttac cctacgacgc cgtacccctt tttaaggcgt 600 gcaatattgg gtcagtataa aaacctggac aagccgtttt gcatcttaat ggagactttt 660 gaagagcttg aaccggaact tatcaagcat atgagtgaaa tttttccaat taaggcggtt 720 ggaccgctat tccgtaacac aaaagcgccc aaaacaactg tgcacggcga cttcttaaaa 780 gcagatgatt gtatagaatg gttggatacc aaacctccat cttctgtcgt ttacgtatca 840 ttcggctctg ttgtgcagct aaaacaggac caatggaacg agatagctta cggtttactt 900 aactctgggg tttccttttt attggtgatg aagcccgctc ataaagacgc tgggcatgac 960 cttctagtac ttcctgacgg cttcttagag aaagctggcg acagagggaa ggtggtccag 1020 tggtcacctc aagaaaaggt actaggtcac ccgagcgtag cctgctttgt aactcactgt 1080 ggatggaaca gcacgatgga agctctgacc tctggaatgc ccgtggtcgc ttttccccag 1140 tggggtgatc aggttacaaa tgcgaaatac ttggtagaca tactaaaggt aggagtcagg 1200 atgtgtaggg gagaggccga aaataagctt attacaagag atgaaattga aaaatgccta 1260 ttagaagcta cggtaggacc taaagctgtt gagatgaagc aaaacgccat gaaatggaag 1320 gaagctgctg aagccgcggt cgccgaaggt ggctcaagtg accaaaacat acgttacttt 1380 actgatgaca tcgtgaaagc aaatgagtcc gagattgcta gaaagtgtat cggctctaac 1440 gaattcccgg tgtcagtcgt tgttaaatct aatgagaaag tggatgagtt agtcggatca 1500 tccgca 1506 <210> 7 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 7 cgggatccat ggtctccgag agtctag 27 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 8 acgcgtcgac tgcggatgat ccgactaac 29 <210> 9 <211> 471 <212> PRT <213> Artificial Sequence <220> <223> recombinant NtUGT <400> 9 Met Val Gln Pro His Val Leu Leu Val Thr Phe Pro Ala Gln Gly His 1 5 10 15 Ile Asn Pro Cys Leu Gln Phe Ala Met Arg Leu Ile Arg Met Gly Ile 20 25 30 Glu Val Thr Phe Ala Thr Ser Val Phe Ala His Arg Arg Met Ala Lys 35 40 45 Thr Ala Thr Ser Thr Leu Pro Lys Gly Leu Asn Phe Ala Ala Phe Ser 50 55 60 Asp Gly Tyr Asp Asp Gly Phe Lys Ala Asp Glu His Asp Ser Gln His 65 70 75 80 Tyr Met Ser Glu Ile Lys Ser Arg Gly Ser Glu Thr Leu Lys Asp Ile 85 90 95 Ile Leu Lys Ser Ser Asp Glu Gly Arg Pro Val Thr Ser Leu Val Tyr 100 105 110 Ser Leu Leu Leu Pro Trp Ala Ala Asn Val Ala Arg Glu Phe His Ile 115 120 125 Pro Cys Ala Leu Leu Trp Ile Gln Pro Ala Thr Val Leu Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Asn Gly Ser Glu Asp Ala Ile Lys Gly Ser Thr Asn 145 150 155 160 Asp Pro Asn Trp Cys Ile Gln Leu Pro Asn Leu Pro Leu Leu Lys Ser 165 170 175 Gln Asp Leu Pro Ser Phe Leu Leu Ser Ser Asn Asn Asp Glu Lys Tyr 180 185 190 Ser Phe Ala Leu Pro Thr Phe Lys Glu Gln Leu Asp Thr Leu Asp Val 195 200 205 Glu Glu Asn Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu Gln 210 215 220 Glu Glu Leu Lys Ala Ile Glu Arg Tyr Asn Leu Ile Gly Ile Gly Pro 225 230 235 240 Leu Ile Pro Ser Ser Phe Leu Asp Gly Lys Asp Pro Leu Asp Ser Ser 245 250 255 Phe Gly Gly Asp Leu Phe Gln Lys Ser Asn Asp Tyr Ile Glu Trp Leu 260 265 270 Asn Ser Lys Asp Asn Ser Ser Val Ile Tyr Ile Ser Phe Gly Ser Leu 275 280 285 Leu Asn Leu Ser Lys Asn Gln Lys Glu Glu Ile Ala Lys Gly Leu Ile 290 295 300 Glu Ile Lys Arg Pro Phe Leu Trp Val Ile Arg Asp Gln Glu Asn Gly 305 310 315 320 Lys Gly Asp Glu Lys Glu Glu Glu Lys Leu Ser Cys Met Met Glu Leu 325 330 335 Glu Lys Gln Gly Lys Ile Val Pro Trp Cys Ser Gln Leu Glu Val Leu 340 345 350 Thr His Pro Ser Leu Gly Cys Phe Val Ser His Cys Gly Trp Asn Ser 355 360 365 Thr Leu Glu Ser Leu Ser Thr Gly Val Pro Val Val Ala Phe Pro His 370 375 380 Trp Thr Asp Gln Gly Thr Asn Ala Lys Leu Ile Glu Asp Val Trp Lys 385 390 395 400 Ile Gly Val Arg Leu Lys Lys Asn Glu Asp Gly Val Val Glu Ser Glu 405 410 415 Glu Ile Lys Arg Cys Ile Asp Met Val Met Asn Gly Gly Glu Lys Gly 420 425 430 Glu Glu Met Arg Arg Asn Ala Gln Lys Trp Lys Glu Leu Ala Arg Glu 435 440 445 Ala Val Lys Glu Gly Gly Ser Ser Tyr Met Asn Leu Lys Ala Phe Val 450 455 460 Gln Glu Val Gly Lys Gly Cys 465 470 <210> 10 <211> 1413 <212> DNA <213> Artificial Sequence <220> <223> recombinant NtUGT <400> 10 atggttcaac ctcatgtcct acttgttaca tttccagcac aaggacatat taacccctgc 60 ttgcagtttg ccatgaggct gatccgtatg ggaatcgaag taacgtttgc tacctcagtg 120 tttgctcacc gtagaatggc taagaccgca acaagcaccc tgcccaaggg cctaaacttt 180 gccgcgttca gcgacggata cgacgatggg tttaaggcag acgagcatga ttcccagcac 240 tacatgtctg aaatcaagtc tagggggagc gagacgctga aggatataat tctgaagagt 300 tctgatgaag ggagacccgt cacgtccctt gtgtatagcc tgcttttacc ctgggctgca 360 aatgttgcga gggaatttca cataccctgt gcgttgctat ggatacaacc ggcaactgtt 420 ttggatattt attactacta ctttaatggg agtgaggacg ccataaaagg atctaccaac 480 gatccgaatt ggtgcatcca gttgccaaat ctacctctat tgaaatctca agacttacct 540 tccttcctac tgtcctcaaa taacgatgag aaatacagtt ttgctttgcc aacgttcaaa 600 gaacagctgg acacattgga tgtggaagaa aaccccaagg ttttggtcaa tacattcgat 660 gcgctagagc aggaagagct gaaggcgatt gagagatata atttaatcgg gatcggacct 720 ttgataccga gcagtttctt agatggaaag gacccgctgg attcaagttt tggtggcgac 780 ttatttcaga agtccaacga ttatattgaa tggctaaact ccaaagataa cagtagcgta 840 atatatatct cattcggttc actgttgaac ctatccaaaa accagaagga ggaaatagcg 900 aaagggctaa tcgagataaa aaggccattc ttgtgggtca tcagagacca ggaaaatggc 960 aaaggcgatg aaaaagaaga agaaaagttg agttgtatga tggaactaga gaagcaaggt 1020 aaaatcgtac catggtgctc acaactagaa gtcttgactc acccctccct tggttgcttc 1080 gtctctcact gcggttggaa ttcaacactt gagagtttgt ctacaggcgt gccagtggtt 1140 gcgttccctc attggaccga tcagggaact aacgcgaaac ttatcgaaga cgtatggaag 1200 atcggtgtgc gtttgaagaa aaatgaagat ggagtagttg agagcgagga gataaaaagg 1260 tgtatagaca tggttatgaa cggaggtgaa aagggagaag agatgcgtcg taacgcacag 1320 aagtggaaag aactagctcg tgaagccgtt aaagagggag gaagttctta tatgaactta 1380 aaagccttcg tgcaagaagt aggaaaaggt tgt 1413 <210> 11 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 11 cgggatccat ggttcaacct catgtcct 28 <210> 12 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 12 cccaagctta caaccttttc ctacttcttg c 31 <210> 13 <211> 555 <212> PRT <213> Artificial Sequence <220> <223> recombinant FaGT2 <400> 13 Met Gly Ser Glu Ser Leu Val His Val Phe Leu Val Ser Phe Ile Gly 1 5 10 15 Gln Gly His Val Asn Pro Leu Leu Arg Leu Gly Lys Arg Leu Ala Ala 20 25 30 Lys Gly Leu Leu Val Thr Phe Cys Thr Ala Glu Cys Val Gly Lys Glu 35 40 45 Met Arg Lys Ser Asn Gly Ile Thr Asp Glu Pro Lys Pro Val Gly Asp 50 55 60 Gly Phe Ile Arg Phe Glu Phe Phe Lys Asp Arg Trp Ala Glu Asp Glu 65 70 75 80 Pro Met Arg Gln Asp Leu Asp Leu Tyr Leu Pro Gln Leu Glu Leu Val 85 90 95 Gly Lys Glu Val Ile Pro Glu Met Ile Lys Lys Asn Ala Glu Gln Gly 100 105 110 Arg Pro Val Ser Cys Leu Ile Asn Asn Pro Phe Ile Pro Trp Val Cys 115 120 125 Asp Val Ala Glu Ser Leu Gly Leu Pro Ser Ala Met Leu Trp Val Gln 130 135 140 Ser Ala Ala Cys Leu Ala Ala Tyr Tyr His Tyr Tyr His Gly Leu Val 145 150 155 160 Pro Phe Pro Ser Glu Ser Asp Met Phe Cys Asp Val Gln Ile Pro Ser 165 170 175 Met Pro Leu Leu Lys Tyr Asp Glu Val Pro Ser Phe Leu Tyr Pro Thr 180 185 190 Ser Pro Tyr Pro Phe Leu Arg Arg Ala Ile Leu Gly Gln Tyr Gly Asn 195 200 205 Leu Glu Lys Pro Phe Cys Ile Leu Met Asp Thr Phe Gln Glu Leu Glu 210 215 220 Ser Glu Ile Ile Glu Tyr Met Ala Arg Leu Cys Pro Ile Lys Ala Val 225 230 235 240 Gly Pro Leu Phe Lys Asn Pro Lys Ala Gln Asn Ala Val Arg Gly Asp 245 250 255 Phe Met Glu Ala Asp Asp Ser Ile Ile Gly Trp Leu Asp Thr Lys Pro 260 265 270 Lys Ser Ser Val Val Tyr Ile Ser Phe Gly Ser Val Val Tyr Leu Lys 275 280 285 Gln Glu Gln Val Asp Glu Ile Ala His Gly Leu Leu Ser Ser Gly Val 290 295 300 Ser Phe Ile Trp Val Met Lys Pro Pro His Pro Asp Ser Gly Phe Glu 305 310 315 320 Leu Leu Val Leu Pro Glu Gly Phe Leu Glu Lys Ala Gly Asp Arg Gly 325 330 335 Lys Val Val Gln Trp Ser Pro Gln Glu Lys Ile Leu Glu His Pro Ser 340 345 350 Thr Ala Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Met Glu Ser 355 360 365 Leu Thr Ser Gly Met Pro Val Val Ala Phe Pro Gln Trp Gly Asp Gln 370 375 380 Val Thr Asp Ala Lys Tyr Leu Val Asp Glu Phe Lys Val Gly Val Arg 385 390 395 400 Met Cys Arg Gly Glu Ala Glu Asp Arg Val Ile Pro Arg Asp Glu Val 405 410 415 Glu Lys Cys Leu Leu Glu Ala Thr Ser Gly Ser Lys Ala Ala Glu Met 420 425 430 Lys Gln Asn Ala Leu Lys Trp Lys Ala Ala Ala Glu Ala Ala Phe Ser 435 440 445 Glu Gly Gly Ser Ser Asp Arg Asn Leu Gln Ala Phe Val Asp Glu Val 450 455 460 Arg Arg Ile Ser Ala Ser Leu Asn Ser Lys Ser Ser Ala Val Gly Tyr 465 470 475 480 Val Lys Ser Lys Ile Asn Gly Val Val Glu Tyr Val Asp Ser Lys Leu 485 490 495 Asn Gly Lys Ala Ala Pro Val Glu Glu Ala Asn Thr Arg Thr Asn Gly 500 505 510 Ile Ala Lys Val Glu Gln Pro Lys Ala Ala Asn Gly Lys Val Glu Ile 515 520 525 Ala Glu Leu Thr Pro Ile Asn Gly Lys Val Glu Ile Ala Glu Leu Lys 530 535 540 Pro Ile Asn Gly Lys Val Glu Leu Val Glu Ser 545 550 555 <210> 14 <211> 1665 <212> DNA <213> Artificial Sequence <220> <223> recombinant FaGT2 <400> 14 atgggtagtg agtctttggt tcacgtcttt cttgtgtcct ttattggcca aggacacgtc 60 aacccgcttc tgaggctggg gaaaagactg gcggctaaag ggctgcttgt aaccttttgc 120 acggcggagt gtgtcgggaa agaaatgagg aaatctaatg ggataaccga tgagcctaaa 180 ccagtaggag acggatttat acgttttgaa ttcttcaaag ataggtgggc agaagacgaa 240 cccatgcgtc aggacctgga tctatattta ccccagttgg agctagttgg aaaagaagtg 300 atacctgaga tgataaagaa gaacgcagaa caagggagac cagtatcatg cctaatcaat 360 aatccgttta ttccgtgggt ctgtgacgtc gccgaatctc ttggtttacc atccgcaatg 420 ctgtgggttc aatctgccgc ctgtctagct gcttactatc actactacca tggactagtc 480 ccattcccat ctgagtctga catgttttgt gatgtccaaa tacccagcat gccattgtta 540 aaatatgacg aggttcccag ctttttgtat ccaacgagcc catatccatt tttgagacgt 600 gcgatcttgg gtcaatacgg gaacttagag aagccgtttt gtatccttat ggatacgttt 660 caagaactag agtctgaaat aattgaatat atggccagat tatgtcccat aaaggccgtc 720 gggccattgt tcaagaatcc caaggctcaa aacgcggtta gaggcgactt tatggaagcc 780 gatgactcaa tcatcggatg gttggacaca aaacccaaga gttcagttgt ttacattagc 840 ttcgggagcg ttgtatactt aaagcaggag caagtggatg aaattgctca cggactttta 900 tcatccggcg tgtcttttat atgggtgatg aaacctcctc atccggatag cggatttgaa 960 ttgttggtat taccagaagg tttcctagaa aaggcgggag accgtggaaa agttgtgcaa 1020 tggagtccac aagaaaagat attagaacac ccttcaactg catgttttgt gacacactgc 1080 gggtggaata gtacgatgga aagtttaacc agcgggatgc ccgtggtcgc gtttccacag 1140 tggggggatc aggtaactga cgcgaagtac ctagtggatg aatttaaggt aggagttagg 1200 atgtgcagag gcgaggccga ggacagggtg attcctagag acgaggttga aaagtgtctt 1260 ttggaagcta cgtctggttc aaaagctgca gagatgaaac aaaatgcgct taagtggaaa 1320 gctgctgcag aggctgcgtt ctccgaagga ggatcttcag ataggaatct gcaagccttc 1380 gtcgacgaag tcagacgtat tagcgccagc ttaaattcta agtcctctgc tgtcggctat 1440 gtcaagagta agatcaacgg agtggtggaa tatgtcgata gtaagttaaa cgggaaagcc 1500 gccccggtgg aggaggcgaa cactcgtaca aatggcatcg caaaggtaga acaacctaag 1560 gcagcaaacg gcaaggtgga aatagctgaa ttaactccga ttaatggcaa ggttgagatt 1620 gccgaactga agccaataaa cgggaaagtc gagttagtag agtca 1665 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 15 cgggatccat gggtagtgag tctt 24 <210> 16 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 16 cccaagcttt gactctacta actcgactt 29 <210> 17 <211> 473 <212> PRT <213> Artificial Sequence <220> <223> recombinant StUGT <400> 17 Met Val Gln Pro His Val Leu Leu Val Thr Phe Pro Ala Gln Gly His 1 5 10 15 Ile Asn Pro Ser Leu Gln Phe Ala Lys Arg Leu Ile Lys Met Gly Ile 20 25 30 Glu Val Thr Phe Thr Thr Ser Val Phe Ala His Arg Arg Met Ala Lys 35 40 45 Thr Ala Ala Ser Asn Ala Pro Lys Gly Leu Asn Leu Ala Ala Phe Ser 50 55 60 Asp Gly Phe Asp Asp Gly Phe Lys Ser Asn Val Asp Asp Ser Lys Arg 65 70 75 80 Tyr Met Ser Glu Ile Arg Ser Arg Gly Ser Gln Thr Leu Arg Asp Ile 85 90 95 Ile Leu Lys Ser Ser Asp Glu Gly Arg Pro Val Thr Ser Leu Val Tyr 100 105 110 Thr Leu Leu Leu Pro Trp Ala Ala Glu Val Ala Arg Glu Leu His Ile 115 120 125 Pro Ser Ala Leu Leu Trp Ile Gln Pro Ala Thr Val Leu Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Asn Gly Tyr Glu Asp Glu Met Lys Cys Ser Ser Asn 145 150 155 160 Asp Pro Asn Trp Ser Ile Gln Leu Pro Arg Leu Pro Leu Leu Lys Ser 165 170 175 Gln Asp Leu Pro Ser Phe Leu Val Ser Ser Ser Ser Lys Asp Asp Lys 180 185 190 Tyr Ser Phe Ala Leu Pro Thr Phe Lys Glu Gln Leu Asp Thr Leu Asp 195 200 205 Gly Glu Glu Asn Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu 210 215 220 Leu Glu Pro Leu Lys Ala Ile Glu Lys Tyr Asn Leu Ile Gly Ile Gly 225 230 235 240 Pro Leu Ile Pro Ser Ser Phe Leu Gly Gly Lys Asp Ser Leu Glu Ser 245 250 255 Ser Phe Gly Gly Asp Leu Phe Gln Lys Ser Asp Asp Asp Tyr Met Glu 260 265 270 Trp Leu Asn Thr Lys Pro Lys Ser Ser Ile Val Tyr Ile Ser Phe Gly 275 280 285 Ser Leu Leu Asn Leu Ser Arg Asn Gln Lys Glu Glu Ile Ala Lys Gly 290 295 300 Leu Ile Glu Ile Lys Arg Pro Phe Leu Trp Val Ile Arg Asp Gln Glu 305 310 315 320 Asn Ile Lys Glu Val Glu Lys Glu Glu Glu Lys Leu Ser Cys Met Met 325 330 335 Glu Leu Glu Lys Gln Gly Lys Ile Val Pro Trp Cys Ser Gln Leu Glu 340 345 350 Val Leu Thr His Pro Ser Leu Gly Cys Phe Val Ser His Cys Gly Trp 355 360 365 Asn Ser Thr Leu Glu Ser Leu Ser Ser Gly Val Pro Val Val Ala Phe 370 375 380 Pro His Trp Thr Asp Gln Gly Thr Asn Ala Lys Leu Ile Glu Asp Val 385 390 395 400 Trp Lys Thr Gly Val Arg Met Arg Val Ser Glu Asp Gly Val Val Glu 405 410 415 Ser Glu Glu Ile Lys Arg Cys Ile Glu Ile Val Met Asp Gly Gly Glu 420 425 430 Lys Gly Glu Glu Met Arg Lys Asn Ala Gln Lys Trp Lys Glu Leu Ala 435 440 445 Arg Glu Ala Val Lys Glu Gly Gly Ser Ser Glu Val Asn Leu Lys Ala 450 455 460 Phe Val Gln Glu Val Gly Lys Ser Cys 465 470 <210> 18 <211> 1419 <212> DNA <213> Artificial Sequence <220> <223> recombinant StUGT <400> 18 atggttcaac ctcatgtgct tcttgtaacg ttcccggcac aaggtcacat caacccctca 60 cttcagtttg ccaagagatt aattaagatg ggcatcgagg ttacctttac tacaagcgtc 120 ttcgcccaca gacgtatggc gaagacggct gcttccaacg ctcccaaggg ccttaatctg 180 gccgccttct cagatggttt tgatgatggc ttcaaatcta atgtcgatga ttcaaagcgt 240 tatatgtctg agattagatc taggggcagc cagaccctgc gtgacattat cttaaagagt 300 agcgacgaag gaaggcccgt aacaagtcta gtatataccc tactattgcc ttgggctgct 360 gaagtagcaa gggagctgca tattccttca gctttgttat ggattcagcc agcgacagtc 420 ctagacatat attattatta cttcaatggc tacgaagatg agatgaaatg tagctcaaac 480 gatcctaatt ggtcaataca attacctcgt ttgcccttac ttaagtccca agatttaccc 540 tctttccttg tgtcctctag ctccaaggat gataaataca gctttgctct accaacgttc 600 aaagaacaac ttgacactct tgacggagag gagaatccca aggtcttagt gaatacgttt 660 gacgccctag aattagagcc gttaaaagca atcgagaagt ataatcttat cgggattgga 720 cctctaattc cgtcttcatt cctgggcggg aaagattcac tagaaagtag cttcggaggc 780 gatttatttc agaaaagcga cgatgattac atggaatggc taaacactaa acccaaatct 840 tcaattgtgt acatttcctt cggttcacta cttaacctat ccagaaatca aaaagaggaa 900 atcgcaaaag ggttaattga aatcaagcgt ccttttctgt gggtcattag ggatcaggaa 960 aatataaagg aggtggagaa ggaggaagag aagttaagct gcatgatgga gctggaaaaa 1020 caaggtaaaa ttgttccttg gtgtagccaa ctagaggtat taactcatcc ctccctggga 1080 tgcttcgtaa gccattgtgg atggaacagt acacttgaaa gtttatcttc aggcgtgccc 1140 gtggtcgcct tccctcactg gacagaccaa ggtactaacg cgaaattaat agaagatgtt 1200 tggaagacag gtgtacgtat gcgtgttagc gaggatggcg tcgtcgaatc agaggagatt 1260 aaaagatgta tagagatagt gatggatggg ggcgagaaag gcgaggaaat gagaaaaaac 1320 gctcagaaat ggaaagaact agccagggaa gcggtgaagg agggggggag cagtgaggtc 1380 aacctaaagg cttttgtgca ggaagtggga aaatcttgt 1419 <210> 19 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 19 ccggaattca tggttcaacc tcatgtgctt 30 <210> 20 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 20 ccgctcgaga caagattttc ccacttcctg 30 <210> 21 <211> 454 <212> PRT <213> Artificial Sequence <220> <223> recombinant CaUGT3 <400> 21 Met Ala Thr Glu Gln Gln Gln Ala Ser Ile Ser Cys Lys Ile Leu Met 1 5 10 15 Phe Pro Trp Leu Ala Phe Gly His Ile Ser Ser Phe Leu Gln Leu Ala 20 25 30 Lys Lys Leu Ser Asp Arg Gly Phe Tyr Phe Tyr Ile Cys Ser Thr Pro 35 40 45 Ile Asn Leu Asp Ser Ile Lys Asn Lys Ile Asn Gln Asn Tyr Ser Ser 50 55 60 Ser Ile Gln Leu Val Asp Leu His Leu Pro Asn Ser Pro Gln Leu Pro 65 70 75 80 Pro Ser Leu His Thr Thr Asn Gly Leu Pro Pro His Leu Met Ser Thr 85 90 95 Leu Lys Asn Ala Leu Ile Asp Ala Asn Pro Asp Leu Cys Lys Ile Ile 100 105 110 Ala Ser Ile Lys Pro Asp Leu Ile Ile Tyr Asp Leu His Gln Pro Trp 115 120 125 Thr Glu Ala Leu Ala Ser Arg His Asn Ile Pro Ala Val Ser Phe Ser 130 135 140 Thr Met Asn Ala Val Ser Phe Ala Tyr Val Met His Met Phe Met Asn 145 150 155 160 Pro Gly Ile Glu Phe Pro Phe Lys Ala Ile His Leu Ser Asp Phe Glu 165 170 175 Gln Ala Arg Phe Leu Glu Gln Leu Glu Ser Ala Lys Asn Asp Ala Ser 180 185 190 Ala Lys Asp Pro Glu Leu Gln Gly Ser Lys Gly Phe Phe Asn Ser Thr 195 200 205 Phe Ile Val Arg Ser Ser Arg Glu Ile Glu Gly Lys Tyr Val Asp Tyr 210 215 220 Leu Ser Glu Ile Leu Lys Ser Lys Val Ile Pro Val Cys Pro Val Ile 225 230 235 240 Ser Leu Asn Asn Asn Asp Gln Gly Gln Gly Asn Lys Asp Glu Asp Glu 245 250 255 Ile Ile Gln Trp Leu Asp Lys Lys Ser His Arg Ser Ser Val Phe Val 260 265 270 Ser Phe Gly Ser Glu Tyr Phe Leu Asn Met Gln Glu Ile Glu Glu Ile 275 280 285 Ala Ile Gly Leu Glu Leu Ser Asn Val Asn Phe Ile Trp Val Leu Arg 290 295 300 Phe Pro Lys Gly Glu Asp Thr Lys Ile Glu Glu Val Leu Pro Glu Gly 305 310 315 320 Phe Leu Asp Arg Val Lys Thr Lys Gly Arg Ile Val His Gly Trp Ala 325 330 335 Pro Gln Ala Arg Ile Leu Gly His Pro Ser Ile Gly Gly Phe Val Ser 340 345 350 His Cys Gly Trp Asn Ser Val Met Glu Ser Ile Gln Ile Gly Val Pro 355 360 365 Ile Ile Ala Met Pro Met Asn Leu Asp Gln Pro Phe Asn Ala Arg Leu 370 375 380 Val Val Glu Ile Gly Val Gly Ile Glu Val Gly Arg Asp Glu Asn Gly 385 390 395 400 Lys Leu Lys Arg Glu Arg Ile Gly Glu Val Ile Lys Glu Val Ala Ile 405 410 415 Gly Lys Lys Gly Glu Lys Leu Arg Lys Thr Ala Lys Asp Leu Gly Gln 420 425 430 Lys Leu Arg Asp Arg Glu Lys Gln Asp Phe Asp Glu Leu Ala Ala Thr 435 440 445 Leu Lys Gln Leu Cys Val 450 <210> 22 <211> 1362 <212> DNA <213> Artificial Sequence <220> <223> recombinant CaUGT3 <400> 22 atggccaccg aacagcaaca ggcatccatc tcttgtaaaa ttttgatgtt cccttggcta 60 gctttcggac atatctccag cttcttgcaa ttagcgaaga agcttagcga cagggggttc 120 tatttttata tctgctccac ccctataaat cttgactcca taaaaaacaa aataaatcag 180 aattattcat cctccattca gctagtcgat ctgcacctac ccaatagccc tcagttgccc 240 ccgagtttac acacaaccaa cggactaccg ccgcatttga tgtcaactct gaagaatgcg 300 cttattgacg ctaatcccga tctgtgtaaa ataatagcaa gcattaagcc ggatctgatc 360 atatatgatc tacatcaacc gtggacagag gcgctagcct cacgtcacaa cattcctgcg 420 gtcagtttct ctaccatgaa tgctgtatcc tttgcctacg ttatgcatat gtttatgaat 480 ccgggcatag aatttccatt caaagctata catctgtctg atttcgagca agcgcgtttc 540 ttggagcagc tggagtcagc aaaaaacgat gcctcagcaa aagaccccga actacaggga 600 agtaaagggt tttttaactc cacctttata gttagatcca gtcgtgaaat cgaggggaaa 660 tacgtcgatt acttgagtga gatcttaaaa agtaaggtta tacctgtttg ccccgtcatt 720 agccttaaca ataatgatca gggacaaggg aacaaagacg aggacgaaat cattcaatgg 780 ttggacaaaa aaagtcatag aagctccgtg tttgtttcat tcggtagtga atacttcctt 840 aacatgcaag agattgaaga gatagccatt ggtcttgagc tgtctaatgt caacttcatc 900 tgggttttga ggtttcccaa aggtgaagat acgaaaatag aagaggtgct accggaagga 960 tttctagata gggtgaagac caaagggaga atagtgcatg gctgggcccc acaggctcgt 1020 attctgggtc atccatctat aggtgggttt gtatcccatt gtggctggaa cagtgtgatg 1080 gaaagtattc aaatcggtgt gcccatcatt gcgatgccga tgaatttaga ccaacctttc 1140 aatgcaaggc ttgtcgtgga aattggtgtg ggtattgagg tcggcaggga cgagaacggg 1200 aaacttaaaa gggaacgtat cggtgaggtc attaaggaag ttgcaatcgg aaagaagggg 1260 gagaagctta gaaaaaccgc gaaagacctg ggccagaagc taagggaccg tgagaaacag 1320 gattttgatg agcttgcagc gacgctgaag caattgtgcg ta 1362 <210> 23 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 23 cgggatccat ggccaccgaa cagca 25 <210> 24 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 24 acgcgtcgac tacgcacaat tgcttcagc 29 <210> 25 <211> 456 <212> PRT <213> Artificial Sequence <220> <223> recombinant NsUGT <400> 25 Met Asp Thr Glu His Ser Thr Leu Arg Val Leu Met Phe Pro Trp Leu 1 5 10 15 Ala His Gly His Ile Ser Pro Tyr Leu Thr Val Ala Lys Lys Leu Ala 20 25 30 Cys Arg Gly Phe Tyr Val Tyr Leu Cys Ser Thr Pro Val Asn Leu Asn 35 40 45 Phe Ile Lys Lys Lys Ile Pro Gln Lys Tyr Ser Ile Ser Ile Gln Leu 50 55 60 Val Glu Phe His Leu Pro Asp Leu Pro Glu Leu Pro Leu Ser Tyr His 65 70 75 80 Thr Thr Asn Gly Ile Pro Pro His Leu Val Ser Thr Leu Lys Lys Ala 85 90 95 Val Lys Met Ser Lys Pro Asn Phe Tyr Lys Ile Ile Glu Asn Leu Lys 100 105 110 Pro Asn Met Leu Ile Tyr Asp Ile Leu Gln Pro Trp Ala Lys Glu Val 115 120 125 Ala Asn Ser Tyr Asn Ile Pro Ala Val Met Leu Leu Thr Phe Cys Ala 130 135 140 Ala Met Leu Ser Tyr Arg Leu His Pro Val Lys Lys Pro Gly Thr Glu 145 150 155 160 Phe Pro Phe Pro Ala Leu Tyr Leu Arg Lys Ile Glu Arg Gln Gln Arg 165 170 175 Glu Glu Met Leu Glu Lys Ala Ala Lys Glu Lys Asp Pro Asp Asp Lys 180 185 190 Asp Pro Phe Ala Glu Glu Glu Thr Met Asn Lys Ile Ile Leu Met Ser 195 200 205 Thr Ser Arg Ala Thr Glu Ala Lys Tyr Ile Asp Tyr Phe Thr Glu Leu 210 215 220 Ile Gln Trp Lys Ile Ile Pro Val Gly Pro Pro Val Gln Glu Thr Thr 225 230 235 240 Asn Glu Tyr Asp Gly Asp Val Asp Asp Leu Ile Asp Trp Leu Gly Asn 245 250 255 Lys Tyr Glu Asn Ser Thr Val Phe Val Ser Phe Gly Ser Gln Tyr Phe 260 265 270 Leu Ser Lys Glu Asp Leu Glu Glu Ile Ala Leu Gly Leu Glu Leu Ser 275 280 285 Asn Val Asn Phe Ile Trp Val Val Arg Phe Pro Lys Gly Glu Glu Val 290 295 300 Arg Val Glu Glu Ala Leu Pro Glu Gly Phe Leu Glu Arg Ile Gly Asp 305 310 315 320 Arg Gly Arg Val Val Asp Gly Trp Ala Pro Gln Leu Arg Ile Leu Ser 325 330 335 His Pro Ser Thr Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Val 340 345 350 Met Glu Ser Ile Asp Phe Arg Val Pro Ile Ile Ala Leu Pro Met His 355 360 365 Leu Asp Gln Pro Ile Asn Ala Arg Leu Ile Val Glu Leu Gly Val Ala 370 375 380 Val Glu Ile Val Arg Asp Asp Glu Gly Lys Val His Arg Gly Glu Val 385 390 395 400 Ala Glu Ile Val Lys Ser Ile Ile Cys Glu Lys Thr Gly Glu Asn Leu 405 410 415 Arg Asn Lys Val Arg Glu Ile Ser Glu Asn Leu Lys Lys Glu Arg Glu 420 425 430 Glu Glu Met Asp Ala Ala Ile Gly Glu Leu Val Gln Leu Cys Lys Thr 435 440 445 Ser Lys Ser Asn Asn Met Met Leu 450 455 <210> 26 <211> 1368 <212> DNA <213> Artificial Sequence <220> <223> recombinant NsUGT <400> 26 atggacacag aacacagcac gctacgtgtg ctaatgttcc cgtggctagc gcacggacac 60 atatctccct atttaacagt tgcaaagaag ctagcatgta gaggatttta cgtctatctt 120 tgttccacgc cggtcaatct aaacttcatc aagaagaaga taccccagaa gtattcaatt 180 agtatacagt tggtcgaatt ccacctacct gacctacctg agcttccatt aagttaccat 240 acgacaaacg gcattcctcc ccacctagtg tcaacgttaa aaaaagccgt aaagatgagc 300 aaaccgaatt tttataaaat aattgagaat ttgaaaccaa atatgttgat atatgacatt 360 cttcagccat gggcgaagga agtcgccaac tcctacaata tccccgctgt tatgttattg 420 acgttttgcg ctgccatgct ttcttaccgt ttgcacccgg tcaaaaagcc gggcacagag 480 ttcccattcc cggcgctata tttgaggaaa attgaacgtc agcaaaggga ggagatgctt 540 gaaaaagccg ccaaggagaa ggacccggat gataaagacc cgtttgcgga ggaagaaact 600 atgaacaaga taatcctgat gtctacaagc agggcgacgg aggccaagta catcgactat 660 ttcactgaac taattcagtg gaagataatt cctgtgggcc cgcccgtcca ggaaacgact 720 aacgagtacg acggagatgt agatgattta attgattggt tgggaaacaa atatgaaaac 780 tccactgtat ttgtcagctt tgggagccaa tatttcttga gcaaagaaga cttagaggag 840 attgctctag gattggaact atccaatgtg aacttcatat gggtcgtgag attccccaag 900 ggggaggaag ttagagtaga agaagcgttg ccagaaggtt tccttgagag gattggggac 960 agagggaggg ttgttgacgg ttgggcacct caattgcgta ttctgagtca cccttcaacc 1020 ggaggatttg tcagtcactg tgggtggaac tctgtgatgg agtccatcga ttttcgtgtc 1080 cctatcatcg ccctgccaat gcacctggat cagccaatca atgcccgtct aattgtcgaa 1140 ttaggagttg ccgtcgaaat agttcgtgac gacgagggca aagttcacag gggggaggta 1200 gcagagatcg ttaaaagcat tatatgtgag aagacggggg aaaatttgcg taataaagtc 1260 cgtgaaatct ctgaaaatct taagaaagaa agggaagagg agatggacgc agcgatagga 1320 gaactggtgc aattatgcaa gactagcaaa agtaataaca tgatgttg 1368 <210> 27 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 27 cgggatccat ggacacagaa cacagca 27 <210> 28 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 28 acgcgtcgac caacatcatg ttattacttt tgc 33 <210> 29 <211> 453 <212> PRT <213> Artificial Sequence <220> <223> recombinant SpUGT <400> 29 Met Gly Thr Gln Val Thr Glu His Gly Thr Ser Asn Leu Arg Val Val 1 5 10 15 Met Phe Pro Trp Leu Ala Tyr Gly His Ile Ser Pro Phe Leu Tyr Val 20 25 30 Ala Lys Lys Leu Ala Asp Arg Gly Phe Leu Ile Tyr Leu Cys Ser Thr 35 40 45 Pro Ile Asn Leu Lys Ser Thr Ile Lys Lys Ile Pro Glu Lys Tyr Ala 50 55 60 Asp Ser Ile His Leu Ile Glu Leu His Ile Pro Glu Leu Pro Glu Leu 65 70 75 80 Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn His 85 90 95 Thr Leu Gln Lys Ala Leu Lys Met Ser Lys Pro Asn Leu Ser Lys Ile 100 105 110 Leu Lys Asn Leu Lys Pro Asp Leu Met Ile Tyr Asp Val Leu Gln Gln 115 120 125 Trp Ala Glu Arg Val Ala Asn Glu Gln Ser Ile Pro Ala Val Arg Leu 130 135 140 Leu Thr Phe Gly Ala Ala Val Phe Ser Tyr Phe Cys Asn Leu Val Lys 145 150 155 160 Lys Pro Gly Val Glu Phe Pro Phe Pro Asp Ile Tyr Leu Arg Lys Ile 165 170 175 Glu Gln Val Lys Leu Gly Glu Met Leu Glu Lys Ser Ala Lys Asp Gln 180 185 190 Asp Pro Asp Asp Glu Glu Arg Leu Val Asp Glu Tyr Lys Gln Ile Ala 195 200 205 Leu Ile Cys Thr Ser Arg Thr Ile Glu Ala Lys Tyr Ile Asp Phe Leu 210 215 220 Leu Glu Leu Ser Asn Leu Lys Val Val Pro Val Gly Pro Pro Val Gln 225 230 235 240 Asp Leu Ile Thr Asn Asp Ala Asp Asp Met Glu Leu Ile Asp Trp Leu 245 250 255 Gly Ser Lys Asp Glu Asn Ser Thr Val Phe Val Ser Phe Gly Ser Glu 260 265 270 Tyr Phe Leu Ser Lys Glu Asp Met Glu Glu Val Ala Leu Gly Leu Glu 275 280 285 Leu Ser Asn Val Asn Phe Val Trp Val Ala Arg Phe Pro Lys Gly Glu 290 295 300 Glu Gln Asn Leu Glu Asp Ala Leu Pro Lys Gly Phe Leu Glu Arg Ile 305 310 315 320 Gly Glu Arg Gly Arg Val Leu Asp Lys Phe Ala Pro Gln Leu Arg Ile 325 330 335 Leu Asn His Thr Ser Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn 340 345 350 Ser Val Met Glu Ser Ile His Phe Gly Val Pro Ile Val Ala Met Pro 355 360 365 Met His Leu Asp Gln Pro Met Asn Ala Arg Leu Ile Val Glu Leu Gly 370 375 380 Val Ala Val Glu Ile Val Arg Asp Asp Asp Gly Lys Ile His Arg Glu 385 390 395 400 Glu Ile Ala Lys Thr Leu Lys Asp Val Ile Thr Glu Arg Ile Gly Glu 405 410 415 Asn Leu Arg Ala Lys Met Arg Asp Ile Ser Met Asn Leu Asn Ser Ile 420 425 430 Ser Gly Glu Glu Met Asp Ala Ala Ala His Glu Leu Ile Gln Phe Cys 435 440 445 Lys Ile Asn Thr Asn 450 <210> 30 <211> 1359 <212> DNA <213> Artificial Sequence <220> <223> recombinant SpUGT <400> 30 atgggaactc aagttacaga acatggaaca tctaatctaa gggtagtcat gttcccttgg 60 ctggcatacg gtcacatctc accattcctt tatgtcgcaa aaaaactggc agacaggggt 120 ttcctgattt acttatgtag tactcccatt aatcttaagt ccacaataaa gaaaatacca 180 gaaaaatacg ccgactccat ccatttaatc gagcttcata ttccagaatt gccagagttg 240 cctcctcatt atcataccac gaacggcctt ccacctcatc taaatcacac actacagaag 300 gccttgaaga tgtcaaagcc caatctatcc aaaatactaa aaaaccttaa gccagattta 360 atgatctacg acgtactaca acagtgggct gagagggtag cgaacgaaca gtccattccg 420 gctgtaaggt tattaacctt cggtgccgca gttttttcat atttttgcaa tttggttaag 480 aaacccggag tcgagtttcc attccccgac atttatttga gaaaaattga gcaggtcaaa 540 ctaggtgaaa tgttggagaa atctgccaaa gaccaagatc ctgacgatga agaaaggtta 600 gtagatgagt acaaacaaat tgctttaatt tgtaccagta gaactattga agccaaatac 660 atcgacttcc tattggagct gagtaaccta aaggtcgttc cagtcggtcc tcccgtccag 720 gacttgatca ccaatgatgc cgatgatatg gaattaatcg actggcttgg ctcaaaggat 780 gaaaattcca ccgttttcgt aagtttcggg agtgagtatt ttttaagcaa agaagatatg 840 gaagaggtcg cgctaggcct tgaactgagc aacgttaact ttgtatgggt cgctaggttc 900 cccaagggag aagaacaaaa tctagaagat gccttaccga aggggttcct ggagagaata 960 ggggaacgtg ggagagtcct ggataagttt gcaccgcagc tgaggatatt aaatcacact 1020 tccactggtg gatttatatc acattgtggt tggaacagcg tgatggaatc tatacacttc 1080 ggagttccga ttgtggcgat gccaatgcat ttggaccagc caatgaacgc gaggcttatc 1140 gttgagcttg gagtagcggt ggagattgtt agggacgatg atggcaaaat tcacagggaa 1200 gaaattgcga agactcttaa ggacgtgata acagagcgta ttggagagaa tctaagggct 1260 aagatgcgtg atatttcaat gaacctaaac agtattagcg gagaggagat ggacgcagct 1320 gctcatgaac ttatacagtt ctgtaagatc aacaccaat 1359 <210> 31 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 31 cgggatccat gggaactcaa gttacagaa 29 <210> 32 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 32 acgcgtcgac attggtgttg atcttacaga ac 32 <210> 33 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 33 cggaattcat ggcgaacaaa gaagagg 27 <210> 34 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 34 cccaagcttt taggtctccg cttgatgc 28 <210> 35 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 35 gctctagaag gaggattaca aaatgactgc tgtcgttctc c 41 <210> 36 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 36 cggaattcct agagcttgcg gaagag 26 <210> 37 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 37 ggaagatctg ctgtgcaggt cgtaaa 26 <210> 38 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 38 ataagaatgc ggccgcgaaa cgcaaaaagg cca 33 <210> 39 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 39 aagagcguct agaaggagga ttacaaaa 28 <210> 40 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 40 agctgatugt actgagagtg caccat 26 <210> 41 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 41 gctctagaag aggattacaa aatggttcaa cctcatgtcc 40 <210> 42 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 42 tccccccggg ttaacaacct tttcctactt cttg 34 <210> 43 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 43 tccccccggg aggaggacag ctaaatggcc accgaacagc 40 <210> 44 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 44 aaggaaaaaa gcggccgctt atacgcacaa ttgcttcagc 40 <110> AJOU UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION <120> Process for preparing crocin-4 using enzyme reaction <130> KPA190916-KR <160> 44 <170> KopatentIn 2.0 <210> 1 <211> 474 <212> PRT <213> Artificial Sequence <220> <223> recombinant GjUGT1 <400> 1 Met Val Gln Gln Arg His Val Leu Leu Ile Thr Tyr Pro Ala Gln Gly 1 5 10 15 His Ile Asn Pro Ala Leu Gln Phe Ala Gln Arg Leu Leu Arg Met Gly 20 25 30 Ile Gln Val Thr Leu Ala Thr Ser Val Tyr Ala Leu Ser Arg Met Lys 35 40 45 Lys Ser Ser Gly Ser Thr Pro Lys Gly Leu Thr Phe Ala Thr Phe Ser 50 55 60 Asp Gly Tyr Asp Asp Gly Phe Arg Pro Lys Gly Val Asp His Thr Glu 65 70 75 80 Tyr Met Ser Ser Leu Ala Lys Gin Gly Ser Asn Thr Leu Arg Asn Val 85 90 95 Ile Asn Thr Ser Ala Asp Gin Gly Cys Pro Val Thr Cys Leu Val Tyr 100 105 110 Thr Leu Leu Leu Pro Trp Ala Ala Thr Val Ala Arg Glu Cys His Ile 115 120 125 Pro Ser Ala Leu Leu Trp Ile Gln Pro Val Ala Val Met Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Arg Gly Tyr Glu Asp Asp Val Lys Asn Asn Ser Asn 145 150 155 160 Asp Pro Thr Trp Ser Ile Gln Phe Pro Gly Leu Pro Ser Met Lys Ala 165 170 175 Lys Asp Leu Pro Ser Phe Ile Leu Pro Ser Ser Asp Asn Ile Tyr Ser 180 185 190 Phe Ala Leu Pro Thr Phe Lys Lys Gln Leu Glu Thr Leu Asp Glu Glu 195 200 205 Glu Arg Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu Pro Gln 210 215 220 Ala Leu Lys Ala Ile Glu Ser Tyr Asn Leu Ile Ala Ile Gly Pro Leu 225 230 235 240 Thr Pro Ser Ala Phe Leu Asp Gly Lys Asp Pro Ser Glu Thr Ser Phe 245 250 255 Ser Gly Asp Leu Phe Gln Lys Ser Lys Asp Tyr Lys Glu Trp Leu Asn 260 265 270 Ser Arg Pro Ala Gly Ser Val Val Tyr Val Ser Phe Gly Ser Leu Leu 275 280 285 Thr Leu Pro Lys Gln Gln Met Glu Glu Ile Ala Arg Gly Leu Leu Lys 290 295 300 Ser Gly Arg Pro Phe Leu Trp Val Ile Arg Ala Lys Glu Asn Gly Glu 305 310 315 320 Glu Glu Lys Glu Glu Asp Arg Leu Ile Cys Met Glu Glu Leu Glu Glu 325 330 335 Gln Gly Met Ile Val Pro Trp Cys Ser Gln Ile Glu Val Leu Thr His 340 345 350 Pro Ser Leu Gly Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Leu 355 360 365 Glu Thr Leu Val Cys Gly Val Pro Val Val Ala Phe Pro His Trp Thr 370 375 380 Asp Gln Gly Thr Asn Ala Lys Leu Ile Glu Asp Val Trp Glu Thr Gly 385 390 395 400 Val Arg Val Val Pro Asn Glu Asp Gly Thr Val Glu Ser Asp Glu Ile 405 410 415 Lys Arg Cys Ile Glu Thr Val Met Asp Asp Gly Glu Lys Gly Val Glu 420 425 430 Leu Lys Arg Asn Ala Lys Lys Trp Lys Glu Leu Ala Arg Glu Ala Met 435 440 445 Gln Glu Asp Gly Ser Ser Asp Lys Asn Leu Lys Ala Phe Val Glu Asp 450 455 460 Ala Gly Lys Gly Tyr Gln Ala Glu Ser Asn 465 470 <210> 2 <211> 1425 <212> DNA <213> Artificial Sequence <220> <223> recombinant GjUGT1 <400> 2 atggttcagc agcgtcacgt tttgttgatt acctatccag cacagggtca cattaaccca 60 gcgttgcagt tcgcacagag attgttgcgt atgggtattc aagttaccct ggcgaccagc 120 gtgtacgctc tgagccgtat gaaaaagagc agcggtagca ccccgaaagg tctgaccttt 180 gcaaccttca gcgatggtta cgatgacggt tttcgtccga aaggtgttga ccataccgaa 240 tatatgagca gcctggcgaa gcaaggtagc aataccctgc gtaatgtgat caacaccagc 300 gctgatcagg gttgcccggt tacctgtttg gtgtatacct tgttgctgcc atgggctgct 360 accgttgcac gtgaatgcca cattccgagc gcgctgctgt ggatccaacc ggttgctgtg 420 atggatatct actattacta tttccgtggt tatgaagatg acgttaagaa taacagcaat 480 gacccgacct ggagcatcca gtttccgggt ctgccgagca tgaaagctaa ggatctgccg 540 agcttcattc tgccgagcag cgacaacatc tacagctttg cactgccgac cttcaaaaag 600 caactggaaa ccctggatga agaagaacgt ccgaaagttc tggtgaatac ctttgacgcg 660 ctggaaccgc aggcactgaa ggcgattgaa agctataacc tgattgcaat tggtccactg 720 accccgagcg cgttcctgga tggtaaagac ccgagcgaaa ccagctttag cggtgacctg 780 tttcagaaaa gcaaggacta caaggaatgg ctgaatagcc gtccggctgg tagcgttgtg 840 tatgttagct ttggtagcct gctgaccctg ccgaaacaac agatggaaga aattgcacgt 900 ggtctgctga agagcggtcg tccgttcctg tgggttattc gtgcgaaaga aaacggtgaa 960 gaagaaaagg aagaagatcg tctgatttgc atggaagaac tggaagaaca gggtatgatt 1020 gttccgtggt gtagccagat cgaagtgctg acccatccga gcctgggttg ctttgttacc 1080 cactgtggtt ggaatagcac cctggaaacc ctggtttgcg gtgtgccggt tgtggctttc 1140 ccgcattgga ccgatcaagg taccaatgca aaactgatcg aagacgtttg ggaaaccggt 1200 gtgcgtgttg tgccgaacga agatggtacc gttgaaagcg acgaaattaa acgttgtatc 1260 gaaaccgtta tggatgacgg tgaaaaaggt gtggaactga agcgtaacgc gaaaaagtgg 1320 aaggaactgg ctcgtgaagc aatgcaggaa gatggtagca gcgacaaaaa cctgaaagcg 1380 ttcgtggagg atgcgggcaa gggctatcaa gcggagagca actaa 1425 <210> 3 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 3 cggaattcat ggttcagcag cgtcac 26 <210> 4 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 4 ccgctcgagg ttgctctccg cttgatag 28 <210> 5 <211> 502 <212> PRT <213> Artificial Sequence <220> <223> recombinant GT1-316 <400> 5 Met Val Ser Glu Ser Leu Gly His Leu Phe Leu Val Ser Phe Pro Gly 1 5 10 15 Gln Gly His Val Asn Pro Leu Leu Arg Leu Gly Lys Ile Leu Ala Ser 20 25 30 Lys Gly Phe Leu Val Thr Phe Ser Thr Thr Glu Thr Thr Gly Glu Gln 35 40 45 Met Arg Lys Ala Ser Asp Ile Ile Asp Lys Leu Thr Pro Phe Gly Asp 50 55 60 Gly Phe Ile Arg Phe Glu Phe Ile Ala Asp Gly Trp Glu Glu Asp Glu 65 70 75 80 Pro Arg Arg Gln Asp Leu Asp Gln Tyr Leu Leu Gln Leu Glu Leu Val 85 90 95 Gly Lys Gln Val Ile Pro Gln Met Ile Lys Lys Asn Ala Glu Gln Gly 100 105 110 Arg Pro Val Ser Cys Leu Ile Asn Asn Pro Phe Ile Pro Trp Val Thr 115 120 125 Asp Val Ala Thr Thr Leu Gly Leu Pro Ser Ala Met Leu Trp Val Gln 130 135 140 Ser Cys Ala Cys Phe Ala Ser Tyr Tyr His Tyr Tyr His Gly Thr Val 145 150 155 160 Pro Phe Pro Asp Glu Glu His Pro Glu Ile Asp Val Gln Leu Pro Trp 165 170 175 Met Pro Leu Leu Lys Tyr Asp Glu Val Pro Ser Tyr Leu Tyr Pro Thr 180 185 190 Thr Pro Tyr Pro Phe Leu Arg Arg Ala Ile Leu Gly Gln Tyr Lys Asn 195 200 205 Leu Asp Lys Pro Phe Cys Ile Leu Met Glu Thr Phe Glu Glu Leu Glu 210 215 220 Pro Glu Leu Ile Lys His Met Ser Glu Ile Phe Pro Ile Lys Ala Val 225 230 235 240 Gly Pro Leu Phe Arg Asn Thr Lys Ala Pro Lys Thr Thr Val His Gly 245 250 255 Asp Phe Leu Lys Ala Asp Asp Cys Ile Glu Trp Leu Asp Thr Lys Pro 260 265 270 Pro Ser Ser Val Val Tyr Val Ser Phe Gly Ser Val Val Gln Leu Lys 275 280 285 Gln Asp Gln Trp Asn Glu Ile Ala Tyr Gly Leu Leu Asn Ser Gly Val 290 295 300 Ser Phe Leu Leu Val Met Lys Pro Ala His Lys Asp Ala Gly His Asp 305 310 315 320 Leu Leu Val Leu Pro Asp Gly Phe Leu Glu Lys Ala Gly Asp Arg Gly 325 330 335 Lys Val Val Gln Trp Ser Pro Gln Glu Lys Val Leu Gly His Pro Ser 340 345 350 Val Ala Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Met Glu Ala 355 360 365 Leu Thr Ser Gly Met Pro Val Val Ala Phe Pro Gln Trp Gly Asp Gln 370 375 380 Val Thr Asn Ala Lys Tyr Leu Val Asp Ile Leu Lys Val Gly Val Arg 385 390 395 400 Met Cys Arg Gly Glu Ala Glu Asn Lys Leu Ile Thr Arg Asp Glu Ile 405 410 415 Glu Lys Cys Leu Leu Glu Ala Thr Val Gly Pro Lys Ala Val Glu Met 420 425 430 Lys Gln Asn Ala Met Lys Trp Lys Glu Ala Ala Glu Ala Ala Val Ala 435 440 445 Glu Gly Gly Ser Ser Asp Gln Asn Ile Arg Tyr Phe Thr Asp Asp Ile 450 455 460 Val Lys Ala Asn Glu Ser Glu Ile Ala Arg Lys Cys Ile Gly Ser Asn 465 470 475 480 Glu Phe Pro Val Ser Val Val Val Lys Ser Asn Glu Lys Val Asp Glu 485 490 495 Leu Val Gly Ser Ser Ala 500 <210> 6 <211> 1506 <212> DNA <213> Artificial Sequence <220> <223> recombinant GT1-316 <400> 6 atggtctccg agagtctagg tcacctattc ctagtatctt ttccggggca ggggcatgta 60 aaccctttat taagattagg aaaaatcctt gctagtaaag gtttcctagt taccttttcc 120 accaccgaga caacagggga gcagatgcgt aaggcaagtg acatcataga taaacttaca 180 ccattcggtg acggcttcat tcgttttgaa ttcattgccg acggttggga ggaggatgaa 240 ccccgtaggc aagaccttga tcaatatcta ctacaacttg aattagtggg caagcaggta 300 ataccgcaaa tgattaagaa gaatgccgaa cagggtagac ccgtgtcctg cctgatcaac 360 aaccccttca tcccatgggt cacagatgtc gctacgactc taggccttcc ttcagctatg 420 ctgtgggtac aaagttgtgc ttgttttgcg agctactatc actactacca tggcacagtc 480 ccttttcccg atgaggagca ccccgaaatt gatgtacagc taccgtggat gcctttgctt 540 aagtatgatg aggtccctag ttatctttac cctacgacgc cgtacccctt tttaaggcgt 600 gcaatattgg gtcagtataa aaacctggac aagccgtttt gcatcttaat ggagactttt 660 gaagagcttg aaccggaact tatcaagcat atgagtgaaa tttttccaat taaggcggtt 720 ggaccgctat tccgtaacac aaaagcgccc aaaacaactg tgcacggcga cttcttaaaa 780 gcagatgatt gtatagaatg gttggatacc aaacctccat cttctgtcgt ttacgtatca 840 ttcggctctg ttgtgcagct aaaacaggac caatggaacg agatagctta cggtttactt 900 aactctgggg tttccttttt attggtgatg aagcccgctc ataaagacgc tgggcatgac 960 cttctagtac ttcctgacgg cttcttagag aaagctggcg acagagggaa ggtggtccag 1020 tggtcacctc aagaaaaggt actaggtcac ccgagcgtag cctgctttgt aactcactgt 1080 ggatggaaca gcacgatgga agctctgacc tctggaatgc ccgtggtcgc ttttccccag 1140 tggggtgatc aggttacaaa tgcgaaatac ttggtagaca tactaaaggt aggagtcagg 1200 atgtgtaggg gagaggccga aaataagctt attacaagag atgaaattga aaaatgccta 1260 ttagaagcta cggtaggacc taaagctgtt gagatgaagc aaaacgccat gaaatggaag 1320 gaagctgctg aagccgcggt cgccgaaggt ggctcaagtg accaaaacat acgttacttt 1380 actgatgaca tcgtgaaagc aaatgagtcc gagattgcta gaaagtgtat cggctctaac 1440 gaattcccgg tgtcagtcgt tgttaaatct aatgagaaag tggatgagtt agtcggatca 1500 tccgca 1506 <210> 7 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 7 cgggatccat ggtctccgag agtctag 27 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 8 acgcgtcgac tgcggatgat ccgactaac 29 <210> 9 <211> 471 <212> PRT <213> Artificial Sequence <220> <223> recombinant NtUGT <400> 9 Met Val Gln Pro His Val Leu Leu Val Thr Phe Pro Ala Gln Gly His 1 5 10 15 Ile Asn Pro Cys Leu Gln Phe Ala Met Arg Leu Ile Arg Met Gly Ile 20 25 30 Glu Val Thr Phe Ala Thr Ser Val Phe Ala His Arg Arg Met Ala Lys 35 40 45 Thr Ala Thr Ser Thr Leu Pro Lys Gly Leu Asn Phe Ala Ala Phe Ser 50 55 60 Asp Gly Tyr Asp Asp Gly Phe Lys Ala Asp Glu His Asp Ser Gln His 65 70 75 80 Tyr Met Ser Glu Ile Lys Ser Arg Gly Ser Glu Thr Leu Lys Asp Ile 85 90 95 Ile Leu Lys Ser Ser Asp Glu Gly Arg Pro Val Thr Ser Leu Val Tyr 100 105 110 Ser Leu Leu Leu Pro Trp Ala Ala Asn Val Ala Arg Glu Phe His Ile 115 120 125 Pro Cys Ala Leu Leu Trp Ile Gln Pro Ala Thr Val Leu Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Asn Gly Ser Glu Asp Ala Ile Lys Gly Ser Thr Asn 145 150 155 160 Asp Pro Asn Trp Cys Ile Gln Leu Pro Asn Leu Pro Leu Leu Lys Ser 165 170 175 Gln Asp Leu Pro Ser Phe Leu Leu Ser Ser Asn Asn Asp Glu Lys Tyr 180 185 190 Ser Phe Ala Leu Pro Thr Phe Lys Glu Gln Leu Asp Thr Leu Asp Val 195 200 205 Glu Glu Asn Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu Gln 210 215 220 Glu Glu Leu Lys Ala Ile Glu Arg Tyr Asn Leu Ile Gly Ile Gly Pro 225 230 235 240 Leu Ile Pro Ser Ser Phe Leu Asp Gly Lys Asp Pro Leu Asp Ser Ser 245 250 255 Phe Gly Gly Asp Leu Phe Gln Lys Ser Asn Asp Tyr Ile Glu Trp Leu 260 265 270 Asn Ser Lys Asp Asn Ser Ser Val Ile Tyr Ile Ser Phe Gly Ser Leu 275 280 285 Leu Asn Leu Ser Lys Asn Gln Lys Glu Glu Ile Ala Lys Gly Leu Ile 290 295 300 Glu Ile Lys Arg Pro Phe Leu Trp Val Ile Arg Asp Gln Glu Asn Gly 305 310 315 320 Lys Gly Asp Glu Lys Glu Glu Glu Lys Leu Ser Cys Met Met Glu Leu 325 330 335 Glu Lys Gln Gly Lys Ile Val Pro Trp Cys Ser Gln Leu Glu Val Leu 340 345 350 Thr His Pro Ser Leu Gly Cys Phe Val Ser His Cys Gly Trp Asn Ser 355 360 365 Thr Leu Glu Ser Leu Ser Thr Gly Val Pro Val Val Ala Phe Pro His 370 375 380 Trp Thr Asp Gln Gly Thr Asn Ala Lys Leu Ile Glu Asp Val Trp Lys 385 390 395 400 Ile Gly Val Arg Leu Lys Lys Asn Glu Asp Gly Val Val Glu Ser Glu 405 410 415 Glu Ile Lys Arg Cys Ile Asp Met Val Met Asn Gly Gly Glu Lys Gly 420 425 430 Glu Glu Met Arg Arg Asn Ala Gln Lys Trp Lys Glu Leu Ala Arg Glu 435 440 445 Ala Val Lys Glu Gly Gly Ser Ser Tyr Met Asn Leu Lys Ala Phe Val 450 455 460 Gln Glu Val Gly Lys Gly Cys 465 470 <210> 10 <211> 1413 <212> DNA <213> Artificial Sequence <220> <223> recombinant NtUGT <400> 10 atggttcaac ctcatgtcct acttgttaca tttccagcac aaggacatat taacccctgc 60 ttgcagtttg ccatgaggct gatccgtatg ggaatcgaag taacgtttgc tacctcagtg 120 tttgctcacc gtagaatggc taagaccgca acaagcaccc tgcccaaggg cctaaacttt 180 gccgcgttca gcgacggata cgacgatggg tttaaggcag acgagcatga ttcccagcac 240 tacatgtctg aaatcaagtc tagggggagc gagacgctga aggatataat tctgaagagt 300 tctgatgaag ggagacccgt cacgtccctt gtgtatagcc tgcttttacc ctgggctgca 360 aatgttgcga gggaatttca cataccctgt gcgttgctat ggatacaacc ggcaactgtt 420 ttggatattt attactacta ctttaatggg agtgaggacg ccataaaagg atctaccaac 480 gatccgaatt ggtgcatcca gttgccaaat ctacctctat tgaaatctca agacttacct 540 tccttcctac tgtcctcaaa taacgatgag aaatacagtt ttgctttgcc aacgttcaaa 600 gaacagctgg acacattgga tgtggaagaa aaccccaagg ttttggtcaa tacattcgat 660 gcgctagagc aggaagagct gaaggcgatt gagagatata atttaatcgg gatcggacct 720 ttgataccga gcagtttctt agatggaaag gacccgctgg attcaagttt tggtggcgac 780 ttattcaga agtccaacga ttatattgaa tggctaaact ccaaagataa cagtagcgta 840 atatatatct cattcggttc actgttgaac ctatccaaaa accagaagga ggaaatagcg 900 aaagggctaa tcgagataaa aaggccattc ttgtgggtca tcagagacca ggaaaatggc 960 aaaggcgatg aaaaagaaga agaaaagttg agttgtatga tggaactaga gaagcaaggt 1020 aaaatcgtac catggtgctc acaactagaa gtcttgactc acccctccct tggttgcttc 1080 gtctctcact gcggttggaa ttcaacactt gagagtttgt ctacaggcgt gccagtggtt 1140 gcgttccctc attggaccga tcagggaact aacgcgaaac ttatcgaaga cgtatggaag 1200 atcggtgtgc gtttgaagaa aaatgaagat ggagtagttg agagcgagga gataaaaagg 1260 tgtatagaca tggttatgaa cggaggtgaa aagggagaag agatgcgtcg taacgcacag 1320 aagtggaaag aactagctcg tgaagccgtt aaagagggag gaagttctta tatgaactta 1380 aaagccttcg tgcaagaagt aggaaaaggt tgt 1413 <210> 11 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 11 cgggatccat ggttcaacct catgtcct 28 <210> 12 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 12 cccaagctta caaccttttc ctacttcttg c 31 <210> 13 <211> 555 <212> PRT <213> Artificial Sequence <220> <223> recombinant FaGT2 <400> 13 Met Gly Ser Glu Ser Leu Val His Val Phe Leu Val Ser Phe Ile Gly 1 5 10 15 Gln Gly His Val Asn Pro Leu Leu Arg Leu Gly Lys Arg Leu Ala Ala 20 25 30 Lys Gly Leu Leu Val Thr Phe Cys Thr Ala Glu Cys Val Gly Lys Glu 35 40 45 Met Arg Lys Ser Asn Gly Ile Thr Asp Glu Pro Lys Pro Val Gly Asp 50 55 60 Gly Phe Ile Arg Phe Glu Phe Phe Lys Asp Arg Trp Ala Glu Asp Glu 65 70 75 80 Pro Met Arg Gln Asp Leu Asp Leu Tyr Leu Pro Gln Leu Glu Leu Val 85 90 95 Gly Lys Glu Val Ile Pro Glu Met Ile Lys Lys Asn Ala Glu Gln Gly 100 105 110 Arg Pro Val Ser Cys Leu Ile Asn Asn Pro Phe Ile Pro Trp Val Cys 115 120 125 Asp Val Ala Glu Ser Leu Gly Leu Pro Ser Ala Met Leu Trp Val Gln 130 135 140 Ser Ala Ala Cys Leu Ala Ala Tyr Tyr His Tyr Tyr His Gly Leu Val 145 150 155 160 Pro Phe Pro Ser Glu Ser Asp Met Phe Cys Asp Val Gln Ile Pro Ser 165 170 175 Met Pro Leu Leu Lys Tyr Asp Glu Val Pro Ser Phe Leu Tyr Pro Thr 180 185 190 Ser Pro Tyr Pro Phe Leu Arg Arg Ala Ile Leu Gly Gln Tyr Gly Asn 195 200 205 Leu Glu Lys Pro Phe Cys Ile Leu Met Asp Thr Phe Gln Glu Leu Glu 210 215 220 Ser Glu Ile Ile Glu Tyr Met Ala Arg Leu Cys Pro Ile Lys Ala Val 225 230 235 240 Gly Pro Leu Phe Lys Asn Pro Lys Ala Gln Asn Ala Val Arg Gly Asp 245 250 255 Phe Met Glu Ala Asp Asp Ser Ile Ile Gly Trp Leu Asp Thr Lys Pro 260 265 270 Lys Ser Ser Val Val Tyr Ile Ser Phe Gly Ser Val Val Tyr Leu Lys 275 280 285 Gln Glu Gln Val Asp Glu Ile Ala His Gly Leu Leu Ser Ser Gly Val 290 295 300 Ser Phe Ile Trp Val Met Lys Pro His Pro Asp Ser Gly Phe Glu 305 310 315 320 Leu Leu Val Leu Pro Glu Gly Phe Leu Glu Lys Ala Gly Asp Arg Gly 325 330 335 Lys Val Val Gln Trp Ser Pro Gln Glu Lys Ile Leu Glu His Pro Ser 340 345 350 Thr Ala Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Met Glu Ser 355 360 365 Leu Thr Ser Gly Met Pro Val Val Ala Phe Pro Gln Trp Gly Asp Gln 370 375 380 Val Thr Asp Ala Lys Tyr Leu Val Asp Glu Phe Lys Val Gly Val Arg 385 390 395 400 Met Cys Arg Gly Glu Ala Glu Asp Arg Val Ile Pro Arg Asp Glu Val 405 410 415 Glu Lys Cys Leu Leu Glu Ala Thr Ser Gly Ser Lys Ala Ala Glu Met 420 425 430 Lys Gln Asn Ala Leu Lys Trp Lys Ala Ala Ala Glu Ala Ala Phe Ser 435 440 445 Glu Gly Gly Ser Ser Asp Arg Asn Leu Gln Ala Phe Val Asp Glu Val 450 455 460 Arg Arg Ile Ser Ala Ser Leu Asn Ser Lys Ser Ser Ala Val Gly Tyr 465 470 475 480 Val Lys Ser Lys Ile Asn Gly Val Val Glu Tyr Val Asp Ser Lys Leu 485 490 495 Asn Gly Lys Ala Ala Pro Val Glu Glu Ala Asn Thr Arg Thr Asn Gly 500 505 510 Ile Ala Lys Val Glu Gln Pro Lys Ala Ala Asn Gly Lys Val Glu Ile 515 520 525 Ala Glu Leu Thr Pro Ile Asn Gly Lys Val Glu Ile Ala Glu Leu Lys 530 535 540 Pro Ile Asn Gly Lys Val Glu Leu Val Glu Ser 545 550 555 <210> 14 <211> 1665 <212> DNA <213> Artificial Sequence <220> <223> recombinant FaGT2 <400> 14 atgggtagtg agtctttggt tcacgtcttt cttgtgtcct ttattggcca aggacacgtc 60 aacccgcttc tgaggctggg gaaaagactg gcggctaaag ggctgcttgt aaccttttgc 120 acggcggagt gtgtcgggaa agaaatgagg aaatctaatg ggataaccga tgagcctaaa 180 ccagtaggag acggatttat acgttttgaa ttcttcaaag ataggtgggc agaagacgaa 240 cccatgcgtc aggacctgga tctatattta ccccagttgg agctagttgg aaaagaagtg 300 atacctgaga tgataaagaa gaacgcagaa caagggagac cagtatcatg cctaatcaat 360 aatccgttta ttccgtgggt ctgtgacgtc gccgaatctc ttggtttacc atccgcaatg 420 ctgtgggttc aatctgccgc ctgtctagct gcttactatc actactacca tggactagtc 480 ccattcccat ctgagtctga catgttttgt gatgtccaaa tacccagcat gccattgtta 540 aaatatgacg aggttcccag ctttttgtat ccaacgagcc catatccatt tttgagacgt 600 gcgatcttgg gtcaatacgg gaacttagag aagccgtttt gtatccttat ggatacgttt 660 caagaactag agtctgaaat aattgaatat atggccagat tatgtcccat aaaggccgtc 720 gggccattgt tcaagaatcc caaggctcaa aacgcggtta gaggcgactt tatggaagcc 780 gatgactcaa tcatcggatg gttggacaca aaacccaaga gttcagttgt ttacattagc 840 ttcgggagcg ttgtatactt aaagcaggag caagtggatg aaattgctca cggactttta 900 tcatccggcg tgtcttttat atgggtgatg aaacctcctc atccggatag cggatttgaa 960 ttgttggtat taccagaagg tttcctagaa aaggcgggag accgtggaaa agttgtgcaa 1020 tggagtccac aagaaaagat attagaacac ccttcaactg catgttttgt gacacactgc 1080 gggtggaata gtacgatgga aagtttaacc agcgggatgc ccgtggtcgc gtttccacag 1140 tggggggatc aggtaactga cgcgaagtac ctagtggatg aatttaaggt aggagttagg 1200 atgtgcagag gcgaggccga ggacagggtg attcctagag acgaggttga aaagtgtctt 1260 ttggaagcta cgtctggttc aaaagctgca gagatgaaac aaaatgcgct taagtggaaa 1320 gctgctgcag aggctgcgtt ctccgaagga ggatcttcag ataggaatct gcaagccttc 1380 gtcgacgaag tcagacgtat tagcgccagc ttaaattcta agtcctctgc tgtcggctat 1440 gtcaagagta agatcaacgg agtggtggaa tatgtcgata gtaagttaaa cgggaaagcc 1500 gccccggtgg aggaggcgaa cactcgtaca aatggcatcg caaaggtaga acaacctaag 1560 gcagcaaacg gcaaggtgga aatagctgaa ttaactccga ttaatggcaa ggttgagatt 1620 gccgaactga agccaataaa cgggaaagtc gagttagtag agtca 1665 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 15 cgggatccat gggtagtgag tctt 24 <210> 16 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 16 cccaagcttt gactctacta actcgactt 29 <210> 17 <211> 473 <212> PRT <213> Artificial Sequence <220> <223> recombinant StUGT <400> 17 Met Val Gln Pro His Val Leu Leu Val Thr Phe Pro Ala Gln Gly His 1 5 10 15 Ile Asn Pro Ser Leu Gln Phe Ala Lys Arg Leu Ile Lys Met Gly Ile 20 25 30 Glu Val Thr Phe Thr Thr Ser Val Phe Ala His Arg Arg Met Ala Lys 35 40 45 Thr Ala Ala Ser Asn Ala Pro Lys Gly Leu Asn Leu Ala Ala Phe Ser 50 55 60 Asp Gly Phe Asp Asp Gly Phe Lys Ser Asn Val Asp Asp Ser Lys Arg 65 70 75 80 Tyr Met Ser Glu Ile Arg Ser Arg Gly Ser Gln Thr Leu Arg Asp Ile 85 90 95 Ile Leu Lys Ser Ser Asp Glu Gly Arg Pro Val Thr Ser Leu Val Tyr 100 105 110 Thr Leu Leu Leu Pro Trp Ala Ala Glu Val Ala Arg Glu Leu His Ile 115 120 125 Pro Ser Ala Leu Leu Trp Ile Gln Pro Ala Thr Val Leu Asp Ile Tyr 130 135 140 Tyr Tyr Tyr Phe Asn Gly Tyr Glu Asp Glu Met Lys Cys Ser Ser Asn 145 150 155 160 Asp Pro Asn Trp Ser Ile Gln Leu Pro Arg Leu Pro Leu Leu Lys Ser 165 170 175 Gln Asp Leu Pro Ser Phe Leu Val Ser Ser Ser Ser Lys Asp Asp Lys 180 185 190 Tyr Ser Phe Ala Leu Pro Thr Phe Lys Glu Gln Leu Asp Thr Leu Asp 195 200 205 Gly Glu Glu Asn Pro Lys Val Leu Val Asn Thr Phe Asp Ala Leu Glu 210 215 220 Leu Glu Pro Leu Lys Ala Ile Glu Lys Tyr Asn Leu Ile Gly Ile Gly 225 230 235 240 Pro Leu Ile Pro Ser Ser Phe Leu Gly Gly Lys Asp Ser Leu Glu Ser 245 250 255 Ser Phe Gly Gly Asp Leu Phe Gln Lys Ser Asp Asp Asp Tyr Met Glu 260 265 270 Trp Leu Asn Thr Lys Pro Lys Ser Ser Ile Val Tyr Ile Ser Phe Gly 275 280 285 Ser Leu Leu Asn Leu Ser Arg Asn Gln Lys Glu Glu Ile Ala Lys Gly 290 295 300 Leu Ile Glu Ile Lys Arg Pro Phe Leu Trp Val Ile Arg Asp Gln Glu 305 310 315 320 Asn Ile Lys Glu Val Glu Lys Glu Glu Glu Lys Leu Ser Cys Met Met 325 330 335 Glu Leu Glu Lys Gln Gly Lys Ile Val Pro Trp Cys Ser Gln Leu Glu 340 345 350 Val Leu Thr His Pro Ser Leu Gly Cys Phe Val Ser His Cys Gly Trp 355 360 365 Asn Ser Thr Leu Glu Ser Leu Ser Ser Gly Val Pro Val Val Ala Phe 370 375 380 Pro His Trp Thr Asp Gin Gly Thr Asn Ala Lys Leu Ile Glu Asp Val 385 390 395 400 Trp Lys Thr Gly Val Arg Met Arg Val Ser Glu Asp Gly Val Val Glu 405 410 415 Ser Glu Glu Ile Lys Arg Cys Ile Glu Ile Val Met Asp Gly Gly Glu 420 425 430 Lys Gly Glu Glu Met Arg Lys Asn Ala Gln Lys Trp Lys Glu Leu Ala 435 440 445 Arg Glu Ala Val Lys Glu Gly Gly Ser Ser Glu Val Asn Leu Lys Ala 450 455 460 Phe Val Gln Glu Val Gly Lys Ser Cys 465 470 <210> 18 <211> 1419 <212> DNA <213> Artificial Sequence <220> <223> recombinant StUGT <400> 18 atggttcaac ctcatgtgct tcttgtaacg ttcccggcac aaggtcacat caacccctca 60 cttcagtttg ccaagagatt aattaagatg ggcatcgagg ttacctttac tacaagcgtc 120 ttcgcccaca gacgtatggc gaagacggct gcttccaacg ctcccaaggg ccttaatctg 180 gccgccttct cagatggttt tgatgatggc ttcaaatcta atgtcgatga ttcaaagcgt 240 tatatgtctg agattagatc taggggcagc cagaccctgc gtgacattat cttaaagagt 300 agcgacgaag gaaggcccgt aacaagtcta gtatataccc tactattgcc ttgggctgct 360 gaagtagcaa gggagctgca tattccttca gctttgttat ggattcagcc agcgacagtc 420 ctagacatat attattatta cttcaatggc tacgaagatg agatgaaatg tagctcaaac 480 gatcctaatt ggtcaataca attacctcgt ttgcccttac ttaagtccca agatttaccc 540 tctttccttg tgtcctctag ctccaaggat gataaataca gctttgctct accaacgttc 600 aaagaacaac ttgacactct tgacggagag gagaatccca aggtcttagt gaatacgttt 660 gacgccctag aattagagcc gttaaaagca atcgagaagt ataatcttat cgggattgga 720 cctctaattc cgtcttcatt cctgggcggg aaagattcac tagaaagtag cttcggaggc 780 gatttatttc agaaaagcga cgatgattac atggaatggc taaacactaa acccaaatct 840 tcaattgtgt acatttcctt cggttcacta cttaacctat ccagaaatca aaaagaggaa 900 atcgcaaaag ggttaattga aatcaagcgt ccttttctgt gggtcattag ggatcaggaa 960 aatataaagg aggtggagaa ggaggaagag aagttaagct gcatgatgga gctggaaaaa 1020 caaggtaaaa ttgttccttg gtgtagccaa ctagaggtat taactcatcc ctccctggga 1080 tgcttcgtaa gccattgtgg atggaacagt acacttgaaa gtttatcttc aggcgtgccc 1140 gtggtcgcct tccctcactg gacagaccaa ggtactaacg cgaaattaat agaagatgtt 1200 tggaagacag gtgtacgtat gcgtgttagc gaggatggcg tcgtcgaatc agaggagatt 1260 aaaagatgta tagagatagt gatggatggg ggcgagaaag gcgaggaaat gagaaaaaac 1320 gctcagaaat ggaaagaact agccagggaa gcggtgaagg agggggggag cagtgaggtc 1380 aacctaaagg cttttgtgca ggaagtggga aaatcttgt 1419 <210> 19 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 19 ccggaattca tggttcaacc tcatgtgctt 30 <210> 20 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 20 ccgctcgaga caagattttc ccacttcctg 30 <210> 21 <211> 454 <212> PRT <213> Artificial Sequence <220> <223> recombinant CaUGT3 <400> 21 Met Ala Thr Glu Gln Gln Gln Ala Ser Ile Ser Cys Lys Ile Leu Met 1 5 10 15 Phe Pro Trp Leu Ala Phe Gly His Ile Ser Phe Leu Gln Leu Ala 20 25 30 Lys Lys Leu Ser Asp Arg Gly Phe Tyr Phe Tyr Ile Cys Ser Thr Pro 35 40 45 Ile Asn Leu Asp Ser Ile Lys Asn Lys Ile Asn Gln Asn Tyr Ser Ser 50 55 60 Ser Ile Gln Leu Val Asp Leu His Leu Pro Asn Ser Pro Gln Leu Pro 65 70 75 80 Pro Ser Leu His Thr Thr Asn Gly Leu Pro His Leu Met Ser Thr 85 90 95 Leu Lys Asn Ala Leu Ile Asp Ala Asn Pro Asp Leu Cys Lys Ile Ile 100 105 110 Ala Ser Ile Lys Pro Asp Leu Ile Ile Tyr Asp Leu His Gln Pro Trp 115 120 125 Thr Glu Ala Leu Ala Ser Arg His Asn Ile Pro Ala Val Ser Phe Ser 130 135 140 Thr Met Asn Ala Val Ser Phe Ala Tyr Val Met His Met Phe Met Asn 145 150 155 160 Pro Gly Ile Glu Phe Pro Phe Lys Ala Ile His Leu Ser Asp Phe Glu 165 170 175 Gln Ala Arg Phe Leu Glu Gln Leu Glu Ser Ala Lys Asn Asp Ala Ser 180 185 190 Ala Lys Asp Pro Glu Leu Gln Gly Ser Lys Gly Phe Phe Asn Ser Thr 195 200 205 Phe Ile Val Arg Ser Ser Arg Glu Ile Glu Gly Lys Tyr Val Asp Tyr 210 215 220 Leu Ser Glu Ile Leu Lys Ser Lys Val Ile Pro Val Cys Pro Val Ile 225 230 235 240 Ser Leu Asn Asn Asn Asn Asp Gln Gly Gln Gly Asn Lys Asp Glu Asp Glu 245 250 255 Ile Ile Gln Trp Leu Asp Lys Lys Ser His Arg Ser Ser Val Phe Val 260 265 270 Ser Phe Gly Ser Glu Tyr Phe Leu Asn Met Gln Glu Ile Glu Glu Ile 275 280 285 Ala Ile Gly Leu Glu Leu Ser Asn Val Asn Phe Ile Trp Val Leu Arg 290 295 300 Phe Pro Lys Gly Glu Asp Thr Lys Ile Glu Glu Val Leu Pro Glu Gly 305 310 315 320 Phe Leu Asp Arg Val Lys Thr Lys Gly Arg Ile Val His Gly Trp Ala 325 330 335 Pro Gln Ala Arg Ile Leu Gly His Pro Ser Ile Gly Gly Phe Val Ser 340 345 350 His Cys Gly Trp Asn Ser Val Met Glu Ser Ile Gln Ile Gly Val Pro 355 360 365 Ile Ile Ala Met Pro Met Asn Leu Asp Gln Pro Phe Asn Ala Arg Leu 370 375 380 Val Val Glu Ile Gly Val Gly Ile Glu Val Gly Arg Asp Glu Asn Gly 385 390 395 400 Lys Leu Lys Arg Glu Arg Ile Gly Glu Val Ile Lys Glu Val Ala Ile 405 410 415 Gly Lys Lys Gly Glu Lys Leu Arg Lys Thr Ala Lys Asp Leu Gly Gln 420 425 430 Lys Leu Arg Asp Arg Glu Lys Gln Asp Phe Asp Glu Leu Ala Ala Thr 435 440 445 Leu Lys Gln Leu Cys Val 450 <210> 22 <211> 1362 <212> DNA <213> Artificial Sequence <220> <223> recombinant CaUGT3 <400> 22 atggccaccg aacagcaaca ggcatccatc tcttgtaaaa ttttgatgtt cccttggcta 60 gctttcggac atatctccag cttcttgcaa ttagcgaaga agcttagcga cagggggttc 120 tatttttata tctgctccac ccctataaat cttgactcca taaaaaacaa aataaatcag 180 aattattcat cctccattca gctagtcgat ctgcacctac ccaatagccc tcagttgccc 240 ccgagtttac acacaaccaa cggactaccg ccgcatttga tgtcaactct gaagaatgcg 300 cttattgacg ctaatcccga tctgtgtaaa ataatagcaa gcattaagcc ggatctgatc 360 atatatgatc tacatcaacc gtggacagag gcgctagcct cacgtcacaa cattcctgcg 420 gtcagtttct ctaccatgaa tgctgtatcc tttgcctacg ttatgcatat gtttatgaat 480 ccgggcatag aatttccatt caaagctata catctgtctg atttcgagca agcgcgtttc 540 ttggagcagc tggagtcagc aaaaaacgat gcctcagcaa aagaccccga actacaggga 600 agtaaagggt tttttaactc cacctttata gttagatcca gtcgtgaaat cgaggggaaa 660 tacgtcgatt acttgagtga gatcttaaaa agtaaggtta tacctgtttg ccccgtcatt 720 agccttaaca ataatgatca gggacaaggg aacaaagacg aggacgaaat cattcaatgg 780 ttggacaaaa aaagtcatag aagctccgtg tttgtttcat tcggtagtga atacttcctt 840 aacatgcaag agattgaaga gatagccatt ggtcttgagc tgtctaatgt caacttcatc 900 tgggttttga ggtttcccaa aggtgaagat acgaaaatag aagaggtgct accggaagga 960 tttctagata gggtgaagac caaagggaga atagtgcatg gctgggcccc acaggctcgt 1020 attctgggtc atccatctat aggtgggttt gtatcccatt gtggctggaa cagtgtgatg 1080 gaaagtattc aaatcggtgt gcccatcatt gcgatgccga tgaatttaga ccaacctttc 1140 aatgcaaggc ttgtcgtgga aattggtgtg ggtattgagg tcggcaggga cgagaacggg 1200 aaacttaaaa gggaacgtat cggtgaggtc attaaggaag ttgcaatcgg aaagaagggg 1260 gagaagctta gaaaaaccgc gaaagacctg ggccagaagc taagggaccg tgagaaacag 1320 gattttgatg agcttgcagc gacgctgaag caattgtgcg ta 1362 <210> 23 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 23 cgggatccat ggccaccgaa cagca 25 <210> 24 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 24 acgcgtcgac tacgcacaat tgcttcagc 29 <210> 25 <211> 456 <212> PRT <213> Artificial Sequence <220> <223> recombinant NsUGT <400> 25 Met Asp Thr Glu His Ser Thr Leu Arg Val Leu Met Phe Pro Trp Leu 1 5 10 15 Ala His Gly His Ile Ser Pro Tyr Leu Thr Val Ala Lys Lys Leu Ala 20 25 30 Cys Arg Gly Phe Tyr Val Tyr Leu Cys Ser Thr Pro Val Asn Leu Asn 35 40 45 Phe Ile Lys Lys Lys Ile Pro Gln Lys Tyr Ser Ile Ser Ile Gln Leu 50 55 60 Val Glu Phe His Leu Pro Asp Leu Pro Glu Leu Pro Leu Ser Tyr His 65 70 75 80 Thr Thr Asn Gly Ile Pro His Leu Val Ser Thr Leu Lys Lys Ala 85 90 95 Val Lys Met Ser Lys Pro Asn Phe Tyr Lys Ile Ile Glu Asn Leu Lys 100 105 110 Pro Asn Met Leu Ile Tyr Asp Ile Leu Gln Pro Trp Ala Lys Glu Val 115 120 125 Ala Asn Ser Tyr Asn Ile Pro Ala Val Met Leu Leu Thr Phe Cys Ala 130 135 140 Ala Met Leu Ser Tyr Arg Leu His Pro Val Lys Lys Pro Gly Thr Glu 145 150 155 160 Phe Pro Phe Pro Ala Leu Tyr Leu Arg Lys Ile Glu Arg Gln Gln Arg 165 170 175 Glu Glu Met Leu Glu Lys Ala Ala Lys Glu Lys Asp Pro Asp Asp Lys 180 185 190 Asp Pro Phe Ala Glu Glu Glu Thr Met Asn Lys Ile Ile Leu Met Ser 195 200 205 Thr Ser Arg Ala Thr Glu Ala Lys Tyr Ile Asp Tyr Phe Thr Glu Leu 210 215 220 Ile Gln Trp Lys Ile Ile Pro Val Gly Pro Pro Val Gln Glu Thr Thr 225 230 235 240 Asn Glu Tyr Asp Gly Asp Val Asp Asp Leu Ile Asp Trp Leu Gly Asn 245 250 255 Lys Tyr Glu Asn Ser Thr Val Phe Val Ser Phe Gly Ser Gln Tyr Phe 260 265 270 Leu Ser Lys Glu Asp Leu Glu Glu Ile Ala Leu Gly Leu Glu Leu Ser 275 280 285 Asn Val Asn Phe Ile Trp Val Val Arg Phe Pro Lys Gly Glu Glu Val 290 295 300 Arg Val Glu Glu Ala Leu Pro Glu Gly Phe Leu Glu Arg Ile Gly Asp 305 310 315 320 Arg Gly Arg Val Val Asp Gly Trp Ala Pro Gln Leu Arg Ile Leu Ser 325 330 335 His Pro Ser Thr Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Val 340 345 350 Met Glu Ser Ile Asp Phe Arg Val Pro Ile Ile Ala Leu Pro Met His 355 360 365 Leu Asp Gln Pro Ile Asn Ala Arg Leu Ile Val Glu Leu Gly Val Ala 370 375 380 Val Glu Ile Val Arg Asp Asp Glu Gly Lys Val His Arg Gly Glu Val 385 390 395 400 Ala Glu Ile Val Lys Ser Ile Ile Cys Glu Lys Thr Gly Glu Asn Leu 405 410 415 Arg Asn Lys Val Arg Glu Ile Ser Glu Asn Leu Lys Lys Glu Arg Glu 420 425 430 Glu Glu Met Asp Ala Ala Ile Gly Glu Leu Val Gln Leu Cys Lys Thr 435 440 445 Ser Lys Ser Asn Asn Met Met Leu 450 455 <210> 26 <211> 1368 <212> DNA <213> Artificial Sequence <220> <223> recombinant NsUGT <400> 26 atggacacag aacacagcac gctacgtgtg ctaatgttcc cgtggctagc gcacggacac 60 atatctccct atttaacagt tgcaaagaag ctagcatgta gaggatttta cgtctatctt 120 tgttccacgc cggtcaatct aaacttcatc aagaagaaga taccccagaa gtattcaatt 180 agtatacagt tggtcgaatt ccacctacct gacctacctg agcttccatt aagttaccat 240 acgacaaacg gcattcctcc ccacctagtg tcaacgttaa aaaaagccgt aaagatgagc 300 aaaccgaatt tttataaaat aattgagaat ttgaaaccaa atatgttgat atatgacatt 360 cttcagccat gggcgaagga agtcgccaac tcctacaata tccccgctgt tatgttattg 420 acgttttgcg ctgccatgct ttcttaccgt ttgcacccgg tcaaaaagcc gggcacagag 480 ttcccattcc cggcgctata tttgaggaaa attgaacgtc agcaaaggga ggagatgctt 540 gaaaaagccg ccaaggagaa ggacccggat gataaagacc cgtttgcgga ggaagaaact 600 atgaacaaga taatcctgat gtctacaagc agggcgacgg aggccaagta catcgactat 660 ttcactgaac taattcagtg gaagataatt cctgtgggcc cgcccgtcca ggaaacgact 720 aacgagtacg acggagatgt agatgattta attgattggt tgggaaacaa atatgaaaac 780 tccactgtat ttgtcagctt tgggagccaa tatttcttga gcaaagaaga cttagaggag 840 attgctctag gattggaact atccaatgtg aacttcatat gggtcgtgag attccccaag 900 ggggaggaag ttagagtaga agaagcgttg ccagaaggtt tccttgagag gattggggac 960 agagggaggg ttgttgacgg ttgggcacct caattgcgta ttctgagtca cccttcaacc 1020 ggaggatttg tcagtcactg tgggtggaac tctgtgatgg agtccatcga ttttcgtgtc 1080 cctatcatcg ccctgccaat gcacctggat cagccaatca atgcccgtct aattgtcgaa 1140 ttaggagttg ccgtcgaaat agttcgtgac gacgagggca aagttcacag gggggaggta 1200 gcagagatcg ttaaaagcat tatatgtgag aagacggggg aaaatttgcg taataaagtc 1260 cgtgaaatct ctgaaaatct taagaaagaa agggaagagg agatggacgc agcgatagga 1320 gaactggtgc aattatgcaa gactagcaaa agtaataaca tgatgttg 1368 <210> 27 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 27 cgggatccat ggacacagaa cacagca 27 <210> 28 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 28 acgcgtcgac caacatcatg ttattacttt tgc 33 <210> 29 <211> 453 <212> PRT <213> Artificial Sequence <220> <223> recombinant SpUGT <400> 29 Met Gly Thr Gln Val Thr Glu His Gly Thr Ser Asn Leu Arg Val Val 1 5 10 15 Met Phe Pro Trp Leu Ala Tyr Gly His Ile Ser Pro Phe Leu Tyr Val 20 25 30 Ala Lys Lys Leu Ala Asp Arg Gly Phe Leu Ile Tyr Leu Cys Ser Thr 35 40 45 Pro Ile Asn Leu Lys Ser Thr Ile Lys Lys Ile Pro Glu Lys Tyr Ala 50 55 60 Asp Ser Ile His Leu Ile Glu Leu His Ile Pro Glu Leu Pro Glu Leu 65 70 75 80 Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro His Leu Asn His 85 90 95 Thr Leu Gln Lys Ala Leu Lys Met Ser Lys Pro Asn Leu Ser Lys Ile 100 105 110 Leu Lys Asn Leu Lys Pro Asp Leu Met Ile Tyr Asp Val Leu Gln Gln 115 120 125 Trp Ala Glu Arg Val Ala Asn Glu Gln Ser Ile Pro Ala Val Arg Leu 130 135 140 Leu Thr Phe Gly Ala Ala Val Phe Ser Tyr Phe Cys Asn Leu Val Lys 145 150 155 160 Lys Pro Gly Val Glu Phe Pro Phe Pro Asp Ile Tyr Leu Arg Lys Ile 165 170 175 Glu Gln Val Lys Leu Gly Glu Met Leu Glu Lys Ser Ala Lys Asp Gln 180 185 190 Asp Pro Asp Asp Glu Glu Arg Leu Val Asp Glu Tyr Lys Gln Ile Ala 195 200 205 Leu Ile Cys Thr Ser Arg Thr Ile Glu Ala Lys Tyr Ile Asp Phe Leu 210 215 220 Leu Glu Leu Ser Asn Leu Lys Val Val Pro Val Gly Pro Pro Val Gln 225 230 235 240 Asp Leu Ile Thr Asn Asp Ala Asp Asp Met Glu Leu Ile Asp Trp Leu 245 250 255 Gly Ser Lys Asp Glu Asn Ser Thr Val Phe Val Ser Phe Gly Ser Glu 260 265 270 Tyr Phe Leu Ser Lys Glu Asp Met Glu Glu Val Ala Leu Gly Leu Glu 275 280 285 Leu Ser Asn Val Asn Phe Val Trp Val Ala Arg Phe Pro Lys Gly Glu 290 295 300 Glu Gln Asn Leu Glu Asp Ala Leu Pro Lys Gly Phe Leu Glu Arg Ile 305 310 315 320 Gly Glu Arg Gly Arg Val Leu Asp Lys Phe Ala Pro Gln Leu Arg Ile 325 330 335 Leu Asn His Thr Ser Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn 340 345 350 Ser Val Met Glu Ser Ile His Phe Gly Val Pro Ile Val Ala Met Pro 355 360 365 Met His Leu Asp Gln Pro Met Asn Ala Arg Leu Ile Val Glu Leu Gly 370 375 380 Val Ala Val Glu Ile Val Arg Asp Asp Asp Gly Lys Ile His Arg Glu 385 390 395 400 Glu Ile Ala Lys Thr Leu Lys Asp Val Ile Thr Glu Arg Ile Gly Glu 405 410 415 Asn Leu Arg Ala Lys Met Arg Asp Ile Ser Met Asn Leu Asn Ser Ile 420 425 430 Ser Gly Glu Glu Met Asp Ala Ala Ala His Glu Leu Ile Gln Phe Cys 435 440 445 Lys Ile Asn Thr Asn 450 <210> 30 <211> 1359 <212> DNA <213> Artificial Sequence <220> <223> recombinant SpUGT <400> 30 atgggaactc aagttacaga acatggaaca tctaatctaa gggtagtcat gttcccttgg 60 ctggcatacg gtcacatctc accattcctt tatgtcgcaa aaaaactggc agacaggggt 120 ttcctgattt acttatgtag tactcccatt aatcttaagt ccacaataaa gaaaatacca 180 gaaaaatacg ccgactccat ccatttaatc gagcttcata ttccagaatt gccagagttg 240 cctcctcatt atcataccac gaacggcctt ccacctcatc taaatcacac actacagaag 300 gccttgaaga tgtcaaagcc caatctatcc aaaatactaa aaaaccttaa gccagattta 360 atgatctacg acgtactaca acagtgggct gagagggtag cgaacgaaca gtccattccg 420 gctgtaaggt tattaacctt cggtgccgca gttttttcat atttttgcaa tttggttaag 480 aaacccggag tcgagtttcc attccccgac atttatttga gaaaaattga gcaggtcaaa 540 ctaggtgaaa tgttggagaa atctgccaaa gaccaagatc ctgacgatga agaaaggtta 600 gtagatgagt acaaacaaat tgctttaatt tgtaccagta gaactattga agccaaatac 660 atcgacttcc tattggagct gagtaaccta aaggtcgttc cagtcggtcc tcccgtccag 720 gacttgatca ccaatgatgc cgatgatatg gaattaatcg actggcttgg ctcaaaggat 780 gaaaattcca ccgttttcgt aagtttcggg agtgagtatt ttttaagcaa agaagatatg 840 gaagaggtcg cgctaggcct tgaactgagc aacgttaact ttgtatgggt cgctaggttc 900 cccaagggag aagaacaaaa tctagaagat gccttaccga aggggttcct ggagagaata 960 ggggaacgtg ggagagtcct ggataagttt gcaccgcagc tgaggatatt aaatcacact 1020 tccactggtg gatttatatc acattgtggt tggaacagcg tgatggaatc tatacacttc 1080 ggagttccga ttgtggcgat gccaatgcat ttggaccagc caatgaacgc gaggcttatc 1140 gttgagcttg gagtagcggt ggagattgtt agggacgatg atggcaaaat tcacagggaa 1200 gaaattgcga agactcttaa ggacgtgata acagagcgta ttggagagaa tctaagggct 1260 aagatgcgtg atatttcaat gaacctaaac agtattagcg gagaggagat ggacgcagct 1320 gctcatgaac ttatacagtt ctgtaagatc aacaccaat 1359 <210> 31 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 31 cgggatccat gggaactcaa gttacagaa 29 <210> 32 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 32 acgcgtcgac attggtgttg atcttacaga ac 32 <210> 33 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 33 cggaattcat ggcgaacaaa gaagagg 27 <210> 34 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 34 cccaagcttt taggtctccg cttgatgc 28 <210> 35 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 35 gctctagaag gaggattaca aaatgactgc tgtcgttctc c 41 <210> 36 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 36 cggaattcct agagcttgcg gaagag 26 <210> 37 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 37 ggaagatctg ctgtgcaggt cgtaaa 26 <210> 38 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 38 ataagaatgc ggccgcgaaa cgcaaaaagg cca 33 <210> 39 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 39 aagagcguct agaaggagga ttacaaaa 28 <210> 40 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 40 agctgatugt actgagagtg caccat 26 <210> 41 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 41 gctctagaag aggattacaa aatggttcaa cctcatgtcc 40 <210> 42 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 42 tccccccggg ttaacaacct tttcctactt cttg 34 <210> 43 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 43 tccccccggg aggaggacag ctaaatggcc accgaacagc 40 <210> 44 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> primer <400> 44 aaggaaaaaa gcggccgctt atacgcacaa ttgcttcagc 40
Claims (21)
Catharanthus roseus ( Catharanthus roseus ) consisting of the amino acid sequence of SEQ ID NO: 21 comprising a CaUGT3 enzyme derived from the strain, Crosin-4 production composition.
상기 조성물은 크로신-2 및 UDP-글루코스를 추가로 포함하는 것인, 크로신-4 생산용 조성물.
According to claim 1,
The composition further comprises crosin-2 and UDP-glucose, crosin-4 production composition.
Crosin-4 from Crosin-2, comprising the step of reacting Crosin-2 with UDP-glucose in the presence of CaUGT3 enzyme consisting of the amino acid sequence of SEQ ID NO: 21 to obtain a reaction product containing Crosin-4 how to produce it.
상기 반응은 20 내지 40℃의 온도조건 및 10 내지 240분의 시간조건에서 수행되는 것인, 크로신-2로부터 크로신-4를 생산하는 방법.
5. The method of claim 4,
The method of producing crosin-4 from crosin-2, wherein the reaction is carried out under a temperature condition of 20 to 40 ℃ and a time condition of 10 to 240 minutes.
A composition for producing crosin-4 comprising a transformant into which CaUGT3 gene derived from a Catharanthus roseus strain consisting of the nucleotide sequence of SEQ ID NO: 22 or a culture product thereof is introduced.
A method for producing crosin-4 from crosin-2, comprising culturing a transformant into which the CaUGT3 gene consisting of the nucleotide sequence of SEQ ID NO: 22 is introduced in a medium containing crosin-2 and UDP-glucose .
상기 배양을 통해 수득한 배양물로부터 크로신-4를 회수하는 단계를 추가로 포함하는 것인, 크로신-2로부터 크로신-4를 생산하는 방법.
9. The method of claim 8,
The method for producing crosin-4 from crosin-2, further comprising the step of recovering crosin-4 from the culture obtained through the culture.
(a) GjUGT1 enzyme derived from the Gardenia jasminoides strain consisting of the amino acid sequence of SEQ ID NO: 1 or Nicotiana tabacum consisting of the amino acid sequence of SEQ ID NO: 9 NtUGT enzyme derived from the strain; And (b) Catharanthus roseus ( Catharanthus roseus ) consisting of the amino acid sequence of SEQ ID NO: 21 Containing a CaUGT3 enzyme derived from the strain, Crosin-4 production composition.
상기 조성물은 크로세틴 및 UDP-글루코스를 추가로 포함하는 것인, 크로신-4 생산용 조성물.
11. The method of claim 10,
The composition further comprises crocetin and UDP-glucose, crosin-4 production composition.
(b) 상기 반응물에 서열번호 21의 아미노산 서열로 이루어진 CaUGT3 효소를 가하여 반응시켜서 크로신-4를 수득하는 단계를 포함하는, 크로세틴으로부터 크로신-4를 생산하는 방법.
(a) reacting crocetin with UDP-glucose in the presence of a GjUGT1 enzyme consisting of the amino acid sequence of SEQ ID NO: 1 or an NtUGT enzyme consisting of the amino acid sequence of SEQ ID NO: 9 to obtain a reaction product containing crosine-2; and
(b) adding and reacting CaUGT3 enzyme consisting of the amino acid sequence of SEQ ID NO: 21 to the reaction to obtain crosin-4.
상기 (a) 단계 및 (b) 단계의 반응은 20 내지 40℃의 온도조건 및 10 내지 240분의 시간조건에서 수행되는 것인, 크로세틴으로부터 크로신-4를 생산하는 방법.
14. The method of claim 13,
The reaction of steps (a) and (b) is a method for producing crosine-4 from crocetin, which is performed under a temperature condition of 20 to 40°C and a time condition of 10 to 240 minutes.
상기 (b) 단계에서 UDP-글루코스를 가하는 단계를 추가로 포함하는 것인, 크로세틴으로부터 크로신-4를 생산하는 방법.
14. The method of claim 13,
In the step (b), the method for producing crosine-4 from crocetin further comprising the step of adding UDP-glucose.
(A) Gardenia jasminoides consisting of the nucleotide sequence of SEQ ID NO: 2 GjUGT1 gene derived from the strain or Nicotiana tabacum consisting of the nucleotide sequence of SEQ ID NO: 10 NtUGT gene derived from the strain; And (b) the nucleotide sequence of SEQ ID NO: 22 Catharanthus roseus ( Catharanthus roseus ) Crosin-4 production composition comprising a transformant or a culture product of the CaUGT3 gene derived from the strain.
상기 형질전환체는 크로세틴 생합성 유전자가 추가로 도입된 것인, 크로신-4 생산용 조성물.
17. The method of claim 16,
The transformant is a composition for producing crocetin-4, wherein the crocetin biosynthesis gene is additionally introduced.
상기 크로세틴 생합성 유전자는 CsCCD2의 유전자 및 aldH 유전자인 것인, 크로신-4 생산용 조성물.
19. The method of claim 18,
The crocetin biosynthesis gene is a gene of CsCCD2 and aldH gene, a composition for crosine-4 production.
(a) the GjUGT1 gene consisting of the nucleotide sequence of SEQ ID NO: 2 or the NtUGT gene consisting of the nucleotide sequence of SEQ ID NO: 10; and (b) culturing the transformant into which the CaUGT3 gene consisting of the nucleotide sequence of SEQ ID NO: 22 is introduced in a medium containing crocetin and UDP-glucose, a method for producing crosine-4 from crocetin .
상기 배양을 통해 수득한 배양물로부터 크로신-4를 회수하는 단계를 추가로 포함하는 것인, 크로세틴으로부터 크로신-4를 생산하는 방법.
21. The method of claim 20,
The method for producing crosine-4 from crocetin further comprising the step of recovering crosin-4 from the culture obtained through the culture.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020190114070A KR102335035B1 (en) | 2019-09-17 | 2019-09-17 | Process for preparing crocin-4 using enzyme reaction |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020190114070A KR102335035B1 (en) | 2019-09-17 | 2019-09-17 | Process for preparing crocin-4 using enzyme reaction |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20210033100A KR20210033100A (en) | 2021-03-26 |
KR102335035B1 true KR102335035B1 (en) | 2021-12-06 |
Family
ID=75259455
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020190114070A Active KR102335035B1 (en) | 2019-09-17 | 2019-09-17 | Process for preparing crocin-4 using enzyme reaction |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR102335035B1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114032222B (en) * | 2021-05-17 | 2022-06-28 | 中国科学院天津工业生物技术研究所 | Sugar chain elongation glycosyltransferase mutants, their encoding genes, and genetically engineered bacteria and their applications |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170306376A1 (en) | 2011-08-08 | 2017-10-26 | Evolva Sa | Methods and Materials for Recombinant Production of Saffron Compounds |
-
2019
- 2019-09-17 KR KR1020190114070A patent/KR102335035B1/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170306376A1 (en) | 2011-08-08 | 2017-10-26 | Evolva Sa | Methods and Materials for Recombinant Production of Saffron Compounds |
Non-Patent Citations (6)
Title |
---|
FEBS Letters, Vol. 586, pp. 1055-1061 (2012.03.10.)* |
Microbial Cell Factories, Vol. 18, pp. 120(1-11) (2019.07.05.)* |
NCBI GenBank Accession No. BAH80312.1 (2009.08.18.)* |
NCBI Reference Sequence XP_016468459.1 (2016.05.03.)* |
Plant Cell Physiol., Vol. 50, pp. 1401-1415 (2009.) |
UniProtKB/Swiss-Prot F8WKW0.1 (2019.07.31.)* |
Also Published As
Publication number | Publication date |
---|---|
KR20210033100A (en) | 2021-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109402158B (en) | Recombinant expression plasmid vector for producing fucosyllactose, metabolic engineering bacteria and production method | |
CN113186142B (en) | Escherichia coli engineering strain for efficiently producing 2' -fucosyllactose | |
CN112280726B (en) | Construction method and application of high-yield tetrahydropyrimidine engineering strain | |
CN104059872A (en) | High-yield N-acetylglucosamine metabolic engineering bacterium, as well construction method and applications thereof | |
CN106350565A (en) | Production method of rare ginsenoside Rh2 | |
CN114350727B (en) | Method for synthesizing D-psicose by combining phosphorylation and ATP regeneration system | |
CN108060111A (en) | A kind of pseudomonas aeruginosa for improving rhamnolipid yield and its construction method | |
WO2024140379A1 (en) | Enzyme, strain for producing salidroside, and production method | |
KR101726288B1 (en) | Method for producing Phenylacetyl homoserine lactone derivatives | |
KR101632697B1 (en) | Method for producing salidroside with metabolically engineered Escherichia coli | |
KR102335035B1 (en) | Process for preparing crocin-4 using enzyme reaction | |
KR102125425B1 (en) | A microorganisms having crocin productivity and process for producing using the same | |
ES2286504T3 (en) | ZEAXANTINE PRODUCTION THROUGH PHAFFIA. | |
KR102343323B1 (en) | Process for preparing crocin-2 using enzyme reaction | |
CN110317765A (en) | A kind of E. coli expression strains of high yield geraniol glucoside and its application | |
CN114032222B (en) | Sugar chain elongation glycosyltransferase mutants, their encoding genes, and genetically engineered bacteria and their applications | |
CN112375725B (en) | Metabolic engineering strain for producing vitamin B6 and construction method and application thereof | |
KR101669044B1 (en) | Recombinant microoganisms for producing steviolbioside and method for steviolbioside using the same | |
CN116004489A (en) | A kind of recombinant escherichia coli producing NMN and its application | |
WO2004029261A2 (en) | Production of canthaxanthin by phaffia | |
CN115960941A (en) | Genetic engineering strain for producing DL-alanine and construction method and application thereof | |
KR101669057B1 (en) | Recombinant microoganisms for producing steviolmonoside and method for steviolmonoside using the same | |
CN116925993B (en) | Genetically engineered strains and methods for enzyme-catalyzed production of cytidine acids | |
KR102489243B1 (en) | Recombinant microorganism for producing S-methylmethionine, and method for preparing S-methylmethionine using the same | |
CN108949788A (en) | Lycopene synthesis related gene and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 20190917 |
|
PA0201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20210209 Patent event code: PE09021S01D |
|
PG1501 | Laying open of application | ||
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20211123 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20211130 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20211201 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
PR1001 | Payment of annual fee |