CN110885846B - Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof - Google Patents
Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof Download PDFInfo
- Publication number
- CN110885846B CN110885846B CN201811043657.0A CN201811043657A CN110885846B CN 110885846 B CN110885846 B CN 110885846B CN 201811043657 A CN201811043657 A CN 201811043657A CN 110885846 B CN110885846 B CN 110885846B
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- gly
- glu
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- FXNFHKRTJBSTCS-UHFFFAOYSA-N Baicalein Natural products C=1C(=O)C=2C(O)=C(O)C(O)=CC=2OC=1C1=CC=CC=C1 FXNFHKRTJBSTCS-UHFFFAOYSA-N 0.000 title claims abstract description 64
- UDFLTIRFTXWNJO-UHFFFAOYSA-N baicalein Chemical compound O1C2=CC(=O)C(O)=C(O)C2=C(O)C=C1C1=CC=CC=C1 UDFLTIRFTXWNJO-UHFFFAOYSA-N 0.000 title claims abstract description 63
- 229940015301 baicalein Drugs 0.000 title claims abstract description 63
- 244000005700 microbiome Species 0.000 title abstract description 9
- 238000002360 preparation method Methods 0.000 title abstract description 6
- 230000002194 synthesizing effect Effects 0.000 title abstract description 6
- DJSISFGPUUYILV-UHFFFAOYSA-N UNPD161792 Natural products O1C(C(O)=O)C(O)C(O)C(O)C1OC(C(=C1O)O)=CC2=C1C(=O)C=C(C=1C=CC(O)=CC=1)O2 DJSISFGPUUYILV-UHFFFAOYSA-N 0.000 title description 4
- DJSISFGPUUYILV-ZFORQUDYSA-N scutellarin Chemical compound O1[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1OC(C(=C1O)O)=CC2=C1C(=O)C=C(C=1C=CC(O)=CC=1)O2 DJSISFGPUUYILV-ZFORQUDYSA-N 0.000 title description 4
- NPLTVGMLNDMOQE-UHFFFAOYSA-N carthamidin Natural products C1=CC(O)=CC=C1C1OC2=CC(O)=C(O)C(O)=C2C(=O)C1 NPLTVGMLNDMOQE-UHFFFAOYSA-N 0.000 title 1
- 229930190376 scutellarin Natural products 0.000 title 1
- 238000000034 method Methods 0.000 claims abstract description 23
- 238000004519 manufacturing process Methods 0.000 claims abstract description 16
- 108090000623 proteins and genes Proteins 0.000 claims description 66
- RTIXKCRFFJGDFG-UHFFFAOYSA-N chrysin Chemical compound C=1C(O)=CC(O)=C(C(C=2)=O)C=1OC=2C1=CC=CC=C1 RTIXKCRFFJGDFG-UHFFFAOYSA-N 0.000 claims description 56
- 150000001413 amino acids Chemical class 0.000 claims description 53
- GAMYVSCDDLXAQW-AOIWZFSPSA-N Thermopsosid Natural products O(C)c1c(O)ccc(C=2Oc3c(c(O)cc(O[C@H]4[C@H](O)[C@@H](O)[C@H](O)[C@H](CO)O4)c3)C(=O)C=2)c1 GAMYVSCDDLXAQW-AOIWZFSPSA-N 0.000 claims description 43
- 229930003944 flavone Natural products 0.000 claims description 43
- 150000002212 flavone derivatives Chemical class 0.000 claims description 43
- 235000011949 flavones Nutrition 0.000 claims description 43
- VHBFFQKBGNRLFZ-UHFFFAOYSA-N vitamin p Natural products O1C2=CC=CC=C2C(=O)C=C1C1=CC=CC=C1 VHBFFQKBGNRLFZ-UHFFFAOYSA-N 0.000 claims description 43
- 101001112118 Homo sapiens NADPH-cytochrome P450 reductase Proteins 0.000 claims description 42
- 102100023897 NADPH-cytochrome P450 reductase Human genes 0.000 claims description 42
- NYCXYKOXLNBYID-UHFFFAOYSA-N 5,7-Dihydroxychromone Natural products O1C=CC(=O)C=2C1=CC(O)=CC=2O NYCXYKOXLNBYID-UHFFFAOYSA-N 0.000 claims description 28
- 235000015838 chrysin Nutrition 0.000 claims description 28
- 229940043370 chrysin Drugs 0.000 claims description 28
- 102000004190 Enzymes Human genes 0.000 claims description 25
- 108090000790 Enzymes Proteins 0.000 claims description 25
- 229920001184 polypeptide Polymers 0.000 claims description 18
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 18
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 18
- 241000588724 Escherichia coli Species 0.000 claims description 14
- 230000014509 gene expression Effects 0.000 claims description 13
- 108091033319 polynucleotide Proteins 0.000 claims description 11
- 102000040430 polynucleotide Human genes 0.000 claims description 11
- 239000002157 polynucleotide Substances 0.000 claims description 11
- 230000004927 fusion Effects 0.000 claims description 7
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 17
- JVXZRQGOGOXCEC-UHFFFAOYSA-N scutellarein Chemical compound C1=CC(O)=CC=C1C1=CC(=O)C2=C(O)C(O)=C(O)C=C2O1 JVXZRQGOGOXCEC-UHFFFAOYSA-N 0.000 abstract description 114
- 230000008569 process Effects 0.000 abstract description 5
- 230000037353 metabolic pathway Effects 0.000 abstract description 2
- 238000010353 genetic engineering Methods 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 35
- 102000004169 proteins and genes Human genes 0.000 description 35
- 239000013612 plasmid Substances 0.000 description 24
- 210000004027 cell Anatomy 0.000 description 23
- 239000000047 product Substances 0.000 description 22
- XADJWCRESPGUTB-UHFFFAOYSA-N apigenin Natural products C1=CC(O)=CC=C1C1=CC(=O)C2=CC(O)=C(O)C=C2O1 XADJWCRESPGUTB-UHFFFAOYSA-N 0.000 description 18
- 235000008714 apigenin Nutrition 0.000 description 18
- KZNIFHPLKGYRTM-UHFFFAOYSA-N apigenin Chemical compound C1=CC(O)=CC=C1C1=CC(=O)C2=C(O)C=C(O)C=C2O1 KZNIFHPLKGYRTM-UHFFFAOYSA-N 0.000 description 18
- 229940117893 apigenin Drugs 0.000 description 18
- 239000012634 fragment Substances 0.000 description 14
- 238000003752 polymerase chain reaction Methods 0.000 description 14
- 238000003786 synthesis reaction Methods 0.000 description 14
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 13
- 230000015572 biosynthetic process Effects 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 12
- 108010013835 arginine glutamate Proteins 0.000 description 11
- 239000007788 liquid Substances 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- 240000004534 Scutellaria baicalensis Species 0.000 description 10
- 238000010367 cloning Methods 0.000 description 10
- 235000017089 Scutellaria baicalensis Nutrition 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 108010009298 lysylglutamic acid Proteins 0.000 description 9
- 239000012071 phase Substances 0.000 description 9
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 8
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 238000004128 high performance liquid chromatography Methods 0.000 description 8
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 7
- 101150023103 CHI gene Proteins 0.000 description 7
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 108020001507 fusion proteins Proteins 0.000 description 7
- 102000037865 fusion proteins Human genes 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 7
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 7
- 108010016192 4-coumarate-CoA ligase Proteins 0.000 description 6
- 101710171922 Cheilanthifoline synthase Proteins 0.000 description 6
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- 102100033472 Lysosomal-trafficking regulator Human genes 0.000 description 6
- 101710178358 Peptidoglycan-associated lipoprotein Proteins 0.000 description 6
- 101710189920 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Proteins 0.000 description 6
- 241000208317 Petroselinum Species 0.000 description 6
- 102100029989 SHC SH2 domain-binding protein 1 Human genes 0.000 description 6
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 6
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 6
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 150000007523 nucleic acids Chemical group 0.000 description 6
- 235000011197 perejil Nutrition 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 241000219195 Arabidopsis thaliana Species 0.000 description 5
- 101000841399 Arabidopsis thaliana ERBB-3 BINDING PROTEIN 1 Proteins 0.000 description 5
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 5
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 5
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 240000007377 Petunia x hybrida Species 0.000 description 5
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 5
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 238000000855 fermentation Methods 0.000 description 5
- 230000004151 fermentation Effects 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 4
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 4
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 4
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 4
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 4
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 4
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 4
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 4
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 4
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 4
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 4
- 241001013934 Erigeron breviscapus Species 0.000 description 4
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 4
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 4
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 4
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 4
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 4
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 4
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 4
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 4
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 4
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 4
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 4
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 4
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 4
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 4
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 4
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 4
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 4
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 4
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 4
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 4
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 4
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 4
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 4
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 4
- QFGVDCBPDGLVTA-SZMVWBNQSA-N Lys-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 QFGVDCBPDGLVTA-SZMVWBNQSA-N 0.000 description 4
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 4
- WVTYEEPGEUSFGQ-LPEHRKFASA-N Met-Cys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WVTYEEPGEUSFGQ-LPEHRKFASA-N 0.000 description 4
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 4
- 108010021466 Mutant Proteins Proteins 0.000 description 4
- 102000008300 Mutant Proteins Human genes 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 101100391071 Petroselinum crispum FNSI gene Proteins 0.000 description 4
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 4
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 4
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 4
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 4
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 4
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 4
- 241000221523 Rhodotorula toruloides Species 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 4
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 4
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 4
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 4
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 4
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 4
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 4
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 4
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 4
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 4
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 4
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 4
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 3
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 3
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 3
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 3
- 101100478890 Caenorhabditis elegans smo-1 gene Proteins 0.000 description 3
- 108010004539 Chalcone isomerase Proteins 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 3
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 3
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 3
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 3
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 3
- 241000219823 Medicago Species 0.000 description 3
- 240000004658 Medicago sativa Species 0.000 description 3
- 235000010624 Medicago sativa Nutrition 0.000 description 3
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 3
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 3
- 108700023158 Phenylalanine ammonia-lyases Proteins 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 108010060641 flavanone synthetase Proteins 0.000 description 3
- 108090001036 flavone synthase I Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 238000001819 mass spectrum Methods 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 239000003960 organic solvent Substances 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 2
- NGSWKAQJJWESNS-UHFFFAOYSA-N 4-coumaric acid Chemical compound OC(=O)C=CC1=CC=C(O)C=C1 NGSWKAQJJWESNS-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YHOPXCAOTRUGLV-XAMCCFCMSA-N Ala-Ala-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YHOPXCAOTRUGLV-XAMCCFCMSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- XIWKVCDQMCNKOZ-UVBJJODRSA-N Ala-Met-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XIWKVCDQMCNKOZ-UVBJJODRSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 2
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 2
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 description 2
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 108010075254 C-Peptide Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 102000005870 Coenzyme A Ligases Human genes 0.000 description 2
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 2
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 2
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 2
- BSGXXYRIDXUEOM-IHRRRGAJSA-N Cys-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N BSGXXYRIDXUEOM-IHRRRGAJSA-N 0.000 description 2
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 2
- 108010020076 Cytochrome P-450 CYP2B1 Proteins 0.000 description 2
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 2
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 2
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 2
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 2
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 2
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 2
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 2
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 2
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- 108010011449 Long-chain-fatty-acid-CoA ligase Proteins 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 2
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- 239000007993 MOPS buffer Substances 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 2
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 2
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 2
- WRLYTJVPSUBYST-AVGNSLFASA-N Met-His-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N WRLYTJVPSUBYST-AVGNSLFASA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 2
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 2
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 2
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 2
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 2
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 2
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 2
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 2
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 2
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 2
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 2
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 229930003935 flavonoid Natural products 0.000 description 2
- 150000002215 flavonoids Chemical class 0.000 description 2
- 235000017173 flavonoids Nutrition 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- NGSWKAQJJWESNS-ZZXKWVIFSA-N trans-4-coumaric acid Chemical compound OC(=O)\C=C\C1=CC=C(O)C=C1 NGSWKAQJJWESNS-ZZXKWVIFSA-N 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- WBYWAXJHAXSJNI-VOTSOKGWSA-M .beta-Phenylacrylic acid Natural products [O-]C(=O)\C=C\C1=CC=CC=C1 WBYWAXJHAXSJNI-VOTSOKGWSA-M 0.000 description 1
- HNSDLXPSAYFUHK-UHFFFAOYSA-N 1,4-bis(2-ethylhexyl) sulfosuccinate Chemical compound CCCCC(CC)COC(=O)CC(S(O)(=O)=O)C(=O)OCC(CC)CCCC HNSDLXPSAYFUHK-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 229940093681 4-coumaric acid Drugs 0.000 description 1
- 101150029435 4CL gene Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 244000056139 Brassica cretica Species 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- 101150058917 CHS gene Proteins 0.000 description 1
- WBYWAXJHAXSJNI-SREVYHEPSA-N Cinnamic acid Chemical compound OC(=O)\C=C/C1=CC=CC=C1 WBYWAXJHAXSJNI-SREVYHEPSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- 206010019695 Hepatic neoplasm Diseases 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 101150017040 I gene Proteins 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 240000001549 Ipomoea eriocarpa Species 0.000 description 1
- 235000005146 Ipomoea eriocarpa Nutrition 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000207923 Lamiaceae Species 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- KVNLHIXLLZBAFQ-RWMBFGLXSA-N Lys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N KVNLHIXLLZBAFQ-RWMBFGLXSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 238000010806 PrimeScriptTM RT Reagent kit Methods 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- 239000008923 Qingkailing Substances 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 244000042430 Rhodiola rosea Species 0.000 description 1
- 235000003713 Rhodiola rosea Nutrition 0.000 description 1
- 241000915604 Scutellaria barbata Species 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 1
- 239000002551 biofuel Substances 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229930016911 cinnamic acid Natural products 0.000 description 1
- 235000013985 cinnamic acid Nutrition 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 108010081447 cytochrophin-4 Proteins 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000012847 fine chemical Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- HQVFCQRVQFYGRJ-UHFFFAOYSA-N formic acid;hydrate Chemical compound O.OC=O HQVFCQRVQFYGRJ-UHFFFAOYSA-N 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- WBYWAXJHAXSJNI-UHFFFAOYSA-N methyl p-hydroxycinnamate Natural products OC(=O)C=CC1=CC=CC=C1 WBYWAXJHAXSJNI-UHFFFAOYSA-N 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 229930195143 oxyphenol Natural products 0.000 description 1
- 101150077062 pal gene Proteins 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 230000008654 plant damage Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000002390 rotary evaporation Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 239000009254 shuang-huang-lian Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000000870 ultraviolet spectroscopy Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000009916 yin-huang Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/75—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0012—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
- C12N9/0036—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6)
- C12N9/0038—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
- C12N9/0042—NADPH-cytochrome P450 reductase (1.6.2.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0073—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen 1.14.13
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/06—Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/14—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen (1.14.14)
- C12Y114/14001—Unspecific monooxygenase (1.14.14.1)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/10—Fusion polypeptide containing a localisation/targetting motif containing a tag for extracellular membrane crossing, e.g. TAT or VP22
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/24—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a MBP (maltose binding protein)-tag
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y106/00—Oxidoreductases acting on NADH or NADPH (1.6)
- C12Y106/02—Oxidoreductases acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
- C12Y106/02004—NADPH-hemoprotein reductase (1.6.2.4), i.e. NADP-cytochrome P450-reductase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/14—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen (1.14.14)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Mycology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及合成黄芩素和野黄芩素的微生物、其制备方法及其应用。本发明人通过基因工程的方法对宿主细胞的异源代谢途径进行改造,获得了高产黄芩素和野黄芩素的工程菌株。本发明还提供了利用所述工程菌株生产黄芩素和野黄芩素的工艺。The invention relates to a microorganism for synthesizing baicalein and scutellarein, a preparation method and application thereof. The present inventors modified the heterologous metabolic pathway of the host cell by means of genetic engineering, and obtained an engineering strain with high production of baicalein and scutellarein. The invention also provides a process for producing scutellarein and scutellarein by using the engineering strain.
Description
技术领域technical field
本发明涉及合成生物学及医药技术领域,具体地,本发明涉及合成黄芩素和野黄芩素的微生物、其制备方法及其应用。The invention relates to the technical fields of synthetic biology and medicine, in particular, the invention relates to microorganisms for synthesizing scutellarein and scutellarein, a preparation method and application thereof.
背景技术Background technique
黄芩(Scutcllaria Baicalensis Georgi)是中国著名的传统药物,为唇形科植物;中药黄芩是植物黄芩干燥的根,药用历史悠久,可用于风热湿热等多种疾病的治疗。灯盏花是菊科植物短葶(Erigeron breviscapus)的干燥全草,性寒味苦,具有消炎止痛、活血化瘀和祛风除湿等功效。黄芩和灯盏花提取物早已被广泛应用于中药制剂中,银黄片、双黄莲口服液和兰岑口服液等的主要原料都是黄芩提取物,中药清开灵的主要有效成分就是黄芩素,它具有消炎,防治腹泻、肝病和肿瘤的作用;常见的灯盏花剂型有灯盏花素片和灯盏花素口服液等,它们在生物体内的主要代谢吸收形式是其苷元野黄芩素。因此,黄芩素和野黄芩素均具有一定的新药开发价值。Scutellaria Baicalensis Georgi is a well-known traditional medicine in China. It is a plant of the family Labiatae. The traditional Chinese medicine Scutellaria Baicalensis is the dried root of the plant Scutellaria baicalensis. It has a long history of medicinal use and can be used for the treatment of wind-heat, damp-heat and other diseases. Erigeron breviscapus is the dried whole herb of the Compositae plant Erigeron breviscapus. It is cold in nature and bitter in taste. Scutellaria baicalensis and Erigeron breviscapus extracts have long been widely used in traditional Chinese medicine preparations. The main raw materials of Yinhuang Tablets, Shuanghuanglian Oral Liquid and Lancen Oral Liquid are all Scutellaria baicalensis extracts. The main active ingredient of traditional Chinese medicine Qingkailing is baicalein. It has the functions of anti-inflammation, prevention and treatment of diarrhea, liver disease and tumors; common breviscapine formulations include breviscapine tablets and breviscapine oral liquid, etc., and their main form of metabolism and absorption in the organism is its aglycone scutellarein. Therefore, both baicalein and scutellarein have certain new drug development value.
黄芩素和野黄芩素是二个结构相似且重要的黄酮类化合物。黄芩素的分子式为C15H10O5,分子量为270.24,而野黄芩素的分子量为C15H10O6,分子量为286.24。它们的结构如图1所示。Baicalein and scutellarein are two structurally similar and important flavonoids. The molecular formula of baicalein is C 15 H 10 O 5 and the molecular weight is 270.24, while the molecular weight of scutellarein is C 15 H 10 O 6 and the molecular weight is 286.24. Their structures are shown in Figure 1.
与大多数植物源天然产物一样,目前黄芩素和野黄芩素的主要通过化学合成和有机溶剂萃取两种方法制备。有机溶剂萃取主要是对黄芩、灯盏花、半枝莲等药用植物进行组织抽提,在此过程中需要大量的有机溶剂,还存在随后的分离工艺繁琐,工业化造价高等问题。最主要的是该方法受到植物生长缓慢、对药用资源的破坏等难题。尽管通过化学合成也可以大量获得黄芩素和野黄芩素,但是原料会涉及肉桂酸或其衍生物、氧代苯酚等化工物质,一定程度限制了其在药物、食品领域的应用。而且在合成过程中还涉及到有毒试剂及昂贵的化学催化剂的使用等问题。Like most natural products of plant origin, baicalein and scutellarein are mainly prepared by chemical synthesis and organic solvent extraction. Organic solvent extraction is mainly to extract the tissues of medicinal plants such as Scutellaria baicalensis, Erigeron breviscapus, and Scutellaria barbata. In this process, a large amount of organic solvents are required, and there are also problems such as cumbersome separation processes and high industrialization costs. The most important thing is that this method is subject to difficult problems such as slow plant growth and damage to medicinal resources. Although baicalein and scutellarein can also be obtained in large quantities through chemical synthesis, the raw materials involve chemical substances such as cinnamic acid or its derivatives, oxyphenol, etc., which limits its application in the fields of medicine and food to a certain extent. Moreover, problems such as the use of toxic reagents and expensive chemical catalysts are also involved in the synthesis process.
合成生物学是基于理性的设计,将标准化的生物元件进行集成与装配,从而构建性能优良的人造生命系统。合成生物学一经诞生,它的思想和设计就深刻地影响着工业微生物技术的发展,使微生物技术在药物、生物燃料、精细化学品的开发与生产过程中发挥出更加巨大的作用。Synthetic biology is based on rational design, integrating and assembling standardized biological components to construct artificial life systems with excellent performance. Once synthetic biology was born, its ideas and designs have profoundly influenced the development of industrial microbial technology, enabling microbial technology to play a greater role in the development and production of drugs, biofuels, and fine chemicals.
在本领域中,多种天然产物的合成元件经过组装后实现了在微生物中的异源合成。但是本发明中涉及到的黄芩素和野黄芩素这两种活性黄酮类化合物,还尚未见在微生物中成功地异源合成的报道。因此,迫切需要构建一株能够异源合成黄芩素和野黄芩素的微生物菌株。In the art, synthetic elements of various natural products have been assembled to enable heterologous synthesis in microorganisms. However, the two active flavonoids, baicalein and scutellarein involved in the present invention, have not yet been reported to be successfully heterologously synthesized in microorganisms. Therefore, it is urgent to construct a microbial strain capable of heterologously synthesizing baicalein and scutellarein.
发明内容Contents of the invention
本发明的目的在于提供合成黄芩素和野黄芩素的微生物、其制备方法及其应用。The object of the present invention is to provide microorganisms for synthesizing baicalein and scutellarein, their preparation method and application.
在本发明的第一方面,提供一种生产黄芩素和野黄芩素的方法,包括:(1)在宿主细胞中引入表达黄酮6-羟化酶(F6H)以及细胞色素P450氧化还原酶(CPR)的基因,以及白杨素或芹菜素合成基因;和(2)在含有苯丙氨酸和/或酪氨酸的培养体系中培养该宿主细胞,从而生产黄芩素或野黄芩素。In a first aspect of the present invention, a method for producing baicalein and scutellarein is provided, comprising: (1) introducing and expressing flavone 6-hydroxylase (F6H) and cytochrome P450 oxidoreductase (CPR) in host cells ) gene, and chrysin or apigenin synthesis gene; and (2) cultivating the host cell in a culture system containing phenylalanine and/or tyrosine, thereby producing baicalein or scutellarein.
在一个优选例中,所述的白杨素或芹菜素合成基因包括:表达苯丙氨酸解氨酶(phenylalanine ammonia-lyase,PAL)、4-香豆酸辅酶A连接酶(4-coumarate:CoA ligase,4CL)、查尔酮合成酶(chalcone synthase,CHS)、查尔酮异构酶(chalcone isomerase,CHI)和黄酮合成酶I(flavone synthase I,FNSI)的基因;较佳地,在被引入宿主细胞时,所述的表达苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I的基因存在于同一表达载体中。In a preferred example, the chrysin or apigenin synthesis gene includes: expression of phenylalanine ammonia-lyase (phenylalanine ammonia-lyase, PAL), 4-coumarate coenzyme A ligase (4-coumarate: CoA ligase, 4CL), chalcone synthase (chalcone synthase, CHS), chalcone isomerase (chalcone isomerase, CHI) and flavone synthase I (flavone synthase I, FNSI) genes; When introduced into host cells, the genes expressing phenylalanine ammonia lyase, 4-coumaric acid coenzyme A ligase, chalcone synthase, chalcone isomerase and flavone synthase I exist in the same expression in the carrier.
在另一优选例中,所述的黄酮6-羟化酶来源于黄芩(Scutcllaria baicalensis),也包括其同源物(来自其它物种的同源基因或多肽);所述的CPR来源于拟南芥(Arabidopsis thaliana),也包括其同源物。In another preferred example, the flavone 6-hydroxylase is derived from Scutellaria baicalensis, including its homologues (homologous genes or polypeptides from other species); the CPR is derived from Arabidopsis Mustard (Arabidopsis thaliana), also including its homologues.
在另一优选例中,所述的PAL来源于红景天(Rhodotorula toruloides),也包括其同源物;所述的4CL来源于欧芹(Petroselium crispum),也包括其同源物;所述的CHS来源于矮牵牛(Petunia X hybrida),也包括其同源物;所述的CHI来源于苜蓿(Medicagosativa),也包括其同源物;所述的FNS I来源于欧芹(Petroselium crispum),也包括其同源物。In another preferred example, the PAL is derived from Rhodotorula toruloides (Rhodotorula toruloides), also including its homologues; the 4CL is derived from parsley (Petroselium crispum), also including its homologues; the The CHS derived from petunia (Petunia X hybrida) also includes its homologue; the CHI derived from alfalfa (Medicagosativa) also includes its homologue; the FNS I derived from parsley (Petroselium crispum ), including its homologues.
在本发明的另一方面,提供一种生产黄芩素和野黄芩素的方法,包括:(1)在宿主细胞中引入表达黄酮6-羟化酶(F6H)以及细胞色素P450氧化还原酶(CPR)的基因,获得重组的宿主细胞;和(2)在含有白杨素或芹菜素的培养体系中培养该重组的宿主细胞,从而生产黄芩素或野黄芩素。In another aspect of the present invention, there is provided a method for producing baicalein and scutellarein, comprising: (1) introducing and expressing flavone 6-hydroxylase (F6H) and cytochrome P450 oxidoreductase (CPR) in host cells ) to obtain a recombinant host cell; and (2) cultivating the recombinant host cell in a culture system containing chrysin or apigenin, thereby producing baicalein or scutellarein.
在本发明的另一方面,提供一种将白杨素或芹菜素转化为黄芩素或野黄芩素的方法:以黄酮6-羟化酶以及细胞色素P450氧化还原酶催化白杨素或芹菜素,从而在白杨素或芹菜素的结构中加上一个羟基,形成黄芩素或野黄芩素。In another aspect of the present invention, there is provided a method for converting chrysin or apigenin into baicalein or scutellarein: catalyzing chrysin or apigenin with flavone 6-hydroxylase and cytochrome P450 oxidoreductase, thereby Add a hydroxyl group to the structure of chrysin or apigenin to form baicalein or scutellarein.
在一个优选例中,所述的黄酮6-羟化酶(F6H)是截去N端第(1-10)~(20-30)位氨基酸的突变型黄酮6-羟化酶;较佳地,是截去N端第(2-5)~(22-28)位氨基酸的突变型黄酮6-羟化酶。In a preferred example, the flavone 6-hydroxylase (F6H) is a mutant flavone 6-hydroxylase that truncates amino acids at positions (1-10) to (20-30) of the N-terminus; preferably , is a mutant flavone 6-hydroxylase that truncated amino acids at positions (2-5)-(22-28) of the N-terminus.
在另一优选例中,所述的黄酮6-羟化酶与多肽标签融合,所述的多肽标签选自:小牛血清17羟基化酶N端8氨基酸多肽(8RP),小分子泛素修饰相关蛋白(Sumo),麦芽糖结合蛋白(MBP),细胞色素P450 2B1家族可溶性蛋白(2B1),或其组合;较佳地为麦芽糖结合蛋白或细胞色素P450 2B1家族可溶性蛋白,或其组合;较佳地所述多肽标签位于N端。In another preferred example, the flavone 6-hydroxylase is fused with a polypeptide tag, and the polypeptide tag is selected from: calf serum 17 hydroxylase N-terminal 8 amino acid polypeptide (8RP), small molecule ubiquitin modified Related protein (Sumo), maltose binding protein (MBP), cytochrome P450 2B1 family soluble protein (2B1), or combination thereof; preferably maltose binding protein or cytochrome P450 2B1 family soluble protein, or combination thereof; preferably The polypeptide tag is located at the N-terminus.
在另一优选例中,所述的细胞色素P450氧化还原酶(CPR)的是截去N端第(1-20)~(60-85)位氨基酸的突变型细胞色素P450氧化还原酶;较佳地,是截去N端第(2-10)~(65-80)位氨基酸的突变型细胞色素P450氧化还原酶;更佳地,是截去N端第(2-5)~(70-75)位氨基酸的突变型细胞色素P450氧化还原酶。In another preferred example, the cytochrome P450 oxidoreductase (CPR) is a mutant cytochrome P450 oxidoreductase that truncated amino acids at positions (1-20)-(60-85) of the N-terminal; Preferably, it is a mutant cytochrome P450 oxidoreductase that truncates (2-10)-(65-80) amino acids at the N-terminus; more preferably, it is a truncated N-terminus (2-5)-(70 -75) amino acid mutant cytochrome P450 oxidoreductase.
在另一优选例中,所述的宿主细胞包括:原核细胞或真核细胞;较佳地,所述的原核细胞包括:大肠杆菌细胞,枯草芽孢杆菌细胞;所述的真核细胞包括:酵母细胞。In another preferred example, the host cells include: prokaryotic cells or eukaryotic cells; preferably, the prokaryotic cells include: Escherichia coli cells, Bacillus subtilis cells; the eukaryotic cells include: yeast cell.
在本发明的另一方面,提供一种重组宿主细胞,其中包括外源的表达黄酮6-羟化酶以及细胞色素P450氧化还原酶的基因。In another aspect of the present invention, a recombinant host cell is provided, which includes exogenous genes expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase.
在另一优选例中,所述的重组宿主细胞中还包括外源的白杨素或芹菜素合成基因。In another preferred example, the recombinant host cell further includes an exogenous chrysin or apigenin synthesis gene.
在另一优选例中,所述的多肽标签为单拷贝或2~10拷贝(如3、4、5、6、8拷贝)串联的序列结构。In another preferred example, the polypeptide tag is a single copy or a sequence structure of 2-10 copies (such as 3, 4, 5, 6, 8 copies) in series.
在本发明的另一方面,提供前面任一所述的重组宿主细胞的用途,用于生产黄芩素和野黄芩素。In another aspect of the present invention, the use of any one of the aforementioned recombinant host cells is provided for producing scutellarein and scutellarein.
在一个优选例中,对于细胞内不存在白杨素或芹菜素合成基因的菌株,用于以外源添加的白杨素或芹菜素为底物生产黄芩素和野黄芩素;对于细胞内存在白杨素或芹菜素合成基因的菌株,用于在外源添加苯丙氨酸和/或酪氨酸的培养条件下生产黄芩素和野黄芩素。In a preferred example, for the bacterial strain that does not have chrysin or apigenin synthesis gene in the cell, it is used to produce baicalein and scutellarein with chrysin or apigenin added by exogenous sources; for the presence of chrysin or apigenin in the cell A strain with an apigenin synthesis gene for producing baicalein and scutellarein under the culture conditions of exogenously added phenylalanine and/or tyrosine.
在本发明的另一方面,提供一种制备用于生产黄芩素和野黄芩素的宿主细胞的方法,包括:在宿主细胞中引入表达黄酮6-羟化酶以及细胞色素P450氧化还原酶的基因,获得重组菌株;较佳地,还包括引入白杨素或芹菜素合成基因。In another aspect of the present invention, there is provided a method for preparing host cells for producing baicalein and scutellarein, comprising: introducing genes expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase into the host cells , to obtain a recombinant strain; preferably, it also includes introducing a chrysin or apigenin synthesis gene.
在本发明的另一方面,提供一种用于生产黄芩素和野黄芩素的试剂盒所述试剂盒中包括前面任一所述的重组的宿主细胞。In another aspect of the present invention, there is provided a kit for producing scutellarein and scutellarein. The kit includes any of the aforementioned recombinant host cells.
在另一优选例中,所述的试剂盒中还包括:宿主细胞培养基,使用说明书等。In another preferred example, the kit further includes: host cell culture medium, instructions for use, and the like.
在本发明的另一方面,提供突变型黄酮6-羟化酶,其对应于野生型黄酮6-羟化酶(F6H),截去N端第(1-10)~(20-30)位氨基酸;较佳地,截去N端第(2-5)~(22-28)位氨基酸;较佳地,其具有SEQ ID NO:2所示的氨基酸序列。In another aspect of the present invention, mutant flavone 6-hydroxylase is provided, which corresponds to wild-type flavone 6-hydroxylase (F6H), truncating the N-terminal (1-10)-(20-30) Amino acid; preferably, amino acids (2-5)-(22-28) of the N-terminal are truncated; preferably, it has the amino acid sequence shown in SEQ ID NO:2.
在本发明的另一方面,提供突变型细胞色素P450氧化还原酶,其对应于野生型细胞色素P450氧化还原酶,截去N端第(1-20)~(60-85)位氨基酸;较佳地,截去N端第(2-10)~(65-80)位氨基酸;更佳地,截去N端第(2-5)~(70-75)位氨基酸;较佳地,其具有SEQ IDNO:8所示的氨基酸序列。In another aspect of the present invention, a mutant cytochrome P450 oxidoreductase is provided, which corresponds to the wild-type cytochrome P450 oxidoreductase, and amino acids at positions (1-20) to (60-85) of the N-terminal are truncated; Preferably, amino acids at positions (2-10) to (65-80) of the N-terminus are truncated; more preferably, amino acids at positions (2-5) to (70-75) of the N-terminus are truncated; preferably, the It has the amino acid sequence shown in SEQ ID NO:8.
在本发明的另一方面,提供融合多肽,其包括前面所述的任一突变型黄酮6-羟化酶,以及与之融合的多肽标签,所述的多肽标签选自:8RP,Sumo,MBP,2B1;较佳地为MBP或2B1。In another aspect of the present invention, a fusion polypeptide is provided, which includes any of the aforementioned mutant flavone 6-hydroxylases, and a polypeptide tag fused thereto, and the polypeptide tag is selected from: 8RP, Sumo, MBP , 2B1; preferably MBP or 2B1.
在一个优选例中,所述的融合多肽具有选自下组的氨基酸序列:SEQ ID NO:3、SEQID NO:4、SEQ ID NO:5或SEQ ID NO:6。In a preferred example, the fusion polypeptide has an amino acid sequence selected from the group consisting of SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6.
在本发明的另一方面,提供多核苷酸,其编码:述的突变型黄酮6-羟化酶;或所述的突变型细胞色素P450氧化还原酶;或所述的融合多肽。In another aspect of the present invention, a polynucleotide is provided, which encodes: the mutant flavone 6-hydroxylase; or the mutant cytochrome P450 oxidoreductase; or the fusion polypeptide.
在本发明的另一方面,提供一种表达构建物,其包括:前面所述的任一多核苷酸;或编码前面所述的任一突变型黄酮6-羟化酶或前面所述的融合蛋白的多核苷酸,以及编码前面所述的突变型细胞色素P450氧化还原酶的多核苷酸。In another aspect of the present invention, an expression construct is provided, which includes: any polynucleotide described above; or encodes any mutant flavone 6-hydroxylase described above or the above-mentioned The polynucleotide of the fusion protein, and the polynucleotide encoding the aforementioned mutant cytochrome P450 oxidoreductase.
在另一优选例中,所述的表达构建物中,还包括与上述的多核苷酸操作性连接的启动子和终止子。In another preferred example, the expression construct further includes a promoter and a terminator operably linked to the above-mentioned polynucleotide.
在本发明的另一方面,提供所述的突变型黄酮6-羟化酶或所述的融合蛋白,以及突变型细胞色素P450氧化还原酶的用途,用于在白杨素或芹菜素的结构中加上一个羟基,形成黄芩素或野黄芩素。In another aspect of the present invention, there is provided the use of the mutant flavone 6-hydroxylase or the fusion protein, and the mutant cytochrome P450 oxidoreductase, which are used in the structure of chrysin or apigenin Add a hydroxyl group to form baicalein or scutellarein.
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。Other aspects of the invention will be apparent to those skilled in the art from the disclosure herein.
附图说明Description of drawings
图1、黄芩素和野黄芩素的结构式。Figure 1. Structural formulas of baicalein and scutellarein.
图2、黄芩素和野黄芩素的生物合成路径。Figure 2. Biosynthetic pathways of baicalein and scutellarein.
图3、质粒pYH66的构建示意图。Fig. 3. Schematic diagram of the construction of plasmid pYH66.
图4、质粒pYH57的构建示意图。Fig. 4. Schematic diagram of the construction of plasmid pYH57.
图5、工程菌株BL21(DE3)-pYH57-pYH66和标准品黄芩素的HPLC检测图谱。其中,i表示BL21(DE3)-pETDuet-1-pCDFDuet-1发酵液作为空白对照;ii表示BL21(DE3)-pYH57-pYH66添加苯丙氨酸的发酵液;iii表示黄芩素标准品。Fig. 5. HPLC detection patterns of engineering strain BL21(DE3)-pYH57-pYH66 and standard product baicalein. Among them, i represents BL21(DE3)-pETDuet-1-pCDFDuet-1 fermentation broth as a blank control; ii represents BL21(DE3)-pYH57-pYH66 fermentation broth added with phenylalanine; iii represents baicalein standard.
图6、工程菌株BL21(DE3)-pYH57-pYH66产黄芩素的质谱图谱。Fig. 6. The mass spectrogram of baicalein produced by engineering strain BL21(DE3)-pYH57-pYH66.
图7、工程菌株BL21(DE3)-pYH57-pYH66和标准品野黄芩素的HPLC检测图谱。其中,i表示BL21(DE3)-pETDuet-1-pCDFDuet-1发酵液作为空白对照;ii表示BL21(DE3)-pYH57-pYH66添加酪氨酸的发酵液;iii表示野黄芩素标准品。Fig. 7. HPLC detection pattern of engineering strain BL21(DE3)-pYH57-pYH66 and standard product scutellarein. Among them, i represents BL21(DE3)-pETDuet-1-pCDFDuet-1 fermentation liquid as a blank control; ii represents tyrosine-added fermentation liquid of BL21(DE3)-pYH57-pYH66; iii represents scutellarein standard.
图8、工程菌株BL21(DE3)-pYH57-pYH66产野黄芩素的质谱图谱。Fig. 8. The mass spectrum of scutellarein produced by engineering strain BL21(DE3)-pYH57-pYH66.
图9、SbF6H和AtCPR突变体催化白杨素生成黄芩素。Figure 9. SbF6H and AtCPR mutants catalyze the production of baicalein from chrysin.
A,所构建的质粒中关键元件的示意图;A, schematic diagram of the key elements in the constructed plasmid;
B,重组大肠杆菌中,催化白杨素生成黄芩素的转化率;B, in recombinant Escherichia coli, the conversion rate of catalyzing chrysin to generate baicalein;
C,HPLC分析重组大肠杆菌催化反应液。其中,Chr表示白杨素,Bai表示黄芩素。C, HPLC analysis of recombinant Escherichia coli catalytic reaction solution. Among them, Chr means chrysin, and Bai means baicalein.
具体实施方式Detailed ways
本发明人致力于利用微生物中异源合成黄芩素和野黄芩素,以及提高生物生产黄芩素和野黄芩素的产量,经过深入的研究,通过基因工程的方法对宿主细胞的异源代谢途径进行改造,获得了高产黄芩素和野黄芩素的工程菌株。The present inventor is committed to using heterologous synthesis of baicalein and scutellarein in microorganisms, and improving the yield of biological production of baicalein and scutellarein. Transformation, obtained high-yielding baicalein and wild scutellarein engineering strains.
如本文所用,所述的“N端第(1-10)~(20-30)位氨基酸”是指起始于N端起第1-10位中的任一位氨基酸,终止于N端起第20-30位中的任一位氨基酸。As used herein, the "N-terminal (1-10)-(20-30) amino acid" refers to any amino acid starting from the 1-10th position from the N-terminal and ending from the N-terminal Any amino acid in the 20th-30th position.
如本文所用,所述的“N端第(2-5)~(22-28)位”是指起始于N端起第2-5位中的任一位氨基酸,终止于N端起第22-28位中的任一位氨基酸。As used herein, the "N-terminal (2-5)-(22-28)" refers to any amino acid starting from the 2-5th position from the N-terminal and ending at the 2-5th amino acid from the N-terminal Any amino acid in positions 22-28.
如本文所用,所述的“N端第(1-20)~(60-85)位”是指起始于N端起第1-20位中的任一位氨基酸,终止于N端起第60-85位中的任一位氨基酸。As used herein, the "N-terminal (1-20) ~ (60-85) position" refers to any amino acid starting from the 1-20th position from the N-terminal and ending at the 1st-20th amino acid from the N-terminal Any amino acid in positions 60-85.
如本文所用,所述的“N端第(2-10)~(65-80)位”是指起始于N端起第2-10位中的任一位氨基酸,终止于N端起第65-80位中的任一位氨基酸。As used herein, the "N-terminal (2-10) ~ (65-80) position" refers to any amino acid starting from the 2-10th position from the N-terminal and ending at the 2-10th amino acid from the N-terminal Any amino acid in positions 65-80.
如本文所用,所述的“N端第(2-5)~(70-75)位”是指起始于N端起第2-5位中的任一位氨基酸,终止于N端起第70-75位中的任一位氨基酸。As used herein, the "N-terminal (2-5) ~ (70-75) position" refers to any amino acid starting from the 2-5th position from the N-terminal and ending at the 2-5th amino acid from the N-terminal Any amino acid in positions 70-75.
如本文所用,“外源的”或“异源的”是指来自不同来源的两条或多条核酸或蛋白质序列之间的关系。As used herein, "exogenous" or "heterologous" refers to the relationship between two or more nucleic acid or protein sequences from different sources.
如本文所用,所述的“可操作地连接(相连)”或“操作性连接(相连)”是指两个或多个核酸区域或核酸序列的功能性的空间排列。例如:启动子区被置于相对于目的基因核酸序列的特定位置,使得核酸序列的转录受到该启动子区域的引导,从而,启动子区域被“可操作地连接”到该核酸序列上。As used herein, the term "operably linked (linked)" or "operably linked (linked)" refers to the functional spatial arrangement of two or more nucleic acid regions or nucleic acid sequences. For example: the promoter region is placed at a specific position relative to the nucleic acid sequence of the target gene, so that the transcription of the nucleic acid sequence is guided by the promoter region, thus, the promoter region is "operably linked" to the nucleic acid sequence.
如本文所用,所述的“表达构建物”是指重组DNA分子,它包含预期的核酸编码序列,其可以包含一个或多个基因表达盒。所述的“构建物”通常被包含在表达载体中。As used herein, the "expression construct" refers to a recombinant DNA molecule comprising the desired nucleic acid coding sequence, which may comprise one or more gene expression cassettes. Said "construct" is usually contained in an expression vector.
如本文所用,所述的PAL、4CL、CHS、CHI和FNSI蛋白是在表达系统中形成白杨素或芹菜素合成途径的蛋白。As used herein, the PAL, 4CL, CHS, CHI and FNSI proteins are proteins that form the synthesis pathway of chrysin or apigenin in the expression system.
如本文所用,所述的F6H和CPR蛋白是在表达系统中转化白杨素或芹菜素、生成黄芩素或野黄芩素的蛋白。As used herein, the F6H and CPR proteins are proteins that convert chrysin or apigenin into baicalein or scutellarein in an expression system.
野生型的上述蛋白或基因均为本领域已经鉴定的,因此,可以从公众途径获得和制备。作为本发明的优选方式,PAL来源于红景天(Rhodotorula toruloides),其具有GenBank登录号AAA33883.1所示的序列;4CL来源于欧芹(Petroselium crispum),其具有GenBank登录号KF765780.1所示的序列;CHS来源于矮牵牛(Petunia X hybrida),其具有GenBank登录号KF765781.1所示的序列;CHI基因来源于苜蓿(Medicago sativa),其具有GenBank登录号KF765782.1所示的序列;FNS I来源于欧芹(Petroselium crispum),其具有Swiss-Prot登录号Q7XZQ8.1所示的序列。The above-mentioned wild-type proteins or genes have been identified in the art, and therefore, can be obtained and prepared from public sources. As a preferred mode of the present invention, PAL is derived from Rhodiola rosea (Rhodotorula toruloides), which has the sequence shown in GenBank accession number AAA33883.1; 4CL is derived from parsley (Petroselium crispum), which has the sequence shown in GenBank accession number KF765780.1. CHS is derived from petunia (Petunia X hybrida), which has the sequence shown in GenBank accession number KF765781.1; CHI gene is derived from alfalfa (Medicago sativa), which has the sequence shown in GenBank accession number KF765782.1 Sequence; FNS I is derived from parsley (Petroselium crispum), which has the sequence shown in Swiss-Prot accession number Q7XZQ8.1.
野生型的F6H和CPR也是本领域已经鉴定的。作为本发明的优选方式,F6H来源于黄岑(Scutellaria baicalensis),其具有GenBank登录号ASW21050.1所示的序列。作为本发明的优选方式,CPR来自于拟南芥(Arabidopsis thaliana),其具有GenBank登录号NP_849472.2所示的序列。Wild-type F6H and CPR have also been identified in the art. As a preferred embodiment of the present invention, F6H is derived from Scutellaria baicalensis, which has the sequence shown by GenBank accession number ASW21050.1. As a preferred mode of the present invention, CPR comes from Arabidopsis thaliana, which has the sequence shown in GenBank accession number NP_849472.2.
本发明人发现,利用宿主细胞生产黄芩素和野黄芩素的过程中,应用野生型的F6H,仅能产生微量的产物,无法实现规模化生产,因此对多个参与反应的蛋白进行了改造,经过大量筛选分析,获得了一些优选的改造方案,极为显著地提高了微生物,尤其是原核表达系统如大肠杆菌中黄芩素和野黄芩素的产量。The inventors found that in the process of using host cells to produce baicalein and scutellarein, only a small amount of products can be produced by using wild-type F6H, and large-scale production cannot be realized. Therefore, multiple proteins involved in the reaction were modified. After a large number of screening and analysis, some optimal transformation schemes were obtained, which greatly improved the production of microorganisms, especially prokaryotic expression systems such as scutellarein and scutellarein in Escherichia coli.
因此,作为本发明的优选方式,提供了一种突变型F6H,其对应于野生型F6H,截去N端第(1-10)~(20-30)位氨基酸;较佳地,截去N端第(2-5)~(22-28)位氨基酸;更佳地,截去N端第2-25位氨基酸。Therefore, as a preferred mode of the present invention, a mutant F6H is provided, which corresponds to the wild-type F6H, and amino acids at positions (1-10) to (20-30) of the N-terminal are truncated; preferably, the N-terminal truncated (2-5)-(22-28) amino acids at the terminal; more preferably, amino acids 2-25 at the N-terminus are truncated.
作为本发明的优选方式,还提供了一种含有所述F6H或突变型F6H的融合蛋白,其包括F6H或任一突变型F6H,以及与之融合的多肽标签,所述的多肽标签选自:8RP,Sumo,MBP,2B1,或它们的组合;较佳地为MBP或2B1。所述的多肽标签与所述F6H或突变型F6H之间,可以包含或不包含连接肽,所述的连接肽不影响两者的生物学活性。As a preferred mode of the present invention, there is also provided a fusion protein containing the F6H or mutant F6H, which includes F6H or any mutant F6H, and a polypeptide tag fused thereto, and the polypeptide tag is selected from: 8RP, Sumo, MBP, 2B1, or a combination thereof; preferably MBP or 2B1. The polypeptide tag and the F6H or mutant F6H may or may not contain a connecting peptide, and the connecting peptide does not affect the biological activity of the two.
作为本发明的优选方式,提供了一种突变型CPR,其对应于野生型CPR,截去N端第(1-20)~(60-85)位氨基酸;较佳地,截去N端第(2-10)~(65-80)位氨基酸;更佳地,截去N端第(2-5)~(70-75)位氨基酸。As a preferred mode of the present invention, a mutant CPR is provided, which corresponds to the wild-type CPR, and amino acids (1-20) to (60-85) of the N-terminus are truncated; (2-10)-(65-80) amino acids; more preferably, the N-terminal (2-5)-(70-75) amino acids are truncated.
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括它们保留生物活性的片段、衍生物和类似物。它们的蛋白片段、衍生物或类似物可以是若干个(通常为1-50个,更佳地1-20个,还更佳如1-10个、1-5个、1-3个、或1-2个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(例如100个以内、80个以内、50个以内、20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。又比如,在C末端和/或N末端添加或缺失一个或数个氨基酸通常也不会改变蛋白质的功能。但是,对于上述的突变型蛋白,在其进一步的变化形式中,均进行了如上所述的针对N端的截短。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes their fragments, derivatives and analogs that retain biological activity. Their protein fragments, derivatives or analogs can be several (usually 1-50, more preferably 1-20, even more preferably 1-10, 1-5, 1-3, or 1-2) deletion, insertion and/or substitution of amino acids, and addition or deletion of one or several (for example, within 100, within 80, within 50, within 20 at the C-terminal and/or N-terminal, relatively Preferably within 10, more preferably within 5) amino acids. For example, in the art, substitutions with amino acids with similar or similar properties generally do not change the function of the protein. As another example, adding or deleting one or several amino acids at the C-terminus and/or N-terminus usually does not change the function of the protein. However, for the above-mentioned mutant proteins, in their further variant forms, the above-mentioned N-terminal truncation is carried out.
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括它们的类似物。这些类似物与天然蛋白的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些蛋白包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的蛋白并不限于上述例举的代表性的蛋白。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes their analogs. The difference between these analogs and the natural protein may be the difference in amino acid sequence, or the difference in the modified form that does not affect the sequence, or both. These proteins include natural or induced genetic variants. Induced variants can be obtained by various techniques, such as random mutagenesis by radiation or exposure to mutagens, but also by site-directed mutagenesis or other techniques known in molecular biology. Analogs also include analogs with residues other than natural L-amino acids (eg, D-amino acids), and analogs with non-naturally occurring or synthetic amino acids (eg, β, γ-amino acids). It should be understood that the proteins of the present invention are not limited to the representative proteins exemplified above.
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括与所述的蛋白同源性高(比如与所列举的具体蛋白序列的同源性为70%或更高;优选地同源性为80%或更高;更优选地同源性为90%或更高,如同源性95%,98%或99%)的、且具有相应多肽相同功能的蛋白也包括在本发明内。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes proteins with high homology (such as 70% homology with the specific protein sequence listed) % or higher; preferably homology is 80% or higher; more preferably homology is 90% or higher, such as homology 95%, 98% or 99%), and has the same function as the corresponding polypeptide The proteins are also included in the present invention.
本发明中列举了来自特定物种的蛋白或基因。应理解,虽然本发明中优选研究了获自特定物种的蛋白或基因,但是获自其它物种的与所述蛋白或基因高度同源(如具有60%以上,如70%,80%,85%、90%、95%、甚至98%序列相同性)的其它蛋白或基因也在本发明考虑的范围之内。Proteins or genes from specific species are listed in the present invention. It should be understood that although proteins or genes obtained from specific species are preferably studied in the present invention, proteins or genes obtained from other species are highly homologous (such as having more than 60%, such as 70%, 80%, 85% , 90%, 95%, or even 98% sequence identity) other proteins or genes are also within the scope of the present invention.
发明还涉及本发明还提供了编码本发明的蛋白或其保守性变异蛋白的多核苷酸序列。本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码本发明的突变体成熟蛋白的多核苷酸包括:只编码成熟蛋白的编码序列;成熟蛋白的编码序列和各种附加编码序列;成熟蛋白的编码序列(和任选的附加编码序列)以及非编码序列。The invention also relates to the invention also provides a polynucleotide sequence encoding the protein of the invention or its conservative variant protein. A polynucleotide of the invention may be in the form of DNA or RNA. Forms of DNA include cDNA, genomic DNA or synthetic DNA. DNA can be single-stranded or double-stranded. DNA can be either the coding strand or the non-coding strand. The polynucleotide encoding the mutant mature protein of the present invention includes: the coding sequence encoding only the mature protein; the coding sequence of the mature protein and various additional coding sequences; the coding sequence of the mature protein (and optional additional coding sequences) and non- coding sequence.
本发明还包括针对所述基因的序列,进行密码子优化后形成的多核苷酸序列,例如,根据宿主细胞的偏好进行密码子优化。The present invention also includes the polynucleotide sequence formed by codon-optimizing the sequence of the gene, for example, codon-optimizing according to the preference of the host cell.
本发明中,还构建了一种高产黄芩素和野黄芩素的工程菌株,其中包括外源引入的表达F6H(特别是所述突变型F6H或融合蛋白)以及CPR(特别是所述突变型F6H或融合蛋白)的基因。培养该重组菌株,并在培养体系中加入白杨素或芹菜素,从而可生产黄芩素或野黄芩素。In the present invention, a high-yielding baicalein and scutellarein engineering strain is also constructed, which includes exogenously introduced expression F6H (especially the mutant F6H or fusion protein) and CPR (especially the mutant F6H or fusion protein) genes. The recombinant bacterial strain is cultivated, and chrysin or apigenin is added into the culture system, so as to produce baicalein or scutellarein.
本发明中,还构建了另一种高产黄芩素或野黄芩素的工程菌株,其中包括外源引入的表达F6H(特别是所述突变型F6H或融合蛋白)以及CPR(特别是所述突变型F6H或融合蛋白)的基因,以及白杨素或芹菜素合成基因。所述的白杨素或芹菜素合成基因包括:表达PAL、4CL、CHS、CHI和FNSI蛋白的基因。In the present invention, another kind of high-yielding baicalein or scutellarein engineering strain is also constructed, which includes the expression of F6H (especially the mutant F6H or fusion protein) and CPR (especially the mutant F6H or fusion protein) gene, and chrysin or apigenin synthesis gene. The chrysin or apigenin synthesis genes include: genes expressing PAL, 4CL, CHS, CHI and FNSI proteins.
本发明的菌株稳定性好,并可实现在生物反应器中规模性培养及生产黄芩素或野黄芩素。本发明优选的菌株的黄芩素或野黄芩素得率非常高。The bacterial strain of the invention has good stability, and can realize large-scale cultivation and production of baicalein or scutellarein in a bioreactor. The yield of baicalein or scutellarein in the preferred strain of the present invention is very high.
本发明中,通过大肠杆菌生产黄芩素或野黄芩素,实现黄芩素或野黄芩素更经济、更方便的制造。In the present invention, Escherichia coli produces baicalein or scutellarein to realize more economical and convenient manufacture of baicalein or scutellarein.
本发明还提供了用于生产黄芩素或野黄芩素工程菌株的试剂盒。此外,其中还可包括大肠杆菌培养基,黄芩素或野黄芩素分离或检测试剂,使用说明书等。The invention also provides a kit for producing baicalein or scutellarein engineering strains. In addition, it may also include Escherichia coli culture medium, baicalein or scutellarein separation or detection reagents, instructions for use, and the like.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods not indicating specific conditions in the following examples are usually according to the conditions described in J. Sambrook et al., edited by J. Sambrook et al., Molecular Cloning Experiment Guide, Third Edition, Science Press, 2002, or according to the conditions described in the manufacturer suggested conditions.
实验材料Experimental Materials
AxyPrep总RNA小量制备试剂盒,多聚酶链式反应(PCR)胶回收试剂盒,质粒抽提试剂盒均为美国Axygen产品;PrimeScript RT reagent Kit with gDNA Eraser(PerfectReal Time)聚合酶试剂盒,聚合酶链式反应(PCR)高保真酶PrimeSTAR Max DNAPolymerase为日本宝生物公司(TAKARA)产品;限制性内切酶均为NEB产品。AxyPrep Total RNA Miniprep Kit, Polymerase Chain Reaction (PCR) Gel Recovery Kit, and Plasmid Extraction Kit are all products of Axygen in the United States; PrimeScript RT reagent Kit with gDNA Eraser (PerfectReal Time) Polymerase Kit, Polymerase The chain reaction (PCR) high-fidelity enzyme PrimeSTAR Max DNAPolymerase is a product of TAKARA; the restriction endonucleases are all NEB products.
大肠杆菌DH10B用于基因克隆,大肠杆菌BL21(DE3)菌株用于蛋白表达和黄芩素、野黄芩素的生产。pET28a、pEDDuet-1、pCDFDuet-1载体用于代谢途径基因装配。Escherichia coli DH10B was used for gene cloning, and Escherichia coli BL21 (DE3) strain was used for protein expression and production of baicalein and wild scutellarein. The pET28a, pEDDuet-1, pCDFDuet-1 vectors were used for metabolic pathway gene assembly.
标准品化合物黄芩素和野黄芩素购自上海源叶生物科技有限公司。其他试剂为国产分析纯或色谱纯试剂,购自国药集团化学试剂有限公司。The standard compounds, baicalein and scutellarein, were purchased from Shanghai Yuanye Biotechnology Co., Ltd. Other reagents were domestic analytically pure or chromatographically pure reagents, which were purchased from Sinopharm Chemical Reagent Co., Ltd.
PCR使用Arktik Thermal Cycler(Thermo Fisher Scientific);恒温培养使用ZXGP-A2050恒温培养箱和ZWY-211G恒温培养振荡器;离心使用5418R高速冷冻式离心机和5418小型离心机(Eppendorf)。真空浓缩使用Concentrator plus浓缩仪(Eppendorf);OD600使用UV-1200紫外可见分光光度计检测(上海美谱达仪器有限公司)。旋转蒸发系统由IKARV 10digital旋转蒸发仪(IKA)和MZ 2C NT化学隔膜泵、CVC3000真空控制器(vacuubrand)组成。高效液相色谱使用Dionex UltiMate 3000液相色谱系统(Thermo FisherScientific)。Arktik Thermal Cycler (Thermo Fisher Scientific) was used for PCR; ZXGP-A2050 constant temperature incubator and ZWY-211G constant temperature culture shaker were used for constant temperature culture; 5418R high-speed refrigerated centrifuge and 5418 small centrifuge (Eppendorf) were used for centrifugation. Concentrator plus concentrator (Eppendorf) was used for vacuum concentration; OD 600 was detected by UV-1200 ultraviolet-visible spectrophotometer (Shanghai Meipuda Instrument Co., Ltd.). The rotary evaporation system consists of IKARV 10digital rotary evaporator (IKA), MZ 2C NT chemical diaphragm pump, and CVC3000 vacuum controller (vacuubrand). For high performance liquid chromatography, a Dionex UltiMate 3000 liquid chromatography system (Thermo Fisher Scientific) was used.
液相检测条件:A相:0.1%甲酸水,B相:乙腈;分离条件:0-20min 20%B相-55%B相,20-22min 55%B相-100%B相,22-27min 100%B相,27-35min100%B相-20%B相,35-40min,20%B相;检测波长:340nm,柱温:30℃。色谱柱:Thermo syncronis C18反相柱(250mm×4.6mm,5μm)。Liquid phase detection conditions: A phase: 0.1% formic acid water, B phase: acetonitrile; separation conditions: 0-20min 20% B phase-55% B phase, 20-22min 55% B phase-100% B phase, 22-27min 100% Phase B, 27-35min100% Phase B-20% Phase B, 35-40min, 20% Phase B; detection wavelength: 340nm, column temperature: 30°C. Chromatographic column: Thermo syncronis C18 reverse phase column (250mm×4.6mm, 5μm).
实施例1、多肽及其序列优化Embodiment 1, polypeptide and its sequence optimization
1、F6H多肽序列的优化改造1. Optimization and transformation of F6H polypeptide sequence
来自黄岑(Scutellaria baicalensis)的F6H(SbF6H)长度为517aa(Genbankaccess no.ASW21050.1),具体序列如下(SEQ ID NO:1):F6H (SbF6H) from Scutellaria baicalensis is 517aa in length (Genbankaccess no.ASW21050.1), and the specific sequence is as follows (SEQ ID NO: 1):
MELSSVIYGAIALLSLFYCYLHFSKPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*M ELSSVIYGAIALLSLFYCYLHFSK PKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGN LSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPY LQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
改造1:本发明人针对SEQ ID NO:1进行序列改造,去除其中第2-25位氨基酸,在N端加上MA两个氨基酸,获得改进的F6H突变体trF6H,具体序列如下(SEQ ID NO:2):Modification 1: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added two amino acids of MA to the N-terminal to obtain an improved F6H mutant trF6H. The specific sequence is as follows (SEQ ID NO :2):
MAMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MA MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSET TKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAF SEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
改造2:本发明人针对SEQ ID NO:1进行序列改造,去除其中第2-25位氨基酸,再在N端加上8RP的氨基酸序列,获得改进的F6H突变体8RPtrF6H,具体序列如下(SEQ ID NO:3):Modification 2: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of 8RP to the N-terminal to obtain an improved F6H mutant 8RPtrF6H, the specific sequence of which is as follows (SEQ ID NO:3):
MALLLAVFMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MALLLAVF MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYG GDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFA GVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
改造3:本发明人针对SEQ ID NO:1进行序列改造,去除其中第2-25位氨基酸,再在N端加上Sumo的氨基酸序列,获得改进的F6H突变体SumotrF6H,具体序列如下(SEQ ID NO:4):Modification 3: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of Sumo to the N-terminal to obtain an improved F6H mutant SumotrF6H, the specific sequence of which is as follows (SEQ ID NO:4):
MADSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYD GIRIQADQTPEDLDMEDNDIIEAHREQIGGMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MADSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYD GIRIQADQTPEDLDMEDNDIIEAHREQIGGMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTK RWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDC TVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
改造4:本发明人针对SEQ ID NO:1进行序列改造,去除其中第2-25位氨基酸,再在N端加上MBP的氨基酸序列,获得改进的F6H突变体MBPtrF6H,具体序列如下(SEQ ID NO:5):Modification 4: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of MBP to the N-terminus to obtain an improved F6H mutant MBPtrF6H. The specific sequence is as follows (SEQ ID NO:5):
MAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDR FGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK GKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFN KGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN KDKPLGAVALKSYEEELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDR FGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK GKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFN KGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN KDKPLGAVALKSYEEELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTK RWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDC TVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
改造5:本发明人针对SEQ ID NO:1进行序列改造,去除其中第2-25位氨基酸,再在N端加上2B1的氨基酸序列,获得改进的F6H突变体2B1trF6H,具体序列如下(SEQ ID NO:6):Modification 5: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of 2B1 to the N-terminus to obtain an improved F6H mutant 2B1trF6H. The specific sequence is as follows (SEQ ID NO:6):
MAKKTSSKGKLPPGPSMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MAKKTSSKGKLPPGPS MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVV GKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMR LYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*
2、CPR的优化改造2. Optimization and transformation of CPR
来自拟南芥(Arabidopsis thaliana)的CPR(AtCPR)序列长度712aa(Genebankaccess no.NP_849472.2),具体如下(SEQ ID NO:7):The length of the CPR (AtCPR) sequence from Arabidopsis thaliana (Arabidopsis thaliana) is 712aa (Genebankaccess no.NP_849472.2), specifically as follows (SEQ ID NO: 7):
MSSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIV MLVWRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW* MSSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIV MLVW *
本发明人针对SEQ ID NO:1进行序列改造,删除其中第2-72位氨基酸,获得改进的AtCPR突变体trAtCPR,具体如下(SEQ ID NO:8):The inventors performed sequence modification on SEQ ID NO: 1, deleted amino acids 2-72, and obtained an improved AtCPR mutant trAtCPR, specifically as follows (SEQ ID NO: 8):
MRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW*MRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDD FTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAG KVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEY VQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW*
实施例2、包含新型F6H突变体的重组质粒的构建Embodiment 2, the construction of the recombinant plasmid that comprises novel F6H mutant
以pETDuet-1为出发质粒,将AtCPR通过一步克隆方法连接到NdeI和XhoI位点获得质粒pYH45。Using pETDuet-1 as the starting plasmid, AtCPR was connected to NdeI and XhoI sites by a one-step cloning method to obtain plasmid pYH45.
以pETDuet-1为出发质粒,将trAtCPR通过一步克隆方法连接到NdeI和XhoI位点获得质粒pYH46。Using pETDuet-1 as the starting plasmid, trAtCPR was connected to NdeI and XhoI sites by a one-step cloning method to obtain plasmid pYH46.
进一步,通过金斯瑞合成密码子优化的F6H的编码基因序列并连接到pUC19载体中,获得pUC19-F6H。以F6H-F/R为引物,以pUC19-F6H为模板进行PCR扩增,体系50μL(PrimeSTAR Max Premix,25μL;双引物终浓度0.2~0.3μM;pUC19-F6H 0.2uL;剩余体积用灭菌蒸馏水补足)。PCR反应条件:98℃预变性2min,然后98℃变性10s,55℃退火15s,72℃延伸20s,25个循环,琼脂糖电泳检测,扩增得到约1.5kb的片段,纯化后用Nco I和BamH I双酶切消化。消化后片段连入相同酶消化过的pYH46,连接产物转化大肠杆菌DH10B感受态细胞,提取质粒经构建时引入酶切位点双酶切验证和基因测序,得到重组质粒pYH59。同样地,将消化后片段连入pYH45,得到重组质粒pYH59。Further, GenScript synthesized the codon-optimized F6H coding gene sequence and ligated it into the pUC19 vector to obtain pUC19-F6H. Use F6H-F/R as the primer and pUC19-F6H as the template for PCR amplification, the system is 50 μL (PrimeSTAR Max Premix, 25 μL; the final concentration of double primers is 0.2-0.3 μM; pUC19-F6H 0.2uL; the remaining volume is sterilized with distilled water make up). PCR reaction conditions: pre-denaturation at 98°C for 2min, then denaturation at 98°C for 10s, annealing at 55°C for 15s, extension at 72°C for 20s, 25 cycles, detected by agarose electrophoresis, amplified to obtain a fragment of about 1.5kb, purified with Nco I and BamH I double enzyme digestion. The digested fragment was ligated into pYH46 digested with the same enzyme, and the ligated product was transformed into Escherichia coli DH10B competent cells. The extracted plasmid was constructed by introducing restriction sites for double digestion verification and gene sequencing to obtain the recombinant plasmid pYH59. Similarly, the digested fragment was ligated into pYH45 to obtain recombinant plasmid pYH59.
以trF6H-F/F6H-R为引物,pUC19-trF6H为模板,通过PCR扩增获得的产物在通过一步克隆方法连接到pYH46中的NdeI和BamH I位点获得质粒pYH58。Using trF6H-F/F6H-R as primers and pUC19-trF6H as a template, the product obtained by PCR amplification was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain plasmid pYH58.
以pUC19-trF6H为模板,以8RP-trF6H-F/F6H-R为引物扩增后,用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中,获得pYH60。Using pUC19-trF6H as a template and 8RP-trF6H-F/F6H-R as primers to amplify, the product was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH60.
以pETSumo(购自Invitrogen公司)为模板,以Sumo-F/Sumo-trF6H-R为引物获得含有Sumo序列的DNA片段,以pUC19-trF6H为模板,Sumo-trF6H-F/F6H-R为引物扩增获得含有trF6HDNA片段。以上述2个DNA片段为模板,以Sumo-F/F6H-R为引物进行融合PCR后,用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中,获得pYH61。Using pETSumo (purchased from Invitrogen) as a template and Sumo-F/Sumo-trF6H-R as primers to obtain a DNA fragment containing the Sumo sequence, using pUC19-trF6H as a template and Sumo-trF6H-F/F6H-R as primers to amplify The HDNA fragment containing trF6 was obtained by increasing. Using the above two DNA fragments as templates and Sumo-F/F6H-R as primers for fusion PCR, the product was ligated into the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH61.
以pMAL-c5x(购自Invitrogen公司)为模板,以MBP-F/MBP-trF6H-R为引物PCR扩增获得含有MBP序列的DNA片段,以pUC19-trF6H为模板,MBP-trF6H-F/F6H-R为引物扩增获得含有trF6H的DNA片段,以上述获得的2个DNA片段为模板,MBP-F/F6H-R为引物进行融合PCR后,用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中,获得pYH62。Using pMAL-c5x (purchased from Invitrogen) as a template, using MBP-F/MBP-trF6H-R as a primer to amplify the DNA fragment containing the MBP sequence, using pUC19-trF6H as a template, MBP-trF6H-F/F6H -R is a primer to amplify the DNA fragment containing trF6H, use the two DNA fragments obtained above as a template, MBP-F/F6H-R as a primer for fusion PCR, and use a one-step cloning method to connect the product to NdeI and pYH46 In the BamHI site, pYH62 was obtained.
以pUC19-trF6H为模板,以2B1-F/F6H-R为引物扩增后,用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中,获得pYH63。Using pUC19-trF6H as a template and 2B1-F/F6H-R as a primer to amplify, the product was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH63.
以trF6H-F/F6H-R为引物,pUC19-trF6H为模板,通过PCR扩增获得的产物再通过一步克隆方法连接到pYH45的NdeI和BamH I位点中,得到重组质粒pYH64。Using trF6H-F/F6H-R as primers and pUC19-trF6H as template, the product amplified by PCR was then ligated into the NdeI and BamH I sites of pYH45 by a one-step cloning method to obtain recombinant plasmid pYH64.
以pMAL-c5x为模板,以MBP-F/MBP-trF6H-R为引物PCR扩增获得含有MBP序列的DNA片段,以pUC19-trF6H为模板,MBP-trF6H-F/F6H-R为引物扩增获得含有trF6H的DNA片段,以上述获得的2个DNA片段为模板,MBP-F/F6H-R为引物进行融合PCR后,用一步克隆方法将产物连接到pYH45的NdeI和BamH I位点中,获得pYH65。Using pMAL-c5x as a template and MBP-F/MBP-trF6H-R as primers to amplify the DNA fragment containing the MBP sequence, using pUC19-trF6H as a template and MBP-trF6H-F/F6H-R as primers to amplify To obtain a DNA fragment containing trF6H, use the two DNA fragments obtained above as templates and MBP-F/F6H-R as primers for fusion PCR, and then use a one-step cloning method to connect the product to the NdeI and BamH I sites of pYH45, pYH65 was obtained.
以pUC19-2B1trF6H为模板,以2B1-F/F6H-R为引物扩增后,用一步克隆方法将产物连接到pYH45的NdeI和BamH I位点中,获得pYH66。质粒pYH66的构建示意图如图3所示。Using pUC19-2B1trF6H as template and 2B1-F/F6H-R as primer to amplify, the product was connected to NdeI and BamH I sites of pYH45 by one-step cloning method to obtain pYH66. The schematic diagram of the construction of plasmid pYH66 is shown in FIG. 3 .
以上构建过程所用的引物如表1。所构建的质粒中关键元件的示意图如图9A。The primers used in the above construction process are listed in Table 1. A schematic diagram of the key elements in the constructed plasmid is shown in Figure 9A.
表1Table 1
实施例3、表达PAL、4CL、CHS、CHI和FNSI的重组质粒的构建Embodiment 3, the construction of the recombinant plasmid expressing PAL, 4CL, CHS, CHI and FNSI
通过金斯瑞合成来源于红景天(Rhodotorula toruloides)的PAL基因、(GenBank登录号AAA33883.1)、来源于欧芹(Petroselium crispum)的4CL基因(GenBank登录号KF765780.1)、来源于矮牵牛(Petunia X hybrida)的CHS基因(GenBank登录号KF765781.1)、来源于苜蓿(Medicago sativa)的CHI基因(GenBank登录号KF765782.1)、来源于欧芹(Petroselium crispum)的FNS I基因(Swiss-Prot登录号Q7XZQ8.1),并构建到pET28a载体上获得pET28-PAL,pET28-4CL,pET28-CHS,pET28a-CHI,pET28a-FNSI。GenScript synthesized the PAL gene from Rhodotorula toruloides (GenBank accession number AAA33883.1), the 4CL gene from parsley (Petroselium crispum) (GenBank accession number KF765780.1), and the CHS gene (GenBank accession number KF765781.1) from morning glory (Petunia X hybrida), CHI gene (GenBank accession number KF765782.1) from alfalfa (Medicago sativa), FNS I gene from parsley (Petroselium crispum) (Swiss-Prot accession number Q7XZQ8.1), and constructed on the pET28a vector to obtain pET28-PAL, pET28-4CL, pET28-CHS, pET28a-CHI, pET28a-FNSI.
合成如表2的引物。以pET28-4CL为模板,4CL-F-NcoI/4CL-R-BamHI为引物获得PCR产物,经与NcoI/BamHI双酶切的pCDFDuet-1连接获得pYH40。Synthesize the primers shown in Table 2. Using pET28-4CL as a template and 4CL-F-NcoI/4CL-R-BamHI as primers to obtain a PCR product, which was ligated with pCDFDuet-1 digested with NcoI/BamHI to obtain pYH40.
以pET28-CHS为模板,CHS-F-NdeI/CHS-R-XhoI为引物获得PCR产物,经与NdeI/XhoI双酶切的pYH40连接获得pYH50。Using pET28-CHS as a template and CHS-F-NdeI/CHS-R-XhoI as primers to obtain a PCR product, which was ligated with NdeI/XhoI double-digested pYH40 to obtain pYH50.
以pET28a-CHI为模板,T7CHI-F-XhoI/CHI-R-AvrII为引物获得PCR产物,连接到pYH50获得pYH51。Using pET28a-CHI as a template and T7CHI-F-XhoI/CHI-R-AvrII as primers to obtain a PCR product, which was connected to pYH50 to obtain pYH51.
以pET28-PAL为模板,T7PAL-F-BamH I/PAL-R-Hind III为引物获得PCR产物,经BamH I/Hind III双酶切后与同样双酶切的pYH51连接,获得质粒pYH55。Using pET28-PAL as a template and T7PAL-F-BamH I/PAL-R-Hind III as primers to obtain a PCR product, which was digested with BamH I/Hind III and ligated with the same double digested pYH51 to obtain plasmid pYH55.
以pET28a-FNSI为模板,FNSI-HindIII-F/FNSI-NotI-R)为引物获得PCR产物,经Hind III/Not I双酶切后与同样双酶切的pYH55连接,获得最终载体pYH57。质粒pYH57的构建示意图如图4所示。Using pET28a-FNSI as a template and FNSI-HindIII-F/FNSI-NotI-R) as primers to obtain a PCR product, which was digested with Hind III/Not I and ligated with the same double digested pYH55 to obtain the final vector pYH57. The schematic diagram of the construction of plasmid pYH57 is shown in FIG. 4 .
表2Table 2
实施例4、黄芩素和野黄芩素合成菌株的构建及功能验证Example 4, construction and functional verification of baicalein and scutellarein synthetic strains
黄芩素和野黄芩素的生物合成的示意图如图1。The schematic diagram of the biosynthesis of baicalein and scutellarein is shown in Fig. 1 .
将重组质粒pYH66和pYH57共转化到大肠杆菌BL21(DE3)的感受态细胞中以获得工程菌株BL21(DE3)-pYH57-pYH66。The recombinant plasmids pYH66 and pYH57 were co-transformed into competent cells of Escherichia coli BL21(DE3) to obtain engineering strain BL21(DE3)-pYH57-pYH66.
LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL),转接过夜培养的菌液到新的10mL MOPS液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度1mM,加入不同浓度经灭菌的苯丙氨酸或酪氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养48h。携带未连入外来基因的pETDuet-1和pCDFDuet-1空质粒的BL21(DE3)重组菌株作为空白对照,培养操作同上。LB solid medium (spectinomycin 80 μg/mL, ampicillin 100 μg/mL) was cultured overnight at 37°C. Pick a single clone into 2mL LB liquid medium (spectinomycin 80μg/mL, ampicillin 100μg/mL), transfer the overnight cultured bacterial liquid to a new 10mL MOPS liquid resistance medium 37°C, 250r/min culture To OD 600 =0.5-0.6, cool the water bath to about 16°C, then add the inducer IPTG to a final concentration of 1mM, add different concentrations of sterilized phenylalanine or tyrosine and transfer to 22°C for low-temperature induction culture, in The culture was continued for 48 hours under the condition of the shaker speed of 220r/min. The BL21(DE3) recombinant strains carrying pETDuet-1 and pCDFDuet-1 empty plasmids that were not linked to foreign genes were used as blank controls, and the cultivation operation was the same as above.
同时,将表2所列的各个重组质粒分别转化大肠杆菌,进行培养,检测各自的产物生成情况。At the same time, each recombinant plasmid listed in Table 2 was transformed into Escherichia coli, cultured, and the production of each product was detected.
培养后,检测各个转化了重组质粒的重组菌株表达化合物的状况,如表3所示。After culturing, the conditions of expression of the compound by each recombinant strain transformed with the recombinant plasmid were detected, as shown in Table 3.
表3table 3
经验证,本发明的各个重组菌株可以成功合成目标化合物。It has been verified that each recombinant strain of the present invention can successfully synthesize the target compound.
工程菌株BL21(DE3)-pYH57-pYH66和标准品黄芩素的HPLC检测图谱如图5。工程菌株BL21(DE3)-pYH57-pYH66产黄芩素的质谱图谱如图6。The HPLC detection patterns of the engineering strain BL21(DE3)-pYH57-pYH66 and the standard baicalein are shown in Figure 5. The mass spectrum of baicalein produced by the engineering strain BL21(DE3)-pYH57-pYH66 is shown in Figure 6.
工程菌株BL21(DE3)-pYH57-pYH66和标准品野黄芩素的HPLC检测图谱如图7。工程菌株BL21(DE3)-pYH57-pYH66产野黄芩素的质谱图谱如图8。The HPLC detection patterns of the engineering strain BL21(DE3)-pYH57-pYH66 and the standard product scutellarein are shown in Figure 7. The mass spectrum of scutellarein produced by the engineering strain BL21(DE3)-pYH57-pYH66 is shown in Figure 8.
实施例5、以白杨素为底物进行生产Embodiment 5, produce with chrysin as substrate
分别将pYH58~pYH66这6个重组质粒转化到大肠杆菌BL21(DE3)的感受态细胞中以获得工程菌株,并命名为BL21(DE3)-pYH58~BL21(DE3)-pYH66。The six recombinant plasmids pYH58-pYH66 were transformed into competent cells of Escherichia coli BL21(DE3) to obtain engineering strains, which were named BL21(DE3)-pYH58-BL21(DE3)-pYH66.
LB固体培养基(氨苄青霉素100μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(氨苄青霉素100μg/mL),转接过夜培养的菌液到新的20mL MOPS液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度1mM,在摇床转速220r/min,温度22℃条件下继续培养12h。将上述发酵液并在6000rpm,4℃,10min条件下离心,去掉上清,收集菌体。用反应缓冲液(50mM Tris-HCl,pH 7.4,0.1%Trixton)重悬至OD600=30。取1mL各重组菌的重悬液,向其中加入5μL白杨素(25mM)和2.5μLNADPH(100mM),继续在37℃反应8小时。待反应结束后,向反应液中加入10μL HCl(6M)和1mL乙酸乙酯进行萃取3次,浓缩有机相获得的残留物用200μL甲醇溶解后取10μL进行HPLC分析。LB solid medium (ampicillin 100 μg/mL) was cultured overnight at 37°C. Pick a single clone into 2mL LB liquid medium (ampicillin 100μg/mL), transfer the overnight cultured bacterial liquid to a new 20mL MOPS liquid resistance medium at 37°C, 250r/min and cultivate until OD 600 = 0.5-0.6 , the water bath was cooled to about 16°C, and then the inducer IPTG was added to a final concentration of 1mM, and the culture was continued for 12h under the conditions of a shaking table with a rotation speed of 220r/min and a temperature of 22°C. The above fermentation broth was centrifuged at 6000rpm, 4°C, 10min, the supernatant was removed, and the cells were collected. Resuspend to OD 600 =30 with reaction buffer (50 mM Tris-HCl, pH 7.4, 0.1% Trixton). 1 mL of the resuspension of each recombinant bacteria was taken, 5 μL of chrysin (25 mM) and 2.5 μL of NADPH (100 mM) were added thereto, and the reaction was continued at 37° C. for 8 hours. After the reaction was completed, 10 μL of HCl (6M) and 1 mL of ethyl acetate were added to the reaction solution for extraction three times, and the residue obtained by concentrating the organic phase was dissolved in 200 μL of methanol, and 10 μL was taken for HPLC analysis.
各个重组大肠杆菌中,催化白杨素生成黄芩素的转化率如图9B。The conversion rate of catalyzing chrysin to baicalein in each recombinant E. coli is shown in Figure 9B.
各个重组大肠杆菌催化反应液的HPLC分析结果如图9C。The HPLC analysis results of each recombinant Escherichia coli catalytic reaction solution are shown in Figure 9C.
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned in this application are incorporated by reference in this application as if each were individually incorporated by reference. In addition, it should be understood that after reading the above teaching content of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.
序列表sequence listing
<110> 中国科学院上海生命科学研究院<110> Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences
<120> 合成黄芩素和野黄芩素的微生物、其制备方法及其应用<120> Microorganisms for Synthesizing Baicalein and Scutellarein, Their Preparation Methods and Applications
<130> 186662<130> 186662
<160> 29<160> 29
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 517<211> 517
<212> PRT<212> PRT
<213> 黄岑(Scutellaria baicalensis)<213> Scutellaria baicalensis
<400> 1<400> 1
Met Glu Leu Ser Ser Val Ile Tyr Gly Ala Ile Ala Leu Leu Ser LeuMet Glu Leu Ser Ser Val Ile Tyr Gly Ala Ile Ala Leu Leu Ser Leu
1 5 10 151 5 10 15
Phe Tyr Cys Tyr Leu His Phe Ser Lys Pro Lys Lys Ser Ser Leu AsnPhe Tyr Cys Tyr Leu His Phe Ser Lys Pro Lys Lys Ser Ser Ser Leu Asn
20 25 30 20 25 30
Ala Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu HisAla Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu His
35 40 45 35 40 45
Leu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn LeuLeu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn Leu
50 55 60 50 55 60
Gly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu GlyGly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu Gly
65 70 75 8065 70 75 80
Val His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu IleVal His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu Ile
85 90 95 85 90 95
Phe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile AlaPhe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile Ala
100 105 110 100 105 110
Asp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro TyrAsp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr
115 120 125 115 120 125
Gly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu LeuGly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu Leu
130 135 140 130 135 140
Ser Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu IleSer Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu Ile
145 150 155 160145 150 155 160
Thr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys AspThr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp
165 170 175 165 170 175
Gly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn LeuGly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn Leu
180 185 190 180 185 190
Ser Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr GlySer Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr Gly
195 200 205 195 200 205
Gly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg GluGly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg Glu
210 215 220 210 215 220
Phe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro PhePhe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe
225 230 235 240225 230 235 240
Leu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp ThrLeu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr
245 250 255 245 250 255
Ala Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr ArgAla Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg
260 265 270 260 265 270
Lys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala LeuLys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala Leu
275 280 285 275 280 285
Met Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr AspMet Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr Asp
290 295 300 290 295 300
Ala Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala AlaAla Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala Ala
305 310 315 320305 310 315 320
Ser Asp Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu LeuSer Asp Thr Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu Leu
325 330 335 325 330 335
Asn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu GlnAsn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu Gln
340 345 350 340 345 350
Val Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu ProVal Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu Pro
355 360 365 355 360 365
Tyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro AlaTyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala
370 375 380 370 375 380
Pro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly GlyPro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly Gly
385 390 395 400385 390 395 400
Tyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys LeuTyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys Leu
405 410 415 405 410 415
His Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys ProHis Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro
420 425 430 420 425 430
Gln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe GluGln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe Glu
435 440 445 435 440 445
Leu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn LeuLeu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn Leu
450 455 460 450 455 460
Gly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala PheGly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala Phe
465 470 475 480465 470 475 480
Asp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly LeuAsp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly Leu
485 490 495 485 490 495
Thr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg LeuThr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu
500 505 510 500 505 510
Ser Pro Thr Leu TyrSer Pro Thr Leu Tyr
515 515
<210> 2<210> 2
<211> 495<211> 495
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(495)<222> (1)..(495)
<223> F6H突变体<223> F6H mutant
<400> 2<400> 2
Met Ala Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala GlyMet Ala Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly
1 5 10 151 5 10 15
Gly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg SerGly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser
20 25 30 20 25 30
Ala Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp GlnAla Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln
35 40 45 35 40 45
His Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val ValHis Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val
50 55 60 50 55 60
Val Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp ThrVal Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr
65 70 75 8065 70 75 80
Ala Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser TyrAla Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr
85 90 95 85 90 95
Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg GluAsp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu
100 105 110 100 105 110
Ile Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile GluIle Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu
115 120 125 115 120 125
Leu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly GluLeu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu
130 135 140 130 135 140
Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val LeuLeu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu
145 150 155 160145 150 155 160
Val Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val SerVal Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser
165 170 175 165 170 175
Arg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu ThrArg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr
180 185 190 180 185 190
Thr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile GlyThr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly
195 200 205 195 200 205
Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp LeuGln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu
210 215 220 210 215 220
Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp LysGly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys
225 230 235 240225 230 235 240
Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser GlyIle Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly
245 250 255 245 250 255
Asp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val GlnAsp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln
260 265 270 260 265 270
Ala Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile LysAla Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys
275 280 285 275 280 285
Ala Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr ValAla Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val
290 295 300 290 295 300
Ile Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val LeuIle Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu
305 310 315 320305 310 315 320
Lys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg ArgLys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg
325 330 335 325 330 335
Val Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val ValVal Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val
340 345 350 340 345 350
Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val ArgLys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg
355 360 365 355 360 365
Ala Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys GlyAla Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly
370 375 380 370 375 380
Thr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg ValThr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val
385 390 395 400385 390 395 400
Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp LysTrp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys
405 410 415 405 410 415
Lys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly GlyLys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly
420 425 430 420 425 430
Gly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val HisGly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His
435 440 445 435 440 445
Phe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly SerPhe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser
450 455 460 450 455 460
Thr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala ThrThr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr
465 470 475 480465 470 475 480
Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrPro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr
485 490 495 485 490 495
<210> 3<210> 3
<211> 501<211> 501
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(501)<222> (1)..(501)
<223> F6H突变体8RPtrF6H<223> F6H mutant 8RPtrF6H
<400> 3<400> 3
Met Ala Leu Leu Leu Ala Val Phe Met Pro Lys Lys Ser Ser Leu AsnMet Ala Leu Leu Leu Ala Val Phe Met Pro Lys Lys Ser Ser Leu Asn
1 5 10 151 5 10 15
Ala Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu HisAla Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu His
20 25 30 20 25 30
Leu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn LeuLeu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn Leu
35 40 45 35 40 45
Gly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu GlyGly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu Gly
50 55 60 50 55 60
Val His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu IleVal His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu Ile
65 70 75 8065 70 75 80
Phe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile AlaPhe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile Ala
85 90 95 85 90 95
Asp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro TyrAsp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr
100 105 110 100 105 110
Gly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu LeuGly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu Leu
115 120 125 115 120 125
Ser Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu IleSer Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu Ile
130 135 140 130 135 140
Thr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys AspThr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp
145 150 155 160145 150 155 160
Gly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn LeuGly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn Leu
165 170 175 165 170 175
Ser Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr GlySer Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr Gly
180 185 190 180 185 190
Gly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg GluGly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg Glu
195 200 205 195 200 205
Phe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro PhePhe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe
210 215 220 210 215 220
Leu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp ThrLeu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr
225 230 235 240225 230 235 240
Ala Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr ArgAla Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg
245 250 255 245 250 255
Lys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala LeuLys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala Leu
260 265 270 260 265 270
Met Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr AspMet Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr Asp
275 280 285 275 280 285
Ala Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala AlaAla Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala Ala
290 295 300 290 295 300
Ser Asp Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu LeuSer Asp Thr Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu Leu
305 310 315 320305 310 315 320
Asn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu GlnAsn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu Gln
325 330 335 325 330 335
Val Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu ProVal Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu Pro
340 345 350 340 345 350
Tyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro AlaTyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala
355 360 365 355 360 365
Pro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly GlyPro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly Gly
370 375 380 370 375 380
Tyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys LeuTyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys Leu
385 390 395 400385 390 395 400
His Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys ProHis Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro
405 410 415 405 410 415
Gln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe GluGln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe Glu
420 425 430 420 425 430
Leu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn LeuLeu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn Leu
435 440 445 435 440 445
Gly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala PheGly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala Phe
450 455 460 450 455 460
Asp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly LeuAsp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly Leu
465 470 475 480465 470 475 480
Thr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg LeuThr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu
485 490 495 485 490 495
Ser Pro Thr Leu TyrSer Pro Thr Leu Tyr
500 500
<210> 4<210> 4
<211> 591<211> 591
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(591)<222> (1)..(591)
<223> 的F6H突变体SumotrF6H<223> F6H mutant SumotrF6H
<400> 4<400> 4
Met Ala Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys ProMet Ala Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro
1 5 10 151 5 10 15
Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly SerGlu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser
20 25 30 20 25 30
Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg LeuSer Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu
35 40 45 35 40 45
Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu ArgMet Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg
50 55 60 50 55 60
Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu AspPhe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp
65 70 75 8065 70 75 80
Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln IleLeu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile
85 90 95 85 90 95
Gly Gly Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala GlyGly Gly Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly
100 105 110 100 105 110
Gly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg SerGly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser
115 120 125 115 120 125
Ala Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp GlnAla Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln
130 135 140 130 135 140
His Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val ValHis Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val
145 150 155 160145 150 155 160
Val Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp ThrVal Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr
165 170 175 165 170 175
Ala Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser TyrAla Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr
180 185 190 180 185 190
Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg GluAsp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu
195 200 205 195 200 205
Ile Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile GluIle Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu
210 215 220 210 215 220
Leu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly GluLeu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu
225 230 235 240225 230 235 240
Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val LeuLeu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu
245 250 255 245 250 255
Val Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val SerVal Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser
260 265 270 260 265 270
Arg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu ThrArg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr
275 280 285 275 280 285
Thr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile GlyThr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly
290 295 300 290 295 300
Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp LeuGln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu
305 310 315 320305 310 315 320
Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp LysGly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys
325 330 335 325 330 335
Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser GlyIle Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly
340 345 350 340 345 350
Asp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val GlnAsp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln
355 360 365 355 360 365
Ala Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile LysAla Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys
370 375 380 370 375 380
Ala Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr ValAla Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val
385 390 395 400385 390 395 400
Ile Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val LeuIle Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu
405 410 415 405 410 415
Lys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg ArgLys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg
420 425 430 420 425 430
Val Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val ValVal Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val
435 440 445 435 440 445
Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val ArgLys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg
450 455 460 450 455 460
Ala Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys GlyAla Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly
465 470 475 480465 470 475 480
Thr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg ValThr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val
485 490 495 485 490 495
Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp LysTrp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys
500 505 510 500 505 510
Lys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly GlyLys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly
515 520 525 515 520 525
Gly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val HisGly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His
530 535 540 530 535 540
Phe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly SerPhe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser
545 550 555 560545 550 555 560
Thr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala ThrThr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr
565 570 575 565 570 575
Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrPro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr
580 585 590 580 585 590
<210> 5<210> 5
<211> 861<211> 861
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(861)<222> (1)..(861)
<223> F6H突变体MBPtrF6H<223> F6H mutant MBPtrF6H
<400> 5<400> 5
Met Ala Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly AspMet Ala Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp
1 5 10 151 5 10 15
Lys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys AspLys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp
20 25 30 20 25 30
Thr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu LysThr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys
35 40 45 35 40 45
Phe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe TrpPhe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp
50 55 60 50 55 60
Ala His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala GluAla His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu
65 70 75 8065 70 75 80
Ile Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr TrpIle Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp
85 90 95 85 90 95
Asp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala ValAsp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val
100 105 110 100 105 110
Glu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro ProGlu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro
115 120 125 115 120 125
Lys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala LysLys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys
130 135 140 130 135 140
Gly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr TrpGly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp
145 150 155 160145 150 155 160
Pro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn GlyPro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly
165 170 175 165 170 175
Lys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys AlaLys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala
180 185 190 180 185 190
Gly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn AlaGly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala
195 200 205 195 200 205
Asp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu ThrAsp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr
210 215 220 210 215 220
Ala Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr SerAla Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser
225 230 235 240225 230 235 240
Lys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln ProLys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro
245 250 255 245 250 255
Ser Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala SerSer Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser
260 265 270 260 265 270
Pro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu ThrPro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr
275 280 285 275 280 285
Asp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala ValAsp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val
290 295 300 290 295 300
Ala Leu Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile AlaAla Leu Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile Ala
305 310 315 320305 310 315 320
Ala Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile ProAla Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro
325 330 335 325 330 335
Gln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn AlaGln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala
340 345 350 340 345 350
Ala Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln ThrAla Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr
355 360 365 355 360 365
Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly AlaMet Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly Ala
370 375 380 370 375 380
Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala SerArg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala Ser
385 390 395 400385 390 395 400
Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His GlyAsp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His Gly
405 410 415 405 410 415
Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val SerPro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val Ser
420 425 430 420 425 430
Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala ValSer Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala Val
435 440 445 435 440 445
Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp GlyMet Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp Gly
450 455 460 450 455 460
Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile ArgAla Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile Arg
465 470 475 480465 470 475 480
Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu GlnLys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu Gln
485 490 495 485 490 495
Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu TyrArg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu Tyr
500 505 510 500 505 510
Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val AspLys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val Asp
515 520 525 515 520 525
Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg MetMet Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg Met
530 535 540 530 535 540
Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr LysVal Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr Lys
545 550 555 560545 550 555 560
Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln PheArg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln Phe
565 570 575 565 570 575
Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly GlyIle Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly
580 585 590 580 585 590
Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile IlePhe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile Ile
595 600 605 595 600 605
Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp AspAla Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp Asp
610 615 620 610 615 620
Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala AsnLys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala Asn
625 630 635 640625 630 635 640
Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala ThrPro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala Thr
645 650 655 645 650 655
Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile LeuCys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile Leu
660 665 670 660 665 670
Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys LysIle Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys Lys
675 680 685 675 680 685
Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val GluVal Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val Glu
690 695 700 690 695 700
Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys GluGlu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys Glu
705 710 715 720705 710 715 720
Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala PheThr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala Phe
725 730 735 725 730 735
Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr PheSer Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr Phe
740 745 750 740 745 750
Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp SerLeu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp Ser
755 760 765 755 760 765
Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys ValAsp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys Val
770 775 780 770 775 780
Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly ArgGlu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly Arg
785 790 795 800785 790 795 800
Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe ValArg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe Val
805 810 815 805 810 815
Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr ValLeu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr Val
820 825 830 820 825 830
Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro LeuAsp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro Leu
835 840 845 835 840 845
Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrAsp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr
850 855 860 850 855 860
<210> 6<210> 6
<211> 509<211> 509
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(509)<222> (1)..(509)
<223> F6H突变体2B1trF6H<223> F6H mutant 2B1trF6H
<400> 6<400> 6
Met Ala Lys Lys Thr Ser Ser Lys Gly Lys Leu Pro Pro Gly Pro SerMet Ala Lys Lys Thr Ser Ser Lys Gly Lys Leu Pro Pro Gly Pro Ser
1 5 10 151 5 10 15
Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly AlaMet Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly Ala
20 25 30 20 25 30
Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala SerArg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala Ser
35 40 45 35 40 45
Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His GlyAsp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His Gly
50 55 60 50 55 60
Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val SerPro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val Ser
65 70 75 8065 70 75 80
Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala ValSer Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala Val
85 90 95 85 90 95
Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp GlyMet Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp Gly
100 105 110 100 105 110
Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile ArgAla Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile Arg
115 120 125 115 120 125
Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu GlnLys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu Gln
130 135 140 130 135 140
Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu TyrArg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu Tyr
145 150 155 160145 150 155 160
Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val AspLys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val Asp
165 170 175 165 170 175
Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg MetMet Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg Met
180 185 190 180 185 190
Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr LysVal Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr Lys
195 200 205 195 200 205
Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln PheArg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln Phe
210 215 220 210 215 220
Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly GlyIle Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly
225 230 235 240225 230 235 240
Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile IlePhe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile Ile
245 250 255 245 250 255
Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp AspAla Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp Asp
260 265 270 260 265 270
Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala AsnLys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala Asn
275 280 285 275 280 285
Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala ThrPro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala Thr
290 295 300 290 295 300
Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile LeuCys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile Leu
305 310 315 320305 310 315 320
Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys LysIle Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys Lys
325 330 335 325 330 335
Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val GluVal Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val Glu
340 345 350 340 345 350
Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys GluGlu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys Glu
355 360 365 355 360 365
Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala PheThr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala Phe
370 375 380 370 375 380
Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr PheSer Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr Phe
385 390 395 400385 390 395 400
Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp SerLeu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp Ser
405 410 415 405 410 415
Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys ValAsp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys Val
420 425 430 420 425 430
Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly ArgGlu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly Arg
435 440 445 435 440 445
Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe ValArg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe Val
450 455 460 450 455 460
Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr ValLeu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr Val
465 470 475 480465 470 475 480
Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro LeuAsp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro Leu
485 490 495 485 490 495
Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrAsp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr
500 505 500 505
<210> 7<210> 7
<211> 712<211> 712
<212> PRT<212> PRT
<213> 拟南芥(Arabidopsis thaliana)<213> Arabidopsis thaliana
<400> 7<400> 7
Met Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met AlaMet Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met Ala
1 5 10 151 5 10 15
Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn AlaAla Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn Ala
20 25 30 20 25 30
Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile GluSer Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile Glu
35 40 45 35 40 45
Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu IleAsn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu Ile
50 55 60 50 55 60
Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn SerGly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn Ser
65 70 75 8065 70 75 80
Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu GluLys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu Glu
85 90 95 85 90 95
Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr GlnGlu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr Gln
100 105 110 100 105 110
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala LysThr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala Lys
115 120 125 115 120 125
Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp TyrAla Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp Tyr
130 135 140 130 135 140
Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp ValAla Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp Val
145 150 155 160145 150 155 160
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp AsnAla Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175 165 170 175
Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly GluAla Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly Glu
180 185 190 180 185 190
Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg GlnTrp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205 195 200 205
Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu ValTyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu Val
210 215 220 210 215 220
Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp AspGlu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp Asp
225 230 235 240225 230 235 240
Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp ProGln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp Pro
245 250 255 245 250 255
Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala ThrGlu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala Thr
260 265 270 260 265 270
Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp SerPro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp Ser
275 280 285 275 280 285
Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly TyrGlu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly Tyr
290 295 300 290 295 300
Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val LysThr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val Lys
305 310 315 320305 310 315 320
Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu GluArg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu Glu
325 330 335 325 330 335
Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His ValPhe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His Val
340 345 350 340 345 350
Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu ArgGly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu Arg
355 360 365 355 360 365
Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu LysLeu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu Lys
370 375 380 370 375 380
Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro ProGlu Asp Gly Thr Pro Ile Ser Ser Ser Ser Leu Pro Pro Pro Phe Pro Pro
385 390 395 400385 390 395 400
Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser SerCys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser Ser
405 410 415 405 410 415
Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp ProPro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp Pro
420 425 430 420 425 430
Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys ValThr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys Val
435 440 445 435 440 445
Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu GluAsp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu
450 455 460 450 455 460
Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe PheVal Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe
465 470 475 480465 470 475 480
Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser SerAla Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser
485 490 495 485 490 495
Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu ValSer Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val
500 505 510 500 505 510
Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser ThrTyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr
515 520 525 515 520 525
Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser SerTrp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser
530 535 540 530 535 540
Ala Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp SerAla Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser
545 550 555 560545 550 555 560
Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro PheLys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe
565 570 575 565 570 575
Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val GluArg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu
580 585 590 580 585 590
Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met AspLeu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp
595 600 605 595 600 605
Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala LeuPhe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu
610 615 620 610 615 620
Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu TyrAla Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr
625 630 635 640625 630 635 640
Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met IleVal Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile
645 650 655 645 650 655
Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met AlaSer Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala
660 665 670 660 665 670
Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly SerArg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser
675 680 685 675 680 685
Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr SerMet Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser
690 695 700 690 695 700
Gly Arg Tyr Leu Arg Asp Val TrpGly Arg Tyr Leu Arg Asp Val Trp
705 710705 710
<210> 8<210> 8
<211> 641<211> 641
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)
<220><220>
<221> VARIANT<221> VARIANT
<222> (1)..(641)<222> (1)..(641)
<223> AtCPR突变体trAtCPR<223> AtCPR mutant trAtCPR
<400> 8<400> 8
Met Arg Arg Ser Gly Ser Gly Asn Ser Lys Arg Val Glu Pro Leu LysMet Arg Arg Ser Gly Ser Gly Asn Ser Lys Arg Val Glu Pro Leu Lys
1 5 10 151 5 10 15
Pro Leu Val Ile Lys Pro Arg Glu Glu Glu Ile Asp Asp Gly Arg LysPro Leu Val Ile Lys Pro Arg Glu Glu Glu Ile Asp Asp Gly Arg Lys
20 25 30 20 25 30
Lys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly PheLys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly Phe
35 40 45 35 40 45
Ala Lys Ala Leu Gly Glu Glu Ala Lys Ala Arg Tyr Glu Lys Thr ArgAla Lys Ala Leu Gly Glu Glu Ala Lys Ala Arg Tyr Glu Lys Thr Arg
50 55 60 50 55 60
Phe Lys Ile Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp Glu TyrPhe Lys Ile Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp Glu Tyr
65 70 75 8065 70 75 80
Glu Glu Lys Leu Lys Lys Glu Asp Val Ala Phe Phe Phe Leu Ala ThrGlu Glu Lys Leu Lys Lys Glu Asp Val Ala Phe Phe Phe Leu Ala Thr
85 90 95 85 90 95
Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys TrpTyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys Trp
100 105 110 100 105 110
Phe Thr Glu Gly Asn Asp Arg Gly Glu Trp Leu Lys Asn Leu Lys TyrPhe Thr Glu Gly Asn Asp Arg Gly Glu Trp Leu Lys Asn Leu Lys Tyr
115 120 125 115 120 125
Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys ValGly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys Val
130 135 140 130 135 140
Ala Lys Val Val Asp Asp Ile Leu Val Glu Gln Gly Ala Gln Arg LeuAla Lys Val Val Asp Asp Ile Leu Val Glu Gln Gly Ala Gln Arg Leu
145 150 155 160145 150 155 160
Val Gln Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp PheVal Gln Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp Phe
165 170 175 165 170 175
Thr Ala Trp Arg Glu Ala Leu Trp Pro Glu Leu Asp Thr Ile Leu ArgThr Ala Trp Arg Glu Ala Leu Trp Pro Glu Leu Asp Thr Ile Leu Arg
180 185 190 180 185 190
Glu Glu Gly Asp Thr Ala Val Ala Thr Pro Tyr Thr Ala Ala Val LeuGlu Glu Gly Asp Thr Ala Val Ala Thr Pro Tyr Thr Ala Ala Val Leu
195 200 205 195 200 205
Glu Tyr Arg Val Ser Ile His Asp Ser Glu Asp Ala Lys Phe Asn AspGlu Tyr Arg Val Ser Ile His Asp Ser Glu Asp Ala Lys Phe Asn Asp
210 215 220 210 215 220
Ile Asn Met Ala Asn Gly Asn Gly Tyr Thr Val Phe Asp Ala Gln HisIle Asn Met Ala Asn Gly Asn Gly Tyr Thr Val Phe Asp Ala Gln His
225 230 235 240225 230 235 240
Pro Tyr Lys Ala Asn Val Ala Val Lys Arg Glu Leu His Thr Pro GluPro Tyr Lys Ala Asn Val Ala Val Lys Arg Glu Leu His Thr Pro Glu
245 250 255 245 250 255
Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ala Gly Ser GlySer Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ala Gly Ser Gly
260 265 270 260 265 270
Leu Thr Tyr Glu Thr Gly Asp His Val Gly Val Leu Cys Asp Asn LeuLeu Thr Tyr Glu Thr Gly Asp His Val Gly Val Leu Cys Asp Asn Leu
275 280 285 275 280 285
Ser Glu Thr Val Asp Glu Ala Leu Arg Leu Leu Asp Met Ser Pro AspSer Glu Thr Val Asp Glu Ala Leu Arg Leu Leu Asp Met Ser Pro Asp
290 295 300 290 295 300
Thr Tyr Phe Ser Leu His Ala Glu Lys Glu Asp Gly Thr Pro Ile SerThr Tyr Phe Ser Leu His Ala Glu Lys Glu Asp Gly Thr Pro Ile Ser
305 310 315 320305 310 315 320
Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Asn Leu Arg Thr Ala LeuSer Ser Leu Pro Pro Pro Phe Pro Pro Cys Asn Leu Arg Thr Ala Leu
325 330 335 325 330 335
Thr Arg Tyr Ala Cys Leu Leu Ser Ser Pro Lys Lys Ser Ala Leu ValThr Arg Tyr Ala Cys Leu Leu Ser Ser Pro Lys Lys Ser Ala Leu Val
340 345 350 340 345 350
Ala Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Glu Arg Leu LysAla Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Glu Arg Leu Lys
355 360 365 355 360 365
His Leu Ala Ser Pro Ala Gly Lys Val Asp Glu Tyr Ser Lys Trp ValHis Leu Ala Ser Pro Ala Gly Lys Val Asp Glu Tyr Ser Lys Trp Val
370 375 380 370 375 380
Val Glu Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro SerVal Glu Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro Ser
385 390 395 400385 390 395 400
Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val Ala Pro Arg LeuAla Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val Ala Pro Arg Leu
405 410 415 405 410 415
Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Pro Lys Ile Ala Glu ThrGln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Ser Pro Lys Ile Ala Glu Thr
420 425 430 420 425 430
Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Met Pro Thr GlyArg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Met Pro Thr Gly
435 440 445 435 440 445
Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val ProArg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val Pro
450 455 460 450 455 460
Tyr Glu Lys Ser Glu Asn Cys Ser Ser Ala Pro Ile Phe Val Arg GlnTyr Glu Lys Ser Glu Asn Cys Ser Ser Ala Pro Ile Phe Val Arg Gln
465 470 475 480465 470 475 480
Ser Asn Phe Lys Leu Pro Ser Asp Ser Lys Val Pro Ile Ile Met IleSer Asn Phe Lys Leu Pro Ser Asp Ser Lys Val Pro Ile Ile Met Ile
485 490 495 485 490 495
Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu ArgGly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu Arg
500 505 510 500 505 510
Leu Ala Leu Val Glu Ser Gly Val Glu Leu Gly Pro Ser Val Leu PheLeu Ala Leu Val Glu Ser Gly Val Glu Leu Gly Pro Ser Val Leu Phe
515 520 525 515 520 525
Phe Gly Cys Arg Asn Arg Arg Met Asp Phe Ile Tyr Glu Glu Glu LeuPhe Gly Cys Arg Asn Arg Arg Met Asp Phe Ile Tyr Glu Glu Glu Leu
530 535 540 530 535 540
Gln Arg Phe Val Glu Ser Gly Ala Leu Ala Glu Leu Ser Val Ala PheGln Arg Phe Val Glu Ser Gly Ala Leu Ala Glu Leu Ser Val Ala Phe
545 550 555 560545 550 555 560
Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met AspSer Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met Asp
565 570 575 565 570 575
Lys Ala Ser Asp Ile Trp Asn Met Ile Ser Gln Gly Ala Tyr Leu TyrLys Ala Ser Asp Ile Trp Asn Met Ile Ser Gln Gly Ala Tyr Leu Tyr
580 585 590 580 585 590
Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Ser LeuVal Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Ser Leu
595 600 605 595 600 605
His Thr Ile Ala Gln Glu Gln Gly Ser Met Asp Ser Thr Lys Ala GluHis Thr Ile Ala Gln Glu Gln Gly Ser Met Asp Ser Thr Lys Ala Glu
610 615 620 610 615 620
Gly Phe Val Lys Asn Leu Gln Thr Ser Gly Arg Tyr Leu Arg Asp ValGly Phe Val Lys Asn Leu Gln Thr Ser Gly Arg Tyr Leu Arg Asp Val
625 630 635 640625 630 635 640
TrpTrp
<210> 9<210> 9
<211> 25<211> 25
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 9<400> 9
tataccatgg aactgagcag tgtga 25tataccatgg aactgagcag tgtga 25
<210> 10<210> 10
<211> 36<211> 36
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 10<400> 10
ctcgaattcg gatccactag tttaatataa agtcgg 36ctcgaattcg gatccactag tttaatataa agtcgg 36
<210> 11<210> 11
<211> 43<211> 43
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 11<400> 11
ctttaagaag gagatatacc atggcgatgc cgaagaaaag ctc 43ctttaagaag gagatatacc atggcgatgc cgaagaaaag ctc 43
<210> 12<210> 12
<211> 63<211> 63
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 12<400> 12
ctttaagaag gagatatacc atggctctgt tattagcagt ttttatgccg aagaaaagct 60ctttaagaag gagatatacc atggctctgt tattagcagt ttttatgccg aagaaaagct 60
ctt 63ctt 63
<210> 13<210> 13
<211> 39<211> 39
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 13<400> 13
ctttaagaag gagatatacc atggctaaaa tcgaagaag 39ctttaagaag gagatatacc atggctaaaa tcgaagaag 39
<210> 14<210> 14
<211> 35<211> 35
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 14<400> 14
ctgaaagacg cgcagactat gccgaagaaa agctc 35ctgaaagacg cgcagactat gccgaagaaa agctc 35
<210> 15<210> 15
<211> 35<211> 35
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 15<400> 15
gagcttttct tcggcatagt ctgcgcgtct ttcag 35gagcttttct tcggcatagt ctgcgcgtct ttcag 35
<210> 16<210> 16
<211> 87<211> 87
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 16<400> 16
ctttaagaag gagatatacc atggctaaga aaacgagctc taaagggaag ctcccaccag 60ctttaagaag gagatatacc atggctaaga aaacgagctc taaagggaag ctcccaccag 60
gacctagcat gccgaagaaa agctctt 87gacctagcat gccgaagaaa agctctt 87
<210> 17<210> 17
<211> 44<211> 44
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 17<400> 17
ctttaagaag gagatatacc atggcggact cagaagtcaa tctt 44ctttaagaag gagatatacc atggcggact cagaagtcaa tctt 44
<210> 18<210> 18
<211> 36<211> 36
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 18<400> 18
gagaacagat tggtggtatg ccgaagaaaa gctctt 36gagaacagat tggtggtatg ccgaagaaaa gctctt 36
<210> 19<210> 19
<211> 36<211> 36
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 19<400> 19
aagagctttt cttcggcata ccaccaatct gttctc 36aagagctttt cttcggcata ccaccaatct gttctc 36
<210> 20<210> 20
<211> 27<211> 27
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 20<400> 20
tataccatgg gtgactgcgt tgccccg 27tataccatgg gtgactgcgt tgccccg 27
<210> 21<210> 21
<211> 30<211> 30
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 21<400> 21
cgggatcctt acttcggcag gtcgccgctc 30cgggatcctt acttcggcag gtcgccgctc 30
<210> 22<210> 22
<211> 29<211> 29
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 22<400> 22
cgggatccct tatgcgactc ctgcattag 29cgggatccct tatgcgactc ctgcattag 29
<210> 23<210> 23
<211> 26<211> 26
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 23<400> 23
gcccaagctt ttatgccagc atcttc 26gcccaagctt ttatgccagc atcttc 26
<210> 24<210> 24
<211> 31<211> 31
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 24<400> 24
agatatacat atggttacgg tggaagaata c 31agatatacat atggttacgg tggaagaata c 31
<210> 25<210> 25
<211> 30<211> 30
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 25<400> 25
ccgctcgagt taggtagcca cactatgcag 30ccgctcgagt taggtagcca cactatgcag 30
<210> 26<210> 26
<211> 30<211> 30
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 26<400> 26
ccgctcgagc tagaaataat tttgtttaac 30ccgctcgagc tagaaataat tttgtttaac 30
<210> 27<210> 27
<211> 28<211> 28
<212> DNA<212> DNA
<213> 引物(Primer)<213> Primer
<400> 27<400> 27
gagcctaggt tagttaccga ttttaaag 28gagcctaggt tagttaccga ttttaaag 28
<210> 28<210> 28
<211> 38<211> 38
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 28<400> 28
gaagatgctg gcataaaagc ttcgatcccg cgaaatta 38gaagatgctg gcataaaagc ttcgatcccg cgaaatta 38
<210> 29<210> 29
<211> 38<211> 38
<212> DNA<212>DNA
<213> 引物(Primer)<213> Primer
<400> 29<400> 29
cgacttaagc attatgcggc cgcctacgcc aggttttc 38cgacttaagc attatgcggc cgcctacgcc aggttttc 38
Claims (10)
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811043657.0A CN110885846B (en) | 2018-09-07 | 2018-09-07 | Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof |
US17/274,377 US20220033827A1 (en) | 2018-09-07 | 2019-09-06 | Baicalein- and Scutellarein- Synthesizing Microorganism, Preparation Method and Applications Thereof |
JP2021537458A JP2021535757A (en) | 2018-09-07 | 2019-09-06 | Microorganisms that synthesize baicalein and scutellarein, their production methods and their use |
EP19856619.2A EP3848462A4 (en) | 2018-09-07 | 2019-09-06 | BAICALEIN AND WILD BAICALEIN SYNTHETIC MICROORGANISM, PREPARATION METHOD THEREOF AND APPLICATIONS THEREOF |
PCT/CN2019/104658 WO2020048523A1 (en) | 2018-09-07 | 2019-09-06 | Baicalein- and wild baicalein-synthesizing microorganism, preparation method for same, and applications thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811043657.0A CN110885846B (en) | 2018-09-07 | 2018-09-07 | Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110885846A CN110885846A (en) | 2020-03-17 |
CN110885846B true CN110885846B (en) | 2023-08-25 |
Family
ID=69722264
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811043657.0A Active CN110885846B (en) | 2018-09-07 | 2018-09-07 | Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof |
Country Status (5)
Country | Link |
---|---|
US (1) | US20220033827A1 (en) |
EP (1) | EP3848462A4 (en) |
JP (1) | JP2021535757A (en) |
CN (1) | CN110885846B (en) |
WO (1) | WO2020048523A1 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113621530A (en) * | 2020-04-07 | 2021-11-09 | 华东理工大学 | Genetically engineered yeast for producing wogonin compounds, construction method and application thereof |
CN114717170B (en) * | 2021-01-05 | 2024-06-04 | 中国科学院分子植物科学卓越创新中心 | Host cells for heterologous synthesis of flavonoids and their applications |
CN116515876A (en) * | 2022-01-20 | 2023-08-01 | 中国科学院分子植物科学卓越创新中心 | Regulation and control method and application of heterologous synthetic flavonoid compound |
CN114774442B (en) * | 2022-04-29 | 2023-10-17 | 中国科学院天津工业生物技术研究所 | Recombinant yarrowia lipolytica for producing scutellarin and construction method and application thereof |
CN115992106B (en) * | 2022-09-06 | 2025-04-29 | 中国中医科学院中药研究所 | Apigenin 8-hydroxylase protein and its encoding gene and application |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW201522644A (en) * | 2013-12-11 | 2015-06-16 | Univ Chia Nan Pharm & Sciency | Method of manufacturing 6-hydroxyapigenin by biotransformation |
CN106318920A (en) * | 2016-09-14 | 2017-01-11 | 中国科学院天津工业生物技术研究所 | Flavones-6-hydroxylase and application thereof to scutellarin synthesis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007044688A1 (en) * | 2005-10-07 | 2007-04-19 | The Regents Of The University Of California | Nucleic acids encoding modified cytochrome p450 enzymes and methods of use thereof |
-
2018
- 2018-09-07 CN CN201811043657.0A patent/CN110885846B/en active Active
-
2019
- 2019-09-06 EP EP19856619.2A patent/EP3848462A4/en active Pending
- 2019-09-06 JP JP2021537458A patent/JP2021535757A/en active Pending
- 2019-09-06 WO PCT/CN2019/104658 patent/WO2020048523A1/en unknown
- 2019-09-06 US US17/274,377 patent/US20220033827A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW201522644A (en) * | 2013-12-11 | 2015-06-16 | Univ Chia Nan Pharm & Sciency | Method of manufacturing 6-hydroxyapigenin by biotransformation |
CN106318920A (en) * | 2016-09-14 | 2017-01-11 | 中国科学院天津工业生物技术研究所 | Flavones-6-hydroxylase and application thereof to scutellarin synthesis |
Non-Patent Citations (1)
Title |
---|
Two CYP82D Enzymes Function as Flavone Hydroxylases in the Biosynthesis of Root-Specific 4"-Deoxyflavones in Scutellaria baicalensis;Qing Zhao;《Mol Plant》;20170824;第11卷(第1期);表1a * |
Also Published As
Publication number | Publication date |
---|---|
CN110885846A (en) | 2020-03-17 |
US20220033827A1 (en) | 2022-02-03 |
WO2020048523A1 (en) | 2020-03-12 |
JP2021535757A (en) | 2021-12-23 |
EP3848462A1 (en) | 2021-07-14 |
EP3848462A4 (en) | 2022-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110885846B (en) | Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof | |
CN113322288B (en) | Novel flavone hydroxylase, microorganism for synthesizing flavone C-glycoside compounds and application thereof | |
CN113136373B (en) | Carbonoside glycosyltransferase and application thereof | |
CN113265433B (en) | Bifunctional glycosyltransferase and application thereof | |
CN113862290B (en) | Isoflavone 4' -O-methyltransferase from liquorice and application thereof | |
CN114032223B (en) | Esculin and ash bark glycoside glycosyltransferase protein, and coding gene and application thereof | |
CN116656641A (en) | Caffeic acid O-methyltransferase mutant and application thereof | |
WO2022148377A1 (en) | Host cell of heterologous synthetic flavonoid compound, and use thereof | |
CN106318920B (en) | Flavone-6-hydroxylase and its application in the synthesis of scutellarin | |
JP2022540791A (en) | Strains and methods for producing rosmarinic acid | |
CN111440734A (en) | Genetically engineered yeast for producing baicalein compounds, construction method and application thereof | |
CN108866142B (en) | Cytochrome P450, nicotinamide adenine dinucleotide-cytochrome P450 reductase and application thereof | |
CN112574968A (en) | Fusion protein for producing 5 alpha-hydroxy taxadiene and application thereof | |
CN113736762B (en) | An α-L-rhamnosidase mutant and its application in the preparation of plunin | |
CN113667655B (en) | Curculigo orchioides glycosyltransferase Co84A-471 gene and application thereof in preparation of orcinol glucoside | |
KR20240032944A (en) | Rhamnose highly specific glycosyltransferase and its applications | |
CN114657160B (en) | Glycosyltransferase mutant and application thereof | |
CN114277024B (en) | Novel triterpene synthase and application thereof | |
CN107880134A (en) | A kind of method of enzyme' s catalysis Kaempferol | |
CN108359652A (en) | Glycosyl transferase and its application | |
CN112553175A (en) | Preparation and application of glycosyltransferase UGT76G1 mutant | |
US20250092431A1 (en) | Method for Regulating and Controlling Heterologous Synthetic Flavonoid Compound and Use Thereof | |
CN113046333B (en) | Flavone-8-hydroxylase and its application in the synthesis of 8-hydroxyflavone compounds | |
CN117106742A (en) | Methyltransferase from stephania tetrandra and application thereof in preparation of isotetrandrine | |
CN115247156A (en) | Recombinant apple flavonol synthase and preparation method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200609 Address after: 200032 building 4, No. 300 Fenglin Road, Xuhui District, Shanghai Applicant after: Center for excellence and innovation in molecular plant science, Chinese Academy of Sciences Address before: 200031 Yueyang Road, Shanghai, No. 319, No. Applicant before: SHANGHAI INSTITUTES FOR BIOLOGICAL SCIENCES, CHINESE ACADEMY OF SCIENCES |
|
GR01 | Patent grant | ||
GR01 | Patent grant |