CN114854726B - Mutant of fatty acid light decarboxylase McFAP and application thereof - Google Patents
Mutant of fatty acid light decarboxylase McFAP and application thereof Download PDFInfo
- Publication number
- CN114854726B CN114854726B CN202210708763.6A CN202210708763A CN114854726B CN 114854726 B CN114854726 B CN 114854726B CN 202210708763 A CN202210708763 A CN 202210708763A CN 114854726 B CN114854726 B CN 114854726B
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- fap
- leu
- fatty acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 235000014113 dietary fatty acids Nutrition 0.000 title claims description 50
- 229930195729 fatty acid Natural products 0.000 title claims description 50
- 239000000194 fatty acid Substances 0.000 title claims description 50
- 150000004665 fatty acids Chemical class 0.000 title claims description 47
- 238000006114 decarboxylation reaction Methods 0.000 claims abstract description 72
- 108030005760 Fatty acid photodecarboxylases Proteins 0.000 claims abstract description 20
- 125000004432 carbon atom Chemical group C* 0.000 claims abstract description 18
- 125000005480 straight-chain fatty acid group Chemical group 0.000 claims abstract description 7
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 3
- 238000000034 method Methods 0.000 claims description 25
- 108090000623 proteins and genes Proteins 0.000 claims description 22
- 239000002773 nucleotide Substances 0.000 claims description 6
- 125000003729 nucleotide group Chemical group 0.000 claims description 6
- 239000013604 expression vector Substances 0.000 claims description 5
- 238000003259 recombinant expression Methods 0.000 claims description 5
- 108090000489 Carboxy-Lyases Proteins 0.000 claims description 4
- 238000006555 catalytic reaction Methods 0.000 claims description 3
- 239000002253 acid Substances 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 34
- 239000000758 substrate Substances 0.000 abstract description 15
- 230000015572 biosynthetic process Effects 0.000 abstract description 5
- 239000000446 fuel Substances 0.000 abstract description 5
- 238000001228 spectrum Methods 0.000 abstract description 2
- WWZKQHOCKIZLMA-UHFFFAOYSA-N octanoic acid Chemical compound CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 description 51
- 238000006243 chemical reaction Methods 0.000 description 38
- 108090000790 Enzymes Proteins 0.000 description 35
- 102000004190 Enzymes Human genes 0.000 description 35
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 15
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 230000003197 catalytic effect Effects 0.000 description 11
- 150000001413 amino acids Chemical class 0.000 description 10
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 9
- 238000011160 research Methods 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 238000003860 storage Methods 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 150000001335 aliphatic alkanes Chemical class 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- KBPLFHHGFOOTCA-UHFFFAOYSA-N caprylic alcohol Natural products CCCCCCCCO KBPLFHHGFOOTCA-UHFFFAOYSA-N 0.000 description 6
- 238000004587 chromatography analysis Methods 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 235000021314 Palmitic acid Nutrition 0.000 description 5
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 238000010828 elution Methods 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 230000035484 reaction time Effects 0.000 description 5
- 235000003441 saturated fatty acids Nutrition 0.000 description 5
- 150000004671 saturated fatty acids Chemical class 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 150000001336 alkenes Chemical class 0.000 description 4
- 239000002551 biofuel Substances 0.000 description 4
- -1 diesel Substances 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 3
- 102000004031 Carboxy-Lyases Human genes 0.000 description 3
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 3
- IMNFDUFMRHMDMM-UHFFFAOYSA-N N-Heptane Chemical compound CCCCCCC IMNFDUFMRHMDMM-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000004817 gas chromatography Methods 0.000 description 3
- 239000003502 gasoline Substances 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- FUZZWVXGSFPDMH-UHFFFAOYSA-N hexanoic acid Chemical compound CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 3
- 229930195733 hydrocarbon Natural products 0.000 description 3
- 150000002430 hydrocarbons Chemical class 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- TVMXDCGIABBOFY-UHFFFAOYSA-N n-Octanol Natural products CCCCCCCC TVMXDCGIABBOFY-UHFFFAOYSA-N 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 229920006395 saturated elastomer Polymers 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- UYRPHDGXHKBZHJ-CIUDSAMLSA-N Asn-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N UYRPHDGXHKBZHJ-CIUDSAMLSA-N 0.000 description 2
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 2
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- 101150027576 FAP gene Proteins 0.000 description 2
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 2
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 2
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 2
- MJXNDRCLGDSBBE-FHWLQOOXSA-N Val-His-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N MJXNDRCLGDSBBE-FHWLQOOXSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 208000012839 conversion disease Diseases 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 150000004667 medium chain fatty acids Chemical class 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- OAHKWDDSKCRNFE-UHFFFAOYSA-N phenylmethanesulfonyl chloride Chemical compound ClS(=O)(=O)CC1=CC=CC=C1 OAHKWDDSKCRNFE-UHFFFAOYSA-N 0.000 description 2
- 238000013032 photocatalytic reaction Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 150000004666 short chain fatty acids Chemical class 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- LZZOARFGFBWHAV-FAOVPRGRSA-N 2-hydroxyethyl(trimethyl)azanium;methanol;(2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC.C[N+](C)(C)CCO.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O LZZOARFGFBWHAV-FAOVPRGRSA-N 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- OARAZORWIMYUPO-FXQIFTODSA-N Ala-Met-Cys Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CS)C(O)=O OARAZORWIMYUPO-FXQIFTODSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 1
- SDXQKJAWASHMIZ-CIUDSAMLSA-N Cys-Glu-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SDXQKJAWASHMIZ-CIUDSAMLSA-N 0.000 description 1
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- LWYKPOCGGTYAIH-FXQIFTODSA-N Cys-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LWYKPOCGGTYAIH-FXQIFTODSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 1
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 1
- MSWBLPLBSLQVME-XIRDDKMYSA-N Cys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 MSWBLPLBSLQVME-XIRDDKMYSA-N 0.000 description 1
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- APWLZZSLCXLDCF-CIUDSAMLSA-N Gln-Cys-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O APWLZZSLCXLDCF-CIUDSAMLSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- VTMSUKSRIKCCAD-ULQDDVLXSA-N His-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N VTMSUKSRIKCCAD-ULQDDVLXSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 235000017858 Laurus nobilis Nutrition 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- XBYKTPZCWQQSGB-IHRRRGAJSA-N Met-Cys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBYKTPZCWQQSGB-IHRRRGAJSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 241000497267 Micractinium conductrix Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 235000005212 Terminalia tomentosa Nutrition 0.000 description 1
- 244000125380 Terminalia tomentosa Species 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- RXEQOXHCHQJMSO-IHPCNDPISA-N Trp-His-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O RXEQOXHCHQJMSO-IHPCNDPISA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 150000001338 aliphatic hydrocarbons Chemical class 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013406 biomanufacturing process Methods 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000010779 crude oil Substances 0.000 description 1
- 150000001924 cycloalkanes Chemical class 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000002283 diesel fuel Substances 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- IPCSVZSSVZVIGE-UHFFFAOYSA-M hexadecanoate Chemical compound CCCCCCCCCCCCCCCC([O-])=O IPCSVZSSVZVIGE-UHFFFAOYSA-M 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 239000007800 oxidant agent Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 238000007146 photocatalysis Methods 0.000 description 1
- 230000001699 photocatalysis Effects 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 239000005060 rubber Substances 0.000 description 1
- 150000003839 salts Chemical group 0.000 description 1
- 235000021391 short chain fatty acids Nutrition 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000012086 standard solution Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010012050 valyl-aspartyl-prolyl-proline Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/01—Carboxy-lyases (4.1.1)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/30—Fuel from waste, e.g. synthetic alcohol or diesel
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明公开了一种脂肪酸光脱羧酶McFAP的突变体及其应用,所述脂肪酸光脱羧酶McFAP的突变体的氨基酸序列如SEQ ID NO.4所示。本发明的McFAP的突变体,丰富了FAP的种类,填补了现有FAP不能对碳原子数6~12的饱和直链脂肪酸催化脱羧的空白,脱羧底物谱明显更广、脱羧效果更好,使燃料的高效可持续生物合成成为可能,有着广阔的工业应用前景。
The invention discloses a mutant of fatty acid photodecarboxylase McFAP and its application. The amino acid sequence of the mutant of fatty acid photodecarboxylase McFAP is shown in SEQ ID NO. 4. The McFAP mutant of the present invention enriches the types of FAP and fills the gap that existing FAP cannot catalyze decarboxylation of saturated straight-chain fatty acids with 6 to 12 carbon atoms. The decarboxylation substrate spectrum is significantly wider and the decarboxylation effect is better. It makes efficient and sustainable biosynthesis of fuel possible and has broad industrial application prospects.
Description
技术领域Technical field
本发明属于酶工程技术领域,具体地说,本发明涉及一种脂肪酸光脱羧酶McFAP的突变体及其应用。The invention belongs to the technical field of enzyme engineering. Specifically, the invention relates to a mutant of fatty acid photodecarboxylase Mc FAP and its application.
背景技术Background technique
当前,生物技术不断从医药、农业、食品向工业领域(如化工、材料及能源)转移。汽油、柴油、塑料、橡胶、纤维及许多大宗传统石油化工产品,正不断被来自可再生原料的工业生物制造产品替代。高温、高压、高污染的化学工业过程,也正不断向条件温和、清洁环保的生物加工过程转移。Currently, biotechnology continues to shift from medicine, agriculture, and food to industrial fields (such as chemicals, materials, and energy). Gasoline, diesel, plastics, rubber, fibers and many bulk traditional petrochemical products are increasingly being replaced by industrial biomanufactured products from renewable feedstocks. High-temperature, high-pressure, and high-pollution chemical industrial processes are also constantly shifting to mild, clean and environmentally friendly biological processing processes.
以环境友好的方式生产可再生生物燃料和化学品的探索和实践已受到极大关注。以生物质为原料,利用微生物或酶生产燃料具有十分广阔的发展前景,开发利用可再生的生物质资源定向转化生产生物燃气和液体燃料的技术,成为了各国相关领域研究工作者科学研究的重要任务。The exploration and practice of producing renewable biofuels and chemicals in an environmentally friendly manner has received great attention. Using biomass as raw material and using microorganisms or enzymes to produce fuels has very broad development prospects. The development and utilization of renewable biomass resources directed conversion technology to produce biogas and liquid fuels has become an important scientific research topic for researchers in related fields in various countries. Task.
迄今为止被发现的可用于脂肪酸脱羧的酶主要是氨基酸脱羧酶、卤代过氧化物酶和脂肪酸脱羧合成末端烯烃酶(OleTJE)等脱羧酶,但大多数酶的催化效率不高,且需要额外添加昂贵的辅因子和/或易破坏油脂不饱和键的氧化剂,是其在应用研究方面的显著缺点。因此,寻找高效的新型脱羧酶是生物燃料制备研究突破瓶颈的关键。The enzymes discovered so far that can be used for fatty acid decarboxylation are mainly decarboxylase enzymes such as amino acid decarboxylase, haloperoxidase and fatty acid decarboxylation synthesis terminal olefinase (OleT JE ). However, the catalytic efficiency of most enzymes is not high and requires The addition of additional expensive cofactors and/or oxidants that easily destroy the unsaturated bonds of oils is a significant shortcoming in applied research. Therefore, finding efficient new decarboxylase enzymes is the key to breaking through the bottleneck in biofuel preparation research.
脂肪酸光脱羧酶(fatty acid photodecarboxylase, FAP,EC 4.1.1.106)属葡萄糖-甲醇-胆碱(GMC)氧化还原酶家族,是一种无需添加昂贵辅因子,只利用蓝光就能将脂肪酸转化为烷(烯)烃的光驱动酶。脂肪酸仅需脱去一个羧基即可形成各类烷(烯)烃,与通过石油原油加工得到的汽油及柴油的成分近乎完美匹配。光催化能耗低、过程清洁、便于调控反应的开关状态,生物酶催化特异性强、条件温和。因此结合二者优点、满足绿色发展期望的光驱动酶FAP成为新兴研究热点。随着对绿色能源的需求及偏好,通过将脂肪酸脱羧制备烷(烯)烃成为开发生物燃料的重点探究路线,在生物燃料的绿色化学制造过程中具有广阔的应用前景。Fatty acid photodecarboxylase (FAP, EC 4.1.1.106) belongs to the glucose-methanol-choline (GMC) oxidoreductase family. It is a kind of enzyme that can convert fatty acids into alkanes using only blue light without adding expensive cofactors. Light-driven enzyme of (alkene) hydrocarbons. Fatty acids only need to remove one carboxyl group to form various alkanes (olefins), which are almost perfectly matched with the composition of gasoline and diesel oil obtained by processing petroleum crude oil. Photocatalysis has low energy consumption, clean process, and is easy to control the switching state of the reaction. Biological enzymes have strong catalytic specificity and mild conditions. Therefore, the light-driven enzyme FAP, which combines the advantages of both and meets the expectations of green development, has become an emerging research hotspot. With the demand and preference for green energy, the preparation of alkanes (olefins) through decarboxylation of fatty acids has become a key research route for the development of biofuels, which has broad application prospects in the green chemical manufacturing process of biofuels.
相较而言,FAP直接利用光能更加节能环保、简单便捷,且不会在碳链末端引入双键,产物即所需烷烃。FAP作为一种利用蓝光将脂肪酸转化为烷(烯)烃的光驱动酶,进行脂肪酸脱羧反应的反应条件温和、转化率高(可达90%以上),且产物的燃烧热值高于酯类燃料分子,副产物只有二氧化碳,过程中仅需利用光能,这对于环境保护的意义非常重大,代表全新的应用领域。In comparison, FAP directly uses light energy, which is more energy-saving, environmentally friendly, simple and convenient, and does not introduce double bonds at the end of the carbon chain. The product is the required alkane. FAP is a light-driven enzyme that uses blue light to convert fatty acids into alkanes (alkenes). The reaction conditions for decarboxylation of fatty acids are mild, the conversion rate is high (up to more than 90%), and the combustion heat value of the product is higher than that of esters. The fuel molecule and the by-product are only carbon dioxide, and only light energy is used in the process. This is of great significance to environmental protection and represents a new application field.
但目前已报道的光脱羧酶仅CvFAP、CrFAP、EsiFAP、GsuFAP、NgaFAP,而其中研究相对深入的仅CvFAP一种,其余均只表达验证脱羧活性,而CvFAP等目前报道过的光脱羧酶对碳原子数为16~22的饱和脂肪酸表现出明显的偏好,对碳原子数小于12的短、中链饱和脂肪酸催化脱羧活性显著下降(反应14 h时,CvFAP催化月桂酸脱羧产率为11%,通过诱饵分子手段催化正己酸脱羧反应12 h仅生成1.6 mM产物,An algal photoenzyme convertsfattyacids to hydrocarbons及Hydrocarbon synthesis via photoenzymaticdecarboxylation of carboxylic acids)。而汽油的主要成分为碳原子数为5~12的脂肪烃和环烷烃,因此,挖掘探索可催化短、中链脂肪酸脱羧的光脱羧酶无论在丰富光脱羧酶的种类,还是在扩充绿色能源开发途径方面都具有广泛前景。However, the only photodecarboxylases that have been reported so far are Cv FAP, Cr FAP, Esi FAP, Gsu FAP, and Nga FAP. Among them, only Cv FAP has been relatively intensively studied. The rest only express and verify decarboxylation activity, and Cv FAP and others are currently reported The passed photodecarboxylase showed a clear preference for saturated fatty acids with carbon atoms of 16 to 22, and its catalytic decarboxylation activity for short and medium-chain saturated fatty acids with less than 12 carbon atoms was significantly reduced (at 14 h of reaction, Cv FAP catalyzed laurel The acid decarboxylation yield is 11%. The decarboxylation reaction of n-hexanoic acid is catalyzed by decoy molecule means to produce only 1.6 mM product for 12 hours. An algal photoenzyme convertsfattyacids to hydrocarbons and Hydrocarbon synthesis via photoenzymaticdecarboxylation of carboxylic acids). The main components of gasoline are aliphatic hydrocarbons and cycloalkanes with 5 to 12 carbon atoms. Therefore, exploring photodecarboxylase that can catalyze the decarboxylation of short and medium-chain fatty acids will not only enrich the types of photodecarboxylase, but also expand green energy. There are broad prospects for development approaches.
发明内容Contents of the invention
基于此,本发明的目的之一在于提供一种脂肪酸光脱羧酶McFAP的突变体,该突变体可催化碳原子数为6~12的脂肪酸脱羧。Based on this, one of the purposes of the present invention is to provide a mutant of fatty acid photodecarboxylase Mc FAP, which can catalyze the decarboxylation of fatty acids with 6 to 12 carbon atoms.
实现上述发明目的的具体技术方案包括如下:Specific technical solutions to achieve the above-mentioned invention objectives include the following:
一种脂肪酸光脱羧酶McFAP的突变体,所述脂肪酸光脱羧酶McFAP的突变体的氨基酸序列如SEQ ID NO.4所示。A mutant of fatty acid photodecarboxylase Mc FAP, the amino acid sequence of the mutant of fatty acid photodecarboxylase Mc FAP is shown in SEQ ID NO. 4.
本发明还提供了上述脂肪酸光脱羧酶McFAP的突变体的编码基因,所述核苷酸序列如SEQ ID NO.3所示。The present invention also provides a gene encoding a mutant of the fatty acid photodecarboxylase Mc FAP, and the nucleotide sequence is shown in SEQ ID NO. 3.
本发明还提供了上述脂肪酸光脱羧酶McFAP的突变体、其编码基因在催化脂肪酸脱羧中的应用。The present invention also provides the application of the mutant of the fatty acid photodecarboxylase Mc FAP and its encoding gene in catalyzing the decarboxylation of fatty acids.
在其中一些实施例中,所述脂肪酸的碳原子数为6~12。In some embodiments, the fatty acid has 6 to 12 carbon atoms.
在其中一些实施例中,所述脂肪酸的碳原子数为7~8。In some embodiments, the fatty acid has 7 to 8 carbon atoms.
在其中一些实施例中,所述脂肪酸为饱和直链脂肪酸。In some embodiments, the fatty acid is a saturated linear fatty acid.
本发明还提供了一种插入有上述编码基因的重组表达载体。The invention also provides a recombinant expression vector inserted with the above encoding gene.
本发明还提供了一种转入有上述重组表达载体的重组工程菌株。The invention also provides a recombinant engineering strain transformed with the above recombinant expression vector.
本发明还提供了一种脂肪酸光脱羧酶McFAP的突变体的制备方法,其是将上述重组工程菌株进行表达、纯化,即得。The present invention also provides a method for preparing a mutant of fatty acid photodecarboxylase Mc FAP, which is obtained by expressing and purifying the above recombinant engineering strain.
本发明还提供了上述重组表达载体、或重组工程菌株在催化脂肪酸脱羧中的应用。The present invention also provides the application of the above-mentioned recombinant expression vector or recombinant engineering strain in catalyzing fatty acid decarboxylation.
本发明还提供了一种催化脂肪酸脱羧的方法,其是使用上述脂肪酸光脱羧酶McFAP的突变体的全细胞进行催化反应。The present invention also provides a method for catalyzing the decarboxylation of fatty acids, which uses whole cells of a mutant of the fatty acid photodecarboxylase Mc FAP to perform a catalytic reaction.
与现有技术相比,本发明具有如下有益效果:Compared with the prior art, the present invention has the following beneficial effects:
在本发明中,发明人根据自身多年的经验,通过缺失突变,构建得到脂肪酸光脱羧酶McFAP的突变体,发现其对碳原子数为6~18的直链脂肪酸具有良好的脱羧效果,尤其是对碳原子数6~12的中链饱和直链脂肪酸脱羧效果非常优异(对C8:0的脱羧效果可达到30 min内90%以上),本发明的McFAP的突变体,丰富了FAP的种类,填补了现有FAP不能对碳原子数6~12的饱和直链脂肪酸催化脱羧的空白,脱羧底物谱明显更广、脱羧效果更好,使燃料的高效可持续生物合成成为可能,有着广阔的工业应用前景。In the present invention, based on many years of experience, the inventor constructed a mutant of fatty acid photodecarboxylase Mc FAP through deletion mutation, and found that it has a good decarboxylation effect on linear fatty acids with carbon atoms of 6 to 18, especially It has a very excellent decarboxylation effect on medium-chain saturated linear fatty acids with 6 to 12 carbon atoms (the decarboxylation effect on C8:0 can reach more than 90% within 30 minutes). The mutant of Mc FAP of the present invention enriches the properties of FAP. species, filling the gap that existing FAP cannot catalyze decarboxylation of saturated straight-chain fatty acids with carbon atoms of 6 to 12. The decarboxylation substrate spectrum is significantly wider and the decarboxylation effect is better, making efficient and sustainable biosynthesis of fuels possible. Broad industrial application prospects.
附图说明Description of drawings
图1为本发明实施例1中McFAP@E. coli的光酶脱羧验证反应结果。Figure 1 shows the results of the photoenzyme decarboxylation verification reaction of Mc FAP@ E. coli in Example 1 of the present invention.
图2为本发明实施例2中McFAP-S的SDS-PAGE蛋白图谱;其中,M.蛋白marker;1.McFAP-S总菌;2.McFAP-S上清;3.McFAP-S沉淀;4.McFAP-S粗酶;5.McFAP-S穿过液;6.0.5 M咪唑洗脱McFAP-S纯酶。Figure 2 is the SDS-PAGE protein profile of Mc FAP-S in Example 2 of the present invention; wherein, M. protein marker; 1. Mc FAP-S total bacteria; 2. Mc FAP-S supernatant; 3. Mc FAP- S precipitation; 4. Mc FAP-S crude enzyme; 5. Mc FAP-S through solution; 6.0.5 M imidazole elution Mc FAP-S pure enzyme.
图3为本发明实施例3中McFAP-S催化正辛酸脱羧的反应时间曲线。Figure 3 is a reaction time curve of Mc FAP-S catalyzing the decarboxylation of n-octanoic acid in Example 3 of the present invention.
图4为本发明实施例3中McFAP-S酶加量对催化正辛酸脱羧效率影响结果图。Figure 4 is a graph showing the effect of Mc FAP-S enzyme addition on the catalytic decarboxylation efficiency of n-octanoic acid in Example 3 of the present invention.
图5为本发明实施例3中底物正辛酸的浓度对McFAP-S催化脱羧效率影响结果图。Figure 5 is a graph showing the effect of the concentration of the substrate n-octanoic acid on the catalytic decarboxylation efficiency of Mc FAP-S in Example 3 of the present invention.
图6为本发明实施例3中反应温度对McFAP-S催化正辛酸脱羧效率影响结果图。Figure 6 is a graph showing the effect of reaction temperature on Mc FAP-S catalyzed n-octanoic acid decarboxylation efficiency in Example 3 of the present invention.
图7为本发明实施例3中反应pH对McFAP-S催化正辛酸脱羧效率影响结果。Figure 7 is the result of the effect of reaction pH on Mc FAP-S catalyzed n-octanoic acid decarboxylation efficiency in Example 3 of the present invention.
图8为本发明实施例3中对McFAP-S在4℃黑暗条件下的储存稳定性考察图。Figure 8 is a graph showing the storage stability of Mc FAP-S under dark conditions at 4°C in Example 3 of the present invention.
图9为本发明实施例3中对McFAP-S的pH耐受性考察图。Figure 9 is a graph showing the pH tolerance of Mc FAP-S in Example 3 of the present invention.
图10为本发明实施例3中对McFAP-S的光失活因素考察图。Figure 10 is a graph showing the investigation of photo-inactivation factors of Mc FAP-S in Example 3 of the present invention.
图11为本发明实施例4中McFAP@E. coli与McFAP-S@E. coli催化不同链长饱和脂肪酸的底物拓展研究图。 Figure 11 is a diagram of substrate expansion research on Mc FAP@ E. coli and Mc FAP-S@ E. coli catalyzing saturated fatty acids with different chain lengths in Example 4 of the present invention .
图12为本发明实施例5中McFAP@E. coli与CvFAP@E. coli催化软脂酸的脱羧效率对比实验结果。Figure 12 is a comparative experimental result of the decarboxylation efficiency of palmitic acid catalyzed by Mc FAP@ E. coli and Cv FAP@ E. coli in Example 5 of the present invention.
具体实施方式Detailed ways
为了便于理解本发明,下面将对本发明进行更全面的描述。本发明可以以许多不同的形式来实现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本发明公开内容的理解更加透彻全面。In order to facilitate an understanding of the invention, the invention will be described more fully below. The invention may be embodied in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that a thorough understanding of the present disclosure will be provided.
除非另有定义,本发明所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不用于限制本发明。本发明所使用的术语“和/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used in the present invention have the same meanings commonly understood by those skilled in the technical field belonging to the present invention. The terms used in the description of the present invention are only for the purpose of describing specific embodiments and are not used to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
在本发明中,通过在基因库中进行序列分析得到来源于Micractinium conductrix的光脱羧酶McFAP的基因序列(核苷酸序列为SEQ ID NO.1),通过基因合成得到该基因,以大肠杆菌BL21(DE3)为宿主进行表达,获得McFAP的全细胞(记作McFAP@E. coli,下同),将光脱羧酶McFAP成功表达并纯化(氨基酸序列为SEQ ID NO.2),同时对其基本的酶学性质进行了探究,McFAP是蓝光催化的光脱羧酶,对催化脱羧的脂肪酸底物链长有非常好的普适性,对碳原子数为6~18的直链饱和脂肪酸均具有良好的脱羧效果。In the present invention, the gene sequence of the photodecarboxylase Mc FAP derived from Micractinium conductrix is obtained through sequence analysis in the gene library (the nucleotide sequence is SEQ ID NO. 1), and the gene is obtained through gene synthesis and used in Escherichia coli BL21 (DE3) was used as the host for expression, and whole cells of Mc FAP were obtained (denoted as Mc FAP@ E. coli , the same below). The photodecarboxylase Mc FAP was successfully expressed and purified (the amino acid sequence is SEQ ID NO. 2). At the same time, its basic enzymatic properties were explored. Mc FAP is a blue light-catalyzed photodecarboxylase. It has very good universality for catalyzing the decarboxylation of fatty acid substrate chain lengths, and is suitable for linear chains with 6 to 18 carbon atoms. Saturated fatty acids all have good decarboxylation effects.
SEQ ID NO.1(脂肪酸光脱羧酶McFAP的编码基因,3438bp):SEQ ID NO.1 (gene encoding fatty acid photodecarboxylase McFAP, 3438bp):
ATGGCTGAAATGGCAGGTGGTGGTGAAGGTGATGGTATGCTGATGGGCGGCGCGGGTAGCGCAAACACTACCGACGCGTGTTATAGCGATCCGTCTAATCCGGATTGCGCAGCGTTTGAGCGCTCCGACGATGATTGGGCGGCGGACATCGAACTGCTGTGCTCTGCGATGCCGTTCATGCCGGGCTGCACCCTGGCGGAACAGTGCATGAATGGCACCGCCGCCGGTGAATATTGCGAAATGTCCAGTCTGGCTGGTAACATCTGTCTGGATATGCCGGGCATGAAAGGCTGTGAGGCATGGAACGCACTGTGTGGCGCGGCCAGCGCCGTTGAACAGTGTTCCTCTCCGGGCCCGGTTGTGGCACTCCCGACCACCGCGCTGGCCAAAGAAGGCCTGGAATCTCTGTGCTCTACCCATTGTATGGACGGTTGCCCAGACTGTGAAATGGGTAAACTGTGGAACACCTGCACCGACCCGCTGAGTGTTCTGGCGTGGATGTGCTACGCAATGCCGGACATGCCGGAATGTCTGGCTGCTCCGCAGGGCTCCGGCATGGTGGTGGCTTGCGGTGACGCTGAGGTTGCAGCTACCTTCCCGCTGGTGTGCGCGCAACCGCCGACCCCGGCGGCTAACTTTCAGCACCGCCTTCGTACCTGCCGTACCGCCGGCGTTGCGGCATCCGCATCCGGTTCTCCGGCAGTCACTATGGCTGGCCTGTCAACTGTTCTGGCAGTACTGGCACTGCTGCCATCCCCGGTTGCTATGGCCATGACTCCGATGCCGACCCCGGCGCTCGCTCCGGGCCCGGCGATCGACGATATCGGCGGTAACTGCCCGCTGCTGGGTCGCGGTAACATGGAAGCTCCGTGTTATAGCGACCCGAGCGCGGCAGCATGCGTTTCCTTTGAACGCAGCGATGCTGGCTGGGCGGATGACCTGAGTCAGCTGTGTTCTGCGATGCCGTATGCTGTTGGCTGCTGGCTGTGGCACTTGTGTAAAACCGGCGCAGCAAGCGGGACTTACTGTGCGCTGCCGTCCCTGACCGCGAACGTATGTGTTGACGCACCGCTGGTGAACGCTACATCAGCGCCGGGCTGCGAAGCGTGGGCCGCACTGTGCGGCGCCCAGGGTAGCGTCGTTGCGCAGTGCTCTGCGCCAGGCCCGCTGCCGGACATCATCAACACCCTGACCACCCGTGACGGCATCAACTCCCTCTGCGGTATGCATTACATGGATGGGTGTAACGAATGTACCCCTCACGAAGGTCCGGCAGTTCACGACTTCGCGGCCTGTGCTGATCCGGGTCCACTGCCGACTCTGGCCCACCAGTGTTACGCGATGCCTGAAATGGGTGAATGTACCCAGACTGGTATTACCGCAATGTGCAGCGGCGCTGAAGCTCGTGCGACCTTTCCGACCGTTTGCGTGGATCCACCTAACCCGACGACACTGGCGCCGGCGCCTGCCGTTTCTGCCTGCGATGTTGCGGCGGGTGCTGGCGCGCCACCAGCGGCGTCTGCCCGTCCGGCGTCGCACAGCCGCGCGTCACTGGTTGCCTCCCGTAGCGGCTTCTGCGCGCCTTCCCCGGCGCTGCGCTCTCAGCGCACCTCTACTGTCGCGCCGGCGCGCCGCGCCGCGTCGGCGCCGCGTGCGAGCGCAGTTGACGATATTCAACGTGCTCTGAGCACCGCTGGAAGCCCGGTATCCGGTAAACAGTACGATTACATCCTGGTGGGTGGCGGCACCGCGGCATGCGTTTTGGCTAACCGTTTAACCGCGGACGGTAGCAAACGTGTACTGGTGCTGGAAGCGGGTGCGGACAACGTGAGCCGCGATGTTAAAGTCCCGGCTGCGATCACCCGTTTGTTCCGTTCACCGTTGGATTGGAACTTGTTCAGCGAATTGCAGGAACAGCTGGCTGCACGTCAGATCTATATGGCTCGCGGCCGCTTGCTGGGTGGGTCTAGCGCGACCAATGCTACTCTTTACCACCGTGGCGCGGCGGCGGATTATGATGCGTGGGGCGTGCCGGGCTGGGGCGCAGCTGACGTGCTGCCATGGTTCGTTAAGGCCGAAACCAACGCGGAGTTTGCGGCGGGCAAATATCACGGCGCAGGTGGTAACATGCGCGTTGAGAATCCGCGCTACTCCAACCCGCAGCTGCACGGTGCTTTCTTTGCAGCTGCGCAGCAGATGGGTCTGCCGCAGAATACCGACTTCAACAATTGGGATCAGGATCATGCAGGCTTTGGCACTTTTCAGGTTATGCAGGAAAAAGGCACCCGCGCTGATATGTACCGCCAGTATCTTAAACCAGCTCTTGGTCGTCCGAACCTGCAGGTTCTGACCGGTGCGTCTGTGACCAAAGTTCATATCGATAAAGCTGGCGGTAAACCGCGTGCTCTGGGCGTAGAGTTTTCTCTGGATGGTCCGGCTGGTGAACGTATGGCAGCAGAGCTGGCGCCGGGCGGTGAAGTTCTCATGTGCGCTGGCGCCGTGCATAGCCCGCACATTCTGCAGCTGTCTGGCGTTGGTTCGGCGGCTACTCTGGCAGACCACGGCATCGCAGCAGTGGCAGATCTGCCAGGTGTTGGTGCGAACATGCAGGACCAGCCGGCCTGCCTGACAGCGGCTCCCCTGAAAGACAAATACGATGGCATTTCGCTGACCGATCATATCTATAATAGCAAAGGCCAGATTCGCAAACGCGCTATCGCGTCCTACCTGCTTCAGGGTAAAGGTGGTCTGACGTCAACTGGCTGCGACCGTGGCGCGTTTGTACGTACCGCAGGCCAGGCACTGCCGGACCTGCAGGTGCGTTTCGTGCCAGGCATGGCACTGGATGCAGATGGTGTGTCCACCTACGTCCGTTTCGCAAAATTTCAGTCTCAGGGCCTGAAATGGCCGTCTGGCATCACCGTACAGCTTATTGCGTGTCGCCCGCACAGCAAAGGTTCTGTTGGCCTGAAAAACGCGGACCCGTTCACCCCGCCGAAACTGCGTCCGGGCTACCTGACCGACAAAGCGGGTGCGGATCTGGCGACCCTGCGCTCTGGTGTTCATTGGGCCCGTGATCTGGCATCTAGCGGTCCGCTGAGCGAATTTCTTGAAGGCGAACTGTTTCCGGGTAGCCAAGTTGTTTCCGATGATGATATTGATTCTTACATTCGTCGTACCATTCACTCCAGCAACGCGATTGTGGGCACCTGTCGTATGGGCGCGGCGGGTGAAGCGGGTGTTGTTGTGGATAACCAGCTGCGCGTTCAGGGTGTTGATGGTCTGCGTGTTGTTGACGCGAGCGTAATGCCGCGTATCCCAGGTGGTCAGGTGGGTGCGCCGGTTGTGATGCTGGCCGAACGTGCAGCAGCGATGCTGACCGGTCAGGCAGCGCTGGCTGGTGCTAGCGCTGCAGCTCCGCCGACCCCGGTCGCGGCTATGGCTGAAATGGCAGGTGGTGGTGAAGGTGATGGTATGCTGATGGGCGGCGCGGGTAGCGCAAACACTACCGACGCGTGTTATAGCGATCCGTCTAATCCGGATTGCGCAGGTTTGAGCGCTCCGACGATGATTGGGCGGCGGACATCGAACTGCTGTGCTCTGCGATGCCGTTCATGCCGGGCTGCACCCTGGCGGAACAGTGCATGAATGGCACCGCCGCCGGTGAATATTGCGAAATGTCCAGTCT GGCTGGTAACATCTGTCTGGATATGCCGGGCATGAAAGGCTGTGAGGCATGGAACGCACTGTGTGGCGCGGCCAGCGCCGTTGAACAGTGTTCCTCTCCGGGCCCGGTTGTGGCACTCCCGACCACCGCGCTGGCCAAAGAAGGCCTGGAATCTCTGTGCTCTACCCATTGTATGGACGGTTGCCCAGACTGTGAAATGGGTAAACTGTGGAACACCTGCACCGACCCGCTGAGTGTTCTGGCGTGGATGTGCTA CGCAATGCCGGACATGCCGGAATGTCTGGCTGCTCCGCAGGGCTCCGGCATGGTGGTGGCTTGCGGTGACGCTGAGGTTGCAGCTACCTTCCCGCTGGTGTGCGGCCAACCGCCGACCCCGGCGGCTAACTTTCAGCACCGCCTTCGTACCTGCCGTACCGCCGGCGTTGCGGCATCCGCATCCGGTTTCTCCGGCAGTCACTATGGCTGGCCTGTCAACTGTTCTGGCAGTACTGGCACTGCTGCCATCCCCGGTT GCTATGGCCATGACTCCGATGCCGACCCCGGCGCTCGCTCCGGGCCCGGCGATCGACGATATCGGCGGTAACTGCCCGCTGCTGGGTCGCGGTAACATGGAAGCTCCGTGTTATAGCGACCCGAGCGCGGCAGCATGCGTTTCCTTTGAACGCAGCGATGCTGGCTGGGCGGATGACCTGAGTCAGCTGTGTTCTGCGATGCCGTATGCTGTTGGCTGCTGGCTGTGGCACTTGTGTAAAACCGGCGCAGCAA GCGGGACTTACTGTGCGCTGCCGTCCCTGACCGCGAACGTATGTGTTGACGCACCGCTGGTGAACGCTACATCAGCGCCGGGCTGCGAAGCGTGGGCCGCACTGTGCGGCGCCCAGGGTAGCGTCGTTGCGCAGTGCTCTGCGCCAGGCCCGCTGCCGGACATCATCAACACCCTGACCACCCGTGACGGCATCAACTCCCTCTGCGGTATGCATTACATGGATGGGTGTAACGAATGTACCCCTCACGAAGGTCCGGCAGTT CACGACTTCGCGGCCTGTGCTGATCCGGGTCCACTGCCGACTCTGGCCCACCAGTGTTACGCGATGCCTGAAATGGGTGAATGTACCCAGACTGGTATTACCGCAATGTGCAGCGGCGCTGAAGCTCGTGCGACCTTTCCGACCGTTTGCGTGGATCCACCTAACCCGACGACACTGGCGCCGGCGCCTGCCGTTTCTGCCTGCGATGTTGCGGCGGGTGCTGGCGCGCCACCAGCGGCGTCTGCCCGTCCGGCGTCGC ACAGCCGCGTCACTGGTTGCCTCCCGTAGCGGCTTCTGCGCGCCTTCCCCGGCGCTGCGCTCTCAGCGCACCTCTACTGTCGCGCCGGCGCGCCGCGCCGCGTCGGCGCCGCGTGCGAGCGCAGTTGACGATATTCAACGTGCTCTGAGCACCGCTGGAAGCCCGGTATCCGGTAAACAGTACGATTACATCCTGGTGGGTGGCGGCACCGCGGCATGCGTTTTGGCTAACCGTTTAACCGCGGACGGTAGCAAACG TGTACTGGTGCTGGAAGCGGGTGCGGACAACGTGAGCCGCGATGTTAAAGTCCCGGCTGCGATCACCCGTTTGTTCCGTTCACCGTTGGATTGGAACTTGTTCAGCGAATTGCAGGAACAGCTGGCTGCACGTCAGATCTATATGGCTCGCGGCCGCTTGCTGGGTGGGTCTAGCGCGACCAATGCTACTCTTTACCACCGTGGCGCGGCGGCGGATTATGATGCGTGGGGCGTGCCGGGCTGGGGCGCAGCTGA CGTGCTGCCATGGTTCGTTAAGGCCGAAACCAACGCGGAGTTTGCGGCGGGCAAATATCACGGCGCAGGTGGTAACATGCGCGTTGAGAATCCGCGCTACTCCAACCCGCAGCTGCACGGTGCTTTCTTTGCAGCTGCGCAGCAGATGGGTCTGCCGCAGAATACCGACTTCAACAATTGGGATCAGGATCATGCAGGCTTTGGCACTTTTCAGGTTATGCAGGAAAAAGGCACCCGCGCTGATATGTACCGCCAG TATCTTAAACCAGCTTCTTGGTCGTCCGAACCTGCAGGTTCTGACCGGTGCGTCTGTGACCAAAGTTCATATCGATAAAGCTGGCGGTAAACCGCGTGCTCTGGGCGTAGAGTTTTCTCTGGATGGTCCGGCTGGTGAACGTATGGCAGCAGAGCTGGCGCCGGGCGGTGAAGTTCTCATGTGCGCTGGCGCCGTGCATAGCCCGCACAATTCTGCAGCTGTCTGGCGTTGGTTCGGCGGCTACTCTGGCAGACCAC GGCATCGCAGCAGTGGCAGATCTGCCAGGTGTTGGTGCGAACATGCAGGACCAGCCGGCCTGCCTGACAGCGGCTCCCCTGAAAGACAAATACGATGGCATTTCGCTGACCGATCATATCTATAATAGCAAAGGCCAGATTCGCAAACGCGCTATCGCGTCCTACCTGCTTCAGGGTAAAGGTGGTCTGACGTCAACTGGCTGCGACCGTGGCGCGTTTGTACGTACCGCAGGCCAGGCACTGCCGGACCTGCAGGTG CGTTTCGTGCCAGGCATGGCACTGGATGCAGATGGTGTGTCCACCTACGTCCGTTTCGCAAAATTTCAGTCTCAGGGCCTGAAATGGCCGTCTGGCATCACCGTACAGCTTATTGCGTGTCGCCCGCACAGCAAAGGTTCTGTTGGCCTGAAAAACGCGGACCCGTTCACCCCGCCGAAACTGCGTCCGGGCTACCTGACCGACAAAGCGGGTGCGGATCTGGCGACCCTGCGCTCTGGTGTTCATTGGGCCCGTGATCTGG CATCTAGCGGTCCGCTGAGCGAATTTCTTGAAGGCGAACTGTTTCCGGGTAGCCAAGTTGTTTCCGATGATGATATTGATTCTTACATTCGTCGTACCATTCACTCCAGCAACGCGATTGTGGGCACCTGTCGTATGGGCGCGGCGGGTGAAGCGGGTTGTTGTGGATAACCAGCTGCGCGTTCAGGGTGTTGATGGTCTGCGTGTTGTTGACGCGAGCGTAATGCCGCGTATCCCAGGTGGTCAGGTGGGTGC GCCGGTTGTGATGCTGGCCGAACGTGCAGCAGCGATGCTGACCGGTCAGGCAGCGCTGGCTGGTGCTAGCGCTGCAGCTCCGCCGACCCCGGTCGCGGCT
SEQ ID NO.2(脂肪酸光脱羧酶McFAP的氨基酸序列):SEQ ID NO.2 (amino acid sequence of fatty acid photodecarboxylase Mc FAP):
MAEMAGGGEGDGMLMGGAGSANTTDACYSDPSNPDCAAFERSDDDWAADIELLCSAMPFMPGCTLAEQCMNGTAAGEYCEMSSLAGNICLDMPGMKGCEAWNALCGAASAVEQCSSPGPVVALPTTALAKEGLESLCSTHCMDGCPDCEMGKLWNTCTDPLSVLAWMCYAMPDMPECLAAPQGSGMVVACGDAEVAATFPLVCAQPPTPAANFQHRLRTCRTAGVAASASGSPAVTMAGLSTVLAVLALLPSPVAMAMTPMPTPALAPGPAIDDIGGNCPLLGRGNMEAPCYSDPSAAACVSFERSDAGWADDLSQLCSAMPYAVGCWLWHLCKTGAASGTYCALPSLTANVCVDAPLVNATSAPGCEAWAALCGAQGSVVAQCSAPGPLPDIINTLTTRDGINSLCGMHYMDGCNECTPHEGPAVHDFAACADPGPLPTLAHQCYAMPEMGECTQTGITAMCSGAEARATFPTVCVDPPNPTTLAPAPAVSACDVAAGAGAPPAASARPASHSRASLVASRSGFCAPSPALRSQRTSTVAPARRAASAPRASAVDDIQRALSTAGSPVSGKQYDYILVGGGTAACVLANRLTADGSKRVLVLEAGADNVSRDVKVPAAITRLFRSPLDWNLFSELQEQLAARQIYMARGRLLGGSSATNATLYHRGAAADYDAWGVPGWGAADVLPWFVKAETNAEFAAGKYHGAGGNMRVENPRYSNPQLHGAFFAAAQQMGLPQNTDFNNWDQDHAGFGTFQVMQEKGTRADMYRQYLKPALGRPNLQVLTGASVTKVHIDKAGGKPRALGVEFSLDGPAGERMAAELAPGGEVLMCAGAVHSPHILQLSGVGSAATLADHGIAAVADLPGVGANMQDQPACLTAAPLKDKYDGISLTDHIYNSKGQIRKRAIASYLLQGKGGLTSTGCDRGAFVRTAGQALPDLQVRFVPGMALDADGVSTYVRFAKFQSQGLKWPSGITVQLIACRPHSKGSVGLKNADPFTPPKLRPGYLTDKAGADLATLRSGVHWARDLASSGPLSEFLEGELFPGSQVVSDDDIDSYIRRTIHSSNAIVGTCRMGAAGEAGVVVDNQLRVQGVDGLRVVDASVMPRIPGGQVGAPVVMLAERAAAMLTGQAALAGASAAAPPTPVAAMAEMAGGGEGDGMLMGGAGSANTTDACYSDPSNPDCAAFERSDDDWAADIELLCSAMPFMPGCTLAEQCMNGTAAGEYCEMSSLAGNICLDMPGMKGCEAWNALCGAASAVEQCSSPGPVVALPTTALAKEGLESLCSTHCMDGCPDCEMGKLWNTCTDPLSVLAWMCYAMPDMPECLAAPQGSGMVVACGDAEVAATFPLVCAQPPTPAANFQHRLRTCRTAGVAASASGSPAVTMAG LSTVLAVLALLPSPVAMAMTPMPTPALAPGPAIDDIGGNCPLLGRGNMEAPCYSDPSAAACVSFERSDAGWADDLSQLCSAMPYAVGCWLWHLCKTGAASGTYCALPSLTANVCVDAPLVNATSAPGCEAWAALCGAQGSVVAQCSAPGPLPDIINTLTTRDGINSLCGMHYMDGCNECTPHEGPAVHDFAACADPGPLPTLAHQCYAMPEMGECTQTGITAMCSGAEARATFPTVCVDP PNPTTLAPAPAVSACDVAAGAGAPPAASARPASHSRASLVASRSGFCAPSPALRSQRTSTVAPARRAASAPRASAVDDIQRALSTAGSPVSGKQYDYILVGGGTAACVLANRLTADGSKRVLEAGADNVSRDVKVPAAITRLFRSPLDWNLFSELQEQLAARQIYMARGRLLGGSSATNATLYHRGAAADYDAWGVPGWGAADVLPWFVKAETNAEFAAGKYHGAGGNMR VENPRYSNPQLHGAFFAAAQQMGLPQNTDFNNWDQDHAGFGTFQVMQEKGTRADMYRQYLKPALGRPNLQVLTGASVTKVHIDKAGGKPRALGVEFSLDGPAGERMAAELAPGGEVLMCAGAVHSPHILQLSGVGSAATLADHGIAAVADLPGVGANMQDQPACLTAAPLKDKYDGISLTDHIYNSKGQIRKRAIASYLLQGKGGLTST GCDRGAFVRTAGQALPDLQVRFVPGMALDADGVSTYVRFAKFQSQGLKWPSGITVQLIACRPPHSKGSVGLKNADPFTPPKLRPGYLTDKAGADLATLRSGVHWARDLASSGPLSEFLEGELFPGSQVVSDDDDIDSYIRRTIHSSNAIVGTCRMGAAGEAGVVVDNQLRVQGVDGLRVVDASVMPRIPGGQVGAPVVMLAERAAAMLTGQAAL AGASAAAPPTPVAA
通过截短McFAP的N端进行缺失突变获得脂肪酸光脱羧酶McFAP的突变体(以下命名为McFAP-S),通过镍柱纯化得到McFAP突变体纯酶(核苷酸序列如SEQ ID NO.3所示,氨基酸序列如SEQ ID NO.4所示)。McFAP的突变体催化效果大幅度提升,且可催化碳原子数为6~12的中链脂肪酸脱羧。丰富了FAP的种类,增补光催化脱羧短链脂肪酸的FAP酶种类。A mutant of fatty acid photodecarboxylase Mc FAP (hereinafter named Mc FAP-S) was obtained by truncating the N terminus of Mc FAP and performing deletion mutation. The Mc FAP mutant pure enzyme was purified through a nickel column (nucleotide sequence such as SEQ ID shown in NO.3, and the amino acid sequence is shown in SEQ ID NO.4). The catalytic effect of the Mc FAP mutant is greatly improved, and it can catalyze the decarboxylation of medium-chain fatty acids with carbon atoms of 6 to 12. It enriches the types of FAP and supplements the types of FAP enzymes that photocatalyze the decarboxylation of short-chain fatty acids.
SEQ ID NO.3(脂肪酸光脱羧酶McFAP的突变体的编码基因):SEQ ID NO.3 (gene encoding mutant of fatty acid photodecarboxylase Mc FAP):
CGTGCGAGCGCAGTTGACGATATTCAACGTGCTCTGAGCACCGCTGGAAGCCCGGTATCCGGTAAACAGTACGATTACATCCTGGTGGGTGGCGGCACCGCGGCATGCGTTTTGGCTAACCGTTTAACCGCGGACGGTAGCAAACGTGTACTGGTGCTGGAAGCGGGTGCGGACAACGTGAGCCGCGATGTTAAAGTCCCGGCTGCGATCACCCGTTTGTTCCGTTCACCGTTGGATTGGAACTTGTTCAGCGAATTGCAGGAACAGCTGGCTGCACGTCAGATCTATATGGCTCGCGGCCGCTTGCTGGGTGGGTCTAGCGCGACCAATGCTACTCTTTACCACCGTGGCGCGGCGGCGGATTATGATGCGTGGGGCGTGCCGGGCTGGGGCGCAGCTGACGTGCTGCCATGGTTCGTTAAGGCCGAAACCAACGCGGAGTTTGCGGCGGGCAAATATCACGGCGCAGGTGGTAACATGCGCGTTGAGAATCCGCGCTACTCCAACCCGCAGCTGCACGGTGCTTTCTTTGCAGCTGCGCAGCAGATGGGTCTGCCGCAGAATACCGACTTCAACAATTGGGATCAGGATCATGCAGGCTTTGGCACTTTTCAGGTTATGCAGGAAAAAGGCACCCGCGCTGATATGTACCGCCAGTATCTTAAACCAGCTCTTGGTCGTCCGAACCTGCAGGTTCTGACCGGTGCGTCTGTGACCAAAGTTCATATCGATAAAGCTGGCGGTAAACCGCGTGCTCTGGGCGTAGAGTTTTCTCTGGATGGTCCGGCTGGTGAACGTATGGCAGCAGAGCTGGCGCCGGGCGGTGAAGTTCTCATGTGCGCTGGCGCCGTGCATAGCCCGCACATTCTGCAGCTGTCTGGCGTTGGTTCGGCGGCTACTCTGGCAGACCACGGCATCGCAGCAGTGGCAGATCTGCCAGGTGTTGGTGCGAACATGCAGGACCAGCCGGCCTGCCTGACAGCGGCTCCCCTGAAAGACAAATACGATGGCATTTCGCTGACCGATCATATCTATAATAGCAAAGGCCAGATTCGCAAACGCGCTATCGCGTCCTACCTGCTTCAGGGTAAAGGTGGTCTGACGTCAACTGGCTGCGACCGTGGCGCGTTTGTACGTACCGCAGGCCAGGCACTGCCGGACCTGCAGGTGCGTTTCGTGCCAGGCATGGCACTGGATGCAGATGGTGTGTCCACCTACGTCCGTTTCGCAAAATTTCAGTCTCAGGGCCTGAAATGGCCGTCTGGCATCACCGTACAGCTTATTGCGTGTCGCCCGCACAGCAAAGGTTCTGTTGGCCTGAAAAACGCGGACCCGTTCACCCCGCCGAAACTGCGTCCGGGCTACCTGACCGACAAAGCGGGTGCGGATCTGGCGACCCTGCGCTCTGGTGTTCATTGGGCCCGTGATCTGGCATCTAGCGGTCCGCTGAGCGAATTTCTTGAAGGCGAACTGTTTCCGGGTAGCCAAGTTGTTTCCGATGATGATATTGATTCTTACATTCGTCGTACCATTCACTCCAGCAACGCGATTGTGGGCACCTGTCGTATGGGCGCGGCGGGTGAAGCGGGTGTTGTTGTGGATAACCAGCTGCGCGTTCAGGGTGTTGATGGTCTGCGTGTTGTTGACGCGAGCGTAATGCCGCGTATCCCAGGTGGTCAGGTGGGTGCGCCGGTTGTGATGCTGGCCGAACGTGCAGCAGCGATGCTGACCGGTCAGGCAGCGCTGGCTGGTGCTAGCGCTGCAGCTCCGCCGACCCCGGTCGCGGCTCGTGCGAGCGCAGTTGACGATATTCAACGTGCTCTGAGCACCGCTGGAAGCCCGGTATCCGGTAAACAGTACGATTACATCCTGGTGGGTGGCGGCACCGCGGCATGCGTTTTGGCTAACCGTTTAACCGCGGACGGTAGCAAACGTGTACTGGTGCTGGAAGCGGGTGCGGACAACGTGAGCCGCGATGTTAAAGTCCCGGCTGCGATCACCCGTTTGTTCCGTTCACCGTTGGATTGGAACTTGTTCAGCGAATT GCAGGAACAGCTGGCTGCACGTCAGATCTATATGGCTCGCGGCCGCTTGCTGGGTGGGTCTAGCGCGACCAATGCTACTCTTTACCACCGTGGCGCGGCGGCGGATTATGATGCGTGGGGCGTGCCGGGCTGGGGCGCAGCTGACGTGCTGCCATGGTTCGTTAAGGCCGAAACCAACGCGGAGTTTGCGGCGGGCAAATATCACGGCGCAGGTGGTAACATGCGCGTTGAGAATCCGCGCTACTCCAACCCGCAG CTGCACGGTGCTTTCTTTGCAGCTGCGCAGCAGATGGGTCTGCCGCAGAATACCGACTTCAACAATTGGGATCAGGATCATGCAGGCTTTGGCACTTTTCAGGTTATGCAGGAAAAAGGCACCCGCGCTGATATGTACCGCCAGTATCTTAAACCAGCTTCTTGGTCGTCCGAACCTGCAGGTTCTGACCGGTGCGTCTGTGACCAAAGTTCATATCGATAAAGCTGGCGGTAAACCGCGTGCTCTGGGCGTAGA GTTTTCTCTGGATGGTCCGGCTGGTGAACGTATGGCAGCAGAGCTGGCGCGGCGGTGAAGTTCTCATGTGCGCTGGCCGTGCATAGCCCGCACATTCTGCAGCTGTCTGGCGTTGGTTCGGCGGCTACTCTGGCAGACCACGGCATCGCAGCAGTGGCAGATCTGCCAGGTGTTGGTGCGAACATGCAGGACCAGCCGGCCTGCCTGACAGCGGCTCCCCTGAAAGACAAATACGATGGCATTTCGCTGACCGA TCATATCTATAATAGCAAAGGCCAGATTCGCAAACGCGCTATCGCGTCCTACCTGCTTCAGGGTTAAAGGTGGTCTGACGTCAACTGGCTGCGACCGTGGCGCGTTTGTACGTACCGCAGGCCAGGCACTGCCGGACCTGCAGGTGCGTTTCGTGCCAGGCATGGCACTGGATGCAGATGGTGTGTCCACCTACGTCCGTTTCGCAAAATTTCAGTCTCAGGGCCTGAAATGGCCGTCTTGGCATCACCGTACAGCTT ATTGCGTGTCGCCCGCACAGCAAAGGTTCTGTTGGCCTGAAAAACGCGGACCCGTTCACCCCGCCGAAACTGCGTCCGGGCTACCTGACCGACAAAGCGGGTGCGGATCTGGCGACCCTGCGCTCTGGTGTTCATTGGGCCCGTGATCTGGCATCTAGCGGTCCGCTGAGCGAATTTCTTGAAGGCGAACTGTTTCCGGGTAGCCAAGTTGTTTCCGATGATGATATTGATTCTTACATTCGTCGTACCATTCACTCCAGCAA CGCGATTGTGGGCACCTGTCGTATGGGCGCGGCGGGTGAAGCGGGTTGTTGTGGATAACCAGCTGCGCGTTCAGGGTGTTGATGGTCTGCGTGTTGTTGACGCGAGCGTAATGCCGCGTATCCCAGGTGGTCAGGTGGGTGCCGGTTGTGATGCTGGCCGAACGTGCAGCAGCGATGCTGACCGGTCAGGCAGCGCTGGCTGGTGCTAGCGCTGCAGCTCCGCCGACCCCGGTCGCGGCT
SEQ ID NO.4(脂肪酸光脱羧酶McFAP的突变体的氨基酸序列):SEQ ID NO.4 (amino acid sequence of mutant of fatty acid photodecarboxylase Mc FAP):
RASAVDDIQRALSTAGSPVSGKQYDYILVGGGTAACVLANRLTADGSKRVLVLEAGADNVSRDVKVPAAITRLFRSPLDWNLFSELQEQLAARQIYMARGRLLGGSSATNATLYHRGAAADYDAWGVPGWGAADVLPWFVKAETNAEFAAGKYHGAGGNMRVENPRYSNPQLHGAFFAAAQQMGLPQNTDFNNWDQDHAGFGTFQVMQEKGTRADMYRQYLKPALGRPNLQVLTGASVTKVHIDKAGGKPRALGVEFSLDGPAGERMAAELAPGGEVLMCAGAVHSPHILQLSGVGSAATLADHGIAAVADLPGVGANMQDQPACLTAAPLKDKYDGISLTDHIYNSKGQIRKRAIASYLLQGKGGLTSTGCDRGAFVRTAGQALPDLQVRFVPGMALDADGVSTYVRFAKFQSQGLKWPSGITVQLIACRPHSKGSVGLKNADPFTPPKLRPGYLTDKAGADLATLRSGVHWARDLASSGPLSEFLEGELFPGSQVVSDDDIDSYIRRTIHSSNAIVGTCRMGAAGEAGVVVDNQLRVQGVDGLRVVDASVMPRIPGGQVGAPVVMLAERAAAMLTGQAALAGASAAAPPTPVAARASAVDDIQRALSTAGSPVSGKQYDYILVGGGTAACVLANRLTADGSKRVLVLEAGADNVSRDVKVPAAITRLFRSPLDWNLFSELQEQLAARQIYMARGRLLGGSSATNATLYHRGAAADYDAWGVPGWGAADVLPWFVKAETNAEFAAGKYHGAGGNMRVENPRYSNPQLHGAFFAAAQQMGLPQNTDFNNWDQDHAGFGTFQVMQEKGTRAD MYRQYLKPALGRPNLQVLTGASVTKVHIDKAGGKPRALGVEFSLDGPAGERMAAELAPGGEVLMCAGAVHSPHILQLSGVGSAATLADHGIAAVADLPGVGANMQDQPACLTAAPLKDKYDGISLTDHIYNSKGQIRKRAIASYLLQGKGGLTSTGCDRGAFVRTAGQALPDLQVRFVPGMALDADGVSTYVRFAKFQSQGLKWPSGI TVQLIACRPHSKGSVGLKNADPFTPPKLRPGYLTDKAGADLATLRSGVHWARDLASSGPLSEFLEGELFPGSQVVSDDDIDSYIRRTIHSSNAIVGTCRMGAAGEAGVVVDNQLRVQGVDGLRVVDASVMPRIPGGQVGAPVVMLAERAAAMLTGQAALAGASAAAPPTPVAA
以下实施例中,脱羧反应前后脂肪酸及烷烃的含量由安捷伦7890B气相色谱系统(Agilent Technologies,Palo Alto,CA,USA)进行检测,色谱分析柱为KB-FFAP(30 m ×0.25mm,0.25 μm),具体色谱分析方法为:进样体积:1 μL;进样器温度:250℃;分流比:30:1;检测器温度:280℃;升温程序为:初始温度为110℃,保持3.4 min,随后以25℃ min-1的速率升至190℃,保持2.1 min,然后再次以25℃ min-1的速率升至230℃,保持2 min,最后以30℃ min-1的速率升至250℃,保持12 min。采用各脂肪酸、烷烃标准品进行色谱出峰时间的定性,并采用上述标准品配制不同浓度标准溶液,正辛醇做内标,通过气相检测得到标准曲线,用于定量计算。In the following examples, the contents of fatty acids and alkanes before and after the decarboxylation reaction were detected by the Agilent 7890B gas chromatography system (Agilent Technologies, Palo Alto, CA, USA), and the chromatographic analysis column was KB-FFAP (30 m × 0.25 mm, 0.25 μm). , the specific chromatographic analysis method is: injection volume: 1 μL; injector temperature: 250°C; split ratio: 30:1; detector temperature: 280°C; temperature rise program: initial temperature is 110°C, maintained for 3.4 minutes, Then increase to 190°C at a rate of 25°C min -1 , hold for 2.1 min, then increase again to 230°C at a rate of 25°C min -1 , hold for 2 min, and finally increase to 250°C at a rate of 30°C min -1 , keep for 12 minutes. Each fatty acid and alkane standard was used to characterize the chromatographic peak time, and the above standards were used to prepare standard solutions with different concentrations, n-octanol was used as the internal standard, and the standard curve was obtained through gas phase detection for quantitative calculation.
以下实施例中,质粒pET28a-McFAP由生工生物工程(上海)股份有限公司合成;空载质粒pET28a为申请人实验室保存;大肠杆菌BL21(DE3)感受态细胞,购自唯地生物科技有限公司;质粒提取试剂盒,购自生工生物工程(上海)股份有限公司;其他所有的化学药品均购自Sigma-Aldrich、TCI或阿拉丁公司,其纯度最高,无需进一步提纯即可使用。In the following examples, plasmid pET28a- Mc FAP was synthesized by Sangon Bioengineering (Shanghai) Co., Ltd.; empty plasmid pET28a was stored in the applicant's laboratory; Escherichia coli BL21 (DE3) competent cells were purchased from Weidi Biotechnology Co., Ltd.; plasmid extraction kit, purchased from Sangon Bioengineering (Shanghai) Co., Ltd.; all other chemicals were purchased from Sigma-Aldrich, TCI or Aladdin Company, which have the highest purity and can be used without further purification.
以下结合附图和具体实施例详细说明本发明。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.
实施例1McFAP@E. coli的制备及催化脂肪酸的光酶脱羧验证Example 1 Preparation of Mc FAP@ E. coli and verification of photoenzymatic decarboxylation of catalyzed fatty acids
将含质粒pET28a-McFAP的重组E.coliBL21(DE3)菌株,于37℃下在含50 μg/mL卡那霉素的超级肉汤(Terrific Broth,TB)中培养,当OD600达到0.7-0.8时,加入0.5 mM异丙基β-D-1-硫代半乳糖苷(IPTG),并将细胞在17℃下孵育20 h。在4℃下,4000 rpm下离心30min收菌;用Tris-HCl缓冲液(50 mM,pH 8,含100mM NaCl)洗涤,再次离心(10000 rpm,20min,4℃);将细胞颗粒按1:2(w/v)悬浮于同一缓冲液中,加入1 mM苯甲基黄酰氯(PMSF)和5%甘油(w/v),液氮冷冻,贮存在-80℃,待用。The recombinant E.coli BL21 (DE3) strain containing plasmid pET28a- Mc FAP was cultured at 37°C in Terrific Broth (TB) containing 50 μg/mL kanamycin. When the OD 600 reached 0.7 -0.8, 0.5 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) was added, and the cells were incubated at 17°C for 20 h. Collect the bacteria by centrifugation at 4000 rpm for 30 minutes at 4°C; wash with Tris-HCl buffer (50 mM, pH 8, containing 100mM NaCl), and centrifuge again (10000 rpm, 20min, 4°C); press the cell pellet according to 1: 2 (w/v) was suspended in the same buffer, added with 1 mM phenylmethylsulfonyl chloride (PMSF) and 5% glycerol (w/v), frozen in liquid nitrogen, and stored at -80°C until use.
为了验证McFAP的催化脱羧作用,按照同样的方法制备了含有空质粒pET28a载体的大肠杆菌细胞(记作empty WC)。In order to verify the catalytic decarboxylation effect of Mc FAP, E. coli cells containing the empty plasmid pET28a vector (denoted as empty WC) were prepared according to the same method.
将500 μL湿重质量浓度为0.5 g/mL的McFAP@E. coli,300 μL 170 mM脂肪酸DMSO溶液,200 μL Tris-HCl缓冲液(100mM,pH 8.0)加入到一个5 mL透明反应瓶中,总反应体积为1mL;然后将其置于自制的光催化反应装置(在普通催化反应装置上增加蓝光照射装置)中,在500 rpm,30℃,蓝光(10 W,220 V)照射下反应12 h;反应结束后,取反应混合物于2mL EP管中,加入1 mL25 mM正辛醇内标的乙酸乙酯溶液进行萃取,即萃取体积比为1:1,萃取的混合物在11000 rpm下离心4 min后,取上层有机相于2 mL色谱瓶中进行GC分析。将同样反应体系置于黑暗中,其它条件不变,催化脂肪酸脱羧。Add 500 μL of Mc FAP@ E. coli with a wet mass concentration of 0.5 g/mL, 300 μL of 170 mM fatty acid DMSO solution, and 200 μL of Tris-HCl buffer (100mM, pH 8.0) into a 5 mL transparent reaction bottle , the total reaction volume is 1mL; then place it in a homemade photocatalytic reaction device (add a blue light irradiation device to the ordinary catalytic reaction device), and react under 500 rpm, 30°C, blue light (10 W, 220 V) irradiation 12 h; after the reaction, take the reaction mixture into a 2mL EP tube, add 1 mL of ethyl acetate solution of 25 mM n-octanol internal standard for extraction, that is, the extraction volume ratio is 1:1, and the extracted mixture is centrifuged at 11000 rpm for 4 After 1 min, take the upper organic phase and put it into a 2 mL chromatography bottle for GC analysis. The same reaction system was placed in the dark with other conditions unchanged to catalyze the decarboxylation of fatty acids.
将McFAP@E. coli换成empty WC,其它条件不变,催化脂肪酸脱羧。Replace Mc FAP@ E. coli with empty WC and keep other conditions unchanged to catalyze the decarboxylation of fatty acids.
实验结果如图1所示。从图1可知,empty WC对脂肪酸无脱羧效果,McFAP@E. coli在无蓝光条件下对脂肪酸无脱羧效果,McFAP@E. coli在蓝光条件下对脂肪酸具有脱羧效果,由此验证McFAP@E. coli在蓝光条件下催化脂肪酸底物脱羧反应。The experimental results are shown in Figure 1. As can be seen from Figure 1, empty WC has no decarboxylation effect on fatty acids, Mc FAP@ E. coli has no decarboxylation effect on fatty acids under blue light conditions, and Mc FAP@ E. coli has a decarboxylation effect on fatty acids under blue light conditions, thus verifying that Mc FAP@ E. coli catalyzes the decarboxylation reaction of fatty acid substrates under blue light conditions.
实施例2 McFAP-S突变体的构建、纯化Example 2 Construction and purification of Mc FAP-S mutant
McFAP基因全长3438 bp,共1146个氨基酸,从N端起第551个氨基酸开始至C端共596个氨基酸为所构建的突变体McFAP-S,氨基酸序列如SEQ ID NO.4所示,核苷酸序列如SEQ ID NO.3所示。The full length of the Mc FAP gene is 3438 bp, with a total of 1146 amino acids. From the 551st amino acid from the N terminus to the C terminus, a total of 596 amino acids are the constructed mutant Mc FAP-S. The amino acid sequence is shown in SEQ ID NO.4 , the nucleotide sequence is shown in SEQ ID NO.3.
通过设计引物(如表1所示),以McFAP基因为模板,通过PCR扩增(体系和程序如表2所示)得到目的基因(1788 bp)及pET28a载体(5362 bp)。By designing primers (as shown in Table 1), using the Mc FAP gene as a template, the target gene (1788 bp) and pET28a vector (5362 bp) were obtained through PCR amplification (the system and procedures are shown in Table 2).
表1用于构建McFAP-S的引物Table 1 Primers used to construct Mc FAP-S
表2PCR反应体系及程序Table 2 PCR reaction system and procedures
对扩增后的目的基因与载体按照生工SanPrep柱式PCR产物胶回收操作手册进行回收,测定上述所得目的基因及载体的DNA浓度后按无缝克隆试剂盒需求进行无缝克隆(反应体系及程序如表2所示),无缝克隆后将所得质粒转化培养,送样测序,测序结果正确的为所构建的缺失突变体McFAP-S。Recover the amplified target gene and vector according to Sangon SanPrep column PCR product gel recovery operation manual. Measure the DNA concentration of the target gene and vector obtained above and perform seamless cloning according to the requirements of the seamless cloning kit (reaction system and The procedure is shown in Table 2). After seamless cloning, the resulting plasmid was transformed and cultured, and the sample was sent for sequencing. The sequencing result was correctly the constructed deletion mutant Mc FAP-S.
McFAP-S纯化所用仪器、层析柱及上样、收样所用所用纯化层析柱为:HisPrepTMFF16/10;HiPrepTM26/10换盐柱。先用上样缓冲液(50 mM Tris-HCl 300 mM NaCl 10mM 咪唑 5%(v/v)甘油 pH 9)平衡层析柱,平衡后将粗酶液泵入,上样完成后用上样缓冲液至平衡。然后用洗脱缓冲液(50mM Tris-HCl 300 mM NaCl 500 mM咪唑5%(v/v)甘油pH 9)进行洗脱,收集出峰样品,用SDS-PAGE电泳检测。将含有目的蛋白所对应峰接收的样品,用换盐柱进行换盐。将蛋白加入到用换盐缓冲液(50 mM Tris-HCl 150 mM NaCl 5%(v/v)甘油pH 9)平衡好的换盐柱中,再用换盐缓冲液继续洗脱,收集洗脱蛋白后浓缩分装,液氮预冻,-80℃保存备用。全程流速均为5 mL min-1。The instruments, chromatography columns, and purification chromatography columns used for the purification of Mc FAP-S are: HisPrep TM FF16/10; HiPrep TM 26/10 salt exchange column. First use the loading buffer (50 mM Tris-HCl 300 mM NaCl 10mM imidazole 5% (v/v) glycerol pH 9) to equilibrate the chromatography column. After equilibrium, pump the crude enzyme solution into the column. After the loading is completed, use the loading buffer. liquid to equilibrium. Then use elution buffer (50mM Tris-HCl, 300mM NaCl, 500mM imidazole, 5% (v/v) glycerol, pH 9) for elution, collect peak samples, and detect by SDS-PAGE electrophoresis. The sample containing the peak corresponding to the target protein is salt-exchanged using a salt-exchange column. Add the protein to the salt-exchange column balanced with salt-exchange buffer (50 mM Tris-HCl 150 mM NaCl 5% (v/v) glycerol pH 9), then continue elution with the salt-exchange buffer, and collect the elution The protein is concentrated and aliquoted, pre-frozen in liquid nitrogen, and stored at -80°C for later use. The flow rate throughout the entire process is 5 mL min -1 .
McFAP-S经镍柱纯化、0.5 M咪唑洗脱后的SDS-PAGE蛋白图谱如图2所示,从图2可知,McFAP-S获得了较好的表达及纯化效果。The SDS-PAGE protein profile of Mc FAP-S after purification with a nickel column and elution with 0.5 M imidazole is shown in Figure 2. From Figure 2, it can be seen that Mc FAP-S has achieved good expression and purification effects.
实施例3 McFAP-S的酶学性质表征Example 3 Characterization of enzymatic properties of Mc FAP-S
FAP酶活定义:30℃、500 rpm、蓝光(10 W,220 V)照射条件下,1 min内催化反应1μmol正辛酸,生成1 μmol正庚烷所需要的酶量定义为1 U。Definition of FAP enzyme activity: Under the conditions of 30°C, 500 rpm, blue light (10 W, 220 V), the amount of enzyme required to catalyze the reaction of 1 μmol n-octanoic acid within 1 minute to generate 1 μmol n-heptane is defined as 1 U.
1、对反应时间进行优化1. Optimize reaction time
本实施例选择12个反应时间对McFAP-S催化正辛酸脱羧过程进行优化(5min、10min、15min、20min、25min、30min、35min、40min、45min、50min、55min和60 min),其中添加150 μL 140 mM正辛酸DMSO溶液(反应体系中正辛酸浓度为20 mM,DMSO添加量为15%),酶加量为40 μM,Tris-HCl缓冲液(100 mM,pH 9.0)补齐总反应体积1 mL,加入到一个5 mL透明反应瓶中。然后将其置于自制的光催化反应装置中,在500 rpm,30℃,蓝光(10 W,220 V)照射下反应一定时间;反应结束后,取反应混合物于2 mL EP管中,加入1 mL 25 mM正辛醇内标的乙酸乙酯溶液进行萃取,即萃取体积比为1:1,萃取的混合物在11000 rpm下离心4 min后,取上层有机相于2 mL色谱瓶中进行GC分析。In this example, 12 reaction times are selected to optimize the Mc FAP-S catalyzed n-octanoic acid decarboxylation process (5min, 10min, 15min, 20min, 25min, 30min, 35min, 40min, 45min, 50min, 55min and 60min), in which 150 μL 140 mM n-octanoic acid DMSO solution (the concentration of n-octanoic acid in the reaction system is 20 mM, the DMSO addition amount is 15%), the enzyme addition amount is 40 μM, and Tris-HCl buffer (100 mM, pH 9.0) makes up the total reaction volume 1 mL into a 5 mL transparent reaction bottle. Then place it in a self-made photocatalytic reaction device and react for a certain period of time under 500 rpm, 30°C, blue light (10 W, 220 V); after the reaction is completed, take the reaction mixture into a 2 mL EP tube and add 1 mL of ethyl acetate solution of 25 mM n-octanol internal standard was used for extraction, that is, the extraction volume ratio was 1:1. After the extracted mixture was centrifuged at 11000 rpm for 4 min, the upper organic phase was taken and placed in a 2 mL chromatographic bottle for GC analysis.
结果如图3所示,McFAP-S催化正辛酸脱羧在5 min内快速反应,5 min后转化率稳步上升,在30 min转化率即可达95%。The results are shown in Figure 3. Mc FAP-S catalyzed the decarboxylation of n-octanoic acid quickly within 5 minutes. After 5 minutes, the conversion rate increased steadily, and the conversion rate reached 95% in 30 minutes.
2、对反应酶加量进行优化2. Optimize the amount of reaction enzyme
本实施例选择4个酶加量对McFAP-S催化正辛酸脱羧过程进行优化(反应酶加量分别为终浓度6μM、12μM、24μM和36 μM),其他条件和处理方式不变,步骤同上。In this example, 4 enzyme dosages are selected to optimize the Mc FAP-S catalyzed n-octanoic acid decarboxylation process (the reaction enzyme dosages are final concentrations of 6 μM, 12 μM, 24 μM, and 36 μM respectively). Other conditions and treatment methods remain unchanged, and the steps are the same as above. .
结果如图4所示,随着反应体系中酶加量的增加,反应转化率增加。The results are shown in Figure 4. As the amount of enzyme in the reaction system increases, the reaction conversion rate increases.
3、对反应底物浓度进行优化3. Optimize reaction substrate concentration
本实施例选择5个底物浓度对McFAP-S催化正辛酸脱羧过程进行优化(反应底物浓度分别为10mM、20mM、30mM、40mM、50 mM),其他条件和处理方式不变,步骤同上。In this example, 5 substrate concentrations are selected to optimize the Mc FAP-S catalyzed n-octanoic acid decarboxylation process (the reaction substrate concentrations are 10mM, 20mM, 30mM, 40mM, and 50mM respectively). Other conditions and treatment methods remain unchanged, and the steps are the same as above. .
结果如图5所示,随着反应体系中底物浓度的增加,反应转化率降低,但实际生成速率不变。The results are shown in Figure 5. As the substrate concentration in the reaction system increases, the reaction conversion rate decreases, but the actual production rate remains unchanged.
4、对反应温度进行优化4. Optimize the reaction temperature
本实施例选择5个反应温度对McFAP-S催化正辛酸脱羧过程进行优化(反应温度分别为20℃、30℃、40℃、45℃、50℃),其他条件和处理方式不变,步骤同上。In this example, 5 reaction temperatures are selected to optimize the Mc FAP-S catalyzed n-octanoic acid decarboxylation process (reaction temperatures are 20°C, 30°C, 40°C, 45°C, and 50°C respectively). Other conditions and treatment methods remain unchanged, and the steps Same as above.
结果如图6所示,McFAP-S催化正辛酸脱羧的最适温度为40℃,30℃~45℃均有80%以上的相对酶活,温度高于45℃相对酶活迅速降低。The results are shown in Figure 6. The optimal temperature for Mc FAP-S to catalyze the decarboxylation of n-octanoic acid is 40°C. The relative enzyme activity is more than 80% between 30°C and 45°C. The relative enzyme activity decreases rapidly when the temperature is higher than 45°C.
5、对反应pH进行优化5. Optimize reaction pH
本实施例选择5个pH对McFAP-S催化正辛酸脱羧过程进行优化(反应体系pH分别为6、7、8、9、10),其他条件和处理方式不变,步骤同上。In this example, 5 pH values were selected to optimize the Mc FAP-S catalyzed n-octanoic acid decarboxylation process (the pH values of the reaction system were 6, 7, 8, 9, and 10 respectively). The other conditions and treatment methods remained unchanged, and the steps were the same as above.
结果如图7所示,McFAP-S催化正辛酸脱羧的最适温度为8~9,但在pH 6~10范围内均有80%以上的转化率,即McFAP-S催化正辛酸脱羧在pH 6~10范围内均有较好的催化效果。The results are shown in Figure 7. The optimal temperature for Mc FAP-S to catalyze the decarboxylation of n-octanoic acid is 8~9, but the conversion rate is more than 80% in the pH range of 6 to 10, that is, Mc FAP-S catalyzes the decarboxylation of n-octanoic acid. It has good catalytic effect in the pH range of 6~10.
6、对储存稳定性进行考察6. Examine storage stability
本实施例选择考察McFAP-S催化正辛酸脱羧的酶活在4℃黑暗条件下保存的储存稳定性,选择12个保存时间考察McFAP-S催化正辛酸脱羧的残余酶活(保存时间为10 min,30 min,1 h,3 h,6 h,12 h,1 d,2 d,3 d,5 d,7 d,10d),其他条件和处理方式不变,步骤同上。In this example, the storage stability of Mc FAP-S catalyzing the decarboxylation of n-octanoic acid was selected to be examined under dark conditions at 4°C, and 12 storage times were selected to examine the residual enzyme activity of Mc FAP-S catalyzing the decarboxylation of n-octanoic acid (the storage time is 10 min, 30 min, 1 h, 3 h, 6 h, 12 h, 1 d, 2 d, 3 d, 5 d, 7 d, 10 d), other conditions and treatment methods remain unchanged, and the steps are the same as above.
结果如图8所示,McFAP-S在4℃黑暗条件下储存10 d,催化正辛酸脱羧的残余活性仍有70%以上。The results are shown in Figure 8. After Mc FAP-S was stored under dark conditions at 4°C for 10 days, the residual activity of catalyzing the decarboxylation of n-octanoic acid was still more than 70%.
7、对pH耐受性进行优化7. Optimize pH tolerance
本实施例选择5个pH对McFAP-S酶液进行孵育(孵育体系pH分别为6、7、8、9、10),孵育条件为4℃避光条件,其他条件和处理方式不变,步骤同上。In this example, 5 pH values are selected to incubate the Mc FAP-S enzyme solution (the pH of the incubation system is 6, 7, 8, 9, and 10 respectively). The incubation conditions are 4°C and light-proof conditions, and other conditions and treatment methods remain unchanged. The steps are the same as above.
结果如图9所示,在4℃黑暗条件下,McFAP-S在pH 6~10范围内孵育5 d催化正辛酸脱羧的残余活性仍有50%以上。同时,pH 6~10范围内,pH对McFAP-S的酶活无显著影响,pH 6~8孵育后的残余活性略好于pH 9~10。The results are shown in Figure 9. Under dark conditions at 4°C, the residual activity of Mc FAP-S in catalyzing the decarboxylation of n-octanoic acid after incubation for 5 days in the pH range of 6~10 is still more than 50%. At the same time, within the range of pH 6~10, pH had no significant effect on the enzyme activity of Mc FAP-S, and the residual activity after incubation at pH 6~8 was slightly better than that at pH 9~10.
8、对光失活因素进行考察8. Examine the factors of light inactivation
本实施例选择考察McFAP-S催化正辛酸脱羧的酶活在不同光照条件下常温保存的储存稳定性,选择5种不同光照环境孵育McFAP-S纯酶液(蓝光照射、日光照射、黑暗保存、红光照射、体系含5% DMSO 10 mM正辛酸的红光照射共孵育),并选择5个保存时间考察McFAP-S催化正辛酸脱羧的残余酶活(保存时间为10 min,30 min,1 h,2 h,3 h),其他条件和处理方式不变,步骤同上。This example chooses to examine the storage stability of Mc FAP-S's enzyme activity that catalyzes the decarboxylation of n-octanoic acid when stored at room temperature under different lighting conditions. Five different lighting environments are selected to incubate the Mc FAP-S pure enzyme solution (blue light irradiation, sunlight irradiation, dark Storage, red light irradiation, and system containing 5% DMSO and 10 mM n-octanoic acid were incubated with red light irradiation), and 5 storage times were selected to examine the residual enzyme activity of Mc FAP-S catalyzing the decarboxylation of n-octanoic acid (the storage time was 10 min, 30 min, 1 h, 2 h, 3 h), other conditions and treatment methods remain unchanged, and the steps are the same as above.
结果如图10所示,常温条件下,蓝光照射对McFAP-S的纯酶活活性影响最大,10min后残余酶活不足10%;日光照射3 h后McFAP-S的残余酶活不足50%;McFAP-S在黑暗条件、红光照射条件下,3 h后残余酶活仍在90%以上;而添加10 mM正辛酸底物与5%DMSO共孵育条件下的McFAP-S,日光照射3 h后残余酶活仍有90%以上,即添加底物共孵育显著抑制了McFAP-S的光失活。基于上述原因,在以下实施例中选用全细胞催化形式进行催化不同链长饱和脂肪酸的底物拓展研究实验。The results are shown in Figure 10. Under normal temperature conditions, blue light irradiation has the greatest impact on the pure enzyme activity of Mc FAP-S. The residual enzyme activity after 10 minutes is less than 10%; after 3 hours of sunlight irradiation, the residual enzyme activity of Mc FAP-S is less than 50%. %; Mc FAP-S under dark conditions and red light irradiation conditions, the residual enzyme activity is still above 90% after 3 hours; while adding 10 mM n-octanoic acid substrate and incubating with 5% DMSO, Mc FAP-S, After 3 hours of sunlight irradiation, the residual enzyme activity was still more than 90%, that is, the addition of substrate and co-incubation significantly inhibited the photoinactivation of Mc FAP-S. Based on the above reasons, in the following examples, the whole-cell catalytic form was selected to conduct substrate expansion research experiments on catalyzing saturated fatty acids with different chain lengths.
实施例4 McFAP@E. coli与McFAP-S@E. coli催化脂肪酸的底物拓展研究Example 4 Research on substrate expansion of Mc FAP@ E. coli and Mc FAP-S@ E. coli catalyzed fatty acids
选用不同脂肪酸底物(碳原子数为6~18的饱和直链脂肪酸)进行光酶催化脱酸,反应时间为30 min,其他相关反应条件按照实施例1的光酶脱羧验证实验进行。Different fatty acid substrates (saturated linear fatty acids with 6 to 18 carbon atoms) were selected for photoenzyme-catalyzed deacidification. The reaction time was 30 minutes. Other relevant reaction conditions were carried out according to the photoenzyme decarboxylation verification experiment in Example 1.
结果如图11所示,McFAP@E. coli与McFAP-S@E. coli均可以催化C6:0~C18:0的脂肪酸脱羧效果,在同等全细胞添加量情况下,McFAP-S@E. coli催化C7:0~C12:0的脂肪酸脱羧效率显著高于McFAP@E. coli催化C7:0~C12:0的脂肪酸脱羧效率。The results are shown in Figure 11. Both Mc FAP@ E. coli and Mc FAP-S@ E. coli can catalyze the decarboxylation effect of C6:0~C18:0 fatty acids. With the same amount of whole cells added, Mc FAP-S The efficiency of @ E. coli in catalyzing the decarboxylation of C7:0~C12:0 fatty acids is significantly higher than that of Mc FAP@ E. coli in catalyzing the decarboxylation of C7:0~C12:0 fatty acids.
实施例5McFAP@E. coli与CvFAP@E. coli催化软脂酸脱羧效率对比Example 5 Comparison of Mc FAP@ E. coli and Cv FAP@ E. coli catalytic decarboxylation efficiency of palmitate
本实施例选择6个反应时间对比McFAP@E. coli与CvFAP@E. coli(制备方法同McFAP@E. coli)催化软脂酸的脱羧效率(1、2、3、4、5和6h),McFAP@E. coli与CvFAP@E. coli加量为0.25 g/mL,其他相关反应条件按照实施例1的光酶脱羧验证实验进行。In this example, 6 reaction times were selected to compare the decarboxylation efficiency of Mc FAP@ E. coli and Cv FAP@ E. coli (preparation method is the same as Mc FAP@ E. coli ) in catalyzing palmitic acid (1, 2, 3, 4, 5 and 6h), the addition amount of Mc FAP@ E. coli and Cv FAP@ E. coli was 0.25 g/mL, and other relevant reaction conditions were carried out according to the photoenzyme decarboxylation verification experiment of Example 1.
结果如图12所示,McFAP@E. coli催化软脂酸脱羧反应6 h后,催化软脂酸脱羧转化率即可达到90%以上。同等反应条件下,CvFAP@E. coli反应6 h后生成30 mM产物,转化率仅60%左右。由此可见,同等全细胞添加量情况下,McFAP@E. coli催化软脂酸的脱羧效率要显著高于CvFAP@E. coli。The results are shown in Figure 12. After Mc FAP@ E. coli catalyzes the decarboxylation reaction of palmitic acid for 6 hours, the catalytic conversion rate of palmitic acid decarboxylation can reach more than 90%. Under the same reaction conditions, Cv FAP @E. coli reacted for 6 hours to generate 30 mM product, with a conversion rate of only about 60%. It can be seen that with the same amount of whole cells added, the decarboxylation efficiency of Mc FAP@ E. coli in catalyzing palmitic acid is significantly higher than that of Cv FAP @E. coli .
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-described embodiments can be combined in any way. To simplify the description, not all possible combinations of the technical features in the above-described embodiments are described. However, as long as there is no contradiction in the combination of these technical features, All should be considered to be within the scope of this manual.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation modes of the present invention, and their descriptions are relatively specific and detailed, but they should not be construed as limiting the scope of the invention. It should be noted that, for those of ordinary skill in the art, several modifications and improvements can be made without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the scope of protection of the patent of the present invention should be determined by the appended claims.
序列表 sequence list
<110> 华南理工大学、广东优酶生物制造研究院有限公司<110> South China University of Technology, Guangdong Youzi Biomanufacturing Research Institute Co., Ltd.
<120> 脂肪酸光脱羧酶McFAP的突变体及其应用<120> Mutants of fatty acid photodecarboxylase McFAP and their applications
<130> 1<130> 1
<160> 8<160> 8
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 3438<211> 3438
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 1<400> 1
atggctgaaa tggcaggtgg tggtgaaggt gatggtatgc tgatgggcgg cgcgggtagc 60atggctgaaa tggcaggtgg tggtgaaggt gatggtatgc tgatgggcgg cgcgggtagc 60
gcaaacacta ccgacgcgtg ttatagcgat ccgtctaatc cggattgcgc agcgtttgag 120gcaaacacta ccgacgcgtg ttatagcgat ccgtctaatc cggattgcgc agcgtttgag 120
cgctccgacg atgattgggc ggcggacatc gaactgctgt gctctgcgat gccgttcatg 180cgctccgacg atgattgggc ggcggacatc gaactgctgt gctctgcgat gccgttcatg 180
ccgggctgca ccctggcgga acagtgcatg aatggcaccg ccgccggtga atattgcgaa 240ccgggctgca ccctggcgga acagtgcatg aatggcaccg ccgccggtga atattgcgaa 240
atgtccagtc tggctggtaa catctgtctg gatatgccgg gcatgaaagg ctgtgaggca 300atgtccagtc tggctggtaa catctgtctg gatatgccgg gcatgaaagg ctgtgaggca 300
tggaacgcac tgtgtggcgc ggccagcgcc gttgaacagt gttcctctcc gggcccggtt 360tggaacgcac tgtgtggcgc ggccagcgcc gttgaacagt gttcctctcc gggcccggtt 360
gtggcactcc cgaccaccgc gctggccaaa gaaggcctgg aatctctgtg ctctacccat 420gtggcactcc cgaccaccgc gctggccaaa gaaggcctgg aatctctgtg ctctacccat 420
tgtatggacg gttgcccaga ctgtgaaatg ggtaaactgt ggaacacctg caccgacccg 480tgtatggacg gttgcccaga ctgtgaaatg ggtaaactgt ggaacacctg caccgacccg 480
ctgagtgttc tggcgtggat gtgctacgca atgccggaca tgccggaatg tctggctgct 540ctgagtgttc tggcgtggat gtgctacgca atgccggaca tgccggaatg tctggctgct 540
ccgcagggct ccggcatggt ggtggcttgc ggtgacgctg aggttgcagc taccttcccg 600ccgcagggct ccggcatggt ggtggcttgc ggtgacgctg aggttgcagc taccttcccg 600
ctggtgtgcg cgcaaccgcc gaccccggcg gctaactttc agcaccgcct tcgtacctgc 660ctggtgtgcg cgcaaccgcc gaccccggcg gctaactttc agcaccgcct tcgtacctgc 660
cgtaccgccg gcgttgcggc atccgcatcc ggttctccgg cagtcactat ggctggcctg 720cgtaccgccg gcgttgcggc atccgcatcc ggttctccgg cagtcactat ggctggcctg 720
tcaactgttc tggcagtact ggcactgctg ccatccccgg ttgctatggc catgactccg 780tcaactgttc tggcagtact ggcactgctg ccatccccgg ttgctatggc catgactccg 780
atgccgaccc cggcgctcgc tccgggcccg gcgatcgacg atatcggcgg taactgcccg 840atgccgaccc cggcgctcgc tccgggcccg gcgatcgacg atatcggcgg taactgcccg 840
ctgctgggtc gcggtaacat ggaagctccg tgttatagcg acccgagcgc ggcagcatgc 900ctgctgggtc gcggtaacat ggaagctccg tgttatagcg acccgagcgc ggcagcatgc 900
gtttcctttg aacgcagcga tgctggctgg gcggatgacc tgagtcagct gtgttctgcg 960gtttcctttg aacgcagcga tgctggctgg gcggatgacc tgagtcagct gtgttctgcg 960
atgccgtatg ctgttggctg ctggctgtgg cacttgtgta aaaccggcgc agcaagcggg 1020atgccgtatg ctgttggctg ctggctgtgg cacttgtgta aaaccggcgc agcaagcggg 1020
acttactgtg cgctgccgtc cctgaccgcg aacgtatgtg ttgacgcacc gctggtgaac 1080acttactgtg cgctgccgtc cctgaccgcg aacgtatgtg ttgacgcacc gctggtgaac 1080
gctacatcag cgccgggctg cgaagcgtgg gccgcactgt gcggcgccca gggtagcgtc 1140gctacatcag cgccgggctg cgaagcgtgg gccgcactgt gcggcgccca gggtagcgtc 1140
gttgcgcagt gctctgcgcc aggcccgctg ccggacatca tcaacaccct gaccacccgt 1200gttgcgcagt gctctgcgcc aggcccgctg ccggacatca tcaacaccct gaccacccgt 1200
gacggcatca actccctctg cggtatgcat tacatggatg ggtgtaacga atgtacccct 1260gacggcatca actccctctg cggtatgcat tacatggatg ggtgtaacga atgtacccct 1260
cacgaaggtc cggcagttca cgacttcgcg gcctgtgctg atccgggtcc actgccgact 1320cacgaaggtc cggcagttca cgacttcgcg gcctgtgctg atccgggtcc actgccgact 1320
ctggcccacc agtgttacgc gatgcctgaa atgggtgaat gtacccagac tggtattacc 1380ctggcccacc agtgttacgc gatgcctgaa atgggtgaat gtacccagac tggtattacc 1380
gcaatgtgca gcggcgctga agctcgtgcg acctttccga ccgtttgcgt ggatccacct 1440gcaatgtgca gcggcgctga agctcgtgcg acctttccga ccgtttgcgt ggatccacct 1440
aacccgacga cactggcgcc ggcgcctgcc gtttctgcct gcgatgttgc ggcgggtgct 1500aacccgacga cactggcgcc ggcgcctgcc gtttctgcct gcgatgttgc ggcgggtgct 1500
ggcgcgccac cagcggcgtc tgcccgtccg gcgtcgcaca gccgcgcgtc actggttgcc 1560ggcgcgccac cagcggcgtc tgcccgtccg gcgtcgcaca gccgcgcgtc actggttgcc 1560
tcccgtagcg gcttctgcgc gccttccccg gcgctgcgct ctcagcgcac ctctactgtc 1620tcccgtagcg gcttctgcgc gccttccccg gcgctgcgct ctcagcgcac ctctactgtc 1620
gcgccggcgc gccgcgccgc gtcggcgccg cgtgcgagcg cagttgacga tattcaacgt 1680gcgccggcgc gccgcgccgc gtcggcgccg cgtgcgagcg cagttgacga tattcaacgt 1680
gctctgagca ccgctggaag cccggtatcc ggtaaacagt acgattacat cctggtgggt 1740gctctgagca ccgctggaag cccggtatcc ggtaaacagt acgattacat cctggtgggt 1740
ggcggcaccg cggcatgcgt tttggctaac cgtttaaccg cggacggtag caaacgtgta 1800ggcggcaccg cggcatgcgt tttggctaac cgtttaaccg cggacggtag caaacgtgta 1800
ctggtgctgg aagcgggtgc ggacaacgtg agccgcgatg ttaaagtccc ggctgcgatc 1860ctggtgctgg aagcgggtgc ggacaacgtg agccgcgatg ttaaagtccc ggctgcgatc 1860
acccgtttgt tccgttcacc gttggattgg aacttgttca gcgaattgca ggaacagctg 1920acccgtttgt tccgttcacc gttggattgg aacttgttca gcgaattgca ggaacagctg 1920
gctgcacgtc agatctatat ggctcgcggc cgcttgctgg gtgggtctag cgcgaccaat 1980gctgcacgtc agatctatat ggctcgcggc cgcttgctgg gtgggtctag cgcgaccaat 1980
gctactcttt accaccgtgg cgcggcggcg gattatgatg cgtggggcgt gccgggctgg 2040gctactcttt accaccgtgg cgcggcggcg gattatgatg cgtggggcgt gccgggctgg 2040
ggcgcagctg acgtgctgcc atggttcgtt aaggccgaaa ccaacgcgga gtttgcggcg 2100ggcgcagctg acgtgctgcc atggttcgtt aaggccgaaa ccaacgcgga gtttgcggcg 2100
ggcaaatatc acggcgcagg tggtaacatg cgcgttgaga atccgcgcta ctccaacccg 2160ggcaaatatc acggcgcagg tggtaacatg cgcgttgaga atccgcgcta ctccaacccg 2160
cagctgcacg gtgctttctt tgcagctgcg cagcagatgg gtctgccgca gaataccgac 2220cagctgcacg gtgctttctt tgcagctgcg cagcagatgg gtctgccgca gaataccgac 2220
ttcaacaatt gggatcagga tcatgcaggc tttggcactt ttcaggttat gcaggaaaaa 2280ttcaacaatt gggatcagga tcatgcaggc tttggcactt ttcaggttat gcaggaaaaa 2280
ggcacccgcg ctgatatgta ccgccagtat cttaaaccag ctcttggtcg tccgaacctg 2340ggcacccgcg ctgatatgta ccgccagtat cttaaaccag ctcttggtcg tccgaacctg 2340
caggttctga ccggtgcgtc tgtgaccaaa gttcatatcg ataaagctgg cggtaaaccg 2400caggttctga ccggtgcgtc tgtgaccaaa gttcatatcg ataaagctgg cggtaaaccg 2400
cgtgctctgg gcgtagagtt ttctctggat ggtccggctg gtgaacgtat ggcagcagag 2460cgtgctctgg gcgtagagtt ttctctggat ggtccggctg gtgaacgtat ggcagcagag 2460
ctggcgccgg gcggtgaagt tctcatgtgc gctggcgccg tgcatagccc gcacattctg 2520ctggcgccgg gcggtgaagt tctcatgtgc gctggcgccg tgcatagccc gcacattctg 2520
cagctgtctg gcgttggttc ggcggctact ctggcagacc acggcatcgc agcagtggca 2580cagctgtctg gcgttggttc ggcggctact ctggcagacc acggcatcgc agcagtggca 2580
gatctgccag gtgttggtgc gaacatgcag gaccagccgg cctgcctgac agcggctccc 2640gatctgccag gtgttggtgc gaacatgcag gaccagccgg cctgcctgac agcggctccc 2640
ctgaaagaca aatacgatgg catttcgctg accgatcata tctataatag caaaggccag 2700ctgaaagaca aatacgatgg catttcgctg accgatcata tctataatag caaaggccag 2700
attcgcaaac gcgctatcgc gtcctacctg cttcagggta aaggtggtct gacgtcaact 2760attcgcaaac gcgctatcgc gtcctacctg cttcagggta aaggtggtct gacgtcaact 2760
ggctgcgacc gtggcgcgtt tgtacgtacc gcaggccagg cactgccgga cctgcaggtg 2820ggctgcgacc gtggcgcgtt tgtacgtacc gcaggccagg cactgccgga cctgcaggtg 2820
cgtttcgtgc caggcatggc actggatgca gatggtgtgt ccacctacgt ccgtttcgca 2880cgtttcgtgc caggcatggc actggatgca gatggtgtgt ccacctacgt ccgtttcgca 2880
aaatttcagt ctcagggcct gaaatggccg tctggcatca ccgtacagct tattgcgtgt 2940aaatttcagt ctcagggcct gaaatggccg tctggcatca ccgtacagct tattgcgtgt 2940
cgcccgcaca gcaaaggttc tgttggcctg aaaaacgcgg acccgttcac cccgccgaaa 3000cgcccgcaca gcaaaggttc tgttggcctg aaaaacgcgg acccgttcac cccgccgaaa 3000
ctgcgtccgg gctacctgac cgacaaagcg ggtgcggatc tggcgaccct gcgctctggt 3060ctgcgtccgg gctacctgac cgacaaagcg ggtgcggatc tggcgaccct gcgctctggt 3060
gttcattggg cccgtgatct ggcatctagc ggtccgctga gcgaatttct tgaaggcgaa 3120gttcattggg cccgtgatct ggcatctagc ggtccgctga gcgaatttct tgaaggcgaa 3120
ctgtttccgg gtagccaagt tgtttccgat gatgatattg attcttacat tcgtcgtacc 3180ctgtttccgg gtagccaagt tgtttccgat gatgatattg attcttacat tcgtcgtacc 3180
attcactcca gcaacgcgat tgtgggcacc tgtcgtatgg gcgcggcggg tgaagcgggt 3240attcactcca gcaacgcgat tgtgggcacc tgtcgtatgg gcgcggcggg tgaagcgggt 3240
gttgttgtgg ataaccagct gcgcgttcag ggtgttgatg gtctgcgtgt tgttgacgcg 3300gttgttgtgg ataaccagct gcgcgttcag ggtgttgatg gtctgcgtgt tgttgacgcg 3300
agcgtaatgc cgcgtatccc aggtggtcag gtgggtgcgc cggttgtgat gctggccgaa 3360agcgtaatgc cgcgtatccc aggtggtcag gtgggtgcgc cggttgtgat gctggccgaa 3360
cgtgcagcag cgatgctgac cggtcaggca gcgctggctg gtgctagcgc tgcagctccg 3420cgtgcagcag cgatgctgac cggtcaggca gcgctggctg gtgctagcgc tgcagctccg 3420
ccgaccccgg tcgcggct 3438ccgaccccgg tcgcggct 3438
<210> 2<210> 2
<211> 1146<211> 1146
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 2<400> 2
Met Ala Glu Met Ala Gly Gly Gly Glu Gly Asp Gly Met Leu Met GlyMet Ala Glu Met Ala Gly Gly Gly Glu Gly Asp Gly Met Leu Met Gly
1 5 10 151 5 10 15
Gly Ala Gly Ser Ala Asn Thr Thr Asp Ala Cys Tyr Ser Asp Pro SerGly Ala Gly Ser Ala Asn Thr Thr Asp Ala Cys Tyr Ser Asp Pro Ser
20 25 30 20 25 30
Asn Pro Asp Cys Ala Ala Phe Glu Arg Ser Asp Asp Asp Trp Ala AlaAsn Pro Asp Cys Ala Ala Phe Glu Arg Ser Asp Asp Asp Trp Ala Ala
35 40 45 35 40 45
Asp Ile Glu Leu Leu Cys Ser Ala Met Pro Phe Met Pro Gly Cys ThrAsp Ile Glu Leu Leu Cys Ser Ala Met Pro Phe Met Pro Gly Cys Thr
50 55 60 50 55 60
Leu Ala Glu Gln Cys Met Asn Gly Thr Ala Ala Gly Glu Tyr Cys GluLeu Ala Glu Gln Cys Met Asn Gly Thr Ala Ala Gly Glu Tyr Cys Glu
65 70 75 8065 70 75 80
Met Ser Ser Leu Ala Gly Asn Ile Cys Leu Asp Met Pro Gly Met LysMet Ser Ser Leu Ala Gly Asn Ile Cys Leu Asp Met Pro Gly Met Lys
85 90 95 85 90 95
Gly Cys Glu Ala Trp Asn Ala Leu Cys Gly Ala Ala Ser Ala Val GluGly Cys Glu Ala Trp Asn Ala Leu Cys Gly Ala Ala Ser Ala Val Glu
100 105 110 100 105 110
Gln Cys Ser Ser Pro Gly Pro Val Val Ala Leu Pro Thr Thr Ala LeuGln Cys Ser Ser Pro Gly Pro Val Val Ala Leu Pro Thr Thr Ala Leu
115 120 125 115 120 125
Ala Lys Glu Gly Leu Glu Ser Leu Cys Ser Thr His Cys Met Asp GlyAla Lys Glu Gly Leu Glu Ser Leu Cys Ser Thr His Cys Met Asp Gly
130 135 140 130 135 140
Cys Pro Asp Cys Glu Met Gly Lys Leu Trp Asn Thr Cys Thr Asp ProCys Pro Asp Cys Glu Met Gly Lys Leu Trp Asn Thr Cys Thr Asp Pro
145 150 155 160145 150 155 160
Leu Ser Val Leu Ala Trp Met Cys Tyr Ala Met Pro Asp Met Pro GluLeu Ser Val Leu Ala Trp Met Cys Tyr Ala Met Pro Asp Met Pro Glu
165 170 175 165 170 175
Cys Leu Ala Ala Pro Gln Gly Ser Gly Met Val Val Ala Cys Gly AspCys Leu Ala Ala Pro Gln Gly Ser Gly Met Val Val Ala Cys Gly Asp
180 185 190 180 185 190
Ala Glu Val Ala Ala Thr Phe Pro Leu Val Cys Ala Gln Pro Pro ThrAla Glu Val Ala Ala Thr Phe Pro Leu Val Cys Ala Gln Pro Pro Thr
195 200 205 195 200 205
Pro Ala Ala Asn Phe Gln His Arg Leu Arg Thr Cys Arg Thr Ala GlyPro Ala Ala Asn Phe Gln His Arg Leu Arg Thr Cys Arg Thr Ala Gly
210 215 220 210 215 220
Val Ala Ala Ser Ala Ser Gly Ser Pro Ala Val Thr Met Ala Gly LeuVal Ala Ala Ser Ala Ser Gly Ser Pro Ala Val Thr Met Ala Gly Leu
225 230 235 240225 230 235 240
Ser Thr Val Leu Ala Val Leu Ala Leu Leu Pro Ser Pro Val Ala MetSer Thr Val Leu Ala Val Leu Ala Leu Leu Pro Ser Pro Val Ala Met
245 250 255 245 250 255
Ala Met Thr Pro Met Pro Thr Pro Ala Leu Ala Pro Gly Pro Ala IleAla Met Thr Pro Met Pro Thr Pro Ala Leu Ala Pro Gly Pro Ala Ile
260 265 270 260 265 270
Asp Asp Ile Gly Gly Asn Cys Pro Leu Leu Gly Arg Gly Asn Met GluAsp Asp Ile Gly Gly Asn Cys Pro Leu Leu Gly Arg Gly Asn Met Glu
275 280 285 275 280 285
Ala Pro Cys Tyr Ser Asp Pro Ser Ala Ala Ala Cys Val Ser Phe GluAla Pro Cys Tyr Ser Asp Pro Ser Ala Ala Ala Cys Val Ser Phe Glu
290 295 300 290 295 300
Arg Ser Asp Ala Gly Trp Ala Asp Asp Leu Ser Gln Leu Cys Ser AlaArg Ser Asp Ala Gly Trp Ala Asp Asp Leu Ser Gln Leu Cys Ser Ala
305 310 315 320305 310 315 320
Met Pro Tyr Ala Val Gly Cys Trp Leu Trp His Leu Cys Lys Thr GlyMet Pro Tyr Ala Val Gly Cys Trp Leu Trp His Leu Cys Lys Thr Gly
325 330 335 325 330 335
Ala Ala Ser Gly Thr Tyr Cys Ala Leu Pro Ser Leu Thr Ala Asn ValAla Ala Ser Gly Thr Tyr Cys Ala Leu Pro Ser Leu Thr Ala Asn Val
340 345 350 340 345 350
Cys Val Asp Ala Pro Leu Val Asn Ala Thr Ser Ala Pro Gly Cys GluCys Val Asp Ala Pro Leu Val Asn Ala Thr Ser Ala Pro Gly Cys Glu
355 360 365 355 360 365
Ala Trp Ala Ala Leu Cys Gly Ala Gln Gly Ser Val Val Ala Gln CysAla Trp Ala Ala Leu Cys Gly Ala Gln Gly Ser Val Val Ala Gln Cys
370 375 380 370 375 380
Ser Ala Pro Gly Pro Leu Pro Asp Ile Ile Asn Thr Leu Thr Thr ArgSer Ala Pro Gly Pro Leu Pro Asp Ile Ile Asn Thr Leu Thr Thr Arg
385 390 395 400385 390 395 400
Asp Gly Ile Asn Ser Leu Cys Gly Met His Tyr Met Asp Gly Cys AsnAsp Gly Ile Asn Ser Leu Cys Gly Met His Tyr Met Asp Gly Cys Asn
405 410 415 405 410 415
Glu Cys Thr Pro His Glu Gly Pro Ala Val His Asp Phe Ala Ala CysGlu Cys Thr Pro His Glu Gly Pro Ala Val His Asp Phe Ala Ala Cys
420 425 430 420 425 430
Ala Asp Pro Gly Pro Leu Pro Thr Leu Ala His Gln Cys Tyr Ala MetAla Asp Pro Gly Pro Leu Pro Thr Leu Ala His Gln Cys Tyr Ala Met
435 440 445 435 440 445
Pro Glu Met Gly Glu Cys Thr Gln Thr Gly Ile Thr Ala Met Cys SerPro Glu Met Gly Glu Cys Thr Gln Thr Gly Ile Thr Ala Met Cys Ser
450 455 460 450 455 460
Gly Ala Glu Ala Arg Ala Thr Phe Pro Thr Val Cys Val Asp Pro ProGly Ala Glu Ala Arg Ala Thr Phe Pro Thr Val Cys Val Asp Pro Pro
465 470 475 480465 470 475 480
Asn Pro Thr Thr Leu Ala Pro Ala Pro Ala Val Ser Ala Cys Asp ValAsn Pro Thr Thr Leu Ala Pro Ala Pro Ala Val Ser Ala Cys Asp Val
485 490 495 485 490 495
Ala Ala Gly Ala Gly Ala Pro Pro Ala Ala Ser Ala Arg Pro Ala SerAla Ala Gly Ala Gly Ala Pro Pro Ala Ala Ser Ala Arg Pro Ala Ser
500 505 510 500 505 510
His Ser Arg Ala Ser Leu Val Ala Ser Arg Ser Gly Phe Cys Ala ProHis Ser Arg Ala Ser Leu Val Ala Ser Arg Ser Gly Phe Cys Ala Pro
515 520 525 515 520 525
Ser Pro Ala Leu Arg Ser Gln Arg Thr Ser Thr Val Ala Pro Ala ArgSer Pro Ala Leu Arg Ser Gln Arg Thr Ser Ser Thr Val Ala Pro Ala Arg
530 535 540 530 535 540
Arg Ala Ala Ser Ala Pro Arg Ala Ser Ala Val Asp Asp Ile Gln ArgArg Ala Ala Ser Ala Pro Arg Ala Ser Ala Val Asp Asp Ile Gln Arg
545 550 555 560545 550 555 560
Ala Leu Ser Thr Ala Gly Ser Pro Val Ser Gly Lys Gln Tyr Asp TyrAla Leu Ser Thr Ala Gly Ser Pro Val Ser Gly Lys Gln Tyr Asp Tyr
565 570 575 565 570 575
Ile Leu Val Gly Gly Gly Thr Ala Ala Cys Val Leu Ala Asn Arg LeuIle Leu Val Gly Gly Gly Thr Ala Ala Cys Val Leu Ala Asn Arg Leu
580 585 590 580 585 590
Thr Ala Asp Gly Ser Lys Arg Val Leu Val Leu Glu Ala Gly Ala AspThr Ala Asp Gly Ser Lys Arg Val Leu Val Leu Glu Ala Gly Ala Asp
595 600 605 595 600 605
Asn Val Ser Arg Asp Val Lys Val Pro Ala Ala Ile Thr Arg Leu PheAsn Val Ser Arg Asp Val Lys Val Pro Ala Ala Ile Thr Arg Leu Phe
610 615 620 610 615 620
Arg Ser Pro Leu Asp Trp Asn Leu Phe Ser Glu Leu Gln Glu Gln LeuArg Ser Pro Leu Asp Trp Asn Leu Phe Ser Glu Leu Gln Glu Gln Leu
625 630 635 640625 630 635 640
Ala Ala Arg Gln Ile Tyr Met Ala Arg Gly Arg Leu Leu Gly Gly SerAla Ala Arg Gln Ile Tyr Met Ala Arg Gly Arg Leu Leu Gly Gly Ser
645 650 655 645 650 655
Ser Ala Thr Asn Ala Thr Leu Tyr His Arg Gly Ala Ala Ala Asp TyrSer Ala Thr Asn Ala Thr Leu Tyr His Arg Gly Ala Ala Ala Asp Tyr
660 665 670 660 665 670
Asp Ala Trp Gly Val Pro Gly Trp Gly Ala Ala Asp Val Leu Pro TrpAsp Ala Trp Gly Val Pro Gly Trp Gly Ala Ala Asp Val Leu Pro Trp
675 680 685 675 680 685
Phe Val Lys Ala Glu Thr Asn Ala Glu Phe Ala Ala Gly Lys Tyr HisPhe Val Lys Ala Glu Thr Asn Ala Glu Phe Ala Ala Gly Lys Tyr His
690 695 700 690 695 700
Gly Ala Gly Gly Asn Met Arg Val Glu Asn Pro Arg Tyr Ser Asn ProGly Ala Gly Gly Asn Met Arg Val Glu Asn Pro Arg Tyr Ser Asn Pro
705 710 715 720705 710 715 720
Gln Leu His Gly Ala Phe Phe Ala Ala Ala Gln Gln Met Gly Leu ProGln Leu His Gly Ala Phe Phe Ala Ala Ala Gln Gln Met Gly Leu Pro
725 730 735 725 730 735
Gln Asn Thr Asp Phe Asn Asn Trp Asp Gln Asp His Ala Gly Phe GlyGln Asn Thr Asp Phe Asn Asn Trp Asp Gln Asp His Ala Gly Phe Gly
740 745 750 740 745 750
Thr Phe Gln Val Met Gln Glu Lys Gly Thr Arg Ala Asp Met Tyr ArgThr Phe Gln Val Met Gln Glu Lys Gly Thr Arg Ala Asp Met Tyr Arg
755 760 765 755 760 765
Gln Tyr Leu Lys Pro Ala Leu Gly Arg Pro Asn Leu Gln Val Leu ThrGln Tyr Leu Lys Pro Ala Leu Gly Arg Pro Asn Leu Gln Val Leu Thr
770 775 780 770 775 780
Gly Ala Ser Val Thr Lys Val His Ile Asp Lys Ala Gly Gly Lys ProGly Ala Ser Val Thr Lys Val His Ile Asp Lys Ala Gly Gly Lys Pro
785 790 795 800785 790 795 800
Arg Ala Leu Gly Val Glu Phe Ser Leu Asp Gly Pro Ala Gly Glu ArgArg Ala Leu Gly Val Glu Phe Ser Leu Asp Gly Pro Ala Gly Glu Arg
805 810 815 805 810 815
Met Ala Ala Glu Leu Ala Pro Gly Gly Glu Val Leu Met Cys Ala GlyMet Ala Ala Glu Leu Ala Pro Gly Gly Glu Val Leu Met Cys Ala Gly
820 825 830 820 825 830
Ala Val His Ser Pro His Ile Leu Gln Leu Ser Gly Val Gly Ser AlaAla Val His Ser Pro His Ile Leu Gln Leu Ser Gly Val Gly Ser Ala
835 840 845 835 840 845
Ala Thr Leu Ala Asp His Gly Ile Ala Ala Val Ala Asp Leu Pro GlyAla Thr Leu Ala Asp His Gly Ile Ala Ala Val Ala Asp Leu Pro Gly
850 855 860 850 855 860
Val Gly Ala Asn Met Gln Asp Gln Pro Ala Cys Leu Thr Ala Ala ProVal Gly Ala Asn Met Gln Asp Gln Pro Ala Cys Leu Thr Ala Ala Pro
865 870 875 880865 870 875 880
Leu Lys Asp Lys Tyr Asp Gly Ile Ser Leu Thr Asp His Ile Tyr AsnLeu Lys Asp Lys Tyr Asp Gly Ile Ser Leu Thr Asp His Ile Tyr Asn
885 890 895 885 890 895
Ser Lys Gly Gln Ile Arg Lys Arg Ala Ile Ala Ser Tyr Leu Leu GlnSer Lys Gly Gln Ile Arg Lys Arg Ala Ile Ala Ser Tyr Leu Leu Gln
900 905 910 900 905 910
Gly Lys Gly Gly Leu Thr Ser Thr Gly Cys Asp Arg Gly Ala Phe ValGly Lys Gly Gly Leu Thr Ser Thr Gly Cys Asp Arg Gly Ala Phe Val
915 920 925 915 920 925
Arg Thr Ala Gly Gln Ala Leu Pro Asp Leu Gln Val Arg Phe Val ProArg Thr Ala Gly Gln Ala Leu Pro Asp Leu Gln Val Arg Phe Val Pro
930 935 940 930 935 940
Gly Met Ala Leu Asp Ala Asp Gly Val Ser Thr Tyr Val Arg Phe AlaGly Met Ala Leu Asp Ala Asp Gly Val Ser Thr Tyr Val Arg Phe Ala
945 950 955 960945 950 955 960
Lys Phe Gln Ser Gln Gly Leu Lys Trp Pro Ser Gly Ile Thr Val GlnLys Phe Gln Ser Gln Gly Leu Lys Trp Pro Ser Gly Ile Thr Val Gln
965 970 975 965 970 975
Leu Ile Ala Cys Arg Pro His Ser Lys Gly Ser Val Gly Leu Lys AsnLeu Ile Ala Cys Arg Pro His Ser Lys Gly Ser Val Gly Leu Lys Asn
980 985 990 980 985 990
Ala Asp Pro Phe Thr Pro Pro Lys Leu Arg Pro Gly Tyr Leu Thr AspAla Asp Pro Phe Thr Pro Pro Lys Leu Arg Pro Gly Tyr Leu Thr Asp
995 1000 1005 995 1000 1005
Lys Ala Gly Ala Asp Leu Ala Thr Leu Arg Ser Gly Val His Trp AlaLys Ala Gly Ala Asp Leu Ala Thr Leu Arg Ser Gly Val His Trp Ala
1010 1015 1020 1010 1015 1020
Arg Asp Leu Ala Ser Ser Gly Pro Leu Ser Glu Phe Leu Glu Gly GluArg Asp Leu Ala Ser Ser Gly Pro Leu Ser Glu Phe Leu Glu Gly Glu
1025 1030 1035 10401025 1030 1035 1040
Leu Phe Pro Gly Ser Gln Val Val Ser Asp Asp Asp Ile Asp Ser TyrLeu Phe Pro Gly Ser Gln Val Val Ser Asp Asp Asp Ile Asp Ser Tyr
1045 1050 1055 1045 1050 1055
Ile Arg Arg Thr Ile His Ser Ser Asn Ala Ile Val Gly Thr Cys ArgIle Arg Arg Thr Ile His Ser Ser Asn Ala Ile Val Gly Thr Cys Arg
1060 1065 1070 1060 1065 1070
Met Gly Ala Ala Gly Glu Ala Gly Val Val Val Asp Asn Gln Leu ArgMet Gly Ala Ala Gly Glu Ala Gly Val Val Val Asp Asn Gln Leu Arg
1075 1080 1085 1075 1080 1085
Val Gln Gly Val Asp Gly Leu Arg Val Val Asp Ala Ser Val Met ProVal Gln Gly Val Asp Gly Leu Arg Val Val Asp Ala Ser Val Met Pro
1090 1095 1100 1090 1095 1100
Arg Ile Pro Gly Gly Gln Val Gly Ala Pro Val Val Met Leu Ala GluArg Ile Pro Gly Gly Gln Val Gly Ala Pro Val Val Met Leu Ala Glu
1105 1110 1115 11201105 1110 1115 1120
Arg Ala Ala Ala Met Leu Thr Gly Gln Ala Ala Leu Ala Gly Ala SerArg Ala Ala Ala Met Leu Thr Gly Gln Ala Ala Leu Ala Gly Ala Ser
1125 1130 1135 1125 1130 1135
Ala Ala Ala Pro Pro Thr Pro Val Ala AlaAla Ala Ala Pro Pro Thr Pro Val Ala Ala
1140 1145 1140 1145
<210> 3<210> 3
<211> 1788<211> 1788
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 3<400> 3
cgtgcgagcg cagttgacga tattcaacgt gctctgagca ccgctggaag cccggtatcc 60cgtgcgagcg cagttgacga tattcaacgt gctctgagca ccgctggaag cccggtatcc 60
ggtaaacagt acgattacat cctggtgggt ggcggcaccg cggcatgcgt tttggctaac 120ggtaaacagt acgattacat cctggtgggt ggcggcaccg cggcatgcgt tttggctaac 120
cgtttaaccg cggacggtag caaacgtgta ctggtgctgg aagcgggtgc ggacaacgtg 180cgtttaaccg cggacggtag caaacgtgta ctggtgctgg aagcgggtgc ggacaacgtg 180
agccgcgatg ttaaagtccc ggctgcgatc acccgtttgt tccgttcacc gttggattgg 240agccgcgatg ttaaagtccc ggctgcgatc acccgtttgt tccgttcacc gttggattgg 240
aacttgttca gcgaattgca ggaacagctg gctgcacgtc agatctatat ggctcgcggc 300aacttgttca gcgaattgca ggaacagctg gctgcacgtc agatctatat ggctcgcggc 300
cgcttgctgg gtgggtctag cgcgaccaat gctactcttt accaccgtgg cgcggcggcg 360cgcttgctgg gtgggtctag cgcgaccaat gctactcttt accaccgtgg cgcggcggcg 360
gattatgatg cgtggggcgt gccgggctgg ggcgcagctg acgtgctgcc atggttcgtt 420gattatgatg cgtggggcgt gccgggctgg ggcgcagctg acgtgctgcc atggttcgtt 420
aaggccgaaa ccaacgcgga gtttgcggcg ggcaaatatc acggcgcagg tggtaacatg 480aaggccgaaa ccaacgcgga gtttgcggcg ggcaaatatc acggcgcagg tggtaacatg 480
cgcgttgaga atccgcgcta ctccaacccg cagctgcacg gtgctttctt tgcagctgcg 540cgcgttgaga atccgcgcta ctccaacccg cagctgcacg gtgctttctt tgcagctgcg 540
cagcagatgg gtctgccgca gaataccgac ttcaacaatt gggatcagga tcatgcaggc 600cagcagatgg gtctgccgca gaataccgac ttcaacaatt gggatcagga tcatgcaggc 600
tttggcactt ttcaggttat gcaggaaaaa ggcacccgcg ctgatatgta ccgccagtat 660tttggcactt ttcaggttat gcaggaaaaa ggcacccgcg ctgatatgta ccgccagtat 660
cttaaaccag ctcttggtcg tccgaacctg caggttctga ccggtgcgtc tgtgaccaaa 720cttaaaccag ctcttggtcg tccgaacctg caggttctga ccggtgcgtc tgtgaccaaa 720
gttcatatcg ataaagctgg cggtaaaccg cgtgctctgg gcgtagagtt ttctctggat 780gttcatatcg ataaagctgg cggtaaaccg cgtgctctgg gcgtagagtt ttctctggat 780
ggtccggctg gtgaacgtat ggcagcagag ctggcgccgg gcggtgaagt tctcatgtgc 840ggtccggctg gtgaacgtat ggcagcagag ctggcgccgg gcggtgaagt tctcatgtgc 840
gctggcgccg tgcatagccc gcacattctg cagctgtctg gcgttggttc ggcggctact 900gctggcgccg tgcatagccc gcacattctg cagctgtctg gcgttggttc ggcggctact 900
ctggcagacc acggcatcgc agcagtggca gatctgccag gtgttggtgc gaacatgcag 960ctggcagacc acggcatcgc agcagtggca gatctgccag gtgttggtgc gaacatgcag 960
gaccagccgg cctgcctgac agcggctccc ctgaaagaca aatacgatgg catttcgctg 1020gaccagccgg cctgcctgac agcggctccc ctgaaagaca aatacgatgg catttcgctg 1020
accgatcata tctataatag caaaggccag attcgcaaac gcgctatcgc gtcctacctg 1080accgatcata tctataatag caaaggccag attcgcaaac gcgctatcgc gtcctacctg 1080
cttcagggta aaggtggtct gacgtcaact ggctgcgacc gtggcgcgtt tgtacgtacc 1140cttcagggta aaggtggtct gacgtcaact ggctgcgacc gtggcgcgtt tgtacgtacc 1140
gcaggccagg cactgccgga cctgcaggtg cgtttcgtgc caggcatggc actggatgca 1200gcaggccagg cactgccgga cctgcaggtg cgtttcgtgc caggcatggc actggatgca 1200
gatggtgtgt ccacctacgt ccgtttcgca aaatttcagt ctcagggcct gaaatggccg 1260gatggtgtgt ccacctacgt ccgtttcgca aaatttcagt ctcagggcct gaaatggccg 1260
tctggcatca ccgtacagct tattgcgtgt cgcccgcaca gcaaaggttc tgttggcctg 1320tctggcatca ccgtacagct tattgcgtgt cgcccgcaca gcaaaggttc tgttggcctg 1320
aaaaacgcgg acccgttcac cccgccgaaa ctgcgtccgg gctacctgac cgacaaagcg 1380aaaaacgcgg acccgttcac cccgccgaaa ctgcgtccgg gctacctgac cgacaaagcg 1380
ggtgcggatc tggcgaccct gcgctctggt gttcattggg cccgtgatct ggcatctagc 1440ggtgcggatc tggcgaccct gcgctctggt gttcattggg cccgtgatct ggcatctagc 1440
ggtccgctga gcgaatttct tgaaggcgaa ctgtttccgg gtagccaagt tgtttccgat 1500ggtccgctga gcgaatttct tgaaggcgaa ctgtttccgg gtagccaagt tgtttccgat 1500
gatgatattg attcttacat tcgtcgtacc attcactcca gcaacgcgat tgtgggcacc 1560gatgatattg attcttacat tcgtcgtacc attcactcca gcaacgcgat tgtgggcacc 1560
tgtcgtatgg gcgcggcggg tgaagcgggt gttgttgtgg ataaccagct gcgcgttcag 1620tgtcgtatgg gcgcggcggg tgaagcgggt gttgttgtgg ataaccagct gcgcgttcag 1620
ggtgttgatg gtctgcgtgt tgttgacgcg agcgtaatgc cgcgtatccc aggtggtcag 1680ggtgttgatg gtctgcgtgt tgttgacgcg agcgtaatgc cgcgtatccc aggtggtcag 1680
gtgggtgcgc cggttgtgat gctggccgaa cgtgcagcag cgatgctgac cggtcaggca 1740gtgggtgcgc cggttgtgat gctggccgaa cgtgcagcag cgatgctgac cggtcaggca 1740
gcgctggctg gtgctagcgc tgcagctccg ccgaccccgg tcgcggct 1788gcgctggctg gtgctagcgc tgcagctccg ccgaccccgg tcgcggct 1788
<210> 4<210> 4
<211> 596<211> 596
<212> PRT<212> PRT
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 4<400> 4
Arg Ala Ser Ala Val Asp Asp Ile Gln Arg Ala Leu Ser Thr Ala GlyArg Ala Ser Ala Val Asp Asp Ile Gln Arg Ala Leu Ser Thr Ala Gly
1 5 10 151 5 10 15
Ser Pro Val Ser Gly Lys Gln Tyr Asp Tyr Ile Leu Val Gly Gly GlySer Pro Val Ser Gly Lys Gln Tyr Asp Tyr Ile Leu Val Gly Gly Gly
20 25 30 20 25 30
Thr Ala Ala Cys Val Leu Ala Asn Arg Leu Thr Ala Asp Gly Ser LysThr Ala Ala Cys Val Leu Ala Asn Arg Leu Thr Ala Asp Gly Ser Lys
35 40 45 35 40 45
Arg Val Leu Val Leu Glu Ala Gly Ala Asp Asn Val Ser Arg Asp ValArg Val Leu Val Leu Glu Ala Gly Ala Asp Asn Val Ser Arg Asp Val
50 55 60 50 55 60
Lys Val Pro Ala Ala Ile Thr Arg Leu Phe Arg Ser Pro Leu Asp TrpLys Val Pro Ala Ala Ile Thr Arg Leu Phe Arg Ser Pro Leu Asp Trp
65 70 75 8065 70 75 80
Asn Leu Phe Ser Glu Leu Gln Glu Gln Leu Ala Ala Arg Gln Ile TyrAsn Leu Phe Ser Glu Leu Gln Glu Gln Leu Ala Ala Arg Gln Ile Tyr
85 90 95 85 90 95
Met Ala Arg Gly Arg Leu Leu Gly Gly Ser Ser Ala Thr Asn Ala ThrMet Ala Arg Gly Arg Leu Leu Gly Gly Ser Ser Ala Thr Asn Ala Thr
100 105 110 100 105 110
Leu Tyr His Arg Gly Ala Ala Ala Asp Tyr Asp Ala Trp Gly Val ProLeu Tyr His Arg Gly Ala Ala Ala Asp Tyr Asp Ala Trp Gly Val Pro
115 120 125 115 120 125
Gly Trp Gly Ala Ala Asp Val Leu Pro Trp Phe Val Lys Ala Glu ThrGly Trp Gly Ala Ala Asp Val Leu Pro Trp Phe Val Lys Ala Glu Thr
130 135 140 130 135 140
Asn Ala Glu Phe Ala Ala Gly Lys Tyr His Gly Ala Gly Gly Asn MetAsn Ala Glu Phe Ala Ala Gly Lys Tyr His Gly Ala Gly Gly Asn Met
145 150 155 160145 150 155 160
Arg Val Glu Asn Pro Arg Tyr Ser Asn Pro Gln Leu His Gly Ala PheArg Val Glu Asn Pro Arg Tyr Ser Asn Pro Gln Leu His Gly Ala Phe
165 170 175 165 170 175
Phe Ala Ala Ala Gln Gln Met Gly Leu Pro Gln Asn Thr Asp Phe AsnPhe Ala Ala Ala Gln Gln Met Gly Leu Pro Gln Asn Thr Asp Phe Asn
180 185 190 180 185 190
Asn Trp Asp Gln Asp His Ala Gly Phe Gly Thr Phe Gln Val Met GlnAsn Trp Asp Gln Asp His Ala Gly Phe Gly Thr Phe Gln Val Met Gln
195 200 205 195 200 205
Glu Lys Gly Thr Arg Ala Asp Met Tyr Arg Gln Tyr Leu Lys Pro AlaGlu Lys Gly Thr Arg Ala Asp Met Tyr Arg Gln Tyr Leu Lys Pro Ala
210 215 220 210 215 220
Leu Gly Arg Pro Asn Leu Gln Val Leu Thr Gly Ala Ser Val Thr LysLeu Gly Arg Pro Asn Leu Gln Val Leu Thr Gly Ala Ser Val Thr Lys
225 230 235 240225 230 235 240
Val His Ile Asp Lys Ala Gly Gly Lys Pro Arg Ala Leu Gly Val GluVal His Ile Asp Lys Ala Gly Gly Lys Pro Arg Ala Leu Gly Val Glu
245 250 255 245 250 255
Phe Ser Leu Asp Gly Pro Ala Gly Glu Arg Met Ala Ala Glu Leu AlaPhe Ser Leu Asp Gly Pro Ala Gly Glu Arg Met Ala Ala Glu Leu Ala
260 265 270 260 265 270
Pro Gly Gly Glu Val Leu Met Cys Ala Gly Ala Val His Ser Pro HisPro Gly Gly Glu Val Leu Met Cys Ala Gly Ala Val His Ser Pro His
275 280 285 275 280 285
Ile Leu Gln Leu Ser Gly Val Gly Ser Ala Ala Thr Leu Ala Asp HisIle Leu Gln Leu Ser Gly Val Gly Ser Ala Ala Thr Leu Ala Asp His
290 295 300 290 295 300
Gly Ile Ala Ala Val Ala Asp Leu Pro Gly Val Gly Ala Asn Met GlnGly Ile Ala Ala Val Ala Asp Leu Pro Gly Val Gly Ala Asn Met Gln
305 310 315 320305 310 315 320
Asp Gln Pro Ala Cys Leu Thr Ala Ala Pro Leu Lys Asp Lys Tyr AspAsp Gln Pro Ala Cys Leu Thr Ala Ala Pro Leu Lys Asp Lys Tyr Asp
325 330 335 325 330 335
Gly Ile Ser Leu Thr Asp His Ile Tyr Asn Ser Lys Gly Gln Ile ArgGly Ile Ser Leu Thr Asp His Ile Tyr Asn Ser Lys Gly Gln Ile Arg
340 345 350 340 345 350
Lys Arg Ala Ile Ala Ser Tyr Leu Leu Gln Gly Lys Gly Gly Leu ThrLys Arg Ala Ile Ala Ser Tyr Leu Leu Gln Gly Lys Gly Gly Leu Thr
355 360 365 355 360 365
Ser Thr Gly Cys Asp Arg Gly Ala Phe Val Arg Thr Ala Gly Gln AlaSer Thr Gly Cys Asp Arg Gly Ala Phe Val Arg Thr Ala Gly Gln Ala
370 375 380 370 375 380
Leu Pro Asp Leu Gln Val Arg Phe Val Pro Gly Met Ala Leu Asp AlaLeu Pro Asp Leu Gln Val Arg Phe Val Pro Gly Met Ala Leu Asp Ala
385 390 395 400385 390 395 400
Asp Gly Val Ser Thr Tyr Val Arg Phe Ala Lys Phe Gln Ser Gln GlyAsp Gly Val Ser Thr Tyr Val Arg Phe Ala Lys Phe Gln Ser Gln Gly
405 410 415 405 410 415
Leu Lys Trp Pro Ser Gly Ile Thr Val Gln Leu Ile Ala Cys Arg ProLeu Lys Trp Pro Ser Gly Ile Thr Val Gln Leu Ile Ala Cys Arg Pro
420 425 430 420 425 430
His Ser Lys Gly Ser Val Gly Leu Lys Asn Ala Asp Pro Phe Thr ProHis Ser Lys Gly Ser Val Gly Leu Lys Asn Ala Asp Pro Phe Thr Pro
435 440 445 435 440 445
Pro Lys Leu Arg Pro Gly Tyr Leu Thr Asp Lys Ala Gly Ala Asp LeuPro Lys Leu Arg Pro Gly Tyr Leu Thr Asp Lys Ala Gly Ala Asp Leu
450 455 460 450 455 460
Ala Thr Leu Arg Ser Gly Val His Trp Ala Arg Asp Leu Ala Ser SerAla Thr Leu Arg Ser Gly Val His Trp Ala Arg Asp Leu Ala Ser Ser
465 470 475 480465 470 475 480
Gly Pro Leu Ser Glu Phe Leu Glu Gly Glu Leu Phe Pro Gly Ser GlnGly Pro Leu Ser Glu Phe Leu Glu Gly Glu Leu Phe Pro Gly Ser Gln
485 490 495 485 490 495
Val Val Ser Asp Asp Asp Ile Asp Ser Tyr Ile Arg Arg Thr Ile HisVal Val Ser Asp Asp Asp Ile Asp Ser Tyr Ile Arg Arg Thr Ile His
500 505 510 500 505 510
Ser Ser Asn Ala Ile Val Gly Thr Cys Arg Met Gly Ala Ala Gly GluSer Ser Asn Ala Ile Val Gly Thr Cys Arg Met Gly Ala Ala Gly Glu
515 520 525 515 520 525
Ala Gly Val Val Val Asp Asn Gln Leu Arg Val Gln Gly Val Asp GlyAla Gly Val Val Val Asp Asn Gln Leu Arg Val Gln Gly Val Asp Gly
530 535 540 530 535 540
Leu Arg Val Val Asp Ala Ser Val Met Pro Arg Ile Pro Gly Gly GlnLeu Arg Val Val Asp Ala Ser Val Met Pro Arg Ile Pro Gly Gly Gln
545 550 555 560545 550 555 560
Val Gly Ala Pro Val Val Met Leu Ala Glu Arg Ala Ala Ala Met LeuVal Gly Ala Pro Val Val Met Leu Ala Glu Arg Ala Ala Ala Met Leu
565 570 575 565 570 575
Thr Gly Gln Ala Ala Leu Ala Gly Ala Ser Ala Ala Ala Pro Pro ThrThr Gly Gln Ala Ala Leu Ala Gly Ala Ser Ala Ala Ala Pro Pro Thr
580 585 590 580 585 590
Pro Val Ala AlaPro Val Ala Ala
595 595
<210> 5<210> 5
<211> 42<211> 42
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 5<400> 5
atgggtcgcg gatccgaatt ccgtgcgagc gcagttgacg at 42atgggtcgcg gatccgaatt ccgtgcgagc gcagttgacg at 42
<210> 6<210> 6
<211> 39<211> 39
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 6<400> 6
tgcggccgca agcttgtcga cagccgcgac cggggtcgg 39tgcggccgca agcttgtcga cagccgcgac cggggtcgg 39
<210> 7<210> 7
<211> 27<211> 27
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 7<400> 7
gaattcggat ccgcgaccca tttgctg 27gaattcggat ccgcgaccca tttgctg 27
<210> 8<210> 8
<211> 23<211> 23
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 8<400> 8
gtcgacaagc ttgcggccgc act 23gtcgacaagc ttgcggccgc act 23
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210708763.6A CN114854726B (en) | 2022-06-21 | 2022-06-21 | Mutant of fatty acid light decarboxylase McFAP and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210708763.6A CN114854726B (en) | 2022-06-21 | 2022-06-21 | Mutant of fatty acid light decarboxylase McFAP and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114854726A CN114854726A (en) | 2022-08-05 |
CN114854726B true CN114854726B (en) | 2023-09-19 |
Family
ID=82626110
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210708763.6A Active CN114854726B (en) | 2022-06-21 | 2022-06-21 | Mutant of fatty acid light decarboxylase McFAP and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114854726B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105925518A (en) * | 2015-02-26 | 2016-09-07 | 赢创德固赛有限公司 | Olefin production |
CN108728470A (en) * | 2017-04-14 | 2018-11-02 | 中国科学院微生物研究所 | The recombinant bacterium and its construction method of production Beta-alanine and application |
CN109477077A (en) * | 2016-05-20 | 2019-03-15 | 原子能和辅助替代能源委员会 | Novel fatty acid decarboxylase and its use |
CN112063608A (en) * | 2020-08-27 | 2020-12-11 | 浙江工业大学 | Fatty acid light decarboxylase mutant and application thereof in synthesis of L-glufosinate-ammonium |
CN112877347A (en) * | 2021-01-28 | 2021-06-01 | 华南理工大学 | Multi-enzyme complex and construction method and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB201806483D0 (en) * | 2018-04-20 | 2018-06-06 | Univ Manchester | Hydrocarbon production |
US20210139879A1 (en) * | 2019-10-31 | 2021-05-13 | The Procter & Gamble Company | Consumer Product Compositions Comprising P450 Fatty Acid Decarboxylases |
-
2022
- 2022-06-21 CN CN202210708763.6A patent/CN114854726B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105925518A (en) * | 2015-02-26 | 2016-09-07 | 赢创德固赛有限公司 | Olefin production |
CN109477077A (en) * | 2016-05-20 | 2019-03-15 | 原子能和辅助替代能源委员会 | Novel fatty acid decarboxylase and its use |
CN108728470A (en) * | 2017-04-14 | 2018-11-02 | 中国科学院微生物研究所 | The recombinant bacterium and its construction method of production Beta-alanine and application |
CN112063608A (en) * | 2020-08-27 | 2020-12-11 | 浙江工业大学 | Fatty acid light decarboxylase mutant and application thereof in synthesis of L-glufosinate-ammonium |
CN112877347A (en) * | 2021-01-28 | 2021-06-01 | 华南理工大学 | Multi-enzyme complex and construction method and application thereof |
Non-Patent Citations (2)
Title |
---|
An algal photoenzyme converts fattyacids to hydrocarbons;Damien Sorigué et al.;Science;第357卷(第6354期);第903–907页 * |
glucose-methanol-choline oxidoreductase [Micractinium conductrix] GENBANK ACCESSION NO. PSC67760.1;Barney,B. et al.;GENBANK;第1-2页 * |
Also Published As
Publication number | Publication date |
---|---|
CN114854726A (en) | 2022-08-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Amer et al. | Low carbon strategies for sustainable bio-alkane gas production and renewable energy | |
CN103261410B (en) | It is converted by the combination enzymatic of 3- hydroxyl alkane acids and produces alkene | |
AU2017260270B2 (en) | 3-methylcrotonic acid decarboxylase (MDC) variants | |
AU2019256747B2 (en) | Hydrocarbon production | |
CN112877347B (en) | Multi-enzyme complex and construction method and application thereof | |
JP2009136285A (en) | Cloning and sequencing of pyruvate decarboxylase (PDC) gene from bacteria and uses thereof | |
JP2023518197A (en) | Biomanufacturing systems and methods for producing organic products from recombinant microorganisms | |
CN102559718B (en) | Construction of thermophilic carboxylesterase gene engineering strain and application of carboxylesterase of strain | |
CN114854726B (en) | Mutant of fatty acid light decarboxylase McFAP and application thereof | |
CN103421734A (en) | High-level soluble expression method and application of recombined tyrosine decarboxylase | |
CN111926027B (en) | Phthalate ester hydrolase and preparation method and application thereof | |
CN116515805A (en) | Mutant of fatty acid decarboxylase OleTPM and its application | |
Wang et al. | The binding, synergistic and structural characteristics of BsEXLX1 for loosening the main components of lignocellulose: Lignin, xylan, and cellulose | |
CN114540319B (en) | Alkene reductase mutant, engineering bacterium and application of alkene reductase mutant and engineering bacterium in preparation of (2R, 5R) -dihydrocarvone | |
CN108715826A (en) | A method of not depending on the whole-cell catalytic synthesis aromatic compound of co-factor | |
CN110777126A (en) | High-temperature anaerobic long-chain alkanol alcohol dehydrogenase and application thereof | |
CN112226428B (en) | Oleate hydratase mutant and its application in the preparation of 10-hydroxystearic acid | |
CN107201356A (en) | Support the reduction chaperone combination of P450 decarboxylation of fatty acids enzymatic activitys and its apply | |
CN110004133B (en) | Oleic acid hydratase and its application in the synthesis of 10-hydroxystearic acid and 10-carbonylstearic acid | |
Hoeven et al. | Distributed biomanufacturing of liquefied petroleum gas | |
CN107674889A (en) | Method for synthesizing 1,2, 4-butanetriol through enzymatic reaction | |
CN103409397B (en) | High-temperature-resistant acid arabinosidase as well as coding gene and application thereof | |
CN108795833A (en) | Acetic acid CoA transferase defective escherichia coli engineering bacterias and its application | |
CN113943721B (en) | Lipase mutant and application thereof | |
CN114438057B (en) | A kind of heat-resistant and alkali-resistant xylanase and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |