CN114410659B - 三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 - Google Patents
三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 Download PDFInfo
- Publication number
- CN114410659B CN114410659B CN202210089004.6A CN202210089004A CN114410659B CN 114410659 B CN114410659 B CN 114410659B CN 202210089004 A CN202210089004 A CN 202210089004A CN 114410659 B CN114410659 B CN 114410659B
- Authority
- CN
- China
- Prior art keywords
- crtiso5
- gene
- leu
- fucoxanthin
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 80
- SJWWTRQNNRNTPU-ABBNZJFMSA-N fucoxanthin Chemical compound C[C@@]1(O)C[C@@H](OC(=O)C)CC(C)(C)C1=C=C\C(C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)C(=O)C[C@]1(C(C[C@H](O)C2)(C)C)[C@]2(C)O1 SJWWTRQNNRNTPU-ABBNZJFMSA-N 0.000 title claims abstract description 38
- AQLRNQCFQNNMJA-UHFFFAOYSA-N fucoxanthin Natural products CC(=O)OC1CC(C)(C)C(=C=CC(=CC=CC(=CC=CC=C(/C)C=CC=C(/C)C(=O)CC23OC2(C)CC(O)CC3(C)C)C)CO)C(C)(O)C1 AQLRNQCFQNNMJA-UHFFFAOYSA-N 0.000 title claims abstract description 38
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 37
- 241000206744 Phaeodactylum tricornutum Species 0.000 title claims abstract description 17
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 15
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 15
- 238000000338 in vitro Methods 0.000 claims abstract description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 7
- 239000002773 nucleotide Substances 0.000 claims description 6
- 230000000243 photosynthetic effect Effects 0.000 abstract description 12
- 241000195493 Cryptophyta Species 0.000 abstract description 6
- 241000894006 Bacteria Species 0.000 abstract description 5
- 239000002028 Biomass Substances 0.000 abstract description 3
- 238000009825 accumulation Methods 0.000 abstract description 3
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000003306 harvesting Methods 0.000 abstract description 2
- 238000000034 method Methods 0.000 description 14
- 239000013598 vector Substances 0.000 description 11
- 239000002299 complementary DNA Substances 0.000 description 9
- 235000021466 carotenoid Nutrition 0.000 description 8
- 150000001747 carotenoids Chemical class 0.000 description 8
- 108020004414 DNA Proteins 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 238000010362 genome editing Methods 0.000 description 6
- 241000206731 Phaeodactylum Species 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 108091027544 Subgenomic mRNA Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 230000029553 photosynthesis Effects 0.000 description 4
- 238000010672 photosynthesis Methods 0.000 description 4
- 239000002904 solvent Substances 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000009088 enzymatic function Effects 0.000 description 3
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 3
- 239000011714 flavin adenine dinucleotide Substances 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 239000000049 pigment Substances 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- 241000191472 Yamadazyma triangularis Species 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000003064 anti-oxidating effect Effects 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000004853 protein function Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- HGHOBRRUMWJWCU-FXQIFTODSA-N (4s)-4-[[(2s)-2-aminopropanoyl]amino]-5-[[(2s)-3-carboxy-1-(carboxymethylamino)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O HGHOBRRUMWJWCU-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- JEBFVOLFMLUKLF-IFPLVEIFSA-N Astaxanthin Natural products CC(=C/C=C/C(=C/C=C/C1=C(C)C(=O)C(O)CC1(C)C)/C)C=CC=C(/C)C=CC=C(/C)C=CC2=C(C)C(=O)C(O)CC2(C)C JEBFVOLFMLUKLF-IFPLVEIFSA-N 0.000 description 1
- 108010045123 Blasticidin-S deaminase Proteins 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 241000199914 Dinophyceae Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- MQZIGYBFDRPAKN-ZWAPEEGVSA-N astaxanthin Chemical compound C([C@H](O)C(=O)C=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C(=O)[C@@H](O)CC1(C)C MQZIGYBFDRPAKN-ZWAPEEGVSA-N 0.000 description 1
- 229940022405 astaxanthin Drugs 0.000 description 1
- 235000013793 astaxanthin Nutrition 0.000 description 1
- 239000001168 astaxanthin Substances 0.000 description 1
- AFYNADDZULBEJA-UHFFFAOYSA-N bicinchoninic acid Chemical compound C1=CC=CC2=NC(C=3C=C(C4=CC=CC=C4N=3)C(=O)O)=CC(C(O)=O)=C21 AFYNADDZULBEJA-UHFFFAOYSA-N 0.000 description 1
- 229930189065 blasticidin Natural products 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 229910001873 dinitrogen Inorganic materials 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 description 1
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000002705 metabolomic analysis Methods 0.000 description 1
- 230000001431 metabolomic effect Effects 0.000 description 1
- GBMDVOWEEQVZKZ-UHFFFAOYSA-N methanol;hydrate Chemical compound O.OC GBMDVOWEEQVZKZ-UHFFFAOYSA-N 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000012044 organic layer Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000013535 sea water Substances 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/405—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明公开了三角褐指藻CRTISO5基因、蛋白及在岩藻黄素合成中的应用,本发明首次公开了CRTISO5基因的一个新功能,其能够提高三角褐指藻中岩藻黄素含量;并且,用于在光合生物(植物、藻类、光合细菌)中能提高光合生物体内岩藻黄素含量,继而提高捕光效率,对光合作用效率和生物量积累有利,并且该基因所编码蛋白能够在生物体外用于岩藻黄素合成。
Description
技术领域
本发明属于生物技术领域,具体涉及三角褐指藻CRTISO5基因、蛋白及在岩藻黄素合成中的应用。
背景技术
类胡萝卜素广泛存在于光合生物中,具有抗氧化与捕捉光能两重特性。由于结构的多样性,不同的类胡萝卜素具有不同的应用价值。多种类胡萝卜素,例如岩藻黄素、虾青素等具有较高的市场价值。近年来,除了抗氧化,岩藻黄素还被报道具有抗癌症的功能。
此外,岩藻黄素在光合作用光能捕捉方面具有重要应用。蓝绿光在海水中穿透能力强,因而海洋藻类大多含有岩藻黄素,用于捕捉蓝绿波段可见光。岩藻黄素如被转移到其他光合生物中,可用于拓宽捕光光谱,提高光合作用效率。
目前,岩藻黄素的合成通路尚未被完全解析。更没有关于合成岩藻黄素的基因的相关报道。
发明内容
本发明的目的之一在于提供一种三角褐指藻(Phaeodactylum tricornutum)CRTISO5基因。
本发明的第二个目的是提供一种上述基因编码的蛋白。
本发明的最主要的一个目的在于提供该CRTISO5基因或蛋白的应用。
为了实现上述目的,本发明采用的技术方案概述如下:
一种CRTISO5基因,其核苷酸序列如SEQ ID NO.1所示,所述的核苷酸序列由1986个碱基组成,或在严格条件下与SEQ ID NO.1限定的DNA序列杂交的DNA分子。
上述基因编码的蛋白质(1),其氨基酸序列如SEQ ID No.2所示。所述的序列由661个氨基酸残基组成。
上述CRTISO5基因编码的蛋白还可以包括将SEQ ID NO.2氨基酸序列经过一个或多个((如1-30个;较佳地1-20个;更佳地1-10个;如5个,3个))氨基酸残基的取代、缺失或添加而形成的,且具有(1)蛋白功能的由(1)衍生的蛋白;或与(1)限定的蛋白序列有80%((较佳地90%以上,如95%,98%,99%或更高))以上同源性且具有(1)蛋白功能的由(1)衍生的蛋白。
含有CRTISO5基因的重组微生物,基因编辑载体均属于本发明的保护范围,其中所述重组微生物包括藻类、真菌或细菌。
本发明最重要的一个发明点在于公开了CRTISO5基因的一个新功能,本发明通过基因编辑获得crtiso5突变体,通过突变体和野生型的三角褐指藻相比,突变体的岩藻黄素含量比野生型低,但是积累了一种野生型中不存在的类胡萝卜素,根据其与已知化合物甲藻黄素的相似性,命名为7′,8′-双脱氢甲藻黄素。可见,CRTISO5基因或蛋白参与了岩藻黄素的合成,也就是说CRTISO5基因具有提高三角褐指藻中岩藻黄素含量的功能,可以用于提高微生物(藻类、真菌、细菌)生产岩藻黄素的产量;并且,用于在光合生物(植物、藻类、光合细菌)中能提高光合生物体内岩藻黄素含量,继而提高捕光效率,对光合作用效率和生物量积累有利,上述方式可以通过转基因的方式来实现。
另外,通过对CRTISO5蛋白的酶学功能的鉴定,结果发现,只有在CRTISO5蛋白存在的前提下,7′,8′-双脱氢甲藻黄素才会被转化为岩藻黄素,因此,CRTISO5基因所编码蛋白还可用于岩藻黄素的体外合成。
这里需要说明的是,本发明所保护的基因的功能,不仅包括上述CRTISO5基因,还包括与SEQ ID NO.1具有较高同源性(如同源性高于40%;较佳地高于50%;较佳地高于60%;更佳地高于70%;更佳地高于80%;更佳地高于90%;更佳地高于95%;更佳地高于98%)的同源基因。
本发明的优点:
在三角褐指藻中我们首次找到一个CRTISO5基因,该基因所编码蛋白负责岩藻黄素合成的最后一步,也就是说本发明首次公开了CRTISO5基因的一个新功能,其能够提高三角褐指藻中岩藻黄素含量。并且,用于在光合生物(植物、藻类、光合细菌)中能提高光合生物体内岩藻黄素含量,继而提高捕光效率,对光合作用效率和生物量积累有利,并且该基因所编码蛋白能够在生物体外用于岩藻黄素合成。
附图说明
图1是crtiso5突变体中CRTISO5基因被编辑的证据;
图2是crtiso5突变体岩藻黄素含量降低的证据;
图中,右图为细胞外观拍照,左图为高效液相色谱(HPLC)色素分析;
图3是crtiso5突变体中所积累色素,即CRTISO5蛋白底物结构的核磁共振鉴定结果,也就是7′,8′-双脱氢甲藻黄素的分子结构;
图4是用于在大肠杆菌中生产CRTISO5蛋白所用载体;
图5是实施例中采用CRTISO5蛋白体外合成岩藻黄素的结果。
具体实施方式
下面将通过具体实施例对本发明进行详细的描述。提供这些实施例是为了能够更透彻地理解本发明,并且能够将本发明的范围完整的传达给本领域的技术人员。
若未特别指明,实施例中所用技术手段为本领域技术人员所熟知的常规手段。下述实施例中的试验方法,如无特别说明,均为常规方法。如无特殊说明,所采用的试剂及材料,均可以通过商业途径获得。
除非另行定义,文中所使用的所有专业与科学用语与本领域熟练人员所熟悉的意义相同。此外,任何与所记载内容相似或均等的方法及材料皆可应用于本发明中。文中所述的较佳实施方法与材料仅作示范之用。
除非另有说明,本发明的实施将使用本领域技术人员显而易见的植物学常规技术、微生物、组织培养、分子生物学、化学、生物化学、DNA重组及生物信息学技术。这些技术均在已经公开的文献中进行了充分解释,另外,本发明所采用的DNA提取、系统发育树的构建、基因编辑方法、基因编辑载体的构建、基因编辑植物获得等方法,除了下述实施例采用的方法外,采用现有文献中已经公开的方法均能实现。
此处使用的“核酸”、“核酸序列”、“核苷酸”、“核酸分子”或“多聚核苷酸”术语意思是指包括分离的DNA分子(例如,cDNA或者基因组DNA),RNA分子(例如,信使RNA),自然类型,突变类型,合成的DNA或RNA分子,核苷酸类似物组成的DNA或RNA分子,单链或是双链结构。这些核酸或多聚核苷酸包括基因编码序列、反义序列及非编码区的调控序列,但不仅限于此。这些术语包括一个基因。“基因”或“基因序列”广泛用来指一有功能的DNA核酸序列。因此,基因可能包括基因组序列中的内含子和外显子,和/或包括cDNA中的编码序列,和/或包括cDNA及其调控序列。在特殊实施方案中,例如有关分离的核酸序列,优先默认其为cDNA。
实施例
一、CRTISO5基因的获取
利用转录组和代谢组等多种技术手段筛选得到一个三角褐指藻CRTISO5基因,该基因全长编码框核苷酸序列长度为1986bp,由661个氨基酸组成,其核苷酸序列如序列SEQID NO.1所示,其蛋白序列如SEQ ID NO.2所示。
二.crtiso5突变体的表型分析
此前,还没有关于三角褐指藻中CRTISO5基因功能报道,申请人进行了如下研究,首次发现了CRTISO5基因在三角褐指藻岩藻黄素合成中起着重要的作用。
1.CRTISO5基因编辑载体的构建
使用CRISPOR网站(http://crispor.tefor.net/)选取针对CRTISO5基因编辑的导向序列(sgRNA),根据所选sgRNA设计两条引物(表1),在退火后获得含粘性末端的双链DNA,克隆进含杀稻瘟素-S脱氨酶基因的目的载体,由三角褐指藻U6启动子负责sgRNA的表达。Cas9基因由γ微管蛋白启动子控制。
表1.CRTISO5 sgRNA引物
2.三角褐指藻接合转化与crtiso5突变体的获得
将含有上述载体与接合转化所需pTA-Mob载体的大肠杆菌与野生型三角褐指藻以1000:1的比例混合,浓缩后涂f/2培养基平板。两天后刮取细胞,涂于含5μg/mL杀稻瘟素的f/2培养基平板。
两周后挑取克隆,用液体培养基重悬,重新涂板获得亚克隆。挑取多个亚克隆,进行菌落PCR与测序。所获得的crtiso5突变体中的CRTISO5基因比野生型缺少15个碱基对(图1)。
3.crtiso5突变体的岩藻黄素表型分析与积累的类胡萝卜素结构鉴定
在恒定光(80μmol·m-2·s-1)条件下培养野生型与突变体。离心收集107细胞,加入250μL90%丙酮,在黑暗条件下超声混匀。离心后取上清,用高效液相色谱(HPLC)进行色素分析。
HPLC参数如下:
仪器:Thermo Ultimate 3000UHPLC;
温度:20℃;
流速:每分钟1mL;
进样体积:10μL;
梯度流动相:由溶剂A(甲醇∶水=90∶10)与溶剂B(乙酸乙酯)组成。0时:100%A,0%B;20-22分钟:0%A,100%B;23-28分钟:100%A;0%B。
HPLC结果表明crtiso5突变体的岩藻黄素含量比野生型低,可以看出CRTISO5基因或蛋白与岩藻黄素的合成有关,crtiso5突变体由于岩藻黄素含量低呈现绿色,但是积累了一种野生型中不存在的类胡萝卜素(图2)。
在HPLC过程中对所积累的类胡萝卜素进行收集,利用Bruker AVANCE NEO 600MHz核磁共振仪进行结构分析,解析出的分子结构见图3,根据其与已知化合物甲藻黄素的相似性,命名为7′,8′-双脱氢甲藻黄素,以下作为CRTISO5蛋白的底物在体外体系中对CRTISO5蛋白的酶学功能进行进一步的鉴定。
三.表达载体的构建及蛋白的获得
1.三角褐指藻CRTISO5基因cDNA(SEQ ID NO.1)的克隆
采用RNeasy Plant Minikit试剂盒提取野生型三角褐指藻RNA。采用SuperScriptTM III Reverse Transcriptase试剂盒进行反转录。以所获得总cDNA为模板PCR克隆CRTISO5基因cDNA。
PCR体系(表2):
表2. 20μL扩增体系
PCR循环:
1)94℃:5min;
2)94℃:30s;
3)55℃:30s;
4)72℃:2min;
步骤2)-4)循环35次;
6)72℃:5min。
PCR引物(表3)不仅包括与CRTISO5 cDNA序列重合序列(大写),也包括与目标载体同源序列(小写)。
表3.CRTISO5 cDNA扩增引物
通过Infusion同源重组法将PCR产物克隆进pMAL-c5x载体。该载体中CRTISO5基因在N端与MBP融合,C端具有多聚组氨酸标签(His-tag),外源基因表达受异丙基-β-D-硫代半乳糖苷(IPTG)诱导。将构建完成质粒转化进表达BL21(DE3)菌株,通过PCR筛选阳性克隆。所构建成功的载体见图4。
2.三角褐指藻CRTISO5基因在大肠杆菌中的表达
采用含有100mg/L氨苄青霉素的LB培养基在37℃培养上述菌株,至OD600为0.6-0.8。加入异丙基-β-D-硫代半乳糖苷(IPTG)至终浓度为0.4mM,在16℃继续培养12小时。
3.三角褐指藻CRTISO5蛋白的纯化
高压破碎后以13000g在4℃离心15分钟。所获得上清采用AKTA系统,采用Source Q离子交换柱进行蛋白纯化。采用30kDaMWCO超滤管进行蛋白浓缩。采用牛血清白蛋白(BSA)配置标准品,采用二辛可宁酸法测定蛋白浓度。分装至每管50μg后液氮速冻,保存于-80℃。
四、CRTISO5蛋白的酶学功能鉴定
在体外体系中加入岩藻黄素合成前体7′,8′-双脱氢甲藻黄素与纯化CRTISO5蛋白,验证岩藻黄素的产生。具体如下:
1.酶学研究体系的建立
将200μL缓冲液(0.1%TritonX-100,100mM Tris,10mM MgCl2,1mM DTT,pH 7.4)加入纯化干燥后的7′,8′-双脱氢甲藻黄素,通过超声混匀来溶剂该前体。
继而加入三种组分至括号内终浓度:
黄素腺嘌呤二核苷酸(FAD,100μM),氧化型;
Na2S2O4(1mM),以使FAD转化为还原型;
50μg纯化CRTISO5蛋白。
在对照反应中省略CRTISO5蛋白。两小时后,加入200μL丙酮与200μL乙酸乙酯,混匀以终止反应。
2.CRTISO5蛋白产物的分析
在反应终止后通过离心对类胡萝卜素产物进行萃取:吸取上清有机层,氮气吹干,用50μL甲醇水溶液溶解。
利用高效液相色谱(HPLC)分析溶解物,以分析岩藻黄素的产生与7′,8′-双脱氢甲藻黄素的残留。HPLC参数如下:
仪器:Waters Acquity UPLC;
温度:45℃;
流速:每分钟0.3mL;
进样体积:3μL;
梯度流动相:由溶剂A(乙腈∶甲醇∶甲基叔丁基醚=70∶20∶10)与溶剂B(10mM醋酸铵)组成。0时:60%A,40%B;4分钟:75%A,25%B;12分钟:100%A。
图5可以看出,通过和对照相比,加入CRTISO5蛋白后,7′,8′-双脱氢甲藻黄素被转化为岩藻黄素,也就是说,CRTISO5蛋白具有生物体外合成岩藻黄素的功能。
序列表
<110> 西湖大学
<120> 三角褐指藻CRTISO5基因、蛋白及在岩藻黄素合成中的应用
<130> 2021
<160> 2
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1986
<212> DNA
<213> Phaeodactylum tricornutum
<400> 1
atgctgcgtc ttgctgcttt attcgctgcc atcgctgctg tagacgtaac ggcgttcacg 60
cctgctacta aacccttttt gacggcatcg catccgtacg gtctacgttc gacgactaac 120
gagaatgtgg cccagacgga aaacacttca cgagaaaaag tcatgacctt ctcgtacgat 180
atgtcgcttg aaccaaagta cgagaaaccc acctatcctg gaactggaaa cggtttgagc 240
ggagattctg gtctttacga tgtaatcgtg attggatccg gtatgggcgg gctagcttgt 300
ggcgccctgt cagctaaata cggtgacaag gtcctcgtgc tagagtcgca cattaaatgc 360
ggaggatcgg ctcatacatt ctcccgtatg cacaacggtg aaaaatattc cttcgaagtg 420
ggtccttcaa tttttgaagg actcgaccgt ccaagcctga atccccttcg catgattttt 480
gatgtcctgg aagaagagat gcccgtaaaa acttacactg gtcttggata ctggactccc 540
acgggatatt ggcgtttccc tatcggtagt caaagcaaat tcgaagatct gcttatggaa 600
caagcggaag atggccccaa ggctgttgag gaatggaaca tgttacgcaa acgcctcaag 660
acacttggtg gttctacaac tgcagtttcg ttgttgaacc tacgtcaaga tcctggtttt 720
ttagcgacaa cagctggtag tttgcctttt gtggcaacgc atcctgatgt gtttctcgac 780
ttgtcgctta cgtttgattc tctccacaag acggttgata aaattgtgac ggtccctttc 840
ctccgaaact ttatcgatac catgtgcatt ttctgcggct tcccagccaa gggcgcgatg 900
acggcgcaca tgctttatat cttagagcgc ttctttgaag agtcagcttg ctattctgtt 960
ccgattggag gtacatgcga aatgggaaac acattggtac gcggcttgga aaagtttggt 1020
ggcaaaatcc agttgaatgc tcacgtagac gaaattttgg tcgaaaacgg acgtgccgtg 1080
ggtgttcgtc tcaagaacgg aaatgttgtt aaagcaaaca aagccgtggt gagcaatgcc 1140
acgccttttg ataccgtgaa gatgcttgga gaaaaacaag cacttccaga aggtgtcgcg 1200
aaatggaagg aagagcttgg gaaactccca cgtcacggag cgattatgca tttattttta 1260
gctattgatg cgaaggatct ggacctttcg cacattcaag accccgctca tttagtagtt 1320
caagactggg gacgttcttt acaagactcg cagaacttgt gtagcttctt cattcctagt 1380
ttacttgaca agacgttatg tccggaaggc aagcatgtca ttcatgtata ctcttctgga 1440
ggggaaccgt atgagccgtg ggaaaagctc aagccaggga cacaggagta cgacgattac 1500
aaaaacgaac gcgctaaagt tttgtgggaa gcagtcgaaa ggtgtattcc agatgttcgg 1560
gatcgcttgg aattttccat agtcggatcc cctcttgcac atgaagcctt tcttcgacgt 1620
gatcgaggta cgtatggaat ggcatgggct gctggtacat cagcgcccca ggccggcctt 1680
cttcagaata ttctcccttt cccattccca aaccttaaga caccagtcga tggtctctta 1740
cgatgcggcg actcctgctt tcccggtatc ggaactccaa gtgcggccgc ctcgggagcg 1800
attgcagcga acacaatgaa ccccgtcggc aagcatttag atttgctgaa agaagccagt 1860
caaagagatc ctatgtacaa gtttctggat cctggtgtgt ttggaagtat ttatcgacca 1920
ttcgtcgagt ctttgacgcc aagtaccgaa cttcaggttg aatctatcca aaacactgca 1980
gattag 1986
<210> 2
<211> 661
<212> PRT
<213> Phaeodactylum tricornutum
<400> 2
Met Leu Arg Leu Ala Ala Leu Phe Ala Ala Ile Ala Ala Val Asp Val
1 5 10 15
Thr Ala Phe Thr Pro Ala Thr Lys Pro Phe Leu Thr Ala Ser His Pro
20 25 30
Tyr Gly Leu Arg Ser Thr Thr Asn Glu Asn Val Ala Gln Thr Glu Asn
35 40 45
Thr Ser Arg Glu Lys Val Met Thr Phe Ser Tyr Asp Met Ser Leu Glu
50 55 60
Pro Lys Tyr Glu Lys Pro Thr Tyr Pro Gly Thr Gly Asn Gly Leu Ser
65 70 75 80
Gly Asp Ser Gly Leu Tyr Asp Val Ile Val Ile Gly Ser Gly Met Gly
85 90 95
Gly Leu Ala Cys Gly Ala Leu Ser Ala Lys Tyr Gly Asp Lys Val Leu
100 105 110
Val Leu Glu Ser His Ile Lys Cys Gly Gly Ser Ala His Thr Phe Ser
115 120 125
Arg Met His Asn Gly Glu Lys Tyr Ser Phe Glu Val Gly Pro Ser Ile
130 135 140
Phe Glu Gly Leu Asp Arg Pro Ser Leu Asn Pro Leu Arg Met Ile Phe
145 150 155 160
Asp Val Leu Glu Glu Glu Met Pro Val Lys Thr Tyr Thr Gly Leu Gly
165 170 175
Tyr Trp Thr Pro Thr Gly Tyr Trp Arg Phe Pro Ile Gly Ser Gln Ser
180 185 190
Lys Phe Glu Asp Leu Leu Met Glu Gln Ala Glu Asp Gly Pro Lys Ala
195 200 205
Val Glu Glu Trp Asn Met Leu Arg Lys Arg Leu Lys Thr Leu Gly Gly
210 215 220
Ser Thr Thr Ala Val Ser Leu Leu Asn Leu Arg Gln Asp Pro Gly Phe
225 230 235 240
Leu Ala Thr Thr Ala Gly Ser Leu Pro Phe Val Ala Thr His Pro Asp
245 250 255
Val Phe Leu Asp Leu Ser Leu Thr Phe Asp Ser Leu His Lys Thr Val
260 265 270
Asp Lys Ile Val Thr Val Pro Phe Leu Arg Asn Phe Ile Asp Thr Met
275 280 285
Cys Ile Phe Cys Gly Phe Pro Ala Lys Gly Ala Met Thr Ala His Met
290 295 300
Leu Tyr Ile Leu Glu Arg Phe Phe Glu Glu Ser Ala Cys Tyr Ser Val
305 310 315 320
Pro Ile Gly Gly Thr Cys Glu Met Gly Asn Thr Leu Val Arg Gly Leu
325 330 335
Glu Lys Phe Gly Gly Lys Ile Gln Leu Asn Ala His Val Asp Glu Ile
340 345 350
Leu Val Glu Asn Gly Arg Ala Val Gly Val Arg Leu Lys Asn Gly Asn
355 360 365
Val Val Lys Ala Asn Lys Ala Val Val Ser Asn Ala Thr Pro Phe Asp
370 375 380
Thr Val Lys Met Leu Gly Glu Lys Gln Ala Leu Pro Glu Gly Val Ala
385 390 395 400
Lys Trp Lys Glu Glu Leu Gly Lys Leu Pro Arg His Gly Ala Ile Met
405 410 415
His Leu Phe Leu Ala Ile Asp Ala Lys Asp Leu Asp Leu Ser His Ile
420 425 430
Gln Asp Pro Ala His Leu Val Val Gln Asp Trp Gly Arg Ser Leu Gln
435 440 445
Asp Ser Gln Asn Leu Cys Ser Phe Phe Ile Pro Ser Leu Leu Asp Lys
450 455 460
Thr Leu Cys Pro Glu Gly Lys His Val Ile His Val Tyr Ser Ser Gly
465 470 475 480
Gly Glu Pro Tyr Glu Pro Trp Glu Lys Leu Lys Pro Gly Thr Gln Glu
485 490 495
Tyr Asp Asp Tyr Lys Asn Glu Arg Ala Lys Val Leu Trp Glu Ala Val
500 505 510
Glu Arg Cys Ile Pro Asp Val Arg Asp Arg Leu Glu Phe Ser Ile Val
515 520 525
Gly Ser Pro Leu Ala His Glu Ala Phe Leu Arg Arg Asp Arg Gly Thr
530 535 540
Tyr Gly Met Ala Trp Ala Ala Gly Thr Ser Ala Pro Gln Ala Gly Leu
545 550 555 560
Leu Gln Asn Ile Leu Pro Phe Pro Phe Pro Asn Leu Lys Thr Pro Val
565 570 575
Asp Gly Leu Leu Arg Cys Gly Asp Ser Cys Phe Pro Gly Ile Gly Thr
580 585 590
Pro Ser Ala Ala Ala Ser Gly Ala Ile Ala Ala Asn Thr Met Asn Pro
595 600 605
Val Gly Lys His Leu Asp Leu Leu Lys Glu Ala Ser Gln Arg Asp Pro
610 615 620
Met Tyr Lys Phe Leu Asp Pro Gly Val Phe Gly Ser Ile Tyr Arg Pro
625 630 635 640
Phe Val Glu Ser Leu Thr Pro Ser Thr Glu Leu Gln Val Glu Ser Ile
645 650 655
Gln Asn Thr Ala Asp
660
Claims (1)
1.三角褐指藻CRTISO5基因编码的蛋白在生物体外用于岩藻黄素合成中的应用,其特征在于,所述三角褐指藻CRTISO5基因的核苷酸序列如SEQ ID NO.1 所示,所述生物为三角褐指藻。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210089004.6A CN114410659B (zh) | 2022-01-21 | 2022-01-21 | 三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210089004.6A CN114410659B (zh) | 2022-01-21 | 2022-01-21 | 三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114410659A CN114410659A (zh) | 2022-04-29 |
CN114410659B true CN114410659B (zh) | 2023-05-02 |
Family
ID=81276864
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210089004.6A Active CN114410659B (zh) | 2022-01-21 | 2022-01-21 | 三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114410659B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN119552890B (zh) * | 2025-01-24 | 2025-06-06 | 宁波大学 | 用于提高三角褐指藻中岩藻黄素含量的热激转录因子49557基因及其应用 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110042096A (zh) * | 2019-04-01 | 2019-07-23 | 宁波大学 | 一种用紫光提高三角褐指藻岩藻黄素含量的方法 |
CN113215118A (zh) * | 2021-05-06 | 2021-08-06 | 中国科学院青岛生物能源与过程研究所 | 一种新黄素合成酶及其编码基因和应用 |
-
2022
- 2022-01-21 CN CN202210089004.6A patent/CN114410659B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN114410659A (zh) | 2022-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111676204B (zh) | 制备烟酰胺单核苷酸的烟酰胺磷酸核糖转移酶、编码基因、载体及应用 | |
CN111484987B (zh) | 一种具有高扩增活性的耐热dna聚合酶突变体 | |
CN112795551B (zh) | 一种耐高温逆转录酶突变体及其应用 | |
CN108728421B (zh) | 一种羰基还原酶突变体及其用途 | |
CN114395571B (zh) | 三角褐指藻zep1基因、蛋白及应用 | |
CN109554414B (zh) | 金针菇基因Fvegt1、Fvegt2和Fvegt3在合成麦角硫因中的应用 | |
CN112795550B (zh) | 耐高温逆转录酶突变体 | |
CN117106819B (zh) | 三角褐指藻CHLC基因以及编码的蛋白在叶绿素c合成中的应用 | |
CN114410659B (zh) | 三角褐指藻crtiso5基因、蛋白及在岩藻黄素合成中的应用 | |
CN112795547B (zh) | 高逆转录效率的逆转录酶突变体 | |
CN111073868B (zh) | 一种植物黄酮甲基转移酶蛋白及其编码基因与应用 | |
CN113652408A (zh) | 羰基还原酶突变体及其在(r)-4-氯-3-羟基丁酸乙酯合成中的应用 | |
CN112852847A (zh) | 一种重组酿酒酵母菌株及其构建方法与应用 | |
CN109337884B9 (zh) | 一种丙酮酸激酶基因及其应用 | |
JP2018536400A (ja) | ドリメノールシンターゼiii | |
CN108277216B (zh) | S-氰醇裂解酶及其应用 | |
CN112410353A (zh) | 一种fkbS基因、含其的基因工程菌及其制备方法和用途 | |
CN112029782B (zh) | 一种β-胡萝卜素羟化酶及其基因与应用 | |
CN113583983B (zh) | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 | |
CN110872594B (zh) | 黑莓糖基转移酶基因及其应用 | |
CN112795549B (zh) | 一种逆转录酶突变体 | |
CN114134186A (zh) | 以葡萄糖为底物生物法合成5-羟基β-吲哚基丙氨酸的方法 | |
CN108707590B (zh) | 一种Pictet-Spengler酶及其编码基因与应用 | |
CN112359045A (zh) | 一种类胡萝卜素代谢途径相关基因及应用 | |
CN114480448B (zh) | 一种促进银杏黄酮醇苷合成的基因GbF3′H及其载体、蛋白、和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |