CN110885846B

CN110885846B - Microorganism for synthesizing baicalein and scutellarin, preparation method and application thereof

Info

Publication number: CN110885846B
Application number: CN201811043657.0A
Authority: CN
Inventors: 王勇; 李建华; 田晨菲
Original assignee: Center for Excellence in Molecular Plant Sciences of CAS
Current assignee: Center for Excellence in Molecular Plant Sciences of CAS
Priority date: 2018-09-07
Filing date: 2018-09-07
Publication date: 2023-08-25
Anticipated expiration: 2038-09-07
Also published as: CN110885846A; US20220033827A1; WO2020048523A1; JP2021535757A; EP3848462A1; EP3848462A4

Abstract

本发明涉及合成黄芩素和野黄芩素的微生物、其制备方法及其应用。本发明人通过基因工程的方法对宿主细胞的异源代谢途径进行改造，获得了高产黄芩素和野黄芩素的工程菌株。本发明还提供了利用所述工程菌株生产黄芩素和野黄芩素的工艺。The invention relates to a microorganism for synthesizing baicalein and scutellarein, a preparation method and application thereof. The present inventors modified the heterologous metabolic pathway of the host cell by means of genetic engineering, and obtained an engineering strain with high production of baicalein and scutellarein. The invention also provides a process for producing scutellarein and scutellarein by using the engineering strain.

Description

Microorganism for synthesizing baicalein and scutellarein, its preparation method and application

技术领域technical field

本发明涉及合成生物学及医药技术领域，具体地，本发明涉及合成黄芩素和野黄芩素的微生物、其制备方法及其应用。The invention relates to the technical fields of synthetic biology and medicine, in particular, the invention relates to microorganisms for synthesizing scutellarein and scutellarein, a preparation method and application thereof.

背景技术Background technique

黄芩(Scutcllaria Baicalensis Georgi)是中国著名的传统药物，为唇形科植物；中药黄芩是植物黄芩干燥的根，药用历史悠久，可用于风热湿热等多种疾病的治疗。灯盏花是菊科植物短葶(Erigeron breviscapus)的干燥全草，性寒味苦，具有消炎止痛、活血化瘀和祛风除湿等功效。黄芩和灯盏花提取物早已被广泛应用于中药制剂中，银黄片、双黄莲口服液和兰岑口服液等的主要原料都是黄芩提取物，中药清开灵的主要有效成分就是黄芩素，它具有消炎，防治腹泻、肝病和肿瘤的作用；常见的灯盏花剂型有灯盏花素片和灯盏花素口服液等，它们在生物体内的主要代谢吸收形式是其苷元野黄芩素。因此，黄芩素和野黄芩素均具有一定的新药开发价值。Scutellaria Baicalensis Georgi is a well-known traditional medicine in China. It is a plant of the family Labiatae. The traditional Chinese medicine Scutellaria Baicalensis is the dried root of the plant Scutellaria baicalensis. It has a long history of medicinal use and can be used for the treatment of wind-heat, damp-heat and other diseases. Erigeron breviscapus is the dried whole herb of the Compositae plant Erigeron breviscapus. It is cold in nature and bitter in taste. Scutellaria baicalensis and Erigeron breviscapus extracts have long been widely used in traditional Chinese medicine preparations. The main raw materials of Yinhuang Tablets, Shuanghuanglian Oral Liquid and Lancen Oral Liquid are all Scutellaria baicalensis extracts. The main active ingredient of traditional Chinese medicine Qingkailing is baicalein. It has the functions of anti-inflammation, prevention and treatment of diarrhea, liver disease and tumors; common breviscapine formulations include breviscapine tablets and breviscapine oral liquid, etc., and their main form of metabolism and absorption in the organism is its aglycone scutellarein. Therefore, both baicalein and scutellarein have certain new drug development value.

黄芩素和野黄芩素是二个结构相似且重要的黄酮类化合物。黄芩素的分子式为C₁₅H₁₀O₅，分子量为270.24，而野黄芩素的分子量为C₁₅H₁₀O₆，分子量为286.24。它们的结构如图1所示。Baicalein and scutellarein are two structurally similar and important flavonoids. The molecular formula of baicalein is C ₁₅ H ₁₀ O ₅ and the molecular weight is 270.24, while the molecular weight of scutellarein is C ₁₅ H ₁₀ O ₆ and the molecular weight is 286.24. Their structures are shown in Figure 1.

与大多数植物源天然产物一样，目前黄芩素和野黄芩素的主要通过化学合成和有机溶剂萃取两种方法制备。有机溶剂萃取主要是对黄芩、灯盏花、半枝莲等药用植物进行组织抽提，在此过程中需要大量的有机溶剂，还存在随后的分离工艺繁琐，工业化造价高等问题。最主要的是该方法受到植物生长缓慢、对药用资源的破坏等难题。尽管通过化学合成也可以大量获得黄芩素和野黄芩素，但是原料会涉及肉桂酸或其衍生物、氧代苯酚等化工物质，一定程度限制了其在药物、食品领域的应用。而且在合成过程中还涉及到有毒试剂及昂贵的化学催化剂的使用等问题。Like most natural products of plant origin, baicalein and scutellarein are mainly prepared by chemical synthesis and organic solvent extraction. Organic solvent extraction is mainly to extract the tissues of medicinal plants such as Scutellaria baicalensis, Erigeron breviscapus, and Scutellaria barbata. In this process, a large amount of organic solvents are required, and there are also problems such as cumbersome separation processes and high industrialization costs. The most important thing is that this method is subject to difficult problems such as slow plant growth and damage to medicinal resources. Although baicalein and scutellarein can also be obtained in large quantities through chemical synthesis, the raw materials involve chemical substances such as cinnamic acid or its derivatives, oxyphenol, etc., which limits its application in the fields of medicine and food to a certain extent. Moreover, problems such as the use of toxic reagents and expensive chemical catalysts are also involved in the synthesis process.

合成生物学是基于理性的设计，将标准化的生物元件进行集成与装配，从而构建性能优良的人造生命系统。合成生物学一经诞生，它的思想和设计就深刻地影响着工业微生物技术的发展，使微生物技术在药物、生物燃料、精细化学品的开发与生产过程中发挥出更加巨大的作用。Synthetic biology is based on rational design, integrating and assembling standardized biological components to construct artificial life systems with excellent performance. Once synthetic biology was born, its ideas and designs have profoundly influenced the development of industrial microbial technology, enabling microbial technology to play a greater role in the development and production of drugs, biofuels, and fine chemicals.

在本领域中，多种天然产物的合成元件经过组装后实现了在微生物中的异源合成。但是本发明中涉及到的黄芩素和野黄芩素这两种活性黄酮类化合物，还尚未见在微生物中成功地异源合成的报道。因此，迫切需要构建一株能够异源合成黄芩素和野黄芩素的微生物菌株。In the art, synthetic elements of various natural products have been assembled to enable heterologous synthesis in microorganisms. However, the two active flavonoids, baicalein and scutellarein involved in the present invention, have not yet been reported to be successfully heterologously synthesized in microorganisms. Therefore, it is urgent to construct a microbial strain capable of heterologously synthesizing baicalein and scutellarein.

发明内容Contents of the invention

本发明的目的在于提供合成黄芩素和野黄芩素的微生物、其制备方法及其应用。The object of the present invention is to provide microorganisms for synthesizing baicalein and scutellarein, their preparation method and application.

在本发明的第一方面，提供一种生产黄芩素和野黄芩素的方法，包括：(1)在宿主细胞中引入表达黄酮6-羟化酶(F6H)以及细胞色素P450氧化还原酶(CPR)的基因，以及白杨素或芹菜素合成基因；和(2)在含有苯丙氨酸和/或酪氨酸的培养体系中培养该宿主细胞，从而生产黄芩素或野黄芩素。In a first aspect of the present invention, a method for producing baicalein and scutellarein is provided, comprising: (1) introducing and expressing flavone 6-hydroxylase (F6H) and cytochrome P450 oxidoreductase (CPR) in host cells ) gene, and chrysin or apigenin synthesis gene; and (2) cultivating the host cell in a culture system containing phenylalanine and/or tyrosine, thereby producing baicalein or scutellarein.

在一个优选例中，所述的白杨素或芹菜素合成基因包括：表达苯丙氨酸解氨酶(phenylalanine ammonia-lyase，PAL)、4-香豆酸辅酶A连接酶(4-coumarate：CoA ligase，4CL)、查尔酮合成酶(chalcone synthase，CHS)、查尔酮异构酶(chalcone isomerase，CHI)和黄酮合成酶I(flavone synthase I，FNSI)的基因；较佳地，在被引入宿主细胞时，所述的表达苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I的基因存在于同一表达载体中。In a preferred example, the chrysin or apigenin synthesis gene includes: expression of phenylalanine ammonia-lyase (phenylalanine ammonia-lyase, PAL), 4-coumarate coenzyme A ligase (4-coumarate: CoA ligase, 4CL), chalcone synthase (chalcone synthase, CHS), chalcone isomerase (chalcone isomerase, CHI) and flavone synthase I (flavone synthase I, FNSI) genes; When introduced into host cells, the genes expressing phenylalanine ammonia lyase, 4-coumaric acid coenzyme A ligase, chalcone synthase, chalcone isomerase and flavone synthase I exist in the same expression in the carrier.

在另一优选例中，所述的黄酮6-羟化酶来源于黄芩(Scutcllaria baicalensis)，也包括其同源物(来自其它物种的同源基因或多肽)；所述的CPR来源于拟南芥(Arabidopsis thaliana)，也包括其同源物。In another preferred example, the flavone 6-hydroxylase is derived from Scutellaria baicalensis, including its homologues (homologous genes or polypeptides from other species); the CPR is derived from Arabidopsis Mustard (Arabidopsis thaliana), also including its homologues.

在另一优选例中，所述的PAL来源于红景天(Rhodotorula toruloides)，也包括其同源物；所述的4CL来源于欧芹(Petroselium crispum)，也包括其同源物；所述的CHS来源于矮牵牛(Petunia X hybrida)，也包括其同源物；所述的CHI来源于苜蓿(Medicagosativa)，也包括其同源物；所述的FNS I来源于欧芹(Petroselium crispum)，也包括其同源物。In another preferred example, the PAL is derived from Rhodotorula toruloides (Rhodotorula toruloides), also including its homologues; the 4CL is derived from parsley (Petroselium crispum), also including its homologues; the The CHS derived from petunia (Petunia X hybrida) also includes its homologue; the CHI derived from alfalfa (Medicagosativa) also includes its homologue; the FNS I derived from parsley (Petroselium crispum ), including its homologues.

在本发明的另一方面，提供一种生产黄芩素和野黄芩素的方法，包括：(1)在宿主细胞中引入表达黄酮6-羟化酶(F6H)以及细胞色素P450氧化还原酶(CPR)的基因，获得重组的宿主细胞；和(2)在含有白杨素或芹菜素的培养体系中培养该重组的宿主细胞，从而生产黄芩素或野黄芩素。In another aspect of the present invention, there is provided a method for producing baicalein and scutellarein, comprising: (1) introducing and expressing flavone 6-hydroxylase (F6H) and cytochrome P450 oxidoreductase (CPR) in host cells ) to obtain a recombinant host cell; and (2) cultivating the recombinant host cell in a culture system containing chrysin or apigenin, thereby producing baicalein or scutellarein.

在本发明的另一方面，提供一种将白杨素或芹菜素转化为黄芩素或野黄芩素的方法：以黄酮6-羟化酶以及细胞色素P450氧化还原酶催化白杨素或芹菜素，从而在白杨素或芹菜素的结构中加上一个羟基，形成黄芩素或野黄芩素。In another aspect of the present invention, there is provided a method for converting chrysin or apigenin into baicalein or scutellarein: catalyzing chrysin or apigenin with flavone 6-hydroxylase and cytochrome P450 oxidoreductase, thereby Add a hydroxyl group to the structure of chrysin or apigenin to form baicalein or scutellarein.

在一个优选例中，所述的黄酮6-羟化酶(F6H)是截去N端第(1-10)～(20-30)位氨基酸的突变型黄酮6-羟化酶；较佳地，是截去N端第(2-5)～(22-28)位氨基酸的突变型黄酮6-羟化酶。In a preferred example, the flavone 6-hydroxylase (F6H) is a mutant flavone 6-hydroxylase that truncates amino acids at positions (1-10) to (20-30) of the N-terminus; preferably , is a mutant flavone 6-hydroxylase that truncated amino acids at positions (2-5)-(22-28) of the N-terminus.

在另一优选例中，所述的黄酮6-羟化酶与多肽标签融合，所述的多肽标签选自：小牛血清17羟基化酶N端8氨基酸多肽(8RP)，小分子泛素修饰相关蛋白(Sumo)，麦芽糖结合蛋白(MBP)，细胞色素P450 2B1家族可溶性蛋白(2B1)，或其组合；较佳地为麦芽糖结合蛋白或细胞色素P450 2B1家族可溶性蛋白，或其组合；较佳地所述多肽标签位于N端。In another preferred example, the flavone 6-hydroxylase is fused with a polypeptide tag, and the polypeptide tag is selected from: calf serum 17 hydroxylase N-terminal 8 amino acid polypeptide (8RP), small molecule ubiquitin modified Related protein (Sumo), maltose binding protein (MBP), cytochrome P450 2B1 family soluble protein (2B1), or combination thereof; preferably maltose binding protein or cytochrome P450 2B1 family soluble protein, or combination thereof; preferably The polypeptide tag is located at the N-terminus.

在另一优选例中，所述的细胞色素P450氧化还原酶(CPR)的是截去N端第(1-20)～(60-85)位氨基酸的突变型细胞色素P450氧化还原酶；较佳地，是截去N端第(2-10)～(65-80)位氨基酸的突变型细胞色素P450氧化还原酶；更佳地，是截去N端第(2-5)～(70-75)位氨基酸的突变型细胞色素P450氧化还原酶。In another preferred example, the cytochrome P450 oxidoreductase (CPR) is a mutant cytochrome P450 oxidoreductase that truncated amino acids at positions (1-20)-(60-85) of the N-terminal; Preferably, it is a mutant cytochrome P450 oxidoreductase that truncates (2-10)-(65-80) amino acids at the N-terminus; more preferably, it is a truncated N-terminus (2-5)-(70 -75) amino acid mutant cytochrome P450 oxidoreductase.

在另一优选例中，所述的宿主细胞包括：原核细胞或真核细胞；较佳地，所述的原核细胞包括：大肠杆菌细胞，枯草芽孢杆菌细胞；所述的真核细胞包括：酵母细胞。In another preferred example, the host cells include: prokaryotic cells or eukaryotic cells; preferably, the prokaryotic cells include: Escherichia coli cells, Bacillus subtilis cells; the eukaryotic cells include: yeast cell.

在本发明的另一方面，提供一种重组宿主细胞，其中包括外源的表达黄酮6-羟化酶以及细胞色素P450氧化还原酶的基因。In another aspect of the present invention, a recombinant host cell is provided, which includes exogenous genes expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase.

在另一优选例中，所述的重组宿主细胞中还包括外源的白杨素或芹菜素合成基因。In another preferred example, the recombinant host cell further includes an exogenous chrysin or apigenin synthesis gene.

在另一优选例中，所述的多肽标签为单拷贝或2～10拷贝(如3、4、5、6、8拷贝)串联的序列结构。In another preferred example, the polypeptide tag is a single copy or a sequence structure of 2-10 copies (such as 3, 4, 5, 6, 8 copies) in series.

在本发明的另一方面，提供前面任一所述的重组宿主细胞的用途，用于生产黄芩素和野黄芩素。In another aspect of the present invention, the use of any one of the aforementioned recombinant host cells is provided for producing scutellarein and scutellarein.

在一个优选例中，对于细胞内不存在白杨素或芹菜素合成基因的菌株，用于以外源添加的白杨素或芹菜素为底物生产黄芩素和野黄芩素；对于细胞内存在白杨素或芹菜素合成基因的菌株，用于在外源添加苯丙氨酸和/或酪氨酸的培养条件下生产黄芩素和野黄芩素。In a preferred example, for the bacterial strain that does not have chrysin or apigenin synthesis gene in the cell, it is used to produce baicalein and scutellarein with chrysin or apigenin added by exogenous sources; for the presence of chrysin or apigenin in the cell A strain with an apigenin synthesis gene for producing baicalein and scutellarein under the culture conditions of exogenously added phenylalanine and/or tyrosine.

在本发明的另一方面，提供一种制备用于生产黄芩素和野黄芩素的宿主细胞的方法，包括：在宿主细胞中引入表达黄酮6-羟化酶以及细胞色素P450氧化还原酶的基因，获得重组菌株；较佳地，还包括引入白杨素或芹菜素合成基因。In another aspect of the present invention, there is provided a method for preparing host cells for producing baicalein and scutellarein, comprising: introducing genes expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase into the host cells , to obtain a recombinant strain; preferably, it also includes introducing a chrysin or apigenin synthesis gene.

在本发明的另一方面，提供一种用于生产黄芩素和野黄芩素的试剂盒所述试剂盒中包括前面任一所述的重组的宿主细胞。In another aspect of the present invention, there is provided a kit for producing scutellarein and scutellarein. The kit includes any of the aforementioned recombinant host cells.

在另一优选例中，所述的试剂盒中还包括：宿主细胞培养基，使用说明书等。In another preferred example, the kit further includes: host cell culture medium, instructions for use, and the like.

在本发明的另一方面，提供突变型黄酮6-羟化酶，其对应于野生型黄酮6-羟化酶(F6H)，截去N端第(1-10)～(20-30)位氨基酸；较佳地，截去N端第(2-5)～(22-28)位氨基酸；较佳地，其具有SEQ ID NO:2所示的氨基酸序列。In another aspect of the present invention, mutant flavone 6-hydroxylase is provided, which corresponds to wild-type flavone 6-hydroxylase (F6H), truncating the N-terminal (1-10)-(20-30) Amino acid; preferably, amino acids (2-5)-(22-28) of the N-terminal are truncated; preferably, it has the amino acid sequence shown in SEQ ID NO:2.

在本发明的另一方面，提供突变型细胞色素P450氧化还原酶，其对应于野生型细胞色素P450氧化还原酶，截去N端第(1-20)～(60-85)位氨基酸；较佳地，截去N端第(2-10)～(65-80)位氨基酸；更佳地，截去N端第(2-5)～(70-75)位氨基酸；较佳地，其具有SEQ IDNO:8所示的氨基酸序列。In another aspect of the present invention, a mutant cytochrome P450 oxidoreductase is provided, which corresponds to the wild-type cytochrome P450 oxidoreductase, and amino acids at positions (1-20) to (60-85) of the N-terminal are truncated; Preferably, amino acids at positions (2-10) to (65-80) of the N-terminus are truncated; more preferably, amino acids at positions (2-5) to (70-75) of the N-terminus are truncated; preferably, the It has the amino acid sequence shown in SEQ ID NO:8.

在本发明的另一方面，提供融合多肽，其包括前面所述的任一突变型黄酮6-羟化酶，以及与之融合的多肽标签，所述的多肽标签选自：8RP，Sumo，MBP，2B1；较佳地为MBP或2B1。In another aspect of the present invention, a fusion polypeptide is provided, which includes any of the aforementioned mutant flavone 6-hydroxylases, and a polypeptide tag fused thereto, and the polypeptide tag is selected from: 8RP, Sumo, MBP , 2B1; preferably MBP or 2B1.

在一个优选例中，所述的融合多肽具有选自下组的氨基酸序列：SEQ ID NO:3、SEQID NO:4、SEQ ID NO:5或SEQ ID NO:6。In a preferred example, the fusion polypeptide has an amino acid sequence selected from the group consisting of SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6.

在本发明的另一方面，提供多核苷酸，其编码：述的突变型黄酮6-羟化酶；或所述的突变型细胞色素P450氧化还原酶；或所述的融合多肽。In another aspect of the present invention, a polynucleotide is provided, which encodes: the mutant flavone 6-hydroxylase; or the mutant cytochrome P450 oxidoreductase; or the fusion polypeptide.

在本发明的另一方面，提供一种表达构建物，其包括：前面所述的任一多核苷酸；或编码前面所述的任一突变型黄酮6-羟化酶或前面所述的融合蛋白的多核苷酸，以及编码前面所述的突变型细胞色素P450氧化还原酶的多核苷酸。In another aspect of the present invention, an expression construct is provided, which includes: any polynucleotide described above; or encodes any mutant flavone 6-hydroxylase described above or the above-mentioned The polynucleotide of the fusion protein, and the polynucleotide encoding the aforementioned mutant cytochrome P450 oxidoreductase.

在另一优选例中，所述的表达构建物中，还包括与上述的多核苷酸操作性连接的启动子和终止子。In another preferred example, the expression construct further includes a promoter and a terminator operably linked to the above-mentioned polynucleotide.

在本发明的另一方面，提供所述的突变型黄酮6-羟化酶或所述的融合蛋白，以及突变型细胞色素P450氧化还原酶的用途，用于在白杨素或芹菜素的结构中加上一个羟基，形成黄芩素或野黄芩素。In another aspect of the present invention, there is provided the use of the mutant flavone 6-hydroxylase or the fusion protein, and the mutant cytochrome P450 oxidoreductase, which are used in the structure of chrysin or apigenin Add a hydroxyl group to form baicalein or scutellarein.

本发明的其它方面由于本文的公开内容，对本领域的技术人员而言是显而易见的。Other aspects of the invention will be apparent to those skilled in the art from the disclosure herein.

附图说明Description of drawings

图1、黄芩素和野黄芩素的结构式。Figure 1. Structural formulas of baicalein and scutellarein.

图2、黄芩素和野黄芩素的生物合成路径。Figure 2. Biosynthetic pathways of baicalein and scutellarein.

图3、质粒pYH66的构建示意图。Fig. 3. Schematic diagram of the construction of plasmid pYH66.

图4、质粒pYH57的构建示意图。Fig. 4. Schematic diagram of the construction of plasmid pYH57.

图5、工程菌株BL21(DE3)-pYH57-pYH66和标准品黄芩素的HPLC检测图谱。其中，i表示BL21(DE3)-pETDuet-1-pCDFDuet-1发酵液作为空白对照；ii表示BL21(DE3)-pYH57-pYH66添加苯丙氨酸的发酵液；iii表示黄芩素标准品。Fig. 5. HPLC detection patterns of engineering strain BL21(DE3)-pYH57-pYH66 and standard product baicalein. Among them, i represents BL21(DE3)-pETDuet-1-pCDFDuet-1 fermentation broth as a blank control; ii represents BL21(DE3)-pYH57-pYH66 fermentation broth added with phenylalanine; iii represents baicalein standard.

图6、工程菌株BL21(DE3)-pYH57-pYH66产黄芩素的质谱图谱。Fig. 6. The mass spectrogram of baicalein produced by engineering strain BL21(DE3)-pYH57-pYH66.

图7、工程菌株BL21(DE3)-pYH57-pYH66和标准品野黄芩素的HPLC检测图谱。其中，i表示BL21(DE3)-pETDuet-1-pCDFDuet-1发酵液作为空白对照；ii表示BL21(DE3)-pYH57-pYH66添加酪氨酸的发酵液；iii表示野黄芩素标准品。Fig. 7. HPLC detection pattern of engineering strain BL21(DE3)-pYH57-pYH66 and standard product scutellarein. Among them, i represents BL21(DE3)-pETDuet-1-pCDFDuet-1 fermentation liquid as a blank control; ii represents tyrosine-added fermentation liquid of BL21(DE3)-pYH57-pYH66; iii represents scutellarein standard.

图8、工程菌株BL21(DE3)-pYH57-pYH66产野黄芩素的质谱图谱。Fig. 8. The mass spectrum of scutellarein produced by engineering strain BL21(DE3)-pYH57-pYH66.

图9、SbF6H和AtCPR突变体催化白杨素生成黄芩素。Figure 9. SbF6H and AtCPR mutants catalyze the production of baicalein from chrysin.

A，所构建的质粒中关键元件的示意图；A, schematic diagram of the key elements in the constructed plasmid;

B，重组大肠杆菌中，催化白杨素生成黄芩素的转化率；B, in recombinant Escherichia coli, the conversion rate of catalyzing chrysin to generate baicalein;

C，HPLC分析重组大肠杆菌催化反应液。其中，Chr表示白杨素，Bai表示黄芩素。C, HPLC analysis of recombinant Escherichia coli catalytic reaction solution. Among them, Chr means chrysin, and Bai means baicalein.

具体实施方式Detailed ways

本发明人致力于利用微生物中异源合成黄芩素和野黄芩素，以及提高生物生产黄芩素和野黄芩素的产量，经过深入的研究，通过基因工程的方法对宿主细胞的异源代谢途径进行改造，获得了高产黄芩素和野黄芩素的工程菌株。The present inventor is committed to using heterologous synthesis of baicalein and scutellarein in microorganisms, and improving the yield of biological production of baicalein and scutellarein. Transformation, obtained high-yielding baicalein and wild scutellarein engineering strains.

如本文所用，所述的“N端第(1-10)～(20-30)位氨基酸”是指起始于N端起第1-10位中的任一位氨基酸，终止于N端起第20-30位中的任一位氨基酸。As used herein, the "N-terminal (1-10)-(20-30) amino acid" refers to any amino acid starting from the 1-10th position from the N-terminal and ending from the N-terminal Any amino acid in the 20th-30th position.

如本文所用，所述的“N端第(2-5)～(22-28)位”是指起始于N端起第2-5位中的任一位氨基酸，终止于N端起第22-28位中的任一位氨基酸。As used herein, the "N-terminal (2-5)-(22-28)" refers to any amino acid starting from the 2-5th position from the N-terminal and ending at the 2-5th amino acid from the N-terminal Any amino acid in positions 22-28.

如本文所用，所述的“N端第(1-20)～(60-85)位”是指起始于N端起第1-20位中的任一位氨基酸，终止于N端起第60-85位中的任一位氨基酸。As used herein, the "N-terminal (1-20) ~ (60-85) position" refers to any amino acid starting from the 1-20th position from the N-terminal and ending at the 1st-20th amino acid from the N-terminal Any amino acid in positions 60-85.

如本文所用，所述的“N端第(2-10)～(65-80)位”是指起始于N端起第2-10位中的任一位氨基酸，终止于N端起第65-80位中的任一位氨基酸。As used herein, the "N-terminal (2-10) ~ (65-80) position" refers to any amino acid starting from the 2-10th position from the N-terminal and ending at the 2-10th amino acid from the N-terminal Any amino acid in positions 65-80.

如本文所用，所述的“N端第(2-5)～(70-75)位”是指起始于N端起第2-5位中的任一位氨基酸，终止于N端起第70-75位中的任一位氨基酸。As used herein, the "N-terminal (2-5) ~ (70-75) position" refers to any amino acid starting from the 2-5th position from the N-terminal and ending at the 2-5th amino acid from the N-terminal Any amino acid in positions 70-75.

如本文所用，“外源的”或“异源的”是指来自不同来源的两条或多条核酸或蛋白质序列之间的关系。As used herein, "exogenous" or "heterologous" refers to the relationship between two or more nucleic acid or protein sequences from different sources.

如本文所用，所述的“可操作地连接(相连)”或“操作性连接(相连)”是指两个或多个核酸区域或核酸序列的功能性的空间排列。例如：启动子区被置于相对于目的基因核酸序列的特定位置，使得核酸序列的转录受到该启动子区域的引导，从而，启动子区域被“可操作地连接”到该核酸序列上。As used herein, the term "operably linked (linked)" or "operably linked (linked)" refers to the functional spatial arrangement of two or more nucleic acid regions or nucleic acid sequences. For example: the promoter region is placed at a specific position relative to the nucleic acid sequence of the target gene, so that the transcription of the nucleic acid sequence is guided by the promoter region, thus, the promoter region is "operably linked" to the nucleic acid sequence.

如本文所用，所述的“表达构建物”是指重组DNA分子，它包含预期的核酸编码序列，其可以包含一个或多个基因表达盒。所述的“构建物”通常被包含在表达载体中。As used herein, the "expression construct" refers to a recombinant DNA molecule comprising the desired nucleic acid coding sequence, which may comprise one or more gene expression cassettes. Said "construct" is usually contained in an expression vector.

如本文所用，所述的PAL、4CL、CHS、CHI和FNSI蛋白是在表达系统中形成白杨素或芹菜素合成途径的蛋白。As used herein, the PAL, 4CL, CHS, CHI and FNSI proteins are proteins that form the synthesis pathway of chrysin or apigenin in the expression system.

如本文所用，所述的F6H和CPR蛋白是在表达系统中转化白杨素或芹菜素、生成黄芩素或野黄芩素的蛋白。As used herein, the F6H and CPR proteins are proteins that convert chrysin or apigenin into baicalein or scutellarein in an expression system.

野生型的上述蛋白或基因均为本领域已经鉴定的，因此，可以从公众途径获得和制备。作为本发明的优选方式，PAL来源于红景天(Rhodotorula toruloides)，其具有GenBank登录号AAA33883.1所示的序列；4CL来源于欧芹(Petroselium crispum)，其具有GenBank登录号KF765780.1所示的序列；CHS来源于矮牵牛(Petunia X hybrida)，其具有GenBank登录号KF765781.1所示的序列；CHI基因来源于苜蓿(Medicago sativa)，其具有GenBank登录号KF765782.1所示的序列；FNS I来源于欧芹(Petroselium crispum)，其具有Swiss-Prot登录号Q7XZQ8.1所示的序列。The above-mentioned wild-type proteins or genes have been identified in the art, and therefore, can be obtained and prepared from public sources. As a preferred mode of the present invention, PAL is derived from Rhodiola rosea (Rhodotorula toruloides), which has the sequence shown in GenBank accession number AAA33883.1; 4CL is derived from parsley (Petroselium crispum), which has the sequence shown in GenBank accession number KF765780.1. CHS is derived from petunia (Petunia X hybrida), which has the sequence shown in GenBank accession number KF765781.1; CHI gene is derived from alfalfa (Medicago sativa), which has the sequence shown in GenBank accession number KF765782.1 Sequence; FNS I is derived from parsley (Petroselium crispum), which has the sequence shown in Swiss-Prot accession number Q7XZQ8.1.

野生型的F6H和CPR也是本领域已经鉴定的。作为本发明的优选方式，F6H来源于黄岑(Scutellaria baicalensis)，其具有GenBank登录号ASW21050.1所示的序列。作为本发明的优选方式，CPR来自于拟南芥(Arabidopsis thaliana)，其具有GenBank登录号NP_849472.2所示的序列。Wild-type F6H and CPR have also been identified in the art. As a preferred embodiment of the present invention, F6H is derived from Scutellaria baicalensis, which has the sequence shown by GenBank accession number ASW21050.1. As a preferred mode of the present invention, CPR comes from Arabidopsis thaliana, which has the sequence shown in GenBank accession number NP_849472.2.

本发明人发现，利用宿主细胞生产黄芩素和野黄芩素的过程中，应用野生型的F6H，仅能产生微量的产物，无法实现规模化生产，因此对多个参与反应的蛋白进行了改造，经过大量筛选分析，获得了一些优选的改造方案，极为显著地提高了微生物，尤其是原核表达系统如大肠杆菌中黄芩素和野黄芩素的产量。The inventors found that in the process of using host cells to produce baicalein and scutellarein, only a small amount of products can be produced by using wild-type F6H, and large-scale production cannot be realized. Therefore, multiple proteins involved in the reaction were modified. After a large number of screening and analysis, some optimal transformation schemes were obtained, which greatly improved the production of microorganisms, especially prokaryotic expression systems such as scutellarein and scutellarein in Escherichia coli.

因此，作为本发明的优选方式，提供了一种突变型F6H，其对应于野生型F6H，截去N端第(1-10)～(20-30)位氨基酸；较佳地，截去N端第(2-5)～(22-28)位氨基酸；更佳地，截去N端第2-25位氨基酸。Therefore, as a preferred mode of the present invention, a mutant F6H is provided, which corresponds to the wild-type F6H, and amino acids at positions (1-10) to (20-30) of the N-terminal are truncated; preferably, the N-terminal truncated (2-5)-(22-28) amino acids at the terminal; more preferably, amino acids 2-25 at the N-terminus are truncated.

作为本发明的优选方式，还提供了一种含有所述F6H或突变型F6H的融合蛋白，其包括F6H或任一突变型F6H，以及与之融合的多肽标签，所述的多肽标签选自：8RP，Sumo，MBP，2B1，或它们的组合；较佳地为MBP或2B1。所述的多肽标签与所述F6H或突变型F6H之间，可以包含或不包含连接肽，所述的连接肽不影响两者的生物学活性。As a preferred mode of the present invention, there is also provided a fusion protein containing the F6H or mutant F6H, which includes F6H or any mutant F6H, and a polypeptide tag fused thereto, and the polypeptide tag is selected from: 8RP, Sumo, MBP, 2B1, or a combination thereof; preferably MBP or 2B1. The polypeptide tag and the F6H or mutant F6H may or may not contain a connecting peptide, and the connecting peptide does not affect the biological activity of the two.

作为本发明的优选方式，提供了一种突变型CPR，其对应于野生型CPR，截去N端第(1-20)～(60-85)位氨基酸；较佳地，截去N端第(2-10)～(65-80)位氨基酸；更佳地，截去N端第(2-5)～(70-75)位氨基酸。As a preferred mode of the present invention, a mutant CPR is provided, which corresponds to the wild-type CPR, and amino acids (1-20) to (60-85) of the N-terminus are truncated; (2-10)-(65-80) amino acids; more preferably, the N-terminal (2-5)-(70-75) amino acids are truncated.

在上述优选的蛋白(包括上述野生型的蛋白，突变型的蛋白)的基础上，本发明还包括它们保留生物活性的片段、衍生物和类似物。它们的蛋白片段、衍生物或类似物可以是若干个(通常为1-50个，更佳地1-20个，还更佳如1-10个、1-5个、1-3个、或1-2个)氨基酸的缺失、插入和/或取代，以及在C末端和/或N末端添加或缺失一个或数个(例如100个以内、80个以内、50个以内、20个以内，较佳地为10个以内，更佳地为5个以内)氨基酸。例如，在本领域中，用性能相近或相似的氨基酸进行取代时，通常不会改变蛋白质的功能。又比如，在C末端和/或N末端添加或缺失一个或数个氨基酸通常也不会改变蛋白质的功能。但是，对于上述的突变型蛋白，在其进一步的变化形式中，均进行了如上所述的针对N端的截短。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes their fragments, derivatives and analogs that retain biological activity. Their protein fragments, derivatives or analogs can be several (usually 1-50, more preferably 1-20, even more preferably 1-10, 1-5, 1-3, or 1-2) deletion, insertion and/or substitution of amino acids, and addition or deletion of one or several (for example, within 100, within 80, within 50, within 20 at the C-terminal and/or N-terminal, relatively Preferably within 10, more preferably within 5) amino acids. For example, in the art, substitutions with amino acids with similar or similar properties generally do not change the function of the protein. As another example, adding or deleting one or several amino acids at the C-terminus and/or N-terminus usually does not change the function of the protein. However, for the above-mentioned mutant proteins, in their further variant forms, the above-mentioned N-terminal truncation is carried out.

在上述优选的蛋白(包括上述野生型的蛋白，突变型的蛋白)的基础上，本发明还包括它们的类似物。这些类似物与天然蛋白的差别可以是氨基酸序列上的差异，也可以是不影响序列的修饰形式上的差异，或者兼而有之。这些蛋白包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到，如通过辐射或暴露于诱变剂而产生随机诱变，还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物，以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解，本发明的蛋白并不限于上述例举的代表性的蛋白。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes their analogs. The difference between these analogs and the natural protein may be the difference in amino acid sequence, or the difference in the modified form that does not affect the sequence, or both. These proteins include natural or induced genetic variants. Induced variants can be obtained by various techniques, such as random mutagenesis by radiation or exposure to mutagens, but also by site-directed mutagenesis or other techniques known in molecular biology. Analogs also include analogs with residues other than natural L-amino acids (eg, D-amino acids), and analogs with non-naturally occurring or synthetic amino acids (eg, β, γ-amino acids). It should be understood that the proteins of the present invention are not limited to the representative proteins exemplified above.

在上述优选的蛋白(包括上述野生型的蛋白，突变型的蛋白)的基础上，本发明还包括与所述的蛋白同源性高(比如与所列举的具体蛋白序列的同源性为70％或更高；优选地同源性为80％或更高；更优选地同源性为90％或更高，如同源性95％，98％或99％)的、且具有相应多肽相同功能的蛋白也包括在本发明内。On the basis of the above-mentioned preferred proteins (including the above-mentioned wild-type proteins and mutant proteins), the present invention also includes proteins with high homology (such as 70% homology with the specific protein sequence listed) % or higher; preferably homology is 80% or higher; more preferably homology is 90% or higher, such as homology 95%, 98% or 99%), and has the same function as the corresponding polypeptide The proteins are also included in the present invention.

本发明中列举了来自特定物种的蛋白或基因。应理解，虽然本发明中优选研究了获自特定物种的蛋白或基因，但是获自其它物种的与所述蛋白或基因高度同源(如具有60％以上，如70％，80％，85％、90％、95％、甚至98％序列相同性)的其它蛋白或基因也在本发明考虑的范围之内。Proteins or genes from specific species are listed in the present invention. It should be understood that although proteins or genes obtained from specific species are preferably studied in the present invention, proteins or genes obtained from other species are highly homologous (such as having more than 60%, such as 70%, 80%, 85% , 90%, 95%, or even 98% sequence identity) other proteins or genes are also within the scope of the present invention.

发明还涉及本发明还提供了编码本发明的蛋白或其保守性变异蛋白的多核苷酸序列。本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码本发明的突变体成熟蛋白的多核苷酸包括：只编码成熟蛋白的编码序列；成熟蛋白的编码序列和各种附加编码序列；成熟蛋白的编码序列(和任选的附加编码序列)以及非编码序列。The invention also relates to the invention also provides a polynucleotide sequence encoding the protein of the invention or its conservative variant protein. A polynucleotide of the invention may be in the form of DNA or RNA. Forms of DNA include cDNA, genomic DNA or synthetic DNA. DNA can be single-stranded or double-stranded. DNA can be either the coding strand or the non-coding strand. The polynucleotide encoding the mutant mature protein of the present invention includes: the coding sequence encoding only the mature protein; the coding sequence of the mature protein and various additional coding sequences; the coding sequence of the mature protein (and optional additional coding sequences) and non- coding sequence.

本发明还包括针对所述基因的序列，进行密码子优化后形成的多核苷酸序列，例如，根据宿主细胞的偏好进行密码子优化。The present invention also includes the polynucleotide sequence formed by codon-optimizing the sequence of the gene, for example, codon-optimizing according to the preference of the host cell.

本发明中，还构建了一种高产黄芩素和野黄芩素的工程菌株，其中包括外源引入的表达F6H(特别是所述突变型F6H或融合蛋白)以及CPR(特别是所述突变型F6H或融合蛋白)的基因。培养该重组菌株，并在培养体系中加入白杨素或芹菜素，从而可生产黄芩素或野黄芩素。In the present invention, a high-yielding baicalein and scutellarein engineering strain is also constructed, which includes exogenously introduced expression F6H (especially the mutant F6H or fusion protein) and CPR (especially the mutant F6H or fusion protein) genes. The recombinant bacterial strain is cultivated, and chrysin or apigenin is added into the culture system, so as to produce baicalein or scutellarein.

本发明中，还构建了另一种高产黄芩素或野黄芩素的工程菌株，其中包括外源引入的表达F6H(特别是所述突变型F6H或融合蛋白)以及CPR(特别是所述突变型F6H或融合蛋白)的基因，以及白杨素或芹菜素合成基因。所述的白杨素或芹菜素合成基因包括：表达PAL、4CL、CHS、CHI和FNSI蛋白的基因。In the present invention, another kind of high-yielding baicalein or scutellarein engineering strain is also constructed, which includes the expression of F6H (especially the mutant F6H or fusion protein) and CPR (especially the mutant F6H or fusion protein) gene, and chrysin or apigenin synthesis gene. The chrysin or apigenin synthesis genes include: genes expressing PAL, 4CL, CHS, CHI and FNSI proteins.

本发明的菌株稳定性好，并可实现在生物反应器中规模性培养及生产黄芩素或野黄芩素。本发明优选的菌株的黄芩素或野黄芩素得率非常高。The bacterial strain of the invention has good stability, and can realize large-scale cultivation and production of baicalein or scutellarein in a bioreactor. The yield of baicalein or scutellarein in the preferred strain of the present invention is very high.

本发明中，通过大肠杆菌生产黄芩素或野黄芩素，实现黄芩素或野黄芩素更经济、更方便的制造。In the present invention, Escherichia coli produces baicalein or scutellarein to realize more economical and convenient manufacture of baicalein or scutellarein.

本发明还提供了用于生产黄芩素或野黄芩素工程菌株的试剂盒。此外，其中还可包括大肠杆菌培养基，黄芩素或野黄芩素分离或检测试剂，使用说明书等。The invention also provides a kit for producing baicalein or scutellarein engineering strains. In addition, it may also include Escherichia coli culture medium, baicalein or scutellarein separation or detection reagents, instructions for use, and the like.

下面结合具体实施例，进一步阐述本发明。应理解，这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法，通常按照常规条件如J.萨姆布鲁克等编著，分子克隆实验指南，第三版，科学出版社，2002中所述的条件，或按照制造厂商所建议的条件。Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods not indicating specific conditions in the following examples are usually according to the conditions described in J. Sambrook et al., edited by J. Sambrook et al., Molecular Cloning Experiment Guide, Third Edition, Science Press, 2002, or according to the conditions described in the manufacturer suggested conditions.

实验材料Experimental Materials

AxyPrep总RNA小量制备试剂盒，多聚酶链式反应(PCR)胶回收试剂盒，质粒抽提试剂盒均为美国Axygen产品；PrimeScript RT reagent Kit with gDNA Eraser(PerfectReal Time)聚合酶试剂盒，聚合酶链式反应(PCR)高保真酶PrimeSTAR Max DNAPolymerase为日本宝生物公司(TAKARA)产品；限制性内切酶均为NEB产品。AxyPrep Total RNA Miniprep Kit, Polymerase Chain Reaction (PCR) Gel Recovery Kit, and Plasmid Extraction Kit are all products of Axygen in the United States; PrimeScript RT reagent Kit with gDNA Eraser (PerfectReal Time) Polymerase Kit, Polymerase The chain reaction (PCR) high-fidelity enzyme PrimeSTAR Max DNAPolymerase is a product of TAKARA; the restriction endonucleases are all NEB products.

大肠杆菌DH10B用于基因克隆，大肠杆菌BL21(DE3)菌株用于蛋白表达和黄芩素、野黄芩素的生产。pET28a、pEDDuet-1、pCDFDuet-1载体用于代谢途径基因装配。Escherichia coli DH10B was used for gene cloning, and Escherichia coli BL21 (DE3) strain was used for protein expression and production of baicalein and wild scutellarein. The pET28a, pEDDuet-1, pCDFDuet-1 vectors were used for metabolic pathway gene assembly.

标准品化合物黄芩素和野黄芩素购自上海源叶生物科技有限公司。其他试剂为国产分析纯或色谱纯试剂，购自国药集团化学试剂有限公司。The standard compounds, baicalein and scutellarein, were purchased from Shanghai Yuanye Biotechnology Co., Ltd. Other reagents were domestic analytically pure or chromatographically pure reagents, which were purchased from Sinopharm Chemical Reagent Co., Ltd.

PCR使用Arktik Thermal Cycler(Thermo Fisher Scientific)；恒温培养使用ZXGP-A2050恒温培养箱和ZWY-211G恒温培养振荡器；离心使用5418R高速冷冻式离心机和5418小型离心机(Eppendorf)。真空浓缩使用Concentrator plus浓缩仪(Eppendorf)；OD₆₀₀使用UV-1200紫外可见分光光度计检测(上海美谱达仪器有限公司)。旋转蒸发系统由IKARV 10digital旋转蒸发仪(IKA)和MZ 2C NT化学隔膜泵、CVC3000真空控制器(vacuubrand)组成。高效液相色谱使用Dionex UltiMate 3000液相色谱系统(Thermo FisherScientific)。Arktik Thermal Cycler (Thermo Fisher Scientific) was used for PCR; ZXGP-A2050 constant temperature incubator and ZWY-211G constant temperature culture shaker were used for constant temperature culture; 5418R high-speed refrigerated centrifuge and 5418 small centrifuge (Eppendorf) were used for centrifugation. Concentrator plus concentrator (Eppendorf) was used for vacuum concentration; OD ₆₀₀ was detected by UV-1200 ultraviolet-visible spectrophotometer (Shanghai Meipuda Instrument Co., Ltd.). The rotary evaporation system consists of IKARV 10digital rotary evaporator (IKA), MZ 2C NT chemical diaphragm pump, and CVC3000 vacuum controller (vacuubrand). For high performance liquid chromatography, a Dionex UltiMate 3000 liquid chromatography system (Thermo Fisher Scientific) was used.

液相检测条件：A相：0.1％甲酸水，B相：乙腈；分离条件：0-20min 20％B相-55％B相，20-22min 55％B相-100％B相，22-27min 100％B相，27-35min100％B相-20％B相，35-40min，20％B相；检测波长：340nm，柱温：30℃。色谱柱：Thermo syncronis C18反相柱(250mm×4.6mm，5μm)。Liquid phase detection conditions: A phase: 0.1% formic acid water, B phase: acetonitrile; separation conditions: 0-20min 20% B phase-55% B phase, 20-22min 55% B phase-100% B phase, 22-27min 100% Phase B, 27-35min100% Phase B-20% Phase B, 35-40min, 20% Phase B; detection wavelength: 340nm, column temperature: 30°C. Chromatographic column: Thermo syncronis C18 reverse phase column (250mm×4.6mm, 5μm).

实施例1、多肽及其序列优化Embodiment 1, polypeptide and its sequence optimization

1、F6H多肽序列的优化改造1. Optimization and transformation of F6H polypeptide sequence

来自黄岑(Scutellaria baicalensis)的F6H(SbF6H)长度为517aa(Genbankaccess no.ASW21050.1)，具体序列如下(SEQ ID NO:1)：F6H (SbF6H) from Scutellaria baicalensis is 517aa in length (Genbankaccess no.ASW21050.1), and the specific sequence is as follows (SEQ ID NO: 1):

MELSSVIYGAIALLSLFYCYLHFSKPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*M ELSSVIYGAIALLSLFYCYLHFSK PKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGN LSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPY LQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

改造1：本发明人针对SEQ ID NO:1进行序列改造，去除其中第2-25位氨基酸，在N端加上MA两个氨基酸，获得改进的F6H突变体trF6H，具体序列如下(SEQ ID NO:2)：Modification 1: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added two amino acids of MA to the N-terminal to obtain an improved F6H mutant trF6H. The specific sequence is as follows (SEQ ID NO :2):

MAMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MA MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSET TKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAF SEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

改造2：本发明人针对SEQ ID NO:1进行序列改造，去除其中第2-25位氨基酸，再在N端加上8RP的氨基酸序列，获得改进的F6H突变体8RPtrF6H，具体序列如下(SEQ ID NO:3)：Modification 2: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of 8RP to the N-terminal to obtain an improved F6H mutant 8RPtrF6H, the specific sequence of which is as follows (SEQ ID NO:3):

MALLLAVFMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MALLLAVF MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYG GDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFA GVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

改造3：本发明人针对SEQ ID NO:1进行序列改造，去除其中第2-25位氨基酸，再在N端加上Sumo的氨基酸序列，获得改进的F6H突变体SumotrF6H，具体序列如下(SEQ ID NO:4)：Modification 3: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of Sumo to the N-terminal to obtain an improved F6H mutant SumotrF6H, the specific sequence of which is as follows (SEQ ID NO:4):

MADSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYD GIRIQADQTPEDLDMEDNDIIEAHREQIGGMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MADSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYD GIRIQADQTPEDLDMEDNDIIEAHREQIGGMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTK RWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDC TVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

改造4：本发明人针对SEQ ID NO:1进行序列改造，去除其中第2-25位氨基酸，再在N端加上MBP的氨基酸序列，获得改进的F6H突变体MBPtrF6H，具体序列如下(SEQ ID NO:5)：Modification 4: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of MBP to the N-terminus to obtain an improved F6H mutant MBPtrF6H. The specific sequence is as follows (SEQ ID NO:5):

MAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDR FGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK GKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFN KGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN KDKPLGAVALKSYEEELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDR FGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK GKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFN KGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN KDKPLGAVALKSYEEELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTK RWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDC TVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

改造5：本发明人针对SEQ ID NO:1进行序列改造，去除其中第2-25位氨基酸，再在N端加上2B1的氨基酸序列，获得改进的F6H突变体2B1trF6H，具体序列如下(SEQ ID NO:6)：Modification 5: The inventors modified the sequence of SEQ ID NO: 1, removed amino acids 2-25, and added the amino acid sequence of 2B1 to the N-terminus to obtain an improved F6H mutant 2B1trF6H. The specific sequence is as follows (SEQ ID NO:6):

MAKKTSSKGKLPPGPSMPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVVGKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSLVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMRLYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY* MAKKTSSKGKLPPGPS MPKKSSLNAPPEAGGARFITGHLHLMDGRSASDKLPHINLGLLADQHGPIFTIRLGVHRAVVVSSWELAKEIFTTHDTAVMARPRLIADDYLSYDGASLGFSPYGPYWREIRKLVTTELLSARRIELQRATRVREITQFTGELYKLWEEKKDGSGRVLVDMKQWLGNLSLNLVSRMVV GKRFYGGDDSETTKRWRGVMREFFQLIGQFIPGDGLPFLRWLDLGGFEKRTRDTAYELDKIIAMWLAEYRKREYSGDDKEQCFMALMLSVQANPTLQLHYDADTIIKATCQVLISAASDTTTVILIWVISLLLNNADVLKKVQEELDEQVGRERRVEESDISNLPYLQAVVKETMR LYPPAPFAGVRAFSEDCTVGGYHIQKGTFLIVNLWKLHRDPRVWSDDALEFKPQRFFDKKVEVKGQDFELMPFGGGRRMCPGSSNLGMHMVHFVLANILQAFDITTGSTVDMTESVGLTNMKATPLDAILTPRLSPTLY*

2、CPR的优化改造2. Optimization and transformation of CPR

来自拟南芥(Arabidopsis thaliana)的CPR(AtCPR)序列长度712aa(Genebankaccess no.NP_849472.2)，具体如下(SEQ ID NO:7)：The length of the CPR (AtCPR) sequence from Arabidopsis thaliana (Arabidopsis thaliana) is 712aa (Genebankaccess no.NP_849472.2), specifically as follows (SEQ ID NO: 7):

MSSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIV MLVWRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW* MSSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIV MLVW *

本发明人针对SEQ ID NO:1进行序列改造，删除其中第2-72位氨基酸，获得改进的AtCPR突变体trAtCPR，具体如下(SEQ ID NO:8)：The inventors performed sequence modification on SEQ ID NO: 1, deleted amino acids 2-72, and obtained an improved AtCPR mutant trAtCPR, specifically as follows (SEQ ID NO: 8):

MRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW*MRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDD FTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAG KVDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEY VQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW*

实施例2、包含新型F6H突变体的重组质粒的构建Embodiment 2, the construction of the recombinant plasmid that comprises novel F6H mutant

以pETDuet-1为出发质粒，将AtCPR通过一步克隆方法连接到NdeI和XhoI位点获得质粒pYH45。Using pETDuet-1 as the starting plasmid, AtCPR was connected to NdeI and XhoI sites by a one-step cloning method to obtain plasmid pYH45.

以pETDuet-1为出发质粒，将trAtCPR通过一步克隆方法连接到NdeI和XhoI位点获得质粒pYH46。Using pETDuet-1 as the starting plasmid, trAtCPR was connected to NdeI and XhoI sites by a one-step cloning method to obtain plasmid pYH46.

进一步，通过金斯瑞合成密码子优化的F6H的编码基因序列并连接到pUC19载体中，获得pUC19-F6H。以F6H-F/R为引物，以pUC19-F6H为模板进行PCR扩增，体系50μL(PrimeSTAR Max Premix，25μL；双引物终浓度0.2～0.3μM；pUC19-F6H 0.2uL；剩余体积用灭菌蒸馏水补足)。PCR反应条件：98℃预变性2min，然后98℃变性10s，55℃退火15s，72℃延伸20s，25个循环，琼脂糖电泳检测，扩增得到约1.5kb的片段，纯化后用Nco I和BamH I双酶切消化。消化后片段连入相同酶消化过的pYH46，连接产物转化大肠杆菌DH10B感受态细胞，提取质粒经构建时引入酶切位点双酶切验证和基因测序，得到重组质粒pYH59。同样地，将消化后片段连入pYH45，得到重组质粒pYH59。Further, GenScript synthesized the codon-optimized F6H coding gene sequence and ligated it into the pUC19 vector to obtain pUC19-F6H. Use F6H-F/R as the primer and pUC19-F6H as the template for PCR amplification, the system is 50 μL (PrimeSTAR Max Premix, 25 μL; the final concentration of double primers is 0.2-0.3 μM; pUC19-F6H 0.2uL; the remaining volume is sterilized with distilled water make up). PCR reaction conditions: pre-denaturation at 98°C for 2min, then denaturation at 98°C for 10s, annealing at 55°C for 15s, extension at 72°C for 20s, 25 cycles, detected by agarose electrophoresis, amplified to obtain a fragment of about 1.5kb, purified with Nco I and BamH I double enzyme digestion. The digested fragment was ligated into pYH46 digested with the same enzyme, and the ligated product was transformed into Escherichia coli DH10B competent cells. The extracted plasmid was constructed by introducing restriction sites for double digestion verification and gene sequencing to obtain the recombinant plasmid pYH59. Similarly, the digested fragment was ligated into pYH45 to obtain recombinant plasmid pYH59.

以trF6H-F/F6H-R为引物，pUC19-trF6H为模板，通过PCR扩增获得的产物在通过一步克隆方法连接到pYH46中的NdeI和BamH I位点获得质粒pYH58。Using trF6H-F/F6H-R as primers and pUC19-trF6H as a template, the product obtained by PCR amplification was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain plasmid pYH58.

以pUC19-trF6H为模板，以8RP-trF6H-F/F6H-R为引物扩增后，用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中，获得pYH60。Using pUC19-trF6H as a template and 8RP-trF6H-F/F6H-R as primers to amplify, the product was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH60.

以pETSumo(购自Invitrogen公司)为模板，以Sumo-F/Sumo-trF6H-R为引物获得含有Sumo序列的DNA片段，以pUC19-trF6H为模板，Sumo-trF6H-F/F6H-R为引物扩增获得含有trF6HDNA片段。以上述2个DNA片段为模板，以Sumo-F/F6H-R为引物进行融合PCR后，用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中，获得pYH61。Using pETSumo (purchased from Invitrogen) as a template and Sumo-F/Sumo-trF6H-R as primers to obtain a DNA fragment containing the Sumo sequence, using pUC19-trF6H as a template and Sumo-trF6H-F/F6H-R as primers to amplify The HDNA fragment containing trF6 was obtained by increasing. Using the above two DNA fragments as templates and Sumo-F/F6H-R as primers for fusion PCR, the product was ligated into the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH61.

以pMAL-c5x(购自Invitrogen公司)为模板，以MBP-F/MBP-trF6H-R为引物PCR扩增获得含有MBP序列的DNA片段，以pUC19-trF6H为模板，MBP-trF6H-F/F6H-R为引物扩增获得含有trF6H的DNA片段，以上述获得的2个DNA片段为模板，MBP-F/F6H-R为引物进行融合PCR后，用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中，获得pYH62。Using pMAL-c5x (purchased from Invitrogen) as a template, using MBP-F/MBP-trF6H-R as a primer to amplify the DNA fragment containing the MBP sequence, using pUC19-trF6H as a template, MBP-trF6H-F/F6H -R is a primer to amplify the DNA fragment containing trF6H, use the two DNA fragments obtained above as a template, MBP-F/F6H-R as a primer for fusion PCR, and use a one-step cloning method to connect the product to NdeI and pYH46 In the BamHI site, pYH62 was obtained.

以pUC19-trF6H为模板，以2B1-F/F6H-R为引物扩增后，用一步克隆方法将产物连接到pYH46的NdeI和BamH I位点中，获得pYH63。Using pUC19-trF6H as a template and 2B1-F/F6H-R as a primer to amplify, the product was connected to the NdeI and BamH I sites of pYH46 by a one-step cloning method to obtain pYH63.

以trF6H-F/F6H-R为引物，pUC19-trF6H为模板，通过PCR扩增获得的产物再通过一步克隆方法连接到pYH45的NdeI和BamH I位点中，得到重组质粒pYH64。Using trF6H-F/F6H-R as primers and pUC19-trF6H as template, the product amplified by PCR was then ligated into the NdeI and BamH I sites of pYH45 by a one-step cloning method to obtain recombinant plasmid pYH64.

以pMAL-c5x为模板，以MBP-F/MBP-trF6H-R为引物PCR扩增获得含有MBP序列的DNA片段，以pUC19-trF6H为模板，MBP-trF6H-F/F6H-R为引物扩增获得含有trF6H的DNA片段，以上述获得的2个DNA片段为模板，MBP-F/F6H-R为引物进行融合PCR后，用一步克隆方法将产物连接到pYH45的NdeI和BamH I位点中，获得pYH65。Using pMAL-c5x as a template and MBP-F/MBP-trF6H-R as primers to amplify the DNA fragment containing the MBP sequence, using pUC19-trF6H as a template and MBP-trF6H-F/F6H-R as primers to amplify To obtain a DNA fragment containing trF6H, use the two DNA fragments obtained above as templates and MBP-F/F6H-R as primers for fusion PCR, and then use a one-step cloning method to connect the product to the NdeI and BamH I sites of pYH45, pYH65 was obtained.

以pUC19-2B1trF6H为模板，以2B1-F/F6H-R为引物扩增后，用一步克隆方法将产物连接到pYH45的NdeI和BamH I位点中，获得pYH66。质粒pYH66的构建示意图如图3所示。Using pUC19-2B1trF6H as template and 2B1-F/F6H-R as primer to amplify, the product was connected to NdeI and BamH I sites of pYH45 by one-step cloning method to obtain pYH66. The schematic diagram of the construction of plasmid pYH66 is shown in FIG. 3 .

以上构建过程所用的引物如表1。所构建的质粒中关键元件的示意图如图9A。The primers used in the above construction process are listed in Table 1. A schematic diagram of the key elements in the constructed plasmid is shown in Figure 9A.

表1Table 1

实施例3、表达PAL、4CL、CHS、CHI和FNSI的重组质粒的构建Embodiment 3, the construction of the recombinant plasmid expressing PAL, 4CL, CHS, CHI and FNSI

通过金斯瑞合成来源于红景天(Rhodotorula toruloides)的PAL基因、(GenBank登录号AAA33883.1)、来源于欧芹(Petroselium crispum)的4CL基因(GenBank登录号KF765780.1)、来源于矮牵牛(Petunia X hybrida)的CHS基因(GenBank登录号KF765781.1)、来源于苜蓿(Medicago sativa)的CHI基因(GenBank登录号KF765782.1)、来源于欧芹(Petroselium crispum)的FNS I基因(Swiss-Prot登录号Q7XZQ8.1)，并构建到pET28a载体上获得pET28-PAL，pET28-4CL，pET28-CHS，pET28a-CHI，pET28a-FNSI。GenScript synthesized the PAL gene from Rhodotorula toruloides (GenBank accession number AAA33883.1), the 4CL gene from parsley (Petroselium crispum) (GenBank accession number KF765780.1), and the CHS gene (GenBank accession number KF765781.1) from morning glory (Petunia X hybrida), CHI gene (GenBank accession number KF765782.1) from alfalfa (Medicago sativa), FNS I gene from parsley (Petroselium crispum) (Swiss-Prot accession number Q7XZQ8.1), and constructed on the pET28a vector to obtain pET28-PAL, pET28-4CL, pET28-CHS, pET28a-CHI, pET28a-FNSI.

合成如表2的引物。以pET28-4CL为模板，4CL-F-NcoI/4CL-R-BamHI为引物获得PCR产物，经与NcoI/BamHI双酶切的pCDFDuet-1连接获得pYH40。Synthesize the primers shown in Table 2. Using pET28-4CL as a template and 4CL-F-NcoI/4CL-R-BamHI as primers to obtain a PCR product, which was ligated with pCDFDuet-1 digested with NcoI/BamHI to obtain pYH40.

以pET28-CHS为模板，CHS-F-NdeI/CHS-R-XhoI为引物获得PCR产物，经与NdeI/XhoI双酶切的pYH40连接获得pYH50。Using pET28-CHS as a template and CHS-F-NdeI/CHS-R-XhoI as primers to obtain a PCR product, which was ligated with NdeI/XhoI double-digested pYH40 to obtain pYH50.

以pET28a-CHI为模板，T7CHI-F-XhoI/CHI-R-AvrII为引物获得PCR产物，连接到pYH50获得pYH51。Using pET28a-CHI as a template and T7CHI-F-XhoI/CHI-R-AvrII as primers to obtain a PCR product, which was connected to pYH50 to obtain pYH51.

以pET28-PAL为模板，T7PAL-F-BamH I/PAL-R-Hind III为引物获得PCR产物，经BamH I/Hind III双酶切后与同样双酶切的pYH51连接，获得质粒pYH55。Using pET28-PAL as a template and T7PAL-F-BamH I/PAL-R-Hind III as primers to obtain a PCR product, which was digested with BamH I/Hind III and ligated with the same double digested pYH51 to obtain plasmid pYH55.

以pET28a-FNSI为模板，FNSI-HindIII-F/FNSI-NotI-R)为引物获得PCR产物，经Hind III/Not I双酶切后与同样双酶切的pYH55连接，获得最终载体pYH57。质粒pYH57的构建示意图如图4所示。Using pET28a-FNSI as a template and FNSI-HindIII-F/FNSI-NotI-R) as primers to obtain a PCR product, which was digested with Hind III/Not I and ligated with the same double digested pYH55 to obtain the final vector pYH57. The schematic diagram of the construction of plasmid pYH57 is shown in FIG. 4 .

表2Table 2

实施例4、黄芩素和野黄芩素合成菌株的构建及功能验证Example 4, construction and functional verification of baicalein and scutellarein synthetic strains

黄芩素和野黄芩素的生物合成的示意图如图1。The schematic diagram of the biosynthesis of baicalein and scutellarein is shown in Fig. 1 .

将重组质粒pYH66和pYH57共转化到大肠杆菌BL21(DE3)的感受态细胞中以获得工程菌株BL21(DE3)-pYH57-pYH66。The recombinant plasmids pYH66 and pYH57 were co-transformed into competent cells of Escherichia coli BL21(DE3) to obtain engineering strain BL21(DE3)-pYH57-pYH66.

LB固体培养基(壮观霉素80μg/mL，氨苄青霉素100μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL，氨苄青霉素100μg/mL)，转接过夜培养的菌液到新的10mL MOPS液体抗性培养基中37℃，250r/min培养至OD₆₀₀＝0.5-0.6，水浴降温至16℃左右，然后加入诱导剂IPTG至终浓度1mM，加入不同浓度经灭菌的苯丙氨酸或酪氨酸并转至22℃低温诱导培养，在摇床转速220r/min条件下继续培养48h。携带未连入外来基因的pETDuet-1和pCDFDuet-1空质粒的BL21(DE3)重组菌株作为空白对照，培养操作同上。LB solid medium (spectinomycin 80 μg/mL, ampicillin 100 μg/mL) was cultured overnight at 37°C. Pick a single clone into 2mL LB liquid medium (spectinomycin 80μg/mL, ampicillin 100μg/mL), transfer the overnight cultured bacterial liquid to a new 10mL MOPS liquid resistance medium 37°C, 250r/min culture To OD ₆₀₀ =0.5-0.6, cool the water bath to about 16°C, then add the inducer IPTG to a final concentration of 1mM, add different concentrations of sterilized phenylalanine or tyrosine and transfer to 22°C for low-temperature induction culture, in The culture was continued for 48 hours under the condition of the shaker speed of 220r/min. The BL21(DE3) recombinant strains carrying pETDuet-1 and pCDFDuet-1 empty plasmids that were not linked to foreign genes were used as blank controls, and the cultivation operation was the same as above.

同时，将表2所列的各个重组质粒分别转化大肠杆菌，进行培养，检测各自的产物生成情况。At the same time, each recombinant plasmid listed in Table 2 was transformed into Escherichia coli, cultured, and the production of each product was detected.

培养后，检测各个转化了重组质粒的重组菌株表达化合物的状况，如表3所示。After culturing, the conditions of expression of the compound by each recombinant strain transformed with the recombinant plasmid were detected, as shown in Table 3.

表3table 3

经验证，本发明的各个重组菌株可以成功合成目标化合物。It has been verified that each recombinant strain of the present invention can successfully synthesize the target compound.

工程菌株BL21(DE3)-pYH57-pYH66和标准品黄芩素的HPLC检测图谱如图5。工程菌株BL21(DE3)-pYH57-pYH66产黄芩素的质谱图谱如图6。The HPLC detection patterns of the engineering strain BL21(DE3)-pYH57-pYH66 and the standard baicalein are shown in Figure 5. The mass spectrum of baicalein produced by the engineering strain BL21(DE3)-pYH57-pYH66 is shown in Figure 6.

工程菌株BL21(DE3)-pYH57-pYH66和标准品野黄芩素的HPLC检测图谱如图7。工程菌株BL21(DE3)-pYH57-pYH66产野黄芩素的质谱图谱如图8。The HPLC detection patterns of the engineering strain BL21(DE3)-pYH57-pYH66 and the standard product scutellarein are shown in Figure 7. The mass spectrum of scutellarein produced by the engineering strain BL21(DE3)-pYH57-pYH66 is shown in Figure 8.

实施例5、以白杨素为底物进行生产Embodiment 5, produce with chrysin as substrate

分别将pYH58～pYH66这6个重组质粒转化到大肠杆菌BL21(DE3)的感受态细胞中以获得工程菌株，并命名为BL21(DE3)-pYH58～BL21(DE3)-pYH66。The six recombinant plasmids pYH58-pYH66 were transformed into competent cells of Escherichia coli BL21(DE3) to obtain engineering strains, which were named BL21(DE3)-pYH58-BL21(DE3)-pYH66.

LB固体培养基(氨苄青霉素100μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(氨苄青霉素100μg/mL)，转接过夜培养的菌液到新的20mL MOPS液体抗性培养基中37℃，250r/min培养至OD₆₀₀＝0.5-0.6，水浴降温至16℃左右，然后加入诱导剂IPTG至终浓度1mM，在摇床转速220r/min，温度22℃条件下继续培养12h。将上述发酵液并在6000rpm，4℃，10min条件下离心，去掉上清，收集菌体。用反应缓冲液(50mM Tris-HCl，pH 7.4，0.1％Trixton)重悬至OD₆₀₀＝30。取1mL各重组菌的重悬液，向其中加入5μL白杨素(25mM)和2.5μLNADPH(100mM)，继续在37℃反应8小时。待反应结束后，向反应液中加入10μL HCl(6M)和1mL乙酸乙酯进行萃取3次，浓缩有机相获得的残留物用200μL甲醇溶解后取10μL进行HPLC分析。LB solid medium (ampicillin 100 μg/mL) was cultured overnight at 37°C. Pick a single clone into 2mL LB liquid medium (ampicillin 100μg/mL), transfer the overnight cultured bacterial liquid to a new 20mL MOPS liquid resistance medium at 37°C, 250r/min and cultivate until OD ₆₀₀ = 0.5-0.6 , the water bath was cooled to about 16°C, and then the inducer IPTG was added to a final concentration of 1mM, and the culture was continued for 12h under the conditions of a shaking table with a rotation speed of 220r/min and a temperature of 22°C. The above fermentation broth was centrifuged at 6000rpm, 4°C, 10min, the supernatant was removed, and the cells were collected. Resuspend to OD ₆₀₀ =30 with reaction buffer (50 mM Tris-HCl, pH 7.4, 0.1% Trixton). 1 mL of the resuspension of each recombinant bacteria was taken, 5 μL of chrysin (25 mM) and 2.5 μL of NADPH (100 mM) were added thereto, and the reaction was continued at 37° C. for 8 hours. After the reaction was completed, 10 μL of HCl (6M) and 1 mL of ethyl acetate were added to the reaction solution for extraction three times, and the residue obtained by concentrating the organic phase was dissolved in 200 μL of methanol, and 10 μL was taken for HPLC analysis.

各个重组大肠杆菌中，催化白杨素生成黄芩素的转化率如图9B。The conversion rate of catalyzing chrysin to baicalein in each recombinant E. coli is shown in Figure 9B.

各个重组大肠杆菌催化反应液的HPLC分析结果如图9C。The HPLC analysis results of each recombinant Escherichia coli catalytic reaction solution are shown in Figure 9C.

在本发明提及的所有文献都在本申请中引用作为参考，就如同每一篇文献被单独引用作为参考那样。此外应理解，在阅读了本发明的上述讲授内容之后，本领域技术人员可以对本发明作各种改动或修改，这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned in this application are incorporated by reference in this application as if each were individually incorporated by reference. In addition, it should be understood that after reading the above teaching content of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

序列表sequence listing

<110> 中国科学院上海生命科学研究院<110> Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences

<120> 合成黄芩素和野黄芩素的微生物、其制备方法及其应用<120> Microorganisms for Synthesizing Baicalein and Scutellarein, Their Preparation Methods and Applications

<130> 186662<130> 186662

<160> 29<160> 29

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 517<211> 517

<212> PRT<212> PRT

<213> 黄岑(Scutellaria baicalensis)<213> Scutellaria baicalensis

<400> 1<400> 1

Met Glu Leu Ser Ser Val Ile Tyr Gly Ala Ile Ala Leu Leu Ser LeuMet Glu Leu Ser Ser Val Ile Tyr Gly Ala Ile Ala Leu Leu Ser Leu

1 5 10 151 5 10 15

Phe Tyr Cys Tyr Leu His Phe Ser Lys Pro Lys Lys Ser Ser Leu AsnPhe Tyr Cys Tyr Leu His Phe Ser Lys Pro Lys Lys Ser Ser Ser Leu Asn

20 25 30 20 25 30

Ala Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu HisAla Pro Pro Glu Ala Gly Gly Ala Arg Phe Ile Thr Gly His Leu His

35 40 45 35 40 45

Leu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn LeuLeu Met Asp Gly Arg Ser Ala Ser Asp Lys Leu Pro His Ile Asn Leu

50 55 60 50 55 60

Gly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu GlyGly Leu Leu Ala Asp Gln His Gly Pro Ile Phe Thr Ile Arg Leu Gly

65 70 75 8065 70 75 80

Val His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu IleVal His Arg Ala Val Val Val Ser Ser Trp Glu Leu Ala Lys Glu Ile

85 90 95 85 90 95

Phe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile AlaPhe Thr Thr His Asp Thr Ala Val Met Ala Arg Pro Arg Leu Ile Ala

100 105 110 100 105 110

Asp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro TyrAsp Asp Tyr Leu Ser Tyr Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr

115 120 125 115 120 125

Gly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu LeuGly Pro Tyr Trp Arg Glu Ile Arg Lys Leu Val Thr Thr Glu Leu Leu

130 135 140 130 135 140

Ser Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu IleSer Ala Arg Arg Ile Glu Leu Gln Arg Ala Thr Arg Val Arg Glu Ile

145 150 155 160145 150 155 160

Thr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys AspThr Gln Phe Thr Gly Glu Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp

165 170 175 165 170 175

Gly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn LeuGly Ser Gly Arg Val Leu Val Asp Met Lys Gln Trp Leu Gly Asn Leu

180 185 190 180 185 190

Ser Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr GlySer Leu Asn Leu Val Ser Arg Met Val Val Gly Lys Arg Phe Tyr Gly

195 200 205 195 200 205

Gly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg GluGly Asp Asp Ser Glu Thr Thr Lys Arg Trp Arg Gly Val Met Arg Glu

210 215 220 210 215 220

Phe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro PhePhe Phe Gln Leu Ile Gly Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe

225 230 235 240225 230 235 240

Leu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp ThrLeu Arg Trp Leu Asp Leu Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr

245 250 255 245 250 255

Ala Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr ArgAla Tyr Glu Leu Asp Lys Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg

260 265 270 260 265 270

Lys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala LeuLys Arg Glu Tyr Ser Gly Asp Asp Lys Glu Gln Cys Phe Met Ala Leu

275 280 285 275 280 285

Met Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr AspMet Leu Ser Leu Val Gln Ala Asn Pro Thr Leu Gln Leu His Tyr Asp

290 295 300 290 295 300

Ala Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala AlaAla Asp Thr Ile Ile Lys Ala Thr Cys Gln Val Leu Ile Ser Ala Ala

305 310 315 320305 310 315 320

Ser Asp Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu LeuSer Asp Thr Thr Thr Thr Val Ile Leu Ile Trp Val Ile Ser Leu Leu Leu

325 330 335 325 330 335

Asn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu GlnAsn Asn Ala Asp Val Leu Lys Lys Val Gln Glu Glu Leu Asp Glu Gln

340 345 350 340 345 350

Val Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu ProVal Gly Arg Glu Arg Arg Val Glu Glu Ser Asp Ile Ser Asn Leu Pro

355 360 365 355 360 365

Tyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro AlaTyr Leu Gln Ala Val Val Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala

370 375 380 370 375 380

Pro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly GlyPro Phe Ala Gly Val Arg Ala Phe Ser Glu Asp Cys Thr Val Gly Gly

385 390 395 400385 390 395 400

Tyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys LeuTyr His Ile Gln Lys Gly Thr Phe Leu Ile Val Asn Leu Trp Lys Leu

405 410 415 405 410 415

His Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys ProHis Arg Asp Pro Arg Val Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro

420 425 430 420 425 430

Gln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe GluGln Arg Phe Phe Asp Lys Lys Val Glu Val Lys Gly Gln Asp Phe Glu

435 440 445 435 440 445

Leu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn LeuLeu Met Pro Phe Gly Gly Gly Arg Arg Met Cys Pro Gly Ser Asn Leu

450 455 460 450 455 460

Gly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala PheGly Met His Met Val His Phe Val Leu Ala Asn Ile Leu Gln Ala Phe

465 470 475 480465 470 475 480

Asp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly LeuAsp Ile Thr Thr Gly Ser Thr Val Asp Met Thr Glu Ser Val Gly Leu

485 490 495 485 490 495

Thr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg LeuThr Asn Met Lys Ala Thr Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu

500 505 510 500 505 510

Ser Pro Thr Leu TyrSer Pro Thr Leu Tyr

515 515

<210> 2<210> 2

<211> 495<211> 495

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(495)<222> (1)..(495)

<223> F6H突变体<223> F6H mutant

<400> 2<400> 2

Met Ala Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala GlyMet Ala Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly

1 5 10 151 5 10 15

Gly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg SerGly Ala Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser

20 25 30 20 25 30

Ala Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp GlnAla Ser Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln

35 40 45 35 40 45

His Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val ValHis Gly Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val

50 55 60 50 55 60

Val Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp ThrVal Ser Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr

65 70 75 8065 70 75 80

Ala Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser TyrAla Val Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr

85 90 95 85 90 95

Asp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg GluAsp Gly Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu

100 105 110 100 105 110

Ile Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile GluIle Arg Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu

115 120 125 115 120 125

Leu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly GluLeu Gln Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu

130 135 140 130 135 140

Leu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val LeuLeu Tyr Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu

145 150 155 160145 150 155 160

Val Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val SerVal Asp Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser

165 170 175 165 170 175

Arg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu ThrArg Met Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr

180 185 190 180 185 190

Thr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile GlyThr Lys Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly

195 200 205 195 200 205

Gln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp LeuGln Phe Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu

210 215 220 210 215 220

Gly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp LysGly Gly Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys

225 230 235 240225 230 235 240

Ile Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser GlyIle Ile Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly

245 250 255 245 250 255

Asp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val GlnAsp Asp Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln

260 265 270 260 265 270

Ala Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile LysAla Asn Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys

275 280 285 275 280 285

Ala Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr ValAla Thr Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val

290 295 300 290 295 300

Ile Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val LeuIle Leu Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu

305 310 315 320305 310 315 320

Lys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg ArgLys Lys Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg

325 330 335 325 330 335

Val Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val ValVal Glu Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val

340 345 350 340 345 350

Lys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val ArgLys Glu Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg

355 360 365 355 360 365

Ala Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys GlyAla Phe Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly

370 375 380 370 375 380

Thr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg ValThr Phe Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val

385 390 395 400385 390 395 400

Trp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp LysTrp Ser Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys

405 410 415 405 410 415

Lys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly GlyLys Val Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly

420 425 430 420 425 430

Gly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val HisGly Arg Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His

435 440 445 435 440 445

Phe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly SerPhe Val Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser

450 455 460 450 455 460

Thr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala ThrThr Val Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr

465 470 475 480465 470 475 480

Pro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrPro Leu Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr

485 490 495 485 490 495

<210> 3<210> 3

<211> 501<211> 501

<212> PRT<212> PRT

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(501)<222> (1)..(501)

<223> F6H突变体8RPtrF6H<223> F6H mutant 8RPtrF6H

<400> 3<400> 3

Met Ala Leu Leu Leu Ala Val Phe Met Pro Lys Lys Ser Ser Leu AsnMet Ala Leu Leu Leu Ala Val Phe Met Pro Lys Lys Ser Ser Leu Asn

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

Ser Pro Thr Leu TyrSer Pro Thr Leu Tyr

500 500

<210> 4<210> 4

<211> 591<211> 591

<212> PRT<212> PRT

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(591)<222> (1)..(591)

<223> 的F6H突变体SumotrF6H<223> F6H mutant SumotrF6H

<400> 4<400> 4

Met Ala Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys ProMet Ala Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro

1 5 10 151 5 10 15

Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly SerGlu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser

20 25 30 20 25 30

Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg LeuSer Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu

35 40 45 35 40 45

Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu ArgMet Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg

50 55 60 50 55 60

Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu AspPhe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp

65 70 75 8065 70 75 80

Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln IleLeu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile

85 90 95 85 90 95

Gly Gly Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala GlyGly Gly Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

<210> 5<210> 5

<211> 861<211> 861

<212> PRT<212> PRT

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(861)<222> (1)..(861)

<223> F6H突变体MBPtrF6H<223> F6H mutant MBPtrF6H

<400> 5<400> 5

Met Ala Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly AspMet Ala Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp

1 5 10 151 5 10 15

Lys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys AspLys Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp

20 25 30 20 25 30

Thr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu LysThr Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys

35 40 45 35 40 45

Phe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe TrpPhe Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp

50 55 60 50 55 60

Ala His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala GluAla His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu

65 70 75 8065 70 75 80

Ile Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr TrpIle Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp

85 90 95 85 90 95

Asp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala ValAsp Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val

100 105 110 100 105 110

Glu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro ProGlu Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro

115 120 125 115 120 125

Lys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala LysLys Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys

130 135 140 130 135 140

Gly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr TrpGly Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp

145 150 155 160145 150 155 160

Pro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn GlyPro Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly

165 170 175 165 170 175

Lys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys AlaLys Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala

180 185 190 180 185 190

Gly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn AlaGly Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala

195 200 205 195 200 205

Asp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu ThrAsp Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr

210 215 220 210 215 220

Ala Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr SerAla Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser

225 230 235 240225 230 235 240

Lys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln ProLys Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro

245 250 255 245 250 255

Ser Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala SerSer Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser

260 265 270 260 265 270

Pro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu ThrPro Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr

275 280 285 275 280 285

Asp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala ValAsp Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val

290 295 300 290 295 300

Ala Leu Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile AlaAla Leu Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile Ala

305 310 315 320305 310 315 320

Ala Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile ProAla Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro

325 330 335 325 330 335

Gln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn AlaGln Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala

340 345 350 340 345 350

Ala Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln ThrAla Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr

355 360 365 355 360 365

Met Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly AlaMet Pro Lys Lys Ser Ser Leu Asn Ala Pro Pro Glu Ala Gly Gly Ala

370 375 380 370 375 380

Arg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala SerArg Phe Ile Thr Gly His Leu His Leu Met Asp Gly Arg Ser Ala Ser

385 390 395 400385 390 395 400

Asp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His GlyAsp Lys Leu Pro His Ile Asn Leu Gly Leu Leu Ala Asp Gln His Gly

405 410 415 405 410 415

Pro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val SerPro Ile Phe Thr Ile Arg Leu Gly Val His Arg Ala Val Val Val Ser

420 425 430 420 425 430

Ser Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala ValSer Trp Glu Leu Ala Lys Glu Ile Phe Thr Thr His Asp Thr Ala Val

435 440 445 435 440 445

Met Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp GlyMet Ala Arg Pro Arg Leu Ile Ala Asp Asp Tyr Leu Ser Tyr Asp Gly

450 455 460 450 455 460

Ala Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile ArgAla Ser Leu Gly Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Glu Ile Arg

465 470 475 480465 470 475 480

Lys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu GlnLys Leu Val Thr Thr Glu Leu Leu Ser Ala Arg Arg Ile Glu Leu Gln

485 490 495 485 490 495

Arg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu TyrArg Ala Thr Arg Val Arg Glu Ile Thr Gln Phe Thr Gly Glu Leu Tyr

500 505 510 500 505 510

Lys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val AspLys Leu Trp Glu Glu Lys Lys Asp Gly Ser Gly Arg Val Leu Val Asp

515 520 525 515 520 525

Met Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg MetMet Lys Gln Trp Leu Gly Asn Leu Ser Leu Asn Leu Val Ser Arg Met

530 535 540 530 535 540

Val Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr LysVal Val Gly Lys Arg Phe Tyr Gly Gly Asp Asp Ser Glu Thr Thr Lys

545 550 555 560545 550 555 560

Arg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln PheArg Trp Arg Gly Val Met Arg Glu Phe Phe Gln Leu Ile Gly Gln Phe

565 570 575 565 570 575

Ile Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly GlyIle Pro Gly Asp Gly Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly

580 585 590 580 585 590

Phe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile IlePhe Glu Lys Arg Thr Arg Asp Thr Ala Tyr Glu Leu Asp Lys Ile Ile

595 600 605 595 600 605

Ala Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp AspAla Met Trp Leu Ala Glu Tyr Arg Lys Arg Glu Tyr Ser Gly Asp Asp

610 615 620 610 615 620

Lys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala AsnLys Glu Gln Cys Phe Met Ala Leu Met Leu Ser Leu Val Gln Ala Asn

625 630 635 640625 630 635 640

Pro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala ThrPro Thr Leu Gln Leu His Tyr Asp Ala Asp Thr Ile Ile Lys Ala Thr

645 650 655 645 650 655

Cys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile LeuCys Gln Val Leu Ile Ser Ala Ala Ser Asp Thr Thr Thr Val Ile Leu

660 665 670 660 665 670

Ile Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys LysIle Trp Val Ile Ser Leu Leu Leu Asn Asn Ala Asp Val Leu Lys Lys

675 680 685 675 680 685

Val Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val GluVal Gln Glu Glu Leu Asp Glu Gln Val Gly Arg Glu Arg Arg Val Glu

690 695 700 690 695 700

Glu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys GluGlu Ser Asp Ile Ser Asn Leu Pro Tyr Leu Gln Ala Val Val Lys Glu

705 710 715 720705 710 715 720

Thr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala PheThr Met Arg Leu Tyr Pro Pro Ala Pro Phe Ala Gly Val Arg Ala Phe

725 730 735 725 730 735

Ser Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr PheSer Glu Asp Cys Thr Val Gly Gly Tyr His Ile Gln Lys Gly Thr Phe

740 745 750 740 745 750

Leu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp SerLeu Ile Val Asn Leu Trp Lys Leu His Arg Asp Pro Arg Val Trp Ser

755 760 765 755 760 765

Asp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys ValAsp Asp Ala Leu Glu Phe Lys Pro Gln Arg Phe Phe Asp Lys Lys Val

770 775 780 770 775 780

Glu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly ArgGlu Val Lys Gly Gln Asp Phe Glu Leu Met Pro Phe Gly Gly Gly Arg

785 790 795 800785 790 795 800

Arg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe ValArg Met Cys Pro Gly Ser Asn Leu Gly Met His Met Val His Phe Val

805 810 815 805 810 815

Leu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr ValLeu Ala Asn Ile Leu Gln Ala Phe Asp Ile Thr Thr Gly Ser Thr Val

820 825 830 820 825 830

Asp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro LeuAsp Met Thr Glu Ser Val Gly Leu Thr Asn Met Lys Ala Thr Pro Leu

835 840 845 835 840 845

Asp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu TyrAsp Ala Ile Leu Thr Pro Arg Leu Ser Pro Thr Leu Tyr

850 855 860 850 855 860

<210> 6<210> 6

<211> 509<211> 509

<212> PRT<212> PRT

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(509)<222> (1)..(509)

<223> F6H突变体2B1trF6H<223> F6H mutant 2B1trF6H

<400> 6<400> 6

Met Ala Lys Lys Thr Ser Ser Lys Gly Lys Leu Pro Pro Gly Pro SerMet Ala Lys Lys Thr Ser Ser Lys Gly Lys Leu Pro Pro Gly Pro Ser

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 500 505

<210> 7<210> 7

<211> 712<211> 712

<212> PRT<212> PRT

<213> 拟南芥(Arabidopsis thaliana)<213> Arabidopsis thaliana

<400> 7<400> 7

Met Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met AlaMet Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met Ala

1 5 10 151 5 10 15

Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn AlaAla Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn Ala

20 25 30 20 25 30

Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile GluSer Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile Glu

35 40 45 35 40 45

Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu IleAsn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu Ile

50 55 60 50 55 60

Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn SerGly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn Ser

65 70 75 8065 70 75 80

Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu GluLys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu Glu

85 90 95 85 90 95

Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr GlnGlu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr Gln

100 105 110 100 105 110

Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala LysThr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala Lys

115 120 125 115 120 125

Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp TyrAla Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp Tyr

130 135 140 130 135 140

Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp ValAla Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp Val

145 150 155 160145 150 155 160

Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp AsnAla Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn

165 170 175 165 170 175

Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly GluAla Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly Glu

180 185 190 180 185 190

Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg GlnTrp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln

195 200 205 195 200 205

Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu ValTyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu Val

210 215 220 210 215 220

Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp AspGlu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp Asp

225 230 235 240225 230 235 240

Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp ProGln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp Pro

245 250 255 245 250 255

Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala ThrGlu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala Thr

260 265 270 260 265 270

Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp SerPro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp Ser

275 280 285 275 280 285

Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly TyrGlu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly Tyr

290 295 300 290 295 300

Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val LysThr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val Lys

305 310 315 320305 310 315 320

Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu GluArg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu Glu

325 330 335 325 330 335

Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His ValPhe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His Val

340 345 350 340 345 350

Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu ArgGly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu Arg

355 360 365 355 360 365

Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu LysLeu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu Lys

370 375 380 370 375 380

Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro ProGlu Asp Gly Thr Pro Ile Ser Ser Ser Ser Leu Pro Pro Pro Phe Pro Pro

385 390 395 400385 390 395 400

Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser SerCys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser Ser

405 410 415 405 410 415

Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp ProPro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp Pro

420 425 430 420 425 430

Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys ValThr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys Val

435 440 445 435 440 445

Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu GluAsp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu

450 455 460 450 455 460

Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe PheVal Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe

465 470 475 480465 470 475 480

Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser SerAla Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser

485 490 495 485 490 495

Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu ValSer Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val

500 505 510 500 505 510

Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser ThrTyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr

515 520 525 515 520 525

Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser SerTrp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser

530 535 540 530 535 540

Ala Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp SerAla Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser

545 550 555 560545 550 555 560

Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro PheLys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe

565 570 575 565 570 575

Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val GluArg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu

580 585 590 580 585 590

Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met AspLeu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp

595 600 605 595 600 605

Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala LeuPhe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu

610 615 620 610 615 620

Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu TyrAla Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr

625 630 635 640625 630 635 640

Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met IleVal Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile

645 650 655 645 650 655

Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met AlaSer Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala

660 665 670 660 665 670

Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly SerArg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser

675 680 685 675 680 685

Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr SerMet Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser

690 695 700 690 695 700

Gly Arg Tyr Leu Arg Asp Val TrpGly Arg Tyr Leu Arg Asp Val Trp

705 710705 710

<210> 8<210> 8

<211> 641<211> 641

<212> PRT<212> PRT

<220><220>

<221> VARIANT<221> VARIANT

<222> (1)..(641)<222> (1)..(641)

<223> AtCPR突变体trAtCPR<223> AtCPR mutant trAtCPR

<400> 8<400> 8

Met Arg Arg Ser Gly Ser Gly Asn Ser Lys Arg Val Glu Pro Leu LysMet Arg Arg Ser Gly Ser Gly Asn Ser Lys Arg Val Glu Pro Leu Lys

1 5 10 151 5 10 15

Pro Leu Val Ile Lys Pro Arg Glu Glu Glu Ile Asp Asp Gly Arg LysPro Leu Val Ile Lys Pro Arg Glu Glu Glu Ile Asp Asp Gly Arg Lys

20 25 30 20 25 30

Lys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly PheLys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly Phe

35 40 45 35 40 45

Ala Lys Ala Leu Gly Glu Glu Ala Lys Ala Arg Tyr Glu Lys Thr ArgAla Lys Ala Leu Gly Glu Glu Ala Lys Ala Arg Tyr Glu Lys Thr Arg

50 55 60 50 55 60

Phe Lys Ile Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp Glu TyrPhe Lys Ile Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp Glu Tyr

65 70 75 8065 70 75 80

Glu Glu Lys Leu Lys Lys Glu Asp Val Ala Phe Phe Phe Leu Ala ThrGlu Glu Lys Leu Lys Lys Glu Asp Val Ala Phe Phe Phe Leu Ala Thr

85 90 95 85 90 95

Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys TrpTyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys Trp

100 105 110 100 105 110

Phe Thr Glu Gly Asn Asp Arg Gly Glu Trp Leu Lys Asn Leu Lys TyrPhe Thr Glu Gly Asn Asp Arg Gly Glu Trp Leu Lys Asn Leu Lys Tyr

115 120 125 115 120 125

Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys ValGly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys Val

130 135 140 130 135 140

Ala Lys Val Val Asp Asp Ile Leu Val Glu Gln Gly Ala Gln Arg LeuAla Lys Val Val Asp Asp Ile Leu Val Glu Gln Gly Ala Gln Arg Leu

145 150 155 160145 150 155 160

Val Gln Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp PheVal Gln Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp Phe

165 170 175 165 170 175

Thr Ala Trp Arg Glu Ala Leu Trp Pro Glu Leu Asp Thr Ile Leu ArgThr Ala Trp Arg Glu Ala Leu Trp Pro Glu Leu Asp Thr Ile Leu Arg

180 185 190 180 185 190

Glu Glu Gly Asp Thr Ala Val Ala Thr Pro Tyr Thr Ala Ala Val LeuGlu Glu Gly Asp Thr Ala Val Ala Thr Pro Tyr Thr Ala Ala Val Leu

195 200 205 195 200 205

Glu Tyr Arg Val Ser Ile His Asp Ser Glu Asp Ala Lys Phe Asn AspGlu Tyr Arg Val Ser Ile His Asp Ser Glu Asp Ala Lys Phe Asn Asp

210 215 220 210 215 220

Ile Asn Met Ala Asn Gly Asn Gly Tyr Thr Val Phe Asp Ala Gln HisIle Asn Met Ala Asn Gly Asn Gly Tyr Thr Val Phe Asp Ala Gln His

225 230 235 240225 230 235 240

Pro Tyr Lys Ala Asn Val Ala Val Lys Arg Glu Leu His Thr Pro GluPro Tyr Lys Ala Asn Val Ala Val Lys Arg Glu Leu His Thr Pro Glu

245 250 255 245 250 255

Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ala Gly Ser GlySer Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ala Gly Ser Gly

260 265 270 260 265 270

Leu Thr Tyr Glu Thr Gly Asp His Val Gly Val Leu Cys Asp Asn LeuLeu Thr Tyr Glu Thr Gly Asp His Val Gly Val Leu Cys Asp Asn Leu

275 280 285 275 280 285

Ser Glu Thr Val Asp Glu Ala Leu Arg Leu Leu Asp Met Ser Pro AspSer Glu Thr Val Asp Glu Ala Leu Arg Leu Leu Asp Met Ser Pro Asp

290 295 300 290 295 300

Thr Tyr Phe Ser Leu His Ala Glu Lys Glu Asp Gly Thr Pro Ile SerThr Tyr Phe Ser Leu His Ala Glu Lys Glu Asp Gly Thr Pro Ile Ser

305 310 315 320305 310 315 320

Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Asn Leu Arg Thr Ala LeuSer Ser Leu Pro Pro Pro Phe Pro Pro Cys Asn Leu Arg Thr Ala Leu

325 330 335 325 330 335

Thr Arg Tyr Ala Cys Leu Leu Ser Ser Pro Lys Lys Ser Ala Leu ValThr Arg Tyr Ala Cys Leu Leu Ser Ser Pro Lys Lys Ser Ala Leu Val

340 345 350 340 345 350

Ala Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Glu Arg Leu LysAla Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Glu Arg Leu Lys

355 360 365 355 360 365

His Leu Ala Ser Pro Ala Gly Lys Val Asp Glu Tyr Ser Lys Trp ValHis Leu Ala Ser Pro Ala Gly Lys Val Asp Glu Tyr Ser Lys Trp Val

370 375 380 370 375 380

Val Glu Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro SerVal Glu Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro Ser

385 390 395 400385 390 395 400

Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val Ala Pro Arg LeuAla Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val Ala Pro Arg Leu

405 410 415 405 410 415

Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Pro Lys Ile Ala Glu ThrGln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Ser Pro Lys Ile Ala Glu Thr

420 425 430 420 425 430

Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Met Pro Thr GlyArg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Met Pro Thr Gly

435 440 445 435 440 445

Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val ProArg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val Pro

450 455 460 450 455 460

Tyr Glu Lys Ser Glu Asn Cys Ser Ser Ala Pro Ile Phe Val Arg GlnTyr Glu Lys Ser Glu Asn Cys Ser Ser Ala Pro Ile Phe Val Arg Gln

465 470 475 480465 470 475 480

Ser Asn Phe Lys Leu Pro Ser Asp Ser Lys Val Pro Ile Ile Met IleSer Asn Phe Lys Leu Pro Ser Asp Ser Lys Val Pro Ile Ile Met Ile

485 490 495 485 490 495

Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu ArgGly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu Arg

500 505 510 500 505 510

Leu Ala Leu Val Glu Ser Gly Val Glu Leu Gly Pro Ser Val Leu PheLeu Ala Leu Val Glu Ser Gly Val Glu Leu Gly Pro Ser Val Leu Phe

515 520 525 515 520 525

Phe Gly Cys Arg Asn Arg Arg Met Asp Phe Ile Tyr Glu Glu Glu LeuPhe Gly Cys Arg Asn Arg Arg Met Asp Phe Ile Tyr Glu Glu Glu Leu

530 535 540 530 535 540

Gln Arg Phe Val Glu Ser Gly Ala Leu Ala Glu Leu Ser Val Ala PheGln Arg Phe Val Glu Ser Gly Ala Leu Ala Glu Leu Ser Val Ala Phe

545 550 555 560545 550 555 560

Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met AspSer Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met Asp

565 570 575 565 570 575

Lys Ala Ser Asp Ile Trp Asn Met Ile Ser Gln Gly Ala Tyr Leu TyrLys Ala Ser Asp Ile Trp Asn Met Ile Ser Gln Gly Ala Tyr Leu Tyr

580 585 590 580 585 590

Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Ser LeuVal Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Ser Leu

595 600 605 595 600 605

His Thr Ile Ala Gln Glu Gln Gly Ser Met Asp Ser Thr Lys Ala GluHis Thr Ile Ala Gln Glu Gln Gly Ser Met Asp Ser Thr Lys Ala Glu

610 615 620 610 615 620

Gly Phe Val Lys Asn Leu Gln Thr Ser Gly Arg Tyr Leu Arg Asp ValGly Phe Val Lys Asn Leu Gln Thr Ser Gly Arg Tyr Leu Arg Asp Val

625 630 635 640625 630 635 640

TrpTrp

<210> 9<210> 9

<211> 25<211> 25

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 9<400> 9

tataccatgg aactgagcag tgtga 25tataccatgg aactgagcag tgtga 25

<210> 10<210> 10

<211> 36<211> 36

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 10<400> 10

ctcgaattcg gatccactag tttaatataa agtcgg 36ctcgaattcg gatccactag tttaatataa agtcgg 36

<210> 11<210> 11

<211> 43<211> 43

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 11<400> 11

ctttaagaag gagatatacc atggcgatgc cgaagaaaag ctc 43ctttaagaag gagatatacc atggcgatgc cgaagaaaag ctc 43

<210> 12<210> 12

<211> 63<211> 63

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 12<400> 12

ctttaagaag gagatatacc atggctctgt tattagcagt ttttatgccg aagaaaagct 60ctttaagaag gagatatacc atggctctgt tattagcagt ttttatgccg aagaaaagct 60

ctt 63ctt 63

<210> 13<210> 13

<211> 39<211> 39

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 13<400> 13

ctttaagaag gagatatacc atggctaaaa tcgaagaag 39ctttaagaag gagatatacc atggctaaaa tcgaagaag 39

<210> 14<210> 14

<211> 35<211> 35

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 14<400> 14

ctgaaagacg cgcagactat gccgaagaaa agctc 35ctgaaagacg cgcagactat gccgaagaaa agctc 35

<210> 15<210> 15

<211> 35<211> 35

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 15<400> 15

gagcttttct tcggcatagt ctgcgcgtct ttcag 35gagcttttct tcggcatagt ctgcgcgtct ttcag 35

<210> 16<210> 16

<211> 87<211> 87

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 16<400> 16

ctttaagaag gagatatacc atggctaaga aaacgagctc taaagggaag ctcccaccag 60ctttaagaag gagatatacc atggctaaga aaacgagctc taaagggaag ctcccaccag 60

gacctagcat gccgaagaaa agctctt 87gacctagcat gccgaagaaa agctctt 87

<210> 17<210> 17

<211> 44<211> 44

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 17<400> 17

ctttaagaag gagatatacc atggcggact cagaagtcaa tctt 44ctttaagaag gagatatacc atggcggact cagaagtcaa tctt 44

<210> 18<210> 18

<211> 36<211> 36

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 18<400> 18

gagaacagat tggtggtatg ccgaagaaaa gctctt 36gagaacagat tggtggtatg ccgaagaaaa gctctt 36

<210> 19<210> 19

<211> 36<211> 36

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 19<400> 19

aagagctttt cttcggcata ccaccaatct gttctc 36aagagctttt cttcggcata ccaccaatct gttctc 36

<210> 20<210> 20

<211> 27<211> 27

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 20<400> 20

tataccatgg gtgactgcgt tgccccg 27tataccatgg gtgactgcgt tgccccg 27

<210> 21<210> 21

<211> 30<211> 30

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 21<400> 21

cgggatcctt acttcggcag gtcgccgctc 30cgggatcctt acttcggcag gtcgccgctc 30

<210> 22<210> 22

<211> 29<211> 29

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 22<400> 22

cgggatccct tatgcgactc ctgcattag 29cgggatccct tatgcgactc ctgcattag 29

<210> 23<210> 23

<211> 26<211> 26

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 23<400> 23

gcccaagctt ttatgccagc atcttc 26gcccaagctt ttatgccagc atcttc 26

<210> 24<210> 24

<211> 31<211> 31

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 24<400> 24

agatatacat atggttacgg tggaagaata c 31agatatacat atggttacgg tggaagaata c 31

<210> 25<210> 25

<211> 30<211> 30

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 25<400> 25

ccgctcgagt taggtagcca cactatgcag 30ccgctcgagt taggtagcca cactatgcag 30

<210> 26<210> 26

<211> 30<211> 30

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 26<400> 26

ccgctcgagc tagaaataat tttgtttaac 30ccgctcgagc tagaaataat tttgtttaac 30

<210> 27<210> 27

<211> 28<211> 28

<212> DNA<212> DNA

<213> 引物(Primer)<213> Primer

<400> 27<400> 27

gagcctaggt tagttaccga ttttaaag 28gagcctaggt tagttaccga ttttaaag 28

<210> 28<210> 28

<211> 38<211> 38

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 28<400> 28

gaagatgctg gcataaaagc ttcgatcccg cgaaatta 38gaagatgctg gcataaaagc ttcgatcccg cgaaatta 38

<210> 29<210> 29

<211> 38<211> 38

<212> DNA<212>DNA

<213> 引物(Primer)<213> Primer

<400> 29<400> 29

cgacttaagc attatgcggc cgcctacgcc aggttttc 38cgacttaagc attatgcggc cgcctacgcc aggttttc 38

Claims

1. A method for producing baicalein, comprising:

introducing into a host cell a gene expressing a flavone 6-hydroxylase and a cytochrome P450 oxidoreductase; the flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type enzyme or an engineered mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid sequence of the mutant enzyme is shown as SEQ ID NO. 8; the host cell is an E.coli cell;

culturing the recombinant host cell in a culture system containing chrysin, thereby producing baicalein.

2. A method for converting chrysin into baicalein is characterized in that flavone 6-hydroxylase and cytochrome P450 oxidoreductase are used for catalyzing chrysin, so that a hydroxyl group is added into the structure of chrysin to generate baicalein; the flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type enzyme or an engineered mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid sequence of the mutant enzyme is shown as SEQ ID NO. 8.

3. Use of a recombinant host cell comprising exogenous genes expressing a flavone 6-hydroxylase and a cytochrome P450 oxidoreductase in the production of baicalein; the flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type enzyme or an engineered mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid sequence of the mutant enzyme is shown as SEQ ID NO. 8; the host cell is an E.coli cell.

4. A method of preparing a host cell for the production of baicalein comprising: introducing a gene expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase into a host cell to obtain a recombinant strain; the flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type enzyme or an engineered mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid sequence of the mutant enzyme is shown as SEQ ID NO. 8; the host cell is an E.coli cell.

5. Use of a kit comprising recombinant host cells in the production of baicalein; the recombinant host cell comprises exogenous genes for expressing flavone 6-hydroxylase and cytochrome P450 oxidoreductase; the flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type enzyme or an engineered mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid sequence of the mutant enzyme is shown as SEQ ID NO. 8; the host cell is an E.coli cell.

6. The application of mutant flavone 6-hydroxylase and cytochrome P450 oxidoreductase in the production of baicalein is that the mutant flavone 6-hydroxylase is modified mutant flavone 6-hydroxylase, and the amino acid sequence of the mutant flavone 6-hydroxylase is shown as SEQ ID NO. 6; the cytochrome P450 oxidoreductase is a wild type or mutant enzyme, the amino acid sequence of the wild type enzyme is shown as SEQ ID NO. 7, and the amino acid of the mutant enzyme is shown as SEQ ID NO. 8.

7. The use according to claim 6, wherein the mutant flavone 6-hydroxylase and cytochrome P450 oxidoreductase form a fusion polypeptide.

8. The use according to claim 6, wherein the polynucleotide encoding the mutant flavone 6-hydroxylase and cytochrome P450 oxidoreductase is introduced into a host cell to produce baicalein; the host cell is an E.coli cell.

9. The use according to claim 8, wherein an expression construct is prepared comprising a polynucleotide encoding the mutant flavone 6-hydroxylase and cytochrome P450 oxidoreductase, introduced into a host cell.

10. The use of claim 6, wherein said producing baicalein is by adding a hydroxyl group to the structure of chrysin to form baicalein.