CN108623690A

CN108623690A - A kind of fusion protein of thrombopoietin and its preparation method and application

Info

Publication number: CN108623690A
Application number: CN201710158090.0A
Authority: CN
Inventors: 李红玉; 张丽芸; 王梅竹; 支德娟; 李洋; 赵晶
Original assignee: Lanzhou University
Current assignee: Lanzhou University
Priority date: 2017-03-16
Filing date: 2017-03-16
Publication date: 2018-10-09

Abstract

The invention belongs to genetic engineering pharmaceutical fields, it is related to a kind of fusion protein of thrombopoietin and its preparation method and application, more particularly to thrombopoietin simulating peptide (the Thrombopoietin Mimetic Peptide of primary yeast preferred codons coding, TMP) the fusion protein and its preparation method and application of dyad and human serum albumins molecule (Human serum Albumin, HSA).The fusion egg includes the TMP dyads that a HSA molecule and a yeast preferred codons encode, wherein HSA molecules are located at the ends N of fusion protein, and the TMP dyads of yeast preferred codons coding are located at the ends C of fusion protein.The fusion protein high-caliber stabilization can express in yeast, can be applied to industrial production；Meanwhile the fusion protein has longer half-life period with thrombopoietic characteristic is remarkably promoted in human body, can be used for preparing the drug for treating the diseases such as a variety of primary or secondary thrombocytopenia.

Description

A fusion protein of thrombopoietin and its preparation method and application

技术领域technical field

本发明属于基因工程制药领域，涉及一种促血小板生成素的融合蛋白及其制备方法和应用，具体涉及一种酵母偏好密码子编码的促血小板生成素模拟肽(ThrombopoietinMimetic Peptide，TMP)二联体与人血清白蛋白分子(Human serum Albumin，HSA)的融合蛋白。The invention belongs to the field of genetic engineering pharmaceuticals, and relates to a fusion protein of thrombopoietin and its preparation method and application, in particular to a thrombopoietin mimetic peptide (Thrombopoietin Mimetic Peptide, TMP) duplex encoded by a yeast preferred codon Fusion protein with human serum albumin molecule (Human serum Albumin, HSA).

背景技术Background technique

在临床上常会遇到由各种原因引起的原发性和继发性血小板减少症，如原发性血小板减少性紫癜、再生障碍性贫血以及肿瘤化/放疗造成的血小板减少等。对于这类疾病，比较适合应用长效升血小板药物进行救治，一方面，可以减少用药次数，减少病人的针痛之苦；另一方面，可以减少药物用量，降低治疗费用。In clinical practice, primary and secondary thrombocytopenia caused by various reasons are often encountered, such as primary thrombocytopenic purpura, aplastic anemia, and thrombocytopenia caused by tumor chemotherapy/radiotherapy. For such diseases, it is more suitable to use long-acting platelet-increasing drugs for treatment. On the one hand, it can reduce the number of medications and reduce the pain of acupuncture for patients; on the other hand, it can reduce the dosage of drugs and reduce treatment costs.

血小板由巨核细胞产生，促进巨核细胞的增殖、分化是提高血小板水平的主要手段。促血小板生成素(Thrombopoietin，TMP)，又称巨核细胞生长发育因子(MGDF)，是骨髓巨核细胞增殖、分化和血小板生成的最重要调控因子。1994年TMP基因缺少注释被首次成功克隆后不久，人们即研制出一种聚乙二醇(PEG)化的重组功能型TMP基因工程产品-PEG-rHuMGDF。动物实验结果表明，PEG-rHuMGDF不仅具有突出的促血小板生成活性，而且在体内具有较长的半衰期，单次应用即可达到显著升高血小板的效果。然而，在临床试验过程中人们发现，一些健康受试者在接受该药物治疗后所产生的抗体与内源性TMP存在交差中和反应，导致受试者出现血小板减少，从而使得PEG-rHuMGDF乃至重组人TMP的研制在1998年时即被美国FDA叫停。受其影响，在美、欧、日本等国家至今重组人TMP都未被批准进入临床。Platelets are produced by megakaryocytes, and promoting the proliferation and differentiation of megakaryocytes is the main means to increase the level of platelets. Thrombopoietin (TMP), also known as megakaryocyte growth and development factor (MGDF), is the most important regulator of bone marrow megakaryocyte proliferation, differentiation and platelet production. Shortly after the first successful cloning of TMP gene lacking annotation in 1994, people developed a polyethylene glycol (PEG) recombinant functional TMP genetic engineering product - PEG-rHuMGDF. The results of animal experiments show that PEG-rHuMGDF not only has outstanding platelet-promoting activity, but also has a long half-life in vivo, and a single application can significantly increase platelets. However, in the course of clinical trials, it was found that the antibodies produced by some healthy subjects after receiving the drug treatment had a cross-neutralization reaction with endogenous TMP, resulting in thrombocytopenia in the subjects, so that PEG-rHuMGDF and even The development of recombinant human TMP was stopped by the US FDA in 1998. Affected by it, recombinant human TMP has not been approved for clinical use in the United States, Europe, Japan and other countries.

人血清白蛋白分子(Albumin Human，HSA)是血浆中的主要蛋白成分，约占血浆总蛋白的50-60％。HSA分子是一种非糖基化单链蛋白，共由585个氨基酸残基组成，其分子量为66.5kDa(A.Dugaiczyk et al.，PNAS，1982，79：71-75)，血浆半衰期长达两周以上。HSA分子在宿主细胞中是以一种原肽形式合成的，其中含24个氨基酸残基组成的信号肽和前肽，在转运和分泌过程中信号肽和前肽被切除。HSA分子是一个稳定的惰性蛋白，可以作为药物载体。HSA分子以下同已成功地在多宿主中表达(EP330451和EP361991)，尤其在酵母细胞中可实现HSA分子较高水平的稳定表达。当目的多肽与HSA分子融合表达时不仅可以提高多肽药物的稳定性，还可以增加多肽药物在体内的半衰期。专利号CN201310084212.8的二聚体化融合蛋白的制备及应用中公开了二聚体化促血小板生成素(TPO)模拟肽TMP二联体-人血清白蛋白分子融合蛋白的制备及应用，但是其存在融合蛋白表达效果不理想的问题，为了提高表达产量、增强融合蛋白的生物活性，本发明采用了酵母偏好密码子编码促血小板生成素模拟肽TMP二联体的策略。基于上述研究现状，本发明现提供一种促血小板生成素的融合蛋白，所述的促血小板生成素模拟肽TMP二联体是由酵母偏好密码子编码而成的，所述融合蛋白能在酵母体内高水平的稳定表达，可应用于工业生产；同时，该融合蛋白具有显著促进血小板生成的特性，在人体内具有较长的半衰期。本发明同时提供该融合蛋白的制备方法和应用。Human serum albumin molecule (Albumin Human, HSA) is the main protein component in plasma, accounting for about 50-60% of total plasma protein. HSA molecule is a non-glycosylated single-chain protein consisting of 585 amino acid residues, its molecular weight is 66.5kDa (A.Dugaiczyk et al., PNAS, 1982, 79:71-75), and its plasma half-life is as long as More than two weeks. HSA molecules are synthesized in the form of a propeptide in host cells, which contains a signal peptide and a propeptide consisting of 24 amino acid residues, which are excised during transport and secretion. HSA molecule is a stable inert protein that can be used as a drug carrier. The following HSA molecules have been successfully expressed in multiple hosts (EP330451 and EP361991), especially in yeast cells, which can achieve a higher level of stable expression of HSA molecules. When the target polypeptide is fused with HSA molecule, it can not only improve the stability of the polypeptide drug, but also increase the half-life of the polypeptide drug in vivo. The preparation and application of the dimerization fusion protein of patent number CN201310084212.8 discloses the preparation and application of the dimerization thrombopoietin (TPO) mimetic peptide TMP duplex-human serum albumin molecular fusion protein, but It has the problem that the expression effect of the fusion protein is not ideal. In order to improve the expression yield and enhance the biological activity of the fusion protein, the present invention adopts the strategy of coding the TMP duplex of the thrombopoietin mimetic peptide with yeast preferred codons. Based on the above research status, the present invention now provides a fusion protein of thrombopoietin, the TMP duplex of the thrombopoietin mimetic peptide is encoded by yeast preferred codons, and the fusion protein can be expressed in yeast High-level stable expression in vivo can be applied to industrial production; at the same time, the fusion protein has the property of significantly promoting platelet production and has a longer half-life in the human body. The invention also provides the preparation method and application of the fusion protein.

重组蛋白的异源表达受到基因自身序列特征如密码子偏好性、GC含量、mRNA二级结构、mRNA稳定性等因素的影响。研究表明同义密码子的更换能够影响蛋白质翻译延伸的速率[1-2]，说明密码子的优化可能会对重组蛋白的表达起到适得其反的影响。另一方面，重组蛋白的表达还受到宿主、培养条件、分泌途径、启动子等因素的影响，现行的基因优化理论和设计方法还不成熟，存在着各种各样的局限性，基因优化策略对提高重组蛋白的表达量是必要条件，而不是充分条件。综上所述，通过酵母偏好密码子改造从而改良基因工程菌株表达量的策略在实际生产应用中存在着极大的不确定性，针对不同的目的基因进行改造，最终的结果是无法预知的。The heterologous expression of recombinant proteins is affected by the sequence characteristics of the gene itself, such as codon preference, GC content, mRNA secondary structure, mRNA stability and other factors. Studies have shown that the replacement of synonymous codons can affect the rate of protein translation elongation [1-2], indicating that codon optimization may have a counterproductive effect on the expression of recombinant proteins. On the other hand, the expression of recombinant proteins is also affected by factors such as hosts, culture conditions, secretion pathways, and promoters. The current gene optimization theory and design methods are still immature, and there are various limitations. Gene optimization strategies It is a necessary condition, but not a sufficient condition, to increase the expression of recombinant protein. To sum up, the strategy of improving the expression level of genetically engineered strains through modification of yeast preferred codons has great uncertainty in actual production and application, and the final result of modification for different target genes is unpredictable.

参考文献：references:

[1]Komar AA,Lesnik T,Reiss C.Synonymous codon substitutions affectribosome traffic and protein folding during in vitro translation.FEBS Lett,1999,462(3):387-391.[1] Komar AA, Lesnik T, Reiss C. Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett, 1999, 462(3): 387-391.

[2]Trinh R,Gurbaxani B,Morrison SL,et al.Optimization of codon pairuse within the(GGGGS)3linker sequence results in enhanced proteinexpression.Mol Immunol,2004,40(10):717-722.[2] Trinh R, Gurbaxani B, Morrison SL, et al. Optimization of codon pairuse within the (GGGGS)3linker sequence results in enhanced protein expression. Mol Immunol, 2004, 40(10): 717-722.

发明内容Contents of the invention

本发明的目的在于提供一种促血小板生成素的融合蛋白，该融合蛋白半衰期长，能在宿主体内高水平稳定表达。The purpose of the present invention is to provide a fusion protein of thrombopoietin, which has a long half-life and can be stably expressed at a high level in a host.

本发明的另一个目的是提供上述融合蛋白的制备方法。Another object of the present invention is to provide a method for preparing the above fusion protein.

本发明的另一个目的是提供上述融合蛋白的应用。Another object of the present invention is to provide the application of the above fusion protein.

本发明所述的上述目的是通过以下技术方案实现的：The above-mentioned purpose described in the present invention is achieved through the following technical solutions:

本发明公开的一种促血小板生成素的融合蛋白，包括人血清白蛋白HSA分子、促血小板生成素模拟肽TMP二联体，所述的人血清白蛋白HSA分子的氨基酸序列如SEQ ID NO：4所示，或该氨基酸序列经突变后具有同样功能的序列，编码所述HSA分子的氨基酸序列的DNA序列如SEQ ID NO：3所示，所述的促血小板生成素模拟肽TMP二联体的氨基酸序列如SEQID NO：2所示，或该氨基酸序列经突变后具有同样功能的序列，促血小板生成素模拟肽TMP二联体是由一个酵母偏好密码子编码而成的，编码所述促血小板生成素模拟肽TMP二联体的氨基酸序列的DNA序列如SEQ ID NO：1所示，该融合蛋白还包括可形成二聚体化结构域的肽段。A fusion protein of thrombopoietin disclosed by the present invention comprises human serum albumin HSA molecule and thrombopoietin mimetic peptide TMP duplex, and the amino acid sequence of said human serum albumin HSA molecule is shown as SEQ ID NO: 4, or a sequence having the same function after the amino acid sequence is mutated, the DNA sequence encoding the amino acid sequence of the HSA molecule is shown in SEQ ID NO: 3, and the thrombopoietin mimetic peptide TMP duplex The amino acid sequence of the amino acid sequence is shown in SEQID NO: 2, or a sequence having the same function after the amino acid sequence is mutated. The thrombopoietin mimetic peptide TMP duplex is encoded by a yeast-preferred codon, encoding the promoting The DNA sequence of the amino acid sequence of the thrombopoietin mimetic peptide TMP duplex is shown in SEQ ID NO: 1, and the fusion protein also includes a peptide segment that can form a dimerization domain.

本发明公开的一种促血小板生成素的融合蛋白，可形成二聚体化结构域的肽段中含有至少两个半胱氨酸。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form the dimerization domain contains at least two cysteines.

本发明公开的一种促血小板生成素的融合蛋白，可形成二聚体化结构域的肽段中含有两个半胱氨酸。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form a dimerization domain contains two cysteines.

本发明公开的一种促血小板生成素的融合蛋白，可形成二聚体化结构域的肽段为H肽段，其氨基酸序列如Seq ID No:8所示。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide that can form the dimerization domain is H peptide, and its amino acid sequence is shown in Seq ID No:8.

本发明公开的一种促血小板生成素的融合蛋白，可形成二聚体化结构域的肽段为H肽段和选自fip结构域的肽段，H肽段的氨基酸序列如Seq ID No:8所示，选自fip结构域的肽段的氨基酸序列如Seq ID No:10所示。A fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form a dimerization domain is an H peptide segment and a peptide segment selected from the fip domain, and the amino acid sequence of the H peptide segment is as Seq ID No: 8, the amino acid sequence of the peptide selected from the fip domain is shown in Seq ID No:10.

本发明公开的一种促血小板生成素的融合蛋白，其中HSA分子连接于融合蛋白的C-末端，结构式表示为HSA-L3-H-L4-TMP-L1-TMP，L1、L3、L4表示肽接头，L1的氨基酸序列为GGPSG，L3的氨基酸序列为GGGGSGL，L4的氨基酸序列为EFGGGGS，H表示可形成二聚体化结构域的H肽段。A fusion protein of thrombopoietin disclosed by the present invention, wherein the HSA molecule is connected to the C-terminus of the fusion protein, the structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, and L4 represent peptides Linker, the amino acid sequence of L1 is GGPPSG, the amino acid sequence of L3 is GGGGSGL, the amino acid sequence of L4 is EFGGGGS, and H represents the H peptide that can form the dimerization domain.

本发明公开的一种促血小板生成素的融合蛋白，其中HSA分子位于融合蛋白的N-末端，结构式表示为TMP-L1-TMP-L2-H-fip-HSA，L1和L2表示肽接头，L1的氨基酸序列为GGPSG，L2的氨基酸序列为GGGGSRS，H表示可形成二聚体化结构域的H肽段，fip表示选自fip结构域的肽段；所述融合蛋白的氨基酸序列如SEQ ID NO：6所示，编码所述融合蛋白的核苷酸序列的DNA序列如SEQ ID NO：5所示。A fusion protein of thrombopoietin disclosed by the present invention, wherein the HSA molecule is located at the N-terminal of the fusion protein, the structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 represent peptide linkers, L1 The amino acid sequence of L2 is GGPPG, the amino acid sequence of L2 is GGGGSRS, H represents the H peptide segment that can form a dimerization domain, and fip represents a peptide segment selected from the fip domain; the amino acid sequence of the fusion protein is as shown in SEQ ID NO : shown in 6, the DNA sequence encoding the nucleotide sequence of the fusion protein is shown in SEQ ID NO: 5.

本发明公开了一种促血小板生成素的融合蛋白形成的二聚体化融合蛋白。The invention discloses a dimerized fusion protein formed by fusion protein of thrombopoietin.

本发明公开一种促血小板生成素的融合蛋白，包括人血清白蛋白HSA分子、促血小板生成素模拟肽TMP二联体，所述的人血清白蛋白HSA分子的氨基酸序列如SEQ ID NO：4所示，或该氨基酸序列经突变后具有同样功能的序列，编码所述HSA分子的氨基酸序列的DNA序列如SEQ ID NO：3所示，所述的促血小板生成素模拟肽TMP二联体的氨基酸序列如SEQ IDNO：2所示，或该氨基酸序列经突变后具有同样功能的序列，所述的促血小板生成素模拟肽TMP二联体是由一个酵母偏好密码子编码而成的，编码所述的促血小板生成素模拟肽TMP二联体的氨基酸序列的DNA序列如SEQ ID NO：1所示，该融合蛋白还包括二聚体化结构域。The invention discloses a fusion protein of thrombopoietin, which comprises human serum albumin HSA molecule and thrombopoietin mimetic peptide TMP duplex, and the amino acid sequence of said human serum albumin HSA molecule is as shown in SEQ ID NO: 4 shown, or the sequence having the same function after the amino acid sequence is mutated, the DNA sequence encoding the amino acid sequence of the HSA molecule is shown in SEQ ID NO: 3, and the thrombopoietin mimic peptide TMP duplex The amino acid sequence is shown in SEQ ID NO: 2, or a sequence having the same function after the amino acid sequence is mutated. The TMP duplex of the thrombopoietin mimetic peptide is encoded by a yeast preferred codon, encoding the The DNA sequence of the amino acid sequence of the thrombopoietin mimetic peptide TMP duplex is shown in SEQ ID NO: 1, and the fusion protein also includes a dimerization domain.

本发明公开的一种促血小板生成素的融合蛋白，所述的二聚体化结构域是通过二硫键的共价作用使促血小板生成素的融合蛋白二聚体化。The invention discloses a fusion protein of thrombopoietin, wherein the dimerization domain dimers the fusion protein of thrombopoietin through the covalent action of disulfide bonds.

本发明公开的一种促血小板生成素的融合蛋白，所述的二聚体化结构域中含有至少两个由半胱氨酸形成的二硫键。In the fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain contains at least two disulfide bonds formed by cysteine.

本发明公开的一种促血小板生成素的融合蛋白，所述的二聚体化结构域中含有两个由半胱氨酸形成的二硫键。In the fusion protein of thrombopoietin disclosed by the invention, the dimerization domain contains two disulfide bonds formed by cysteine.

本发明公开的一种促血小板生成素的融合蛋白，所述的二聚体化结构域包含H肽段，H肽段的氨基酸序列如Seq ID No:8所示。In the fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain includes an H peptide segment, and the amino acid sequence of the H peptide segment is shown in Seq ID No:8.

本发明公开的一种促血小板生成素的融合蛋白，二聚体化结构域包含H肽段和选自fip结构域的肽段，H肽段的氨基酸序列如Seq ID No:8所示，选自fip结构域的肽段的氨基酸序列如Seq ID No:10所示。In a fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain includes an H peptide segment and a peptide segment selected from the fip domain, and the amino acid sequence of the H peptide segment is shown in Seq ID No: 8, selected from The amino acid sequence of the peptide from the fip domain is shown in Seq ID No:10.

本发明公开的促血小板生成素的融合蛋白，所述HSA分子连接于融合蛋白的C-末端，结构式表示为HSA-L3-H-L4-TMP-L1-TMP，L1、L3、L4表示肽接头，L1的氨基酸序列为GGPSG，L3的氨基酸序列为GGGGSGL，L4的氨基酸序列为EFGGGGS，H表示可形成二聚体化结构域的H肽段。In the fusion protein of thrombopoietin disclosed by the present invention, the HSA molecule is connected to the C-terminus of the fusion protein, and the structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, and L4 represent peptide linkers , the amino acid sequence of L1 is GGPPSG, the amino acid sequence of L3 is GGGGSGL, the amino acid sequence of L4 is EFGGGGS, and H represents the H peptide that can form the dimerization domain.

本发明公开的一种促血小板生成素的融合蛋白，所述HSA分子位于融合蛋白的N-末端，结构式表示为TMP-L1-TMP-L2-H-fip-HSA，L1和L2表示肽接头，L1的氨基酸序列为GGPSG，L2的氨基酸序列为GGGGSRS，H表示可形成二聚体化结构域的H肽段，fip表示选自fip结构域的肽段；所述融合蛋白的氨基酸序列如SEQ ID NO：6所示，编码所述融合蛋白的核苷酸序列的DNA序列如SEQ ID NO：5所示。A fusion protein of thrombopoietin disclosed by the present invention, the HSA molecule is located at the N-terminal of the fusion protein, the structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 represent peptide linkers, The amino acid sequence of L1 is GGPPG, the amino acid sequence of L2 is GGGGSRS, H represents the H peptide segment that can form a dimerization domain, and fip represents a peptide segment selected from the fip domain; the amino acid sequence of the fusion protein is as shown in SEQ ID As shown in NO:6, the DNA sequence encoding the nucleotide sequence of the fusion protein is shown in SEQ ID NO:5.

本发明公开的一种促血小板生成素的融合蛋白形成的二聚体化融合蛋白。The invention discloses a dimerization fusion protein formed by fusion protein of thrombopoietin.

本发明公开的一种促血小板生成素的融合蛋白，所述融合蛋白采用酵母细胞表达制备。The invention discloses a fusion protein of thrombopoietin, which is expressed and prepared by yeast cells.

本发明公开的一种促血小板生成素的融合蛋白，所述的酵母为嗜甲醇毕赤酵母。The invention discloses a fusion protein of thrombopoietin, and the yeast is Pichia methanolophilus.

本发明公开的一种促血小板生成素的融合蛋白在制备治疗原发性或继发性血小板减少症的药物中的应用。The application of a fusion protein of thrombopoietin disclosed by the invention in the preparation of medicine for treating primary or secondary thrombocytopenia.

本发明公开的一种促血小板生成素的融合蛋白的制备方法，所述步骤如下：A preparation method of a fusion protein of thrombopoietin disclosed by the present invention, the steps are as follows:

1)全基因合成酵母偏好密码子编码的促血小板生成素模拟肽TMP二联体序列；1) Whole-gene synthesis of the TMP duplex sequence encoded by yeast preferred codons;

2)通过PCR扩增获取HSA分子序列；2) Obtain the HSA molecular sequence by PCR amplification;

3)通过限制性内切酶酶切、连接并转化大肠杆菌，获得含编码所述酵母偏好密码子编码的促血小板生成素模拟肽二联体与人血清白蛋白分子的融合蛋白的DNA序列的重组表达载体；3) Digesting with restriction endonucleases, ligating and transforming Escherichia coli to obtain the DNA sequence encoding the fusion protein of the thrombopoietin mimetic peptide duplex encoded by the yeast preferred codon and the human serum albumin molecule Recombinant expression vector;

4)将步骤3)所述的重组表达载体转化到感受态大肠杆菌TOP10，再转化到宿主表达系统进行表达，即得所述融合蛋白。4) Transform the recombinant expression vector described in step 3) into competent Escherichia coli TOP10, and then transform into a host expression system for expression to obtain the fusion protein.

一种含有上述促血小板生成素的融合蛋白编码基因的重组表达载体。可用于携带编码本发明融合蛋白的基因的表达载体包括但不局限于原核、真核表达系统常用的质粒。A recombinant expression vector containing the gene encoding the fusion protein of the above-mentioned thrombopoietin. The expression vectors that can be used to carry the gene encoding the fusion protein of the present invention include, but are not limited to, plasmids commonly used in prokaryotic and eukaryotic expression systems.

一种含有上述重组表达载体的宿主表达系统。宿主可以是细菌、酵母及哺乳动物细胞等，其中优选的是酵母，更优选的是嗜甲醇毕赤酵母；对于重组表达的本发明融合蛋白可以通过多种方法从相应的细胞培养物中进行提取、纯化，这些方法包括离心、超滤及液相层析等技术，其中液相层析又包括了离子交换、疏水和分子筛等层析技术。A host expression system containing the above-mentioned recombinant expression vector. The host can be bacteria, yeast, mammalian cells, etc., wherein yeast is preferred, and Pichia methanolophilus is more preferred; the fusion protein of the present invention expressed recombinantly can be extracted from the corresponding cell culture by various methods , Purification, these methods include techniques such as centrifugation, ultrafiltration and liquid chromatography, among which liquid chromatography includes chromatography techniques such as ion exchange, hydrophobicity and molecular sieves.

本发明所述融合蛋白可以与药物载体一起组成药物制剂使用，这些药物载体包括水、盐水、糖类、醇类及氨基酸等。由本发明融合蛋白制成的药物制剂优选的是含水量低于3％或不含水的冻干制剂，给药方式包括静脉输注、注射(包括皮下和肌肉注射)、鼻内、呼吸道等，其中优选的是皮下或肌肉注射。The fusion protein of the present invention can be used together with pharmaceutical carriers to form pharmaceutical preparations, and these pharmaceutical carriers include water, saline, sugars, alcohols, amino acids and the like. The pharmaceutical preparation made of the fusion protein of the present invention is preferably a lyophilized preparation with a water content of less than 3% or no water, and the administration methods include intravenous infusion, injection (including subcutaneous and intramuscular injection), intranasal, respiratory tract, etc., wherein Subcutaneous or intramuscular injection is preferred.

所述融合蛋白能在酵母体内稳定高效的表达，可应用于工业生产；同时，该融合蛋白具有显著促进血小板生成的特性，在人体内具有较长的半衰期，可用于制备治疗多种原发性或继发性血小板减少症等疾病的药物。The fusion protein can be stably and efficiently expressed in yeast, and can be applied to industrial production; at the same time, the fusion protein has the characteristic of significantly promoting platelet production, has a long half-life in the human body, and can be used to prepare and treat various primary diseases. Or drugs for diseases such as secondary thrombocytopenia.

以下提供具体实施例以实现本发明所述的一种促血小板生成素的融合蛋白及其制备方法和应用。但不限于这些实施例。Specific examples are provided below to realize a thrombopoietin fusion protein of the present invention and its preparation method and application. But not limited to these examples.

附图说明：Description of drawings:

图1：插入有HSA分子基因的pcDNA3.1-HSA载体，作为构建促血小板生成素的融合蛋白表达载体的模板。Figure 1: The pcDNA3.1-HSA vector inserted with the HSA molecular gene is used as a template for constructing a fusion protein expression vector of thrombopoietin.

图2：促血小板生成素的融合蛋白表达载体图谱Figure 2: The fusion protein expression vector map of thrombopoietin

具体实施方式Detailed ways

主要实验仪器：Main experimental instruments:

移液枪、超净工作台(安泰)、磁力搅拌器、微波炉、高温蒸汽灭菌锅、-80℃低温冰箱(Forma)、超纯水仪(Millipore)、制冰机、离心机(Hitachi)、HDB-PLUS型恒温金属浴、HZQ-F16OA型恒温振荡培养箱(上海一恒)、PCR仪(Applied Biosystems)、台式冷冻离心机(Thermo)、DYY-8B型电泳仪(伯乐)、Image Quant 300型凝胶成像仪(GE)等。Pipette gun, ultra-clean bench (Antai), magnetic stirrer, microwave oven, high-temperature steam sterilizer, -80°C low-temperature refrigerator (Forma), ultra-pure water instrument (Millipore), ice maker, centrifuge (Hitachi) , HDB-PLUS constant temperature metal bath, HZQ-F16OA constant temperature shaking incubator (Shanghai Yiheng), PCR instrument (Applied Biosystems), desktop refrigerated centrifuge (Thermo), DYY-8B electrophoresis instrument (Bole), Image Quant 300 gel imager (GE), etc.

主要实验材料：Main experimental materials:

1.限制性核酸内切酶KpnI、EcoRI、SacI、AflII(NEB公司产品，美国)1. Restriction endonucleases KpnI, EcoRI, SacI, AflII (products of NEB Company, USA)

2.小提质粒试剂盒、PCR纯化试剂盒、DNA胶回收试剂盒(生工公司，中国)2. Small extraction plasmid kit, PCR purification kit, DNA gel recovery kit (Shenggong Company, China)

3.T4DNA连接酶试剂盒(Takara公司产品，中国大连)3. T4 DNA Ligase Kit (product of Takara Company, Dalian, China)

4.载体pPinkα-HC、嗜甲醇毕赤酵母菌株(Invitrogen公司产品，美国)4. Vector pPinkα-HC, Pichia methanolophilus strain (product of Invitrogen, USA)

5.大肠杆菌TOP10(天根生化科技(北京)有限公司)5. Escherichia coli TOP10 (Tiangen Biochemical Technology (Beijing) Co., Ltd.)

6.酵母提取物、蛋白胨(Oxford公司产品，美国)6. Yeast extract, peptone (product of Oxford, USA)

7.LB培养基7. LB medium

酵母提取物5g，蛋白胨10g，NaCl 10g，溶于1000ml去离子水中，并用1mol/L的NaOH调节pH值至7.0，高压蒸气灭菌。5 g of yeast extract, 10 g of peptone, and 10 g of NaCl were dissolved in 1000 ml of deionized water, adjusted to pH 7.0 with 1 mol/L NaOH, and sterilized by autoclaving.

8.YPD培养基8. YPD medium

酵母提取物10g，胰蛋白胨20g，Agar 20g，溶于900ml去离子水中，高压灭菌，冷却后加入100ml经滤器除菌后的20％的右旋糖。10 g of yeast extract, 20 g of tryptone, and 20 g of Agar were dissolved in 900 ml of deionized water, sterilized by high pressure, and after cooling, 100 ml of 20% dextrose sterilized by a filter were added.

9.YPDS培养基9. YPDS medium

酵母提取物10g，蛋白胨20g，山梨糖醇182.2g，溶于900ml去离子水中，高压灭菌，冷却后加入100ml经滤器除菌后的20％的右旋糖。10 g of yeast extract, 20 g of peptone, and 182.2 g of sorbitol were dissolved in 900 ml of deionized water, sterilized by high pressure, and 100 ml of 20% dextrose sterilized by a filter were added after cooling.

10.BMGY液体培养基10. BMGY liquid medium

酵母提取物10g，蛋白胨20g，无氨基酸酵母氮源13.4g，甘油10g，磷酸钾26.631g，溶于1000ml双蒸水中高压灭菌，冷至室温，调节pH至6.0，4℃保存备用。Yeast extract 10g, peptone 20g, amino acid-free yeast nitrogen source 13.4g, glycerol 10g, potassium phosphate 26.631g, dissolve in 1000ml double distilled water, autoclave, cool to room temperature, adjust pH to 6.0, store at 4°C for later use.

11.1％琼脂糖凝胶的配置11.1% Agarose gel configuration

根据用量，每100ml的TAE缓冲液，加入1g琼脂糖，使用微波炉加热煮沸，使琼脂糖完全融解，室温冷却至不烫手时滴加少量溴化乙锭(EB)，混匀后将其倒入事先摆放好梳子的胶槽中，待到室温冷却至完全凝固后拔去梳子即可使用。According to the dosage, for every 100ml of TAE buffer, add 1g of agarose, heat and boil in a microwave oven to completely melt the agarose, drop a small amount of ethidium bromide (EB) when it is cooled at room temperature until it is not hot, mix well and pour it into Place the comb in the glue tank in advance, wait until it cools at room temperature until it is completely solidified, then pull out the comb and use it.

实施例1HSA-TMP融合蛋白酵母表达载体的构建与表达Construction and expression of embodiment 1 HSA-TMP fusion protein yeast expression vector

一、p29-simple-TMP序列的获得1. Acquisition of p29-simple-TMP sequence

1.首先根据嗜甲醇毕赤酵母偏爱密码子对编码(TMP)₂的基因序列进行优化。1. First, optimize the gene sequence encoding (TMP) ₂ according to the preferred codons of Pichia methanolophilus.

2.委托大连Takara公司合成优化后的(TMP)₂基因，(TMP)₂的DNA序列如SEQ ID NO：1所示，并将其装载到p29-simple(p29-simple质粒载体大连Takara公司提供)，获得载体p29-simple-(TMP)₂。2. Entrust Dalian Takara Company to synthesize the optimized (TMP) ₂ gene, the DNA sequence of (TMP) ₂ is shown in SEQ ID NO: 1, and load it into p29-simple (p29-simple plasmid vector provided by Dalian Takara Company ) to obtain the vector p29-simple-(TMP) ₂ .

3.其中p29-simple-TMP中已包含L，即连接肽，LDNA序列为GGCGGCGGCGGTTCCGGACTGGAGCCCAAGAGCTGCGACAAGACCCACACCTGCCCTCCCTGCGAATTCGGTGGTGGCGGCAGC，氨基酸序列为GGGGSGLEPKSCDKTHTCPPCEF GGGGS。3. The p29-simple-TMP already contains L, namely the connecting peptide, the LDNA sequence is GGCGGCGGCGGTTCCGGACTGGAGCCCAAGAGCTGCGACAAGACCCACACCTGCCCTCCCTGCGAATTCGGTGGTGGCGGCAGC, and the amino acid sequence is GGGGSGLEPKSCDKTHTCPPCEF GGGGS.

二、HSA cDNA的片段的克隆：2. Cloning of fragments of HSA cDNA:

从质粒pcDNA3.1-fip-HSA克隆获得(HSA全基因序列由宝生物公司全基因合成，pcDNA3.1质粒购自Invitrogen)Obtained from plasmid pcDNA3.1-fip-HSA clone (HSA complete gene sequence was synthesized by Bao Biological Company, pcDNA3.1 plasmid was purchased from Invitrogen)

三、HSA分子序列的获得3. Obtaining the molecular sequence of HSA

1.设计合成PCR引物：1. Design and synthesize PCR primers:

P1:GGTACCTCATAAGCCTAAGGCAGCTTGP1: GGTACCTCATAAGCCTAAGGCAGCTTG

P2:TCCGGAGATGCACACAAGAGTGAGP2:TCCGGAGATGCACACAAGAGTGAG

2.PCR扩增：以载体pcDNA3.1-HSA的DNA为模版，以P1和P2分别作为上、下游引物，进行PCR扩增。反应条件如下：①变性：94℃，5min；②变性：94℃，1min；③复性：55℃，30S；④延伸：72℃，2min；⑤返回步骤“②”，35循环；⑥延伸：72℃，5min，总循环次数为30次。将PCR产物进行1％琼脂糖凝胶电泳，结果显示扩增出约1.8kb大小的HSA分子的DNA条带。2. PCR amplification: The DNA of the carrier pcDNA3.1-HSA was used as a template, and P1 and P2 were used as upstream and downstream primers respectively to carry out PCR amplification. The reaction conditions are as follows: ① Denaturation: 94°C, 5min; ② Denaturation: 94°C, 1min; ③ Renaturation: 55°C, 30S; ④ Extension: 72°C, 2min; ⑤ Return to step "②", 35 cycles; ⑥ Extend: 72°C, 5min, the total number of cycles is 30 times. The PCR product was subjected to 1% agarose gel electrophoresis, and the result showed that a DNA band of about 1.8 kb in size was amplified.

四、pPinkα-HC-HSA-(TMP)₂融合蛋白酵母表达载体的构建4. Construction of pPinkα-HC-HSA-(TMP) ₂ fusion protein yeast expression vector

1.提取p29-simple-(TMP)₂载体，KpnI和EcoRI双酶切质粒，胶回收对应的TMP(KpnI/EcoRI)DNA片段,DNA序列如SEQ ID NO：1所示，氨基酸序列如SEQ ID NO：2所示；1. Extract p29-simple-(TMP) ₂ vector, KpnI and EcoRI double digestion plasmid, gel recovery corresponding TMP (KpnI/EcoRI) DNA fragment, the DNA sequence is shown in SEQ ID NO: 1, and the amino acid sequence is shown in SEQ ID NO: as shown in 2;

2.将HSA分子经PCR扩增后的产物EcoRI和SacI双酶切，胶回收对应的HSA(EcoRI/SacI)DNA片段,DNA序列如SEQ ID NO：3所示，氨基酸序列如SEQ ID NO：4所示；2. Digest the HSA molecule amplified by PCR with EcoRI and SacI, and recover the corresponding HSA (EcoRI/SacI) DNA fragment from the gel. The DNA sequence is shown in SEQ ID NO: 3, and the amino acid sequence is shown in SEQ ID NO: 4 shown;

3.同时，SacI和KpnI双酶切载体pPinkα-HC(Invitrogen公司产品)的DNA，胶回收pPinkα-HC(SacI/KpnI)载体片段；3. At the same time, SacI and KpnI double-enzyme-cut the DNA of the carrier pPinkα-HC (product of Invitrogen Company), and gel recovered the pPinkα-HC (SacI/KpnI) carrier fragment;

4.T4DNA酶连接TMP(KpnI/EcoRI)DNA片段、HSA(EcoRI/SacI)DNA片段和pPinkα-HC(SacI/KpnI)载体片段，转化感受态大肠杆菌TOP10，涂布于氨苄抗性LB板37℃培养过夜，筛选阳性克隆。所得克隆送Invitrogen公司测序，序列正确的克隆命名为pPINKα-HC-HSA-(TMP)2。4. T4 DNA enzyme ligated TMP (KpnI/EcoRI) DNA fragments, HSA (EcoRI/SacI) DNA fragments and pPinkα-HC (SacI/KpnI) vector fragments, transformed into competent Escherichia coli TOP10, and coated on ampicillin-resistant LB plate 37 Cultivate overnight at ℃, and select positive clones. The obtained clone was sent to Invitrogen Company for sequencing, and the clone with the correct sequence was named pPINKα-HC-HSA-(TMP)2.

五、HSA-TMP融合蛋白在酵母中的表达5. Expression of HSA-TMP fusion protein in yeast

将测序正确的载体pPinkα-HC-HSA-(TMP)₂的DNA用AflII酶切回收后得到pPinkα-HC-HSA-(TMP)₂，转化酵母感受态细胞。然后将转化菌液接种于PAD平板，30℃培养3-4天，挑取阳性克隆。将得到阳性克隆分别接种BMGY液体培养基，30℃培养48小时，然后转接至BMMY培养基中诱导表达，持续96小时后，1500rpm低温离心15分钟，取上清，SDS-PAGE电泳检测蛋白表达情况，分子量约70kD蛋白条带即为HSA-TMP融合蛋白，所述融合蛋白的氨基酸序列如SEQ ID NO：6，编码所述融合蛋白的氨基酸序列的DNA序列如SEQ ID NO：5所示。选择表达水平最高的菌株作为工程菌，冻存于-80℃保种DNA of vector pPinkα-HC-HSA-(TMP) ₂ sequenced correctly was digested with AflII and recovered to obtain pPinkα-HC-HSA-(TMP) ₂ , which was transformed into competent yeast cells. Then the transformed bacteria liquid was inoculated on the PAD plate, cultured at 30°C for 3-4 days, and positive clones were picked. The positive clones were inoculated into BMGY liquid medium, cultured at 30°C for 48 hours, and then transferred to BMMY medium to induce expression. After 96 hours, centrifuged at 1500rpm for 15 minutes at low temperature, took the supernatant, and detected protein expression by SDS-PAGE electrophoresis In this case, the protein band with a molecular weight of about 70kD is the HSA-TMP fusion protein, the amino acid sequence of the fusion protein is shown in SEQ ID NO: 6, and the DNA sequence encoding the amino acid sequence of the fusion protein is shown in SEQ ID NO: 5. Select the strain with the highest expression level as the engineering strain, and freeze it at -80°C for preservation

实施例2HSA-(TMP)₂融合蛋白的生物活性检测The biological activity detection of embodiment 2HSA-(TMP) ₂ fusion protein

1、实验过程1. Experimental process

细胞铺板，c-mpl、c-fos、pAdVAntage、Renilla，四质粒共转染。转染后48h，分别加入阳性对照、空白对照和受试样品，分别检测萤火虫荧光素酶活力和海肾荧光素酶活力，计算荧光比率。Cell plating, c-mpl, c-fos, pAdVAntage, Renilla, four plasmid co-transfection. 48 hours after transfection, the positive control, blank control and test samples were added to detect firefly luciferase activity and Renilla luciferase activity, respectively, and calculate the fluorescence ratio.

阳性对照：特比澳特比澳(重组人促血小板生成素注射液)购买自沈阳三生制药有限责任公司Positive control: TPIAO TPIAO (recombinant human thrombopoietin injection) purchased from Shenyang Sansheng Pharmaceutical Co., Ltd.

空白对照：血清Blank control: serum

荧光比率＝萤火虫荧光值/海肾荧光值。Fluorescence ratio = firefly fluorescence value/renilla fluorescence value.

具体步骤详见中国专利(CN201310084212)。For specific steps, see Chinese patent (CN201310084212).

2.实验结果2. Experimental results

表1HSA-(TMP)₂融合蛋白的生物活性检测结果The biological activity detection result of table 1HSA-(TMP) ₂ fusion protein

实验组test group 相对荧光比率relative fluorescence ratio 空白对照组Blank control group 1.00±0.021.00±0.02 阳性对照组positive control group 1.44±0.121.44±0.12 1-1号酵母菌株Yeast strain No. 1-1 1.37±0.051.37±0.05 1-3号酵母菌株Yeast strains 1-3 1.68±0.011.68±0.01 1-4号酵母菌株Yeast strains 1-4 1.70±0.041.70±0.04 2-1号酵母菌株Yeast strain No. 2-1 1.80±0.161.80±0.16 2-2号酵母菌株Yeast strain No. 2-2 1.69±0.121.69±0.12 2-4号酵母菌株Yeast strains 2-4 2.01±0.162.01±0.16 3-2号酵母菌株Yeast strain No. 3-2 1.11±0.031.11±0.03 3-3号酵母菌株Yeast strain No. 3-3 1.27±0.091.27±0.09 3-4号酵母菌株Yeast strain No. 3-4 1.37±0.111.37±0.11 4-1号酵母菌株Yeast strain No. 4-1 2.00±0.052.00±0.05 4-2号酵母菌株Yeast strain No. 4-2 2.98±0.092.98±0.09 4-3号酵母菌株Yeast strain No. 4-3 1.97±0.041.97±0.04 4-4号酵母菌株Yeast strain No. 4-4 1.79±0.061.79±0.06

结果表明，所有转染不含c-mpl受体的pcDNA3.1-HSA空载体各处理组，相对荧光比率均为1。所有转染c-mpl-pcDNA3.1载体的各处理组数据见上表1。由表1可知所检测13株酵母菌表达样品中的HSA-(TMP)₂融合蛋白均具有显著的c-mpl受体依赖的生物活性。其中4-2号酵母菌株所表达的融合蛋白活性显著高于阳性对照。特比奥阳性药物及通过对TMP二联体核苷酸序列进行偏好密码子改造后的13株HSA-(TMP)₂融合蛋白酵母表达菌株均具有受体依赖的生物活性，其中2-1号、2-4号、4-1号、4-2号、4-3号、4-4号酵母表达的融合蛋白的生物活性得到了显著性提高。The results showed that the relative fluorescence ratio of all treatment groups transfected with pcDNA3.1-HSA empty vector without c-mpl receptor was 1. The data of all treatment groups transfected with the c-mpl-pcDNA3.1 vector are shown in Table 1 above. It can be seen from Table 1 that the HSA-(TMP) ₂ fusion proteins in the 13 tested yeast expression samples all have significant c-mpl receptor-dependent biological activities. Among them, the activity of the fusion protein expressed by No. 4-2 yeast strain was significantly higher than that of the positive control. Tebio-positive drugs and 13 HSA-(TMP) ₂ fusion protein yeast expression strains after modifying the TMP duplex nucleotide sequence with preferred codons all have receptor-dependent biological activities, and No. 2-1 , No. 2-4, No. 4-1, No. 4-2, No. 4-3, and No. 4-4, the biological activity of the fusion protein expressed by the yeast has been significantly improved.

专利号CN201310084212.8的二聚体化融合蛋白的制备及应用中公开了二聚体化促血小板生成素(TPO)模拟肽TMP二联体-人血清白蛋白融合蛋白的制备及应用，该专利中酵母表达的二聚体化促血小板生成素模拟肽TMP二联体-人血清白蛋白融合蛋白c-mpl受体依赖的细胞水平生物活性平均值为1.635，本发明公开的促血小板生成素的融合蛋白生物活性检测结果平均值为1.749；The preparation and application of dimerization fusion protein of patent number CN201310084212.8 discloses the preparation and application of dimerization thrombopoietin (TPO) mimetic peptide TMP duplex-human serum albumin fusion protein, the patent The dimerized thrombopoietin mimetic peptide TMP duplex-human serum albumin fusion protein c-mpl receptor-dependent average biological activity of the dimerized thrombopoietin expressed in yeast is 1.635, and the thrombopoietin disclosed in the present invention has The average value of the bioactivity test results of the fusion protein was 1.749;

因此，本发明通过对TMP二联体实施偏好密码子改造，提高了促血小板生成素的融合蛋白生物活性的均值，高于未对促血小板生成素模拟肽TMP二联体基因改造产生的融合蛋白的活性均值。Therefore, the present invention improves the mean value of the biological activity of the fusion protein of thrombopoietin by implementing preferred codon modification on the TMP duplex, which is higher than the fusion protein produced without genetic modification of the TMP duplex of the thrombopoietin mimetic peptide activity mean.

此外，CN201310084212.8专利中公开的融合蛋白活性最高的是2号酵母株表达的二聚体融合蛋白，其生物活性为1.71±0.11，本发明公开的2-1号、2-4号、4-1号、4-2号、4-3号、4-4号的生物活性均高于上述2号酵母株表达的二聚体融合蛋白。In addition, the fusion protein disclosed in the CN201310084212.8 patent has the highest activity of the dimeric fusion protein expressed by No. 2 yeast strain, and its biological activity is 1.71±0.11. -1, 4-2, 4-3, and 4-4 had biological activities higher than the dimer fusion protein expressed by the yeast strain No. 2 above.

通过对TMP二联体实施偏好密码子改造，不仅提高了促血小板生成素的融合蛋白生物活性的均值，本发明公开的2-1号、2-4号、4-1号、4-2号、4-3号、4-4号，所述酵母产生的融合蛋白的活性也得到了显著提高。By implementing preferred codon modification to the TMP duplex, not only the mean value of the biological activity of the fusion protein of thrombopoietin is improved, but No. 2-1, No. 2-4, No. 4-1 and No. 4-2 disclosed by the present invention , No. 4-3, and No. 4-4, the activity of the fusion protein produced by the yeast has also been significantly improved.

序列表(SEQUENCE LISTING)SEQUENCE LISTING

<110> 兰州大学<110> Lanzhou University

<120> 一种促血小板生成素的融合蛋白<120> A fusion protein of thrombopoietin

<160> 10<160> 10

<170> PatentIn version 3.5<170> PatentIn version 3.5

<210> 1<210> 1

<211> 99<211> 99

<212> DNA<212>DNA

<213> TMP二联体的基因序列<213> Gene sequence of TMP duplex

<400> 1<400> 1

attgagggtc caactttgag acaatggttg gctgctagag ccggtggtcc atctggaatc 60attgaggtc caactttgag acaatggttg gctgctagag ccggtggtcc atctggaatc 60

gagggaccta ccctgagaca gtggcttgct gctagagct 99gagggaccta ccctgagaca gtggcttgct gctagagct 99

<210> 2<210> 2

<211> 33<211> 33

<212> PRT<212> PRT

<213> TMP二联体的氨基酸序列Amino acid sequence of <213> TMP duplex

<400> 2<400> 2

1 Ile Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile1 Ile Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile

21 Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala21 Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala

<210> 3<210> 3

<211> 1755<211> 1755

<212> DNA<212>DNA

<213> HSA分子的基因序列<213> Gene sequence of HSA molecule

<400> 3<400> 3

gatgcacaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 60gatgcacaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 60

gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 120gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 120

aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 180aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 180

aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 240aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 240

cgtgaaacct atggtgaaat ggctgactgt tgtgcaaaac aagaacctga gagaaatgaa 300cgtgaaacct atggtgaaat ggctgactgt tgtgcaaaac aagaacctga gagaaatgaa 300

tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 360tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 360

gatgtgatgt gcactgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 420gatgtgatgt gcactgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 420

gaaattgcca gaagacatcc ttacttttat gccccggaac tccttttctt tgctaaaagg 480gaaattgcca gaagacatcc ttacttttat gccccggaac tccttttctt tgctaaaagg 480

tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 540tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 540

aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 600aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 600

gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 660gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 660

cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 720cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 720

gtccacacgg aatgctgcca tggagatctg cttgaatgtg ctgatgacag ggcggacctt 780gtccaacacgg aatgctgcca tggagatctg cttgaatgtg ctgatgacag ggcggacctt 780

gccaagtata tctgtgaaaa tcaagattcg atctccagta aactgaagga atgctgtgaa 840gccaagtata tctgtgaaaa tcaagattcg atctccagta aactgaagga atgctgtgaa 840

aaacctctgt tggaaaaatc ccactgcatt gccgaagtgg aaaatgatga gatgcctgct 900aaacctctgt tggaaaaatc ccactgcatt gccgaagtgg aaaatgatga gatgcctgct 900

gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 960gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 960

gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 1020gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 1020

tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 1080tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 1080

tgtgccgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 1140tgtgccgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 1140

gtggaagagc ctcagaattt aatcaaacaa aattgtgagc tttttgagca gcttggagag 1200gtggaagagc ctcagaattt aatcaaacaa aattgtgagc tttttgagca gcttggagag 1200

tacaaattcc agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtcaact 1260tacaaattcc agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtcaact 1260

ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 1320ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 1320

cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 1380cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 1380

tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 1440tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 1440

ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 1500ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 1500

gagtttaatg ctgaaacgtt caccttccat gcagatatat gcacactttc tgagaaggag 1560gagtttaatg ctgaaacgtt caccttccat gcagatatat gcacactttc tgagaaggag 1560

agacaaatca agaaacaaac tgcacttgtt gagcttgtga aacacaagcc caaggcaaca 1620agacaaatca agaaacaaac tgcacttgtt gagcttgtga aacacaagcc caaggcaaca 1620

aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 1680aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 1680

gctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 1740gctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 1740

gctgccttag gctta 1755gctgccttag gctta 1755

<210> 4<210> 4

<211> 585<211> 585

<212> PRT<212> PRT

<213> HSA分子的氨基酸序列<213> Amino acid sequence of the HSA molecule

<400> 4<400> 4

1 Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys1 Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys

21 Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val21 Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val

41 Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu41 Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu

61 Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu61 Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu

81 Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu81 Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu

101 Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val101 Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val

121 Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr121 Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr

141 Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg141 Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg

161 Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro161 Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro

181 Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys181 Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys

201 Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser201 Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser

221 Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys221 Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys

241 Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu241 Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu

261 Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu261 Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu

281 Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu MET Pro Ala281 Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu MET Pro Ala

301 Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala301 Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala

321 Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp321 Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp

341 Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys341 Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys

361 Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu361 Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu

381 Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln Leu Gly Glu381 Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln Leu Gly Glu

401 Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr401 Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr

421 Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His421 Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His

441 Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gln Leu441 Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gln Leu

461 Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser461 Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser

481 Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys481 Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys

501 Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu501 Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu

521 Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr521 Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr

541 Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys541 Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys

561 Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln561 Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Lys Leu Val Ala Ala Ser Gln

581 Ala Ala Leu Gly Leu581 Ala Ala Leu Gly Leu

<210> 5<210> 5

<211> 2010<211> 2010

<212> DNA<212>DNA

<213> ss-TMP-L1-TMP-L2 -H-fip-HSA融合蛋白的基因序列<213> Gene sequence of ss-TMP-L1-TMP-L2-H-fip-HSA fusion protein

<400> 5<400> 5

aagtgggtaa cctttatttc ccttcttttt ctctttagct cggcttattc caggggtgtg 60aagtgggtaa cctttatttc ccttcttttt ctctttagct cggcttattc caggggtgtg 60

tttcgtcgag atgcacacaa gagtgaggtt gctcatcggt ttaaagattt gggagaagaa 120tttcgtcgag atgcacacaa gagtgaggtt gctcatcggt ttaaagattt gggagaagaa 120

aatttcaaag ccttggtgtt gattgccttt gctcagtatc ttcagcagtg tccatttgaa 180aatttcaaag ccttggtgtt gattgccttt gctcagtatc ttcagcagtg tccatttgaa 180

gatcatgtaa aattagtgaa tgaagtaact gaatttgcaa aaacatgtgt tgctgatgag 240gatcatgtaa aattagtgaa tgaagtaact gaatttgcaa aaacatgtgttgctgatgag 240

tcagctgaaa attgtgacaa atcacttcat accctttttg gagacaaatt atgcacagtt 300tcagctgaaa attgtgacaa atcacttcat accctttttg gagacaaatt atgcacagtt 300

gcaactcttc gtgaaaccta tggtgaaatg gctgactgtt gtgcaaaaca agaacctgag 360gcaactcttc gtgaaaccta tggtgaaatg gctgactgtt gtgcaaaaca agaacctgag 360

agaaatgaat gcttcttgca acacaaagat gacaacccaa acctcccccg attggtgaga 420agaaatgaat gcttcttgca acacaaagat gacaacccaa acctcccccg attggtgaga 420

ccagaggttg atgtgatgtg cactgctttt catgacaatg aagagacatt tttgaaaaaa 480ccagaggttg atgtgatgtg cactgctttt catgacaatg aagagacatt tttgaaaaaa 480

tacttatatg aaattgccag aagacatcct tacttttatg ccccggaact ccttttcttt 540tacttatatg aaattgccag aagacatcct tacttttatg ccccggaact ccttttcttt 540

gctaaaaggt ataaagctgc ttttacagaa tgttgccaag ctgctgataa agctgcctgc 600gctaaaaggt ataaagctgc ttttacagaa tgttgccaag ctgctgataa agctgcctgc 600

ctgttgccaa agctcgatga acttcgggat gaagggaagg cttcgtctgc caaacagaga 660ctgttgccaa agctcgatga acttcgggat gaagggaagg cttcgtctgc caaacagaga 660

ctcaagtgtg ccagtctcca aaaatttgga gaaagagctt tcaaagcatg ggcagtagct 720ctcaagtgtg ccagtctcca aaaatttgga gaaagagctt tcaaagcatg ggcagtagct 720

cgcctgagcc agagatttcc caaagctgag tttgcagaag tttccaagtt agtgacagat 780cgcctgagcc agagatttcc caaagctgag tttgcagaag tttccaagtt agtgacagat 780

cttaccaaag tccacacgga atgctgccat ggagatctgc ttgaatgtgc tgatgacagg 840cttaccaaag tccacacgga atgctgccat ggagatctgc ttgaatgtgc tgatgacagg 840

gcggaccttg ccaagtatat ctgtgaaaat caagattcga tctccagtaa actgaaggaa 900gcggaccttg ccaagtatat ctgtgaaaat caagattcga tctccagtaa actgaaggaa 900

tgctgtgaaa aacctctgtt ggaaaaatcc cactgcattg ccgaagtgga aaatgatgag 960tgctgtgaaa aacctctgtt ggaaaaatcc cactgcattg ccgaagtgga aaatgatgag 960

atgcctgctg acttgccttc attagctgct gattttgttg aaagtaagga tgtttgcaaa 1020atgcctgctg acttgccttc attagctgct gattttgttg aaagtaagga tgtttgcaaa 1020

aactatgctg aggcaaagga tgtcttcctg ggcatgtttt tgtatgaata tgcaagaagg 1080aactatgctg aggcaaagga tgtcttcctg ggcatgtttt tgtatgaata tgcaagaagg 1080

catcctgatt actctgtcgt gctgctgctg agacttgcca agacatatga aaccactcta 1140catcctgatt actctgtcgt gctgctgctg agacttgcca agacatatga aaccactcta 1140

gagaagtgct gtgccgctgc agatcctcat gaatgctatg ccaaagtgtt cgatgaattt 1200gagaagtgct gtgccgctgc agatcctcat gaatgctatg ccaaagtgtt cgatgaattt 1200

aaacctcttg tggaagagcc tcagaattta atcaaacaaa attgtgagct ttttgagcag 1260aaacctcttg tggaagagcc tcagaattta atcaaacaaa attgtgagct ttttgagcag 1260

cttggagagt acaaattcca gaatgcgcta ttagttcgtt acaccaagaa agtaccccaa 1320cttggagagt acaaattcca gaatgcgcta ttagttcgtt acaccaagaa agtaccccaa 1320

gtgtcaactc caactcttgt agaggtctca agaaacctag gaaaagtggg cagcaaatgt 1380gtgtcaactc caactcttgt agaggtctca agaaacctag gaaaagtggg cagcaaatgt 1380

tgtaaacatc ctgaagcaaa aagaatgccc tgtgcagaag actatctatc cgtggtcctg 1440tgtaaacatc ctgaagcaaa aagaatgccc tgtgcagaag actatctatc cgtggtcctg 1440

aaccagttat gtgtgttgca tgagaaaacg ccagtaagtg acagagtcac caaatgctgc 1500aaccagttat gtgtgttgca tgagaaaacg ccagtaagtg acagagtcac caaatgctgc 1500

acagaatcct tggtgaacag gcgaccatgc ttttcagctc tggaagtcga tgaaacatac 1560acagaatcct tggtgaacag gcgaccatgc ttttcagctc tggaagtcga tgaaacatac 1560

gttcccaaag agtttaatgc tgaaacgttc accttccatg cagatatatg cacactttct 1620gttcccaaag agtttaatgc tgaaacgttc accttccatg cagatatatg cacactttct 1620

gagaaggaga gacaaatcaa gaaacaaact gcacttgttg agcttgtgaa acacaagccc 1680gagaaggaga gacaaatcaa gaaacaaact gcacttgttg agcttgtgaa acacaagccc 1680

aaggcaacaa aagagcaact gaaagctgtt atggatgatt tcgcagcttt tgtagagaag 1740aaggcaacaa aagagcaact gaaagctgtt atggatgatt tcgcagcttt tgtagagaag 1740

tgctgcaagg ctgacgataa ggagacctgc tttgccgagg agggtaaaaa acttgttgct 1800tgctgcaagg ctgacgataa ggagacctgc tttgccgagg agggtaaaaa acttgttgct 1800

gcaagtcaag ctgccttagg cttaggcggc ggcggttccg gactggagcc caagagctgc 1860gcaagtcaag ctgccttagg ctaggcggc ggcggttccg gactggagcc caagagctgc 1860

gacaagaccc acacctgccc tccctgcgaa ttcggtggtg gtggttctat tgagggtcca 1920gacaagaccc acacctgccc tccctgcgaa ttcggtggtg gtggttctat tgagggtcca 1920

actttgagac aatggttggc tgctagagcc ggtggtccat ctggaatcga gggacctacc 1980actttgagac aatggttggc tgctagagcc ggtggtccat ctggaatcga gggacctacc 1980

ctgagacagt ggcttgctgc tagagcttaa 2010ctgagacagt ggcttgctgc tagagcttaa 2010

<210> 6<210> 6

<211> 670<211> 670

<212> PRT<212> PRT

<213> ss-TMP-L1-TMP-L2 -H-fip-HSA融合蛋白的氨基酸序列<213> Amino acid sequence of ss-TMP-L1-TMP-L2-H-fip-HSA fusion protein

<400> 6<400> 6

1 Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser Arg Gly Val1 Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser Arg Gly Val

21 Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu21 Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu

41 Asn Phe Lys Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu41 Asn Phe Lys Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu

61 Asp His Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu61 Asp His Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu

81 Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val81 Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val

101 Ala Thr Leu Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu101 Ala Thr Leu Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu

121 Arg Asn Glu Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg121 Arg Asn Glu Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg

141 Pro Glu Val Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys141 Pro Glu Val Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys

161 Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe161 Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe

181 Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys181 Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys

201 Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg201 Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg

221 Leu Lys Cys Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala221 Leu Lys Cys Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala

241 Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp241 Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp

261 Leu Thr Lys Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg261 Leu Thr Lys Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg

281 Ala Asp Leu Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu281 Ala Asp Leu Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu

301 Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu301 Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu

321 MET Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys321 MET Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys

341 Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg341 Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg

361 His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu361 His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu

381 Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe381 Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe

401 Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln401 Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln

421 Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln421 Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln

441 Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys441 Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys

461 Cys Lys His Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu461 Cys Lys His Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu

481 Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys481 Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys

501 Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr501 Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr

521 Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser521 Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser

541 Glu Lys Glu Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro541 Glu Lys Glu Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro

561 Lys Ala Thr Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys561 Lys Ala Thr Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys

581 Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala581 Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala

601 Ala Ser Gln Ala Ala Leu Gly Leu Gly Gly Gly Gly Ser Gly Leu Glu Pro Lys Ser Cys601 Ala Ser Gln Ala Ala Leu Gly Leu Gly Gly Gly Gly Ser Gly Leu Glu Pro Lys Ser Cys

621 Asp Lys Thr His Thr Cys Pro Pro Cys Glu Phe Gly Gly Gly Gly Ser Ile Glu Gly Pro621 Asp Lys Thr His Thr Cys Pro Pro Cys Glu Phe Gly Gly Gly Gly Ser Ile Glu Gly Pro

641 Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile Glu Gly Pro Thr641 Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile Glu Gly Pro Thr

661 Leu Arg Gln Trp Leu Ala Ala Arg Ala ***661 Leu Arg Gln Trp Leu Ala Ala Arg Ala ***

<210> 7<210> 7

<211> 42<211> 42

<212> DNA<212>DNA

<213> H 肽段的基因序列<213> Gene sequence of H peptide

<400> 7<400> 7

gagcccaaga gctgcgacaa gacccacacc tgccctccct gc 42gagcccaaga gctgcgacaa gacccacacc tgccctccct gc 42

<210> 8<210> 8

<211> 14<211> 14

<212> PRT<212> PRT

<213> H 肽段的氨基酸序列Amino acid sequence of <213> H peptide

<400> 8<400> 8

1 Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys1 Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys

<210> 9<210> 9

<211> 114<211> 114

<212> DNA<212>DNA

<213> 选自fip结构域的肽段的基因序列<213> Gene sequence selected from peptides of fip domain

<400> 9<400> 9

gtgagcaggg acgagctgat ggaggccatc cagaagcagg aggagatcaa cttcaggctg 60gtgagcaggg acgagctgat ggaggccatc cagaagcagg aggagatcaa cttcaggctg 60

caggactaca tcgacaggat catcgtggcc atcatggaga ccaaccccag catc 114caggactaca tcgacaggat catcgtggcc atcatggaga ccaacccccag catc 114

<210> 10<210> 10

<211> 38<211> 38

<212> PRT<212> PRT

<213> 选自fip结构域的肽段的氨基酸序列<213> Amino acid sequence of a peptide selected from the fip domain

<400> 10<400> 10

1 Val Ser Arg Asp Glu Leu MET Glu Ala Ile Gln Lys Gln Glu Glu Ile Asn Phe Arg Leu1 Val Ser Arg Asp Glu Leu MET Glu Ala Ile Gln Lys Gln Glu Glu Ile Asn Phe Arg Leu

21 Gln Asp Tyr Ile Asp Arg Ile Ile Val Ala Ile MET Glu Thr Asn Pro Ser Ile21 Gln Asp Tyr Ile Asp Arg Ile Ile Val Ala Ile MET Glu Thr Asn Pro Ser Ile

Claims

1. a kind of fusion protein of thrombopoietin, including the simulation of human serum albumin HSA molecule, thrombopoietin Peptide TMP dyads, the amino acid sequence such as SEQ ID NO of the HSA molecules：Shown in 4 or after the amino acid sequence is mutated Sequence with said function encodes the DNA sequence dna such as SEQ ID NO of the amino acid sequence of the HSA molecules：It is described shown in 3 Thrombopoietin simulating peptide TMP dyads amino acid sequence such as SEQ ID NO：Shown in 2 or the amino acid sequence passes through Sequence with said function after mutation, which is characterized in that the thrombopoietin simulating peptide TMP dyads are by one Made of a yeast preferred codons coding, the amino acid sequence of the thrombopoietin simulating peptide TMP dyads is encoded DNA sequence dna such as SEQ ID NO：Shown in 1, which further includes the peptide fragment that can form Dimerized structural domain.

2. a kind of fusion protein of thrombopoietin according to claim 1, which is characterized in that described forms Contain at least two cysteines in the peptide fragment of Dimerized structural domain.

3. a kind of fusion protein of thrombopoietin according to claim 1, which is characterized in that described forms Containing there are two cysteines in the peptide fragment of Dimerized structural domain.

4. a kind of fusion protein of thrombopoietin according to claim 3, which is characterized in that described forms The peptide fragment of Dimerized structural domain is H peptide fragments, amino acid sequence such as Seq ID No:Shown in 8.

5. a kind of fusion protein of thrombopoietin according to claim 3, which is characterized in that dimer can be formed The peptide fragment for changing structural domain is H peptide fragments and the peptide fragment selected from fip structural domains, the amino acid sequence such as Seq ID No of H peptide fragments:8 institutes Show, the amino acid sequence such as Seq ID No of the peptide fragment selected from fip structural domains:Shown in 10.

6. a kind of fusion protein of thrombopoietin according to claim 4, which is characterized in that the HSA molecules The ends C- of fusion protein are connected to, structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, L4 indicate peptide linker, The amino acid sequence of L1 is GGPSG, and the amino acid sequence of L3 is GGGGSGL, and the amino acid sequence of L4 is EFGGGGS, and H expressions can Form the H peptide fragments of Dimerized structural domain.

7. a kind of fusion protein of thrombopoietin according to claim 5, which is characterized in that the HAS molecules Positioned at the ends N- of fusion protein, structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 and indicates peptide linker, L1 Amino acid sequence be GGPSG, the amino acid sequence of L2 is GGGGSRS, and H indicates that the H peptide fragments of Dimerized structural domain can be formed, Fip indicates the peptide fragment selected from fip structural domains；The amino acid sequence of the fusion protein such as SEQ ID NO：Shown in 6, described in coding The DNA sequence dna of the nucleotide sequence of fusion protein such as SEQ ID NO：Shown in 5.

8. the Dimerized fusion protein that a kind of fusion protein of thrombopoietin described in claim 6 is formed.

9. the Dimerized fusion protein that a kind of fusion protein of thrombopoietin described in claim 7 is formed.

10. a kind of fusion protein of thrombopoietin, including human serum albumins HAS molecules, thrombopoietin mould Peptidomimetic TMP dyads, the amino acid sequence such as SEQ ID NO of the human serum albumins HAS molecules：Shown in 4 or the amino The sequence with said function, encodes the DNA sequence dna such as SEQ ID of the amino acid sequence of the HAS molecules after acid sequence is mutated NO：Shown in 3, the amino acid sequence such as SEQ ID NO of the thrombopoietin simulating peptide TMP dyads：Shown in 2, or Sequence with said function after the amino acid sequence is mutated, which is characterized in that the thrombopoietin simulating peptide TMP dyads are made of being encoded by a yeast preferred codons, to encode the thrombopoietin simulating peptide TMP bis- The DNA sequence dna of conjuncted amino acid sequence such as SEQ ID NO：Shown in 1, which further includes Dimerized structural domain.

11. a kind of fusion protein of thrombopoietin according to claim 10, which is characterized in that the dimerization Body structural domain is to keep the fusion protein of thrombopoietin Dimerized by the covalent effect of disulfide bond.

12. a kind of fusion protein of thrombopoietin according to claim 11, which is characterized in that the dimerization The disulfide bond formed by cysteine containing at least two in body structural domain.

13. a kind of fusion protein of thrombopoietin according to claim 12, which is characterized in that the dimerization Containing there are two the disulfide bond formed by cysteine in body structural domain.

14. a kind of fusion protein of thrombopoietin according to claim 12, which is characterized in that the dimerization Body structural domain includes H peptide fragments, the amino acid sequence such as Seq ID No of H peptide fragments:Shown in 8.

15. a kind of fusion protein of thrombopoietin according to claim 12, which is characterized in that the dimerization Body structural domain includes H peptide fragments and the peptide fragment selected from fip structural domains, the amino acid sequence such as Seq ID No of H peptide fragments:Shown in 8, The amino acid sequence of peptide fragment selected from fip structural domains such as Seq ID No:Shown in 10.

16. the fusion protein of thrombopoietin according to claim 14, which is characterized in that the HSA molecules connect The ends C- of fusion protein are connected to, structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, L4 indicate peptide linker, L1 Amino acid sequence be GGPSG, the amino acid sequence of L3 is GGGGSGL, and the amino acid sequence of L4 is EFGGGGS, and H expressions can shape At the H peptide fragments of Dimerized structural domain.

17. a kind of fusion protein of thrombopoietin according to claim 15, which is characterized in that the HSA points Son is located at the ends N- of fusion protein, and structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 and indicates peptide linker, The amino acid sequence of L1 is GGPSG, and the amino acid sequence of L2 is GGGGSRS, and H indicates that the H peptides of Dimerized structural domain can be formed Section, fip indicate the peptide fragment selected from fip structural domains；The amino acid sequence of the fusion protein such as SEQ ID NO：Shown in 6, coding The DNA sequence dna of the nucleotide sequence of the fusion protein such as SEQ ID NO：Shown in 5.

18. the Dimerized fusion protein that a kind of fusion protein of thrombopoietin described in claim 16 is formed.

19. the Dimerized fusion protein that a kind of fusion protein of thrombopoietin described in claim 17 is formed.

20. a kind of fusion protein of thrombopoietin according to claim 1 or 10, it is characterised in that the fusion Albumen is prepared using yeast cell to express.

21. a kind of fusion protein of thrombopoietin according to claim 20, it is characterised in that the yeast For thermophilic pichia methanolica.

22. a kind of fusion protein of thrombopoietin described in claim 1 to 19 any one is primary in preparation treatment Application in the drug of property or secondary thrombocytopenia.

23. a kind of preparation method of the fusion protein of thrombopoietin as described in claim 1 or 10, feature exist In described steps are as follows：

1) the thrombopoietin simulating peptide TMP dyad sequences of full genome synthetic yeast preferred codons coding；

2) HSA molecular sequences are obtained by PCR amplification；

3) by digestion with restriction enzyme, connection and Escherichia coli are converted, obtains and compiles containing encoding the yeast preferred codons The recombinant expression of the thrombopoietin simulating peptide dyad of code and the DNA sequence dna of the fusion protein of human serum albumins molecule Carrier；

4) recombinant expression carrier described in step 3) is transformed into competent E.coli TOP10, then is transformed into host expresses system System is expressed to get the fusion protein.