CN108623690A - A kind of fusion protein of thrombopoietin and its preparation method and application - Google Patents
A kind of fusion protein of thrombopoietin and its preparation method and application Download PDFInfo
- Publication number
- CN108623690A CN108623690A CN201710158090.0A CN201710158090A CN108623690A CN 108623690 A CN108623690 A CN 108623690A CN 201710158090 A CN201710158090 A CN 201710158090A CN 108623690 A CN108623690 A CN 108623690A
- Authority
- CN
- China
- Prior art keywords
- fusion protein
- thrombopoietin
- amino acid
- acid sequence
- tmp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 130
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 130
- 108010041111 Thrombopoietin Proteins 0.000 title claims abstract description 120
- 102000036693 Thrombopoietin Human genes 0.000 title claims abstract description 116
- 238000002360 preparation method Methods 0.000 title claims abstract description 15
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 47
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 45
- 108020004705 Codon Proteins 0.000 claims abstract description 23
- 239000003814 drug Substances 0.000 claims abstract description 12
- 108091006905 Human Serum Albumin Proteins 0.000 claims abstract description 11
- 102000008100 Human Serum Albumin Human genes 0.000 claims abstract description 11
- 229940079593 drug Drugs 0.000 claims abstract description 10
- 206010043554 thrombocytopenia Diseases 0.000 claims abstract description 10
- 201000004208 acquired thrombocytopenia Diseases 0.000 claims abstract description 5
- 210000002966 serum Anatomy 0.000 claims abstract description 5
- 102000009027 Albumins Human genes 0.000 claims abstract 4
- 108010088751 Albumins Proteins 0.000 claims abstract 4
- 230000004927 fusion Effects 0.000 claims abstract 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 70
- 238000006471 dimerization reaction Methods 0.000 claims description 25
- 230000014509 gene expression Effects 0.000 claims description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 18
- 230000000694 effects Effects 0.000 claims description 10
- 235000018417 cysteine Nutrition 0.000 claims description 8
- 241000588724 Escherichia coli Species 0.000 claims description 6
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 6
- 238000003259 recombinant expression Methods 0.000 claims description 6
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 238000012408 PCR amplification Methods 0.000 claims description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 4
- 150000001945 cysteines Chemical class 0.000 claims description 4
- 210000005253 yeast cell Anatomy 0.000 claims description 4
- 239000000539 dimer Substances 0.000 claims description 3
- 108091008146 restriction endonucleases Proteins 0.000 claims description 3
- 230000029087 digestion Effects 0.000 claims description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 2
- 102000007079 Peptide Fragments Human genes 0.000 claims 21
- 108010033276 Peptide Fragments Proteins 0.000 claims 21
- 241001452677 Ogataea methanolica Species 0.000 claims 1
- 239000002253 acid Substances 0.000 claims 1
- 230000035772 mutation Effects 0.000 claims 1
- 239000000816 peptidomimetic Substances 0.000 claims 1
- 238000004088 simulation Methods 0.000 claims 1
- 108010059728 thrombopoietin mimetic peptide Proteins 0.000 abstract description 15
- 201000010099 disease Diseases 0.000 abstract description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 3
- 238000010353 genetic engineering Methods 0.000 abstract description 3
- 238000009776 industrial production Methods 0.000 abstract description 3
- 201000003725 primary thrombocytopenia Diseases 0.000 abstract description 3
- 244000285963 Kluyveromyces fragilis Species 0.000 abstract 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 abstract 1
- 230000006641 stabilisation Effects 0.000 abstract 1
- 238000011105 stabilization Methods 0.000 abstract 1
- 230000001361 thrombopoietic effect Effects 0.000 abstract 1
- 108090000623 proteins and genes Proteins 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 15
- 230000004071 biological effect Effects 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 9
- 238000000034 method Methods 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010004073 cysteinylcysteine Proteins 0.000 description 6
- 229940041514 candida albicans extract Drugs 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 239000012138 yeast extract Substances 0.000 description 5
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- 241000235648 Pichia Species 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 239000013641 positive control Substances 0.000 description 4
- 230000001737 promoting effect Effects 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- 102000007562 Serum Albumin Human genes 0.000 description 3
- 108010071390 Serum Albumin Proteins 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 239000008367 deionised water Substances 0.000 description 3
- 229910021641 deionized water Inorganic materials 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 210000003593 megakaryocyte Anatomy 0.000 description 3
- 108010002543 polyethylene glycol-recombinant human megakaryocyte growth and development factor Proteins 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 2
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 2
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 2
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 2
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 2
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 2
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 2
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 2
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 2
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 2
- MXPBQDFWIMBACQ-ACZMJKKPSA-N Glu-Cys-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O MXPBQDFWIMBACQ-ACZMJKKPSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 2
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 2
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 2
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 2
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 2
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 2
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- 241000242739 Renilla Species 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 2
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 2
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 2
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000008121 dextrose Substances 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000010255 intramuscular injection Methods 0.000 description 2
- 239000007927 intramuscular injection Substances 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000000825 pharmaceutical preparation Substances 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 235000018102 proteins Nutrition 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 235000002639 sodium chloride Nutrition 0.000 description 2
- 230000010473 stable expression Effects 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- 208000032467 Aplastic anaemia Diseases 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 108010075254 C-Peptide Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 1
- LWTTURISBKEVAC-CIUDSAMLSA-N Cys-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N LWTTURISBKEVAC-CIUDSAMLSA-N 0.000 description 1
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 1
- VOBMMKMWSIVIOA-SRVKXCTJSA-N Cys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N VOBMMKMWSIVIOA-SRVKXCTJSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000331 Firefly luciferases Proteins 0.000 description 1
- -1 GC content Proteins 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 1
- 101000799461 Homo sapiens Thrombopoietin Proteins 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000254158 Lampyridae Species 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- CFVQPNSCQMKDPB-CIUDSAMLSA-N Lys-Cys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N CFVQPNSCQMKDPB-CIUDSAMLSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000007568 Proto-Oncogene Proteins c-fos Human genes 0.000 description 1
- 108010071563 Proto-Oncogene Proteins c-fos Proteins 0.000 description 1
- 108010052090 Renilla Luciferases Proteins 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- 239000008049 TAE buffer Substances 0.000 description 1
- 101150029803 TMP gene Proteins 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000001467 acupuncture Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010034445 albutensin A Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 108010036999 aspartyl-alanyl-histidyl-lysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 235000019441 ethanol Nutrition 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 238000013326 plasmid cotransfection Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 208000037920 primary disease Diseases 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000009962 secretion pathway Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/52—Cytokines; Lymphokines; Interferons
- C07K14/524—Thrombopoietin, i.e. C-MPL ligand
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/31—Fusion polypeptide fusions, other than Fc, for prolonged plasma life, e.g. albumin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
技术领域technical field
本发明属于基因工程制药领域,涉及一种促血小板生成素的融合蛋白及其制备方法和应用,具体涉及一种酵母偏好密码子编码的促血小板生成素模拟肽(ThrombopoietinMimetic Peptide,TMP)二联体与人血清白蛋白分子(Human serum Albumin,HSA)的融合蛋白。The invention belongs to the field of genetic engineering pharmaceuticals, and relates to a fusion protein of thrombopoietin and its preparation method and application, in particular to a thrombopoietin mimetic peptide (Thrombopoietin Mimetic Peptide, TMP) duplex encoded by a yeast preferred codon Fusion protein with human serum albumin molecule (Human serum Albumin, HSA).
背景技术Background technique
在临床上常会遇到由各种原因引起的原发性和继发性血小板减少症,如原发性血小板减少性紫癜、再生障碍性贫血以及肿瘤化/放疗造成的血小板减少等。对于这类疾病,比较适合应用长效升血小板药物进行救治,一方面,可以减少用药次数,减少病人的针痛之苦;另一方面,可以减少药物用量,降低治疗费用。In clinical practice, primary and secondary thrombocytopenia caused by various reasons are often encountered, such as primary thrombocytopenic purpura, aplastic anemia, and thrombocytopenia caused by tumor chemotherapy/radiotherapy. For such diseases, it is more suitable to use long-acting platelet-increasing drugs for treatment. On the one hand, it can reduce the number of medications and reduce the pain of acupuncture for patients; on the other hand, it can reduce the dosage of drugs and reduce treatment costs.
血小板由巨核细胞产生,促进巨核细胞的增殖、分化是提高血小板水平的主要手段。促血小板生成素(Thrombopoietin,TMP),又称巨核细胞生长发育因子(MGDF),是骨髓巨核细胞增殖、分化和血小板生成的最重要调控因子。1994年TMP基因缺少注释被首次成功克隆后不久,人们即研制出一种聚乙二醇(PEG)化的重组功能型TMP基因工程产品-PEG-rHuMGDF。动物实验结果表明,PEG-rHuMGDF不仅具有突出的促血小板生成活性,而且在体内具有较长的半衰期,单次应用即可达到显著升高血小板的效果。然而,在临床试验过程中人们发现,一些健康受试者在接受该药物治疗后所产生的抗体与内源性TMP存在交差中和反应,导致受试者出现血小板减少,从而使得PEG-rHuMGDF乃至重组人TMP的研制在1998年时即被美国FDA叫停。受其影响,在美、欧、日本等国家至今重组人TMP都未被批准进入临床。Platelets are produced by megakaryocytes, and promoting the proliferation and differentiation of megakaryocytes is the main means to increase the level of platelets. Thrombopoietin (TMP), also known as megakaryocyte growth and development factor (MGDF), is the most important regulator of bone marrow megakaryocyte proliferation, differentiation and platelet production. Shortly after the first successful cloning of TMP gene lacking annotation in 1994, people developed a polyethylene glycol (PEG) recombinant functional TMP genetic engineering product - PEG-rHuMGDF. The results of animal experiments show that PEG-rHuMGDF not only has outstanding platelet-promoting activity, but also has a long half-life in vivo, and a single application can significantly increase platelets. However, in the course of clinical trials, it was found that the antibodies produced by some healthy subjects after receiving the drug treatment had a cross-neutralization reaction with endogenous TMP, resulting in thrombocytopenia in the subjects, so that PEG-rHuMGDF and even The development of recombinant human TMP was stopped by the US FDA in 1998. Affected by it, recombinant human TMP has not been approved for clinical use in the United States, Europe, Japan and other countries.
人血清白蛋白分子(Albumin Human,HSA)是血浆中的主要蛋白成分,约占血浆总蛋白的50-60%。HSA分子是一种非糖基化单链蛋白,共由585个氨基酸残基组成,其分子量为66.5kDa(A.Dugaiczyk et al.,PNAS,1982,79:71-75),血浆半衰期长达两周以上。HSA分子在宿主细胞中是以一种原肽形式合成的,其中含24个氨基酸残基组成的信号肽和前肽,在转运和分泌过程中信号肽和前肽被切除。HSA分子是一个稳定的惰性蛋白,可以作为药物载体。HSA分子以下同已成功地在多宿主中表达(EP330451和EP361991),尤其在酵母细胞中可实现HSA分子较高水平的稳定表达。当目的多肽与HSA分子融合表达时不仅可以提高多肽药物的稳定性,还可以增加多肽药物在体内的半衰期。专利号CN201310084212.8的二聚体化融合蛋白的制备及应用中公开了二聚体化促血小板生成素(TPO)模拟肽TMP二联体-人血清白蛋白分子融合蛋白的制备及应用,但是其存在融合蛋白表达效果不理想的问题,为了提高表达产量、增强融合蛋白的生物活性,本发明采用了酵母偏好密码子编码促血小板生成素模拟肽TMP二联体的策略。基于上述研究现状,本发明现提供一种促血小板生成素的融合蛋白,所述的促血小板生成素模拟肽TMP二联体是由酵母偏好密码子编码而成的,所述融合蛋白能在酵母体内高水平的稳定表达,可应用于工业生产;同时,该融合蛋白具有显著促进血小板生成的特性,在人体内具有较长的半衰期。本发明同时提供该融合蛋白的制备方法和应用。Human serum albumin molecule (Albumin Human, HSA) is the main protein component in plasma, accounting for about 50-60% of total plasma protein. HSA molecule is a non-glycosylated single-chain protein consisting of 585 amino acid residues, its molecular weight is 66.5kDa (A.Dugaiczyk et al., PNAS, 1982, 79:71-75), and its plasma half-life is as long as More than two weeks. HSA molecules are synthesized in the form of a propeptide in host cells, which contains a signal peptide and a propeptide consisting of 24 amino acid residues, which are excised during transport and secretion. HSA molecule is a stable inert protein that can be used as a drug carrier. The following HSA molecules have been successfully expressed in multiple hosts (EP330451 and EP361991), especially in yeast cells, which can achieve a higher level of stable expression of HSA molecules. When the target polypeptide is fused with HSA molecule, it can not only improve the stability of the polypeptide drug, but also increase the half-life of the polypeptide drug in vivo. The preparation and application of the dimerization fusion protein of patent number CN201310084212.8 discloses the preparation and application of the dimerization thrombopoietin (TPO) mimetic peptide TMP duplex-human serum albumin molecular fusion protein, but It has the problem that the expression effect of the fusion protein is not ideal. In order to improve the expression yield and enhance the biological activity of the fusion protein, the present invention adopts the strategy of coding the TMP duplex of the thrombopoietin mimetic peptide with yeast preferred codons. Based on the above research status, the present invention now provides a fusion protein of thrombopoietin, the TMP duplex of the thrombopoietin mimetic peptide is encoded by yeast preferred codons, and the fusion protein can be expressed in yeast High-level stable expression in vivo can be applied to industrial production; at the same time, the fusion protein has the property of significantly promoting platelet production and has a longer half-life in the human body. The invention also provides the preparation method and application of the fusion protein.
重组蛋白的异源表达受到基因自身序列特征如密码子偏好性、GC含量、mRNA二级结构、mRNA稳定性等因素的影响。研究表明同义密码子的更换能够影响蛋白质翻译延伸的速率[1-2],说明密码子的优化可能会对重组蛋白的表达起到适得其反的影响。另一方面,重组蛋白的表达还受到宿主、培养条件、分泌途径、启动子等因素的影响,现行的基因优化理论和设计方法还不成熟,存在着各种各样的局限性,基因优化策略对提高重组蛋白的表达量是必要条件,而不是充分条件。综上所述,通过酵母偏好密码子改造从而改良基因工程菌株表达量的策略在实际生产应用中存在着极大的不确定性,针对不同的目的基因进行改造,最终的结果是无法预知的。The heterologous expression of recombinant proteins is affected by the sequence characteristics of the gene itself, such as codon preference, GC content, mRNA secondary structure, mRNA stability and other factors. Studies have shown that the replacement of synonymous codons can affect the rate of protein translation elongation [1-2], indicating that codon optimization may have a counterproductive effect on the expression of recombinant proteins. On the other hand, the expression of recombinant proteins is also affected by factors such as hosts, culture conditions, secretion pathways, and promoters. The current gene optimization theory and design methods are still immature, and there are various limitations. Gene optimization strategies It is a necessary condition, but not a sufficient condition, to increase the expression of recombinant protein. To sum up, the strategy of improving the expression level of genetically engineered strains through modification of yeast preferred codons has great uncertainty in actual production and application, and the final result of modification for different target genes is unpredictable.
参考文献:references:
[1]Komar AA,Lesnik T,Reiss C.Synonymous codon substitutions affectribosome traffic and protein folding during in vitro translation.FEBS Lett,1999,462(3):387-391.[1] Komar AA, Lesnik T, Reiss C. Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett, 1999, 462(3): 387-391.
[2]Trinh R,Gurbaxani B,Morrison SL,et al.Optimization of codon pairuse within the(GGGGS)3linker sequence results in enhanced proteinexpression.Mol Immunol,2004,40(10):717-722.[2] Trinh R, Gurbaxani B, Morrison SL, et al. Optimization of codon pairuse within the (GGGGS)3linker sequence results in enhanced protein expression. Mol Immunol, 2004, 40(10): 717-722.
发明内容Contents of the invention
本发明的目的在于提供一种促血小板生成素的融合蛋白,该融合蛋白半衰期长,能在宿主体内高水平稳定表达。The purpose of the present invention is to provide a fusion protein of thrombopoietin, which has a long half-life and can be stably expressed at a high level in a host.
本发明的另一个目的是提供上述融合蛋白的制备方法。Another object of the present invention is to provide a method for preparing the above fusion protein.
本发明的另一个目的是提供上述融合蛋白的应用。Another object of the present invention is to provide the application of the above fusion protein.
本发明所述的上述目的是通过以下技术方案实现的:The above-mentioned purpose described in the present invention is achieved through the following technical solutions:
本发明公开的一种促血小板生成素的融合蛋白,包括人血清白蛋白HSA分子、促血小板生成素模拟肽TMP二联体,所述的人血清白蛋白HSA分子的氨基酸序列如SEQ ID NO:4所示,或该氨基酸序列经突变后具有同样功能的序列,编码所述HSA分子的氨基酸序列的DNA序列如SEQ ID NO:3所示,所述的促血小板生成素模拟肽TMP二联体的氨基酸序列如SEQID NO:2所示,或该氨基酸序列经突变后具有同样功能的序列,促血小板生成素模拟肽TMP二联体是由一个酵母偏好密码子编码而成的,编码所述促血小板生成素模拟肽TMP二联体的氨基酸序列的DNA序列如SEQ ID NO:1所示,该融合蛋白还包括可形成二聚体化结构域的肽段。A fusion protein of thrombopoietin disclosed by the present invention comprises human serum albumin HSA molecule and thrombopoietin mimetic peptide TMP duplex, and the amino acid sequence of said human serum albumin HSA molecule is shown as SEQ ID NO: 4, or a sequence having the same function after the amino acid sequence is mutated, the DNA sequence encoding the amino acid sequence of the HSA molecule is shown in SEQ ID NO: 3, and the thrombopoietin mimetic peptide TMP duplex The amino acid sequence of the amino acid sequence is shown in SEQID NO: 2, or a sequence having the same function after the amino acid sequence is mutated. The thrombopoietin mimetic peptide TMP duplex is encoded by a yeast-preferred codon, encoding the promoting The DNA sequence of the amino acid sequence of the thrombopoietin mimetic peptide TMP duplex is shown in SEQ ID NO: 1, and the fusion protein also includes a peptide segment that can form a dimerization domain.
本发明公开的一种促血小板生成素的融合蛋白,可形成二聚体化结构域的肽段中含有至少两个半胱氨酸。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form the dimerization domain contains at least two cysteines.
本发明公开的一种促血小板生成素的融合蛋白,可形成二聚体化结构域的肽段中含有两个半胱氨酸。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form a dimerization domain contains two cysteines.
本发明公开的一种促血小板生成素的融合蛋白,可形成二聚体化结构域的肽段为H肽段,其氨基酸序列如Seq ID No:8所示。In the fusion protein of thrombopoietin disclosed by the present invention, the peptide that can form the dimerization domain is H peptide, and its amino acid sequence is shown in Seq ID No:8.
本发明公开的一种促血小板生成素的融合蛋白,可形成二聚体化结构域的肽段为H肽段和选自fip结构域的肽段,H肽段的氨基酸序列如Seq ID No:8所示,选自fip结构域的肽段的氨基酸序列如Seq ID No:10所示。A fusion protein of thrombopoietin disclosed by the present invention, the peptide segment that can form a dimerization domain is an H peptide segment and a peptide segment selected from the fip domain, and the amino acid sequence of the H peptide segment is as Seq ID No: 8, the amino acid sequence of the peptide selected from the fip domain is shown in Seq ID No:10.
本发明公开的一种促血小板生成素的融合蛋白,其中HSA分子连接于融合蛋白的C-末端,结构式表示为HSA-L3-H-L4-TMP-L1-TMP,L1、L3、L4表示肽接头,L1的氨基酸序列为GGPSG,L3的氨基酸序列为GGGGSGL,L4的氨基酸序列为EFGGGGS,H表示可形成二聚体化结构域的H肽段。A fusion protein of thrombopoietin disclosed by the present invention, wherein the HSA molecule is connected to the C-terminus of the fusion protein, the structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, and L4 represent peptides Linker, the amino acid sequence of L1 is GGPPSG, the amino acid sequence of L3 is GGGGSGL, the amino acid sequence of L4 is EFGGGGS, and H represents the H peptide that can form the dimerization domain.
本发明公开的一种促血小板生成素的融合蛋白,其中HSA分子位于融合蛋白的N-末端,结构式表示为TMP-L1-TMP-L2-H-fip-HSA,L1和L2表示肽接头,L1的氨基酸序列为GGPSG,L2的氨基酸序列为GGGGSRS,H表示可形成二聚体化结构域的H肽段,fip表示选自fip结构域的肽段;所述融合蛋白的氨基酸序列如SEQ ID NO:6所示,编码所述融合蛋白的核苷酸序列的DNA序列如SEQ ID NO:5所示。A fusion protein of thrombopoietin disclosed by the present invention, wherein the HSA molecule is located at the N-terminal of the fusion protein, the structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 represent peptide linkers, L1 The amino acid sequence of L2 is GGPPG, the amino acid sequence of L2 is GGGGSRS, H represents the H peptide segment that can form a dimerization domain, and fip represents a peptide segment selected from the fip domain; the amino acid sequence of the fusion protein is as shown in SEQ ID NO : shown in 6, the DNA sequence encoding the nucleotide sequence of the fusion protein is shown in SEQ ID NO: 5.
本发明公开了一种促血小板生成素的融合蛋白形成的二聚体化融合蛋白。The invention discloses a dimerized fusion protein formed by fusion protein of thrombopoietin.
本发明公开一种促血小板生成素的融合蛋白,包括人血清白蛋白HSA分子、促血小板生成素模拟肽TMP二联体,所述的人血清白蛋白HSA分子的氨基酸序列如SEQ ID NO:4所示,或该氨基酸序列经突变后具有同样功能的序列,编码所述HSA分子的氨基酸序列的DNA序列如SEQ ID NO:3所示,所述的促血小板生成素模拟肽TMP二联体的氨基酸序列如SEQ IDNO:2所示,或该氨基酸序列经突变后具有同样功能的序列,所述的促血小板生成素模拟肽TMP二联体是由一个酵母偏好密码子编码而成的,编码所述的促血小板生成素模拟肽TMP二联体的氨基酸序列的DNA序列如SEQ ID NO:1所示,该融合蛋白还包括二聚体化结构域。The invention discloses a fusion protein of thrombopoietin, which comprises human serum albumin HSA molecule and thrombopoietin mimetic peptide TMP duplex, and the amino acid sequence of said human serum albumin HSA molecule is as shown in SEQ ID NO: 4 shown, or the sequence having the same function after the amino acid sequence is mutated, the DNA sequence encoding the amino acid sequence of the HSA molecule is shown in SEQ ID NO: 3, and the thrombopoietin mimic peptide TMP duplex The amino acid sequence is shown in SEQ ID NO: 2, or a sequence having the same function after the amino acid sequence is mutated. The TMP duplex of the thrombopoietin mimetic peptide is encoded by a yeast preferred codon, encoding the The DNA sequence of the amino acid sequence of the thrombopoietin mimetic peptide TMP duplex is shown in SEQ ID NO: 1, and the fusion protein also includes a dimerization domain.
本发明公开的一种促血小板生成素的融合蛋白,所述的二聚体化结构域是通过二硫键的共价作用使促血小板生成素的融合蛋白二聚体化。The invention discloses a fusion protein of thrombopoietin, wherein the dimerization domain dimers the fusion protein of thrombopoietin through the covalent action of disulfide bonds.
本发明公开的一种促血小板生成素的融合蛋白,所述的二聚体化结构域中含有至少两个由半胱氨酸形成的二硫键。In the fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain contains at least two disulfide bonds formed by cysteine.
本发明公开的一种促血小板生成素的融合蛋白,所述的二聚体化结构域中含有两个由半胱氨酸形成的二硫键。In the fusion protein of thrombopoietin disclosed by the invention, the dimerization domain contains two disulfide bonds formed by cysteine.
本发明公开的一种促血小板生成素的融合蛋白,所述的二聚体化结构域包含H肽段,H肽段的氨基酸序列如Seq ID No:8所示。In the fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain includes an H peptide segment, and the amino acid sequence of the H peptide segment is shown in Seq ID No:8.
本发明公开的一种促血小板生成素的融合蛋白,二聚体化结构域包含H肽段和选自fip结构域的肽段,H肽段的氨基酸序列如Seq ID No:8所示,选自fip结构域的肽段的氨基酸序列如Seq ID No:10所示。In a fusion protein of thrombopoietin disclosed by the present invention, the dimerization domain includes an H peptide segment and a peptide segment selected from the fip domain, and the amino acid sequence of the H peptide segment is shown in Seq ID No: 8, selected from The amino acid sequence of the peptide from the fip domain is shown in Seq ID No:10.
本发明公开的促血小板生成素的融合蛋白,所述HSA分子连接于融合蛋白的C-末端,结构式表示为HSA-L3-H-L4-TMP-L1-TMP,L1、L3、L4表示肽接头,L1的氨基酸序列为GGPSG,L3的氨基酸序列为GGGGSGL,L4的氨基酸序列为EFGGGGS,H表示可形成二聚体化结构域的H肽段。In the fusion protein of thrombopoietin disclosed by the present invention, the HSA molecule is connected to the C-terminus of the fusion protein, and the structural formula is expressed as HSA-L3-H-L4-TMP-L1-TMP, and L1, L3, and L4 represent peptide linkers , the amino acid sequence of L1 is GGPPSG, the amino acid sequence of L3 is GGGGSGL, the amino acid sequence of L4 is EFGGGGS, and H represents the H peptide that can form the dimerization domain.
本发明公开的一种促血小板生成素的融合蛋白,所述HSA分子位于融合蛋白的N-末端,结构式表示为TMP-L1-TMP-L2-H-fip-HSA,L1和L2表示肽接头,L1的氨基酸序列为GGPSG,L2的氨基酸序列为GGGGSRS,H表示可形成二聚体化结构域的H肽段,fip表示选自fip结构域的肽段;所述融合蛋白的氨基酸序列如SEQ ID NO:6所示,编码所述融合蛋白的核苷酸序列的DNA序列如SEQ ID NO:5所示。A fusion protein of thrombopoietin disclosed by the present invention, the HSA molecule is located at the N-terminal of the fusion protein, the structural formula is expressed as TMP-L1-TMP-L2-H-fip-HSA, L1 and L2 represent peptide linkers, The amino acid sequence of L1 is GGPPG, the amino acid sequence of L2 is GGGGSRS, H represents the H peptide segment that can form a dimerization domain, and fip represents a peptide segment selected from the fip domain; the amino acid sequence of the fusion protein is as shown in SEQ ID As shown in NO:6, the DNA sequence encoding the nucleotide sequence of the fusion protein is shown in SEQ ID NO:5.
本发明公开的一种促血小板生成素的融合蛋白形成的二聚体化融合蛋白。The invention discloses a dimerization fusion protein formed by fusion protein of thrombopoietin.
本发明公开的一种促血小板生成素的融合蛋白,所述融合蛋白采用酵母细胞表达制备。The invention discloses a fusion protein of thrombopoietin, which is expressed and prepared by yeast cells.
本发明公开的一种促血小板生成素的融合蛋白,所述的酵母为嗜甲醇毕赤酵母。The invention discloses a fusion protein of thrombopoietin, and the yeast is Pichia methanolophilus.
本发明公开的一种促血小板生成素的融合蛋白在制备治疗原发性或继发性血小板减少症的药物中的应用。The application of a fusion protein of thrombopoietin disclosed by the invention in the preparation of medicine for treating primary or secondary thrombocytopenia.
本发明公开的一种促血小板生成素的融合蛋白的制备方法,所述步骤如下:A preparation method of a fusion protein of thrombopoietin disclosed by the present invention, the steps are as follows:
1)全基因合成酵母偏好密码子编码的促血小板生成素模拟肽TMP二联体序列;1) Whole-gene synthesis of the TMP duplex sequence encoded by yeast preferred codons;
2)通过PCR扩增获取HSA分子序列;2) Obtain the HSA molecular sequence by PCR amplification;
3)通过限制性内切酶酶切、连接并转化大肠杆菌,获得含编码所述酵母偏好密码子编码的促血小板生成素模拟肽二联体与人血清白蛋白分子的融合蛋白的DNA序列的重组表达载体;3) Digesting with restriction endonucleases, ligating and transforming Escherichia coli to obtain the DNA sequence encoding the fusion protein of the thrombopoietin mimetic peptide duplex encoded by the yeast preferred codon and the human serum albumin molecule Recombinant expression vector;
4)将步骤3)所述的重组表达载体转化到感受态大肠杆菌TOP10,再转化到宿主表达系统进行表达,即得所述融合蛋白。4) Transform the recombinant expression vector described in step 3) into competent Escherichia coli TOP10, and then transform into a host expression system for expression to obtain the fusion protein.
一种含有上述促血小板生成素的融合蛋白编码基因的重组表达载体。可用于携带编码本发明融合蛋白的基因的表达载体包括但不局限于原核、真核表达系统常用的质粒。A recombinant expression vector containing the gene encoding the fusion protein of the above-mentioned thrombopoietin. The expression vectors that can be used to carry the gene encoding the fusion protein of the present invention include, but are not limited to, plasmids commonly used in prokaryotic and eukaryotic expression systems.
一种含有上述重组表达载体的宿主表达系统。宿主可以是细菌、酵母及哺乳动物细胞等,其中优选的是酵母,更优选的是嗜甲醇毕赤酵母;对于重组表达的本发明融合蛋白可以通过多种方法从相应的细胞培养物中进行提取、纯化,这些方法包括离心、超滤及液相层析等技术,其中液相层析又包括了离子交换、疏水和分子筛等层析技术。A host expression system containing the above-mentioned recombinant expression vector. The host can be bacteria, yeast, mammalian cells, etc., wherein yeast is preferred, and Pichia methanolophilus is more preferred; the fusion protein of the present invention expressed recombinantly can be extracted from the corresponding cell culture by various methods , Purification, these methods include techniques such as centrifugation, ultrafiltration and liquid chromatography, among which liquid chromatography includes chromatography techniques such as ion exchange, hydrophobicity and molecular sieves.
本发明所述融合蛋白可以与药物载体一起组成药物制剂使用,这些药物载体包括水、盐水、糖类、醇类及氨基酸等。由本发明融合蛋白制成的药物制剂优选的是含水量低于3%或不含水的冻干制剂,给药方式包括静脉输注、注射(包括皮下和肌肉注射)、鼻内、呼吸道等,其中优选的是皮下或肌肉注射。The fusion protein of the present invention can be used together with pharmaceutical carriers to form pharmaceutical preparations, and these pharmaceutical carriers include water, saline, sugars, alcohols, amino acids and the like. The pharmaceutical preparation made of the fusion protein of the present invention is preferably a lyophilized preparation with a water content of less than 3% or no water, and the administration methods include intravenous infusion, injection (including subcutaneous and intramuscular injection), intranasal, respiratory tract, etc., wherein Subcutaneous or intramuscular injection is preferred.
所述融合蛋白能在酵母体内稳定高效的表达,可应用于工业生产;同时,该融合蛋白具有显著促进血小板生成的特性,在人体内具有较长的半衰期,可用于制备治疗多种原发性或继发性血小板减少症等疾病的药物。The fusion protein can be stably and efficiently expressed in yeast, and can be applied to industrial production; at the same time, the fusion protein has the characteristic of significantly promoting platelet production, has a long half-life in the human body, and can be used to prepare and treat various primary diseases. Or drugs for diseases such as secondary thrombocytopenia.
以下提供具体实施例以实现本发明所述的一种促血小板生成素的融合蛋白及其制备方法和应用。但不限于这些实施例。Specific examples are provided below to realize a thrombopoietin fusion protein of the present invention and its preparation method and application. But not limited to these examples.
附图说明:Description of drawings:
图1:插入有HSA分子基因的pcDNA3.1-HSA载体,作为构建促血小板生成素的融合蛋白表达载体的模板。Figure 1: The pcDNA3.1-HSA vector inserted with the HSA molecular gene is used as a template for constructing a fusion protein expression vector of thrombopoietin.
图2:促血小板生成素的融合蛋白表达载体图谱Figure 2: The fusion protein expression vector map of thrombopoietin
具体实施方式Detailed ways
主要实验仪器:Main experimental instruments:
移液枪、超净工作台(安泰)、磁力搅拌器、微波炉、高温蒸汽灭菌锅、-80℃低温冰箱(Forma)、超纯水仪(Millipore)、制冰机、离心机(Hitachi)、HDB-PLUS型恒温金属浴、HZQ-F16OA型恒温振荡培养箱(上海一恒)、PCR仪(Applied Biosystems)、台式冷冻离心机(Thermo)、DYY-8B型电泳仪(伯乐)、Image Quant 300型凝胶成像仪(GE)等。Pipette gun, ultra-clean bench (Antai), magnetic stirrer, microwave oven, high-temperature steam sterilizer, -80°C low-temperature refrigerator (Forma), ultra-pure water instrument (Millipore), ice maker, centrifuge (Hitachi) , HDB-PLUS constant temperature metal bath, HZQ-F16OA constant temperature shaking incubator (Shanghai Yiheng), PCR instrument (Applied Biosystems), desktop refrigerated centrifuge (Thermo), DYY-8B electrophoresis instrument (Bole), Image Quant 300 gel imager (GE), etc.
主要实验材料:Main experimental materials:
1.限制性核酸内切酶KpnI、EcoRI、SacI、AflII(NEB公司产品,美国)1. Restriction endonucleases KpnI, EcoRI, SacI, AflII (products of NEB Company, USA)
2.小提质粒试剂盒、PCR纯化试剂盒、DNA胶回收试剂盒(生工公司,中国)2. Small extraction plasmid kit, PCR purification kit, DNA gel recovery kit (Shenggong Company, China)
3.T4DNA连接酶试剂盒(Takara公司产品,中国大连)3. T4 DNA Ligase Kit (product of Takara Company, Dalian, China)
4.载体pPinkα-HC、嗜甲醇毕赤酵母菌株(Invitrogen公司产品,美国)4. Vector pPinkα-HC, Pichia methanolophilus strain (product of Invitrogen, USA)
5.大肠杆菌TOP10(天根生化科技(北京)有限公司)5. Escherichia coli TOP10 (Tiangen Biochemical Technology (Beijing) Co., Ltd.)
6.酵母提取物、蛋白胨(Oxford公司产品,美国)6. Yeast extract, peptone (product of Oxford, USA)
7.LB培养基7. LB medium
酵母提取物5g,蛋白胨10g,NaCl 10g,溶于1000ml去离子水中,并用1mol/L的NaOH调节pH值至7.0,高压蒸气灭菌。5 g of yeast extract, 10 g of peptone, and 10 g of NaCl were dissolved in 1000 ml of deionized water, adjusted to pH 7.0 with 1 mol/L NaOH, and sterilized by autoclaving.
8.YPD培养基8. YPD medium
酵母提取物10g,胰蛋白胨20g,Agar 20g,溶于900ml去离子水中,高压灭菌,冷却后加入100ml经滤器除菌后的20%的右旋糖。10 g of yeast extract, 20 g of tryptone, and 20 g of Agar were dissolved in 900 ml of deionized water, sterilized by high pressure, and after cooling, 100 ml of 20% dextrose sterilized by a filter were added.
9.YPDS培养基9. YPDS medium
酵母提取物10g,蛋白胨20g,山梨糖醇182.2g,溶于900ml去离子水中,高压灭菌,冷却后加入100ml经滤器除菌后的20%的右旋糖。10 g of yeast extract, 20 g of peptone, and 182.2 g of sorbitol were dissolved in 900 ml of deionized water, sterilized by high pressure, and 100 ml of 20% dextrose sterilized by a filter were added after cooling.
10.BMGY液体培养基10. BMGY liquid medium
酵母提取物10g,蛋白胨20g,无氨基酸酵母氮源13.4g,甘油10g,磷酸钾26.631g,溶于1000ml双蒸水中高压灭菌,冷至室温,调节pH至6.0,4℃保存备用。Yeast extract 10g, peptone 20g, amino acid-free yeast nitrogen source 13.4g, glycerol 10g, potassium phosphate 26.631g, dissolve in 1000ml double distilled water, autoclave, cool to room temperature, adjust pH to 6.0, store at 4°C for later use.
11.1%琼脂糖凝胶的配置11.1% Agarose gel configuration
根据用量,每100ml的TAE缓冲液,加入1g琼脂糖,使用微波炉加热煮沸,使琼脂糖完全融解,室温冷却至不烫手时滴加少量溴化乙锭(EB),混匀后将其倒入事先摆放好梳子的胶槽中,待到室温冷却至完全凝固后拔去梳子即可使用。According to the dosage, for every 100ml of TAE buffer, add 1g of agarose, heat and boil in a microwave oven to completely melt the agarose, drop a small amount of ethidium bromide (EB) when it is cooled at room temperature until it is not hot, mix well and pour it into Place the comb in the glue tank in advance, wait until it cools at room temperature until it is completely solidified, then pull out the comb and use it.
实施例1HSA-TMP融合蛋白酵母表达载体的构建与表达Construction and expression of embodiment 1 HSA-TMP fusion protein yeast expression vector
一、p29-simple-TMP序列的获得1. Acquisition of p29-simple-TMP sequence
1.首先根据嗜甲醇毕赤酵母偏爱密码子对编码(TMP)2的基因序列进行优化。1. First, optimize the gene sequence encoding (TMP) 2 according to the preferred codons of Pichia methanolophilus.
2.委托大连Takara公司合成优化后的(TMP)2基因,(TMP)2的DNA序列如SEQ ID NO:1所示,并将其装载到p29-simple(p29-simple质粒载体大连Takara公司提供),获得载体p29-simple-(TMP)2。2. Entrust Dalian Takara Company to synthesize the optimized (TMP) 2 gene, the DNA sequence of (TMP) 2 is shown in SEQ ID NO: 1, and load it into p29-simple (p29-simple plasmid vector provided by Dalian Takara Company ) to obtain the vector p29-simple-(TMP) 2 .
3.其中p29-simple-TMP中已包含L,即连接肽,LDNA序列为GGCGGCGGCGGTTCCGGACTGGAGCCCAAGAGCTGCGACAAGACCCACACCTGCCCTCCCTGCGAATTCGGTGGTGGCGGCAGC,氨基酸序列为GGGGSGLEPKSCDKTHTCPPCEF GGGGS。3. The p29-simple-TMP already contains L, namely the connecting peptide, the LDNA sequence is GGCGGCGGCGGTTCCGGACTGGAGCCCAAGAGCTGCGACAAGACCCACACCTGCCCTCCCTGCGAATTCGGTGGTGGCGGCAGC, and the amino acid sequence is GGGGSGLEPKSCDKTHTCPPCEF GGGGS.
二、HSA cDNA的片段的克隆:2. Cloning of fragments of HSA cDNA:
从质粒pcDNA3.1-fip-HSA克隆获得(HSA全基因序列由宝生物公司全基因合成,pcDNA3.1质粒购自Invitrogen)Obtained from plasmid pcDNA3.1-fip-HSA clone (HSA complete gene sequence was synthesized by Bao Biological Company, pcDNA3.1 plasmid was purchased from Invitrogen)
三、HSA分子序列的获得3. Obtaining the molecular sequence of HSA
1.设计合成PCR引物:1. Design and synthesize PCR primers:
P1:GGTACCTCATAAGCCTAAGGCAGCTTGP1: GGTACCTCATAAGCCTAAGGCAGCTTG
P2:TCCGGAGATGCACACAAGAGTGAGP2:TCCGGAGATGCACACAAGAGTGAG
2.PCR扩增:以载体pcDNA3.1-HSA的DNA为模版,以P1和P2分别作为上、下游引物,进行PCR扩增。反应条件如下:①变性:94℃,5min;②变性:94℃,1min;③复性:55℃,30S;④延伸:72℃,2min;⑤返回步骤“②”,35循环;⑥延伸:72℃,5min,总循环次数为30次。将PCR产物进行1%琼脂糖凝胶电泳,结果显示扩增出约1.8kb大小的HSA分子的DNA条带。2. PCR amplification: The DNA of the carrier pcDNA3.1-HSA was used as a template, and P1 and P2 were used as upstream and downstream primers respectively to carry out PCR amplification. The reaction conditions are as follows: ① Denaturation: 94°C, 5min; ② Denaturation: 94°C, 1min; ③ Renaturation: 55°C, 30S; ④ Extension: 72°C, 2min; ⑤ Return to step "②", 35 cycles; ⑥ Extend: 72°C, 5min, the total number of cycles is 30 times. The PCR product was subjected to 1% agarose gel electrophoresis, and the result showed that a DNA band of about 1.8 kb in size was amplified.
四、pPinkα-HC-HSA-(TMP)2融合蛋白酵母表达载体的构建4. Construction of pPinkα-HC-HSA-(TMP) 2 fusion protein yeast expression vector
1.提取p29-simple-(TMP)2载体,KpnI和EcoRI双酶切质粒,胶回收对应的TMP(KpnI/EcoRI)DNA片段,DNA序列如SEQ ID NO:1所示,氨基酸序列如SEQ ID NO:2所示;1. Extract p29-simple-(TMP) 2 vector, KpnI and EcoRI double digestion plasmid, gel recovery corresponding TMP (KpnI/EcoRI) DNA fragment, the DNA sequence is shown in SEQ ID NO: 1, and the amino acid sequence is shown in SEQ ID NO: as shown in 2;
2.将HSA分子经PCR扩增后的产物EcoRI和SacI双酶切,胶回收对应的HSA(EcoRI/SacI)DNA片段,DNA序列如SEQ ID NO:3所示,氨基酸序列如SEQ ID NO:4所示;2. Digest the HSA molecule amplified by PCR with EcoRI and SacI, and recover the corresponding HSA (EcoRI/SacI) DNA fragment from the gel. The DNA sequence is shown in SEQ ID NO: 3, and the amino acid sequence is shown in SEQ ID NO: 4 shown;
3.同时,SacI和KpnI双酶切载体pPinkα-HC(Invitrogen公司产品)的DNA,胶回收pPinkα-HC(SacI/KpnI)载体片段;3. At the same time, SacI and KpnI double-enzyme-cut the DNA of the carrier pPinkα-HC (product of Invitrogen Company), and gel recovered the pPinkα-HC (SacI/KpnI) carrier fragment;
4.T4DNA酶连接TMP(KpnI/EcoRI)DNA片段、HSA(EcoRI/SacI)DNA片段和pPinkα-HC(SacI/KpnI)载体片段,转化感受态大肠杆菌TOP10,涂布于氨苄抗性LB板37℃培养过夜,筛选阳性克隆。所得克隆送Invitrogen公司测序,序列正确的克隆命名为pPINKα-HC-HSA-(TMP)2。4. T4 DNA enzyme ligated TMP (KpnI/EcoRI) DNA fragments, HSA (EcoRI/SacI) DNA fragments and pPinkα-HC (SacI/KpnI) vector fragments, transformed into competent Escherichia coli TOP10, and coated on ampicillin-resistant LB plate 37 Cultivate overnight at ℃, and select positive clones. The obtained clone was sent to Invitrogen Company for sequencing, and the clone with the correct sequence was named pPINKα-HC-HSA-(TMP)2.
五、HSA-TMP融合蛋白在酵母中的表达5. Expression of HSA-TMP fusion protein in yeast
将测序正确的载体pPinkα-HC-HSA-(TMP)2的DNA用AflII酶切回收后得到pPinkα-HC-HSA-(TMP)2,转化酵母感受态细胞。然后将转化菌液接种于PAD平板,30℃培养3-4天,挑取阳性克隆。将得到阳性克隆分别接种BMGY液体培养基,30℃培养48小时,然后转接至BMMY培养基中诱导表达,持续96小时后,1500rpm低温离心15分钟,取上清,SDS-PAGE电泳检测蛋白表达情况,分子量约70kD蛋白条带即为HSA-TMP融合蛋白,所述融合蛋白的氨基酸序列如SEQ ID NO:6,编码所述融合蛋白的氨基酸序列的DNA序列如SEQ ID NO:5所示。选择表达水平最高的菌株作为工程菌,冻存于-80℃保种DNA of vector pPinkα-HC-HSA-(TMP) 2 sequenced correctly was digested with AflII and recovered to obtain pPinkα-HC-HSA-(TMP) 2 , which was transformed into competent yeast cells. Then the transformed bacteria liquid was inoculated on the PAD plate, cultured at 30°C for 3-4 days, and positive clones were picked. The positive clones were inoculated into BMGY liquid medium, cultured at 30°C for 48 hours, and then transferred to BMMY medium to induce expression. After 96 hours, centrifuged at 1500rpm for 15 minutes at low temperature, took the supernatant, and detected protein expression by SDS-PAGE electrophoresis In this case, the protein band with a molecular weight of about 70kD is the HSA-TMP fusion protein, the amino acid sequence of the fusion protein is shown in SEQ ID NO: 6, and the DNA sequence encoding the amino acid sequence of the fusion protein is shown in SEQ ID NO: 5. Select the strain with the highest expression level as the engineering strain, and freeze it at -80°C for preservation
实施例2HSA-(TMP)2融合蛋白的生物活性检测The biological activity detection of embodiment 2HSA-(TMP) 2 fusion protein
1、实验过程1. Experimental process
细胞铺板,c-mpl、c-fos、pAdVAntage、Renilla,四质粒共转染。转染后48h,分别加入阳性对照、空白对照和受试样品,分别检测萤火虫荧光素酶活力和海肾荧光素酶活力,计算荧光比率。Cell plating, c-mpl, c-fos, pAdVAntage, Renilla, four plasmid co-transfection. 48 hours after transfection, the positive control, blank control and test samples were added to detect firefly luciferase activity and Renilla luciferase activity, respectively, and calculate the fluorescence ratio.
阳性对照:特比澳特比澳(重组人促血小板生成素注射液)购买自沈阳三生制药有限责任公司Positive control: TPIAO TPIAO (recombinant human thrombopoietin injection) purchased from Shenyang Sansheng Pharmaceutical Co., Ltd.
空白对照:血清Blank control: serum
荧光比率=萤火虫荧光值/海肾荧光值。Fluorescence ratio = firefly fluorescence value/renilla fluorescence value.
具体步骤详见中国专利(CN201310084212)。For specific steps, see Chinese patent (CN201310084212).
2.实验结果2. Experimental results
表1HSA-(TMP)2融合蛋白的生物活性检测结果The biological activity detection result of table 1HSA-(TMP) 2 fusion protein
结果表明,所有转染不含c-mpl受体的pcDNA3.1-HSA空载体各处理组,相对荧光比率均为1。所有转染c-mpl-pcDNA3.1载体的各处理组数据见上表1。由表1可知所检测13株酵母菌表达样品中的HSA-(TMP)2融合蛋白均具有显著的c-mpl受体依赖的生物活性。其中4-2号酵母菌株所表达的融合蛋白活性显著高于阳性对照。特比奥阳性药物及通过对TMP二联体核苷酸序列进行偏好密码子改造后的13株HSA-(TMP)2融合蛋白酵母表达菌株均具有受体依赖的生物活性,其中2-1号、2-4号、4-1号、4-2号、4-3号、4-4号酵母表达的融合蛋白的生物活性得到了显著性提高。The results showed that the relative fluorescence ratio of all treatment groups transfected with pcDNA3.1-HSA empty vector without c-mpl receptor was 1. The data of all treatment groups transfected with the c-mpl-pcDNA3.1 vector are shown in Table 1 above. It can be seen from Table 1 that the HSA-(TMP) 2 fusion proteins in the 13 tested yeast expression samples all have significant c-mpl receptor-dependent biological activities. Among them, the activity of the fusion protein expressed by No. 4-2 yeast strain was significantly higher than that of the positive control. Tebio-positive drugs and 13 HSA-(TMP) 2 fusion protein yeast expression strains after modifying the TMP duplex nucleotide sequence with preferred codons all have receptor-dependent biological activities, and No. 2-1 , No. 2-4, No. 4-1, No. 4-2, No. 4-3, and No. 4-4, the biological activity of the fusion protein expressed by the yeast has been significantly improved.
专利号CN201310084212.8的二聚体化融合蛋白的制备及应用中公开了二聚体化促血小板生成素(TPO)模拟肽TMP二联体-人血清白蛋白融合蛋白的制备及应用,该专利中酵母表达的二聚体化促血小板生成素模拟肽TMP二联体-人血清白蛋白融合蛋白c-mpl受体依赖的细胞水平生物活性平均值为1.635,本发明公开的促血小板生成素的融合蛋白生物活性检测结果平均值为1.749;The preparation and application of dimerization fusion protein of patent number CN201310084212.8 discloses the preparation and application of dimerization thrombopoietin (TPO) mimetic peptide TMP duplex-human serum albumin fusion protein, the patent The dimerized thrombopoietin mimetic peptide TMP duplex-human serum albumin fusion protein c-mpl receptor-dependent average biological activity of the dimerized thrombopoietin expressed in yeast is 1.635, and the thrombopoietin disclosed in the present invention has The average value of the bioactivity test results of the fusion protein was 1.749;
因此,本发明通过对TMP二联体实施偏好密码子改造,提高了促血小板生成素的融合蛋白生物活性的均值,高于未对促血小板生成素模拟肽TMP二联体基因改造产生的融合蛋白的活性均值。Therefore, the present invention improves the mean value of the biological activity of the fusion protein of thrombopoietin by implementing preferred codon modification on the TMP duplex, which is higher than the fusion protein produced without genetic modification of the TMP duplex of the thrombopoietin mimetic peptide activity mean.
此外,CN201310084212.8专利中公开的融合蛋白活性最高的是2号酵母株表达的二聚体融合蛋白,其生物活性为1.71±0.11,本发明公开的2-1号、2-4号、4-1号、4-2号、4-3号、4-4号的生物活性均高于上述2号酵母株表达的二聚体融合蛋白。In addition, the fusion protein disclosed in the CN201310084212.8 patent has the highest activity of the dimeric fusion protein expressed by No. 2 yeast strain, and its biological activity is 1.71±0.11. -1, 4-2, 4-3, and 4-4 had biological activities higher than the dimer fusion protein expressed by the yeast strain No. 2 above.
通过对TMP二联体实施偏好密码子改造,不仅提高了促血小板生成素的融合蛋白生物活性的均值,本发明公开的2-1号、2-4号、4-1号、4-2号、4-3号、4-4号,所述酵母产生的融合蛋白的活性也得到了显著提高。By implementing preferred codon modification to the TMP duplex, not only the mean value of the biological activity of the fusion protein of thrombopoietin is improved, but No. 2-1, No. 2-4, No. 4-1 and No. 4-2 disclosed by the present invention , No. 4-3, and No. 4-4, the activity of the fusion protein produced by the yeast has also been significantly improved.
序列表(SEQUENCE LISTING)SEQUENCE LISTING
<110> 兰州大学<110> Lanzhou University
<120> 一种促血小板生成素的融合蛋白<120> A fusion protein of thrombopoietin
<160> 10<160> 10
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 99<211> 99
<212> DNA<212>DNA
<213> TMP二联体的基因序列<213> Gene sequence of TMP duplex
<400> 1<400> 1
attgagggtc caactttgag acaatggttg gctgctagag ccggtggtcc atctggaatc 60attgaggtc caactttgag acaatggttg gctgctagag ccggtggtcc atctggaatc 60
gagggaccta ccctgagaca gtggcttgct gctagagct 99gagggaccta ccctgagaca gtggcttgct gctagagct 99
<210> 2<210> 2
<211> 33<211> 33
<212> PRT<212> PRT
<213> TMP二联体的氨基酸序列Amino acid sequence of <213> TMP duplex
<400> 2<400> 2
1 Ile Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile1 Ile Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile
21 Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala21 Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala
<210> 3<210> 3
<211> 1755<211> 1755
<212> DNA<212>DNA
<213> HSA分子的基因序列<213> Gene sequence of HSA molecule
<400> 3<400> 3
gatgcacaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 60gatgcacaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 60
gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 120gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 120
aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 180aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 180
aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 240aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 240
cgtgaaacct atggtgaaat ggctgactgt tgtgcaaaac aagaacctga gagaaatgaa 300cgtgaaacct atggtgaaat ggctgactgt tgtgcaaaac aagaacctga gagaaatgaa 300
tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 360tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 360
gatgtgatgt gcactgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 420gatgtgatgt gcactgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 420
gaaattgcca gaagacatcc ttacttttat gccccggaac tccttttctt tgctaaaagg 480gaaattgcca gaagacatcc ttacttttat gccccggaac tccttttctt tgctaaaagg 480
tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 540tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 540
aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 600aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 600
gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 660gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 660
cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 720cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 720
gtccacacgg aatgctgcca tggagatctg cttgaatgtg ctgatgacag ggcggacctt 780gtccaacacgg aatgctgcca tggagatctg cttgaatgtg ctgatgacag ggcggacctt 780
gccaagtata tctgtgaaaa tcaagattcg atctccagta aactgaagga atgctgtgaa 840gccaagtata tctgtgaaaa tcaagattcg atctccagta aactgaagga atgctgtgaa 840
aaacctctgt tggaaaaatc ccactgcatt gccgaagtgg aaaatgatga gatgcctgct 900aaacctctgt tggaaaaatc ccactgcatt gccgaagtgg aaaatgatga gatgcctgct 900
gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 960gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 960
gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 1020gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 1020
tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 1080tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 1080
tgtgccgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 1140tgtgccgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 1140
gtggaagagc ctcagaattt aatcaaacaa aattgtgagc tttttgagca gcttggagag 1200gtggaagagc ctcagaattt aatcaaacaa aattgtgagc tttttgagca gcttggagag 1200
tacaaattcc agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtcaact 1260tacaaattcc agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtcaact 1260
ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 1320ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 1320
cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 1380cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 1380
tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 1440tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 1440
ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 1500ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 1500
gagtttaatg ctgaaacgtt caccttccat gcagatatat gcacactttc tgagaaggag 1560gagtttaatg ctgaaacgtt caccttccat gcagatatat gcacactttc tgagaaggag 1560
agacaaatca agaaacaaac tgcacttgtt gagcttgtga aacacaagcc caaggcaaca 1620agacaaatca agaaacaaac tgcacttgtt gagcttgtga aacacaagcc caaggcaaca 1620
aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 1680aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 1680
gctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 1740gctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 1740
gctgccttag gctta 1755gctgccttag gctta 1755
<210> 4<210> 4
<211> 585<211> 585
<212> PRT<212> PRT
<213> HSA分子的氨基酸序列<213> Amino acid sequence of the HSA molecule
<400> 4<400> 4
1 Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys1 Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys
21 Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val21 Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val
41 Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu41 Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu
61 Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu61 Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu
81 Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu81 Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu
101 Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val101 Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val
121 Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr121 Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr
141 Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg141 Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg
161 Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro161 Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro
181 Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys181 Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys
201 Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser201 Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser
221 Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys221 Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys
241 Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu241 Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu
261 Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu261 Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu
281 Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu MET Pro Ala281 Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu MET Pro Ala
301 Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala301 Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala
321 Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp321 Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp
341 Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys341 Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys
361 Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu361 Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu
381 Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln Leu Gly Glu381 Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln Leu Gly Glu
401 Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr401 Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr
421 Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His421 Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His
441 Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gln Leu441 Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gln Leu
461 Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser461 Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser
481 Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys481 Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys
501 Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu501 Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu
521 Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr521 Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr
541 Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys541 Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys
561 Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln561 Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Lys Leu Val Ala Ala Ser Gln
581 Ala Ala Leu Gly Leu581 Ala Ala Leu Gly Leu
<210> 5<210> 5
<211> 2010<211> 2010
<212> DNA<212>DNA
<213> ss-TMP-L1-TMP-L2 -H-fip-HSA融合蛋白的基因序列<213> Gene sequence of ss-TMP-L1-TMP-L2-H-fip-HSA fusion protein
<400> 5<400> 5
aagtgggtaa cctttatttc ccttcttttt ctctttagct cggcttattc caggggtgtg 60aagtgggtaa cctttatttc ccttcttttt ctctttagct cggcttattc caggggtgtg 60
tttcgtcgag atgcacacaa gagtgaggtt gctcatcggt ttaaagattt gggagaagaa 120tttcgtcgag atgcacacaa gagtgaggtt gctcatcggt ttaaagattt gggagaagaa 120
aatttcaaag ccttggtgtt gattgccttt gctcagtatc ttcagcagtg tccatttgaa 180aatttcaaag ccttggtgtt gattgccttt gctcagtatc ttcagcagtg tccatttgaa 180
gatcatgtaa aattagtgaa tgaagtaact gaatttgcaa aaacatgtgt tgctgatgag 240gatcatgtaa aattagtgaa tgaagtaact gaatttgcaa aaacatgtgttgctgatgag 240
tcagctgaaa attgtgacaa atcacttcat accctttttg gagacaaatt atgcacagtt 300tcagctgaaa attgtgacaa atcacttcat accctttttg gagacaaatt atgcacagtt 300
gcaactcttc gtgaaaccta tggtgaaatg gctgactgtt gtgcaaaaca agaacctgag 360gcaactcttc gtgaaaccta tggtgaaatg gctgactgtt gtgcaaaaca agaacctgag 360
agaaatgaat gcttcttgca acacaaagat gacaacccaa acctcccccg attggtgaga 420agaaatgaat gcttcttgca acacaaagat gacaacccaa acctcccccg attggtgaga 420
ccagaggttg atgtgatgtg cactgctttt catgacaatg aagagacatt tttgaaaaaa 480ccagaggttg atgtgatgtg cactgctttt catgacaatg aagagacatt tttgaaaaaa 480
tacttatatg aaattgccag aagacatcct tacttttatg ccccggaact ccttttcttt 540tacttatatg aaattgccag aagacatcct tacttttatg ccccggaact ccttttcttt 540
gctaaaaggt ataaagctgc ttttacagaa tgttgccaag ctgctgataa agctgcctgc 600gctaaaaggt ataaagctgc ttttacagaa tgttgccaag ctgctgataa agctgcctgc 600
ctgttgccaa agctcgatga acttcgggat gaagggaagg cttcgtctgc caaacagaga 660ctgttgccaa agctcgatga acttcgggat gaagggaagg cttcgtctgc caaacagaga 660
ctcaagtgtg ccagtctcca aaaatttgga gaaagagctt tcaaagcatg ggcagtagct 720ctcaagtgtg ccagtctcca aaaatttgga gaaagagctt tcaaagcatg ggcagtagct 720
cgcctgagcc agagatttcc caaagctgag tttgcagaag tttccaagtt agtgacagat 780cgcctgagcc agagatttcc caaagctgag tttgcagaag tttccaagtt agtgacagat 780
cttaccaaag tccacacgga atgctgccat ggagatctgc ttgaatgtgc tgatgacagg 840cttaccaaag tccacacgga atgctgccat ggagatctgc ttgaatgtgc tgatgacagg 840
gcggaccttg ccaagtatat ctgtgaaaat caagattcga tctccagtaa actgaaggaa 900gcggaccttg ccaagtatat ctgtgaaaat caagattcga tctccagtaa actgaaggaa 900
tgctgtgaaa aacctctgtt ggaaaaatcc cactgcattg ccgaagtgga aaatgatgag 960tgctgtgaaa aacctctgtt ggaaaaatcc cactgcattg ccgaagtgga aaatgatgag 960
atgcctgctg acttgccttc attagctgct gattttgttg aaagtaagga tgtttgcaaa 1020atgcctgctg acttgccttc attagctgct gattttgttg aaagtaagga tgtttgcaaa 1020
aactatgctg aggcaaagga tgtcttcctg ggcatgtttt tgtatgaata tgcaagaagg 1080aactatgctg aggcaaagga tgtcttcctg ggcatgtttt tgtatgaata tgcaagaagg 1080
catcctgatt actctgtcgt gctgctgctg agacttgcca agacatatga aaccactcta 1140catcctgatt actctgtcgt gctgctgctg agacttgcca agacatatga aaccactcta 1140
gagaagtgct gtgccgctgc agatcctcat gaatgctatg ccaaagtgtt cgatgaattt 1200gagaagtgct gtgccgctgc agatcctcat gaatgctatg ccaaagtgtt cgatgaattt 1200
aaacctcttg tggaagagcc tcagaattta atcaaacaaa attgtgagct ttttgagcag 1260aaacctcttg tggaagagcc tcagaattta atcaaacaaa attgtgagct ttttgagcag 1260
cttggagagt acaaattcca gaatgcgcta ttagttcgtt acaccaagaa agtaccccaa 1320cttggagagt acaaattcca gaatgcgcta ttagttcgtt acaccaagaa agtaccccaa 1320
gtgtcaactc caactcttgt agaggtctca agaaacctag gaaaagtggg cagcaaatgt 1380gtgtcaactc caactcttgt agaggtctca agaaacctag gaaaagtggg cagcaaatgt 1380
tgtaaacatc ctgaagcaaa aagaatgccc tgtgcagaag actatctatc cgtggtcctg 1440tgtaaacatc ctgaagcaaa aagaatgccc tgtgcagaag actatctatc cgtggtcctg 1440
aaccagttat gtgtgttgca tgagaaaacg ccagtaagtg acagagtcac caaatgctgc 1500aaccagttat gtgtgttgca tgagaaaacg ccagtaagtg acagagtcac caaatgctgc 1500
acagaatcct tggtgaacag gcgaccatgc ttttcagctc tggaagtcga tgaaacatac 1560acagaatcct tggtgaacag gcgaccatgc ttttcagctc tggaagtcga tgaaacatac 1560
gttcccaaag agtttaatgc tgaaacgttc accttccatg cagatatatg cacactttct 1620gttcccaaag agtttaatgc tgaaacgttc accttccatg cagatatatg cacactttct 1620
gagaaggaga gacaaatcaa gaaacaaact gcacttgttg agcttgtgaa acacaagccc 1680gagaaggaga gacaaatcaa gaaacaaact gcacttgttg agcttgtgaa acacaagccc 1680
aaggcaacaa aagagcaact gaaagctgtt atggatgatt tcgcagcttt tgtagagaag 1740aaggcaacaa aagagcaact gaaagctgtt atggatgatt tcgcagcttt tgtagagaag 1740
tgctgcaagg ctgacgataa ggagacctgc tttgccgagg agggtaaaaa acttgttgct 1800tgctgcaagg ctgacgataa ggagacctgc tttgccgagg agggtaaaaa acttgttgct 1800
gcaagtcaag ctgccttagg cttaggcggc ggcggttccg gactggagcc caagagctgc 1860gcaagtcaag ctgccttagg ctaggcggc ggcggttccg gactggagcc caagagctgc 1860
gacaagaccc acacctgccc tccctgcgaa ttcggtggtg gtggttctat tgagggtcca 1920gacaagaccc acacctgccc tccctgcgaa ttcggtggtg gtggttctat tgagggtcca 1920
actttgagac aatggttggc tgctagagcc ggtggtccat ctggaatcga gggacctacc 1980actttgagac aatggttggc tgctagagcc ggtggtccat ctggaatcga gggacctacc 1980
ctgagacagt ggcttgctgc tagagcttaa 2010ctgagacagt ggcttgctgc tagagcttaa 2010
<210> 6<210> 6
<211> 670<211> 670
<212> PRT<212> PRT
<213> ss-TMP-L1-TMP-L2 -H-fip-HSA融合蛋白的氨基酸序列<213> Amino acid sequence of ss-TMP-L1-TMP-L2-H-fip-HSA fusion protein
<400> 6<400> 6
1 Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser Arg Gly Val1 Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser Arg Gly Val
21 Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu21 Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu
41 Asn Phe Lys Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu41 Asn Phe Lys Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu
61 Asp His Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu61 Asp His Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu
81 Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val81 Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val
101 Ala Thr Leu Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu101 Ala Thr Leu Arg Glu Thr Tyr Gly Glu MET Ala Asp Cys Cys Ala Lys Gln Glu Pro Glu
121 Arg Asn Glu Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg121 Arg Asn Glu Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg
141 Pro Glu Val Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys141 Pro Glu Val Asp Val MET Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys
161 Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe161 Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe
181 Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys181 Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala Cys
201 Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg201 Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg
221 Leu Lys Cys Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala221 Leu Lys Cys Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala
241 Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp241 Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp
261 Leu Thr Lys Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg261 Leu Thr Lys Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg
281 Ala Asp Leu Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu281 Ala Asp Leu Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu
301 Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu301 Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp Glu
321 MET Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys321 MET Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys
341 Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg341 Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly MET Phe Leu Tyr Glu Tyr Ala Arg Arg
361 His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu361 His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu
381 Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe381 Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe
401 Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln401 Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln
421 Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln421 Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gln
441 Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys441 Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys
461 Cys Lys His Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu461 Cys Lys His Pro Glu Ala Lys Arg MET Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu
481 Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys481 Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys
501 Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr501 Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr
521 Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser521 Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser
541 Glu Lys Glu Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro541 Glu Lys Glu Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys Pro
561 Lys Ala Thr Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys561 Lys Ala Thr Lys Glu Gln Leu Lys Ala Val MET Asp Asp Phe Ala Ala Phe Val Glu Lys
581 Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala581 Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala
601 Ala Ser Gln Ala Ala Leu Gly Leu Gly Gly Gly Gly Ser Gly Leu Glu Pro Lys Ser Cys601 Ala Ser Gln Ala Ala Leu Gly Leu Gly Gly Gly Gly Ser Gly Leu Glu Pro Lys Ser Cys
621 Asp Lys Thr His Thr Cys Pro Pro Cys Glu Phe Gly Gly Gly Gly Ser Ile Glu Gly Pro621 Asp Lys Thr His Thr Cys Pro Pro Cys Glu Phe Gly Gly Gly Gly Ser Ile Glu Gly Pro
641 Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile Glu Gly Pro Thr641 Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala Gly Gly Pro Ser Gly Ile Glu Gly Pro Thr
661 Leu Arg Gln Trp Leu Ala Ala Arg Ala ***661 Leu Arg Gln Trp Leu Ala Ala Arg Ala ***
<210> 7<210> 7
<211> 42<211> 42
<212> DNA<212>DNA
<213> H 肽段的基因序列<213> Gene sequence of H peptide
<400> 7<400> 7
gagcccaaga gctgcgacaa gacccacacc tgccctccct gc 42gagcccaaga gctgcgacaa gacccacacc tgccctccct gc 42
<210> 8<210> 8
<211> 14<211> 14
<212> PRT<212> PRT
<213> H 肽段的氨基酸序列Amino acid sequence of <213> H peptide
<400> 8<400> 8
1 Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys1 Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys
<210> 9<210> 9
<211> 114<211> 114
<212> DNA<212>DNA
<213> 选自fip结构域的肽段的基因序列<213> Gene sequence selected from peptides of fip domain
<400> 9<400> 9
gtgagcaggg acgagctgat ggaggccatc cagaagcagg aggagatcaa cttcaggctg 60gtgagcaggg acgagctgat ggaggccatc cagaagcagg aggagatcaa cttcaggctg 60
caggactaca tcgacaggat catcgtggcc atcatggaga ccaaccccag catc 114caggactaca tcgacaggat catcgtggcc atcatggaga ccaacccccag catc 114
<210> 10<210> 10
<211> 38<211> 38
<212> PRT<212> PRT
<213> 选自fip结构域的肽段的氨基酸序列<213> Amino acid sequence of a peptide selected from the fip domain
<400> 10<400> 10
1 Val Ser Arg Asp Glu Leu MET Glu Ala Ile Gln Lys Gln Glu Glu Ile Asn Phe Arg Leu1 Val Ser Arg Asp Glu Leu MET Glu Ala Ile Gln Lys Gln Glu Glu Ile Asn Phe Arg Leu
21 Gln Asp Tyr Ile Asp Arg Ile Ile Val Ala Ile MET Glu Thr Asn Pro Ser Ile21 Gln Asp Tyr Ile Asp Arg Ile Ile Val Ala Ile MET Glu Thr Asn Pro Ser Ile
Claims (23)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710158090.0A CN108623690A (en) | 2017-03-16 | 2017-03-16 | A kind of fusion protein of thrombopoietin and its preparation method and application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710158090.0A CN108623690A (en) | 2017-03-16 | 2017-03-16 | A kind of fusion protein of thrombopoietin and its preparation method and application |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108623690A true CN108623690A (en) | 2018-10-09 |
Family
ID=63687645
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710158090.0A Pending CN108623690A (en) | 2017-03-16 | 2017-03-16 | A kind of fusion protein of thrombopoietin and its preparation method and application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108623690A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109336974A (en) * | 2018-11-12 | 2019-02-15 | 兰州大学 | A kind of preparation method of anti-TMP rabbit polyclonal antibody |
CN113527508A (en) * | 2020-04-17 | 2021-10-22 | 上海多米瑞生物技术有限公司 | A kind of preparation method of thrombopoietin peptidomimetic-Fc fusion protein |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030195154A1 (en) * | 2002-04-04 | 2003-10-16 | Kenneth Walker | Use of transthyretin peptide/protein fusions to increase the serum half-life of pharmacologically active peptides/proteins |
CN101362797A (en) * | 2008-09-28 | 2009-02-11 | 华东师范大学 | A kind of thrombopoietin mimetic peptide, its preparation and application |
CN101863982A (en) * | 2009-04-17 | 2010-10-20 | 哈药集团生物工程有限公司 | Fusion protein for increasing blood platelets and preparation method thereof |
CN104045715A (en) * | 2013-03-15 | 2014-09-17 | 兰州大学 | Preparation and application of dimerized fusion protein |
-
2017
- 2017-03-16 CN CN201710158090.0A patent/CN108623690A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030195154A1 (en) * | 2002-04-04 | 2003-10-16 | Kenneth Walker | Use of transthyretin peptide/protein fusions to increase the serum half-life of pharmacologically active peptides/proteins |
CN101362797A (en) * | 2008-09-28 | 2009-02-11 | 华东师范大学 | A kind of thrombopoietin mimetic peptide, its preparation and application |
CN101863982A (en) * | 2009-04-17 | 2010-10-20 | 哈药集团生物工程有限公司 | Fusion protein for increasing blood platelets and preparation method thereof |
CN104045715A (en) * | 2013-03-15 | 2014-09-17 | 兰州大学 | Preparation and application of dimerized fusion protein |
Non-Patent Citations (5)
Title |
---|
HU ZHIYI等: "Secretion Expression and Activity Assay of a Novel Fusion Protein of Thrombopoietin and Interleukin-6 in Pichia pastoris", 《JOURNAL OF BIOCHEMISTRY》 * |
MEIZHU WANG等: "Functional expression of human serum albumin-tandem thrombopoietin mimetic peptide fusion protein as a novel thrombopoietin analog in Pichia pastoris", 《BIOTECHNOLOGY LETTERS》 * |
刘朔: "纳豆激酶基因密码子优化设计与合成及在毕赤酵母中的高效表达", 《中国博士学位论文全文数据库》 * |
杨洋等: "重组双体化促血小板生成素模拟肽人血清白蛋白融合蛋白(HSA-TMP-TMP)体内生物活性与药效学研究", 《药物生物技术》 * |
王海晴等: "促血小板生成素模拟肽二联体连接肽的筛选", 《药物生物技术》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109336974A (en) * | 2018-11-12 | 2019-02-15 | 兰州大学 | A kind of preparation method of anti-TMP rabbit polyclonal antibody |
CN113527508A (en) * | 2020-04-17 | 2021-10-22 | 上海多米瑞生物技术有限公司 | A kind of preparation method of thrombopoietin peptidomimetic-Fc fusion protein |
CN113527508B (en) * | 2020-04-17 | 2024-01-05 | 上海多米瑞生物技术有限公司 | Preparation method of thrombopoietin peptidomimetic-Fc fusion protein |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110845603B (en) | Human collagen 17-type polypeptide, production method and use thereof | |
US11034731B2 (en) | Stabilized soluble pre-fusion RSV F polypeptides | |
EP3453401A1 (en) | Interleukin combination and use thereof | |
KR20160002938A (en) | Stabilized soluble prefusion rsv f polypeptides | |
CN110577592B (en) | Recombinant human fibronectin peptide | |
KR20160021196A (en) | Stabilized soluble pre-fusion rsv f polypeptides | |
JP2004024276A (en) | Stable protein product with improved disinfection / permeability and pharmaceutical composition containing same | |
CN102250254A (en) | Tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) fusion protein, its preparation and applications | |
CN101413002A (en) | Recombinant Kluyveromyces sp. expressing antibody or antibody analogue, and construction method and use thereof | |
CN104045715B (en) | The preparation and application of Dimerized fusion protein | |
CN116554309A (en) | Recombinant human III type collagen and preparation method and application thereof | |
CN110835366B (en) | Tag polypeptide for promoting soluble expression of protein and application thereof | |
CN108623690A (en) | A kind of fusion protein of thrombopoietin and its preparation method and application | |
CN109384849B (en) | A fusion protein of thrombopoietin and its preparation method and application | |
AU2020243073B2 (en) | Recombinant CCN domain proteins and fusion proteins | |
CN112390875B (en) | A high-affinity T-cell receptor that recognizes AFP | |
KR20130141001A (en) | A novel vector system for isolation and purification of target proteins | |
KR20180003677A (en) | Pharmaceutical compositions comprising mutant proteins of human growth hormone or transferrin fusion proteins thereof | |
CN106674352B (en) | Derivative of antineoplastic protein endostatin and application thereof | |
CN109748969B (en) | A kind of dimerization fusion protein and its preparation method and application | |
WO2022007885A1 (en) | Fusion polypeptide and polypeptide dimer, and use thereof | |
CN113912735B (en) | Polypeptide dimer and its use | |
CN100404683C (en) | A kind of human FL polynucleotide and its use in combination with human GM-CSF gene therapy for malignant tumors | |
CN112480262B (en) | Fusion protein and preparation and application thereof | |
CN116621989A (en) | Multimerization structural monomer and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20181009 |