WO2024119434A1 - Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein - Google Patents
Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein Download PDFInfo
- Publication number
- WO2024119434A1 WO2024119434A1 PCT/CN2022/137539 CN2022137539W WO2024119434A1 WO 2024119434 A1 WO2024119434 A1 WO 2024119434A1 CN 2022137539 W CN2022137539 W CN 2022137539W WO 2024119434 A1 WO2024119434 A1 WO 2024119434A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protein
- peptide
- fusion protein
- amino acid
- tag
- Prior art date
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 156
- 230000014509 gene expression Effects 0.000 title claims description 89
- 230000002378 acidificating effect Effects 0.000 title claims description 28
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 title description 34
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 title description 34
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 111
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 111
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 53
- 229920001184 polypeptide Polymers 0.000 claims abstract description 50
- 150000001413 amino acids Chemical class 0.000 claims abstract description 45
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 25
- 239000002157 polynucleotide Substances 0.000 claims abstract description 25
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 25
- 238000004519 manufacturing process Methods 0.000 claims abstract description 11
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 12
- 108090000623 proteins and genes Proteins 0.000 claims description 176
- 102000004169 proteins and genes Human genes 0.000 claims description 170
- 210000004027 cell Anatomy 0.000 claims description 56
- 238000000746 purification Methods 0.000 claims description 45
- 241000894006 Bacteria Species 0.000 claims description 37
- 238000000034 method Methods 0.000 claims description 31
- 238000003776 cleavage reaction Methods 0.000 claims description 25
- 230000004927 fusion Effects 0.000 claims description 25
- 230000007017 scission Effects 0.000 claims description 22
- 241000588724 Escherichia coli Species 0.000 claims description 17
- 125000006850 spacer group Chemical group 0.000 claims description 16
- 238000011144 upstream manufacturing Methods 0.000 claims description 12
- 239000002773 nucleotide Substances 0.000 claims description 11
- 125000003729 nucleotide group Chemical group 0.000 claims description 11
- 241000588722 Escherichia Species 0.000 claims description 10
- 125000000539 amino acid group Chemical group 0.000 claims description 10
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 7
- 230000007935 neutral effect Effects 0.000 claims description 7
- 244000063299 Bacillus subtilis Species 0.000 claims description 6
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 6
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 5
- 241000194107 Bacillus megaterium Species 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 5
- 241000589516 Pseudomonas Species 0.000 claims description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 5
- 241000607142 Salmonella Species 0.000 claims description 5
- 241000187747 Streptomyces Species 0.000 claims description 5
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 5
- 210000004962 mammalian cell Anatomy 0.000 claims description 5
- 239000000126 substance Substances 0.000 claims description 4
- 238000000926 separation method Methods 0.000 claims description 3
- 230000002255 enzymatic effect Effects 0.000 claims 2
- 235000018102 proteins Nutrition 0.000 description 87
- 230000003381 solubilizing effect Effects 0.000 description 45
- 239000006228 supernatant Substances 0.000 description 26
- 230000000694 effects Effects 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 19
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 19
- 238000001962 electrophoresis Methods 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 17
- 239000013604 expression vector Substances 0.000 description 17
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 16
- 108010028230 Trp-Ser- His-Pro-Gln-Phe-Glu-Lys Proteins 0.000 description 15
- 230000001580 bacterial effect Effects 0.000 description 15
- 239000012160 loading buffer Substances 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 14
- 108090000790 Enzymes Proteins 0.000 description 14
- 239000002244 precipitate Substances 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 229960005091 chloramphenicol Drugs 0.000 description 12
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 12
- 229930027917 kanamycin Natural products 0.000 description 12
- 229960000318 kanamycin Drugs 0.000 description 12
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 12
- 229930182823 kanamycin A Natural products 0.000 description 12
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 11
- 239000000523 sample Substances 0.000 description 10
- 238000001042 affinity chromatography Methods 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 238000001262 western blot Methods 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 238000013461 design Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000009465 prokaryotic expression Effects 0.000 description 7
- 230000003698 anagen phase Effects 0.000 description 6
- 238000005520 cutting process Methods 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- 238000005421 electrostatic potential Methods 0.000 description 6
- 238000001976 enzyme digestion Methods 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 6
- 102000005720 Glutathione transferase Human genes 0.000 description 5
- 108010070675 Glutathione transferase Proteins 0.000 description 5
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 5
- 230000002776 aggregation Effects 0.000 description 5
- 238000004220 aggregation Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 230000017730 intein-mediated protein splicing Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 4
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 4
- 108010088160 Staphylococcal Protein A Proteins 0.000 description 4
- 241000191940 Staphylococcus Species 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000007796 conventional method Methods 0.000 description 4
- 230000009089 cytolysis Effects 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 210000003000 inclusion body Anatomy 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 230000001131 transforming effect Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 229920004890 Triton X-100 Polymers 0.000 description 3
- 239000013504 Triton X-100 Substances 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 238000009835 boiling Methods 0.000 description 3
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 3
- 239000012141 concentrate Substances 0.000 description 3
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 3
- 238000004042 decolorization Methods 0.000 description 3
- 230000007547 defect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 238000009630 liquid culture Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 3
- 235000019624 protein content Nutrition 0.000 description 3
- 238000001742 protein purification Methods 0.000 description 3
- 230000012743 protein tagging Effects 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 108010077805 Bacterial Proteins Proteins 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108010074124 Escherichia coli Proteins Proteins 0.000 description 2
- 108010088406 Glucagon-Like Peptides Proteins 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- CPELXLSAUQHCOX-UHFFFAOYSA-N Hydrogen bromide Chemical compound Br CPELXLSAUQHCOX-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 239000012880 LB liquid culture medium Substances 0.000 description 2
- 101710098556 Lipase A Proteins 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 101710099648 Lysosomal acid lipase/cholesteryl ester hydrolase Proteins 0.000 description 2
- 102100026001 Lysosomal acid lipase/cholesteryl ester hydrolase Human genes 0.000 description 2
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 2
- 108010076818 TEV protease Proteins 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000002902 bimodal effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 239000008367 deionised water Substances 0.000 description 2
- 229910021641 deionized water Inorganic materials 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000005714 functional activity Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- 230000002934 lysing effect Effects 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 150000007523 nucleic acids Chemical group 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 108020001580 protein domains Proteins 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 101800000263 Acidic protein Proteins 0.000 description 1
- 102000014133 Antimicrobial Cationic Peptides Human genes 0.000 description 1
- 108010050820 Antimicrobial Cationic Peptides Proteins 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 102000006573 Chemokine CXCL12 Human genes 0.000 description 1
- 108010008951 Chemokine CXCL12 Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 1
- 102000012745 Immunoglobulin Subunits Human genes 0.000 description 1
- 108010079585 Immunoglobulin Subunits Proteins 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108700011201 Streptococcus IgG Fc-binding Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 229940091771 aspergillus fumigatus Drugs 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 229910000042 hydrogen bromide Inorganic materials 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- LTWJSHCKHBKNNY-ATHVYWIXSA-N nrc-03 Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CNC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)[C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(C)=O)C1=CN=CN1 LTWJSHCKHBKNNY-ATHVYWIXSA-N 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000011403 purification operation Methods 0.000 description 1
- 239000012521 purified sample Substances 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- WGWPRVFKDLAUQJ-MITYVQBRSA-N sermorelin Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(N)=O)C1=CC=C(O)C=C1 WGWPRVFKDLAUQJ-MITYVQBRSA-N 0.000 description 1
- 229960002758 sermorelin Drugs 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 239000013595 supernatant sample Substances 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000012932 thermodynamic analysis Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000014723 transformation of host cell by virus Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
Definitions
- the invention belongs to the field of protein purification, and in particular relates to an acidic surface solubilizing short peptide label designed by computer-aided calculation.
- E. coli Escherichia coli
- solubilizing proteins such as green fluorescent protein (GFP), maltose binding protein (MBP), glutathione S-transferase (GST), etc.
- GFP green fluorescent protein
- MBP maltose binding protein
- GST glutathione S-transferase
- the present invention uses computer-aided calculation to perform surface design of charge distribution on short stable protein domains, so that the surface contains multiple negative charges, thereby increasing the polarity of the domains themselves and making them more soluble in water, and being able to repel proteins with acidic isoelectric points, thereby reducing aggregation during protein purification, which not only plays an excellent solubilizing role, but also reduces interference with the folding process of the recombinant protein itself.
- the acidic surface solubilizing peptide tag (Sacid) of the present invention is based on the B domain of Staphylococcus protein A [ N et al., A synthetic IgG-binding domain based on staphylococcal protein A, Protein Eng, 1(2): 107-113, 1987].
- This domain is short and stable, containing only 57 amino acid residues and a molecular weight of only 8kD, which is suitable for charge distribution modification and addition to recombinant proteins as fusion tags. It can effectively solubilize recombinant proteins, stabilize the conformation of recombinant proteins during purification, improve the expression efficiency of recombinant proteins, and maintain their biological functional activity.
- the present invention grafts this modified short polypeptide with a surface rich in negative charge onto the target recombinant protein, which is more advantageous than the positively charged solubilizing tags published by other researchers in the early stage in terms of improving the expression efficiency, solubility and biological activity of recombinant proteins.
- the present invention provides an isolated peptide comprising the amino acid sequence VDNKFNKEQQX1AFYEILHLPNLNEEQRNAFIQX2LKDDPX3X4SX5X6X7LX8EAX9X10LNDAQPK ( SEQ ID NO: 9 ), wherein: X1 - X10 are all negatively charged amino acids, preferably glutamic acid (E) or aspartic acid ( D ) .
- the present invention provides an isolated polynucleotide encoding the peptide of the first aspect.
- the present invention provides an isolated fusion protein comprising a first peptide and a second peptide, wherein the first peptide is the peptide of the first aspect and the second peptide is a polypeptide of interest.
- the present invention provides an isolated polynucleotide encoding the fusion protein of the third aspect.
- the present invention provides a construct comprising the polynucleotide of the fourth aspect.
- the present invention provides a host cell comprising the polynucleotide of the fourth aspect or the construct of the fifth aspect, wherein the host cell is capable of expressing the fusion protein.
- the present invention provides a method for producing a fusion protein, comprising: (a) culturing the host cell of the sixth aspect under conditions suitable for expression of the fusion protein; and (b) recovering the fusion protein, optionally, (c) cleaving the fusion protein to release a polypeptide of interest and (d) recovering the polypeptide of interest.
- the present invention provides use of the peptide of the first aspect or the polynucleotide of the second aspect for producing a target protein.
- the present invention provides a method for producing a target protein, comprising: (a) expressing a fusion protein formed by fusion of the target protein and the peptide of the first aspect in a host cell; (b) cutting the fusion protein to release the target protein; and (c) optionally, isolating and/or purifying the target protein.
- Figure 1 Sequence alignment results of amino acid sequences of different short peptide tags.
- Wt corresponds to SEQ ID NO: 4
- Zbasic corresponds to SEQ ID NO: 7
- Sacid corresponds to SEQ ID NO: 1
- Sacid1 corresponds to SEQ ID NO: 2
- Sacid2 corresponds to SEQ ID NO: 3.
- Figure 2 Electrostatic potential model of different short peptide tags at physiological pH
- Figure A is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of the B domain of Staphylococcal protein A at physiological pH
- Figure B is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Zbasic at physiological pH
- Figure C is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid at physiological pH
- Figure D is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid 1 at physiological pH
- Figure E is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid 2 at physiological pH.
- Figure 3 Solubility and isoelectric point predicted by computer-aided prediction (URL: https://protein-sol.manchester.ac.uk/). It predicts protein solubility based on sequence. Based on the observation of a bimodal distribution of solubility of cell-free expressed Escherichia coli proteins [Niwa T et al., Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins, Proceedings of the National Academy of Sciences of the United States of America, 2009, 106(11): 4201-4206], these measurements report the ratio of the soluble fraction of a protein (in the supernatant after centrifugation) to the total amount of the protein, rather than a thermodynamic property.
- a linear model combining 35 features gives a preliminary fit to the solubility data. The return value is between 0-1. If it is higher than 0.45 (the average value of the E. coli expressed protein data set Niwa et al (2009)), it means that the solubility of the protein may be higher than the average solubility of the E. coli expressed protein.
- Figures A and B show the solubility of different solubility-enhancing tags.
- the self-solubility of the acidic short peptide solubility tag (Sacid) of the present invention and the wild-type (WT) tag before modification is not much different, and is even slightly lower than the SUMO tag ( Figure A)
- its charged groups are concentrated on the surface, after fusion with the recombinant protein, it can interact with the positively charged protons in the water more effectively than other tags to form a hydration film, thereby helping the recombinant protein to dissolve. Therefore, after the Sacid tag is fused with the target protein, the solubility of the target protein can be significantly improved (Figure B).
- the acidic short peptide solubility tag (Sacid) of the present invention has a higher self-solubility than the known published positively charged tag (Zbasic) ( Figure A). Both charges are distributed on the surface, but because the negative charge on the acidic surface can repel each other with more negative charges on the surface of most weakly acidic recombinant proteins, it is not easy to aggregate, and the solubility effect is better than that of the alkaline solubility peptide tag. Therefore, after fusion with the target protein, the solubility effect is significantly better than the positively charged tag (Zbasic) ( Figure B). Similarly, compared with other commonly known solubilizing tags, the acidic short peptide solubilizing tag (Sacid) of the present invention also has a significantly better solubilizing effect.
- Figures C and D show the isoelectric points of different solubilizing tags. It can be seen from the figure that the acidic short peptide solubilizing tag of the present invention exhibits better acidity. After fusion with the target protein, the pI of the fusion protein still exhibits a strong acidity.
- FIG4 is a schematic diagram of the protein expression and purification technology used in the present invention.
- FIG. 5 Electrophoretic identification of target proteins purified using different purification tags.
- Figure A shows the situation when NsiI protein is expressed alone.
- M protein molecular weight standard
- F Strep-tag II purification flow-through
- W1, W2 Strep-tag II purification wash solution
- E Strep-tag II purification eluate.
- the arrow points to the target protein. It can be seen from the figure that when NsiI protein is expressed alone, its expression level is not high because it is prone to aggregation and precipitation before the protein is correctly folded. After purification, the target protein is significantly reduced or even completely lost.
- Figure B shows the soluble expression of NeonGreen-NsiI fusion protein using NeonGreen as a solubilizing peptide to express NsiI.
- M protein molecular weight standard
- F Strep-tag II purification flow-through
- W1, W2 Strep-tag II purification wash solution
- E Strep-tag II purification eluate.
- the arrow points to the fusion protein, indicating that although the use of NeonGreen as a solubilizing protein fusion expression method can also achieve the purification and enrichment of the fusion protein, due to its large molecular weight, it still has defects such as long folding time, limited solubilization effect and easy interference with the activity of the recombinant protein. It can be seen from the figure that most of the protein exists in insoluble inclusion bodies and the fusion protein is easily degraded.
- Figure C shows the electrophoresis identification of the fusion protein purified by Strep-tag II affinity column chromatography using a positively charged solubilizing tag (Zbasic) published by other researchers as a purification tag.
- M protein molecular weight standard
- F Strep-tag II purification flow-through
- W1, W2 Strep-tag II purification wash solution
- E Strep-tag II purification eluate.
- the arrow points to the fusion protein.
- the published positively charged solubilizing tag (Zbasic) has a lower yield; this indirectly shows that the Sacid tag of the present invention can efficiently solubilize the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, and improve the expression efficiency of the recombinant protein.
- Figure D shows the electrophoresis identification of the fusion protein purified by Strep-tag II affinity column chromatography using Sacid as the purification tag using the technology of the present invention.
- M protein molecular weight standard
- F Strep-tag II purification flow-through
- W1 Strep-tag II purification wash solution
- E Strep-tag II purification eluate.
- the arrow points to the fusion protein.
- the Sacid tag significantly improves the solubility of the fusion protein, and the target protein is obtained after Strep-tag II purification.
- Figure A is the SDS PAGE of TEVP protein expressed using Sacid and NeonGreen as solubilizing peptides.
- M protein molecular weight standard
- ST Sacid-TEVP fusion protein
- NT NeonGreen-TEVP fusion protein.
- the arrow points to the fusion protein. From the figure, only a thick band at 39KD of Sacid-TEVP can be observed, while the target band cannot be clearly seen when NeonGreen is used as a solubilizing protein fusion expression method.
- Figure B is a Western blot of TEVP expressed using Sacid and NeonGreen as solubilizing peptides.
- M protein molecular weight standard
- ST Sacid-TEVP fusion protein
- NT NeonGreen-TEVP fusion protein.
- the arrow points to the fusion protein. It can be seen from the figure that when Sacid is used as a solubilizing peptide, the Western blot results show a clear band at 39KD, but when NeonGreen is used as a solubilizing peptide, no target band is observed in the Western blot, which is basically consistent with the SDS PAGE result of Figure A. It shows that after the fusion expression of the present invention, the expression amount of the target protein has been greatly improved; compared with other tags, the Sacid tag of the present invention significantly improves the solubility of the fusion protein.
- Figure 7 Activity test of purified protein, using a plasmid containing a single NsiI restriction site and a single HindIII restriction site as a substrate, after double restriction digestion reaction, 3392bp and 1987bp target bands were obtained.
- the purified NsiI enzyme was diluted in a gradient (10 1 to 10 8 times) and then subjected to double restriction digestion reaction (37°C, 1 hour).
- Figure A shows the double restriction digestion identification of the NsiI protein expressed alone and the NeonGreen-NsiI protein
- Figure B shows the double restriction digestion identification of the NsiI protein expressed by the positively charged solubilizing tag (Zbasic) and the Sacid-NsiI protein of the present invention.
- the NsiI protein expressed alone is almost inactive; the activities of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the positively charged solubility tag (Zbasic) are basically the same; and by using the technology of the present invention with Sacid as the purification tag, the activity of the purified protein Sacid-NsiI is greatly improved, which is higher than the activity of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the positively charged solubility tag (Zbasic).
- Figure 8 Specific activities of enzymes purified in different ways (one unit is defined as the amount of enzyme required to digest 1 ⁇ g of plasmid DNA containing a single NsiI restriction site in 50 ⁇ l of reaction buffer at 37°C for 1 hour).
- nucleic acid sequences are referred to herein in a 5' to 3' direction from left to right; amino acid sequences are referred to herein in a 5' to 3' direction from left (upstream) to right (downstream).
- the present invention is based on the amino acid sequence shown in the B domain of staphylococcal protein A, uses computer-assisted calculations to transform the C-terminal domain of the B structure of natural staphylococcal protein A, introduces negatively charged amino acids, concentrates the negatively charged amino acids on one side, and designs an acidic surface solubilizing short peptide tag (Sacid).
- the present invention uses computer-assisted calculations to perform surface design of charge distribution on short stable protein domains, so that the surface contains multiple negative charges, and designs several acidic short peptide tags, namely Sacid (SEQ ID NO: 1), Sacid1 (SEQ ID NO: 2), and Sacid2 (SEQ ID NO: 3). It is found that the charge of the Sacid tag is relatively concentrated, more acidic than other designed tags, and can repel proteins with low isoelectric points, which can reduce aggregation during protein purification and play an excellent solubilizing role.
- Sacid SEQ ID NO: 1
- Sacid1 SEQ ID NO: 2
- Sacid2 SEQ ID NO: 3
- This tag is fused with the target protein, and it is found that the tag can efficiently solubilize the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, improve the expression efficiency of the recombinant protein, and maintain its biological functional activity.
- the method fuses the acidic surface solubilizing short peptide tag to be studied with the target protein, and quickly purifies the target protein through affinity chromatography.
- the present invention provides an isolated peptide comprising the amino acid sequence VDNKFNKEQQX1AFYEILHLPNLNEEQRNAFIQX2LKDDPX3X4SX5X6X7LX8EAX9X10LNDAQPK ( SEQ ID NO: 9 ), wherein: X1 - X10 are all negatively charged amino acids, preferably glutamic acid (E) or aspartic acid ( D ) .
- X1 is D. In one embodiment, X2 is E. In one embodiment, X3 is E. In one embodiment, X4 is E. In one embodiment, X5 is D. In one embodiment, X6 is E. In one embodiment, X7 is E. In one embodiment, X8 is E. In one embodiment, X9 is D. In one embodiment, X10 is D.
- the isolated peptide provided by the present invention is an acidic surface solubilizing short peptide capable of improving the expression efficiency of recombinant proteins. It is based on the amino acid sequence of the B domain of Staphylococcus protein A (SEQ ID NO: 4), introduces negatively charged amino acids and concentrates the negatively charged amino acids on the surface.
- the protein solubility is predicted based on the sequence by computer-assisted prediction (see https://protein-sol.manchester.ac.uk/).
- the linear model combining the 35 features gives a preliminary fit to the solubility data.
- the predicted return value is between 0 and 1. If it is higher than 0.45 (the average value of the E. coli expressed protein dataset Niwa et al (2009)), it means that the solubility of the protein may be higher than the average solubility of the E. coli expressed protein.
- the predicted solubility of the peptide shown in SEQ ID NO: 9 is greater than 0.45, as predicted based on https://protein-sol.manchester.ac.uk/. In one embodiment, the predicted solubility of the peptide shown in SEQ ID NO: 9 is equal to or greater than 0.70, as predicted based on https://protein-sol.manchester.ac.uk/.
- the isoelectric point pI of the soluble peptide of SEQ ID NO: 9 is equal to or less than 5.0.
- the peptide comprises or consists of the amino acid sequence shown in SEQ ID NO: 1.
- peptide As used herein, the terms “peptide,” “polypeptide,” and “protein” are used interchangeably and are defined as biological molecules composed of amino acid residues linked by peptide bonds.
- the present invention relates to a polynucleotide encoding the peptide of the first aspect.
- nucleotide sequence As used herein, the terms “nucleotide sequence”, “polynucleotide”, “nucleic acid” and “nucleic acid sequence” are used interchangeably and refer to a macromolecule composed of multiple nucleotides connected by 3'-5'-phosphodiester bonds, wherein the nucleotides include ribonucleotides and deoxyribonucleotides.
- the sequence of the polynucleotide of the present invention can be codon-optimized for different host cells (such as Escherichia coli) to improve the expression of the fusion protein. Methods for codon optimization are known in the art.
- the present invention relates to an isolated fusion protein comprising a first peptide and a second peptide, wherein the first peptide is a peptide according to the first aspect of the present invention and the second peptide is a polypeptide of interest.
- target polypeptide refers to any polypeptide or protein that can be produced and purified by the method of the present invention, non-limiting examples of which include enzymes, hormones, immunoglobulin chains, therapeutic polypeptides such as anti-cancer polypeptides, diagnostic polypeptides, or polypeptides that can be used for immunization purposes or biologically active fragments thereof, etc.
- the target polypeptide can be derived from any source, including microbial-derived polypeptides, mammalian-derived polypeptides, and artificial proteins (e.g., fusion proteins or mutant proteins), etc.
- the target polypeptide can be a polypeptide or protein of any length.
- the target polypeptide that can be produced and purified by the method of the present invention can have a length of 20-500 amino acid residues, for example, about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450, 500 amino acid residues.
- the polypeptide of interest described herein has an acidic, neutral or weakly basic isoelectric point lower than, equal to or slightly greater than 7.0, such as equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0.
- the first peptide can be located upstream (N-terminal) or downstream (C-terminal) of the second peptide, preferably located upstream (N-terminal). In some embodiments, the first peptide is located upstream of the second peptide. In some embodiments, the first peptide is located downstream of the second peptide.
- a first peptide being located "upstream" of a second peptide means that the C-terminal residue of the first peptide is located before the N-terminal residue of the second peptide; a first peptide being located "downstream" of a second peptide means that the N-terminal residue of the first peptide is located after the C-terminal residue of the second peptide.
- the production and purification of the target polypeptide of the present invention can be achieved under milder pH conditions (e.g., pH 7-11). There are no special restrictions on the type and properties of the target polypeptide. It can be used for the expression and purification of a variety of different polypeptides, and the final output and yield of the target polypeptide are high.
- target polypeptides that can be produced and purified by the method of the present invention include, but are not limited to, Bacillus subtilis lipase A (LipA), green fluorescent protein (GFP) and Aspergillus fumigatus type II ketoamine oxidase (AMA), glucagon-like peptide (GLP-1), stromal cell-derived factor (SDF-1 ⁇ ), sermorelin, pleurocidin-like cationic antimicrobial peptide NRC-03 (PNRC03) and Hinnavin II-Melanocyte (HM) or their biologically active fragments.
- LipA Bacillus subtilis lipase A
- GFP green fluorescent protein
- AMA Aspergillus fumigatus type II ketoamine oxidase
- GLP-1 glucagon-like peptide
- SDF-1 ⁇ stromal cell-derived factor
- sermorelin sermorelin
- pleurocidin-like cationic antimicrobial peptide NRC-03
- the target polypeptide is NsiI protein, for example comprising the amino acid sequence shown in SEQ ID NO: 5.
- the target polypeptide is a TEVP protein, for example comprising the amino acid sequence shown in SEQ ID NO: 8.
- the first peptide and the second peptide in the fusion protein of the present invention are connected by a spacer.
- spacer refers to a peptide of a certain length composed of amino acids with low hydrophobicity and low charge effect, which, when used in a fusion protein, can allow the connected parts to fully unfold and fully fold into their respective natural conformations without interfering with each other.
- Commonly used spacers in the art include, for example, flexible GS-type linkers rich in glycine (G) and serine (S); and rigid PT-type linkers rich in proline (P) and threonine (T). Since GS-type linkers have a more suitable amino acid length, are hydrophobic and ductile, and can make the functional protein have better stability and biological activity, GS-type linkers are preferably used in the present invention.
- the polypeptide produced by recombinant production needs to have a consistent sequence with the target polypeptide, i.e., no additional amino acid residues at both ends.
- the spacer in the fusion protein of the present invention also includes a cleavage site. The cleavage of the cleavage site can release the target polypeptide from the fusion protein.
- Suitable cleavage sites include cleavage sites that can be chemically cleaved, enzymatically cleaved or self-cleaved, or any other cleavage sites known to those skilled in the art.
- Preferred cleavage sites in the present invention can be self-cleaved, for example, they contain the amino acid sequence of a self-cleavable intein. This is because the cleavage method based on intein does not require the addition of enzymes or the use of harmful substances such as hydrogen bromide used in chemical methods, but only requires changing the buffer environment in which the aggregate is located to simply induce cleavage.
- a variety of self-cleaving inteins are known in the art, such as a series of inteins with different self-cleavage properties from NEB.
- the spacer sequence may contain a specific sequence that can be specifically chemically cleaved or biologically enzymatically hydrolyzed, for example, a Met (methionine) residue can be chemically cleaved by CNBr, a Lys (lysine) or Arg (arginine) residue can be cleaved by trypsin, LysArg or ArgArg can be cleaved by a two-base protease such as Kex2, GluAsnLeuTyrPheGln can be recognized and cleaved by tobacco etch virus protease (TEV protease, TEVP), IleGluGlyArg can be recognized and cleaved by factor Xa (Xa protease), as well as other suitable protease cleavage sites or inteins.
- a Met (methionine) residue can be chemically cleaved by CNBr
- the fusion protein may further comprise a portion connected to the second peptide (target polypeptide) for separating and purifying the second peptide, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc.
- a portion connected to the second peptide for separating and purifying the second peptide, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc.
- the fusion protein comprises the peptide described in the first aspect of the present invention, a spacer, a polypeptide of interest, and a portion for separating and purifying a second peptide.
- the present invention relates to an isolated polynucleotide comprising a nucleotide sequence encoding said fusion protein, and in a fifth aspect, the present invention relates to a construct, in particular an expression construct, comprising the polynucleotide of the fourth aspect.
- the sequence of the polynucleotide encoding the fusion protein is operably linked to an expression control sequence to carry out the desired transcription and ultimately produce the fusion protein in a host cell.
- Suitable expression control sequences include, but are not limited to, promoters, enhancers, ribosome action sites such as ribosome binding sites, polyadenylation sites, transcriptional splicing sequences, transcriptional termination sequences, and sequences that stabilize mRNA, etc.
- Vectors used to construct the expression constructs of the present invention include those that replicate autonomously in host cells, such as plasmid vectors; and also include vectors that can be integrated into host cell DNA and replicated with the host cell DNA. Many vectors suitable for the present invention are commercially available.
- the plasmid is a plasmid suitable for a prokaryotic or eukaryotic expression system.
- the plasmid is or is derived from pET28a.
- the present invention relates to a host cell capable of expressing the fusion protein of the third aspect of the present invention.
- the host cell contains a polynucleotide of the fourth aspect of the present invention or a construct of the fifth aspect of the present invention, such as an expression construct, wherein the host cell is capable of expressing the fusion protein.
- Host cells for expressing the fusion protein of the present invention include prokaryotes, yeast and higher eukaryotic cells.
- Exemplary prokaryotic hosts include bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces.
- the host cell is an Escherichia cell, a mammalian cell or an insect cell.
- the host cell is Escherichia coli, Bacillus subtilis or Bacillus megaterium.
- the host cell used is an Escherichia coli Rosetta (DE3) strain cell.
- a polynucleotide or construct of the invention can be introduced into a host cell to express the encoded amino acid sequence by one of many well-known techniques, including but not limited to heat shock transformation, electroporation, DEAE-dextran transfection, microinjection, liposome-mediated transfection, calcium phosphate precipitation, protoplast fusion, microprojectile bombardment, viral transformation and the like.
- the polynucleotide encoding the fusion protein can be integrated into the genome of a host cell, which can express the encoded fusion protein under appropriate conditions or constitutively express the encoded fusion protein.
- the polynucleotide encoding the fusion protein is present in the host cell in an extrachromosomal form (eg, a plasmid or a construct such as an expression vector).
- an extrachromosomal form eg, a plasmid or a construct such as an expression vector.
- the present invention also relates to a method for preparing the fusion protein of the third aspect of the present invention, comprising: culturing the host cell of the sixth aspect of the present invention under conditions suitable for expression of the fusion protein; and recovering the fusion protein, optionally, by cutting the fusion protein to release the target polypeptide and recovering the target polypeptide.
- the method comprises: transforming a host cell with a construct, in particular an expression construct, of the fifth aspect of the invention, culturing the transformed host cell under conditions suitable for expression of the fusion protein and recovering the fusion protein, optionally, by cutting the fusion protein to release the target polypeptide and recovering the target polypeptide.
- the conditions for expressing fusion proteins are known in the art, such as temperature, pH, culture medium, etc. Any suitable method for recovering fusion proteins is known in the art, including but not limited to, for example, by chromatography, centrifugation, dialysis, etc.
- the recovery of the fusion protein can be carried out by any suitable method known in the art. For example, after the fusion protein is expressed, the host cells are collected, the cells are lysed, the supernatant is collected (e.g., by centrifugation to remove cell debris), and the fusion protein is optionally separated (e.g., by a specific tag or a specific binding molecule such as an antibody).
- the method comprises: transforming host cells with the construct of the fifth aspect of the invention, in particular the expression construct, culturing the transformed host cells under conditions suitable for expression of the fusion protein, collecting and lysing the cells, collecting the supernatant, and optionally isolating the fusion protein.
- a spacer sequence is included between the short peptide tag of the fusion protein and the target protein, and the spacer sequence contains a cleavage site (such as the cleavage site described herein) that can be cut, thereby removing the short peptide tag by cutting the cleavage site, thereby releasing the target protein.
- a cleavage site such as the cleavage site described herein
- the present invention provides use of the peptide of the first aspect or the polynucleotide of the second aspect for preparing a target protein in a host cell.
- the peptide of the first aspect of the present invention can form a fusion protein with the target protein, so that after being expressed in a host cell, the solubility of the fusion protein can be increased, the expression efficiency of the fusion protein can be improved, and the biological activity of the target protein can be maintained.
- the polynucleotide of the second aspect can be linked to a polynucleotide encoding a target protein in frame and placed under the control of a suitable expression regulatory element (eg, a promoter) to express the desired fusion protein in a suitable expression system.
- a suitable expression regulatory element eg, a promoter
- the present invention provides a method for producing a target protein, comprising expressing in a host cell a fusion protein formed by fusion of the peptide described in the first aspect of the present invention and the target protein, such as the fusion protein described in the third aspect of the present invention, recovering the fusion protein, cutting the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
- the method comprises: constructing an expression construct (e.g., a plasmid or a viral vector) comprising a nucleotide sequence encoding a fusion protein formed by fusion of the peptide described in the first aspect of the present invention and a target protein, transforming a host cell with the expression construct, culturing the transformed host cell under conditions suitable for expression of the fusion protein, recovering the fusion protein, cleaving the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
- an expression construct e.g., a plasmid or a viral vector
- the recovery of the fusion protein can be carried out by any suitable method known in the art. For example, after the fusion protein is expressed, the host cells are collected, the cells are lysed, the supernatant is collected (e.g., by centrifugation to remove cell debris), and the fusion protein is optionally separated (e.g., by a specific tag or a specific binding molecule such as an antibody).
- the method comprises: transforming a host cell with the construct of the fifth aspect of the invention, in particular an expression construct, culturing the transformed host cell under conditions suitable for expression of the fusion protein, collecting and lysing the cells, collecting the supernatant, cutting the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
- cleavage of the fusion protein can be performed by any suitable means known in the art, such as including a spacer sequence between the short peptide tag and the target protein, wherein the spacer sequence comprises a cleavage site that can be cleaved (such as the cleavage site described herein), thereby removing the short peptide tag by cleaving the fusion protein, thereby releasing the target protein.
- a spacer sequence between the short peptide tag and the target protein wherein the spacer sequence comprises a cleavage site that can be cleaved (such as the cleavage site described herein), thereby removing the short peptide tag by cleaving the fusion protein, thereby releasing the target protein.
- the fusion protein may further comprise a portion connected to the target protein for separation and purification purposes, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc.
- a portion connected to the target protein for separation and purification purposes such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc.
- the method comprises constructing an expression construct comprising a nucleotide sequence encoding the peptide described in the first aspect of the invention, a nucleotide sequence encoding a spacer sequence, a nucleotide sequence encoding a target protein, and a nucleotide sequence encoding a portion for isolating and purifying a second peptide.
- the host cell is selected from prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3).
- prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3).
- the target protein has an acidic, neutral or weakly alkaline isoelectric point lower than, equal to or slightly greater than 7.0, for example, equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0, and more preferably is an NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
- the present invention has the following beneficial effects:
- the present invention can overcome the disadvantage that the target protein is easy to form inclusion bodies in the prokaryotic expression system, can effectively dissolve the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, improve the expression efficiency of the recombinant protein, maintain its biological function activity, and can be widely used in the fermentation production field of insoluble proteins;
- the acidic short peptide solubility tag of the present invention can replace the solubility tags such as GST, MBP, and NeonGreen. It is shorter and smaller than the commonly known solubility tags, and has a stable structure. It contains only 57 amino acid residues and a molecular weight of only 8kD. It has less potential impact on the recombinant protein when fused with the recombinant protein;
- the acidic solubilizing tag (Sacid) of the present invention is mutually repelled by the acidic proteins widely present in the body, and is less likely to cause aggregation effects, thereby being more conducive to the dissolution and folding of recombinant proteins, and having more advantages in improving the solubility, expression efficiency and biological activity of recombinant proteins;
- the tag of the present invention can also obtain high-purity target protein in affinity purification applications, indicating that the modification has no effect on affinity purification.
- step means that the step exists or does not exist.
- the term “about” refers to a numerical range that includes a specific value that a person skilled in the art would reasonably consider to be similar to the specific value. In some embodiments, the term “about” refers to within the standard error of measurement using commonly accepted measurements in the art. In some embodiments, approximately refers to +/-10% of a specific value.
- an acidic surface solubilizing peptide tag (Sacid, SEQ ID NO: 1) was designed using computer-aided calculations (URL: https://www.expasy.org/resources/swiss-model), which introduced negatively charged amino acids to concentrate them on the surface. This tag is only a form of predicted acidic tag.
- Swiss-Model is a tool for predicting protein structure models. It uses homology modeling to predict the tertiary structure of an unknown sequence.
- the target protein encoding gene sequence was fused with the acidic surface solubilizing short peptide tag (Sacid) encoding gene sequence to obtain the target protein expression unit (wherein the Sacid amino acid sequence is located upstream of the target protein amino acid sequence), and the target protein expression unit was inserted into the expression plasmid pET-28a (Novagen, USA) to obtain the target protein expression vector pET-28a/Sacid-NsiI by artificial synthesis.
- Sacid acidic surface solubilizing short peptide tag
- the NsiI nucleotide sequence (encoding amino acid sequence SEQ ID NO: 5) was inserted into the prokaryotic expression vector pET28a to obtain the expression vector pET28a/NsiI. After the vector construction was verified to be correct by sequencing, the plasmid was extracted by alkaline lysis and transformed into the expression host bacteria Rosetta (DE3) (Tiangen Biochemical Technology (Beijing) Co., Ltd.).
- a single colony of the expression host bacteria containing the recombinant plasmid pET28a/NsiI was inoculated in LB liquid culture containing kanamycin (50 ⁇ g/mL) and chloramphenicol (34 ⁇ g/mL) and cultured at 37°C, 220rpm for 16 hours as seed bacteria.
- the amino acid sequence of NeonGreen used in this experiment is shown in SEQ ID NO: 6.
- NeonGreen and NsiI SEQ ID NO: 5
- the prokaryotic expression vector pET28a was connected together through artificial synthesis (where the NeonGreen amino acid sequence was located upstream of the NsiI protein amino acid sequence) to obtain the target protein expression vector pET-28a/NeonGreen-NsiI.
- the recombinant plasmid pET-28a/NeonGreen-NsiI was transformed into the expression host bacteria Rosetta (DE3), and a single colony of the expression host bacteria was picked and inoculated into LB liquid culture containing kanamycin (50 ⁇ g/mL) and chloramphenicol (34 ⁇ g/mL), and cultured at 37°C and 220 rpm for 12 hours as seed bacteria.
- NeonGreen is used as a solubilizing peptide, and the expression of the NeonGreen-NsiI fusion protein has a molecular weight of about 70KD.
- the arrow points to the fusion protein.
- NeonGreen as a solubilizing protein fusion expression method can also achieve the purification and enrichment of the fusion protein, due to its large molecular weight, it still has defects such as long folding time, limited solubilizing effect and easy interference with the activity of the recombinant protein. It can be seen from the figure that most of the protein exists in the form of insoluble inclusion bodies and the fusion protein is easily degraded.
- the amino acid sequence of the published positively charged solubilizing tag Zbasic [Hedhammar M, Hober S, Z(basic)--a novel purification tag for efficient protein recovery, Journal of Chromatography A, 2007, 1161(1-2):22-28] is shown as SEQ ID NO: 7.
- the Zbasic coding sequence and the NsiI coding sequence were connected to obtain a fusion protein expression vector pET-28a/Zbasic-NsiI, wherein the Zbasic amino acid sequence is located upstream of the NsiI amino acid sequence.
- the plasmid was transformed into the expression host bacteria Rosetta (DE3) to obtain the target engineered strain.
- the supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C.
- the target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C as the target protein sample.
- the above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
- SDS-PAGE gel was prepared according to the conventional method.
- the prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE.
- the electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
- Figure 5 C shows the NsiI protein expressed using a published positively charged solubilizing tag (Zbasic), with a molecular weight of about 45 kDa, and the arrow indicates the target protein band.
- Zbasic published positively charged solubilizing tag
- the plasmid was transformed into the expression host bacteria Rosetta (DE3) (purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.) to obtain the target engineered strain.
- Rosetta purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.
- the supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C.
- the target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C as the target protein sample.
- the above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
- SDS-PAGE gel was prepared according to the conventional method.
- the prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE.
- the electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
- Figure 5D shows the fusion expression of Sacid-NsiI using the method of the present invention, with a molecular weight of about 45 kDa.
- Example 7 Using NeonGreen as a solubilizing peptide and fusion expression with TEVP
- the amino acid sequence of NeonGreen used in this experiment is shown in SEQ ID NO: 6.
- NeonGreen and TEVP SEQ ID NO: 8
- the prokaryotic expression vector pET28a were connected together through artificial synthesis (where the NeonGreen amino acid sequence was located upstream of the TEVP protein amino acid sequence) to obtain the target protein expression vector pET-28a/NeonGreen-TEVP.
- the recombinant plasmid pET-28a/NeonGreen-TEVP was transformed into the expression host bacteria Rosetta (DE3), and a single colony of the expression host bacteria was picked and inoculated into LB liquid culture containing kanamycin (50 ⁇ g/mL) and chloramphenicol (34 ⁇ g/mL), and cultured at 37°C and 220 rpm for 12 hours as seed bacteria.
- NeonGreen was used as the solubilizing peptide to express the NeonGreen-TEVP fusion protein in the supernatant.
- the NT molecular weight was about 70KD, and the arrow pointed to the fusion protein.
- the target bands could not be clearly distinguished from the SDS PAGE in Figure A.
- the proteins were subjected to Western blotting.
- the results are shown in Figure 6B, NT. From the results of Western blotting, it can be seen that the NeonGreen-TEVP fusion protein was not expressed.
- Example 8 Fusion expression of Sacid and target protein TEVP
- the genes of Sacid SEQ ID NO: 1
- TEVP SEQ ID NO: 8
- the prokaryotic expression vector pET28a were connected together through artificial synthesis (where the Sacid amino acid sequence was located upstream of the TEVP protein amino acid sequence) to obtain the recombinant expression plasmid pET-28a/Sacid-TEVP.
- the plasmid was transformed into the expression host bacteria Rosetta (DE3) (purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.) to obtain the target engineered strain.
- Rosetta purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.
- the supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C.
- the target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10 ⁇ L of 5 ⁇ SDS loading buffer, boiled in water bath for 10min, and stored at -20°C as the target protein sample.
- the above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
- SDS-PAGE gel was prepared according to the conventional method.
- the prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE.
- the electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
- the proteins on the gel were wet transferred to a PVDF membrane, blocked with 5% skim milk powder for 2 h, incubated with primary antibody at 4°C overnight, incubated with HRP-labeled secondary antibody at room temperature for 1 h, and exposed using a chemiluminescence kit and GelView 6000Plus equipment.
- the target band is shown in ST in Figure 6B.
- Lane ST in Figures 6A and B is Sacid-TEVP fusion expressed by the method of the present invention, with a molecular weight of about 39 kDa.
- the arrow indicates the position where there is an obvious target protein band in the supernatant sample and the Western blotting results are consistent with it.
- the expression level of the target protein is greatly improved after the fusion expression of the present invention; compared with other tags, the Sacid tag of the present invention significantly improves the solubility of the fusion protein.
- the obtained protein eluate was subjected to ultrafiltration to replace the buffer, and the protein was stored in Tris HCl buffer (20mM Tris HCl, 20mM NaCl, 50% glycerol), and the protein activity was verified by enzyme digestion of the plasmid.
- the plasmid containing a single NsiI restriction site and a single HindIII restriction site was used as a substrate for double restriction digestion.
- the purified NsiI enzyme was diluted in a gradient (10 1 to 10 8 times) and then subjected to double restriction digestion reaction. Each dilution was used as a sample.
- the enzymes were digested for 1 hour at 37°C and then subjected to agarose gel electrophoresis.
- the enzyme and plasmid dosages were the same and comparable.
- Figure 7 shows the results of Figure 7. Using a plasmid containing a single NsiI restriction site and a single HindIII restriction site as a substrate, after double enzyme digestion reaction, target bands of 3392bp and 1987bp were obtained.
- Figure A shows the double enzyme digestion identification of the NsiI protein expressed alone and the NeonGreen-NsiI protein;
- Figure B shows the double enzyme digestion identification of the NsiI protein expressed by the published positive charge solubility tag (Zbasic) and the Sacid-NsiI protein of the present invention.
- the NsiI protein expressed alone is almost inactive; the activities of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the published positive charge solubility tag (Zbasic) are basically the same; and the activity of the purified protein Sacid-NsiI is greatly improved by using the technology of the present invention with Sacid as the purification tag, which is higher than the activity of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the published positive charge solubility tag (Zbasic).
- the technology of the present invention uses Sacid as a purification tag, and the enzyme activity of the purified protein Sacid-NsiI is 496KU/mg, the enzyme activity of the NsiI protein expressed by the known positively charged solubilizing tag (Zbasic) is 5.5KU/mg, and the enzyme activity of the NeonGreen-NsiI fusion protein is 4.9KU/mg.
- the NsiI protein expressed alone is almost inactive and has the problem of contamination by foreign proteins.
- the technology of the present invention uses Sacid as a purification tag, and the enzyme activity of the purified protein Sacid-NsiI is about 100 times that of the enzymes purified by the other two known solubilizing tags.
- SEQ ID NO: 4 (B domain of Staphylococcus protein A, Wt)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Provided is an isolated peptide having an amino acid sequence of VDNKFNKEQQX1AFYEILHLPNLNEEQRNAFIQX2LKDDPX3X4SX5X6X7LX8EAX9X 10LNDAQPK (SEQ ID NO: 9), wherein X1-X10 are all negatively charged amino acids, preferably E or D. Further provided are a fusion protein comprising the peptide, a polynucleotide encoding the peptide or fusion protein, a construct and host cell comprising the polynucleotide, and a method for producing the fusion protein or the polypeptide of interest.
Description
本发明属于蛋白纯化领域,具体涉及了一种利用计算机辅助计算设计的酸性表面助溶短肽标签。The invention belongs to the field of protein purification, and in particular relates to an acidic surface solubilizing short peptide label designed by computer-aided calculation.
近年来随着生物技术的不断发展,重组蛋白表达和纯化技术一直受到人们的重视和广泛应用,其中原核表达成本最低,最易于操作和纯化效率最高。大肠杆菌(E.coli)是重组蛋白生产最常用的宿主生物之一。In recent years, with the continuous development of biotechnology, recombinant protein expression and purification technology has been valued and widely used. Among them, prokaryotic expression has the lowest cost, is the easiest to operate and has the highest purification efficiency. Escherichia coli (E. coli) is one of the most commonly used host organisms for recombinant protein production.
很多富含疏水氨基酸残基的重组蛋白,尤其一些来自非原核生物的重组蛋白,由于原核细胞内部缺乏相应的蛋白折叠和保护机制,常常面临难溶解和大量进入包涵体的问题。这些在表达过程中发生沉淀的重组蛋白即便在重新溶解后也存在大量失活的现象。Many recombinant proteins rich in hydrophobic amino acid residues, especially those from non-prokaryotes, often face the problem of being difficult to dissolve and entering inclusion bodies in large quantities due to the lack of corresponding protein folding and protection mechanisms in prokaryotic cells. These recombinant proteins that precipitate during expression are largely inactivated even after re-dissolving.
为达到增加表达量,提高产物的正确构象比例及可溶性,方便下游纯化操作,人们采用助溶蛋白融合表达的方式,应用如绿色荧光蛋白(GFP)、麦芽糖结合蛋白(MBP)、谷胱甘肽硫转移酶(GST)等。然而,由于这些助溶蛋白自身分子量较大,仍然存在折叠时间较长,助溶效果有限和易干扰重组蛋白活性等缺陷。因此,本领域需要设计和开发新的助溶肽分子助力重组蛋白的生产纯化。In order to increase the expression amount, improve the correct conformation ratio and solubility of the product, and facilitate downstream purification operations, people use the method of fusion expression of solubilizing proteins, such as green fluorescent protein (GFP), maltose binding protein (MBP), glutathione S-transferase (GST), etc. However, due to the large molecular weight of these solubilizing proteins themselves, there are still defects such as long folding time, limited solubilizing effect and easy interference with the activity of recombinant proteins. Therefore, the art needs to design and develop new solubilizing peptide molecules to assist the production and purification of recombinant proteins.
发明内容Summary of the invention
由于大多数蛋白质具有偏酸性或中性的等电点[Link A J et al.,Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K-12,ELECTROPHORESIS,1997,18(8):1259-1313],因此本发明通过计算机辅助计算,对短小的稳定蛋白结构域进行电荷分布的表面化设计,使之表面包含多个负电荷,使之本身极性提高更容易溶于水,并能够与具有偏酸性等电点的蛋白质发生互斥作用,可使蛋白纯化过程中减少聚集,不仅起到优越的助溶作用,也减少了对重组蛋白自身折叠过程的干扰。细菌的一些受体蛋白的溶剂暴露表面,特别是那些含有螺旋束结构的受体表面,被证明是非常稳定的[Alexander P et al.,Thermodynamic analysis of the folding of the streptococcal protein G IgG-binding domains B1 and B2:why small proteins tend to have high denaturation temperatures,Biochemistry,1992.31(14):3597-3603]。可以利用这些稳定的溶剂暴露表面区域的区域来设计稳定的助溶肽。Since most proteins have acidic or neutral isoelectric points [Link A J et al., Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K-12, ELECTROPHORESIS, 1997, 18(8):1259-1313], the present invention uses computer-aided calculation to perform surface design of charge distribution on short stable protein domains, so that the surface contains multiple negative charges, thereby increasing the polarity of the domains themselves and making them more soluble in water, and being able to repel proteins with acidic isoelectric points, thereby reducing aggregation during protein purification, which not only plays an excellent solubilizing role, but also reduces interference with the folding process of the recombinant protein itself. The solvent-exposed surfaces of some bacterial receptor proteins, especially those containing helical bundle structures, have been shown to be very stable [Alexander P et al., Thermodynamic analysis of the folding of the streptococcal protein G IgG-binding domains B1 and B2: why small proteins tend to have high denaturation temperatures, Biochemistry, 1992. 31(14): 3597-3603]. These stable solvent-exposed surface regions can be used to design stable solubilizing peptides.
本发明的酸性表面助溶短肽标签(Sacid)基于葡萄球菌蛋白A的B结构域[
N et al.,A synthetic IgG-binding domain based on staphylococcal protein A,Protein Eng,1(2):107-113,1987]改造而来。该结构域短小稳定,仅含57个氨基酸残基,分子量仅8kD,适合进行电荷分布改造和作为融合标签添加到重组蛋白上,其可高效助溶重组蛋白,在提纯过程中稳定重组蛋白构象,提高重组蛋白表 达效率,维持其生物学功能活性。本发明将这种改造后的表面富含负电荷的短小多肽嫁接到目标重组蛋白,与早期其他研究者已发表的正电荷助溶标签相比,对提高重组蛋白的表达效率、溶解性和生物活性方面更具优势。
The acidic surface solubilizing peptide tag (Sacid) of the present invention is based on the B domain of Staphylococcus protein A [ N et al., A synthetic IgG-binding domain based on staphylococcal protein A, Protein Eng, 1(2): 107-113, 1987]. This domain is short and stable, containing only 57 amino acid residues and a molecular weight of only 8kD, which is suitable for charge distribution modification and addition to recombinant proteins as fusion tags. It can effectively solubilize recombinant proteins, stabilize the conformation of recombinant proteins during purification, improve the expression efficiency of recombinant proteins, and maintain their biological functional activity. The present invention grafts this modified short polypeptide with a surface rich in negative charge onto the target recombinant protein, which is more advantageous than the positively charged solubilizing tags published by other researchers in the early stage in terms of improving the expression efficiency, solubility and biological activity of recombinant proteins.
在第一方面,本发明提供了一种分离的肽,其包含氨基酸序列VDNKFNKEQQX
1AFYEILHLPNLNEEQRNAFIQX
2LKDDPX
3X
4SX
5X
6X
7LX
8EAX
9X
10LNDAQPK(SEQ ID NO:9),其中:X
1-X
10均是带负电荷氨基酸,优选为谷氨酸(E)或天冬氨酸(D)。
In a first aspect, the present invention provides an isolated peptide comprising the amino acid sequence VDNKFNKEQQX1AFYEILHLPNLNEEQRNAFIQX2LKDDPX3X4SX5X6X7LX8EAX9X10LNDAQPK ( SEQ ID NO: 9 ), wherein: X1 - X10 are all negatively charged amino acids, preferably glutamic acid (E) or aspartic acid ( D ) .
在第二方面,本发明提供了一种分离的多核苷酸,其编码第一方面的肽。In a second aspect, the present invention provides an isolated polynucleotide encoding the peptide of the first aspect.
在第三方面,本发明提供了一种分离的融合蛋白,其包含第一肽和第二肽,其中所述第一肽是第一方面的肽,所述第二肽是目的多肽。In a third aspect, the present invention provides an isolated fusion protein comprising a first peptide and a second peptide, wherein the first peptide is the peptide of the first aspect and the second peptide is a polypeptide of interest.
在第四方面,本发明提供了一种分离的多核苷酸,其编码第三方面的融合蛋白。In a fourth aspect, the present invention provides an isolated polynucleotide encoding the fusion protein of the third aspect.
在第五方面,本发明提供了一种构建体,其包含第四方面的多核苷酸。In a fifth aspect, the present invention provides a construct comprising the polynucleotide of the fourth aspect.
在第六方面,本发明提供了一种宿主细胞,其包含第四方面的多核苷酸或第五方面的构建体,其中所述宿主细胞能够表达所述融合蛋白。In a sixth aspect, the present invention provides a host cell comprising the polynucleotide of the fourth aspect or the construct of the fifth aspect, wherein the host cell is capable of expressing the fusion protein.
在第七方面,本发明提供了一种生产融合蛋白的方法,包括:(a)在适合融合蛋白表达的条件下培养第六方面的宿主细胞;和(b)回收融合蛋白,任选地,(c)切割融合蛋白以释放目的多肽和(d)回收所述目的多肽。In a seventh aspect, the present invention provides a method for producing a fusion protein, comprising: (a) culturing the host cell of the sixth aspect under conditions suitable for expression of the fusion protein; and (b) recovering the fusion protein, optionally, (c) cleaving the fusion protein to release a polypeptide of interest and (d) recovering the polypeptide of interest.
在第八方面,本发明提供了第一方面的肽或第二方面的多核苷酸用于生产目的蛋白的用途。In an eighth aspect, the present invention provides use of the peptide of the first aspect or the polynucleotide of the second aspect for producing a target protein.
在第九方面,本发明提供了一种生产目的蛋白的方法,包括:(a)在宿主细胞中表达目的蛋白与第一方面的肽融合形成的融合蛋白;(b)切割融合蛋白,释放目的蛋白;和(c)任选地,分离和/或纯化所述目的蛋白。In the ninth aspect, the present invention provides a method for producing a target protein, comprising: (a) expressing a fusion protein formed by fusion of the target protein and the peptide of the first aspect in a host cell; (b) cutting the fusion protein to release the target protein; and (c) optionally, isolating and/or purifying the target protein.
图1:不同短肽标签的氨基酸序列的序列比对结果。其中,Wt相应于SEQ ID NO:4;Zbasic相应于SEQ ID NO:7;Sacid相应于SEQ ID NO:1;Sacid1相应于SEQ ID NO:2;以及Sacid2相应于SEQ ID NO:3。Figure 1: Sequence alignment results of amino acid sequences of different short peptide tags. Among them, Wt corresponds to SEQ ID NO: 4; Zbasic corresponds to SEQ ID NO: 7; Sacid corresponds to SEQ ID NO: 1; Sacid1 corresponds to SEQ ID NO: 2; and Sacid2 corresponds to SEQ ID NO: 3.
图2:不同短肽标签在生理pH的静电势模型;A图为葡萄球菌蛋白A的B结构域在生理pH下N端(左)和C端(右)的静电势模型;B图为Zbasic在生理pH下N端(左)和C端(右)的静电势模型;C图为Sacid在生理pH下N端(左)和C端(右)的静电势模型;D图为Sacid 1在生理pH下N端(左)和C端(右)的静电势模型;E图为Sacid 2在生理pH下N端(左)和C端(右)的静电势模型。Figure 2: Electrostatic potential model of different short peptide tags at physiological pH; Figure A is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of the B domain of Staphylococcal protein A at physiological pH; Figure B is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Zbasic at physiological pH; Figure C is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid at physiological pH; Figure D is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid 1 at physiological pH; Figure E is the electrostatic potential model of the N-terminus (left) and C-terminus (right) of Sacid 2 at physiological pH.
图3:通过计算机辅助预测的溶解度及等电点(网址https://protein-sol.manchester.ac.uk/)。其基于序列预测蛋白质可溶性。基于对无细胞表达的大肠杆菌蛋白质溶解度双峰分布的观察[Niwa T et al.,Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins,Proceedings of the National Academy of Sciences of the United States of America,2009,106(11):4201-4206],这些测量报告了一种蛋白质的可溶性部分(在离心后的上清液中)与该蛋白质的总量之比,而不是一种热力学属性。该工具计算35种特征,包括:20种氨基酸组成;7种氨基酸组合:K-R、D-E、K+R、D+E、K+R-D-E、K+R+D+E、F+W+Y;8种预测的特征:长度、等电点、疏水性、pH=7.0时蛋白的净电荷、折叠倾向、无序区、序列熵、和β链倾向。结合35个特征的线性模型给出了对溶解度数据的初步拟合。返回值介于0-1之间,如果高于0.45(大肠杆菌表达的蛋白数据集Niwa et al(2009)的平均值),意味着蛋白的溶解性可能比大肠杆菌表达的蛋白平均的溶解性要高。Figure 3: Solubility and isoelectric point predicted by computer-aided prediction (URL: https://protein-sol.manchester.ac.uk/). It predicts protein solubility based on sequence. Based on the observation of a bimodal distribution of solubility of cell-free expressed Escherichia coli proteins [Niwa T et al., Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins, Proceedings of the National Academy of Sciences of the United States of America, 2009, 106(11): 4201-4206], these measurements report the ratio of the soluble fraction of a protein (in the supernatant after centrifugation) to the total amount of the protein, rather than a thermodynamic property. The tool calculates 35 features, including: 20 amino acid compositions; 7 amino acid combinations: K-R, D-E, K+R, D+E, K+R-D-E, K+R+D+E, F+W+Y; 8 predicted features: length, isoelectric point, hydrophobicity, net charge of protein at pH=7.0, folding tendency, disordered region, sequence entropy, and beta chain tendency. A linear model combining 35 features gives a preliminary fit to the solubility data. The return value is between 0-1. If it is higher than 0.45 (the average value of the E. coli expressed protein data set Niwa et al (2009)), it means that the solubility of the protein may be higher than the average solubility of the E. coli expressed protein.
A图和B图为不同助溶标签的溶解度。从图中可知,本发明的酸性短肽助溶标签(Sacid)和改造前的野生型(WT)标签的自身溶解性虽然相差不大,甚至略低于SUMO标签(A图),但是由于其带电基团集中在表面,因而在和重组蛋白融合后,能比其他标签更有效与水中带正电的质子发生相互作用形成水化膜,从而帮助重组蛋白溶解。因而Sacid标签在与目的蛋白融合后,可明显提高目的蛋白的溶解度(B图)。本发明的酸性短肽助溶标签(Sacid)比已知发表的正电荷标签(Zbasic)的自身溶解度要高(A图),二者电荷均分布在表面,但是由于酸性表面的负电荷可以和大多偏弱酸性的重组蛋白表面更多的负电荷相互排斥,因而不容易发生聚集,比碱性助溶肽标签的助溶效果更好。因而,在与目的蛋白融合后,助溶效果明显优于正电荷标签(Zbasic)(B图)。类似的,和其他常见的已知助溶标签相比,本发明的酸性短肽助溶标签(Sacid)自身的助溶效果也明显更优。Figures A and B show the solubility of different solubility-enhancing tags. As can be seen from the figure, although the self-solubility of the acidic short peptide solubility tag (Sacid) of the present invention and the wild-type (WT) tag before modification is not much different, and is even slightly lower than the SUMO tag (Figure A), because its charged groups are concentrated on the surface, after fusion with the recombinant protein, it can interact with the positively charged protons in the water more effectively than other tags to form a hydration film, thereby helping the recombinant protein to dissolve. Therefore, after the Sacid tag is fused with the target protein, the solubility of the target protein can be significantly improved (Figure B). The acidic short peptide solubility tag (Sacid) of the present invention has a higher self-solubility than the known published positively charged tag (Zbasic) (Figure A). Both charges are distributed on the surface, but because the negative charge on the acidic surface can repel each other with more negative charges on the surface of most weakly acidic recombinant proteins, it is not easy to aggregate, and the solubility effect is better than that of the alkaline solubility peptide tag. Therefore, after fusion with the target protein, the solubility effect is significantly better than the positively charged tag (Zbasic) (Figure B). Similarly, compared with other commonly known solubilizing tags, the acidic short peptide solubilizing tag (Sacid) of the present invention also has a significantly better solubilizing effect.
C图和D图为不同助溶标签的等电点。从图中可知本发明的酸性短肽助溶标签体现出更好的偏酸性。在与目的蛋白融合后,融合蛋白的pI仍体现较强偏酸性。Figures C and D show the isoelectric points of different solubilizing tags. It can be seen from the figure that the acidic short peptide solubilizing tag of the present invention exhibits better acidity. After fusion with the target protein, the pI of the fusion protein still exhibits a strong acidity.
E图为本发明的酸性短肽助溶标签(Sacid)预测的特征(pH=7.0时蛋白的净电荷、折叠倾向等)。Figure E shows the predicted characteristics of the acidic short peptide solubility tag (Sacid) of the present invention (net charge of the protein at pH = 7.0, folding tendency, etc.).
图4:本发明所采用的蛋白表达纯化技术示意图。FIG4 is a schematic diagram of the protein expression and purification technology used in the present invention.
图5:不同纯化标签纯化目的蛋白的电泳鉴定。Figure 5: Electrophoretic identification of target proteins purified using different purification tags.
A图为单独表达NsiI蛋白时的情况。M:蛋白质分子量标准;F:Strep-tag II纯化流穿液;W1、W2:Strep-tag II纯化洗杂液;E:Strep tag II纯化洗脱液。箭头指向目的蛋白,从图中可见,单独表达NsiI蛋白,由于该蛋白正确折叠前易发生聚集沉淀,其表达量不高,纯化后,目的蛋白明显减少甚至完全损失。Figure A shows the situation when NsiI protein is expressed alone. M: protein molecular weight standard; F: Strep-tag II purification flow-through; W1, W2: Strep-tag II purification wash solution; E: Strep-tag II purification eluate. The arrow points to the target protein. It can be seen from the figure that when NsiI protein is expressed alone, its expression level is not high because it is prone to aggregation and precipitation before the protein is correctly folded. After purification, the target protein is significantly reduced or even completely lost.
B图为采用NeonGreen作为助溶肽表达NsiI,NeonGreen-NsiI融合蛋白的可溶性表达情况。M:蛋白质分子量标准;F:Strep-tag II纯化流穿液;W1、W2:Strep-tag II纯化洗杂液;E:Strep tag II纯化洗脱液。箭头指向融合蛋白,说明采用NeonGreen作为助溶蛋白融合表达的方式,虽然也能够实现融合蛋白的纯化和富集,但由于自身分子量较大,仍然存在折叠时间较长,助溶效果有限和易干扰重组蛋白活性等缺陷,从图中可看出大部分蛋白以不溶性包涵体存在且融合蛋 白易降解。Figure B shows the soluble expression of NeonGreen-NsiI fusion protein using NeonGreen as a solubilizing peptide to express NsiI. M: protein molecular weight standard; F: Strep-tag II purification flow-through; W1, W2: Strep-tag II purification wash solution; E: Strep-tag II purification eluate. The arrow points to the fusion protein, indicating that although the use of NeonGreen as a solubilizing protein fusion expression method can also achieve the purification and enrichment of the fusion protein, due to its large molecular weight, it still has defects such as long folding time, limited solubilization effect and easy interference with the activity of the recombinant protein. It can be seen from the figure that most of the protein exists in insoluble inclusion bodies and the fusion protein is easily degraded.
C图为采用其他研究者已发表的正电荷助溶标签(Zbasic)作为纯化标签,采用Strep-tag II亲和柱层析纯化融合蛋白的电泳鉴定。M:蛋白质分子量标准;F:Strep-tag II纯化流穿液;W1、W2:Strep-tag II纯化洗杂液;E:Strep tag II纯化洗脱液。箭头指向融合蛋白,与图5D相比,已发表的正电荷助溶标签(Zbasic)产量偏低;侧面说明本发明Sacid标签可高效助溶重组蛋白,在提纯过程中稳定重组蛋白构象,提高重组蛋白表达效率。Figure C shows the electrophoresis identification of the fusion protein purified by Strep-tag II affinity column chromatography using a positively charged solubilizing tag (Zbasic) published by other researchers as a purification tag. M: protein molecular weight standard; F: Strep-tag II purification flow-through; W1, W2: Strep-tag II purification wash solution; E: Strep-tag II purification eluate. The arrow points to the fusion protein. Compared with Figure 5D, the published positively charged solubilizing tag (Zbasic) has a lower yield; this indirectly shows that the Sacid tag of the present invention can efficiently solubilize the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, and improve the expression efficiency of the recombinant protein.
D图为采用本发明技术以Sacid作为纯化标签,采用Strep-tag II亲和柱层析纯化融合蛋白的电泳鉴定。M:蛋白质分子量标准;F:Strep-tag II纯化流穿液;W1:Strep-tag II纯化洗杂液;E:Strep tag II纯化洗脱液。箭头指向融合蛋白,与图5C相比,Sacid标签明显提高了融合蛋白的可溶性,且经过Strep-tag II纯化后得到目的蛋白。Figure D shows the electrophoresis identification of the fusion protein purified by Strep-tag II affinity column chromatography using Sacid as the purification tag using the technology of the present invention. M: protein molecular weight standard; F: Strep-tag II purification flow-through; W1: Strep-tag II purification wash solution; E: Strep-tag II purification eluate. The arrow points to the fusion protein. Compared with Figure 5C, the Sacid tag significantly improves the solubility of the fusion protein, and the target protein is obtained after Strep-tag II purification.
图6:Sacid与NeonGreen纯化标签纯化目的蛋白的比较Figure 6: Comparison of purification of target protein using Sacid and NeonGreen purification tags
A图为采用Sacid和NeonGreen作为助溶肽表达TEVP蛋白的SDS PAGE图。M:蛋白质分子量标准;ST:Sacid-TEVP融合蛋白;NT:NeonGreen-TEVP融合蛋白。箭头指向融合蛋白,从图中仅能观察到Sacid-TEVP在39KD处有一条较浓的条带,而采用NeonGreen作为助溶蛋白融合表达的方式却不能很好的看出目的条带。Figure A is the SDS PAGE of TEVP protein expressed using Sacid and NeonGreen as solubilizing peptides. M: protein molecular weight standard; ST: Sacid-TEVP fusion protein; NT: NeonGreen-TEVP fusion protein. The arrow points to the fusion protein. From the figure, only a thick band at 39KD of Sacid-TEVP can be observed, while the target band cannot be clearly seen when NeonGreen is used as a solubilizing protein fusion expression method.
B图为采用Sacid和NeonGreen作为助溶肽表达TEVP的Western印迹图。M:蛋白质分子量标准;ST:Sacid-TEVP融合蛋白;NT:NeonGreen-TEVP融合蛋白。箭头指向融合蛋白,从图中可以看出,采用Sacid作为助溶肽,Western印迹结果显示在39KD出有明显的条带,但采用NeonGreen作为助溶肽,Western印迹却未观察到目的条带,与A图的SDS PAGE结果基本一致。说明采用本发明融合表达后,目的蛋白表达量得到了大幅度提高;与其他标签相比,本发明Sacid标签明显提高了融合蛋白的可溶性。Figure B is a Western blot of TEVP expressed using Sacid and NeonGreen as solubilizing peptides. M: protein molecular weight standard; ST: Sacid-TEVP fusion protein; NT: NeonGreen-TEVP fusion protein. The arrow points to the fusion protein. It can be seen from the figure that when Sacid is used as a solubilizing peptide, the Western blot results show a clear band at 39KD, but when NeonGreen is used as a solubilizing peptide, no target band is observed in the Western blot, which is basically consistent with the SDS PAGE result of Figure A. It shows that after the fusion expression of the present invention, the expression amount of the target protein has been greatly improved; compared with other tags, the Sacid tag of the present invention significantly improves the solubility of the fusion protein.
图7:纯化蛋白的活性测试,使用含单个NsiI酶切位点和单个HindIII酶切位点的质粒做底物,进行双酶切反应后,得到3392bp和1987bp的目标条带。对提纯的NsiI酶进行梯度(10
1~10
8倍)稀释后进行双酶切反应(37℃,1小时)。A图为单独表达的NsiI蛋白和NeonGreen-NsiI蛋白的双酶切鉴定;B图为正电荷助溶标签(Zbasic)表达的NsiI蛋白和本发明Sacid-NsiI蛋白的双酶切鉴定。从图中可以看出,单独表达的NsiI蛋白几乎无活性;NeonGreen-NsiI融合蛋白及正电荷助溶标签(Zbasic)表达的NsiI蛋白活性基本一致;而采用本发明技术以Sacid作为纯化标签,纯化的蛋白Sacid-NsiI的活性得到大幅度提高,比NeonGreen-NsiI融合蛋白及正电荷助溶标签(Zbasic)表达的NsiI蛋白活性均较高。
Figure 7: Activity test of purified protein, using a plasmid containing a single NsiI restriction site and a single HindIII restriction site as a substrate, after double restriction digestion reaction, 3392bp and 1987bp target bands were obtained. The purified NsiI enzyme was diluted in a gradient (10 1 to 10 8 times) and then subjected to double restriction digestion reaction (37°C, 1 hour). Figure A shows the double restriction digestion identification of the NsiI protein expressed alone and the NeonGreen-NsiI protein; Figure B shows the double restriction digestion identification of the NsiI protein expressed by the positively charged solubilizing tag (Zbasic) and the Sacid-NsiI protein of the present invention. As can be seen from the figure, the NsiI protein expressed alone is almost inactive; the activities of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the positively charged solubility tag (Zbasic) are basically the same; and by using the technology of the present invention with Sacid as the purification tag, the activity of the purified protein Sacid-NsiI is greatly improved, which is higher than the activity of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the positively charged solubility tag (Zbasic).
图8:不同方式纯化的酶的比活力(一个单位定义为在50μl反应缓冲液中在37℃、1小时消化1μg含单个NsiI酶切位点的质粒DNA所需的酶量)。Figure 8: Specific activities of enzymes purified in different ways (one unit is defined as the amount of enzyme required to digest 1 μg of plasmid DNA containing a single NsiI restriction site in 50 μl of reaction buffer at 37°C for 1 hour).
除非另有说明或者上下文明显,本文所用的缩写具有其在化学和生物学领域内的常规含义,本发明中使用的所有技术和科学术语与本技术领域技术人员通常理解的意义相同。本发明中未作具体说明的实验方法,均根据《分子克隆实验指南》(第四版)J.萨姆布鲁克一书中具体方法进行,或者按照相关产品说明书进行。本发明中所用生物试剂,无特殊说明,均可以从商业途径获得。本领域技术人员在不偏离本发明精神的范围内可以进行多种变化、改变和替代。Unless otherwise specified or obvious from the context, the abbreviations used herein have their conventional meanings in the fields of chemistry and biology, and all technical and scientific terms used in the present invention have the same meanings as those generally understood by those skilled in the art. Experimental methods not specifically described in the present invention are carried out according to the specific methods in the book "Molecular Cloning Laboratory Guide" (4th edition) by J. Sambrook, or according to the instructions of relevant products. The biological reagents used in the present invention can be obtained from commercial channels unless otherwise specified. Those skilled in the art can make various changes, modifications and substitutions without departing from the spirit of the present invention.
除非另有说明,本文中提及核酸序列时从左至右为5′至3′方向;提及氨基酸序列时从左(上游)至右(下游)为氨基(N)至羧基(C)方向。Unless otherwise indicated, nucleic acid sequences are referred to herein in a 5' to 3' direction from left to right; amino acid sequences are referred to herein in a 5' to 3' direction from left (upstream) to right (downstream).
本发明以葡萄球菌蛋白A的B结构域所示的氨基酸序列为基础,利用计算机辅助计算,对天然葡萄球菌蛋白A的B结构的C端结构域进行改造,引入带负电荷的氨基酸,让带负电的氨基酸集中在一面,设计了酸性表面助溶短肽标签(Sacid)。The present invention is based on the amino acid sequence shown in the B domain of staphylococcal protein A, uses computer-assisted calculations to transform the C-terminal domain of the B structure of natural staphylococcal protein A, introduces negatively charged amino acids, concentrates the negatively charged amino acids on one side, and designs an acidic surface solubilizing short peptide tag (Sacid).
本发明通过计算机辅助计算,对短小的稳定蛋白结构域进行电荷分布的表面化设计,使之表面包含多个负电,设计了几种酸性短肽标签,即Sacid(SEQ ID NO:1)、Sacid1(SEQ ID NO:2)、Sacid2(SEQ ID NO:3),发现Sacid标签的电荷比较集中,较其他设计的标签更加偏酸性,且能够与具有低等电点的蛋白质发生互斥作用,可使蛋白纯化过程中减少聚集,起到优越的助溶作用。将此标签与目的蛋白融合,发现该标签可高效助溶重组蛋白,在提纯过程中稳定重组蛋白构象,提高重组蛋白表达效率,维持其生物学功能活性。所述方法将所要研究的酸性表面助溶短肽标签与目的蛋白融合表达,通过亲和层析快速实现目的蛋白的纯化。The present invention uses computer-assisted calculations to perform surface design of charge distribution on short stable protein domains, so that the surface contains multiple negative charges, and designs several acidic short peptide tags, namely Sacid (SEQ ID NO: 1), Sacid1 (SEQ ID NO: 2), and Sacid2 (SEQ ID NO: 3). It is found that the charge of the Sacid tag is relatively concentrated, more acidic than other designed tags, and can repel proteins with low isoelectric points, which can reduce aggregation during protein purification and play an excellent solubilizing role. This tag is fused with the target protein, and it is found that the tag can efficiently solubilize the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, improve the expression efficiency of the recombinant protein, and maintain its biological functional activity. The method fuses the acidic surface solubilizing short peptide tag to be studied with the target protein, and quickly purifies the target protein through affinity chromatography.
在第一方面,本发明提供了一种分离的肽,其包含氨基酸序列VDNKFNKEQQX
1AFYEILHLPNLNEEQRNAFIQX
2LKDDPX
3X
4SX
5X
6X
7LX
8EAX
9X
10LNDAQPK(SEQ ID NO:9),其中:X
1-X
10均是带负电荷氨基酸,优选为谷氨酸(E)或天冬氨酸(D)。
In a first aspect, the present invention provides an isolated peptide comprising the amino acid sequence VDNKFNKEQQX1AFYEILHLPNLNEEQRNAFIQX2LKDDPX3X4SX5X6X7LX8EAX9X10LNDAQPK ( SEQ ID NO: 9 ), wherein: X1 - X10 are all negatively charged amino acids, preferably glutamic acid (E) or aspartic acid ( D ) .
在一个实施方案中,X
1为D。在一个实施方案中,X
2为E。在一个实施方案中,X
3为E。在一个实施方案中,X
4为E。在一个实施方案中,X
5为D。在一个实施方案中,X
6为E。在一个实施方案中,X
7为E。在一个实施方案中,X
8为E。在一个实施方案中,X
9为D。在一个实施方案中,X
10为D。
In one embodiment, X1 is D. In one embodiment, X2 is E. In one embodiment, X3 is E. In one embodiment, X4 is E. In one embodiment, X5 is D. In one embodiment, X6 is E. In one embodiment, X7 is E. In one embodiment, X8 is E. In one embodiment, X9 is D. In one embodiment, X10 is D.
本发明提供的分离的肽是一种能够提高重组蛋白表达效率的酸性表面助溶短肽,其基于葡萄球菌蛋白A的B结构域的氨基酸序列(SEQ ID NO:4),引入带负电荷的氨基酸并让带负电的氨基酸集中在表面。通过计算机辅助预测(参见https://protein-sol.manchester.ac.uk/),基于序列预测蛋白质可溶性。该工具计算35种特征,包括:20种氨基酸组成;7种氨基酸组合:K-R、D-E、K+R、D+E、K+R-D-E、K+R+D+E、F+W+Y;8种预测的特征:长度、等电点、疏水性、pH=7.0时蛋白的净电荷、折叠倾向、无序区、序列熵、和β链倾向。结合35个特征的线性模型给出了对溶解度数据的初步拟合。预测的返回值介于0-1之间,如果高 于0.45(大肠杆菌表达的蛋白数据集Niwa et al(2009)的平均值),意味着蛋白的溶解性可能比大肠杆菌表达的蛋白平均的溶解性要高。The isolated peptide provided by the present invention is an acidic surface solubilizing short peptide capable of improving the expression efficiency of recombinant proteins. It is based on the amino acid sequence of the B domain of Staphylococcus protein A (SEQ ID NO: 4), introduces negatively charged amino acids and concentrates the negatively charged amino acids on the surface. The protein solubility is predicted based on the sequence by computer-assisted prediction (see https://protein-sol.manchester.ac.uk/). The tool calculates 35 features, including: 20 amino acid compositions; 7 amino acid combinations: K-R, D-E, K+R, D+E, K+R-D-E, K+R+D+E, F+W+Y; 8 predicted features: length, isoelectric point, hydrophobicity, net charge of protein at pH=7.0, folding tendency, disordered region, sequence entropy, and β-chain tendency. The linear model combining the 35 features gives a preliminary fit to the solubility data. The predicted return value is between 0 and 1. If it is higher than 0.45 (the average value of the E. coli expressed protein dataset Niwa et al (2009)), it means that the solubility of the protein may be higher than the average solubility of the E. coli expressed protein.
因此,在一个优选实施方案中,SEQ ID NO:9所示的肽的预测的溶解度大于0.45,如基于https://protein-sol.manchester.ac.uk/所预测的。在一个实施方案中,SEQ ID NO:9所示的肽的预测的溶解度等于或大于0.70,如基于https://protein-sol.manchester.ac.uk/所预测的。Thus, in a preferred embodiment, the predicted solubility of the peptide shown in SEQ ID NO: 9 is greater than 0.45, as predicted based on https://protein-sol.manchester.ac.uk/. In one embodiment, the predicted solubility of the peptide shown in SEQ ID NO: 9 is equal to or greater than 0.70, as predicted based on https://protein-sol.manchester.ac.uk/.
在一个实施方案中,所述SEQ ID NO:9的助溶肽的等电点pI等于或小于5.0。In one embodiment, the isoelectric point pI of the soluble peptide of SEQ ID NO: 9 is equal to or less than 5.0.
在一个实施方案中,所述肽包含SEQ ID NO:1所示的氨基酸序列或由SEQ ID NO:1所示的氨基酸序列组成。In one embodiment, the peptide comprises or consists of the amino acid sequence shown in SEQ ID NO: 1.
如本文所用,术语“肽”、“多肽”和“蛋白”可互换使用,并且定义为由通过肽键连接的氨基酸残基组成的生物分子。As used herein, the terms "peptide," "polypeptide," and "protein" are used interchangeably and are defined as biological molecules composed of amino acid residues linked by peptide bonds.
在第二方面,本发明涉及编码第一方面的肽的多核苷酸。In a second aspect, the present invention relates to a polynucleotide encoding the peptide of the first aspect.
如本文所用,术语“核苷酸序列”、“多核苷酸”、“核酸”和“核酸序列”可互换使用,是指多个核苷酸通过3’-5’-磷酸二酯键连接而成的大分子,其中所述核苷酸包括核糖核苷酸和脱氧核糖核苷酸。本发明的多核苷酸的序列可以针对不同的宿主细胞(如大肠杆菌)进行密码子优化,从而改善融合蛋白的表达。进行密码子优化的方法是本领域已知的。As used herein, the terms "nucleotide sequence", "polynucleotide", "nucleic acid" and "nucleic acid sequence" are used interchangeably and refer to a macromolecule composed of multiple nucleotides connected by 3'-5'-phosphodiester bonds, wherein the nucleotides include ribonucleotides and deoxyribonucleotides. The sequence of the polynucleotide of the present invention can be codon-optimized for different host cells (such as Escherichia coli) to improve the expression of the fusion protein. Methods for codon optimization are known in the art.
在第三方面,本发明涉及分离的融合蛋白,其包含第一肽和第二肽,其中所述第一肽是本发明第一方面的肽,所述第二肽是目的多肽。In a third aspect, the present invention relates to an isolated fusion protein comprising a first peptide and a second peptide, wherein the first peptide is a peptide according to the first aspect of the present invention and the second peptide is a polypeptide of interest.
如本文所用,“目的多肽”是指可通过本发明的方法生产并纯化的任何多肽或蛋白质,其非限制性例子包括酶、激素、免疫球蛋白链、诸如抗癌多肽的治疗性多肽、诊断性多肽或者可以用于免疫目的的多肽或其生物学活性片段等等。目的多肽可以来自任何来源,包括微生物来源多肽、哺乳动物来源多肽和人工蛋白质(例如融合蛋白或突变的蛋白质)等等。As used herein, "target polypeptide" refers to any polypeptide or protein that can be produced and purified by the method of the present invention, non-limiting examples of which include enzymes, hormones, immunoglobulin chains, therapeutic polypeptides such as anti-cancer polypeptides, diagnostic polypeptides, or polypeptides that can be used for immunization purposes or biologically active fragments thereof, etc. The target polypeptide can be derived from any source, including microbial-derived polypeptides, mammalian-derived polypeptides, and artificial proteins (e.g., fusion proteins or mutant proteins), etc.
目的多肽可以是任何长度的多肽和蛋白。在一个实施方案中,可通过本发明的方法生产并纯化的目的多肽的长度可以是20-500个氨基酸残基,例如,大约30、40、50、60、70、80、90、100、150、200、250、300、350、400、450、500个氨基酸残基。The target polypeptide can be a polypeptide or protein of any length. In one embodiment, the target polypeptide that can be produced and purified by the method of the present invention can have a length of 20-500 amino acid residues, for example, about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450, 500 amino acid residues.
在一个实施方案中,本文所述目的多肽是具有低于、等于或略大于7.0的酸性或中性或弱碱性等电点,例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0。In one embodiment, the polypeptide of interest described herein has an acidic, neutral or weakly basic isoelectric point lower than, equal to or slightly greater than 7.0, such as equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0.
在一些实施方案中,第一肽可以位于第二肽的上游(N端)或下游(C端),优选位于上游(N端)。在一些实施方案中,第一肽位于第二肽的上游。在一些实施方案中,第一肽位于第二肽的下游。In some embodiments, the first peptide can be located upstream (N-terminal) or downstream (C-terminal) of the second peptide, preferably located upstream (N-terminal). In some embodiments, the first peptide is located upstream of the second peptide. In some embodiments, the first peptide is located downstream of the second peptide.
如本文所用,第一肽位于第二肽的“上游”是指第一肽的C末端残基位于第二肽的N末端残基之前;第一肽位于第二肽的“下游”是指第一肽的N末端残基位于第二肽的C末端残基之后。As used herein, a first peptide being located "upstream" of a second peptide means that the C-terminal residue of the first peptide is located before the N-terminal residue of the second peptide; a first peptide being located "downstream" of a second peptide means that the N-terminal residue of the first peptide is located after the C-terminal residue of the second peptide.
本发明目的多肽的生产和纯化可在较温和的pH变化(例如pH 7-11)条件下实现,对于目的多肽的种类和性质没有特殊的限制,可用于多种不同多肽的表达和纯化,且最终目的多肽的产量和产率均较高。The production and purification of the target polypeptide of the present invention can be achieved under milder pH conditions (e.g., pH 7-11). There are no special restrictions on the type and properties of the target polypeptide. It can be used for the expression and purification of a variety of different polypeptides, and the final output and yield of the target polypeptide are high.
可通过本发明的方法来生产并纯化的“目的多肽”的实例包括但不限于枯草芽孢杆菌脂肪酶A(LipA)、绿色荧光蛋白(GFP)和烟曲霉II型酮胺氧化酶(AMA)、胰高血糖素样肽(GLP-1)、基质细胞衍生因子(SDF-1α)、舍莫瑞林(Sermorelin)、pleurocidin样阳离子抗菌肽NRC-03(PNRC03)和Hinnavin II-Melanocyte(HM)或它们的生物学活性片段等。Examples of "target polypeptides" that can be produced and purified by the method of the present invention include, but are not limited to, Bacillus subtilis lipase A (LipA), green fluorescent protein (GFP) and Aspergillus fumigatus type II ketoamine oxidase (AMA), glucagon-like peptide (GLP-1), stromal cell-derived factor (SDF-1α), sermorelin, pleurocidin-like cationic antimicrobial peptide NRC-03 (PNRC03) and Hinnavin II-Melanocyte (HM) or their biologically active fragments.
在一个实施方案中,所述目的多肽是NsiI蛋白,例如包含SEQ ID NO:5所示的氨基酸序列。In one embodiment, the target polypeptide is NsiI protein, for example comprising the amino acid sequence shown in SEQ ID NO: 5.
在一个实施方案中,所述目的多肽是TEVP蛋白,例如包含SEQ ID NO:8所示的氨基酸序列。In one embodiment, the target polypeptide is a TEVP protein, for example comprising the amino acid sequence shown in SEQ ID NO: 8.
在一些实施方案中,本发明的融合蛋白中的第一肽和第二肽通过间隔物连接。In some embodiments, the first peptide and the second peptide in the fusion protein of the present invention are connected by a spacer.
如本文所用,“间隔物”或“间隔序列”是指具有一定长度的由低疏水性和低电荷效应的氨基酸组成的肽,其用于融合蛋白时可以使所连接的各部分充分展开、互不干扰地充分折叠成各自的天然构象。本领域常用的间隔物包括例如,富含甘氨酸(G)和丝氨酸(S)的柔性的GS型接头;富含脯氨酸(P)和苏氨酸(T)的刚性的PT型接头。由于GS型接头具有较合适的氨基酸长度,同时具有疏水性和延展性并能使功能蛋白具有较好的稳定性和生物活性,因而在本发明中优选使用GS型接头。As used herein, "spacer" or "spacer sequence" refers to a peptide of a certain length composed of amino acids with low hydrophobicity and low charge effect, which, when used in a fusion protein, can allow the connected parts to fully unfold and fully fold into their respective natural conformations without interfering with each other. Commonly used spacers in the art include, for example, flexible GS-type linkers rich in glycine (G) and serine (S); and rigid PT-type linkers rich in proline (P) and threonine (T). Since GS-type linkers have a more suitable amino acid length, are hydrophobic and ductile, and can make the functional protein have better stability and biological activity, GS-type linkers are preferably used in the present invention.
在某些应用中,例如在多肽类药物的生产中,需要重组生产的多肽与目的多肽具有一致的序列,即两端不具有额外的氨基酸残基。为此,在一些实施方案中,本发明的融合蛋白中的间隔物还包含切割位点。通过所述切割位点的切割,可以将目的多肽从融合蛋白中释放。In certain applications, such as the production of polypeptide drugs, the polypeptide produced by recombinant production needs to have a consistent sequence with the target polypeptide, i.e., no additional amino acid residues at both ends. For this reason, in some embodiments, the spacer in the fusion protein of the present invention also includes a cleavage site. The cleavage of the cleavage site can release the target polypeptide from the fusion protein.
合适的切割位点包括可以化学切割、酶法切割或自切割的切割位点,或本领域技术人员已知的其它任何切割位点。本发明中优选的切割位点可以进行自切割,例如,其包含可自切割的内含肽的氨基酸序列。这是因为基于内含肽的切割方法不需要外加酶或使用如化学法中所用的溴化氢等有害物质,而仅仅需要改变聚集体所处的缓冲环境就能简单地诱导切割。本领域已知多种自切割内含肽,例如NEB公司的一系列具有不同自切割特性的内含肽。Suitable cleavage sites include cleavage sites that can be chemically cleaved, enzymatically cleaved or self-cleaved, or any other cleavage sites known to those skilled in the art. Preferred cleavage sites in the present invention can be self-cleaved, for example, they contain the amino acid sequence of a self-cleavable intein. This is because the cleavage method based on intein does not require the addition of enzymes or the use of harmful substances such as hydrogen bromide used in chemical methods, but only requires changing the buffer environment in which the aggregate is located to simply induce cleavage. A variety of self-cleaving inteins are known in the art, such as a series of inteins with different self-cleavage properties from NEB.
在一些实施方案中,间隔序列可以含有能被特异性化学切割或生物酶解的特定序列,例如Met(甲硫氨酸)残基可被CNBr化学切割,Lys(赖氨酸)或Arg(精氨酸)残基可被胰蛋白酶切割,LysArg或ArgArg可被双碱基蛋白酶类如Kex2切割,GluAsnLeuTyrPheGln可被烟草蚀纹病毒蛋白酶(TEV蛋白酶,TEVP)识别和切割,IleGluGlyArg可被Xa因子(Xa蛋白酶)识别和切割,以及其它合适的蛋白酶切位点或者内切肽(Intein)。In some embodiments, the spacer sequence may contain a specific sequence that can be specifically chemically cleaved or biologically enzymatically hydrolyzed, for example, a Met (methionine) residue can be chemically cleaved by CNBr, a Lys (lysine) or Arg (arginine) residue can be cleaved by trypsin, LysArg or ArgArg can be cleaved by a two-base protease such as Kex2, GluAsnLeuTyrPheGln can be recognized and cleaved by tobacco etch virus protease (TEV protease, TEVP), IleGluGlyArg can be recognized and cleaved by factor Xa (Xa protease), as well as other suitable protease cleavage sites or inteins.
在进一步的实施方案中,所述融合蛋白还可以包含与第二肽(目的多肽)连接的用于分离纯化第二肽的部分,例如6组氨酸标签、GST标签、Strep标签、 Twin-Strep标签、MBP标签等。由此,在去除所述第一肽后,可以通过该部分来分离纯化第二肽。In a further embodiment, the fusion protein may further comprise a portion connected to the second peptide (target polypeptide) for separating and purifying the second peptide, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc. Thus, after removing the first peptide, the second peptide can be separated and purified by this portion.
在一个实施方案中,所述融合蛋白包含本发明第一方面所述的肽、间隔物、目的多肽和用于分离纯化第二肽的部分。In one embodiment, the fusion protein comprises the peptide described in the first aspect of the present invention, a spacer, a polypeptide of interest, and a portion for separating and purifying a second peptide.
在第四方面,本发明涉及包含编码所述融合蛋白的核苷酸序列的分离的多核苷酸,以及在第五方面,本发明涉及包含第四方面的多核苷酸的构建体、特别是表达构建体。In a fourth aspect, the present invention relates to an isolated polynucleotide comprising a nucleotide sequence encoding said fusion protein, and in a fifth aspect, the present invention relates to a construct, in particular an expression construct, comprising the polynucleotide of the fourth aspect.
在本发明的表达构建体中,编码所述融合蛋白的多核苷酸的序列与表达控制序列可操纵地连接以进行希望的转录及最终在宿主细胞中生产所述融合蛋白。合适的表达控制序列包括但不限于启动子、增强子、核糖体作用位点如核糖体结合位点、聚腺苷酸化位点、转录剪接序列、转录终止序列和稳定mRNA的序列等等。In the expression construct of the present invention, the sequence of the polynucleotide encoding the fusion protein is operably linked to an expression control sequence to carry out the desired transcription and ultimately produce the fusion protein in a host cell. Suitable expression control sequences include, but are not limited to, promoters, enhancers, ribosome action sites such as ribosome binding sites, polyadenylation sites, transcriptional splicing sequences, transcriptional termination sequences, and sequences that stabilize mRNA, etc.
用于构建本发明的表达构建体的载体包括那些在宿主细胞中自主复制的载体,如质粒载体;还包括能够整合到宿主细胞DNA中并和宿主细胞DNA一起复制的载体。可商购获得许多适于本发明的载体。在一个实施方案中,所述质粒为适用于原核或真核表达系统的质粒。在一个具体实施方案中,所述质粒为或衍生自pET28a。Vectors used to construct the expression constructs of the present invention include those that replicate autonomously in host cells, such as plasmid vectors; and also include vectors that can be integrated into host cell DNA and replicated with the host cell DNA. Many vectors suitable for the present invention are commercially available. In one embodiment, the plasmid is a plasmid suitable for a prokaryotic or eukaryotic expression system. In a specific embodiment, the plasmid is or is derived from pET28a.
在第六方面,本发明涉及能够表达本发明第三方面的融合蛋白的宿主细胞。所述宿主细胞含有本发明第四方面的多核苷酸或本发明第五方面的构建体例如表达构建体,其中所述宿主细胞能够表达所述融合蛋白。In a sixth aspect, the present invention relates to a host cell capable of expressing the fusion protein of the third aspect of the present invention. The host cell contains a polynucleotide of the fourth aspect of the present invention or a construct of the fifth aspect of the present invention, such as an expression construct, wherein the host cell is capable of expressing the fusion protein.
用于表达本发明融合蛋白的宿主细胞包括原核生物、酵母和高等真核细胞。示例性的原核宿主包括埃希氏菌属(Escherichia)、芽孢杆菌属(Bacillus)、沙门氏菌属(Salmonella)以及假单胞菌属(Pseudomonas)和链霉菌属(Streptomyces)的细菌。在优选的实施方案中,宿主细胞是埃希氏菌属细胞、哺乳动物细胞或昆虫细胞。在更优选的实施方案中,宿主细胞是是大肠杆菌(Escherichia coli)、枯草芽抱杆菌(Bacillus subtilis)或巨大芽抱杆菌(Bacillus megaterium)。在本发明的一个具体实施方案中,所使用的宿主细胞为大肠杆菌Rosetta(DE3)菌株细胞。Host cells for expressing the fusion protein of the present invention include prokaryotes, yeast and higher eukaryotic cells. Exemplary prokaryotic hosts include bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces. In a preferred embodiment, the host cell is an Escherichia cell, a mammalian cell or an insect cell. In a more preferred embodiment, the host cell is Escherichia coli, Bacillus subtilis or Bacillus megaterium. In a specific embodiment of the present invention, the host cell used is an Escherichia coli Rosetta (DE3) strain cell.
可以通过许多已熟知的技术之一将本发明的多核苷酸或构建体例如表达构建体导入宿主细胞以表达编码的氨基酸序列,这样的技术包括但不限于:热激转化,电穿孔,DEAE-葡聚糖转染,显微注射,脂质体接介导的转染,磷酸钙沉淀,原生质融合,微粒轰击,病毒转化及类似技术。A polynucleotide or construct of the invention, such as an expression construct, can be introduced into a host cell to express the encoded amino acid sequence by one of many well-known techniques, including but not limited to heat shock transformation, electroporation, DEAE-dextran transfection, microinjection, liposome-mediated transfection, calcium phosphate precipitation, protoplast fusion, microprojectile bombardment, viral transformation and the like.
在一个实施方案中,编码融合蛋白的多核苷酸可以整合入宿主细胞的基因组中,其可以在适当的条件下表达所编码的融合蛋白或者组成性表达所编码的融合蛋白。In one embodiment, the polynucleotide encoding the fusion protein can be integrated into the genome of a host cell, which can express the encoded fusion protein under appropriate conditions or constitutively express the encoded fusion protein.
在一个实施方案中,编码融合蛋白的多核苷酸以染色体外的形式(例如质粒或构建体例如表达载体)存在于宿主细胞中。In one embodiment, the polynucleotide encoding the fusion protein is present in the host cell in an extrachromosomal form (eg, a plasmid or a construct such as an expression vector).
在第七方面,本发明还涉及制备本发明第三方面的融合蛋白的方法,包括:在适合融合蛋白表达的条件下培养本发明第六方面的宿主细胞;和回收融合蛋白, 任选地,通过融合蛋白切割以释放目的多肽和回收所述目的多肽。In a seventh aspect, the present invention also relates to a method for preparing the fusion protein of the third aspect of the present invention, comprising: culturing the host cell of the sixth aspect of the present invention under conditions suitable for expression of the fusion protein; and recovering the fusion protein, optionally, by cutting the fusion protein to release the target polypeptide and recovering the target polypeptide.
在一个实施方案中,所述方法包括:用本发明第五方面的构建体特别是表达构建体转化宿主细胞,在适合融合蛋白表达的条件下培养经转化的宿主细胞以及回收融合蛋白,任选地,通过切割融合蛋白以释放目的多肽和回收所述目的多肽。In one embodiment, the method comprises: transforming a host cell with a construct, in particular an expression construct, of the fifth aspect of the invention, culturing the transformed host cell under conditions suitable for expression of the fusion protein and recovering the fusion protein, optionally, by cutting the fusion protein to release the target polypeptide and recovering the target polypeptide.
本领域已知表达融合蛋白的条件,例如温度、pH、培养基等。本领域已知回收融合蛋白的任何合适方法,包括但不限于例如通过层析、离心、透析等。The conditions for expressing fusion proteins are known in the art, such as temperature, pH, culture medium, etc. Any suitable method for recovering fusion proteins is known in the art, including but not limited to, for example, by chromatography, centrifugation, dialysis, etc.
如本文所述,回收融合蛋白可以采用本领域已知的任何合适方法进行。例如,在融合蛋白表达后,收集宿主细胞,裂解细胞,收集上清(例如通过离心除去细胞碎片),任选地分离(例如通过特定标签或特异性结合分子如抗体)融合蛋白。As described herein, the recovery of the fusion protein can be carried out by any suitable method known in the art. For example, after the fusion protein is expressed, the host cells are collected, the cells are lysed, the supernatant is collected (e.g., by centrifugation to remove cell debris), and the fusion protein is optionally separated (e.g., by a specific tag or a specific binding molecule such as an antibody).
在一个实施方案中,所述方法包括:用本发明第五方面的构建体特别是表达构建体转化宿主细胞,在适合融合蛋白表达的条件下培养经转化的宿主细胞,收集并裂解细胞,收集上清,任选地分离融合蛋白。In one embodiment, the method comprises: transforming host cells with the construct of the fifth aspect of the invention, in particular the expression construct, culturing the transformed host cells under conditions suitable for expression of the fusion protein, collecting and lysing the cells, collecting the supernatant, and optionally isolating the fusion protein.
在优选的实施方案中,在所述融合蛋白的短肽标签和目的蛋白之间包括间隔序列,所述间隔序列包含可被切割的切割位点(如本文所述的切割位点),由此通过切割该切割位点,可以去除短肽标签,由此释放目的蛋白。In a preferred embodiment, a spacer sequence is included between the short peptide tag of the fusion protein and the target protein, and the spacer sequence contains a cleavage site (such as the cleavage site described herein) that can be cut, thereby removing the short peptide tag by cutting the cleavage site, thereby releasing the target protein.
在第八方面,本发明提供了第一方面的肽或第二方面的多核苷酸用于在宿主细胞中准备目的蛋白的用途。In an eighth aspect, the present invention provides use of the peptide of the first aspect or the polynucleotide of the second aspect for preparing a target protein in a host cell.
本发明第一方面的肽可以与目的蛋白形成融合蛋白,从而在宿主细胞中表达后,可以增加融合蛋白的溶解度,提高融合蛋白的表达效率,并且可以保持目的蛋白的生物学活性。The peptide of the first aspect of the present invention can form a fusion protein with the target protein, so that after being expressed in a host cell, the solubility of the fusion protein can be increased, the expression efficiency of the fusion protein can be improved, and the biological activity of the target protein can be maintained.
例如,可以将第二方面的多核苷酸与编码目的蛋白的多核苷酸符合读框地连接在一起,并置于合适的表达调控元件(例如启动子)的控制下,从而在合适的表达系统中表达希望的融合蛋白。For example, the polynucleotide of the second aspect can be linked to a polynucleotide encoding a target protein in frame and placed under the control of a suitable expression regulatory element (eg, a promoter) to express the desired fusion protein in a suitable expression system.
在第九方面,本发明提供了一种生产目的蛋白的方法,包括在宿主细胞中表达本发明第一方面所述的肽与目的蛋白融合形成的例如本发明第三方面所述的融合蛋白,回收融合蛋白,切割所述融合蛋白以释放目的蛋白,任选地分离纯化释放的目的蛋白。In the ninth aspect, the present invention provides a method for producing a target protein, comprising expressing in a host cell a fusion protein formed by fusion of the peptide described in the first aspect of the present invention and the target protein, such as the fusion protein described in the third aspect of the present invention, recovering the fusion protein, cutting the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
在一个实施方案中,所述方法包括:构建表达构建体(例如质粒或病毒载体),其包含编码本发明第一方面所述的肽与目的蛋白融合形成的融合蛋白的核苷酸序列,用所述表达构建体转化宿主细胞,在适合融合蛋白表达的条件下培养经转化的宿主细胞,回收融合蛋白,切割所述融合蛋白以释放目的蛋白,任选地分离纯化释放的目的蛋白。In one embodiment, the method comprises: constructing an expression construct (e.g., a plasmid or a viral vector) comprising a nucleotide sequence encoding a fusion protein formed by fusion of the peptide described in the first aspect of the present invention and a target protein, transforming a host cell with the expression construct, culturing the transformed host cell under conditions suitable for expression of the fusion protein, recovering the fusion protein, cleaving the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
如本文所述,回收融合蛋白可以采用本领域已知的任何合适方法进行。例如,在融合蛋白表达后,收集宿主细胞,裂解细胞,收集上清(例如通过离心除去细胞碎片),任选地分离(例如通过特定标签或特异性结合分子如抗体)融合蛋白。As described herein, the recovery of the fusion protein can be carried out by any suitable method known in the art. For example, after the fusion protein is expressed, the host cells are collected, the cells are lysed, the supernatant is collected (e.g., by centrifugation to remove cell debris), and the fusion protein is optionally separated (e.g., by a specific tag or a specific binding molecule such as an antibody).
在一个实施方案中,所述方法包括:用本发明第五方面的构建体特别是表达构建体转化宿主细胞,在适合融合蛋白表达的条件下培养经转化的宿主细胞,收集并裂解细胞,收集上清,切割所述融合蛋白以释放目的蛋白,任选地分离纯化 释放的目的蛋白。In one embodiment, the method comprises: transforming a host cell with the construct of the fifth aspect of the invention, in particular an expression construct, culturing the transformed host cell under conditions suitable for expression of the fusion protein, collecting and lysing the cells, collecting the supernatant, cutting the fusion protein to release the target protein, and optionally isolating and purifying the released target protein.
如本文所述,切割融合蛋白可以采用本领域已知的任何合适手段进行,例如在短肽标签和目的蛋白之间包括间隔序列,所述间隔序列包含可被切割的切割位点(如本文所述的切割位点),由此通过切割融合蛋白可以去除短肽标签,从而释放目的蛋白。As described herein, cleavage of the fusion protein can be performed by any suitable means known in the art, such as including a spacer sequence between the short peptide tag and the target protein, wherein the spacer sequence comprises a cleavage site that can be cleaved (such as the cleavage site described herein), thereby removing the short peptide tag by cleaving the fusion protein, thereby releasing the target protein.
在进一步的实施方案中,所述融合蛋白还可以包含与目的蛋白连接的用于分离纯化目的的部分,例如6组氨酸标签、GST标签、Strep标签、Twin-Strep标签、MBP标签等。由此,在去除所述第一肽后,可以通过该部分来分离纯化目的蛋白。In a further embodiment, the fusion protein may further comprise a portion connected to the target protein for separation and purification purposes, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag, an MBP tag, etc. Thus, after removing the first peptide, the target protein can be separated and purified by this portion.
在一个实施方案中,所述方法包括构建表达构建体,其包含编码本发明第一方面所述的肽核苷酸序列、编码间隔序列的核苷酸序列、编码目的蛋白的核苷酸序列和编码用于分离纯化第二肽的部分的核苷酸序列。In one embodiment, the method comprises constructing an expression construct comprising a nucleotide sequence encoding the peptide described in the first aspect of the invention, a nucleotide sequence encoding a spacer sequence, a nucleotide sequence encoding a target protein, and a nucleotide sequence encoding a portion for isolating and purifying a second peptide.
在一个实施方案中,所述宿主细胞选自原核生物、酵母和真核细胞例如哺乳动物细胞或昆虫细胞,更优选选自埃希氏菌属、芽孢杆菌属、沙门氏菌属、假单胞菌属和链霉菌属,更优选选自埃希氏菌属,更优选是大肠杆菌、枯草芽抱杆菌或巨大芽抱杆菌,更优选大肠杆菌Rosetta(DE3)。In one embodiment, the host cell is selected from prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3).
在一个实施方案中,所述目的蛋白具有低于、等于或略大于7.0的酸性或中性或弱碱性等电点,例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0,更优选是包含SEQ ID NO:5所示氨基酸序列的NsiI蛋白或包含SEQ ID NO:8所示氨基酸序列的TEVP蛋白。In one embodiment, the target protein has an acidic, neutral or weakly alkaline isoelectric point lower than, equal to or slightly greater than 7.0, for example, equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0, and more preferably is an NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
与现有技术相比,本发明具有如下有益效果:Compared with the prior art, the present invention has the following beneficial effects:
1)本发明与传统技术相比,能够克服目的蛋白在原核表达系统易形成包涵体的缺点,可高效助溶重组蛋白,在提纯过程中稳定重组蛋白构象,提高重组蛋白表达效率,维持其生物学功能活性,可广泛应用于难溶解蛋白的发酵生产领域;1) Compared with the traditional technology, the present invention can overcome the disadvantage that the target protein is easy to form inclusion bodies in the prokaryotic expression system, can effectively dissolve the recombinant protein, stabilize the conformation of the recombinant protein during the purification process, improve the expression efficiency of the recombinant protein, maintain its biological function activity, and can be widely used in the fermentation production field of insoluble proteins;
2)本发明的酸性短肽助溶标签可替代GST、MBP、NeonGreen等助溶标签,比现有已知的常用助溶标签均更短小,且结构稳定,仅含57个氨基酸残基,分子量仅8kD,和重组蛋白融合表达时对重组蛋白潜在的影响更少;2) The acidic short peptide solubility tag of the present invention can replace the solubility tags such as GST, MBP, and NeonGreen. It is shorter and smaller than the commonly known solubility tags, and has a stable structure. It contains only 57 amino acid residues and a molecular weight of only 8kD. It has less potential impact on the recombinant protein when fused with the recombinant protein;
3)与之前已知的正电荷助溶标签(Zbasic)相比,本发明的酸性助溶标签(Sacid)与生物体内广泛存在的偏酸性蛋白之间相互排斥,更不容易发生聚集效应,从而更有利于重组蛋白的溶解和折叠,在提高重组蛋白的溶解性,表达效率和生物活性等方面更具优势;3) Compared with the previously known positively charged solubilizing tag (Zbasic), the acidic solubilizing tag (Sacid) of the present invention is mutually repelled by the acidic proteins widely present in the body, and is less likely to cause aggregation effects, thereby being more conducive to the dissolution and folding of recombinant proteins, and having more advantages in improving the solubility, expression efficiency and biological activity of recombinant proteins;
4)本发明标签在亲和纯化应用中同样能获得高纯度的目标蛋白,说明改造并没有对亲和纯化产生影响。4) The tag of the present invention can also obtain high-purity target protein in affinity purification applications, indicating that the modification has no effect on affinity purification.
如本文所用,“任选”或“任选地”是指随后描述的事件或情况发生或不发生,该描述包括其中所述事件或情况发生及不发生的情况。例如,任选包括的步骤是指该步骤存在或不存在。As used herein, "optional" or "optionally" means that the subsequently described event or circumstance occurs or does not occur, and the description includes instances where the event or circumstance occurs and instances where it does not occur. For example, an optionally included step means that the step exists or does not exist.
如本文所用,术语“约”是指包括具体数值的数值范围,本领域技术人员可以合理认为其类似于具体数值。在一些实施方案中,术语“约”是指在使用本领域通常接受的测量的标准误差内。在一些实施方案中,约是指到具体数值的+/-10%。As used herein, the term "about" refers to a numerical range that includes a specific value that a person skilled in the art would reasonably consider to be similar to the specific value. In some embodiments, the term "about" refers to within the standard error of measurement using commonly accepted measurements in the art. In some embodiments, approximately refers to +/-10% of a specific value.
虽然在此示出并描述了本发明的各个实施方案和各方面,但是本领域技术人员显然了解这些实施方案和各个方面只是举例说明本发明。本领域技术人员在不偏离本发明精神的范围内可以进行多种变化、改变和替代。应理解本文所述的本发明的实施方案的各种替代选择可用于本发明的实施中。Although various embodiments and various aspects of the present invention are shown and described herein, it is apparent to those skilled in the art that these embodiments and various aspects are only examples of the present invention. Those skilled in the art can make various changes, modifications and substitutions without departing from the scope of the spirit of the present invention. It should be understood that the various alternatives of the embodiments of the present invention described herein can be used in the implementation of the present invention.
下面将通过下述非限制性实施例进一步说明本发明,本领域技术人员公知,在不背离本发明精神的情况下,可以对本发明做出许多修改,这样的修改也落入本发明的范围。The present invention will be further described by the following non-limiting examples. It is well known to those skilled in the art that many modifications may be made to the present invention without departing from the spirit of the present invention, and such modifications also fall within the scope of the present invention.
下述实验方法如无特别说明,均为常规方法,所使用的实验材料如无特别说明,均可容易地从商业公司获取。Unless otherwise specified, the following experimental methods are conventional methods, and the experimental materials used are easily available from commercial companies unless otherwise specified.
常规质粒构建所涉及的PCR、酶切、连接等实验,以及蛋白质表达所涉及的转化、细菌培养等实验为本领域研究人员所熟悉,所以具体相关实验细节没有详细注明,具体可参照分子克隆实验指南[J.萨姆布鲁克等,分子克隆实验指南(第三版)[M],科学出版社,2002]所述常规实验条件。The PCR, enzyme digestion, ligation and other experiments involved in conventional plasmid construction, as well as the transformation, bacterial culture and other experiments involved in protein expression are familiar to researchers in this field, so the specific relevant experimental details are not specified in detail. For details, please refer to the conventional experimental conditions described in the Molecular Cloning Experiment Guide [J. Sambrook et al., Molecular Cloning Experiment Guide (3rd Edition) [M], Science Press, 2002].
除实施例中使用的具体方法、设备、材料外,根据本技术领域的技术人员对现有技术的掌握及本发明的记载,还可以使用与本发明实施例中所述的方法、设备、材料相似或等同的现有技术的任何方法、设备和材料来实现本发明。In addition to the specific methods, equipment, and materials used in the embodiments, based on the understanding of the prior art by those skilled in the art and the description of the present invention, any methods, equipment, and materials of the prior art that are similar or equivalent to the methods, equipment, and materials described in the embodiments of the present invention may also be used to implement the present invention.
实施例Example
实施例1:酸性标签设计Example 1: Acid tag design
基于葡萄球菌蛋白A的B结构域所示的氨基酸序列,如SEQ ID NO:4所示;利用计算机辅助计算设计出酸性表面助溶短肽标签(Sacid,SEQ ID NO:1)(网址:https://www.expasy.org/resources/swiss-model),引入带负电荷的氨基酸,让带负电的氨基酸集中在表面。此标签仅为预测酸性标签的一种表现形式。Based on the amino acid sequence of the B domain of Staphylococcus protein A, as shown in SEQ ID NO: 4, an acidic surface solubilizing peptide tag (Sacid, SEQ ID NO: 1) was designed using computer-aided calculations (URL: https://www.expasy.org/resources/swiss-model), which introduced negatively charged amino acids to concentrate them on the surface. This tag is only a form of predicted acidic tag.
Swiss-Model是一款预测蛋白质结构模型的工具,它利用同源建模的方法实现对一段未知序列的三级结构的预测。Swiss-Model is a tool for predicting protein structure models. It uses homology modeling to predict the tertiary structure of an unknown sequence.
实施例2:构建目的蛋白表达载体Example 2: Construction of target protein expression vector
将目的蛋白编码基因序列与酸性表面助溶短肽标签(Sacid)编码基因序列融合后获得目的蛋白表达单元(其中Sacid氨基酸序列位于目的蛋白氨基酸序列上游),并将所述目的蛋白表达单元插入表达质粒pET-28a(Novagen,USA)上,人工合成获得目的蛋白表达载体pET-28a/Sacid-NsiI。The target protein encoding gene sequence was fused with the acidic surface solubilizing short peptide tag (Sacid) encoding gene sequence to obtain the target protein expression unit (wherein the Sacid amino acid sequence is located upstream of the target protein amino acid sequence), and the target protein expression unit was inserted into the expression plasmid pET-28a (Novagen, USA) to obtain the target protein expression vector pET-28a/Sacid-NsiI by artificial synthesis.
实施例3:单独表达NsiI蛋白Example 3: Expression of NsiI protein alone
将NsiI核苷酸序列(编码氨基酸序列SEQ ID NO:5)插入到原核表达载体pET28a,得到表达载体pET28a/NsiI,通过测序验证载体构建正确后,将质粒按照碱裂解法抽提并转化表达宿主菌Rosetta(DE3)(天根生化科技(北京)有限公司)。将含有重组质粒pET28a/NsiI的表达宿主菌单菌落接种于卡那霉素 (50μg/mL)和氯霉素(34μg/mL)的LB液体培养中,37℃,220rpm培养16小时,作为种子菌。The NsiI nucleotide sequence (encoding amino acid sequence SEQ ID NO: 5) was inserted into the prokaryotic expression vector pET28a to obtain the expression vector pET28a/NsiI. After the vector construction was verified to be correct by sequencing, the plasmid was extracted by alkaline lysis and transformed into the expression host bacteria Rosetta (DE3) (Tiangen Biochemical Technology (Beijing) Co., Ltd.). A single colony of the expression host bacteria containing the recombinant plasmid pET28a/NsiI was inoculated in LB liquid culture containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) and cultured at 37°C, 220rpm for 16 hours as seed bacteria.
取种子菌按1:50体积比接种于无菌的卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB培养基中,37℃,200rpm振荡培养至合适对数生长期(OD600=0.6-0.8),加入0.5mM IPTG,27℃诱导表达4h-16h,收集菌液并离心,将菌体沉淀按照10%-20%浓度重悬到20mM Tris-HCl、300mM NaCl pH 8.0的缓冲液中,超声破碎细胞。裂解菌液8000rpm离心30min,分离上清和沉淀,进行SDS-PAGE鉴定目的蛋白的表达情况。Take the seed bacteria and inoculate them in a sterile LB medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a volume ratio of 1:50. Cultivate at 37°C, 200rpm shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), add 0.5mM IPTG, induce expression at 27°C for 4h-16h, collect the bacterial solution and centrifuge, resuspend the bacterial precipitate in 20mM Tris-HCl, 300mM NaCl pH 8.0 buffer at a concentration of 10%-20%, and ultrasonically disrupt the cells. The lysed bacterial solution was centrifuged at 8000rpm for 30min, the supernatant and precipitate were separated, and SDS-PAGE was performed to identify the expression of the target protein.
结果如图5的A图所示,NsiI目的蛋白单独表达,其分子量约50kDa,从SDS-PAGE难以看到箭头所指位置的明显目的条带,其表达量不高,蛋白明显减少甚至完全损失且纯化后杂蛋白较多。The results are shown in Figure 5A. The NsiI target protein was expressed alone, with a molecular weight of about 50 kDa. It was difficult to see the obvious target band at the position indicated by the arrow from SDS-PAGE. Its expression level was not high, the protein was significantly reduced or even completely lost, and there were many impurity proteins after purification.
实施例4:利用NeonGreen作为助溶肽与NsiI融合表达Example 4: Using NeonGreen as a solubilizing peptide and fusion expression with NsiI
(1)NeonGreen与NsiI融合表达载体和宿主菌的构建(1) Construction of NeonGreen and NsiI fusion expression vector and host bacteria
本实验中采用NeonGreen的氨基酸序列如SEQ ID NO:6所示。The amino acid sequence of NeonGreen used in this experiment is shown in SEQ ID NO: 6.
将NeonGreen和NsiI(SEQ ID NO:5)及原核表达载体pET28a的基因通过人工合成连接到一起(其中NeonGreen氨基酸序列位于NsiI蛋白氨基酸序列上游),获得目的蛋白表达载体pET-28a/NeonGreen-NsiI。The genes of NeonGreen and NsiI (SEQ ID NO: 5) and the prokaryotic expression vector pET28a were connected together through artificial synthesis (where the NeonGreen amino acid sequence was located upstream of the NsiI protein amino acid sequence) to obtain the target protein expression vector pET-28a/NeonGreen-NsiI.
(2)重组NeonGreen-NsiI蛋白的表达与纯化(2) Expression and purification of recombinant NeonGreen-NsiI protein
将重组质粒pET-28a/NeonGreen-NsiI转化表达宿主菌Rosetta(DE3),挑取表达宿主菌单菌落接种于卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养中,37℃、220rpm培养12小时,作为种子菌。The recombinant plasmid pET-28a/NeonGreen-NsiI was transformed into the expression host bacteria Rosetta (DE3), and a single colony of the expression host bacteria was picked and inoculated into LB liquid culture containing kanamycin (50 μg/mL) and chloramphenicol (34 μg/mL), and cultured at 37°C and 220 rpm for 12 hours as seed bacteria.
取种子菌按1:50体积比接种于无菌的卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养基中,37℃、200rpm振荡培养至合适对数生长期(OD600=0.6-0.8),添加IPTG诱导剂,将温度调到27℃培养4-16小时。将200mL诱导后的菌液进行离心收菌,用30mL 20mM Tris HCl、300mM NaCl(pH8.0)重悬菌体,然后超声破碎。将裂解菌液8000rpm离心30min,分离上清和沉淀,上清用0.45uM滤器过滤去除不溶性的菌体蛋白,并将沉淀用去离子水或PBS重悬,对表达产物进行SDS-PAGE分析。Take the seed bacteria and inoculate them in a sterile LB liquid medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a volume ratio of 1:50. Cultivate at 37°C and 200rpm with shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), add IPTG inducer, adjust the temperature to 27°C and culture for 4-16 hours. Centrifuge 200mL of the induced bacterial solution to collect the bacteria, resuspend the bacteria with 30mL 20mM Tris HCl, 300mM NaCl (pH8.0), and then ultrasonically disrupt. Centrifuge the lysed bacterial solution at 8000rpm for 30min, separate the supernatant and precipitate, filter the supernatant with a 0.45uM filter to remove insoluble bacterial proteins, and resuspend the precipitate with deionized water or PBS, and perform SDS-PAGE analysis on the expression product.
(3)结果分析(3) Results analysis
如图5的B图所示,采用NeonGreen作为助溶肽,NeonGreen-NsiI融合蛋白的表达情况,分子量约70KD,箭头指向融合蛋白,从图可知,采用NeonGreen作为助溶蛋白融合表达的方式,虽然也能够实现融合蛋白的纯化和富集,但由于自身分子量较大,仍然存在折叠时间较长,助溶效果有限和易干扰重组蛋白活性等缺陷,从图中可看出大部分蛋白以不溶性包涵体存在且融合蛋白易降解。As shown in Figure 5B, NeonGreen is used as a solubilizing peptide, and the expression of the NeonGreen-NsiI fusion protein has a molecular weight of about 70KD. The arrow points to the fusion protein. As can be seen from the figure, although the use of NeonGreen as a solubilizing protein fusion expression method can also achieve the purification and enrichment of the fusion protein, due to its large molecular weight, it still has defects such as long folding time, limited solubilizing effect and easy interference with the activity of the recombinant protein. It can be seen from the figure that most of the protein exists in the form of insoluble inclusion bodies and the fusion protein is easily degraded.
实施例5:已发表的正电荷助溶标签(Zbasic)与NsiI融合表达Example 5: Fusion expression of published positively charged solubility-enhancing tag (Zbasic) and NsiI
(1)已发表的正电荷助溶标签(Zbasic)与NsiI融合表达载体和宿主菌的构建(1) Construction of published positively charged solubilizing tag (Zbasic) and NsiI fusion expression vector and host bacteria
已发表的正电荷助溶标签Zbasic[Hedhammar M,Hober S,Z(basic)--a novel purification tag for efficient protein recovery,Journal of Chromatography A,2007,1161(1-2):22-28]的氨基酸序列如SEQ ID NO:7所示。如实施例4所示方法,将Zbasic编码序列与NsiI编码序列连接获得融合蛋白表达载体pET-28a/Zbasic-NsiI,其中Zbasic氨基酸序列位于NsiI氨基酸序列上游。The amino acid sequence of the published positively charged solubilizing tag Zbasic [Hedhammar M, Hober S, Z(basic)--a novel purification tag for efficient protein recovery, Journal of Chromatography A, 2007, 1161(1-2):22-28] is shown as SEQ ID NO: 7. As shown in Example 4, the Zbasic coding sequence and the NsiI coding sequence were connected to obtain a fusion protein expression vector pET-28a/Zbasic-NsiI, wherein the Zbasic amino acid sequence is located upstream of the NsiI amino acid sequence.
(2)已发表的正电荷助溶标签(Zbasic)与NsiI重组蛋白的表达与纯化(2) Expression and purification of published positively charged solubilizing tags (Zbasic) and NsiI recombinant proteins
将质粒转化表达宿主菌Rosetta(DE3),得到目的工程菌株。具体实施例均采用实验室小规模摇瓶表达。在37℃条件下含卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养基中振荡活化16h,之后将过夜培养物按1:50比例转到新的含有卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB培养基(约200mL)中,37℃振荡培养至合适对数生长期(OD600=0.6-0.8),27℃,0.5mM IPTG诱导表达4h-16h。The plasmid was transformed into the expression host bacteria Rosetta (DE3) to obtain the target engineered strain. The specific embodiments all adopted laboratory small-scale shake flask expression. Oscillating activation was carried out in LB liquid medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at 37°C for 16h, and then the overnight culture was transferred to a new LB medium (about 200mL) containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a ratio of 1:50, and cultured at 37°C with shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), and induced expression at 27°C with 0.5mM IPTG for 4h-16h.
取诱导菌液1mL,8000rpm于4℃离心2min后弃上清,收集菌体,用200μL体积的PBS重悬后直接加入50μL体积的5×SDS上样缓冲液(300mM Tris-HCl(pH 6.8)、20%β-巯基乙醇、20%SDS、25%甘油、0.05%溴酚蓝),沸水浴10min,-20℃保存。Take 1 mL of the induced bacterial solution, centrifuge at 8000 rpm at 4°C for 2 min, discard the supernatant, collect the bacteria, resuspend them with 200 μL of PBS, and directly add 50 μL of 5×SDS loading buffer (300 mM Tris-HCl (pH 6.8), 20% β-mercaptoethanol, 20% SDS, 25% glycerol, 0.05% bromophenol blue), boil in a boiling water bath for 10 min, and store at -20°C.
取诱导样品200mL,8000rpm于4℃离心20min后弃上清,收集菌体,用30mL体积的Tris HCl缓冲液重悬,加入1%Triton X-100,1×Cokitail蛋白酶抑制剂和1mM PMSF后混匀,超声破碎30min。全菌裂解样品12000rpm于4℃离心20min,上清、沉淀分离。取40μL体积的上清,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存。将沉淀用1mL体积的PBS重悬后取其中40μL加入10μL体积5×SDS上样缓冲液,沸水浴10min,-20℃保存。Take 200 mL of the induced sample, centrifuge at 8000 rpm at 4°C for 20 min, discard the supernatant, collect the bacteria, resuspend with 30 mL of Tris HCl buffer, add 1% Triton X-100, 1× Cokitail protease inhibitor and 1 mM PMSF, mix well, and ultrasonically disrupt for 30 min. Centrifuge the whole bacterial lysis sample at 12000 rpm at 4°C for 20 min, separate the supernatant and precipitate. Take 40 μL of the supernatant, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C. Resuspend the precipitate with 1 mL of PBS, take 40 μL of it, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C.
将破碎上清用亲和层析柱获得纯化的融合蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,通过Strep Tag II亲和层析柱纯化得到目的蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,作为目的蛋白样品。上述各种样品在电泳检测中的稀释度相同,其蛋白含量具有直接可比性。The supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃. The target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃ as the target protein sample. The above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
(3)蛋白电泳(SDS-PAGE)(3) Protein electrophoresis (SDS-PAGE)
SDS-PAGE凝胶按常规方法配制。对已制备好的蛋白样品等体积上样,进行10%SDS-PAGE检测。先恒压80V电泳30min,再恒压125V。电泳结束后进行考马斯亮蓝R-250染色,经脱色后得电泳结果。SDS-PAGE gel was prepared according to the conventional method. The prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE. The electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
(4)结果分析(4) Results analysis
图5的C图为采用已发表的正电荷助溶标签(Zbasic)表达的NsiI蛋白,分子量约45kDa,箭头指示位置为目的蛋白条带。从图5的D图可知,已发表的正电荷助溶标签(Zbasic)与Sacid相比,蛋白的表达量相对较差。Figure 5 C shows the NsiI protein expressed using a published positively charged solubilizing tag (Zbasic), with a molecular weight of about 45 kDa, and the arrow indicates the target protein band. As can be seen from Figure 5 D, the protein expression level of the published positively charged solubilizing tag (Zbasic) is relatively poor compared to Sacid.
实施例6:Sacid与目的蛋白NsiI融合表达Example 6: Fusion expression of Sacid and target protein NsiI
(1)Sacid与NsiI融合表达载体和宿主菌的构建(1) Construction of Sacid and NsiI fusion expression vector and host bacteria
将Sacid(SEQ ID NO:1)和NsiI(SEQ ID NO:5)及原核表达载体pET28a的基因通过人工合成连接到一起,得到表达融合蛋白的重组表达质粒 pET-28a/Sacid-NsiI,其中Sacid氨基酸序列位于NsiI蛋白氨基酸序列上游。The genes of Sacid (SEQ ID NO: 1), NsiI (SEQ ID NO: 5) and the prokaryotic expression vector pET28a were connected together by artificial synthesis to obtain the recombinant expression plasmid pET-28a/Sacid-NsiI expressing the fusion protein, in which the Sacid amino acid sequence was located upstream of the NsiI protein amino acid sequence.
(2)重组Sacid-NsiI的表达与纯化(2) Expression and purification of recombinant Sacid-NsiI
将质粒转化表达宿主菌Rosetta(DE3)(购自天根生化科技(北京)有限公司),得到目的工程菌株。具体实施例均采用实验室小规模摇瓶表达。在37℃条件下含卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养基中振荡活化16h,之后将过夜培养物按1:50比例转到新的含有卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB培养基(约200mL)中,37℃振荡培养至合适对数生长期(OD600=0.6-0.8),27℃,0.5mM IPTG诱导表达4h-16h。The plasmid was transformed into the expression host bacteria Rosetta (DE3) (purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.) to obtain the target engineered strain. The specific embodiments all adopted laboratory small-scale shake flask expression. Oscillating activation was carried out in LB liquid culture medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at 37°C for 16h, and then the overnight culture was transferred to a new LB culture medium (about 200mL) containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a ratio of 1:50, and cultured at 37°C with shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), and induced expression at 27°C with 0.5mM IPTG for 4h-16h.
取诱导菌液1mL,8000rpm于4℃离心2min后弃上清,收集菌体,用200μL体积的PBS重悬后直接加入50μL体积的5×SDS上样缓冲液(300mM Tris-HCl(pH 6.8)、20%β-巯基乙醇、20%SDS、25%甘油、0.05%溴酚蓝),沸水浴10min,-20℃保存。Take 1 mL of the induced bacterial solution, centrifuge at 8000 rpm at 4°C for 2 min, discard the supernatant, collect the bacteria, resuspend them with 200 μL of PBS, and directly add 50 μL of 5×SDS loading buffer (300 mM Tris-HCl (pH 6.8), 20% β-mercaptoethanol, 20% SDS, 25% glycerol, 0.05% bromophenol blue), boil in a boiling water bath for 10 min, and store at -20°C.
取诱导样品200mL,8000rpm于4℃离心20min后弃上清,收集菌体,用30mL体积的Tris HCl缓冲液重悬,加入1%Triton X-100,1×Cokitail蛋白酶抑制剂和1mM PMSF后混匀,超声破碎30min。全菌裂解样品12000rpm于4℃离心20min,上清、沉淀分离。取40μL体积的上清,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存。将沉淀用lmL体积的PBS重悬后取其中40μL加入10μL体积5×SDS上样缓冲液,沸水浴10min,-20℃保存。Take 200 mL of the induced sample, centrifuge at 8000 rpm at 4°C for 20 min, discard the supernatant, collect the bacteria, resuspend with 30 mL of Tris HCl buffer, add 1% Triton X-100, 1× Cokitail protease inhibitor and 1 mM PMSF, mix well, and ultrasonically disrupt for 30 min. Centrifuge the whole bacterial lysis sample at 12000 rpm at 4°C for 20 min, separate the supernatant and precipitate. Take 40 μL of the supernatant, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C. Resuspend the precipitate with 1 mL of PBS, take 40 μL of it, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C.
将破碎上清用亲和层析柱获得纯化的融合蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,通过Strep Tag II亲和层析柱纯化得到目的蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,作为目的蛋白样品。上述各种样品在电泳检测中的稀释度相同,其蛋白含量具有直接可比性。The supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃. The target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃ as the target protein sample. The above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
(3)蛋白电泳(SDS-PAGE)(3) Protein electrophoresis (SDS-PAGE)
SDS-PAGE凝胶按常规方法配制。对已制备好的蛋白样品等体积上样,进行10%SDS-PAGE检测。先恒压80V电泳30min,再恒压125V。电泳结束后进行考马斯亮蓝R-250染色,经脱色后得电泳结果。SDS-PAGE gel was prepared according to the conventional method. The prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE. The electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
(4)结果分析(4) Results analysis
图5D为采用本发明方法融合表达Sacid-NsiI,分子量约45kDa,箭头指示位置在纯化后样品中有明显的目的蛋白条带,从图中看出采用本发明融合表达后目的蛋白表达量得到了大幅度提高;与其他标签相比,本发明Sacid标签明显提高了融合蛋白的可溶性。Figure 5D shows the fusion expression of Sacid-NsiI using the method of the present invention, with a molecular weight of about 45 kDa. There is an obvious target protein band in the purified sample at the position indicated by the arrow. It can be seen from the figure that the expression level of the target protein is greatly improved after the fusion expression of the present invention; compared with other tags, the Sacid tag of the present invention significantly improves the solubility of the fusion protein.
实施例7:利用NeonGreen作为助溶肽与TEVP融合表达Example 7: Using NeonGreen as a solubilizing peptide and fusion expression with TEVP
(1)NeonGreen与TEVP融合表达载体和宿主菌的构建(1) Construction of NeonGreen and TEVP fusion expression vector and host bacteria
本实验中采用NeonGreen的氨基酸序列如SEQ ID NO:6所示。The amino acid sequence of NeonGreen used in this experiment is shown in SEQ ID NO: 6.
将NeonGreen和TEVP(SEQ ID NO:8)及原核表达载体pET28a的基因通过人工合成连接到一起(其中NeonGreen氨基酸序列位于TEVP蛋白氨基酸序列上游),获得目的蛋白表达载体pET-28a/NeonGreen-TEVP。The genes of NeonGreen and TEVP (SEQ ID NO: 8) and the prokaryotic expression vector pET28a were connected together through artificial synthesis (where the NeonGreen amino acid sequence was located upstream of the TEVP protein amino acid sequence) to obtain the target protein expression vector pET-28a/NeonGreen-TEVP.
(2)重组NeonGreen-TEVP蛋白的表达与纯化(2) Expression and purification of recombinant NeonGreen-TEVP protein
将重组质粒pET-28a/NeonGreen-TEVP转化表达宿主菌Rosetta(DE3),挑取表达宿主菌单菌落接种于卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养中,37℃、220rpm培养12小时,作为种子菌。The recombinant plasmid pET-28a/NeonGreen-TEVP was transformed into the expression host bacteria Rosetta (DE3), and a single colony of the expression host bacteria was picked and inoculated into LB liquid culture containing kanamycin (50 μg/mL) and chloramphenicol (34 μg/mL), and cultured at 37°C and 220 rpm for 12 hours as seed bacteria.
取种子菌按1:50体积比接种于无菌的卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养基中,37℃、200rpm振荡培养至合适对数生长期(OD600=0.6-0.8),添加IPTG诱导剂,将温度调到27℃培养4-16小时。将200mL诱导后的菌液进行离心收菌,用30mL 20mM Tris HCl、300mM NaCl(pH8.0)重悬菌体,然后超声破碎。将裂解菌液8000rpm离心30min,分离上清和沉淀,上清用0.45uM滤器过滤去除不溶性的菌体蛋白,并将沉淀用去离子水或PBS重悬,对表达产物进行SDS-PAGE及Western印迹分析。Take the seed bacteria and inoculate them in a sterile LB liquid medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a volume ratio of 1:50. Cultivate at 37℃ and 200rpm with shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), add IPTG inducer, adjust the temperature to 27℃ and culture for 4-16 hours. Centrifuge 200mL of the induced bacterial solution to collect the bacteria, resuspend the bacteria with 30mL 20mM Tris HCl, 300mM NaCl (pH8.0), and then ultrasonically disrupt. Centrifuge the lysed bacterial solution at 8000rpm for 30min, separate the supernatant and precipitate, filter the supernatant with a 0.45uM filter to remove insoluble bacterial proteins, and resuspend the precipitate with deionized water or PBS, and perform SDS-PAGE and Western blot analysis on the expression product.
(3)结果分析(3) Result analysis
如图6的A图所示,采用NeonGreen作为助溶肽,NeonGreen-TEVP融合蛋白破碎上清的表达情况,NT分子量约为70KD,箭头指向融合蛋白,从图A的SDS PAGE尚不能明显区分目的条带,为了更好的观察两者的条带,将蛋白进行Western印迹,结果如图6B的NT所示,从Western印迹的结果中可知,NeonGreen-TEVP融合蛋白未表达。As shown in Figure 6A, NeonGreen was used as the solubilizing peptide to express the NeonGreen-TEVP fusion protein in the supernatant. The NT molecular weight was about 70KD, and the arrow pointed to the fusion protein. The target bands could not be clearly distinguished from the SDS PAGE in Figure A. In order to better observe the two bands, the proteins were subjected to Western blotting. The results are shown in Figure 6B, NT. From the results of Western blotting, it can be seen that the NeonGreen-TEVP fusion protein was not expressed.
实施例8:Sacid与目的蛋白TEVP融合表达Example 8: Fusion expression of Sacid and target protein TEVP
(1)Sacid与TEVP融合表达载体和宿主菌的构建(1) Construction of Sacid and TEVP fusion expression vector and host bacteria
将Sacid(SEQ ID NO:1)和TEVP(SEQ ID NO:8)及原核表达载体pET28a的基因通过人工合成连接到一起(其中Sacid氨基酸序列位于TEVP蛋白氨基酸序列上游),得到重组表达质粒pET-28a/Sacid-TEVP。The genes of Sacid (SEQ ID NO: 1), TEVP (SEQ ID NO: 8) and the prokaryotic expression vector pET28a were connected together through artificial synthesis (where the Sacid amino acid sequence was located upstream of the TEVP protein amino acid sequence) to obtain the recombinant expression plasmid pET-28a/Sacid-TEVP.
(2)重组Sacid-TEVP的表达与纯化(2) Expression and purification of recombinant Sacid-TEVP
将质粒转化表达宿主菌Rosetta(DE3)(购自天根生化科技(北京)有限公司),得到目的工程菌株。具体实施例均采用实验室小规模摇瓶表达。在37℃条件下含卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB液体培养基中振荡活化16h,之后将过夜培养物按1:50比例转到新的含有卡那霉素(50μg/mL)和氯霉素(34μg/mL)的LB培养基(约200mL)中,37℃振荡培养至合适对数生长期(OD600=0.6-0.8),27℃,0.5mM IPTG诱导表达4h-16h。The plasmid was transformed into the expression host bacteria Rosetta (DE3) (purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.) to obtain the target engineered strain. The specific embodiments all adopted laboratory small-scale shake flask expression. Oscillating activation was carried out in LB liquid culture medium containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at 37°C for 16h, and then the overnight culture was transferred to a new LB culture medium (about 200mL) containing kanamycin (50μg/mL) and chloramphenicol (34μg/mL) at a ratio of 1:50, and cultured at 37°C with shaking until the appropriate logarithmic growth phase (OD600=0.6-0.8), and induced expression at 27°C with 0.5mM IPTG for 4h-16h.
取诱导菌液1mL,8000rpm于4℃离心2min后弃上清,收集菌体,用200μL体积的PBS重悬后直接加入50μL体积的5×SDS上样缓冲液(300mM Tris-HCl(pH 6.8)、20%β-巯基乙醇、20%SDS、25%甘油、0.05%溴酚蓝),沸水浴10min,-20℃保存。Take 1 mL of the induced bacterial solution, centrifuge at 8000 rpm at 4°C for 2 min, discard the supernatant, collect the bacteria, resuspend them with 200 μL of PBS, and directly add 50 μL of 5×SDS loading buffer (300 mM Tris-HCl (pH 6.8), 20% β-mercaptoethanol, 20% SDS, 25% glycerol, 0.05% bromophenol blue), boil in a boiling water bath for 10 min, and store at -20°C.
取诱导样品200mL,8000rpm于4℃离心20min后弃上清,收集菌体,用30mL体积的Tris HCl缓冲液重悬,加入1%Triton X-100,1×Cokitail蛋白酶抑制剂和1mM PMSF后混匀,超声破碎30min。全菌裂解样品12000rpm于4℃离心20min,上清、沉淀分离。取40μL体积的上清,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存。将沉淀用1mL体积的PBS重悬后取其中40μL加入 10μL体积5×SDS上样缓冲液,沸水浴10min,-20℃保存。Take 200 mL of the induced sample, centrifuge at 8000 rpm at 4°C for 20 min, discard the supernatant, collect the bacteria, resuspend with 30 mL of Tris HCl buffer, add 1% Triton X-100, 1× Cokitail protease inhibitor and 1 mM PMSF, mix well, and ultrasonically disrupt for 30 min. Centrifuge the whole bacterial lysis sample at 12000 rpm at 4°C for 20 min, separate the supernatant and precipitate. Take 40 μL of the supernatant, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C. Resuspend the precipitate with 1 mL of PBS, take 40 μL of it, add 10 μL of 5×SDS loading buffer, boil in water bath for 10 min, and store at -20°C.
将破碎上清用亲和层析柱获得纯化的融合蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,通过Strep Tag II亲和层析柱纯化得到目的蛋白,取40uL洗脱液,加入10μL体积的5×SDS上样缓冲液,沸水浴10min,-20℃保存,作为目的蛋白样品。上述各种样品在电泳检测中的稀释度相同,其蛋白含量具有直接可比性。The supernatant was purified by affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃. The target protein was purified by Strep Tag II affinity chromatography column, 40uL of eluate was added to 10μL of 5×SDS loading buffer, boiled in water bath for 10min, and stored at -20℃ as the target protein sample. The above samples had the same dilution in electrophoresis detection, and their protein contents were directly comparable.
(3)蛋白电泳(SDS-PAGE)及Western印迹(3) Protein electrophoresis (SDS-PAGE) and Western blotting
SDS-PAGE凝胶按常规方法配制。对已制备好的蛋白样品等体积上样,进行10%SDS-PAGE检测。先恒压80V电泳30min,再恒压125V。电泳结束后进行考马斯亮蓝R-250染色,经脱色后得电泳结果。SDS-PAGE gel was prepared according to the conventional method. The prepared protein samples were loaded with equal volumes and tested by 10% SDS-PAGE. The electrophoresis was performed at a constant voltage of 80 V for 30 min, and then at a constant voltage of 125 V. After the electrophoresis, Coomassie Brilliant Blue R-250 staining was performed, and the electrophoresis results were obtained after decolorization.
将胶上的蛋白湿转到PVDF膜,用5%脱脂奶粉封闭2h,一抗4℃孵育过夜,与HRP标记的二抗孵育室温孵育1h,利用化学发光试剂盒和GelView 6000Plus设备进行曝光。目的条带如图6B的ST所示。The proteins on the gel were wet transferred to a PVDF membrane, blocked with 5% skim milk powder for 2 h, incubated with primary antibody at 4°C overnight, incubated with HRP-labeled secondary antibody at room temperature for 1 h, and exposed using a chemiluminescence kit and GelView 6000Plus equipment. The target band is shown in ST in Figure 6B.
(4)结果分析(4) Results analysis
图6A和B图的泳道ST为采用本发明方法融合表达Sacid-TEVP,分子量约39kDa,箭头指示位置在破碎上清样品中有明显的目的蛋白条带且Western印迹结果与其一致。Lane ST in Figures 6A and B is Sacid-TEVP fusion expressed by the method of the present invention, with a molecular weight of about 39 kDa. The arrow indicates the position where there is an obvious target protein band in the supernatant sample and the Western blotting results are consistent with it.
从图6可以看出采用本发明融合表达后目的蛋白表达量得到了大幅度提高;与其他标签相比,本发明Sacid标签明显提高了融合蛋白的可溶性。It can be seen from FIG6 that the expression level of the target protein is greatly improved after the fusion expression of the present invention; compared with other tags, the Sacid tag of the present invention significantly improves the solubility of the fusion protein.
实施例9:纯化蛋白活性测试Example 9: Purified protein activity test
将得到的蛋白洗脱液进行超滤置换缓冲液,将蛋白保存在Tris HCl缓冲液中(20mM Tris HCl,20mM NaCl,50%甘油),通过酶切质粒验证蛋白活性。The obtained protein eluate was subjected to ultrafiltration to replace the buffer, and the protein was stored in Tris HCl buffer (20mM Tris HCl, 20mM NaCl, 50% glycerol), and the protein activity was verified by enzyme digestion of the plasmid.
使用含单个NsiI酶切位点和单个HindIII酶切位点的质粒做底物,进行双酶切反应,对提纯的NsiI酶进行梯度(10
1~10
8倍)稀释后进行双酶切反应,每个稀释度作为一个样品,37℃,酶切1小时,进行琼脂糖凝胶电泳。酶及质粒用量均相同,具有可比性。
The plasmid containing a single NsiI restriction site and a single HindIII restriction site was used as a substrate for double restriction digestion. The purified NsiI enzyme was diluted in a gradient (10 1 to 10 8 times) and then subjected to double restriction digestion reaction. Each dilution was used as a sample. The enzymes were digested for 1 hour at 37°C and then subjected to agarose gel electrophoresis. The enzyme and plasmid dosages were the same and comparable.
表1双酶切反应体系Table 1 Double enzyme digestion reaction system
结果如图7所示,使用含单个NsiI酶切位点和单个HindIII酶切位点的质粒做底物,进行双酶切反应后,得到3392bp和1987bp的目标条带。A图为单独表达的NsiI蛋白和NeonGreen-NsiI蛋白的双酶切鉴定;B图为已发表的正电荷助溶标签(Zbasic)表达的NsiI蛋白和本发明Sacid-NsiI蛋白的双酶切鉴定。从 图中可以看出,单独表达的NsiI蛋白几乎无活性;NeonGreen-NsiI融合蛋白及已发表的正电荷助溶标签(Zbasic)表达的NsiI蛋白活性基本一致;而采用本发明技术以Sacid作为纯化标签,纯化的蛋白Sacid-NsiI的活性得到大幅度提高,比NeonGreen-NsiI融合蛋白及已发表的正电荷助溶标签(Zbasic)表达的NsiI蛋白活性均均较高。The results are shown in Figure 7. Using a plasmid containing a single NsiI restriction site and a single HindIII restriction site as a substrate, after double enzyme digestion reaction, target bands of 3392bp and 1987bp were obtained. Figure A shows the double enzyme digestion identification of the NsiI protein expressed alone and the NeonGreen-NsiI protein; Figure B shows the double enzyme digestion identification of the NsiI protein expressed by the published positive charge solubility tag (Zbasic) and the Sacid-NsiI protein of the present invention. It can be seen from the figure that the NsiI protein expressed alone is almost inactive; the activities of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the published positive charge solubility tag (Zbasic) are basically the same; and the activity of the purified protein Sacid-NsiI is greatly improved by using the technology of the present invention with Sacid as the purification tag, which is higher than the activity of the NsiI protein expressed by the NeonGreen-NsiI fusion protein and the published positive charge solubility tag (Zbasic).
通过测定酶的比活力,结果如图8所示,本发明技术以Sacid作为纯化标签,纯化的蛋白Sacid-NsiI的酶活为496KU/mg、已知的正电荷助溶标签(Zbasic)表达的NsiI蛋白的酶活为5.5KU/mg、NeonGreen-NsiI融合蛋白的酶活为4.9KU/mg,单独表达的NsiI蛋白几乎无活性且存在杂蛋白污染问题。从图中可知,本发明技术以Sacid作为纯化标签,纯化的蛋白Sacid-NsiI的酶活是其他两种已知助溶标签纯化得到的酶的100倍左右。By measuring the specific activity of the enzyme, the results are shown in Figure 8. The technology of the present invention uses Sacid as a purification tag, and the enzyme activity of the purified protein Sacid-NsiI is 496KU/mg, the enzyme activity of the NsiI protein expressed by the known positively charged solubilizing tag (Zbasic) is 5.5KU/mg, and the enzyme activity of the NeonGreen-NsiI fusion protein is 4.9KU/mg. The NsiI protein expressed alone is almost inactive and has the problem of contamination by foreign proteins. As can be seen from the figure, the technology of the present invention uses Sacid as a purification tag, and the enzyme activity of the purified protein Sacid-NsiI is about 100 times that of the enzymes purified by the other two known solubilizing tags.
SEQ ID NO:1(Sacid)SEQ ID NO:1(Sacid)
Val-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asp-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Gln-Glu-Leu-Lys-Asp-Asp-Pro-Glu-Glu-Ser-Asp-Glu-Glu-Leu-Glu-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-LysVal-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asp-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Gln-Glu-Leu-Lys-Asp-Asp-Pro-Glu-Glu-Ser-Asp-Glu-Glu-Leu-Glu-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-Lys
SEQ ID NO:2(Sacid 1)SEQ ID NO:2(Sacid 1)
Val-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asp-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-LysVal-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asp-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-Lys
SEQ ID NO:3(Sacid 2)SEQ ID NO: 3 (Sacid 2)
Val-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Arg-Arg-Arg-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-LysVal-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Arg-Arg-Arg-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Asp-Asp-Leu-Asn-Asp-Ala-Gln-Pro-Lys
SEQ ID NO:4(葡萄球菌蛋白A的B结构域,Wt)SEQ ID NO: 4 (B domain of Staphylococcus protein A, Wt)
Val-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asn-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Gln-Ser-Leu-Lys-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Lys-Lys-Leu-Asn-Asp-Ala-Gln-Pro-LysVal-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Gln-Gln-Asn-Ala-Phe-Tyr-Glu-Ile-Leu-His-Leu-Pro-Asn-Leu-Asn-Glu-Glu-Gln-Arg-Asn-Ala-Phe-Ile-Gln-Ser-Leu-Lys-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Lys-Lys-Leu-Asn-Asp-Ala-Gln-Pro-Lys
SEQ ID NO:5(NsiI)SEQ ID NO: 5(NsiI)
Met-Ile-Asn-His-Ser-Ile-Leu-Lys-His-His-Ser-Phe-Thr-Gly-Lys-Ile-Ile-Ser-Ile-Leu-Lys-Asp-Glu-Phe-Gly-Asp-Asp-Ala-Ile-Tyr-Ile-Phe-Glu-Asn-Ser-Pro-Ile-Leu-Gly-Tyr-Leu-Asn-Ile-Lys-Thr-Lys-Ser-Ala-Glu-Arg-Gly-Ser-Lys-Ser-Arg-Gly-Ser-Phe-Ala-Asn-His-Tyr-Ala-Leu-Tyr-Val-Ile-Ile-Glu-Asp-Tyr-Ile-Asn-Lys-Gly-Tyr-Leu-Gly-As p-Asp-Leu-Asp-Tyr-Ser-Lys-Tyr-Asp-Gly-Ala-Lys-Phe-Thr-Asp-Leu-Phe-Arg-Arg-Gln-Arg-Glu-Leu-Pro-Phe-Gly-Ser-Lys-Leu-Gln-Asn-His-Ala-Leu-Asn-Ser-Arg-Leu-Asn-Asp-Glu-Phe-Lys-Lys-Phe-Phe-Pro-Thr-Leu-Gly-Ile-Val-Pro-Ile-Ile-Arg-Asp-Val-Arg-Thr-Ser-Arg-Tyr-Trp-Ile-Gln-Glu-Asp-Leu-Ile-Lys-Val-Ser-Val-Arg-Asn-Lys-Asn-Gly-Ile-Glu-Arg-Arg-Glu-Asn-Leu-Ala-Pro-Ser-Ile-Ile-Arg-Ile-Ile-Asp-Glu-Tyr-Ile-Ala-Thr-Lys-Lys-Glu-Ser-Phe-Glu-Leu-Phe-Leu-Lys-Thr-Cys-Gln-Glu-Ile-Ala-Asn-Leu-Ser-Ser-Ser-Asp-Pro-His-Ser-Val-Val-Lys-Phe-Ile-Gln-Glu-Gln-Leu-His-Pro-Ser-Ser-Asp-Ala-Arg-Val-Phe-Glu-Ile-Val-Ser-Tyr-Ala-Val-Leu-Lys-Glu-Arg-Tyr-Ser-Asn-Gln-Thr-Ile-Trp-Ile-Gly-Asp-Ser-Arg-Asp-Asp-Val-Ala-Glu-Glu-Ser-Leu-Val-Leu-Tyr-Lys-Thr-Gly-Arg-Thr-Asn-Ala-Asn-Asp-Gly-Gly-Ile-Asp-Phe-Val-Met-Lys-Pro-Leu-Gly-Arg-Phe-Phe-Gln-Val-Thr-Glu-Thr-Ile-Asp-Ala-Asn-Lys-Tyr-Phe-Leu-Asp-Ile-Asp-Lys-Val-Gln-Arg-Phe-Pro-Ile-Thr-Phe-Val-Val-Lys-Thr-Asn-Ser-Ser-Tyr-Glu-Glu-Ile-Glu-lys-Ile-Ile-Lys-Glu-Gln-Ala-Lys-Ala-Lys-Tyr-Asn-Ile-Glu-Ala-Ile-Val-Asn-Ser-Tyr-Met-Asp-Ser-Ile-Glu-Glu-Ile-Ile-Asn-Val-Pro-Asp-Leu-Met-Lys-Tyr-Phe-Glu-Glu-Met-Ile-Tyr-Ser-Asp-Ser-Leu-Lys-Arg-Ile-Met-Asp-Glu-Ile-Ile-Val-Gln-Ser-Lys-Val-Glu-Phe-Asn-Tyr-Glu-Glu-Asp-Val-SerMet-Ile-Asn-His-Ser-Ile-Leu-Lys-His-His-Ser-Phe-Thr-Gly-Lys-Ile-Ile-Ser-Ile-Leu-Lys-Asp-Glu-Phe-Gly-Asp-Asp-Ala-Ile-Tyr-Ile-Phe-Glu-Asn-Ser-Pro-Ile-Leu-Gly-Tyr-Leu-Asn-Ile-Lys-Thr-Lys-Ser-Ala-Glu-Arg-Gly-Ser-Lys-Ser-Arg-Gly-Ser-Phe-Ala-Asn-His-Tyr-Ala-Leu-Tyr-Val-Ile-Ile-Glu-Asp-Tyr-Ile-Asn-Lys-Gly-Tyr-Leu-Gly-As p-Asp-Leu-Asp-Tyr-Ser-Lys-Tyr-Asp-Gly-Ala-Lys-Phe-Thr-Asp-Leu-Phe-Arg-Arg-Gln-Arg-Glu-Leu-Pro-Phe-Gly-Ser-Lys-Leu-Gln-Asn-His-Ala-Leu-Asn-Ser-Arg-Leu-Asn-Asp-Glu-Phe-Lys-Lys-Phe-Phe-Pro-Thr-Leu-Gly-Ile-Val-Pro-Ile-Ile-Arg-Asp-Val-Arg-Thr-Ser-Arg-Tyr-Trp-Ile-Gln-Glu-Asp-Leu-Ile-Lys-Val-Ser-Val-Arg-Asn-Ly s-Asn-Gly-Ile-Glu-Arg-Arg-Glu-Asn-Leu-Ala-Pro-Ser-Ile-Ile-Arg-Ile-Ile-Asp-Glu-Tyr-Ile-Ala-Thr-Lys-Lys-Glu-Ser-Phe-Glu-Leu-Phe-Leu-Lys-Thr-Cys-Gln-Glu-Ile-Ala-Asn-Leu-Ser-Ser-Ser-Asp-Pro-His-Ser-Val-Val-Lys-Phe-Ile-Gln-Glu-Gln-Leu-His-Pro-Ser-Ser-Asp-Ala-Arg-Val-Phe-Glu-Ile-Val-Ser-Tyr-Ala-Val-Leu-Lys-Gl u-Arg-Tyr-Ser-Asn-Gln-Thr-Ile-Trp-Ile-Gly-Asp-Ser-Arg-Asp-Asp-Val-Ala-Glu-Glu-Ser-Leu-Val-Leu-Tyr-Lys-Thr-Gly-Arg-Thr-Asn-Ala-Asn-Asp-Gly-Gly-Ile-Asp-Phe-Val-Met-Lys-Pro-Leu-Gly-Arg-Phe-Phe-Gln-Val-Thr-Glu-Thr-Ile-Asp-Ala-Asn-Lys-Tyr-Phe-Leu-Asp-Ile-Asp-Lys-Val-Gln-Arg-Phe-Pro-Ile-Thr-Phe-Val-Val-Lys-Th r-Asn-Ser-Ser-Tyr-Glu-Glu-Ile-Glu-lys-Ile-Ile-Lys-Glu-Gln-Ala-Lys-Ala-Lys-Tyr-Asn-Ile-Glu-Ala-Ile-Val-Asn-Ser-Tyr-Met-Asp-Ser-Ile-Glu-Glu-Ile-Ile-Asn-Val-Pro-Asp-Leu-Met-Lys-Tyr-Phe-Glu-Glu-Met-Ile-Tyr-Ser-Asp-Ser-Leu-Lys-Arg-Ile-Met-Asp-Glu-Ile-Ile-Val-Gln-Ser-Lys-Val-Glu-Phe-Asn-Tyr-Glu-Glu-Asp-Val-Ser
SEQ ID NO:6(NeonGreen)SEQ ID NO: 6 (NeonGreen)
Met-Leu-Ser-Lys-Gly-Glu-Glu-Asp-Asn-Met-Ala-Ser-Leu-Pro-Ala-Thr-His-Glu-Leu-His-Ile-Phe-Gly-Ser-Ile-Asn-Gly-Val-Asp-Phe-Asp-Met-Val-Gly-Gln-Gly-Thr-Gly-Asn-Pro-Asn-Asp-Gly-Tyr-Glu-Glu-Leu-Asn-Leu-Lys-Ser-Thr-Lys-Gly-Asp-Leu-Gln-Phe-Ser-Pro-Trp-Ile-Leu-Val-Pro-His-Ile-Gly-Tyr-Gly-Phe-His-Gln-Tyr-Leu-Pro-Tyr-Pro-Asp-Gly-Met-Ser-Pro-Phe-Gln-Ala-Ala-Met-Val-Asp-Gly-Ser-Gly-Tyr-Gln-Val-His-Arg-Thr-Met-Gln-Phe-Glu-Asp-Gly-Ala-Ser-Leu-Thr-Val-Asn-Tyr-Arg-Tyr-Thr-Tyr-Glu-Gly-Ser-His-Ile-Lys-Gly-Glu-Ala-Gln-Val-Lys-Gly-Thr-Gly-Phe-Pro-Ala-Asp-Gly-Pro-Val-Met-Thr-Asn-Ser-Leu-Thr-Ala-Ala-Asp-Trp-Cys-Arg-Ser-Lys-Lys-Thr-Tyr-Pro-Asn-Asp-Lys-Thr-Ile-Ile-Ser-Thr-Phe-Lys-Trp-Ser-Tyr-Thr-Thr-Gly-Asn-Gly-Lys-Arg-Tyr-Arg-Ser-Thr-Ala-Arg-Thr-Thr-Tyr-Thr-Phe-Ala-Lys-Pro-Met-Ala-Ala-Asn-Tyr-Leu-Lys-Asn-Gln-Pro-Met-Tyr-Val-Phe-Arg-Lys-Thr-Glu-Leu-Lys-His-Ser-Lys-Thr-Glu-Leu-Asn-Phe-Lys-Glu-Trp-Gln-Lys-Ala-Phe-ThrMet-Leu-Ser-Lys-Gly-Glu-Glu-Asp-Asn-Met-Ala-Ser-Leu-Pro-Ala-Thr-His-Glu-Leu-His-Ile-Phe-Gly-Ser-Ile-Asn-Gly-Val-Asp-Phe-Asp-Met-Val-Gly-Gln-Gly-Thr-Gly-Asn-Pro-Asn-Asp-Gly-Tyr-Glu-Glu-Leu-Asn-Leu-Lys-Ser-Thr-Lys-Gly-Asp-Leu-G ln-Phe-Ser-Pro-Trp-Ile-Leu-Val-Pro-His-Ile-Gly-Tyr-Gly-Phe-His-Gln-Tyr-Leu-Pro-Tyr-Pro-Asp-Gly-Met-Ser-Pro-Phe-Gln-Ala-Ala-Met-Val-Asp-Gly-Ser-Gly-Tyr-Gln-Val-His-Arg-Thr-Met-Gln-Phe-Glu-Asp-Gly-Ala-Ser-Leu-Thr-Val-Asn-Tyr-Arg -Tyr-Thr-Tyr-Glu-Gly-Ser-His-Ile-Lys-Gly-Glu-Ala-Gln-Val-Lys-Gly-Thr-Gly-Phe-Pro-Ala-Asp-Gly-Pro-Val-Met-Thr-Asn-Ser-Leu-Thr-Ala-Ala-Asp-Trp-Cys-Arg-Ser-Lys-Lys-Thr-Tyr-Pro-Asn-Asp-Lys-Thr-Ile-Ile-Ser-Thr-Phe-Lys-Trp-Ser-Tyr-T hr-Thr-Gly-Asn-Gly-Lys-Arg-Tyr-Arg-Ser-Thr-Ala-Arg-Thr-Thr-Tyr-Thr-Phe-Ala-Lys-Pro-Met-Ala-Ala-Asn-Tyr-Leu-Lys-Asn-Gln-Pro-Met-Tyr-Val-Phe-Arg-Lys-Thr-Glu-Leu-Lys-His-Ser-Lys-Thr-Glu-Leu-Asn-Phe-Lys-Glu-Trp-Gln-Lys-Ala-Phe-Thr
SEQ ID NO:7(Zbasic)SEQ ID NO: 7 (Zbasic)
Val-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Arg-Arg-Arg-Ala-Arg-Arg-Glu-Ile-Arg-His-Leu-Pro-Asn-Leu-Asn-Arg-Glu-Gln-Arg-Arg-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Lys-Lys-Leu-Asn-Asp-Ala-Gln-Pro-LysVal-Asp-Asn-Lys-Phe-Asn-Lys-Glu-Arg-Arg-Arg-Ala-Arg-Arg-Glu-Ile-Arg-His-Leu-Pro-Asn-Leu-Asn-Arg-Glu-Gln-Arg-Arg-Ala-Phe-Ile-Arg-Ser-Leu-Arg-Asp-Asp-Pro-Ser-Gln-Ser-Ala-Asn-Leu-Leu-Ala-Glu-Ala-Lys-Lys-Leu-Asn-Asp-Ala-Gln-Pro-Lys
SEQ ID NO:8(TEVP)SEQ ID NO: 8 (TEVP)
Lys-Gly-Pro-Arg-Asp-Tyr-Asn-Pro-Ile-Ser-Ser-Ser-Ile-Cys-His-Leu-Thr-Asn-Glu-Ser-Asp-Gly-His-Thr-Thr-Ser-Leu-Tyr-Gly-Ile-Gly-Phe-Gly-Pro-Phe-Ile-Ile-Thr-Asn-L ys-His-Leu-Phe-Arg-Arg-Asn-Asn-Gly-Thr-Leu-Val-Val-Gln-Ser-Leu-His-Gly-Val-Phe-Lys-Val-Lys-Asp-Thr-Thr-Thr-Leu-Gln-Gln-His-Leu-Val-Asp-Gly-Arg-Asp-Met-Ile-Ile-Ile-Arg-Met-Pro-Lys-Asp-Phe-Pro-Pro-Phe-Pro-Gln-Lys-Leu-Lys-Phe-Arg-Glu-Pro-Gln-Arg-Glu-Glu-Arg-Ile-Cys-Leu-Val-Thr-Thr-Asn-Phe-Gln-Thr-Lys-Ser-Met-Ser-Ser-Met-Val-Ser-Asp-Thr-Ser-Cys-Thr-Phe-Pro-Ser-Gly-Asp-Gly-Ile-Phe-Trp-Lys-His-Trp-Ile-Gln-Thr-Lys-Asp-Gly-Gln-Cys-Gly-Ser-Pro-Leu-Val-Ser-Thr-Arg-Asp-Gly-Phe-Ile-Val-Gly-Ile-His-Ser-Ala-Ser-Asn-Phe-Thr-Asn-Thr-Asn-Asn-Tyr-Phe-Thr-Ser-Val-Pro-Lys-Asn-Phe-Met-Glu-Leu-Leu-Thr-Asn-Gln-Glu-Ala-Gln-Gln-Trp-Val-Ser-Gly-Trp-Arg-Leu-Asn-Ala-Asp-Ser-Val-Leu-Trp-Gly-Gly-His-Lys-Val-Phe-Met-Val-Lys-Pro-Glu-Glu-Pro-Phe-Gln-Pro-Val-Lys-Glu-Ala-Thr-Gln-Leu-Met-Asn-Glu-Gly。Lys-Gly-Pro-Arg-Asp-Tyr-Asn-Pro-Ile-Ser-Ser-Ser-Ile-Cys-His-Leu-Thr-Asn-Glu-Ser-Asp-Gly-His-Thr-Thr-Ser-Leu-Tyr-Gly-Ile-Gly-Phe-Gly-Pro-Phe-Ile-Ile-Thr-Asn-L ys-His-Leu-Phe-Arg-Arg-Asn-Asn-Gly-Thr-Leu-Val-Val-Gln-Ser-Leu-His-Gly-Val-Phe-Lys-Val-Lys-Asp-Thr-Thr-Thr-Leu-Gln-Gln-His-Leu-Val-Asp-Gly-Arg-Asp-Met-Ile-Ile-Ile-Arg-Met-Pro-Lys-Asp-Phe-Pro-Pr o-Phe-Pro-Gln-Lys-Leu-Lys-Phe-Arg-Glu-Pro-Gln-Arg-Glu-Glu-Arg-Ile-Cys-Leu-Val-Thr-Thr-Asn-Phe-Gln-Thr-Lys-Ser-Met-Ser-Ser-Met-Val-Ser-Asp-Thr-Ser-Cys-Thr-Phe-Pro-Ser-Gly-Asp-Gly-Ile-Phe-Trp-Lys- His-Trp-Ile-Gln-Thr-Lys-Asp-Gly-Gln-Cys-Gly-Ser-Pro-Leu-Val-Ser-Thr-Arg-Asp-Gly-Phe-Ile-Val-Gly-Ile-His-Ser-Ala-Ser-Asn-Phe-Thr-Asn-Thr-Asn-Asn-Tyr-Phe-Thr-Ser-Val-Pro-Lys-Asn-Phe-Met-Glu-Leu-Le u-Thr-Asn-Gln-Glu-Ala-Gln-Gln-Trp-Val-Ser-Gly-Trp-Arg-Leu-Asn-Ala-Asp-Ser-Val-Leu-Trp-Gly-Gly-His-Lys-Val-Phe-Met-Val-Lys-Pro-Glu-Glu-Pro-Phe-Gln-Pro-Val-Lys-Glu-Ala-Thr-Gln-Leu-Met-Asn-Glu-Gly.
Claims (13)
- 一种分离的肽,其包含氨基酸序列VDNKFNKEQQX 1AFYEILHLPNLNEEQRNAFIQX 2LKDDPX 3X 4SX 5X 6X 7LX 8EAX 9X 10LNDAQPK(SEQ ID NO:9)其中:X 1-X 10均是带负电荷氨基酸,优选为E或D, An isolated peptide comprises an amino acid sequence VDNKFNKEQQX 1 AFYEILHLPNLNEEQRNAFIQX 2 LKDDPX 3 X 4 SX 5 X 6 X 7 LX 8 EAX 9 X 10 LNDAQPK (SEQ ID NO: 9), wherein: X 1 -X 10 are all negatively charged amino acids, preferably E or D,优选地,所述肽的等电点pI等于或小于5.0。Preferably, the isoelectric point pi of the peptide is equal to or less than 5.0.
- 权利要求1的肽,其包含SEQ ID NO:1所示的氨基酸序列或由SEQ ID NO:1所示的氨基酸序列组成。The peptide of claim 1, which comprises the amino acid sequence shown in SEQ ID NO: 1 or consists of the amino acid sequence shown in SEQ ID NO: 1.
- 一种分离的多核苷酸,其编码权利要求1或2的肽。An isolated polynucleotide encoding the peptide of claim 1 or 2.
- 一种分离的融合蛋白,其包含第一肽和第二肽,其中所述第一肽是权利要求1或2的肽,所述第二肽是目的多肽,任选所述第二肽通过间隔物连接于所述第一肽。An isolated fusion protein comprises a first peptide and a second peptide, wherein the first peptide is the peptide of claim 1 or 2, the second peptide is a target polypeptide, and optionally the second peptide is linked to the first peptide via a spacer.
- 权利要求4的融合蛋白,其中:The fusion protein of claim 4, wherein:所述间隔物包含切割位点,优选选自化学切割位点、酶法切割位点和自切割位点,和/或,The spacer comprises a cleavage site, preferably selected from a chemical cleavage site, an enzymatic cleavage site and a self-cleavage site, and/or,所述融合蛋白还包含与所述第二肽连接的用于分离纯化的部分,例如6组氨酸标签、GST标签、Strep标签、Twin-Strep标签或MBP标签,和/或,The fusion protein further comprises a part connected to the second peptide for separation and purification, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag or an MBP tag, and/or,所述第二肽具有低于、等于或略大于7.0例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0的酸性或中性或弱碱性等电点,更优选所述第二肽的长度为20-500个氨基酸残基,例如包含SEQ ID NO:5所示氨基酸序列的NsiI蛋白或包含SEQ ID NO:8所示氨基酸序列的TEVP蛋白。The second peptide has an acidic, neutral or weakly basic isoelectric point lower than, equal to or slightly greater than 7.0, for example, equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0. More preferably, the length of the second peptide is 20-500 amino acid residues, for example, an NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
- 一种分离的多核苷酸,其包含编码权利要求4或5的融合蛋白的核苷酸序列。An isolated polynucleotide comprising a nucleotide sequence encoding the fusion protein of claim 4 or 5.
- 一种构建体、优选表达构建体,其包含权利要求6的多核苷酸。A construct, preferably an expression construct, comprising the polynucleotide of claim 6.
- 一种宿主细胞,其包含权利要求6的多核苷酸或权利要求7的构建体优选表达构建体,其中所述宿主细胞能够表达所述融合蛋白,优选地,所述宿主细胞选自原核生物、酵母和真核细胞例如哺乳动物细胞或昆虫细胞,更优选选自埃希氏菌属(Escherichia)、芽孢杆菌属(Bacillus)、沙门氏菌属(Salmonella)、假单胞菌属(Pseudomonas)和链霉菌属(Streptomyces),更优选选自埃希氏菌属,更优选是大肠杆菌(Escherichia coli)、枯草芽抱杆菌(Bacillus subtilis)或巨大芽抱杆菌(Bacillus megaterium),更优选大肠杆菌Rosetta(DE3)。A host cell comprising the polynucleotide of claim 6 or the construct of claim 7, preferably an expression construct, wherein the host cell is capable of expressing the fusion protein, preferably, the host cell is selected from prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3).
- 一种生产权利要求4或5的融合蛋白的方法,包括:A method for producing the fusion protein of claim 4 or 5, comprising:(a)在适合融合蛋白表达的条件下培养权利要求8的宿主细胞;和(a) culturing the host cell of claim 8 under conditions suitable for expression of the fusion protein; and(b)回收融合蛋白,(b) recovering the fusion protein,任选地,Optionally,(c)切割融合蛋白,释放目的多肽;和(c) cleaving the fusion protein to release the polypeptide of interest; and(d)回收所述目的多肽,(d) recovering the target polypeptide,优选地,所述目的多肽具有低于、等于或略大于7.0例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0的酸性或中性或弱碱性等电点,更优选是包含SEQ ID NO:5所示氨基酸序列的NsiI蛋白或包含SEQ ID NO:8所示氨基酸序列的TEVP蛋白。Preferably, the target polypeptide has an acidic, neutral or weakly basic isoelectric point lower than, equal to or slightly greater than 7.0, for example equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0, and more preferably is an NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
- 一种生产目的多肽的方法,包括:A method for producing a target polypeptide, comprising:(a)在宿主细胞中表达目的多肽与权利要求1或2的肽融合形成的融合蛋白,优选权利要求1或2的肽位于目的多肽的上游;(a) expressing a fusion protein formed by fusion of a target polypeptide and the peptide of claim 1 or 2 in a host cell, wherein the peptide of claim 1 or 2 is preferably located upstream of the target polypeptide;(b)回收并切割融合蛋白,释放目的多肽;和(b) recovering and cleaving the fusion protein to release the polypeptide of interest; and(c)任选地,分离纯化所述目的多肽,(c) optionally, isolating and purifying the target polypeptide,优选地,所述目的多肽具有低于、等于或略大于7.0例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0的酸性或中性或弱碱性等电点,更优选所述目的多肽的长度为20-500个氨基酸残基,更优选是包含SEQ ID NO:5所示氨基酸序列的NsiI蛋白或包含SEQ ID NO:8所示氨基酸序列的TEVP蛋白。Preferably, the target polypeptide has an acidic, neutral or weakly basic isoelectric point lower than, equal to or slightly greater than 7.0, for example equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0. More preferably, the target polypeptide has a length of 20-500 amino acid residues, more preferably a NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
- 权利要求10的方法,其中:The method of claim 10, wherein:所述目的多肽通过间隔物连接于所述权利要求1或2的肽,优选所述间隔物包含切割位点,优选选自化学切割位点、酶法切割位点和自切割位点,和/或The polypeptide of interest is linked to the peptide of claim 1 or 2 via a spacer, preferably the spacer comprises a cleavage site, preferably selected from a chemical cleavage site, an enzymatic cleavage site and a self-cleavage site, and/or所述融合蛋白还包含与所述目的多肽连接的用于分离纯化的部分,例如6组氨酸标签、GST标签、Strep标签、Twin-Strep标签或MBP标签。The fusion protein further comprises a part connected to the target polypeptide for separation and purification, such as a 6-histidine tag, a GST tag, a Strep tag, a Twin-Strep tag or an MBP tag.
- 权利要求9-11任一项的方法,其中所述宿主细胞选自原核生物、酵母和真核细胞例如哺乳动物细胞或昆虫细胞,更优选选自埃希氏菌属、芽孢杆菌属、沙门氏菌属、假单胞菌属和链霉菌属,更优选选自埃希氏菌属,更优选是大肠杆菌、枯草芽抱杆菌或巨大芽抱杆菌,更优选大肠杆菌Rosetta(DE3)。The method of any one of claims 9 to 11, wherein the host cell is selected from prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3).
- 权利要求1或2的肽或权利要求3的多核苷酸用于在宿主细胞中制备目的蛋白的用途,其中所述宿主细胞优选选自原核生物、酵母和真核细胞例如哺乳动物细胞或昆虫细胞,更优选选自埃希氏菌属、芽孢杆菌属、沙门氏菌属、假单胞菌属和链霉菌属,更优选选自埃希氏菌属,更优选是大肠杆菌、枯草芽抱杆菌或巨大芽抱杆菌,更优选大肠杆菌Rosetta(DE3),Use of the peptide of claim 1 or 2 or the polynucleotide of claim 3 for preparing a target protein in a host cell, wherein the host cell is preferably selected from prokaryotes, yeasts and eukaryotic cells such as mammalian cells or insect cells, more preferably selected from Escherichia, Bacillus, Salmonella, Pseudomonas and Streptomyces, more preferably selected from Escherichia, more preferably Escherichia coli, Bacillus subtilis or Bacillus megaterium, more preferably Escherichia coli Rosetta (DE3),优选地,所述目的蛋白具有低于、等于或略大于7.0例如等于或低于8.0、7.0、6.5、6.0、5.5、5.0、4.5或4.0的酸性或中性或弱碱性等电点,更优选所述目的蛋白的长度为20-500个氨基酸残基,更优选是包含SEQ ID NO:5所示氨基酸序列的NsiI蛋白或包含SEQ ID NO:8所示氨基酸序列的TEVP蛋白。Preferably, the target protein has an acidic, neutral or weakly alkaline isoelectric point lower than, equal to or slightly greater than 7.0, for example equal to or lower than 8.0, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5 or 4.0. More preferably, the target protein has a length of 20-500 amino acid residues, more preferably a NsiI protein comprising the amino acid sequence shown in SEQ ID NO: 5 or a TEVP protein comprising the amino acid sequence shown in SEQ ID NO: 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2022/137539 WO2024119434A1 (en) | 2022-12-08 | 2022-12-08 | Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2022/137539 WO2024119434A1 (en) | 2022-12-08 | 2022-12-08 | Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2024119434A1 true WO2024119434A1 (en) | 2024-06-13 |
WO2024119434A9 WO2024119434A9 (en) | 2024-07-25 |
Family
ID=91378324
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/137539 WO2024119434A1 (en) | 2022-12-08 | 2022-12-08 | Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024119434A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118772291A (en) * | 2022-12-08 | 2024-10-15 | 宜明(济南)生物科技有限公司 | An acidic surface-soluble short peptide tag for improving the expression efficiency of recombinant proteins |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014056199A1 (en) * | 2012-10-12 | 2014-04-17 | 清华大学 | Production and purification methods of polypeptide |
CN107629129A (en) * | 2016-07-19 | 2018-01-26 | 清华大学 | Production and the method for purified polypeptide |
CN114381471A (en) * | 2020-10-19 | 2022-04-22 | 丽珠集团新北江制药股份有限公司 | Application of auxiliary protein in recombinant protein production and fusion expression system |
-
2022
- 2022-12-08 WO PCT/CN2022/137539 patent/WO2024119434A1/en unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014056199A1 (en) * | 2012-10-12 | 2014-04-17 | 清华大学 | Production and purification methods of polypeptide |
CN107629129A (en) * | 2016-07-19 | 2018-01-26 | 清华大学 | Production and the method for purified polypeptide |
CN114381471A (en) * | 2020-10-19 | 2022-04-22 | 丽珠集团新北江制药股份有限公司 | Application of auxiliary protein in recombinant protein production and fusion expression system |
Non-Patent Citations (2)
Title |
---|
HEDHAMMAR, M. HOBER, S.: "Zbasic - A novel purification tag for efficient protein recovery", JOURNAL OF CHROMATOGRAPHY A, ELSEVIER, AMSTERDAM, NL, vol. 1161, no. 1-2, 25 July 2007 (2007-07-25), AMSTERDAM, NL, pages 22 - 28, XP022169415, ISSN: 0021-9673, DOI: 10.1016/j.chroma.2007.05.091 * |
LI, XIANGKUI; FAN, CUI-YING; CUI, YA-JUN; LIU, XUAN: "Recent Advances on the Solubility-Enhancing Fusion Tags for Recombinant Protein Expression", SHENGWU-JISHU = BIOTECHNOLOGY, HEILONGJIANG-SHENG WEISHENGWU XUEHUI; HEILONGJIANG-SHENG SHENGWU GONGCHENG XUEHUI; HEILONGJIANG-SHENG KEXUEYUAN, YINGYONG WEISHENGWU YANJIUSUO, CN, vol. 23, no. 2, 15 April 2013 (2013-04-15), CN , pages 93 - 97, XP009555719, ISSN: 1004-311X, DOI: 10.3969/j.issn.1004-311X.2013.02.049 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118772291A (en) * | 2022-12-08 | 2024-10-15 | 宜明(济南)生物科技有限公司 | An acidic surface-soluble short peptide tag for improving the expression efficiency of recombinant proteins |
Also Published As
Publication number | Publication date |
---|---|
WO2024119434A9 (en) | 2024-07-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Singh et al. | Protein recovery from inclusion bodies of Escherichia coli using mild solubilization process | |
CN104755502B (en) | The generation of polypeptide and purification process | |
CN113195521B (en) | Mtu delta I-CM intein variants and uses thereof | |
Peng et al. | A simple cost-effective methodology for large-scale purification of recombinant non-animal collagens | |
EP0305500A1 (en) | Process for the purification of recombinant polypeptides. | |
Chen et al. | Effect of linker length and flexibility on the Clostridium thermocellum esterase displayed on Bacillus subtilis spores | |
EP2075336B1 (en) | Purification of recombinant proteins fused to multiple epitopes | |
CN101570756B (en) | New-type label protein and application thereof | |
CN107446941A (en) | Cecropin A antibacterial peptide based on self-aggregation short-peptide induction and preparation method thereof | |
WO2024119434A1 (en) | Solubilization-assisting short peptide tag with acidic surface for improving expression efficiency of recombinant protein | |
CN104387473B (en) | Class elastin polypeptide ELP for the non-chromatographic purification method prokaryotic expression fusion protein Prx of non-digestion | |
CN107629129B (en) | Methods of producing and purifying polypeptides | |
US20190352365A1 (en) | Expression construct and method for producing proteins of interest | |
CN104250288B (en) | Amphiphilicα-helix self-assembling peptides and its application | |
US20030134352A1 (en) | Facilitating protein folding and solubility by use of peptide extensions | |
CN118755746A (en) | Recombinant plasmid, construction method and prokaryotic expression method of μ-conotoxin peptide | |
CN110511950B (en) | Application of PAB protein in construction of fusion protein expression vector with chaperone-like protein effect | |
CN109535262B (en) | TrxA-Defensin fusion protein, preparation method and defensin protein further prepared and application | |
CN116496365B (en) | Acidic surface-assisted dissolution short peptide tag for improving recombinant protein expression efficiency | |
CN102776216B (en) | Application of cellulase in soluable and secretion expression of recombinant protein in escherichia coli | |
CN101223275B (en) | Method for extracellular production of target protein through co-expression of OmpF and target protein | |
CN111073925B (en) | High-efficiency polypeptide-polypeptide coupling system and method based on disordered protein coupling enzyme | |
CN116254283A (en) | A method for preparing the transmembrane domain of a single transmembrane protein | |
KR20210079235A (en) | Method for enhancing soluble expression of target proteins by using fusion protein of whep domain | |
CN109096394B (en) | Nano antibody of B subunit of anti-staphylococcal protein A, nucleic acid molecule and application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22967592 Country of ref document: EP Kind code of ref document: A1 |