CN109022478A - A kind of DNA element for high-throughput protein synthesis in vitro - Google Patents
A kind of DNA element for high-throughput protein synthesis in vitro Download PDFInfo
- Publication number
- CN109022478A CN109022478A CN201710430876.3A CN201710430876A CN109022478A CN 109022478 A CN109022478 A CN 109022478A CN 201710430876 A CN201710430876 A CN 201710430876A CN 109022478 A CN109022478 A CN 109022478A
- Authority
- CN
- China
- Prior art keywords
- sequence
- protein
- yeast
- dna
- kluyveromyces
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000014616 translation Effects 0.000 title claims abstract description 58
- 238000001243 protein synthesis Methods 0.000 title claims abstract description 39
- 238000000338 in vitro Methods 0.000 title abstract description 15
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 57
- 108090000623 proteins and genes Proteins 0.000 claims description 75
- 108020004414 DNA Proteins 0.000 claims description 71
- 102000004169 proteins and genes Human genes 0.000 claims description 61
- 238000010276 construction Methods 0.000 claims description 28
- 238000000034 method Methods 0.000 claims description 24
- 241000235649 Kluyveromyces Species 0.000 claims description 22
- 108091081024 Start codon Proteins 0.000 claims description 18
- 150000007523 nucleic acids Chemical class 0.000 claims description 18
- 108020004707 nucleic acids Proteins 0.000 claims description 17
- 102000039446 nucleic acids Human genes 0.000 claims description 17
- 230000014621 translational initiation Effects 0.000 claims description 17
- 108020004705 Codon Proteins 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 15
- 125000003729 nucleotide group Chemical group 0.000 claims description 15
- 108091029865 Exogenous DNA Proteins 0.000 claims description 14
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 14
- 241000723873 Tobacco mosaic virus Species 0.000 claims description 14
- 238000006384 oligomerization reaction Methods 0.000 claims description 11
- 230000015572 biosynthetic process Effects 0.000 claims description 8
- 238000003786 synthesis reaction Methods 0.000 claims description 8
- 108090000331 Firefly luciferases Proteins 0.000 claims description 7
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 7
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 7
- 108060001084 Luciferase Proteins 0.000 claims description 7
- 239000005089 Luciferase Substances 0.000 claims description 7
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 claims description 7
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 claims description 7
- 239000005090 green fluorescent protein Substances 0.000 claims description 7
- 108010053835 Catalase Proteins 0.000 claims description 6
- 102000016938 Catalase Human genes 0.000 claims description 6
- SIIZPVYVXNXXQG-KGXOGWRBSA-N [(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-4-[[(3s,4r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-3-hydroxyoxolan-2-yl]methyl [(2r,4r,5r)-2-(6-aminopurin-9-yl)-4-hydroxy-5-(phosphonooxymethyl)oxolan-3-yl] hydrogen phosphate Polymers C1=NC2=C(N)N=CN=C2N1[C@@H]1O[C@H](COP(O)(=O)OC2[C@@H](O[C@H](COP(O)(O)=O)[C@H]2O)N2C3=NC=NC(N)=C3N=C2)[C@@H](O)[C@H]1OP(O)(=O)OCC([C@@H](O)[C@H]1O)OC1N1C(N=CN=C2N)=C2N=C1 SIIZPVYVXNXXQG-KGXOGWRBSA-N 0.000 claims description 6
- 239000005547 deoxyribonucleotide Substances 0.000 claims description 6
- 102000007469 Actins Human genes 0.000 claims description 5
- 108010085238 Actins Proteins 0.000 claims description 5
- 102000004190 Enzymes Human genes 0.000 claims description 5
- 108090000790 Enzymes Proteins 0.000 claims description 5
- 108010022355 Fibroins Proteins 0.000 claims description 5
- 238000011534 incubation Methods 0.000 claims description 5
- 239000004744 fabric Substances 0.000 claims description 4
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 claims description 3
- 235000018368 Saccharomyces fragilis Nutrition 0.000 claims description 3
- 239000003153 chemical reaction reagent Substances 0.000 claims description 3
- 229940031154 kluyveromyces marxianus Drugs 0.000 claims description 3
- 125000000266 alpha-aminoacyl group Chemical group 0.000 claims description 2
- 241000254158 Lampyridae Species 0.000 claims 1
- 244000253911 Saccharomyces fragilis Species 0.000 claims 1
- 239000006071 cream Substances 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 5
- 235000018102 proteins Nutrition 0.000 description 56
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 50
- 210000004027 cell Anatomy 0.000 description 33
- 241001138401 Kluyveromyces lactis Species 0.000 description 30
- 210000005253 yeast cell Anatomy 0.000 description 19
- 238000006062 fragmentation reaction Methods 0.000 description 15
- 238000013467 fragmentation Methods 0.000 description 14
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- MTHSVFCYNBDYFN-UHFFFAOYSA-N diethylene glycol Chemical compound OCCOCCO MTHSVFCYNBDYFN-UHFFFAOYSA-N 0.000 description 9
- 239000000284 extract Substances 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 239000007788 liquid Substances 0.000 description 7
- 229920001223 polyethylene glycol Polymers 0.000 description 7
- 239000002202 Polyethylene glycol Substances 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 230000000977 initiatory effect Effects 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 229930006000 Sucrose Natural products 0.000 description 5
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 229940024606 amino acid Drugs 0.000 description 5
- 150000001413 amino acids Chemical class 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000005720 sucrose Substances 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 description 4
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 description 4
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 4
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 108010073141 Hepatitis C virus glycoprotein E2 Proteins 0.000 description 4
- 108010078049 Interferon alpha-2 Proteins 0.000 description 4
- 102100040018 Interferon alpha-2 Human genes 0.000 description 4
- 102000000589 Interleukin-1 Human genes 0.000 description 4
- 108010002352 Interleukin-1 Proteins 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 4
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 4
- 102000003425 Tyrosinase Human genes 0.000 description 4
- 108060008724 Tyrosinase Proteins 0.000 description 4
- 102000004139 alpha-Amylases Human genes 0.000 description 4
- 108090000637 alpha-Amylases Proteins 0.000 description 4
- 229940024171 alpha-amylase Drugs 0.000 description 4
- 108010049056 enterocin A Proteins 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 3
- PCDQPRRSZKQHHS-CCXZUQQUSA-N Cytarabine Triphosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-CCXZUQQUSA-N 0.000 description 3
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 3
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 3
- 101710137500 T7 RNA polymerase Proteins 0.000 description 3
- 108010020764 Transposases Proteins 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 229960005305 adenosine Drugs 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- UEGPKNKPLBYCNK-UHFFFAOYSA-L magnesium acetate Chemical compound [Mg+2].CC([O-])=O.CC([O-])=O UEGPKNKPLBYCNK-UHFFFAOYSA-L 0.000 description 3
- 239000011654 magnesium acetate Substances 0.000 description 3
- 229940069446 magnesium acetate Drugs 0.000 description 3
- 235000011285 magnesium acetate Nutrition 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 235000011056 potassium acetate Nutrition 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000003161 proteinsynthetic effect Effects 0.000 description 3
- 210000003705 ribosome Anatomy 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102000004420 Creatine Kinase Human genes 0.000 description 2
- 108010042126 Creatine kinase Proteins 0.000 description 2
- 241000710188 Encephalomyocarditis virus Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 101900264058 Escherichia coli Beta-galactosidase Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 2
- 241000500435 Kluyveromyces dobzhanskii Species 0.000 description 2
- 241000235650 Kluyveromyces marxianus Species 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 101000715512 Mus musculus Catalase Proteins 0.000 description 2
- DRBBFCLWYRJSJZ-UHFFFAOYSA-N N-phosphocreatine Chemical compound OC(=O)CN(C)C(=N)NP(O)(O)=O DRBBFCLWYRJSJZ-UHFFFAOYSA-N 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108010071690 Prealbumin Proteins 0.000 description 2
- 241000288906 Primates Species 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- RZCIEJXAILMSQK-JXOAFFINSA-N TTP Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 RZCIEJXAILMSQK-JXOAFFINSA-N 0.000 description 2
- 108020004566 Transfer RNA Proteins 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 102000009190 Transthyretin Human genes 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 206010006514 bruxism Diseases 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- MURGITYSBWUQTI-UHFFFAOYSA-N fluorescin Chemical compound OC(=O)C1=CC=CC=C1C1C2=CC=C(O)C=C2OC2=CC(O)=CC=C21 MURGITYSBWUQTI-UHFFFAOYSA-N 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- -1 nucleoside triphosphate Chemical class 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- DWNBOPVKNPVNQG-LURJTMIESA-N (2s)-4-hydroxy-2-(propylamino)butanoic acid Chemical compound CCCN[C@H](C(O)=O)CCO DWNBOPVKNPVNQG-LURJTMIESA-N 0.000 description 1
- DOTSWKOWPPKEIB-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid;potassium Chemical compound [K].OCCN1CCN(CCS(O)(=O)=O)CC1 DOTSWKOWPPKEIB-UHFFFAOYSA-N 0.000 description 1
- CFPWPNDPUSLDPF-UHFFFAOYSA-N 2-[carbamimidoyl(methyl)amino]-2-phosphonoacetic acid Chemical class NC(=N)N(C)C(C(O)=O)P(O)(O)=O CFPWPNDPUSLDPF-UHFFFAOYSA-N 0.000 description 1
- 101001082110 Acanthamoeba polyphaga mimivirus Eukaryotic translation initiation factor 4E homolog Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- MNQZXJOMYWMBOU-VKHMYHEASA-N D-glyceraldehyde Chemical compound OC[C@@H](O)C=O MNQZXJOMYWMBOU-VKHMYHEASA-N 0.000 description 1
- UNXHWFMMPAWVPI-QWWZWVQMSA-N D-threitol Chemical compound OC[C@@H](O)[C@H](O)CO UNXHWFMMPAWVPI-QWWZWVQMSA-N 0.000 description 1
- XUIIKFGFIJCVMT-GFCCVEGCSA-N D-thyroxine Chemical compound IC1=CC(C[C@@H](N)C(O)=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-GFCCVEGCSA-N 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- UNXHWFMMPAWVPI-UHFFFAOYSA-N Erythritol Natural products OCC(O)C(O)CO UNXHWFMMPAWVPI-UHFFFAOYSA-N 0.000 description 1
- 101710091919 Eukaryotic translation initiation factor 4G Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000043856 Leucine-tRNA ligase Human genes 0.000 description 1
- 108010071170 Leucine-tRNA ligase Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 239000008118 PEG 6000 Substances 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 229920002560 Polyethylene Glycol 3000 Polymers 0.000 description 1
- 229920002562 Polyethylene Glycol 3350 Polymers 0.000 description 1
- 229920002584 Polyethylene Glycol 6000 Polymers 0.000 description 1
- 229920002594 Polyethylene Glycol 8000 Polymers 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108091006725 SLCO1C1 Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 102100027229 Solute carrier organic anion transporter family member 1C1 Human genes 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 239000005864 Sulphur Substances 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000003125 aqueous solvent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009697 arginine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000013065 commercial product Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 210000004907 gland Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 235000014304 histidine Nutrition 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 235000018977 lysine Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 229950007002 phosphocreatine Drugs 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 229960004109 potassium acetate Drugs 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 239000002212 purine nucleoside Substances 0.000 description 1
- 239000002213 purine nucleotide Substances 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 239000003488 releasing hormone Substances 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 229940034208 thyroxine Drugs 0.000 description 1
- XUIIKFGFIJCVMT-UHFFFAOYSA-N thyroxine-binding globulin Natural products IC1=CC(CC([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-UHFFFAOYSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 229940048102 triphosphoric acid Drugs 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- DJJCXFVJDGTHFX-XVFCMESISA-N uridine 5'-monophosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-XVFCMESISA-N 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 230000009614 wildtype growth Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43595—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from coelenteratae, e.g. medusae
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4716—Muscle proteins, e.g. myosin, actin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0065—Oxidoreductases (1.) acting on hydrogen peroxide as acceptor (1.11)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0069—Oxidoreductases (1.) acting on single donors with incorporation of molecular oxygen, i.e. oxygenases (1.13)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/01—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
- C12Y102/01009—Glyceraldehyde-3-phosphate dehydrogenase (NADP+) (1.2.1.9)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y111/00—Oxidoreductases acting on a peroxide as acceptor (1.11)
- C12Y111/01—Peroxidases (1.11.1)
- C12Y111/01006—Catalase (1.11.1.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y113/00—Oxidoreductases acting on single donors with incorporation of molecular oxygen (oxygenases) (1.13)
- C12Y113/12—Oxidoreductases acting on single donors with incorporation of molecular oxygen (oxygenases) (1.13) with incorporation of one atom of oxygen (internal monooxygenases or internal mixed function oxidases)(1.13.12)
- C12Y113/12007—Photinus-luciferin 4-monooxygenase (ATP-hydrolysing) (1.13.12.7), i.e. firefly-luciferase
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Mycology (AREA)
- Toxicology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The present invention provides a kind of DNA elements for high-throughput protein synthesis in vitro, specifically, the present invention has unexpectedly discovered a kind of Novel DNA element that protein translation efficiency can be greatly enhanced for the first time, DNA element of the invention is applied in protein synthesis system outside yeast, the relative light unit value of synthesized uciferase activity enhances about 22.3 times than the DNA element only containing Ω sequence.
Description
Technical field
The present invention relates to field of biotechnology, preferably, being related to a kind of DNA member for high-throughput protein synthesis in vitro
Part.
Background technique
In eukaryocyte, the overwhelming majority all contains " cap " structure at 5 ' ends for the mRNA molecule of coding protein
(m7GpppN) and 3 ' end polyadenosine chain [poly (A)] structure.The steady of mRNA molecule not only can be enhanced in both structures
It is qualitative, and be necessary to protein translation[1].Wherein, 5 ' " cap " structures can be translated initiation factor eIF4E and be known
Not, and a variety of translation initiation factors and ribosomes in downstream, and then initiation protein translation process are recruited[1].It is right in eukaryocyte
MRNA is carried out plus " cap " modification is related to the 3 steps reaction by 3 enzymatics, and this process is urged with by rna plymerase ii
Change transcription simultaneously progress[2]。
It is often poly- using the RNA of bacteriophage or virus in the protein synthesis in vitro system of transcription and translation coupling
Synthase (such as t7 rna polymerase) is transcribed, which dictates that being difficult to carry out mRNA to add " cap " modification.It is being based on eukaryocyte
Protein synthesis in vitro system in, to utilize protein translation system (including the translation initiation factor from eukaryocyte
Son and ribosomes etc.), the mRNA for not having " cap " structure must recruit correlation factor containing other elements, and then originate
The translation of protein[3]。
However, element used in current raising protein synthesis mainly has the translational enhancer Ω sequence of tobacco mosaic virus (TMV)
Column and the internal ribosome of some other viruses enter sequence (Internal Ribosome Entry Sites, IRESs), such as
EMCV IRES(Encephalomyocarditis virus).These elements can in the cell with protein synthesis in vitro body
" cap "-non-dependent protein translation is originated in system.But the efficiency of these element initiation proteins translation often compares
It is low, the time required to will increase protein synthesis, limit the yield of final protein[4]。
Therefore, there is an urgent need in the art to develop a kind of new DNA element that can enhance protein translation efficiency.
Summary of the invention
The purpose of the present invention is to provide the new DNA elements that one kind can enhance protein translation efficiency.
First aspect present invention provides a kind of nucleic acid constructs, and the construction has from 5 ' to 3 ' Formulas I knot
Structure:
Z1-Z2-Z3-Z4 (I)
In formula,
Z1, Z2, Z3, Z4 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Also, described Z2, Z3, Z4 have collectively constituted Kozak sequence, and the Kozak sequence derives from Kluyveromyces
Yeast.
In another preferred example, the Kluyveromyces yeast is selected from the group: Kluyveromyces lactis, Marx's Crewe
Tie up yeast, more cloth kluyveromyces (Kluyveromyces dobzhanskii), or combinations thereof.
In another preferred example, the translation initiation codon is selected from the group: ATG, ATA, ATT, GTG, TTG or its group
It closes.
In another preferred example, the translation initiation codon is ATG.
In another preferred example, the serine codon is selected from the group: TCT, TCC, TCA, TCG, AGT, AGC or its
Combination.
In another preferred example, the serine codon is TCT.
In another preferred example, n 6-12, preferably, 8-10.
In another preferred example, the Ω sequence includes direct replicated blocks (ACAATTAC)m(CAA)pModule.
In another preferred example, the m is 1-6, preferably, 2-4.
In another preferred example, the p is 6-12, preferably, 8-10.
In another preferred example, described (CAA)pModule is 1-5, preferably, 1-3.
In another preferred example, described (CAA)pModule further includes (CAA) of optimizationpModule.
In another preferred example, the sequence of the nucleic acid constructs is as shown in SEQ ID NO.:1-3.
In another preferred example, shown Kozak sequence is as shown in SEQ ID NO.:5.
Second aspect of the present invention provides a kind of nucleic acid constructs, and the construction has from 5 ' to 3 ' Formula II knot
Structure:
Z1-Z2-Z3-Z4-Z5 (II)
In formula,
Z1, Z2, Z3, Z4, Z5 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Z5 is the coded sequence of foreign protein;
Also, described Z2, Z3, Z4 have collectively constituted Kozak sequence, and the Kozak sequence derives from Kluyveromyces
Yeast.
In another preferred example, the coded sequence of the foreign protein comes from prokaryotes, eucaryote.
In another preferred example, the coded sequence of the foreign protein comes from animal, plant, pathogen.
In another preferred example, the coded sequence of the foreign protein comes from mammal, preferably Primate, grinding tooth
Animal, including people, mouse, rat.
In another preferred example, the coded sequence of the foreign protein is selected from the group: coding fluorescence fibroin or fluorescence
Plain enzyme (such as firefly luciferase), green fluorescent protein, yellow fluorescence protein, aminoacyl tRNA synthetase, glyceraldehyde-3-phosphate
Dehydrogenase, catalase, actin, the exogenous DNA of the Variable Area of antibody, the DNA of luciferase mutant or its group
It closes.
In another preferred example, the foreign protein is selected from the group: alpha-amylase, enterocin A, hepatitis C virus E 2
Glycoprotein, insulin precurosor, Interferon α A, interleukin-1 ' beta ', lysozyme element, seralbumin, single-chain antibody section
(scFV), transthyretin, tyrosinase, zytase, or combinations thereof.
In another preferred example, the nucleotide sequence of the coded sequence of the foreign protein such as SEQ ID NO.:6-12 institute
Show.
Third aspect present invention provides a kind of carrier or carrier combination, and the carrier or carrier combination contain the present invention
Nucleic acid constructs described in first aspect or second aspect of the present invention.
Fourth aspect present invention provides a kind of genetically engineered cell, one of the genome of the genetically engineered cell or
Multiple integrations have construction described in first aspect present invention or second aspect of the present invention or the genetically engineered cell
In contain carrier described in third aspect present invention or carrier combination.
In another preferred example, the genetically engineered cell is yeast cells.
In another preferred example, the yeast cells comes from Kluyveromyces yeast.
In another preferred example, the Kluyveromyces yeast is selected from the group: Kluyveromyces lactis, Marx's Crewe
Tie up yeast, more cloth kluyveromyces (Kluyveromyces dobzhanskii), or combinations thereof.
One of fifth aspect present invention provides a kind of kit, and the reagent for including in the kit is selected from the group
Or it is a variety of:
(a) construction described in first aspect present invention or second aspect of the present invention;
(b) carrier described in third aspect present invention or carrier combination;With
(c) genetically engineered cell described in fourth aspect present invention;
In another preferred example, the kit further includes the outer albumen synthetic system of (d) yeast.
In another preferred example, the outer albumen synthetic system of the yeast is the external albumen synthetic system of kluyveromyces
(the preferably external albumen synthetic system of Kluyveromyces lactis).
Sixth aspect present invention provides construction, this hair as described in first aspect present invention or second aspect of the present invention
Genetically engineered cell described in carrier described in the bright third aspect or carrier combination, fourth aspect present invention or the 5th side of the invention
The purposes of kit described in face, for carrying out high-throughput external albumen synthesis.
Seventh aspect present invention provides a kind of foreign protein synthetic method of external high throughput, comprising steps of
(i) outside yeast in the presence of albumen synthetic system, nucleic acid constructs described in second aspect of the present invention is provided;
(ii) under the suitable conditions, the outer albumen synthetic system of the yeast of incubation step (i) T1 for a period of time, to close
At the foreign protein.
In another preferred example, the method also includes (iii) albumen synthetic systems optionally outside the yeast
In, separate or detect the foreign protein.
In another preferred example, the outer albumen synthetic system of the yeast is the external albumen synthetic system of kluyveromyces
(the preferably external albumen synthetic system of Kluyveromyces lactis).
In another preferred example, the coded sequence of the foreign protein comes from prokaryotes, eucaryote.
In another preferred example, the coded sequence of the foreign protein comes from animal, plant, pathogen.
In another preferred example, the coded sequence of the foreign protein comes from mammal, preferably Primate, grinding tooth
Animal, including people, mouse, rat.
In another preferred example, the coded sequence of the foreign protein is selected from the group: coding fluorescence fibroin or fluorescence
Plain enzyme (such as firefly luciferase), green fluorescent protein, yellow fluorescence protein, aminoacyl tRNA synthetase, glyceraldehyde-3-phosphate
Dehydrogenase, catalase, actin, the exogenous DNA of the Variable Area of antibody, the DNA of luciferase mutant or its group
It closes.
In another preferred example, the foreign protein is selected from the group: alpha-amylase, enterocin A, hepatitis C virus E 2
Glycoprotein, insulin precurosor, Interferon α A, interleukin-1 ' beta ', lysozyme element, seralbumin, single-chain antibody section
(scFV), transthyretin, tyrosinase, zytase, or combinations thereof.
In another preferred example, the nucleotide sequence of the coded sequence of the foreign protein such as SEQ ID NO.:6-12 institute
Show.
In another preferred example, in the step (ii), reaction temperature is 20-37 DEG C, preferably, 22-35 DEG C.
In another preferred example, in the step (ii), reaction time 1-10h, preferably, 2-8h.
It should be understood that above-mentioned each technical characteristic of the invention and having in below (eg embodiment) within the scope of the present invention
It can be combined with each other between each technical characteristic of body description, to form a new or preferred technical solution.As space is limited, exist
This no longer tires out one by one states.
Detailed description of the invention
Fig. 1, which is shown, joined the Ω -10A/8A/6A-Fluc containing Kluyveromyces lactis specificity Kozak sequence
3 DNA fragmentations and Ω-Fluc DNA fragmentation without containing Kozak sequence encode in protein synthesis system outside yeast to be closed
At the relative light unit value (Relative Light Unit, RLU) of Fluc protein, and it is added without the ferment of any DNA fragmentation
The relative light unit value RLU of the outer protein system of parent.
Specific embodiment
After extensive and in-depth study, by largely screening and groping, have unexpectedly discovered that one kind can be significantly for the first time
Enhance the Novel DNA element of protein translation efficiency, DNA element of the invention includes Ω sequence, adenine deoxyribonucleoside oligomerization
Chain, translation initiation codon and serine codon, wherein rear three elements constitute kluyveromyces (such as Kluyveromyces Lactis dimension ferment
It is female) the Kozak sequence of specificity, DNA element of the invention is applied in protein synthesis system outside yeast, synthesized is glimmering
The relative light unit value of light element enzymatic activity enhances about 22.3 times than the DNA element only containing Ω sequence.On this basis, of the invention
People completes the present invention.
The outer protein synthesis system of yeast
Yeast (yeast) has both the advantage for cultivating simple efficient protein matter folding and posttranslational modification.It wherein makes wine ferment
Female (Saccharomyces cerevisiae) and pichia yeast (Pichia pastoris) be express complicated eukaryotic protein with
The model organism of memebrane protein, yeast also can be used as the raw material for preparing external translating system.
Kluyveromyces (Kluyveromyces) are a kind of ascospore yeast, kluyveromyces marxianus therein
(Kluyveromyces marxianus) and Kluyveromyces lactis (Kluyveromyces lactis) are industrially to make extensively
Yeast[5].Compared with other yeast, Kluyveromyces lactis is had many advantages, such as superpower secretion capacity, preferably
Large scale fermentation characteristic, the rank of food safety and there is the ability modified after protein translation simultaneously etc.[5]。
In the present invention, the outer protein synthesis system of yeast is not particularly limited, a kind of outer albumen of preferred yeast
Matter synthetic system is kluyveromyces expression system (more preferably, Kluyveromyces lactis expression system).
In the present invention, the outer protein synthesis system of the yeast includes:
(a) yeast cell extract;
(b) polyethylene glycol;
(c) optional Exogenous Sucrose;With
(d) optional solvent, the solvent are water or aqueous solvent.
In a particularly preferred embodiment, external albumen synthetic system provided by the invention includes: that yeast cells mentions
Take object, 4- hydroxyethyl piperazineethanesulfonic acid, potassium acetate, magnesium acetate, adenosine triphyosphate (ATP), guanopterin nucleoside triphosphate
(GTP), cytidine triphosphate (CTP), thymidine triphosphate (TTP), ispol, phosphocreatine, two
Sulphur threitol (DTT), creatine phosphokinase, RNase inhibitor, fluorescein, luciferin enzyme dna, RNA polymerase.
In the present invention, RNA polymerase is not particularly limited, and can be selected from one or more RNA polymerases, typically
RNA polymerase is t7 rna polymerase.
In the present invention, ratio of the yeast cell extract in vitro in albumen synthetic system is not particularly limited,
Shared system is 20-70% to the usual yeast cell extract in protein synthetic proteins synthetic system in vitro, preferably
Ground, 30-60%, more preferably, 40-50%.
In the present invention, the yeast cell extract is free of complete cell, typical yeast cell extract packet
Include the initiation factor and extension of the ribosomes for protein translation, transfer RNA, aminoacyl tRNA synthetase, protein synthesis needs
The factor and termination releasing factor.In addition, also containing other in some cytoplasm from yeast cells in yeast extract
Albumen, especially soluble protein.
In the present invention, protein content contained by the yeast cell extract is 20-100mg/ml, preferably 50-
100mg/ml.The measurement protein content method is Coomassie brilliant blue measuring method.
In the present invention, the preparation method of the yeast cell extract is unrestricted, a kind of preferred preparation method
The following steps are included:
(i) yeast cells is provided;
(ii) carrying out washing treatment is carried out to yeast cells, obtains washed yeast cells;
(iii) broken cell processing is carried out to washed yeast cells, to obtain yeast crude extract;
(iv) the yeast crude extract is separated by solid-liquid separation, obtains liquid portion, as yeast cell extract.
In the present invention, the solid-liquid separation method is not particularly limited, and a kind of preferred mode is centrifugation.
In a preferred embodiment, the centrifugation carries out in the liquid state.
In the present invention, the centrifugal condition is not particularly limited, and a kind of preferred centrifugal condition is 5000-100000
× g, preferably, 8000-30000 × g.
In the present invention, the centrifugation time is not particularly limited, and a kind of preferred centrifugation time is 0.5min-2h, compared with
Goodly, 20min-50min.
In the present invention, the temperature of the centrifugation is not particularly limited, it is preferred that and the centrifugation carries out at 1-10 DEG C,
Preferably, being carried out at 2-6 DEG C.
In the present invention, the carrying out washing treatment mode is not particularly limited, and a kind of preferred carrying out washing treatment mode is to adopt
It is handled with cleaning solution in the case where pH is 7-8 (preferably, 7.4), the cleaning solution is not particularly limited, the typical washing
Liquid is selected from the group: 4- hydroxyethyl piperazineethanesulfonic acid potassium, potassium acetate, magnesium acetate, or combinations thereof.
In the present invention, the mode of broken cell processing is not particularly limited, at a kind of preferred broken cell
Reason includes that high pressure is broken, freeze thawing (such as liquid nitrogen cryogenics) is broken.
Ribonucleoside triphosphote mixture in the protein synthesis in vitro system is adenosine triphyosphate, guanosint
Guanosine triphosphate, cytidine triphosphate and uridine diphosphate guanosine triphosphate.In the present invention, the concentration of various mononucleotides does not have
Especially limitation, the concentration of usual every kind of mononucleotide are 0.5-5mM, preferably 1.0-2.0mM.
Ispol in the protein synthesis in vitro system may include natural or non-natural amino acids, it may include
D type or L-type amino acid.Representative amino acid includes (but being not limited to) 20 kinds of natural amino acids: glycine, alanine, figured silk fabrics
Propylhomoserin, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, day
Winter amide, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine.The concentration of every kind of amino acid
Usually 0.01-0.5mM, preferably 0.02-0.2mM, such as 0.05,0.06,0.07,0.08mM.
In preference, the protein synthesis in vitro system also contains polyethylene glycol or its analog.Polyethylene glycol or
The concentration of its analog is not particularly limited, in general, the concentration (w/v) of polyethylene glycol or its analog is 0.1-8%, preferably
Ground, 0.5-4%, more preferably, 1-2%, with the total weight of the albumen synthetic system.Representative PEG example includes (but simultaneously
It is not limited to): PEG3000, PEG8000, PEG6000 and PEG3350.It should be understood that may also include other various for system of the invention
The polyethylene glycol (such as PEG200,400,1500,2000,4000,6000,8000,10000) of molecular weight.
In preference, the protein synthesis in vitro system also contains sucrose.The concentration of sucrose is not particularly limited, and leads to
Often, the concentration of sucrose is 0.03-40wt%, preferably, 0.08-10wt%, more preferably, 0.1-5wt%, with albumen synthesis
The total weight of system.
A kind of particularly preferred protein synthesis in vitro system also contains following components other than yeast extract:
The 4- hydroxyethyl piperazineethanesulfonic acid that 22mM, pH are 7.4,30-150mM potassium acetate, 1.0-5.0 mM magnesium acetate, 1.5-4mM nucleosides
Triphosphoric acid mixture, the ispol of 0.08-0.24mM, 25 mM phosphocreatines, 1.7mM dithiothreitol (DTT), 0.27mg/
ML creatine phosphokinase, 1%-4% polyethylene glycol, 0.5%-2% sucrose, the DNA of 8-20ng/ μ l firefly luciferase,
0.027-0.054mg/mL T7 RNA polymerase.
The coded sequence (exogenous DNA) of foreign protein
As used herein, term " coded sequence of foreign protein " is used interchangeably with " exogenous DNA ", refers both to the use of external source
In the DNA molecular for instructing protein to synthesize.In general, the DNA molecular is linear or cricoid.The DNA molecular contains
There is the sequence of encoding foreign proteins.
In the present invention, the example of the sequence of the encoding foreign proteins includes (but being not limited to): genome sequence,
CDNA sequence.The sequence of the encoding foreign proteins also contains promoter sequence, 5' non-translated sequence, 3' non-translated sequence.
In the present invention, the selection of the exogenous DNA is not particularly limited, in general, exogenous DNA is selected from the group: coding is glimmering
Light fibroin or luciferase (such as firefly luciferase), green fluorescent protein, yellow fluorescence protein, aminoacyl tRNA synthesis
Enzyme, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, the exogenous DNA of the Variable Area of antibody, luciferase are prominent
The DNA of variant, or combinations thereof.
Exogenous DNA is also selected from the following group: coding alpha-amylase, enterocin A, hepatitis C virus E 2 glycoprotein, pancreas
Island element precursor, Interferon α A, interleukin-1 ' beta ', lysozyme element, seralbumin, single-chain antibody section (scFV), thyroxine
Transporter, tyrosinase, zytase exogenous DNA, or combinations thereof.
The sequence of a kind of representative exogenous DNA is selected from: SEQ ID NO.:6-12.
In a preferred embodiment, the exogenous DNA encodes albumen selected from the group below: green fluorescent protein
(enhanced GFP, eGFP), yellow fluorescence protein (YFP), Escherichia coli beta galactosidase (β-galactosidase,
LacZ), people's lysine-tRNA synzyme (Lysine-tRNA synthetase), human leucine-tRNA synzyme
(Leucine-tRNA synthetase), arabidopsis glyceraldehyde 3 phosphate dehydrogenase (Glyceraldehyde-3-
Phosphate dehydrogenase), mouse catalase (Catalase), or combinations thereof.
In a preferred embodiment, the nucleotide sequence of the green fluorescent protein is as shown in SEQ ID NO.:6;Institute
The nucleotide sequence of yellow fluorescence protein is stated as shown in SEQ ID NO.:7;The nucleotide of the Escherichia coli beta galactosidase
Sequence is as shown in SEQ ID NO.:8;The nucleotide sequence of people's lysine-tRNA synzyme is as shown in SEQ ID NO.:9;
The nucleotide sequence of the human leucine-tRNA synzyme is as shown in SEQ ID NO.:10;The arabidopsis glyceraldehyde 3- phosphorus
The nucleotide sequence of acidohydrogenase is as shown in SEQ ID NO.:11;The nucleotide sequence of the mouse catalase such as SEQ ID
Shown in NO.:12.
Ω sequence
As used herein, term " Ω sequence " is 5 ' end leader sequences of tmv cdna group, is this kind of virus
Translational enhancer.The DNA sequence dna of Ω contains 68 base-pairs, straight by 1-6 (preferably 2-4, more preferable 3) 8 base-pairs
Connecing replicated blocks (ACAATTAC) and 1-5 (preferably 1-3, more preferable 1) (CAA) p modules, wherein p is 6-12, compared with
Goodly, 8-10.The two modules are crucial for the enhancing interpretative function of Ω sequence.The protein outside yeast of the invention
In synthetic system, Ω sequence can originate the protein translation of " cap sequence " dependent/non-dependent, and this function may be by recruiting
What the translation initiation factor eIF4G that raises was realized.But the efficiency of Ω sequence initiation protein translation is relatively low, needs to constitute it
It optimizes, and cooperates other DNA elements or protein to enhance the efficiency of protein translation.
Kozak sequence
By dividing translation initiation codon (AUG) upstream and downstream sequence in known Eukaryotic mRNA molecule
Analysis, the consensus sequence found out are referred to as Kozak sequence.Kozak sequence is proved that the translation initiation efficiency of mRNA can be enhanced.
The Kozak sequence of different plant species is often different, and such as brewing yeast cell (Saccharomyces cerevisiae) and is fed
The Kozak sequence of newborn zooblast there is significant difference.
In the present invention, Kozak sequence used includes 6-12 adenine deoxyribonucleoside oligomerization chain (preferably, 8-
10), translation initiation codon (such as ATG, ATA, ATT, GTG, TTG, preferably ATG) and serine codon (such as TCT, TCC,
TCA, TCG, AGT, AGC etc., preferably TCT), it derives from kluyveromyces (preferably Kluyveromyces lactis).
DNA construction
The present invention provides a kind of DNA construction, the DNA construction contains structure nucleic acid sequence shown in formula I:
Z1-Z2-Z3-Z4 (I)
In formula,
Z1, Z2, Z3, Z4 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Also, described Z2, Z3, Z4 have collectively constituted Kozak sequence, and the Kozak sequence derives from Kluyveromyces
Yeast.
The present invention also provides a kind of DNA construction, the construction has from 5 ' to 3 ' Formula II structure:
Z1-Z2-Z3-Z4-Z5 (II)
In formula,
Z1, Z2, Z3, Z4, Z5 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Z5 is the coded sequence of foreign protein;
In the present invention, the selection of the coded sequence of the foreign protein is not particularly limited, in general, the volume of foreign protein
Code sequence is selected from the group: coding fluorescence fibroin or luciferase (such as firefly luciferase), green fluorescent protein, yellow
The variable region of fluorescin, aminoacyl tRNA synthetase, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, antibody
The exogenous DNA in domain, luciferase mutant DNA, or combinations thereof.
Foreign protein is also selected from the following group: alpha-amylase, enterocin A, hepatitis C virus E 2 glycoprotein, insulin
Precursor, Interferon α A, interleukin-1 ' beta ', lysozyme element, seralbumin, single-chain antibody section (scFV), thyroxine delivery
Albumen, tyrosinase, zytase, or combinations thereof.
In addition, the nucleic acid constructs of the invention can be linear, it is also possible to cricoid.The core of the invention
Acid construct object can be single-stranded, be also possible to double-strand.The nucleic acid constructs of the invention can be DNA, be also possible to
RNA or DNA/RNA heterozygosis.
In another preferred example, the construction further includes element selected from the group below or combinations thereof: promoter, termination
Son, poly (A) element, transhipment element, gene target element, riddled basins, enhancer, resistant gene, transposase coding
Gene.
Multiple choices marker gene can be applied to the present invention, including but not limited to: nutrient defect type mark, resistance mark
Note, reporter gene label.The application of selective key plays a role the screening of recombinant cell (recon), so that recipient cell
Born of the same parents can significantly be distinguished with unconverted cell.Nutrient defect type mark is the marker gene and recipient cell by being transferred to
Mutated gene is complementary, so that recipient cell be made to show wild type growth.Resistance marker, which refers to, is transferred to recipient cell for resistant gene
In, the gene being transferred to makes recipient cell show drug resistance under certain drug concentration.As preferred embodiment of the invention, application
Resistance marker realizes the convenient screening of recombinant cell.
In the present invention, DNA construction of the invention is applied in protein synthesis system outside yeast of the invention, it can
The efficiency of protein translation is significantly improved, specifically, using the opposite of uciferase activity synthesized by DNA construction of the invention
Light unit value enhances about 22.3 times than the DNA element only containing Ω sequence.
Carrier, genetically engineered cell
The present invention also provides a kind of carriers or carrier to combine, and the carrier contains DNA construction of the invention.It is preferred that
Ground, the carrier are selected from: bacterial plasmid, bacteriophage, yeast plasmid or zooblast carrier, shuttle vector;The carrier is
Transposon vector.The method for being used to prepare recombinant vector is well known to those of ordinary skill in the art.As long as it can be in place
Duplication and stabilization in main body, any plasmid and carrier are all can be adopted.
Those of ordinary skill in the art can be used the building of well known method containing promoter of the present invention and/or
The expression vector of objective gene sequence.These methods include recombinant DNA technology in vi, DNA synthetic technology, In vivo recombination technology
Deng.
The present invention also provides a kind of genetically engineered cell, the genetically engineered cell contains the construction or load
Body or carrier combination or the genetically engineered cell chromosomal integration have the construction or carrier.In another preference
In, the genetically engineered cell further includes being integrated with transposase base on the carrier containing encoding transposase gene or its chromosome
Cause.
Preferably, the genetically engineered cell is eukaryocyte.
In another preferred example, the eukaryocyte, including but not limited to: (preferably, kluyveromyces are thin for yeast cells
Born of the same parents, more preferable Kluyveromyces lactis cell).
Construction or carrier of the invention, can be used for converting genetically engineered cell appropriate.Genetically engineered cell can be with
It is prokaryotic cell, such as Escherichia coli, streptomyces, Agrobacterium: or low eukaryocyte, such as yeast cells;Or it is high dynamic
Object cell, such as insect cell.Persons skilled in the art are aware that how to select carrier and genetically engineered cell appropriate.With
Recombinant DNA transformation gene engineering cell can be carried out with routine techniques well known to those skilled in the art.When host is prokaryotes
When (such as Escherichia coli), CaCl can be used2Method processing, it is also possible to which electroporation carries out.When host is eucaryote, can be selected such as
Under DNA transfection method: calcium phosphate precipitation, conventional mechanical methods (such as microinjection, electroporation, liposome packaging).
The methods of Agrobacterium-mediated Transformation or via Particle Bombardment Transformation, such as leaf disk method, rataria conversion method, bud infusion method can also be used in conversion plant
Deng.
External high-throughput protein synthesis methods
The present invention provides a kind of protein synthesis methods of external high throughput, comprising steps of
(i) outside yeast in the presence of albumen synthetic system, nucleic acid constructs described in first aspect present invention is provided, and
The DNA molecular for instructing protein to synthesize of external source is added;Or provide nucleic acid constructs described in second aspect of the present invention;
(ii) under the suitable conditions, the outer albumen synthetic system of the yeast of incubation step (i) T1 for a period of time, to close
At the protein or foreign protein encoded by the exogenous DNA.
Main advantages of the present invention include:
(1) it present invention firstly discovers that, by Ω sequence and derives from kluyveromyces (preferably Kluyveromyces lactis)
Kozak sequence is remarkably improved albumen applied in protein synthesis system outside yeast of the invention as nucleic acid constructs
The efficiency of translation.
(2) present invention firstly discovers that, compared with the DNA element only containing Ω sequence, synthesized by DNA construction of the invention
Uciferase activity relative light unit value enhance about 22.3 times.
(3) present invention firstly discovers that, in terms of promoting protein synthesis, Ω -10A is better than Ω -8A and Ω -6A.
(4) Novel DNA element Ω -10A of the invention derives from the Kozak sequence of Kluyveromyces lactis specificity, energy
Enough significantly increase the efficiency of the outer synthetic system synthetic proteins matter of yeast.With the Ω-Fluc DNA fragmentation for not containing Kozak sequence
It compares, Ω -10A-Fluc increases 22.3 times of protein synthetic quantity.
(5) present invention is by translation initiation codon AUG upstream and downstream sequence in K. lactis gene mRNA
Analysis finds that the sequence at 5 ' ends is that A and U is enriched with, and the DNA fragmentation containing polyadenous purine nucleosides (10A) oligomerization chain increases
The efficiency of strong protein synthesis is better than the DNA fragmentation containing 8A and 6A.The above analysis with the experimental results showed that, the present invention is wrapped
The Novel DNA element contained has the potentiality advanced optimized.
(6) the Ω sequence in the Novel DNA element Ω -10A that the present invention is included is generally applied to external protein conjunction
In architectonical, carry out the synthesis of initiation protein.For the present invention by the understanding to Ω sequence component part, learning has starting albumen
The possible comprising modules of matter translation ability, it is subsequent to be directed between the sequence, quantity and disparate modules of these modules
It puts in order and is further optimized, it is desired to be able to further strengthen the ability of its initiation protein combined coefficient.
Present invention will be further explained below with reference to specific examples.It should be understood that these embodiments are merely to illustrate the present invention
Rather than it limits the scope of the invention.In the following examples, the experimental methods for specific conditions are not specified, usually according to conventional strip
Part, such as Sambrook et al., molecular cloning: laboratory manual (New York:Cold Spring Harbor
Laboratory Press, 1989) condition described in, or according to the normal condition proposed by manufacturer.Unless otherwise stated, no
Then percentage and number are weight percent and parts by weight.
Unless otherwise instructed, then material used in the embodiment of the present invention and reagent are commercial product.
Embodiment 1: Kluyveromyces lactis specificity Kozak sequence analysis
The determination of the Kozak sequence of 1.1 Kluyveromyces lactis specificity: according to analysis saccharomyces cerevisiae
The method of the Kozak sequence of (Saccharomyces cerevisiae) specificity, retrieves and collects Kluyveromyces lactis 107
The mRNA sequence of a gene, and 60 alkali that the Kluyveromyces lactis mRNA translation initiation codon AUG 5 ' filtered out is held
Basic sequence and a Codon sequences in 3 ' end downstreams carry out the content analysis of A, U, C and G, have been surprisingly found that and hold position the 5 ' of AUG
It sets, the ratio of adenine A is dominant, and has been determined that the codon UCU an of serine is dominant at the 3 ' ends of AUG, in this way
The Kozak sequence for just having primarily determined Kluyveromyces lactis specificity, is named as kl-Kozak sequence.
Protein synthesis system matter outside 1.2 yeasts of the building containing Kluyveromyces lactis specificity kl-Kozak sequence
Grain: (the pET21a plasmid modified is obtained from health code (Shanghai) biological section to protein synthesis system plasmid outside original yeast
Skill Co., Ltd) in Ω sequence and reporter gene firefly luciferase Fluc sequence between be added Kluyveromyces Lactis tie up ferment
The kl-Kozak sequence of female specificity inserts at 5 ' ends of Fluc initiation codon containing different number adenine deoxyribonucleoside
The oligomerization chain of acid, including 10A, 8A and 6A, and serine codon TCT is inserted into Fluc initiation codon and second password
Between son, pKL- Ω -10A-Fluc, pKL- containing Kluyveromyces lactis specificity kl-Kozak sequence are ultimately formed
The plasmid of Ω -8A-Fluc and pKL- Ω -6A-Fluc.
Specific implementation process is as follows: use 3 pairs of primers:
O6A-605_f(CAACAATTACCAACAACAACAAACAACAAACAACATTAC
AATTACTATTTACAATTACAAAAAAAATGTCTGAAGACGCCAAAAACAT AAAGAAAGGCC)(SEQ ID NO.:13)
With OK-60_b (TTTTTTTTGTAATTGTAAATAGTAATTGTAATGTTGTTTGTTGTTTGTTGT TG) (SEQ ID
NO.:14);
O8A-605_f(CAATTACCAACAACAACAAACAACAAACAACATTACAAT
TACTATTTACAATTACAAAAAAAAAATGTCTGAAGACGCCAAAAACATA AAGAAAGGCC)(SEQ ID NO.:15)
With OK-60_b;
O10A-605_f(AATTACCAACAACAACAAACAACAAACAACATTACAAT
TACTATTTACAATTACAAAAAAAAAAAATGTCTGAAGACGCCAAAAACA TAAAGAAAGGCC)(SEQ ID NO.:
16) and OK-60_b carries out PCR amplification to plasmid pKL- Ω-Fluc (SEQ ID NO.:4), and 1 μ is added into 20 μ L amplified productions
L Dpn I, 37 DEG C of incubation 6h;4 μ L of product is added in 50 μ L DH5 α competent cells after DpnI is handled, and places on ice
After 30min, 42 DEG C of heat shock 45s, 3min is placed on ice, and 200 37 DEG C of shaken cultivation 4h of μ L LB liquid medium are added, are coated on
It is incubated overnight on LB solid medium containing Amp antibiotic;After 6 monoclonals of picking expand culture, be sequenced really
After recognizing correctly, extracts plasmid and save, be named as pKL- Ω -10A-Fluc (SEQ ID NO.:1), pKL- Ω -8A-Fluc (SEQ
ID NO.:2) and pKL- Ω -6A-Fluc (SEQ ID NO.:3) plasmid.
Application of the embodiment 2:kl-Kozak sequence outside yeast in protein synthesis system
2.1 methods for utilizing PCR, and use primer T7_pET21a_F:CGCGAAATTAATACGACTCACTATAGG
(SEQ ID NO.:17) and T7ter_pET21a_R:TCCGGATATAGTTCCTCCTTTCAG (SEQ ID NO.:18) are by matter
It include Ω-between T7 transcriptional initiation sequence and termination sequence in grain pKL- Ω -10A/8A/6A-Fluc and pKL- Ω-Fluc
The segment and Ω-Fluc segment of 10A/8A/6A-Fluc is expanded.
And the method for the DNA fragmentation ethanol precipitation that amplification obtains is purified and is enriched with: being added into PCR product
Then the 3M sodium acetate (pH5.2) of 1/10 volume adds 2.5-3 times of volume (volume is the volume being added after sodium acetate)
95% ethyl alcohol, be placed in and be incubated for 15min on ice;30min is centrifuged with the speed higher than 14000g under room temperature, is discarded
Clearly;It is cleaned using 70% ethyl alcohol, is then centrifuged 15min again, discarded supernatant, and dissolved precipitating with ultrapure water, measure DNA
Concentration.
The DNA fragmentation of purifying according to operation instruction, is added to the external protein of homemade Kluyveromyces lactis and closed by 2.2
In architectonical.And above-mentioned reaction system is placed in 25-30 DEG C of environment, stationary incubation about 2-6h.After reaction, in 96 holes
Isometric Fluc substrate luciferin (luciferin) is added in blank or 384 hole blanks, is placed in Envision immediately
2120 multi-function microplate readers (Perkin Elmer), reading, detection Fluc activity, relative light unit value (Relative Light
Unit, RLU) it is used as active unit, as shown in Figure 1.
2.3 use the DNA fragmentation (Ω-Fluc) without containing kl-Kozak sequence as control, and each sample designs three
Group independent experiment, and the experimental group without containing any DNA fragmentation is designed as negative control.
Experimental result
1. the kl-Kozak sequence of Kluyveromyces lactis specificity
On the position before initiation codon AUG, gland is fast at 5 ' ends of 107 gene mRNAs of Kluyveromyces lactis
The ratio of purine nucleotide (A) is dominant, has the ratio of uridylate (U) on several positions and approaches, but is less slightly.So with
Saccharomyces cerevisiae is similar, and the 5 ' terminal sequence of gene mRNA initiation codon of Kluyveromyces lactis is that A and U is enriched with.Initiation codon
It is dominant on 3 positions of the codon after sub- AUG to be followed successively by U, C and U, it is the codon of serine.Finally determine
The kl-Kozak sequence of Kluyveromyces lactis specificity is as shown in SEQ ID NO.:5.
Application of the 2.kl-Kozak sequence outside yeast in protein synthesis system
As shown in Figure 1, Ω-the 10A/8A/6A-Fluc 3 containing Kluyveromyces lactis specificity kl-Kozak sequence
A DNA fragmentation encodes the relative light unit value highest that the Fluc albumen of synthesis is released outside yeast in protein synthesis system
Reach 2.72 × 107.And the light relatively of the Fluc albumen of the Ω-Fluc DNA fragmentation coding synthesis without containing kl-Kozak sequence
Unit value only has 1.22 × 106.This shows that the insertion of kl-Kozak sequence can effectively enhance the outer protein compound body of yeast
It is the efficiency of synthetic proteins matter.Because being in the range of linearity of detecting instrument relative light unit value RLU and protein concentration relationship
Interior, the relative light unit value of the highest Ω -10A-Fluc segment of activity is 22.3 times of Ω-Fluc segment, shows Ω -10A energy
The protein synthesis of enough about 22.3 times of enhancings.
The efficiency of DNA fragmentation enhancing protein synthesis containing different length adenosine oligomerization chain is also to have differences
, it joined the outer protein synthesis system synthesis of yeast of Ω -10A-Fluc, Ω -8A-Fluc and Ω -6A-Fluc segment
The relative light unit value of Fluc protein is respectively 2.72 × 107、1.52×107With 1.01 × 107.These are the result shows that promoting
Aspect is synthesized into protein, Ω -10A is better than Ω -8A and Ω -6A.The outer albumen of yeast of any DNA fragmentation is not added
Matter synthetic system only has 190.33 as negative control, relative light unit value.
The present invention the result shows that, the kl-Kozak sequence of Kluyveromyces lactis specificity can significantly increase outside yeast
Protein synthesis system produces protedogenous efficiency, and the relative light unit value of Ω -10A-Fluc DNA fragmentation is Ω-Fluc
22.3 times, reach 2.72 × 107, show that Ω -10A can enhance 22.3 times of protein synthesis.
All references mentioned in the present invention is incorporated herein by reference, independent just as each document
It is incorporated as with reference to such.In addition, it should also be understood that, after reading the above teachings of the present invention, those skilled in the art can
To make various changes or modifications to the present invention, such equivalent forms equally fall within model defined by the application the appended claims
It encloses.
Bibliography:
1.Dever,T.E.,et al.(2016)Mechanism and Regulation of Protein
Synthesis in Saccharomyces cerevisiae.Genetics 203,65-107.
2.Kyrieleis,O.J.,et al.(2014)Crystal structure of vaccinia virus mRNA
capping enzyme provides insights into the mechanism and evolution of the
capping apparatus.Structure 22,452-465.
3.Katzen,F.,et al.(2005)The past,present and future of cell-free
protein synthesis.Trends in biotechnology 23,150-156.
4.Anastasina,M.,et al.(2014)A technique to increase protein yield in
a rabbit reticulocyte lysate translation system.BioTechniques 56,36-39.
5.Spohner,S.C.,et al.(2016)Kluyveromyces lactis:An emerging tool in
biotechnology.Journal of biotechnology 222,104-116.
Sequence table
<110>health code (Shanghai) Biotechnology Co., Ltd
<120>a kind of DNA element for high-throughput protein synthesis in vitro
<130> P2017-1191
<160> 18
<170> PatentIn version 3.5
<210> 1
<211> 1993
<212> DNA
<213>artificial sequence
<400> 1
cgcgaaatta atacgactca ctataggggt atttttacaa caattaccaa caacaacaaa 60
caacaaacaa cattacaatt actatttaca attacaaaaa aaaaaaatgt ctgaagacgc 120
caaaaacata aagaaaggcc cggcgccatt ctatcctcta gaggatggaa ccgctggaga 180
gcaactgcat aaggctatga agagatacgc cctggttcct ggaacaattg cttttacaga 240
tgcacatatc gaggtgaaca tcacgtacgc ggaatacttc gaaatgtccg ttcggttggc 300
agaagctatg aaacgatatg ggctgaatac aaatcacaga atcgtcgtat gcagtgaaaa 360
ctctcttcaa ttctttatgc cggtgttggg cgcgttattt atcggagttg cagttgcgcc 420
cgcgaacgac atttataatg aacgtgaatt gctcaacagt atgaacattt cgcagcctac 480
cgtagtgttt gtttccaaaa aggggttgca aaaaattttg aacgtgcaaa aaaaattacc 540
aataatccag aaaattatta tcatggattc taaaacggat taccagggat ttcagtcgat 600
gtacacgttc gtcacatctc atctacctcc cggttttaat gaatacgatt ttgtaccaga 660
gtcctttgat cgtgacaaaa caattgcact gataatgaat tcctctggat ctactgggtt 720
acctaagggt gtggcccttc cgcatagaac tgcctgcgtc agattctcgc atgccagaga 780
tcctattttt ggcaatcaaa tcattccgga tactgcgatt ttaagtgttg ttccattcca 840
tcacggtttt ggaatgttta ctacactcgg atatttgata tgtggatttc gagtcgtctt 900
aatgtataga tttgaagaag agctgttttt acgatccctt caggattaca aaattcaaag 960
tgcgttgcta gtaccaaccc tattttcatt cttcgccaaa agcactctga ttgacaaata 1020
cgatttatct aatttacacg aaattgcttc tgggggcgca cctctttcga aagaagtcgg 1080
ggaagcggtt gcaaaacgct tccatcttcc agggatacga caaggatatg ggctcactga 1140
gactacatca gctattctga ttacacccga gggggatgat aaaccgggcg cggtcggtaa 1200
agttgttcca ttttttgaag cgaaggttgt ggatctggat accgggaaaa cgctgggcgt 1260
taatcagaga ggcgaattat gtgtcagagg acctatgatt atgtccggtt atgtaaacaa 1320
tccggaagcg accaacgcct tgattgacaa ggatggatgg ctacattctg gagacatagc 1380
ttactgggac gaagacgaac acttcttcat agttgaccgc ttgaagtctt taattaaata 1440
caaaggatat caggtggccc ccgctgaatt ggaatcgata ttgttacaac accccaacat 1500
cttcgacgcg ggcgtggcag gtcttcccga cgatgacgcc ggtgaacttc ccgccgccgt 1560
tgttgttttg gagcacggaa agacgatgac ggaaaaagag atcgtggatt acgtcgccag 1620
tcaagtaaca accgcgaaaa agttgcgcgg aggagttgtg tttgtggacg aagtaccgaa 1680
aggtcttacc ggaaaactcg acgcaagaaa aatcagagag atcctcataa aggccaagaa 1740
gggcggaaag tccaaattgg tttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaactcga 1860
gcaccaccac caccaccact gagatccggc tgctaacaaa gcccgaaagg aagctgagtt 1920
ggctgctgcc accgctgagc aataactagc ataacccctt ggggcctcta aacgggtctt 1980
gaggggtttt ttg 1993
<210> 2
<211> 1991
<212> DNA
<213>artificial sequence
<400> 2
cgcgaaatta atacgactca ctataggggt atttttacaa caattaccaa caacaacaaa 60
caacaaacaa cattacaatt actatttaca attacaaaaa aaaaatgtct gaagacgcca 120
aaaacataaa gaaaggcccg gcgccattct atcctctaga ggatggaacc gctggagagc 180
aactgcataa ggctatgaag agatacgccc tggttcctgg aacaattgct tttacagatg 240
cacatatcga ggtgaacatc acgtacgcgg aatacttcga aatgtccgtt cggttggcag 300
aagctatgaa acgatatggg ctgaatacaa atcacagaat cgtcgtatgc agtgaaaact 360
ctcttcaatt ctttatgccg gtgttgggcg cgttatttat cggagttgca gttgcgcccg 420
cgaacgacat ttataatgaa cgtgaattgc tcaacagtat gaacatttcg cagcctaccg 480
tagtgtttgt ttccaaaaag gggttgcaaa aaattttgaa cgtgcaaaaa aaattaccaa 540
taatccagaa aattattatc atggattcta aaacggatta ccagggattt cagtcgatgt 600
acacgttcgt cacatctcat ctacctcccg gttttaatga atacgatttt gtaccagagt 660
cctttgatcg tgacaaaaca attgcactga taatgaattc ctctggatct actgggttac 720
ctaagggtgt ggcccttccg catagaactg cctgcgtcag attctcgcat gccagagatc 780
ctatttttgg caatcaaatc attccggata ctgcgatttt aagtgttgtt ccattccatc 840
acggttttgg aatgtttact acactcggat atttgatatg tggatttcga gtcgtcttaa 900
tgtatagatt tgaagaagag ctgtttttac gatcccttca ggattacaaa attcaaagtg 960
cgttgctagt accaacccta ttttcattct tcgccaaaag cactctgatt gacaaatacg 1020
atttatctaa tttacacgaa attgcttctg ggggcgcacc tctttcgaaa gaagtcgggg 1080
aagcggttgc aaaacgcttc catcttccag ggatacgaca aggatatggg ctcactgaga 1140
ctacatcagc tattctgatt acacccgagg gggatgataa accgggcgcg gtcggtaaag 1200
ttgttccatt ttttgaagcg aaggttgtgg atctggatac cgggaaaacg ctgggcgtta 1260
atcagagagg cgaattatgt gtcagaggac ctatgattat gtccggttat gtaaacaatc 1320
cggaagcgac caacgccttg attgacaagg atggatggct acattctgga gacatagctt 1380
actgggacga agacgaacac ttcttcatag ttgaccgctt gaagtcttta attaaataca 1440
aaggatatca ggtggccccc gctgaattgg aatcgatatt gttacaacac cccaacatct 1500
tcgacgcggg cgtggcaggt cttcccgacg atgacgccgg tgaacttccc gccgccgttg 1560
ttgttttgga gcacggaaag acgatgacgg aaaaagagat cgtggattac gtcgccagtc 1620
aagtaacaac cgcgaaaaag ttgcgcggag gagttgtgtt tgtggacgaa gtaccgaaag 1680
gtcttaccgg aaaactcgac gcaagaaaaa tcagagagat cctcataaag gccaagaagg 1740
gcggaaagtc caaattggtt taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaactcgagc 1860
accaccacca ccaccactga gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg 1920
ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga 1980
ggggtttttt g 1991
<210> 3
<211> 1989
<212> DNA
<213>artificial sequence
<400> 3
cgcgaaatta atacgactca ctataggggt atttttacaa caattaccaa caacaacaaa 60
caacaaacaa cattacaatt actatttaca attacaaaaa aaatgtctga agacgccaaa 120
aacataaaga aaggcccggc gccattctat cctctagagg atggaaccgc tggagagcaa 180
ctgcataagg ctatgaagag atacgccctg gttcctggaa caattgcttt tacagatgca 240
catatcgagg tgaacatcac gtacgcggaa tacttcgaaa tgtccgttcg gttggcagaa 300
gctatgaaac gatatgggct gaatacaaat cacagaatcg tcgtatgcag tgaaaactct 360
cttcaattct ttatgccggt gttgggcgcg ttatttatcg gagttgcagt tgcgcccgcg 420
aacgacattt ataatgaacg tgaattgctc aacagtatga acatttcgca gcctaccgta 480
gtgtttgttt ccaaaaaggg gttgcaaaaa attttgaacg tgcaaaaaaa attaccaata 540
atccagaaaa ttattatcat ggattctaaa acggattacc agggatttca gtcgatgtac 600
acgttcgtca catctcatct acctcccggt tttaatgaat acgattttgt accagagtcc 660
tttgatcgtg acaaaacaat tgcactgata atgaattcct ctggatctac tgggttacct 720
aagggtgtgg cccttccgca tagaactgcc tgcgtcagat tctcgcatgc cagagatcct 780
atttttggca atcaaatcat tccggatact gcgattttaa gtgttgttcc attccatcac 840
ggttttggaa tgtttactac actcggatat ttgatatgtg gatttcgagt cgtcttaatg 900
tatagatttg aagaagagct gtttttacga tcccttcagg attacaaaat tcaaagtgcg 960
ttgctagtac caaccctatt ttcattcttc gccaaaagca ctctgattga caaatacgat 1020
ttatctaatt tacacgaaat tgcttctggg ggcgcacctc tttcgaaaga agtcggggaa 1080
gcggttgcaa aacgcttcca tcttccaggg atacgacaag gatatgggct cactgagact 1140
acatcagcta ttctgattac acccgagggg gatgataaac cgggcgcggt cggtaaagtt 1200
gttccatttt ttgaagcgaa ggttgtggat ctggataccg ggaaaacgct gggcgttaat 1260
cagagaggcg aattatgtgt cagaggacct atgattatgt ccggttatgt aaacaatccg 1320
gaagcgacca acgccttgat tgacaaggat ggatggctac attctggaga catagcttac 1380
tgggacgaag acgaacactt cttcatagtt gaccgcttga agtctttaat taaatacaaa 1440
ggatatcagg tggcccccgc tgaattggaa tcgatattgt tacaacaccc caacatcttc 1500
gacgcgggcg tggcaggtct tcccgacgat gacgccggtg aacttcccgc cgccgttgtt 1560
gttttggagc acggaaagac gatgacggaa aaagagatcg tggattacgt cgccagtcaa 1620
gtaacaaccg cgaaaaagtt gcgcggagga gttgtgtttg tggacgaagt accgaaaggt 1680
cttaccggaa aactcgacgc aagaaaaatc agagagatcc tcataaaggc caagaagggc 1740
ggaaagtcca aattggttta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa actcgagcac 1860
caccaccacc accactgaga tccggctgct aacaaagccc gaaaggaagc tgagttggct 1920
gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg 1980
ggttttttg 1989
<210> 4
<211> 1983
<212> DNA
<213>artificial sequence
<400> 4
cgcgaaatta atacgactca ctataggggt atttttacaa caattaccaa caacaacaaa 60
caacaaacaa cattacaatt actatttaca attacaatgt ctgaagacgc caaaaacata 120
aagaaaggcc cggcgccatt ctatcctcta gaggatggaa ccgctggaga gcaactgcat 180
aaggctatga agagatacgc cctggttcct ggaacaattg cttttacaga tgcacatatc 240
gaggtgaaca tcacgtacgc ggaatacttc gaaatgtccg ttcggttggc agaagctatg 300
aaacgatatg ggctgaatac aaatcacaga atcgtcgtat gcagtgaaaa ctctcttcaa 360
ttctttatgc cggtgttggg cgcgttattt atcggagttg cagttgcgcc cgcgaacgac 420
atttataatg aacgtgaatt gctcaacagt atgaacattt cgcagcctac cgtagtgttt 480
gtttccaaaa aggggttgca aaaaattttg aacgtgcaaa aaaaattacc aataatccag 540
aaaattatta tcatggattc taaaacggat taccagggat ttcagtcgat gtacacgttc 600
gtcacatctc atctacctcc cggttttaat gaatacgatt ttgtaccaga gtcctttgat 660
cgtgacaaaa caattgcact gataatgaat tcctctggat ctactgggtt acctaagggt 720
gtggcccttc cgcatagaac tgcctgcgtc agattctcgc atgccagaga tcctattttt 780
ggcaatcaaa tcattccgga tactgcgatt ttaagtgttg ttccattcca tcacggtttt 840
ggaatgttta ctacactcgg atatttgata tgtggatttc gagtcgtctt aatgtataga 900
tttgaagaag agctgttttt acgatccctt caggattaca aaattcaaag tgcgttgcta 960
gtaccaaccc tattttcatt cttcgccaaa agcactctga ttgacaaata cgatttatct 1020
aatttacacg aaattgcttc tgggggcgca cctctttcga aagaagtcgg ggaagcggtt 1080
gcaaaacgct tccatcttcc agggatacga caaggatatg ggctcactga gactacatca 1140
gctattctga ttacacccga gggggatgat aaaccgggcg cggtcggtaa agttgttcca 1200
ttttttgaag cgaaggttgt ggatctggat accgggaaaa cgctgggcgt taatcagaga 1260
ggcgaattat gtgtcagagg acctatgatt atgtccggtt atgtaaacaa tccggaagcg 1320
accaacgcct tgattgacaa ggatggatgg ctacattctg gagacatagc ttactgggac 1380
gaagacgaac acttcttcat agttgaccgc ttgaagtctt taattaaata caaaggatat 1440
caggtggccc ccgctgaatt ggaatcgata ttgttacaac accccaacat cttcgacgcg 1500
ggcgtggcag gtcttcccga cgatgacgcc ggtgaacttc ccgccgccgt tgttgttttg 1560
gagcacggaa agacgatgac ggaaaaagag atcgtggatt acgtcgccag tcaagtaaca 1620
accgcgaaaa agttgcgcgg aggagttgtg tttgtggacg aagtaccgaa aggtcttacc 1680
ggaaaactcg acgcaagaaa aatcagagag atcctcataa aggccaagaa gggcggaaag 1740
tccaaattgg tttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaactcga gcaccaccac 1860
caccaccact gagatccggc tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc 1920
accgctgagc aataactagc ataacccctt ggggcctcta aacgggtctt gaggggtttt 1980
ttg 1983
<210> 5
<211> 16
<212> RNA
<213>artificial sequence
<400> 5
aaaaaaaaaa augucu 16
<210> 6
<211> 717
<212> DNA
<213>artificial sequence
<400> 6
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180
ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 240
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaag 717
<210> 7
<211> 717
<212> DNA
<213>artificial sequence
<400> 7
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120
ggcaagctga ccctgaagct gatctgcacc accggcaagc tgcccgtgcc ctggcccacc 180
ctcgtgacca ccctgggcta cggcctgcag tgcttcgccc gctaccccga ccacatgaag 240
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420
aagctggagt acaactacaa cagccacaac gtctatatca ccgccgacaa gcagaagaac 480
ggcatcaagg ccaacttcaa gatccgccac aacatcgagg acggcggcgt gcagctcgcc 540
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600
tacctgagct accagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaag 717
<210> 8
<211> 3048
<212> DNA
<213>artificial sequence
<400> 8
atggtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 60
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 120
tcccaacagt tgcgcagcct gaatggcgaa tggcgctttg cctggtttcc ggcaccagaa 180
gcggtgccgg aaagctggct ggagtgcgat cttcctgagg ccgatactgt cgtcgtcccc 240
tcaaactggc agatgcacgg ttacgatgcg cccatctaca ccaacgtgac ctatcccatt 300
acggtcaatc cgccgtttgt tcccacggag aatccgacgg gttgttactc gctcacattt 360
aatgttgatg aaagctggct acaggaaggc cagacgcgaa ttatttttga tggcgttaac 420
tcggcgtttc atctgtggtg caacgggcgc tgggtcggtt acggccagga cagtcgtttg 480
ccgtctgaat ttgacctgag cgcattttta cgcgccggag aaaaccgcct cgcggtgatg 540
gtgctgcgct ggagtgacgg cagttatctg gaagatcagg atatgtggcg gatgagcggc 600
attttccgtg acgtctcgtt gctgcataaa ccgactacac aaatcagcga tttccatgtt 660
gccactcgct ttaatgatga tttcagccgc gctgtactgg aggctgaagt tcagatgtgc 720
ggcgagttgc gtgactacct acgggtaaca gtttctttat ggcagggtga aacgcaggtc 780
gccagcggca ccgcgccttt cggcggtgaa attatcgatg agcgtggtgg ttatgccgat 840
cgcgtcacac tacgtctgaa cgtcgaaaac ccgaaactgt ggagcgccga aatcccgaat 900
ctctatcgtg cggtggttga actgcacacc gccgacggca cgctgattga agcagaagcc 960
tgcgatgtcg gtttccgcga ggtgcggatt gaaaatggtc tgctgctgct gaacggcaag 1020
ccgttgctga ttcgaggcgt taaccgtcac gagcatcatc ctctgcatgg tcaggtcatg 1080
gatgagcaga cgatggtgca ggatatcctg ctgatgaagc agaacaactt taacgccgtg 1140
cgctgttcgc attatccgaa ccatccgctg tggtacacgc tgtgcgaccg ctacggcctg 1200
tatgtggtgg atgaagccaa tattgaaacc cacggcatgg tgccaatgaa tcgtctgacc 1260
gatgatccgc gctggctacc ggcgatgagc gaacgcgtaa cgcgaatggt gcagcgcgat 1320
cgtaatcacc cgagtgtgat catctggtcg ctggggaatg aatcaggcca cggcgctaat 1380
cacgacgcgc tgtatcgctg gatcaaatct gtcgatcctt cccgcccggt gcagtatgaa 1440
ggcggcggag ccgacaccac ggccaccgat attatttgcc cgatgtacgc gcgcgtggat 1500
gaagaccagc ccttcccggc tgtgccgaaa tggtccatca aaaaatggct ttcgctacct 1560
ggagagacgc gcccgctgat cctttgcgaa tacgcccacg cgatgggtaa cagtcttggc 1620
ggtttcgcta aatactggca ggcgtttcgt cagtatcccc gtttacaggg cggcttcgtc 1680
tgggactggg tggatcagtc gctgattaaa tatgatgaaa acggcaaccc gtggtcggct 1740
tacggcggtg attttggcga tacgccgaac gatcgccagt tctgtatgaa cggtctggtc 1800
tttgccgacc gcacgccgca tccagcgctg acggaagcaa aacaccagca gcagtttttc 1860
cagttccgtt tatccgggca aaccatcgaa gtgaccagcg aatacctgtt ccgtcatagc 1920
gataacgagc tcctgcactg gatggtggcg ctggatggta agccgctggc aagcggtgaa 1980
gtgcctctgg atgtcgctcc acaaggtaaa cagttgattg aactgcctga actaccgcag 2040
ccggagagcg ccgggcaact ctggctcaca gtacgcgtag tgcaaccgaa cgcgaccgca 2100
tggtcagaag ccggacacat cagcgcctgg cagcagtggc gtctggctga aaacctcagc 2160
gtgacactcc ccgccgcgtc ccacgccatc ccgcatctga ccaccagcga aatggatttt 2220
tgcatcgagc tgggtaataa gcgttggcaa tttaaccgcc agtcaggctt tctttcacag 2280
atgtggattg gcgataaaaa acaactgctg acgccgctgc gcgatcagtt cacccgtgca 2340
ccgctggata acgacattgg cgtaagtgaa gcgacccgca ttgaccctaa cgcctgggtc 2400
gaacgctgga aggcggcggg ccattaccag gccgaagcag cgttgttgca gtgcacggca 2460
gatacacttg ctgatgcggt gctgattacg accgctcacg cgtggcagca tcaggggaaa 2520
accttattta tcagccggaa aacctaccgg attgatggta gtggtcaaat ggcgattacc 2580
gttgatgttg aagtggcgag cgatacaccg catccggcgc ggattggcct gaactgccag 2640
ctggcgcagg tagcagagcg ggtaaactgg ctcggattag ggccgcaaga aaactatccc 2700
gaccgcctta ctgccgcctg ttttgaccgc tgggatctgc cattgtcaga catgtatacc 2760
ccgtacgtct tcccgagcga aaacggtctg cgctgcggga cgcgcgaatt gaattatggc 2820
ccacaccagt ggcgcggcga cttccagttc aacatcagcc gctacagtca acagcaactg 2880
atggaaacca gccatcgcca tctgctgcac gcggaagaag gcacatggct gaatatcgac 2940
ggtttccata tggggattgg tggcgacgac tcctggagcc cgtcagtatc ggcggaattc 3000
cagctgagcg ccggtcgcta ccattaccag ttggtctggt gtcaaaaa 3048
<210> 9
<211> 1935
<212> DNA
<213>artificial sequence
<400> 9
atggcggccg tgcaggcggc cgaggtgaaa gtggatggca gcgagccgaa actgagcaag 60
aatgagctga agagacgcct gaaagctgag aagaaagtag cagagaagga ggccaaacag 120
aaagagctca gtgagaaaca gctaagccaa gccactgctg ctgccaccaa ccacaccact 180
gataatggtg tgggtcctga ggaagagagc gtggacccaa atcaatacta caaaatccgc 240
agtcaagcaa ttcatcagct gaaggtcaat ggggaagacc catacccaca caagttccat 300
gtagacatct cactcactga cttcatccaa aaatatagtc acctgcagcc tggggatcac 360
ctgactgaca tcaccttaaa ggtggcaggt aggatccatg ccaaaagagc ttctggggga 420
aagctcatct tctatgatct tcgaggagag ggggtgaagt tgcaagtcat ggccaattcc 480
agaaattata aatcagaaga agaatttatt catattaata acaaactgcg tcggggagac 540
ataattggag ttcaggggaa tcctggtaaa accaagaagg gtgagctgag catcattccg 600
tatgagatca cactgctgtc tccctgtttg catatgttac ctcatcttca ctttggcctc 660
aaagacaagg aaacaaggta tcgccagaga tacttggact tgatcctgaa tgactttgtg 720
aggcagaaat ttatcatccg ctctaagatc atcacatata taagaagttt cttagatgag 780
ctgggattcc tagagattga aactcccatg atgaacatca tcccaggggg agccgtggcc 840
aagcctttca tcacttatca caacgagctg gacatgaact tatatatgag aattgctcca 900
gaactctatc ataagatgct tgtggttggt ggcatcgacc gggtttatga aattggacgc 960
cagttccgga atgaggggat tgatttgacg cacaatcctg agttcaccac ctgtgagttc 1020
tacatggcct atgcagacta tcacgatctc atggaaatca cggagaagat ggtttcaggg 1080
atggtgaagc atattacagg cagttacaag gtcacctacc acccagatgg cccagagggc 1140
caagcctacg atgttgactt caccccaccc ttccggcgaa tcaacatggt agaagagctt 1200
gagaaagccc tggggatgaa gctgccagaa acgaacctct ttgaaactga agaaactcgc 1260
aaaattcttg atgatatctg tgtggcaaaa gctgttgaat gccctccacc tcggaccaca 1320
gccaggctcc ttgacaagct tgttggggag ttcctggaag tgacttgcat caatcctaca 1380
ttcatctgtg atcacccaca gataatgagc cctttggcta aatggcaccg ctctaaagag 1440
ggtctgactg agcgctttga gctgtttgtc atgaagaaag agatatgcaa tgcgtatact 1500
gagctgaatg atcccatgcg gcagcggcag ctttttgaag aacaggccaa ggccaaggct 1560
gcaggtgatg atgaggccat gttcatagat gaaaacttct gtactgccct ggaatatggg 1620
ctgcccccca cagctggctg gggcatgggc attgatcgag tcgccatgtt tctcacggac 1680
tccaacaaca tcaaggaagt acttctgttt cctgccatga aacccgaaga caagaaggag 1740
aatgtagcaa ccactgatac actggaaagc acaacagttg gcacttctgt ctagaaaata 1800
ataattgcaa gttgtataac tcaggcgtct ttgcatttct gcgaaagatc aaggtctgca 1860
agggaattct tgtgtgctgc tttccatttg acaccgcagt tctgttcagc catcagaaga 1920
gagacaagga attaa 1935
<210> 10
<211> 3531
<212> DNA
<213>artificial sequence
<400> 10
atggcggaaa gaaaaggaac agccaaagtg gactttttga agaagattga gaaagaaatc 60
caacagaaat gggatactga gagagtgttt gaggtcaatg catctaattt agagaaacag 120
accagcaagg gcaagtattt tgtaaccttc ccatatccat atatgaatgg acgccttcat 180
ttgggacaca cgttttcttt atccaaatgt gagtttgctg tagggtacca gcgattgaaa 240
ggaaaatgtt gtctgtttcc ctttggcctg cactgtactg gaatgcctat taaggcatgt 300
gctgataagt tgaaaagaga aatagagctg tatggttgcc cccctgattt tccagatgaa 360
gaagaggaag aggaagaaac cagtgttaaa acagaagata taataattaa ggataaagct 420
aaaggaaaaa agagtaaagc tgctgctaaa gctggatctt ctaaatacca gtggggcatt 480
atgaaatccc ttggcctgtc tgatgaagag atagtaaaat tttctgaagc agaacattgg 540
cttgattatt tcccgccact ggctattcag gatttaaaaa gaatgggttt gaaggtagac 600
tggcgtcgtt ccttcatcac cactgatgtt aatccttact atgattcatt tgtcagatgg 660
caatttttaa cattaagaga aagaaacaaa attaaatttg ggaagcggta tacaatttac 720
tctccgaaag atggacagcc ttgcatggat catgatagac aaactggaga gggtgttgga 780
cctcaggaat atactttact caaattgaag gtgcttgagc catacccatc taaattaagt 840
ggcctgaaag gtaaaaatat tttcttggtg gctgctactc tcagacctga gaccatgttt 900
gggcagacaa attgttgggt tcgtcctgat atgaagtaca ttggatttga gacggtgaat 960
ggtgatatat tcatctgtac ccaaaaagca gccaggaata tgtcatacca gggctttacc 1020
aaagacaatg gcgtggtgcc tgttgttaag gaattaatgg gggaggaaat tcttggtgca 1080
tcactttctg cacctttaac atcatacaag gtgatctatg ttctcccaat gctaactatt 1140
aaggaggata aaggcactgg tgtggttaca agtgttcctt ccgactcccc tgatgatatt 1200
gctgccctca gagacttgaa gaaaaagcaa gccttacgag caaaatatgg aattagagat 1260
gacatggtct tgccatttga gccggtgcca gtcattgaaa tcccaggttt tggaaatctt 1320
tctgctgtaa ccatttgtga tgagttgaaa attcagagcc agaatgaccg ggaaaaactt 1380
gcagaagcaa aggagaagat atatctaaaa ggattttatg agggtatcat gttggtggat 1440
ggatttaaag gacagaaggt tcaagatgta aagaagacta ttcagaaaaa gatgattgac 1500
gctggagatg cacttattta catggaacca gagaaacaag tgatgtccag gtcgtcagat 1560
gaatgtgttg tggctctgtg tgaccagtgg tacttggatt atggagaaga gaattggaag 1620
aaacagacat ctcagtgctt gaagaacctg gaaacattct gtgaggagac caggaggaat 1680
tttgaagcca ccttaggttg gctacaagaa catgcttgct caagaactta tggtctaggc 1740
actcacctgc cttgggatga gcagtggctg attgaatcac tttctgactc cactatttac 1800
atggcatttt acacagttgc acacctattg caggggggta acttgcatgg acaggcagag 1860
tctccgctgg gcattagacc gcaacagatg accaaggaag tttgggatta tgttttcttc 1920
aaggaggctc catttcctaa gactcagatt gcaaaggaaa aattagatca gttaaagcag 1980
gagtttgaat tctggtatcc tgttgatctt cgcgtctctg gcaaggatct tgttccaaat 2040
catctttcat attaccttta taatcatgtg gctatgtggc cggaacaaag tgacaaatgg 2100
cctacagctg tgagagcaaa tggacatctc ctcctgaact ctgagaagat gtcaaaatcc 2160
acaggcaact tcctcacttt gacccaagct attgacaaat tttcagcaga tggaatgcgt 2220
ttggctctgg ctgatgctgg tgacactgta gaagatgcca actttgtgga agccatggca 2280
gatgcaggta ttctccgtct gtacacctgg gtagagtggg tgaaagaaat ggttgccaac 2340
tgggacagcc taagaagtgg tcctgccagc actttcaatg atagagtttt tgccagtgaa 2400
ttgaatgcag gaattataaa aacagatcaa aactatgaaa agatgatgtt taaagaagct 2460
ttgaaaacag ggttttttga gtttcaggcc gcaaaagata agtaccgtga attggctgtg 2520
gaagggatgc acagagaact tgtgttccgg tttattgaag ttcagacact tctcctcgct 2580
ccattctgtc cacatttgtg tgagcacatc tggacactcc tgggaaagcc tgactcaatt 2640
atgaatgctt catggcctgt ggcaggtcct gttaatgaag ttttaataca ctcctcacag 2700
tatcttatgg aagtaacaca tgaccttaga ctacgactca agaactatat gatgccagct 2760
aaagggaaga agactgacaa acaacccctg cagaagccct cacattgcac catctatgtg 2820
gcaaagaact atccaccttg gcaacatacc accctgtctg ttctacgtaa acactttgag 2880
gccaataacg gaaaactgcc tgacaacaaa gtcattgcta gtgaactagg cagtatgcca 2940
gaactgaaga aatacatgaa gaaagtcatg ccatttgttg ccatgattaa ggaaaatctg 3000
gagaagatgg ggcctcgtat tctggatttg caattagaat ttgatgaaaa ggctgtgctt 3060
atggagaata tagtctatct gactaattcg cttgagctag aacacataga agtcaagttt 3120
gcctccgaag cagaagataa aatcagggaa gactgctgtc ctgggaaacc acttaatgtt 3180
tttagaatag aacctggtgt gtccgtttct ctggtgaatc cccagccatc caatggccac 3240
ttctcaacca aaattgaaat caggcaagga gataactgtg attccataat caggcgttta 3300
atgaaaatga atcgaggaat taaagacctt tccaaagtga aactgatgag atttgatgat 3360
ccactgttgg ggcctcgacg agttcctgtc ctgggaaagg agtacaccga gaagaccccc 3420
atttctgagc atgctgtttt caatgtggac ctcatgagca agaaaattca tctgactgag 3480
aatgggataa gggtggatat tggcgataca ataatctatc tggttcatta a 3531
<210> 11
<211> 1017
<212> DNA
<213>artificial sequence
<400> 11
atggctgaca agaagattag gatcggaatc aacggattcg gaagaattgg tcgtttggtt 60
gctagagttg ttctccagag ggacgatgtt gagctcgtcg ctgtcaacga ccccttcatc 120
actactgagt acatgaccta catgttcaag tacgacagtg ttcacggtca atggaaacac 180
aatgaactca agatcaagga tgagaagacc cttctcttcg gtgagaagcc agtcactgtt 240
ttcggcatca ggaaccctga ggatatccca tgggccgagg ctggagctga ctacgttgtt 300
gagtctactg gtgtcttcac tgacaaagac aaggctgcag ctcacttgaa gggtggtgcc 360
aagaaggttg ttatctctga acccagcaaa gacgctccaa tgtttgttgt tggtgtcaac 420
gagcacgaat acaagtccga ccttgacatt gtctccaacg ctagctgcac cactaactgc 480
cttgctcccc ttgccaaggt tatcaatgac agatttggaa ttgttgaggg tcttatgact 540
acagtccact caatcactgc tactcagaag actgttgatg ggccttcaat gaaggactgg 600
agaggtggaa gagctgcttc attcaacatt attcccagca gcactggagc tgccaaggct 660
gtcggaaagg tgcttccagc tcttaacgga aagttgactg gaatgtcttt ccgtgtccca 720
accgttgatg tctcagttgt tgaccttact gtcagactcg agaaagctgc tacctacgaa 780
gaaatcaaaa aggctatcaa ggaggaatcc gaaggcaaac tcaagggaat ccttggatac 840
accgaggatg atgttgtctc aactgacttc gttggcgaca acaggtcgag catttttgac 900
gccaaggctg gaattgcatt gagcgacaag tttgtgaaat tggtgtcatg gtacgacaac 960
gaatggggtt acagttcccg tgtggtcgac ttgatcgtcc acatgtcaaa ggcctaa 1017
<210> 12
<211> 1584
<212> DNA
<213>artificial sequence
<400> 12
atgtcggaca gtcgggaccc agccagcgac cagatgaagc agtggaagga gcagcgggcc 60
tcgcagagac cagatgtcct gaccaccgga ggcgggaacc caataggaga taaacttaat 120
atcatgaccg cggggtcccg agggcccctc ctcgttcagg atgtggtttt cactgacgag 180
atggcacact ttgacagaga gcggattcct gagagagtgg tacacgcaaa aggagcaggt 240
gcttttggat actttgaggt cacccacgat atcaccagat actccaaggc aaaggtgttt 300
gagcatattg gaaagaggac ccctattgcc gttcgattct ccacagtcgc tggagagtca 360
ggctcagctg acacagttcg tgaccctcgg gggtttgcag tgaaatttta cactgaagat 420
ggtaactggg atcttgtggg aaacaacacc cctattttct tcatcaggga tgccatattg 480
tttccatcct ttatccatag ccagaagaga aacccacaga ctcacctgaa ggatcctgac 540
atggtctggg acttctggag tcttcgtccc gagtctctcc atcaggtttc tttcttgttc 600
agtgaccgag ggattcccga tggtcaccgg cacatgaatg gctatggatc acacaccttc 660
aagttggtta atgcagatgg agaggcagtc tattgcaagt tccattacaa gaccgaccag 720
ggcatcaaaa acttgcctgt tggagaggca ggaaggcttg ctcaggaaga tccggattat 780
ggcctccgag atcttttcaa tgccatcgcc aatggcaatt acccgtcctg gacgttttac 840
atccaggtca tgacttttaa ggaggcagaa actttcccat ttaatccatt tgatctgacc 900
aaggtttggc ctcacaagga ctaccctctt ataccagttg gcaaactggt tttaaacaaa 960
aatccagtta attactttgc tgaagttgaa cagatggctt ttgacccaag caatatgccc 1020
cctggcatcg agcccagccc tgacaaaatg cttcagggcc gcctttttgc ctacccggac 1080
actcaccgcc accgcctggg acccaactat ctgcagatac ctgtgaactg tccctaccgc 1140
gctcgagtgg ccaactacca gcgtgatggc cccatgtgca tgcatgacaa ccagggtggt 1200
gcccccaact attaccccaa cagcttcagc gcaccagagc agcagcgctc agccctggag 1260
cacagcgtcc agtgcgctgt agatgtgaaa cgcttcaaca gtgctaatga agacaatgtc 1320
actcaggtgc ggacattcta cacaaaggtg ttgaatgagg aggagaggaa acgcctgtgt 1380
gagaacattg ctggccacct gaaggacgct cagcttttca ttcagaagaa agcggtcaag 1440
aatttcactg acgtccaccc tgactatggg gcccgcatcc aggctcttct ggacaagtac 1500
aacgctgaga agcctaagaa cgcaattcac acctacacgc aggccggctc tcacatggct 1560
gcgaagggaa aagctaacct gtaa 1584
<210> 13
<211> 99
<212> DNA
<213>artificial sequence
<400> 13
caacaattac caacaacaac aaacaacaaa caacattaca attactattt acaattacaa 60
aaaaaatgtc tgaagacgcc aaaaacataa agaaaggcc 99
<210> 14
<211> 53
<212> DNA
<213>artificial sequence
<400> 14
ttttttttgt aattgtaaat agtaattgta atgttgtttg ttgtttgttg ttg 53
<210> 15
<211> 98
<212> DNA
<213>artificial sequence
<400> 15
caattaccaa caacaacaaa caacaaacaa cattacaatt actatttaca attacaaaaa 60
aaaaatgtct gaagacgcca aaaacataaa gaaaggcc 98
<210> 16
<211> 99
<212> DNA
<213>artificial sequence
<400> 16
aattaccaac aacaacaaac aacaaacaac attacaatta ctatttacaa ttacaaaaaa 60
aaaaaatgtc tgaagacgcc aaaaacataa agaaaggcc 99
<210> 17
<211> 27
<212> DNA
<213>artificial sequence
<400> 17
cgcgaaatta atacgactca ctatagg 27
<210> 18
<211> 24
<212> DNA
<213>artificial sequence
<400> 18
tccggatata gttcctcctt tcag 24
Claims (10)
1. a kind of nucleic acid constructs, which is characterized in that the construction has from 5 ' to 3 ' Formulas I structure:
Z1-Z2-Z3-Z4 (I)
In formula,
Z1, Z2, Z3, Z4 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Also, described Z2, Z3, Z4 have collectively constituted Kozak sequence, and the Kozak sequence derives from Kluyveromyces yeast.
2. nucleic acid constructs as described in claim 1, which is characterized in that the Kluyveromyces yeast is selected from the group: cream
Sour kluyveromyces, kluyveromyces marxianus, more cloth kluyveromyces, or combinations thereof.
3. a kind of nucleic acid constructs, which is characterized in that the construction has from 5 ' to 3 ' Formula II structure:
Z1-Z2-Z3-Z4-Z5 (II)
In formula,
Z1, Z2, Z3, Z4, Z5 are respectively the element for being used to constitute the construction;
Each "-" independently is key or nucleotide catenation sequence;
Z1 is 5 ' end leader sequence (leading sequence)-Ω sequences of tobacco mosaic virus (TMV);
Z2 is the oligomerization chain [oligo (A)] of adenyl-deoxyribonucleotiden;
Z3 is translation initiation codon;
Z4 is serine codon;
Z5 is the coded sequence of foreign protein;
Also, described Z2, Z3, Z4 have collectively constituted Kozak sequence, and the Kozak sequence derives from Kluyveromyces yeast.
4. nucleic acid constructs as claimed in claim 3, which is characterized in that the coded sequence of the foreign protein is selected from down
Group: coding fluorescence fibroin or luciferase (such as firefly luciferase), green fluorescent protein, yellow fluorescence protein, aminoacyl
TRNA synzyme, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, the exogenous DNA of the Variable Area of antibody, firefly
The DNA of light element enzyme mutant, or combinations thereof.
5. a kind of carrier or carrier combination, which is characterized in that the carrier or carrier combination are containing described in claim 1 or 3
Nucleic acid constructs.
6. a kind of genetically engineered cell, which is characterized in that one or more sites of the genome of the genetically engineered cell are whole
Conjunction have the right to require 1 or 3 described in nucleic acid constructs or the genetically engineered cell containing the load described in claim 5
Body or carrier combination.
7. a kind of kit, which is characterized in that the reagent for including in the kit one of is selected from the group or a variety of:
(a) construction described in claim 1 or 3;
(b) carrier described in claim 5 or carrier combination;With
(c) genetically engineered cell as claimed in claim 6.
8. described in carrier described in construction as claimed in claim 1 or 3, claim 5 or carrier combination, claim 6
Genetically engineered cell or claim 7 described in kit purposes, which is characterized in that for carrying out high-throughput external albumen
Synthesis.
9. a kind of protein synthesis methods of external high throughput, which is characterized in that comprising steps of
(i) outside yeast in the presence of albumen synthetic system, nucleic acid constructs as claimed in claim 3 is provided;
(ii) under the suitable conditions, the outer albumen synthetic system of the yeast of incubation step (i) T1 for a period of time, to synthesize
The foreign protein.
10. method as claimed in claim 9, which is characterized in that the method also includes: (iii) is optionally from the yeast
In external albumen synthetic system, the foreign protein is separated or detected.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710430876.3A CN109022478A (en) | 2017-06-09 | 2017-06-09 | A kind of DNA element for high-throughput protein synthesis in vitro |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710430876.3A CN109022478A (en) | 2017-06-09 | 2017-06-09 | A kind of DNA element for high-throughput protein synthesis in vitro |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109022478A true CN109022478A (en) | 2018-12-18 |
Family
ID=64628733
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710430876.3A Pending CN109022478A (en) | 2017-06-09 | 2017-06-09 | A kind of DNA element for high-throughput protein synthesis in vitro |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109022478A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020239111A1 (en) | 2019-05-30 | 2020-12-03 | 康码(上海)生物科技有限公司 | Method for quantitative co-expressing multiple proteins in vitro and application thereof |
WO2021104435A1 (en) | 2019-11-30 | 2021-06-03 | 康码(上海)生物科技有限公司 | Biomagnetic microsphere and preparation method therefor and use thereof |
CN114085864A (en) * | 2021-09-14 | 2022-02-25 | 福建农林大学 | A specific inducible expression system for peripheral cells of megasporoblasts |
WO2023126009A1 (en) | 2021-12-31 | 2023-07-06 | 康码(上海)生物科技有限公司 | Polymer molecule, monomeric structure and polymeric structure comprising same |
WO2024114738A1 (en) | 2022-11-30 | 2024-06-06 | 康码(上海)生物科技有限公司 | Recombinant hemoglobin |
CN118755749A (en) * | 2024-09-06 | 2024-10-11 | 中国农业科学院北京畜牧兽医研究所 | A method for weakening the expression of gene hemB of Corynebacterium glutamicum and increasing the yield of ALA |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1032367A (en) * | 1987-07-28 | 1989-04-12 | 吉斯特-布罗卡迪斯 | The DAN that contains directing heterologous polypeptide excretory kluyveromyces-factor leader sequence is set up thing |
CN1336436A (en) * | 2001-09-05 | 2002-02-20 | 南京大学 | Method of improving soluble expression of foreign gene in colon bacillus |
CN1778923A (en) * | 2005-10-13 | 2006-05-31 | 上海交通大学 | Gene sequence specifically expressed by human granulocyte-macrophage colony-stimulating factor |
CN102753700A (en) * | 2009-09-04 | 2012-10-24 | 先正达参股股份有限公司 | Stacking of translational enhancer elements to increase polypeptide expression in plants |
-
2017
- 2017-06-09 CN CN201710430876.3A patent/CN109022478A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1032367A (en) * | 1987-07-28 | 1989-04-12 | 吉斯特-布罗卡迪斯 | The DAN that contains directing heterologous polypeptide excretory kluyveromyces-factor leader sequence is set up thing |
CN1336436A (en) * | 2001-09-05 | 2002-02-20 | 南京大学 | Method of improving soluble expression of foreign gene in colon bacillus |
CN1778923A (en) * | 2005-10-13 | 2006-05-31 | 上海交通大学 | Gene sequence specifically expressed by human granulocyte-macrophage colony-stimulating factor |
CN102753700A (en) * | 2009-09-04 | 2012-10-24 | 先正达参股股份有限公司 | Stacking of translational enhancer elements to increase polypeptide expression in plants |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020239111A1 (en) | 2019-05-30 | 2020-12-03 | 康码(上海)生物科技有限公司 | Method for quantitative co-expressing multiple proteins in vitro and application thereof |
WO2021104435A1 (en) | 2019-11-30 | 2021-06-03 | 康码(上海)生物科技有限公司 | Biomagnetic microsphere and preparation method therefor and use thereof |
CN114085864A (en) * | 2021-09-14 | 2022-02-25 | 福建农林大学 | A specific inducible expression system for peripheral cells of megasporoblasts |
CN114085864B (en) * | 2021-09-14 | 2024-01-19 | 福建农林大学 | Megasporocyte peripheral cell specificity induction expression system |
WO2023126009A1 (en) | 2021-12-31 | 2023-07-06 | 康码(上海)生物科技有限公司 | Polymer molecule, monomeric structure and polymeric structure comprising same |
WO2024114738A1 (en) | 2022-11-30 | 2024-06-06 | 康码(上海)生物科技有限公司 | Recombinant hemoglobin |
CN118755749A (en) * | 2024-09-06 | 2024-10-11 | 中国农业科学院北京畜牧兽医研究所 | A method for weakening the expression of gene hemB of Corynebacterium glutamicum and increasing the yield of ALA |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109022478A (en) | A kind of DNA element for high-throughput protein synthesis in vitro | |
US9745588B2 (en) | Transcription terminator sequences | |
CN111778169B (en) | Method for improving in vitro protein synthesis efficiency | |
CN109971775B (en) | Nucleic acid construct and method for regulating protein synthesis by using same | |
JP7058839B2 (en) | Cell-free protein expression using rolling circle amplification products | |
CN110408635B (en) | Application of nucleic acid construct containing streptavidin element in protein expression and purification | |
CN110408636A (en) | The concatenated DNA sequence dna of multiple label and its application in protein expression purification system | |
JP7028986B2 (en) | Tandem DNA element that can increase protein synthesis efficiency | |
US10350307B2 (en) | In vivo RNA or protein expression using double-stranded concatemeric DNA including phosphorothioated nucleotides | |
CN110551745A (en) | Multiple histidine sequence tag and application thereof in protein expression and purification | |
CN110938649A (en) | Protein synthesis system for improving expression quantity of foreign protein and application method thereof | |
KR20200036925A (en) | Cell-free protein expression using double-stranded concatemer DNA | |
CN110845622A (en) | Preparation of fusion protein with deletion of different structural domains and application of fusion protein in improvement of protein synthesis | |
CN109423497A (en) | Enhance the RNA element of protein synthesis efficiency | |
EP3574099B1 (en) | Promoter construct for cell-free protein synthesis | |
CN113215005A (en) | In-vitro cell-free protein synthesis system (D2P system), kit and application thereof | |
CN113493813A (en) | External source magnesium ion-containing in-vitro cell-free protein synthesis system and kit and application thereof | |
CN111778270A (en) | Method for reflecting in vitro cell-free protein expression level by integrating luminescent reporter gene | |
CN111363742B (en) | A method for inhibiting virulence of Streptococcus mutans by point mutation based on codon preference | |
WO2023198006A1 (en) | Method for preparing s-lactoylglutathione | |
WO2024126847A1 (en) | Mrna recombinant capping enzymes | |
WO2025079564A1 (en) | Method for preparing protein sample, method for ligation of nucleic acid, method for preparing ngs library, and kit | |
CN117683804A (en) | Nucleic acid construct and application thereof in IVTT system | |
EP1525305A2 (en) | Methods and compositions for in vitro synthesis of biological macromolecules in a cell-free system enriched with atp-sulfurylase | |
JP2004344029A (en) | Gene-destroyed microorganism and method for producing pyrimidine nucleoside compound therewith |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Building No. 12, 118 Lane, Furong Hualu, Pudong New Area, Shanghai, 201321 Applicant after: Kang code (Shanghai) Biological Technology Co., Ltd. Address before: 3rd Floor, No. 8 Building, 500 Lane, Furong Hualu, Pudong New Area, Shanghai, 201321 Applicant before: Kang code (Shanghai) Biological Technology Co., Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181218 |
|
RJ01 | Rejection of invention patent application after publication |