WO2025045247A1 - Nucleic acids encoding crispr-associated proteins and uses thereof - Google Patents
Nucleic acids encoding crispr-associated proteins and uses thereof Download PDFInfo
- Publication number
- WO2025045247A1 WO2025045247A1 PCT/CN2024/116339 CN2024116339W WO2025045247A1 WO 2025045247 A1 WO2025045247 A1 WO 2025045247A1 CN 2024116339 W CN2024116339 W CN 2024116339W WO 2025045247 A1 WO2025045247 A1 WO 2025045247A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- mrna
- sequence
- seq
- acid molecule
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims description 109
- 108020004707 nucleic acids Proteins 0.000 title claims description 79
- 102000039446 nucleic acids Human genes 0.000 title claims description 79
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 title claims description 10
- 108020004999 messenger RNA Proteins 0.000 claims description 120
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 53
- 108090000623 proteins and genes Proteins 0.000 claims description 52
- 239000000203 mixture Substances 0.000 claims description 40
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 39
- 108020004414 DNA Proteins 0.000 claims description 36
- 108020003589 5' Untranslated Regions Proteins 0.000 claims description 28
- 108020005004 Guide RNA Proteins 0.000 claims description 28
- 239000013604 expression vector Substances 0.000 claims description 25
- 238000003259 recombinant expression Methods 0.000 claims description 25
- 108020005345 3' Untranslated Regions Proteins 0.000 claims description 24
- 108700026244 Open Reading Frames Proteins 0.000 claims description 22
- 239000013612 plasmid Substances 0.000 claims description 21
- 230000014509 gene expression Effects 0.000 claims description 14
- 238000000034 method Methods 0.000 claims description 12
- 108091026890 Coding region Proteins 0.000 claims description 11
- 241001465754 Metazoa Species 0.000 claims description 10
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 claims description 8
- 230000001939 inductive effect Effects 0.000 claims description 8
- 239000003814 drug Substances 0.000 claims description 7
- 150000002632 lipids Chemical class 0.000 claims description 7
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 7
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 6
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 6
- 238000001415 gene therapy Methods 0.000 claims description 6
- 201000010099 disease Diseases 0.000 claims description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 5
- 238000000338 in vitro Methods 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 4
- 230000009977 dual effect Effects 0.000 claims description 4
- 238000010362 genome editing Methods 0.000 claims description 4
- 239000002105 nanoparticle Substances 0.000 claims description 4
- 230000008685 targeting Effects 0.000 claims description 4
- 238000011282 treatment Methods 0.000 claims description 4
- 239000007788 liquid Substances 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- RKSLVDIXBGWPIS-UAKXSSHOSA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-iodopyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 RKSLVDIXBGWPIS-UAKXSSHOSA-N 0.000 claims description 2
- UVBYMVOUBXYSFV-XUTVFYLZSA-N 1-methylpseudouridine Chemical compound O=C1NC(=O)N(C)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 UVBYMVOUBXYSFV-XUTVFYLZSA-N 0.000 claims description 2
- ZXIATBNUWJBBGT-JXOAFFINSA-N 5-methoxyuridine Chemical compound O=C1NC(=O)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXIATBNUWJBBGT-JXOAFFINSA-N 0.000 claims description 2
- 230000004568 DNA-binding Effects 0.000 claims description 2
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 2
- 229930185560 Pseudouridine Natural products 0.000 claims description 2
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 claims description 2
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 claims description 2
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 claims description 2
- 239000011230 binding agent Substances 0.000 claims description 2
- 238000010367 cloning Methods 0.000 claims description 2
- 239000008194 pharmaceutical composition Substances 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 claims description 2
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 230000001225 therapeutic effect Effects 0.000 claims description 2
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 claims description 2
- 229940045145 uridine Drugs 0.000 claims description 2
- 239000003981 vehicle Substances 0.000 claims description 2
- 108091027544 Subgenomic mRNA Proteins 0.000 claims 2
- 210000004027 cell Anatomy 0.000 description 49
- 239000002773 nucleotide Substances 0.000 description 35
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 33
- 241000699670 Mus sp. Species 0.000 description 19
- 230000000694 effects Effects 0.000 description 18
- 239000012634 fragment Substances 0.000 description 18
- 108091081024 Start codon Proteins 0.000 description 15
- 108020004705 Codon Proteins 0.000 description 14
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 12
- 108091034057 RNA (poly(A)) Proteins 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 11
- 210000003494 hepatocyte Anatomy 0.000 description 10
- 238000001890 transfection Methods 0.000 description 10
- 101000991410 Homo sapiens Nucleolar and spindle-associated protein 1 Proteins 0.000 description 9
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 8
- 239000012091 fetal bovine serum Substances 0.000 description 8
- 238000007481 next generation sequencing Methods 0.000 description 8
- QGWBEETXHOVFQS-UHFFFAOYSA-N 6-[6-(2-hexyldecanoyloxy)hexyl-(4-hydroxybutyl)amino]hexyl 2-hexyldecanoate Chemical compound CCCCCCCCC(CCCCCC)C(=O)OCCCCCCN(CCCCO)CCCCCCOC(=O)C(CCCCCC)CCCCCCCC QGWBEETXHOVFQS-UHFFFAOYSA-N 0.000 description 7
- 150000001413 amino acids Chemical group 0.000 description 7
- 239000000178 monomer Substances 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 6
- 239000008188 pellet Substances 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 210000003462 vein Anatomy 0.000 description 6
- 108020004566 Transfer RNA Proteins 0.000 description 5
- 238000002296 dynamic light scattering Methods 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 108091033409 CRISPR Proteins 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 239000012098 Lipofectamine RNAiMAX Substances 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- -1 lipid amine Chemical class 0.000 description 4
- 239000002479 lipoplex Substances 0.000 description 4
- 210000004185 liver Anatomy 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 102000004169 proteins and genes Human genes 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 230000014621 translational initiation Effects 0.000 description 4
- 206010002091 Anaesthesia Diseases 0.000 description 3
- 108020005098 Anticodon Proteins 0.000 description 3
- 206010003497 Asphyxia Diseases 0.000 description 3
- 238000007400 DNA extraction Methods 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 206010015719 Exsanguination Diseases 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108020005350 Initiator Codon Proteins 0.000 description 3
- PIWKPBJCKXDKJR-UHFFFAOYSA-N Isoflurane Chemical compound FC(F)OC(Cl)C(F)(F)F PIWKPBJCKXDKJR-UHFFFAOYSA-N 0.000 description 3
- 230000002411 adverse Effects 0.000 description 3
- 230000037005 anaesthesia Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 229960002725 isoflurane Drugs 0.000 description 3
- 210000005228 liver tissue Anatomy 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 238000007480 sanger sequencing Methods 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- NRJAVPSFFCBXDT-HUESYALOSA-N 1,2-distearoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCCCC NRJAVPSFFCBXDT-HUESYALOSA-N 0.000 description 2
- 241000180579 Arca Species 0.000 description 2
- 238000010354 CRISPR gene editing Methods 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- 102000002241 Eukaryotic Initiation Factors Human genes 0.000 description 2
- 108010014863 Eukaryotic Initiation Factors Proteins 0.000 description 2
- 108090000331 Firefly luciferases Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- 229930182816 L-glutamine Natural products 0.000 description 2
- 108091027974 Mature messenger RNA Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000005538 encapsulation Methods 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000003999 initiator Substances 0.000 description 2
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000007747 plating Methods 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000002028 premature Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 239000002356 single layer Substances 0.000 description 2
- 238000001179 sorption measurement Methods 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 238000010257 thawing Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- KHWCHTKSEGGWEX-RRKCRQDMSA-N 2'-deoxyadenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 KHWCHTKSEGGWEX-RRKCRQDMSA-N 0.000 description 1
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 description 1
- LTFMZDNNPPEQNG-KVQBGUIXSA-N 2'-deoxyguanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 LTFMZDNNPPEQNG-KVQBGUIXSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- ASJSAQIRZKANQN-UHFFFAOYSA-N 2-deoxypentose Chemical compound OCC(O)C(O)CC=O ASJSAQIRZKANQN-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 101710172824 CRISPR-associated endonuclease Cas9 Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 1
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- DJJCXFVJDGTHFX-UHFFFAOYSA-N Uridinemonophosphate Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-UHFFFAOYSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 1
- 229960003190 adenosine monophosphate Drugs 0.000 description 1
- LNQVTSROQXJCDD-UHFFFAOYSA-N adenosine monophosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)C(OP(O)(O)=O)C1O LNQVTSROQXJCDD-UHFFFAOYSA-N 0.000 description 1
- 239000012574 advanced DMEM Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- 230000003851 biochemical process Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000002619 cancer immunotherapy Methods 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- IERHLVCPSMICTF-XVFCMESISA-N cytidine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 1
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 description 1
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 210000003002 eukaryotic large ribosome subunit Anatomy 0.000 description 1
- 210000004265 eukaryotic small ribosome subunit Anatomy 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000003198 gene knock in Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 1
- 235000013928 guanylic acid Nutrition 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 1
- 238000003468 luciferase reporter gene assay Methods 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229920002113 octoxynol Polymers 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- AJAMRCUNWLZBDF-UHFFFAOYSA-N propyl octadeca-9,12-dienoate Chemical compound CCCCCC=CCC=CCCCCCCCC(=O)OCCC AJAMRCUNWLZBDF-UHFFFAOYSA-N 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 230000009712 regulation of translation Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DJJCXFVJDGTHFX-XVFCMESISA-N uridine 5'-monophosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-XVFCMESISA-N 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/0008—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
- A61K48/0025—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition wherein the non-active part clearly interacts with the delivered nucleic acid
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K9/00—Medicinal preparations characterised by special physical form
- A61K9/10—Dispersions; Emulsions
- A61K9/127—Synthetic bilayered vehicles, e.g. liposomes or liposomes with cholesterol as the only non-phosphatidyl surfactant
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/88—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation using microencapsulation, e.g. using amphiphile liposome vesicle
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/0008—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
- A61K48/0025—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition wherein the non-active part clearly interacts with the delivered nucleic acid
- A61K48/0041—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition wherein the non-active part clearly interacts with the delivered nucleic acid the non-active part being polymeric
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
- A61K48/0066—Manipulation of the nucleic acid to modify its expression pattern, e.g. enhance its duration of expression, achieved by the presence of particular introns in the delivered nucleic acid
Definitions
- the present invention relates to artificial nucleic acids, in particular RNAs, encoding polypeptides or proteins of interest, especially CRISPR-associated polypeptides or CRISPR-associated proteins, and (pharmaceutical) compositions and kit-of-parts comprising the same.
- Said artificial nucleic acids, in particular RNAs, (pharmaceutical) composition and kits are for use in medicine, for instance in gene therapy, and in particular in the treatment and/or prophylaxis of disease amenable to treatment with CRISPR-associated proteins, e.g. by gene editing, knock-in, knock-out or modulating the expression of target genes of interest.
- mRNA-based drug technologies therapeutic mRNA has been applied in many fields, such as cancer immunotherapies, infectious disease vaccines, allergy tolerization, protein-replacement and supplementation therapies, genome engineering, gene therapy and genetic reprogramming etc., particularly in CRISPR-associated gene therapy.
- a prolonged presence of editing enzymes translated from DNA-based vectors resulted in off-target effect.
- the transient expression of the nucleases from mRNA would minimize thus nonspecific effect.
- Engineered mRNA encoding ZFNs, TALENs, Cas9 have been applied successfully to edit genomes by disrupting or integrating sequences ex vivo and in vivo. (Sahin, U., Karikó, K. &Türeci, mRNA-based therapeutics -developing a new class of drugs. Nat Rev Drug Discov 13, 759–780 (2014) . doi. org/10.1038/nrd4278) .
- the present invention solves the above-mentioned technical problems through the following technical solutions:
- the present invention provides an mRNA comprising an open reading frame encoding an RNA-guided DNA-binding agent, wherein the open reading frame comprises a sequence having at least 90%identity to any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
- the open reading frame comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to the sequence of any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
- the mRNA further comprises a 5’ UTR with at least 90%identity to any one of SEQ ID NOs: 110-171 or 1-62.
- the mRNA further comprises a 5’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 110-171 or 1-62.
- the mRNA further comprises a 3’ UTR with at least 90%identity to any one of SEQ ID NOs: 172-210 or 63-101.
- the mRNA further comprises a 3’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 172-210 or 63-101.
- the mRNA further comprises a poly-adenylated (poly-A) tail with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to SEQ ID NO: 243, 244, 245 or 246.
- poly-A poly-adenylated
- the present invention provides an mRNA comprising a sequence with at least 98%identity to any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- the mRNA comprises a sequence selected from any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- At least 10%of the uridine is substituted with a modified uridine, wherein the modified uridine is one or more of N1-methyl-pseudouridine, pseudouridine, 5-methoxyuridine, or 5-iodouridine.
- the present invention provides an expression construct comprising a promoter operably linked to a sequence encoding an mRNA mentioned above, wherein the expression construct is optionally a plasmid expression construct.
- the present invention provides an isolated host cell comprising the expression construct mentioned above.
- the present invention provides a composition comprising the mRNA mentioned above and at least one guide RNA.
- the present invention provides a lipid nanoparticle comprising the mRNA mentioned above.
- the present invention provides a pharmaceutical composition
- a pharmaceutical composition comprising the mRNA mentioned above and a pharmaceutically acceptable carrier.
- the present invention provides a method of genome editing or modifying a target gene comprising contacting a cell with the mRNA mentioned above.
- the present invention provides an artificial nucleic acid molecule comprising a. at least one coding region encoding at least one CRISPR-associated protein; b. at least one 5’ untranslated region (5’UTR) element; and c. at least one 3' untranslated region (3’ UTR) element, wherein said coding region comprises a sequence having at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232 and 247.
- said coding region further comprises at least one nuclear localization signal.
- said artificial nucleic acid molecule is an RNA.
- the RNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- said artificial nucleic acid molecule is a DNA.
- the DNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 217-224, and 249-251.
- the present invention provides a recombinant expression vector comprising the artificial nucleic acid molecule mentioned above.
- the artificial nucleic acid molecule is operably linked to a promoter.
- the promoter is an inducible promoter.
- the artificial nucleic acid molecule further comprising a multiple cloning site.
- the present invention provides an in vitro genetically modified host cell comprising the artificial nucleic acid according to any one of claims 18 to 23 or the recombinant expression vector mentioned above.
- the present invention provides a composition comprising the artificial nucleic acid molecule mentioned above or the recombinant expression vector mentioned above and a pharmaceutically acceptable carrier and/or excipient.
- the composition further comprises a guide RNA or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest.
- the guide RNA is a single guide (sgRNA) .
- the guide RNA is a dual guide (dgRNA) .
- the present invention provides a kit comprising the artificial nucleic acid molecule mentioned above or the recombinant expression vector mentioned above, and optionally a liquid vehicle and/or optionally technical instructions with information on the administration and dosage of the artificial nucleic acid molecule or the composition.
- the kit further comprises a guide RNA (gRNA) or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest, or a regulatory element operably linked thereto.
- gRNA guide RNA
- the guide RNA is a single guide (sgRNA) .
- the guide RNA is a dual guide (dgRNA) .
- the present invention provides a method of inducing a double stranded break (DSB) within a gene of interest, and/or modifying the gene of interest, comprising delivering the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above to a cell.
- DSB double stranded break
- the present invention provides a method of treating, preventing or diagnosing diseases associated with a gene of interest comprising administering the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above to a subject in need thereof.
- the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above in the preparation of medicament.
- the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for gene therapy.
- the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
- DSB double-stranded break
- the present invention provides the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for use in inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
- DSB double-stranded break
- the present invention provides the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for use in treating, preventing or diagnosing diseases associated with a gene of interest in a subject.
- the present invention provides an artificial nucleic acid molecule comprising
- 5’ untranslated region comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 110-171 or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 1-62;
- 3’ untranslated region (3’ UTR) element comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 172-210; or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 63-101.
- the 5’ untranslated region (5’UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 110-171, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 1-62.
- the 3’ untranslated region (3’UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 172-210, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 63-101.
- Figure 1 shows percent editing following transfection of PHH cells with lipoplex comprising SpCas9 sgRNA (sg5815-3, SEQ ID NO: 242) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/CAS001D, SEQ ID NO: 236/CAS001F, SEQ ID NO: 238) .
- Figure 2 shows percent editing following transfection of PHH cells with lipoplex comprising SpCas9 sgRNA (sg5815-3, SEQ ID NO: 242) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/CAS001C, SEQ ID NO: 235/CAS001F, SEQ ID NO: 238) .
- Figure 3 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
- LNP LNP
- SpCas9 sgRNA sg5815-1, SEQ ID NO: 241
- SpCas9 mRNA m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
- Figure 4 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
- LNP LNP
- SpCas9 sgRNA sg5815-1, SEQ ID NO: 241
- SpCas9 mRNA m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
- Figure 5 shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A) and 5’UTR and 3’UTR combinations (B) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 12. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- A 5’UTR
- B 3’UTR combinations
- Figure 6A shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 14. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- A&B 5’UTR
- C 3’UTR combinations
- Figure 6B shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 15. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- A&B 5’UTR
- C 3’UTR combinations
- Figure 6C shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 16. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- A&B 5’UTR
- C 3’UTR combinations
- Figure 7A shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- Figure 7B shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- Figure 7C shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
- SpCas9 CAS001F
- Figure 8A shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
- Figure 8B shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
- Figure 8C shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
- Figure 8D shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
- Figure 8E shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
- Figure 9A shows the editing efficiency of mRNA samples tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in primary human hepatocyte (PHH) cell.
- mRNA-790 and mRNA-784 were generated from the same IVT DNA template with poly (A) A60+A60, but different IVT reaction -A and -B.
- mRNA-768 and mRNA-791 were generated from the same IVT DNA template with poly (A) A95, but different IVT reaction -A and -B. All the mRNA samples tested contain CDS sequence of SpCas9-encoding CAS001F.
- sgRNA which is named as sg5815-3 (SEQ ID NO: 242) was used as the guide RNA in this experiment.
- Figure 9B shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in primary human hepatocyte (PHH) cell.
- mRNA-788 and mRNA-802 tailed with poly (A) A60+A60, while with different NLS included.
- mRNA-791 and mRNA-792 tailed with poly (A) A95 and they were generated from the same IVT DNA template. All the mRNA samples tested contain CDS sequence of SpCas9-encoding CAS001F.
- sgRNA which is named as sg5815-3 (SEQ ID NO: 242) was used as the guide RNA in this experiment.
- Figure 9C shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in CD-1 mice.
- mRNA used in this experiment are the same as used in Figure 9B.
- CD1-mice were administrated with two doses, 0.03 mpk (light gray bar) and 0.1 mpk (dark gray bar) , as displayed in the figure.
- sgRNA which is named as sg5815-1 (SEQ ID NO: 241) was used as the guide RNA in this experiment.
- Figure 10 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (CAS001F, SEQ ID NO: 238, mRNA-788/SEQ ID NO: 252) .
- LNP LNP
- SpCas9 sgRNA sg5815-1, SEQ ID NO: 241
- SpCas9 mRNA CAS001F, SEQ ID NO: 238, mRNA-788/SEQ ID NO: 252
- UTR refers to an "untranslated region" flanking the coding sequence of an artificial nucleic acid as defined herein.
- an "UTR element” comprises or consists of a nucleic acid sequence, which is derived from the (naturally occurring, wild-type) UTR of a particular gene, preferably as exemplified herein.
- nucleic acid sequences corresponding to the sequence of said UTR ( "parent UTR” ) or a homolog, variant or fragment of said UTR.
- the term includes sequences corresponding to the entire (full-length) wild-type sequence of said UTR, or a homolog, variant or fragment thereof, including full-length homologs and variants, as well as fragments of said full-length wild-type sequences, homologs and variants, and variants of said fragments.
- the term “corresponds to” means that the nucleic acid sequence derived from the "parent UTR” may be an RNA sequence (e.g. equal to the RNA sequence used for defining said parent UTR sequence) , or a DNA sequence (both sense and antisense strand and both mature and immature) , which corresponds to such RNA sequence.
- the expression “or a homolog, fragment or variant thereof” may refer to the gene, or the UTR, or both.
- homolog in the context of genes (or nucleic acid sequences derived therefrom or comprised by said gene, like a UTR) refers to a gene (or a nucleic acid sequence derived therefrom or comprised by said gene) related to a second gene (or such nucleic acid sequence) by descent from a common ancestral DNA sequence.
- homolog includes genes separated by the event of speciation ( “ortholog” ) and genes separated by the event of genetic duplication ( “paralog” ) .
- An artificial nucleic acid molecule may typically be understood to be a nucleic acid molecule, e.g. a DNA or an RNA, that does not occur naturally.
- an artificial nucleic acid molecule may be understood as a non-natural nucleic acid molecule.
- Such nucleic acid molecule may be non-natural due to its individual sequence (which does not occur naturally) and/or due to other modifications, e.g. structural modifications of nucleotides, which do not occur naturally.
- An artificial nucleic acid molecule may be a DNA molecule, an RNA molecule or a hybrid-molecule comprising DNA and RNA portions.
- artificial nucleic acid molecules may be designed and/or generated by genetic engineering methods to correspond to a desired artificial sequence of nucleotides (heterologous sequence) .
- an artificial sequence is usually a sequence that may not occur naturally, i.e. it differs from the wild type sequence by at least one nucleotide.
- wild type may be understood as a sequence occurring in nature.
- artificial nucleic acid molecule is not restricted to mean “one single molecule” but is, typically, understood to comprise an ensemble of identical molecules. Accordingly, it may relate to a plurality of identical molecules contained in an aliquot.
- Nucleic acid molecule is a molecule comprising, preferably consisting of nucleic acid components.
- the term nucleic acid molecule preferably refers to DNA or RNA molecules. It is preferably used synonymous with the term "polynucleotide” .
- a nucleic acid molecule is a polymer comprising or consisting of nucleotide monomers, which are covalently linked to each other by phosphodiester-bonds of a sugar/phosphate-backbone.
- the term "nucleic acid molecule” also encompasses modified nucleic acid molecules, such as base-modified, sugar-modified or backbone-modified etc. DNA or RNA molecules.
- DNA is the usual abbreviation for deoxy-ribonucleic acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually deoxy-adenosine-monophosphate, deoxy-thymidine-monophosphate, deoxy-guanosine-monophosphate and deoxy-cytidine-monophosphate monomers which are-by themselves-composed of a sugar moiety (deoxyribose) , a base moiety and a phosphate moiety, and polymerise by a characteristic backbone structure.
- the backbone structure is, typically, formed by phosphodiester bonds between the sugar moiety of the nucleotide, i.e. deoxyribose, of a first and a phosphate moiety of a second, adjacent monomer.
- the specific order of the monomers i.e. the order of the bases linked to the sugar/phosphate-backbone, is called the DNA sequence.
- DNA may be single stranded or double stranded. In the double stranded form, the nucleotides of the first strand typically hybridize with the nucleotides of the second strand, e.g. by A/T-base-pairing and G/C-base-pairing.
- RNA is the usual abbreviation for ribonucleic-acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually adenosine-monophosphate, uridine-monophosphate, guanosine-monophosphate and cytidine-monophosphate monomers which are connected to each other along a so-called backbone.
- the backbone is formed by phosphodiester bonds between the sugar, i.e. ribose, of a first and a phosphate moiety of a second, adjacent monomer. The specific succession of the monomers is called the RNA-sequence.
- RNA may be obtainable by transcription of a DNA-sequence, e.g., inside a cell.
- transcription is typically performed inside the nucleus or the mitochondria.
- transcription of DNA usually results in the so-called premature RNA which has to be processed into so-called messenger-RNA, usually abbreviated as mRNA.
- Processing of the premature RNA e.g. in eukaryotic organisms, comprises a variety of different posttranscriptional-modifications such as splicing, 5'-capping, polyadenylation, export from the nucleus or the mitochondria and the like. The sum of these processes is also called maturation of RNA.
- the mature messenger RNA usually provides the nucleotide sequence that may be translated into an amino-acid sequence of a particular peptide or protein.
- a mature mRNA comprises a 5'-cap, a 5'-UTR, an open reading frame, a 3'-UTR and a poly (A) sequence.
- Aside from messenger RNA several non-coding types of RNA exist which may be involved in regulation of transcription and/or translation.
- Heterologous sequence Two sequences are typically understood to be 'heterologous' if they are not derivable from the same gene. i.e., although heterologous sequences may be derivable from the same organism, they naturally (in nature) do not occur in the same nucleic acid molecule, such as in the same mRNA.
- Open reading frame in the context of the invention may typically be a sequence of several nucleotide triplets, which may be translated into a peptide or protein.
- An open reading frame preferably contains a start codon, i.e. a combination of three subsequent nucleotides coding usually for the amino acid methionine (ATG) , at its 5'-end and a subsequent region, which usually exhibits a length which is a multiple of 3 nucleotides.
- An ORF is preferably terminated by a stop-codon (e.g., TAA, TAG, TGA) . Typically, this is the only stop-codon of the open reading frame.
- an open reading frame in the context of the present invention is preferably a nucleotide sequence, consisting of a number of nucleotides that may be divided by three, which starts with a start codon (e.g. ATG) and which preferably terminates with a stop codon (e.g., TAA, TGA, or TAG) .
- the open reading frame may be isolated or it may be incorporated in a longer nucleic acid sequence, for example in a vector or an mRNA.
- An open reading frame may also be termed " (protein) coding sequence” or, preferably, "coding sequence” .
- Codon refers to a sequence of three nucleotides that together form a unit of genetic code in a DNA or RNA molecule.
- a codon is operationally defined by the initial nucleotide from which translation starts and sets the frame for a run of successive nucleotide triplets, which is known as an "open reading frame” (ORF) .
- ORF open reading frame
- the string GGGAAACCC if read from the first position, contains the codons GGG, AAA, and CCC; if read from the second position, it contains the codons GGA and AAC; and if read from the third position, GAA and ACC.
- every nucleic sequence read in its 5' ⁇ 3' direction comprises three reading frames, each producing a possibly distinct amino acid sequence (in the given example, Gly-Lys-Pro, Gly-Asn, or Glu-Thr, respectively) .
- DNA is double-stranded defining six possible reading frames, three in the forward orientation on one strand and three reverse on the opposite strand.
- Open reading frames encoding polypeptides are typically defined by a start codon, usually the first AUG codon in the sequence.
- initiation codon refers to the first codon of an open reading frame that is translated by the ribosome and is comprised of a triplet of linked adenine-uracil-guanine nucleobases.
- the initiation codon is depicted by the first letter codes of adenine (A) , uracil (U) , and guanine (G) and is often written simply as "AUG” .
- initiation codons may use codons other than AUG as the initiation codon, which are referred to herein as "alternative initiation codons"
- the initiation codons of polynucleotides described herein use the AUG codon.
- the sequence comprising the initiation codon is recognized via complementary base-pairing to the anticodon of an initiator tRNA (Met-tRNAi Met ) bound by the ribosome.
- Open reading frames may contain more than one AUG initiation codon, which are referred to herein as "alternate initiation codons” .
- the initiation codon plays a critical role in translation initiation.
- the initiation codon is the first codon of an open reading frame that is translated by the ribosome.
- the initiation codon comprises the nucleotide triplet AUG, however, in some instances translation initiation can occur at other codons comprised of distinct nucleotides.
- the initiation of translation in eukaryotes is a multistep biochemical process that involves numerous protein-protein, protein-RNA, and RNA-RNA interactions between messenger RNA molecules (mRNAs) , the 40S ribosomal subunit, other components of the translation machinery (e.g., eukaryotic initiation factors; eIFs) .
- mRNAs messenger RNA molecules
- eIFs eukaryotic initiation factors
- the current model of mRNA translation initiation postulates that the pre-initiation complex (alternatively "43S pre-initiation complex” ; abbreviated as "PIC” ) translocates from the site of recruitment on the mRNA (typically the 5' cap) to the initiation codon by scanning nucleotides in a 5' to 3' direction until the first AUG codon that resides within a specific translation-promotive nucleotide context (the Kozak sequence) is encountered (Kozak (1989) J Cell Biol 108: 229-241) .
- PIC pre-initiation complex
- Sequence identity Two or more sequences are identical if they exhibit the same length and order of nucleotides or amino acids.
- the percentage of identity typically describes the extent to which two sequences are identical, i.e. it typically describes the percentage of nucleotides that correspond in their sequence position with identical nucleotides of a reference-sequence.
- the sequences to be compared are typically considered to exhibit the same length, i.e. the length of the longest sequence of the sequences to be compared. This means that a first sequence consisting of 8 nucleotides is 80%identical to a second sequence consisting of 10 nucleotides comprising the first sequence.
- identity of sequences preferably relates to the percentage of nucleotides or amino acids of a sequence which have the same position in two or more sequences having the same length.
- the "%identity" of two amino acid sequences or two nucleic acid sequences may be determined by aligning the sequences for optimal comparison purposes (e.g., gaps can be introduced in either sequence for best alignment with the other sequence) and comparing the amino acids or nucleotides at corresponding positions. Gaps are usually regarded as non-identical positions, irrespective of their actual position in an alignment. The " best alignment” is typically an alignment of two sequences that results in the highest percent identity.
- the determination of percent identity between two sequences can be accomplished using a mathematical algorithm known to those of skill in the art.
- variants in the context of nucleic acid sequences of genes refers to nucleic acid sequence variants, i.e. nucleic acid sequences or genes comprising a nucleic acid sequence that differs in at least one nucleic acid from a reference (or “parent” ) nucleic acid sequence of a reference (or “parent” ) nucleic acid or gene.
- Variant nucleic acids or genes may thus preferably comprise, in their nucleic acid sequence, at least one mutation, substitution, insertion or deletion as compared to their respective reference sequence.
- the term “variant” as used herein includes naturally occurring variants, and engineered variants of nucleic acid sequences or genes.
- a "variant" as defined herein can be derived from, isolated from, related to, based on or homologous to the reference nucleic acid sequence/Variants” may preferably have a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90%and most preferably of at least 95%or even 97%, to a nucleic acid sequence of the respective naturally occurring (wild-type) nucleic acid sequence or gene, or a homolog, fragment or derivative thereof.
- fragment in the context of nucleic acid sequences or genes refers to a continuous subsequence of the full-length reference (or “parent” ) nucleic acid sequence or gene.
- a “fragment” may typically be a shorter portion of a full-length nucleic acid sequence or gene.
- a fragment typically, consists of a sequence that is identical to the corresponding stretch within the full-length nucleic acid sequence or gene.
- the term includes naturally occurring fragments as well as engineered fragments.
- a preferred fragment of a sequence in the context of the present invention consists of a continuous stretch of nucleic acids corresponding to a continuous stretch of entities in the nucleic acid or gene the fragment is derived from, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, and most preferably at least 80%of the total (i.e. full-length) nucleic acid sequence or gene from which the fragment is derived.
- a sequence identity indicated with respect to such a fragment preferably refers to the entire nucleic acid sequence or gene.
- a "fragment” may comprise a nucleic acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90%and most preferably of at least 95%or even 97%, to a reference nucleic acid sequence or gene that it is derived from.
- the Cas9 mRNA and sgRNA were encapsulated within lipid nanoparticles (LNP) using a Precision MPF-L2 TM AITESEN.
- the lipid nanoparticle components were dissolved in 100%ethanol with the lipid component (LP01, DSPC, cholesterol and DMG-PEG2000 in molar ratio of 50: 9: 38: 3 or ALC-0315, DSPC, cholesterol and ALC-0159 in molar ratio of 46.3: 42.7: 9.4: 1.6) .
- the RNA cargos were dissolved in 25 mM citrate, 100 mM NaCl, pH 5.0, resulting in a concentration of RNA cargo of approximately 0.45 mg/mL.
- the LNPs were formulated with a lipid amine to RNA phosphate (N: P) molar ratio of about 6, with the ratio of mRNA to gRNA at 2: 1 by weight.
- RNA in aqueous buffer conditions is mixed with ethanolic lipid mix, respectively in the volume ratio of 3 parts of RNA and 1 part of lipid mix.
- the LNPs were collected, and then remaining buffer was exchanged into PH7.5 TSS (contain 5%w/v (g/L) sucrose, 45 mM NaCl, 50 mM Tris , 100-fold excess of sample volume) , overnight at 4°C under gentle stirring using a 10 kDa dialysis bag.
- the resulting mixture was then filtered using a 0.2 um sterile filter.
- the resulting filtrate was stored at 2-8 °C.
- Lp01 is (9Z, 12Z) -3- ( (4, 4-Bis (octyloxy) butanoyl) oxy) -2- ( ( ( (3- (diethylamino) propoxy) carbonyl) oxy) methyl) propyl octadeca-9, 12-dienoate and Lp01 has the structure shown in the below:
- ALC-0315 is 6- ( (2-hexyldecanoyl) oxy) -N- (6- ( (2-hexyldecanoyl) oxy) hexyl) -N- (4-hydroxybutyl) hexan-1-aminium and ALC-0315 has the structure shown in the below:
- Dynamic Light Scattering is used to determine the average particle size and polydispersity index ( “PDI” ) and of LNP samples.
- Average particle size and polydispersity are measured by dynamic light scattering (DLS) using a Malvern Zetasizer DLS instrument. Dilute LNP samples in 1x PBS at volumetric ratios of 1: 99 prior to testing. Measure the mean hydrodynamic diameter of each sample by reporting average particle size along with PDI. Zeta potential of the LNP is also measured by Malvern Zetasizer. Samples are diluted at volumetric ratios of 1: 19 in 10mM NaCl prior to measurement.
- a fluorescence-based assay (ThermoFisher Scientific) is used to determine the encapsulation efficiency, which is calculated as (Total RNA -Free RNA) /Total RNA.
- LNP samples are diluted with1x TE buffer containing 1%Triton-X 100 to a proper concentration to determine total RNA or 1x TE buffer to determine free RNA.
- Standard curves are prepared by utilizing the starting RNA solution used to make the formulations according to the manufacturer's instructions) .
- stain is then added to each of the standards and samples and allowed stain to incubate for approximately 5 minutes at room temperature, in the absence of light.
- a Tecan INFINITE 200 PRO is used to read the samples with excitation and emission wavelength at 480nm, 520nm respectively.
- Total RNA and free RNA are calculated from the appropriate standard curves after subtracting the fluorescence value of the reagent blank from that of each of the samples.
- encapsulation was >80%, particle size was ⁇ 120 nm, and pdi was ⁇ 0.2
- PSH Primary human liver hepatocytes (PHH) cells were thawed and resuspended in hepatocyte thawing medium with supplements (Lonza, Cat. MCHT50) followed by centrifugation at 100 g for 10 minutes. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium (Lonza, Cat. MP100) plus 10%fetal bovine serum. Cells were counted and plated on Ultra Low Adsorption Cell Culture 96-well plates (Liver Biotech, Cat. LV-ULA002-96W) at a density of 40,000 cells/well. Plated cells were allowed to settle and adhere for 24 hours in a tissue culture incubator at 37°C and 5%CO 2 atmosphere.
- Figure 1 and Figure 2 shows the editing efficiency in PHH cells for SpCas9 mRNA indicated m5815-1, SEQ ID NO: 233/CAS001D (327) , SEQ ID NO: 236/CAS001F (331) , SEQ ID NO: 238. The result indicated the highest editing efficiency for SpCas9 mRNA CAS001F.
- Example 3 LNP delivery to mice having WT (murine) TTR.
- mice Selected guide RNA, mRNA and LNP formulation were further tested in mice.
- LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO 2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1.
- liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) . All DNA samples were prepared for Sanger-seq and NGS analysis as described in Example 2.
- mRNA was IVT synthesized from a linear plasmid DNA template.
- the plasmid template was constructed with pVAX1 as backbone.
- T7 promoter was fused to a 5’ UTR, kozak sequence, SpCas9 CDS or SaCas9 CDS or Firefly luciferase reporter gene, a 3’ UTR and poly (A) of less than A100 or separated poly (A) of A60+6dNTP+A60. Plasmid construction was performed by either Gibson assembly or Golden gate assembly.
- antibiotic ampicillin 100 ⁇ g/mL, or kanamycin 100 ⁇ g/mL
- Plasmid was isolated using commercial plasmid prep kit ( (TIANGEN endo-free plasmid midi prep kit (DP108) or (Macherey-Nagel) NucleoBond Xtra Midi EF, Midi kit for endotoxin-free plasmid DNA (740420.10) ) .
- Plasmid template was linearized with BsaI or BspQI or Esp3I or NdeI (NEB) overnight at 37°Caccording to manufacturer’s instruction.
- Linearized plasmid DNA template was treated with proteinase K (100–200 ⁇ g/mL) and 0.5%SDS for 30 min at 50°C to remove potential inhibitors of transcription introduced during plasmid prep, such as RNases. The reaction was followed with incubation at 65°C for 10 min to heat inactivate the enzyme .
- the RE digestion reaction is purified with phenol/chloroform extraction (using an equal volume) and ethanol precipitation.
- IVT reagents and GAG cap from Hongene.
- the 20 ⁇ L IVT reaction contains ATP (10 mM) , UTP (10 mM) , GTP (10 mM) , VTP (10 mM) , GAG or GAG 3’ oMe cap (10 mM) , DNA template (1 ⁇ g) , 5X Reaction buffer (1X) , and T7 enzyme mix (1.5 ⁇ L) .
- the reaction was assembled at room temperature, mixed well by gently flicking the tube and incubated at 37°C for 4 h.
- ARCA cap instead of GAG cap was added to the IVT reaction at final concentration of 8 mM, and GTP was used at final concentration 2 mM. All other components kept the same as in co-transcriptional capping with GAG cap.
- the IVT reaction was treated with DNase I (2 U for 20 ⁇ L reaction) and incubated at 37°C for 15 min.
- mRNA product was purified through lithium chloride precipitation. Briefly,
- RNA concentration Carefully remove the 70%ethanol, and resuspend the RNA in a solution or appropriate for your application. Determine the RNA concentration and store frozen at –20°C or –70°C.
- the final mRNA product was quantified by either fluorescent dye (Qubit) or UV light absorbance (Nanodrop) .
- the purity of mRNA product was checked on both agarose gel and by capillary electrophoresis using 5300 fragment analyzer (Agilent) .
- HepG2 cells were cultured in Advanced Dulbecco’s Modified Eagle’s Medium (Advanced DMEM, Gibco) containing 2 mM L-glutamine (GlutaMAXTM-l, Gibco) , supplemented with 10%fetal bovine serum (FETAL BOVINE SERUM, GEMINI) at 37 °C in 5%CO 2 -buffered incubators.
- Advanced DMEM Modified Eagle’s Medium
- GlutaMAXTM-l 2 mM L-glutamine
- FETAL BOVINE SERUM 10%fetal bovine serum
- RNAiMAX Reagent as per the manufacturer’s instructions (Life Technologies) . After 72 hours of incubation, the cells were harvest and gDNA was isolated from the cells. The editing efficiency at the target was analyzed via NGS or Sanger seq with the PCR product at the target site.
- HEK293T cells were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM, CORNING) containing 2 mM L-glutamine (GlutaMAXTM-l, Gibco) , supplemented with 10%fetal bovine serum (FETAL BOVINE SERUM, GEMINI) at 37°C in 5%CO 2 -buffered incubators.
- DMEM Modified Eagle’s Medium
- GlutaMAXTM-l 2 mM L-glutamine
- FETAL BOVINE SERUM 10%fetal bovine serum
- the cells in each well were treated with Bright-Lumi TM Firefly luciferase Reporter Gene Assay kit (Beyotime Technology, China) according to manufacturer’s introduction. Briefly, 1/1 volume of the Luciferase assay reagent was added into 100 ⁇ L cell culture in the well and mix well. After 2 min reaction at room temperature (around 25°C) , the luminescence signal was detected using a multi-mode microplate reader (INFINITE 200 PRO, TECAN) .
- a multi-mode microplate reader IFINITE 200 PRO, TECAN
- the cap utilized in the mRNA sequences named GEB_mRNA_p318, GEB_mRNA_p917, and GEB_mRNA_p676 was sourced from SyngeneBio.
- the product name is CAP 5 m7G (5') vppp (5') (2'OMeA) pG, with the item number CAP5011.
- the cap has the following structure:
- PSH Primary human liver hepatocytes (PHH) cells were thawed and resuspended in hepatocyte thawing medium with supplements (Lonza, Cat. MCHT50) followed by centrifugation at 100 g for 10 minutes. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium (Lonza, Cat. MP100) plus 10%fetal bovine serum. Cells were counted and plated on Ultra Low Adsorption Cell Culture 96-well plates (Liver Biotech, Cat. LV-ULA002-96W) at a density of 40,000 cells/well. Plated cells were allowed to settle and adhere for 24 hours in a tissue culture incubator at 37°C and 5%CO 2 atmosphere.
- hepatocyte culture medium (Lonza, Cat. CC-3198) plus 10%fetal bovine serum.
- FIG. 9A and Figure 9B shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in PHH cells.
- mice Selected guide RNA, mRNA and LNP formulation were further tested in mice.
- LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO 2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1. For studies involving in vivo editing, liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) .
- Figure 9C shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in CD-1 mice.
- mice Selected guide RNA, mRNA and LNP formulation were further tested in mice.
- LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO 2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1. For studies involving in vivo editing, liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) .
- Figure 10 shows the editing efficiency of mRNA with different UTRs, in CD-1 mice.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Dispersion Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
Provides are artificial nucleic acids, in particular RNA. A composition and kit of parts comprising the same are also provided.
Description
The present invention claims the priorities of the PCT/CN2023/116171, filed on August 31, 2023, and the PCT/CN2024/074906 filed on January 31, 2024, the contents of which are incorporated herein by their entirety.
The present invention relates to artificial nucleic acids, in particular RNAs, encoding polypeptides or proteins of interest, especially CRISPR-associated polypeptides or CRISPR-associated proteins, and (pharmaceutical) compositions and kit-of-parts comprising the same. Said artificial nucleic acids, in particular RNAs, (pharmaceutical) composition and kits are for use in medicine, for instance in gene therapy, and in particular in the treatment and/or prophylaxis of disease amenable to treatment with CRISPR-associated proteins, e.g. by gene editing, knock-in, knock-out or modulating the expression of target genes of interest.
With the development of mRNA-based drug technologies, therapeutic mRNA has been applied in many fields, such as cancer immunotherapies, infectious disease vaccines, allergy tolerization, protein-replacement and supplementation therapies, genome engineering, gene therapy and genetic reprogramming etc., particularly in CRISPR-associated gene therapy. In the field of gene therapy, a prolonged presence of editing enzymes translated from DNA-based vectors resulted in off-target effect. However, the transient expression of the nucleases from mRNA would minimize thus nonspecific effect. Engineered mRNA encoding ZFNs, TALENs, Cas9 have been applied successfully to edit genomes by disrupting or integrating sequences ex vivo and in vivo. (Sahin, U., Karikó, K. &Türeci, mRNA-based therapeutics -developing a new class of drugs. Nat Rev Drug Discov 13, 759–780 (2014) . doi. org/10.1038/nrd4278) .
However, unresolved issues such as the control and regulation of mRNA translation need to be addressed. Complying with these needs is the object of the present invention.
The present invention solves the above-mentioned technical problems through the following technical solutions:
In one aspect, the present invention provides an mRNA comprising an open reading frame encoding an RNA-guided DNA-binding agent, wherein the open reading frame comprises a sequence having at least 90%identity to any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
In some embodiments, the open reading frame comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to the sequence of any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
In some embodiments, the mRNA further comprises a 5’ UTR with at least 90%identity to any one of SEQ ID NOs: 110-171 or 1-62.
In some embodiments, the mRNA further comprises a 5’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 110-171 or 1-62.
In some embodiments, the mRNA further comprises a 3’ UTR with at least 90%identity to any one of SEQ ID NOs: 172-210 or 63-101.
In some embodiments, the mRNA further comprises a 3’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 172-210 or 63-101.
In some embodiments, the mRNA further comprises a poly-adenylated (poly-A) tail with at least 90%identity to SEQ ID NO: 243, 244, 245 or 246.
In some embodiments, the mRNA further comprises a poly-adenylated (poly-A) tail with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to SEQ ID NO: 243, 244, 245 or 246.
In another aspect, the present invention provides an mRNA comprising a sequence with at least 98%identity to any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
In some embodiments, the mRNA comprises a sequence selected from any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
In some embodiments, at least 10%of the uridine is substituted with a modified uridine, wherein the modified uridine is one or more of N1-methyl-pseudouridine, pseudouridine, 5-methoxyuridine, or 5-iodouridine.
In yet another aspect, the present invention provides an expression construct comprising a promoter operably linked to a sequence encoding an mRNA mentioned above, wherein the expression construct is optionally a plasmid expression construct.
In still yet another aspect, the present invention provides an isolated host cell comprising the expression construct mentioned above.
In one aspect, the present invention provides a composition comprising the mRNA mentioned above and at least one guide RNA.
In another aspect, the present invention provides a lipid nanoparticle comprising the mRNA mentioned above.
In yet another aspect, the present invention provides a pharmaceutical composition comprising the mRNA mentioned above and a pharmaceutically acceptable carrier.
In still yet another aspect, the present invention provides a method of genome editing or modifying a target gene comprising contacting a cell with the mRNA mentioned above.
In one aspect, the present invention provides an artificial nucleic acid molecule comprising a. at least one coding region encoding at least one CRISPR-associated protein; b. at least one 5’ untranslated region (5’UTR) element; and c. at least one 3' untranslated region (3’ UTR) element, wherein said coding region comprises a sequence having at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232 and 247.
In some embodiments, said coding region further comprises at least one nuclear localization signal.
In some embodiments, said artificial nucleic acid molecule is an RNA.
In some embodiments, the RNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
In some embodiments, said artificial nucleic acid molecule is a DNA.
In some embodiments, the DNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 217-224, and 249-251.
In another aspect, the present invention provides a recombinant expression vector comprising the artificial nucleic acid molecule mentioned above.
In some embodiments, the artificial nucleic acid molecule is operably linked to a promoter.
In some embodiments, the promoter is an inducible promoter.
In some embodiments, the artificial nucleic acid molecule further comprising a multiple cloning site.
In yet another aspect, the present invention provides an in vitro genetically modified host cell comprising the artificial nucleic acid according to any one of claims 18 to 23 or the recombinant expression vector mentioned above.
In still yet another aspect, the present invention provides a composition comprising the artificial nucleic acid molecule mentioned above or the recombinant expression vector mentioned above and a pharmaceutically acceptable carrier and/or excipient.
In some embodiments, the composition further comprises a guide RNA or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest.
In some embodiments, the guide RNA is a single guide (sgRNA) .
In some embodiments, the guide RNA is a dual guide (dgRNA) .
In one aspect, the present invention provides a kit comprising the artificial nucleic acid molecule mentioned above or the recombinant expression vector mentioned above, and optionally a liquid vehicle and/or optionally technical instructions with information on the administration and dosage of the artificial nucleic acid molecule or the composition.
In some embodiments, the kit further comprises a guide RNA (gRNA) or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest, or a regulatory element operably linked thereto.
In some embodiments, the guide RNA is a single guide (sgRNA) .
In some embodiments, the guide RNA is a dual guide (dgRNA) .
In another aspect, the present invention provides a method of inducing a double stranded break (DSB) within a gene of interest, and/or modifying the gene of interest, comprising delivering the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above to a cell.
In yet another aspect, the present invention provides a method of treating, preventing or diagnosing diseases associated with a gene of interest comprising administering the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above to a subject in need thereof.
In still yet another aspect, the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above in the preparation of medicament.
In one aspect, the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for gene therapy.
In another aspect, the present invention provides use of the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
In yet another aspect, the present invention provides the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or the kit mentioned above for use in inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
In still yet another aspect, the present invention provides the artificial nucleic acid molecule mentioned above, the recombinant expression vector mentioned above, the composition mentioned above, or
the kit mentioned above for use in treating, preventing or diagnosing diseases associated with a gene of interest in a subject.
In one aspect, the present invention provides an artificial nucleic acid molecule comprising
i. at least one coding region encoding at least one peptide of interest;
ii. at least one 5’ untranslated region (5’UTR) element comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 110-171 or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 1-62;
iii. at least one 3’ untranslated region (3’ UTR) element comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 172-210; or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 63-101.
In some embodiments, the 5’ untranslated region (5’UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 110-171, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 1-62.
In some embodiments, the 3’ untranslated region (3’UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 172-210, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 63-101.
Sequence Listing-1
Sequence Listing-2
Sequence Listing-3
Table 1. Detailed description of sequence listing 1-3
Figure 1 shows percent editing following transfection of PHH cells with lipoplex comprising SpCas9 sgRNA (sg5815-3, SEQ ID NO: 242) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/CAS001D, SEQ ID NO: 236/CAS001F, SEQ ID NO: 238) .
Figure 2 shows percent editing following transfection of PHH cells with lipoplex comprising SpCas9 sgRNA (sg5815-3, SEQ ID NO: 242) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/CAS001C, SEQ ID NO: 235/CAS001F, SEQ ID NO: 238) .
Figure 3 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
Figure 4 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238) .
Figure 5 shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A) and 5’UTR and 3’UTR combinations (B) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 12. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 2
Figure 6A shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to
right in the horizontal line are shown in the table 14. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 3
Figure 6B shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 15. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 4
Figure 6C shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR (A&B) and 5’UTR and 3’UTR combinations (C) in HepG2 cells, and the corresponding SEQ ID NOs of the names from left to right in the horizontal line are shown in the table 16. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 5
Figure 7A shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 6
Figure 7B shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 7
Figure 7C shows the activity of SpCas9 (CAS001F) mRNA with different 5’UTR and 3’UTR combinations in HepG2 cells. And the different SpCas9 mRNAs are combined with the sgRNA of SEQ ID NO: 109 respectively to form the different cargos.
Table 8
Figure 8A shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
Table 9
Figure 8B shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
Table 10
Figure 8C shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
Table 11
Figure 8D shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
Table 12
Figure 8E shows the activity of Fluc mRNA with different 5’UTRs (A, B, C&D) and 3’UTRs (E) in HEK293T cells.
Table 13
Figure 9A shows the editing efficiency of mRNA samples tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in primary human hepatocyte (PHH) cell. mRNA-790 and mRNA-784 were generated from the same IVT DNA template with poly (A) A60+A60, but different IVT reaction -A and -B. mRNA-768 and mRNA-791 were generated from the same IVT DNA template with poly (A) A95, but different IVT reaction -A and -B. All the mRNA samples tested contain CDS sequence of SpCas9-encoding CAS001F. sgRNA which is named as sg5815-3 (SEQ ID NO: 242) was used as the guide RNA in this experiment.
Table 14
Figure 9B shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in primary human hepatocyte (PHH) cell. mRNA-788 and mRNA-802 tailed with poly (A) A60+A60, while with different NLS included. mRNA-791 and mRNA-792 tailed with poly (A) A95 and they were generated from the same IVT DNA template. All the mRNA samples tested contain CDS sequence of SpCas9-encoding CAS001F. sgRNA which is named as sg5815-3 (SEQ ID NO: 242) was used as the guide RNA in this experiment.
Table 15
Figure 9C shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in CD-1 mice. mRNA used in this experiment are the same as used in Figure 9B. CD1-mice were administrated with two doses, 0.03 mpk (light gray bar) and 0.1 mpk (dark gray bar) , as displayed in the figure. sgRNA which is named as sg5815-1 (SEQ ID NO: 241) was used as the guide RNA in this experiment.
Table 16
Figure 10 shows percent editing following transfection of WT mice with LNP (ALC-0315) comprising SpCas9 sgRNA (sg5815-1, SEQ ID NO: 241) and SpCas9 mRNA (CAS001F, SEQ ID NO: 238, mRNA-788/SEQ ID NO: 252) .
Definitions
The term "UTR" refers to an "untranslated region" flanking the coding sequence of an artificial nucleic acid as defined herein. In this context, an "UTR element" comprises or consists of a nucleic acid sequence, which is derived from the (naturally occurring, wild-type) UTR of a particular gene, preferably as exemplified herein.
When referring to UTR elements “derived from” a particular UTR, reference is made to nucleic acid sequences corresponding to the sequence of said UTR ( "parent UTR" ) or a homolog, variant or fragment of said UTR. The term includes sequences corresponding to the entire (full-length) wild-type sequence of said UTR, or a homolog, variant or fragment thereof, including full-length homologs and variants, as well as fragments of said full-length wild-type sequences, homologs and variants, and variants of said fragments. The term "corresponds to" means that the nucleic acid sequence derived from the "parent UTR" may be an RNA sequence (e.g. equal to the RNA sequence used for defining said parent UTR sequence) , or a DNA sequence (both sense and antisense strand and both mature and immature) , which corresponds to such RNA sequence.
When referring to an UTR element derived from an UTR of a gene, “or a homolog, fragment or variant thereof” , the expression “or a homolog, fragment or variant thereof” may refer to the gene, or the UTR, or both.
The term "homolog" in the context of genes (or nucleic acid sequences derived therefrom or comprised by said gene, like a UTR) refers to a gene (or a nucleic acid sequence derived therefrom or comprised by said gene) related to a second gene (or such nucleic acid sequence) by descent from a common ancestral DNA sequence. The term, "homolog" includes genes separated by the event of speciation ( "ortholog" ) and genes separated by the event of genetic duplication ( "paralog" ) .
Artificial nucleic acid molecule: An artificial nucleic acid molecule may typically be understood to be a nucleic acid molecule, e.g. a DNA or an RNA, that does not occur naturally. In other words, an artificial nucleic acid molecule may be understood as a non-natural nucleic acid molecule. Such nucleic acid molecule
may be non-natural due to its individual sequence (which does not occur naturally) and/or due to other modifications, e.g. structural modifications of nucleotides, which do not occur naturally. An artificial nucleic acid molecule may be a DNA molecule, an RNA molecule or a hybrid-molecule comprising DNA and RNA portions. Typically, artificial nucleic acid molecules may be designed and/or generated by genetic engineering methods to correspond to a desired artificial sequence of nucleotides (heterologous sequence) . In this context an artificial sequence is usually a sequence that may not occur naturally, i.e. it differs from the wild type sequence by at least one nucleotide. The term "wild type" may be understood as a sequence occurring in nature. Further, the term "artificial nucleic acid molecule" is not restricted to mean "one single molecule" but is, typically, understood to comprise an ensemble of identical molecules. Accordingly, it may relate to a plurality of identical molecules contained in an aliquot.
Nucleic acid molecule: A nucleic acid molecule is a molecule comprising, preferably consisting of nucleic acid components. The term nucleic acid molecule preferably refers to DNA or RNA molecules. It is preferably used synonymous with the term "polynucleotide" . Preferably, a nucleic acid molecule is a polymer comprising or consisting of nucleotide monomers, which are covalently linked to each other by phosphodiester-bonds of a sugar/phosphate-backbone. The term "nucleic acid molecule" also encompasses modified nucleic acid molecules, such as base-modified, sugar-modified or backbone-modified etc. DNA or RNA molecules.
DNA: DNA is the usual abbreviation for deoxy-ribonucleic acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually deoxy-adenosine-monophosphate, deoxy-thymidine-monophosphate, deoxy-guanosine-monophosphate and deoxy-cytidine-monophosphate monomers which are-by themselves-composed of a sugar moiety (deoxyribose) , a base moiety and a phosphate moiety, and polymerise by a characteristic backbone structure. The backbone structure is, typically, formed by phosphodiester bonds between the sugar moiety of the nucleotide, i.e. deoxyribose, of a first and a phosphate moiety of a second, adjacent monomer. The specific order of the monomers, i.e. the order of the bases linked to the sugar/phosphate-backbone, is called the DNA sequence. DNA may be single stranded or double stranded. In the double stranded form, the nucleotides of the first strand typically hybridize with the nucleotides of the second strand, e.g. by A/T-base-pairing and G/C-base-pairing.
RNA, mRNA: RNA is the usual abbreviation for ribonucleic-acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually adenosine-monophosphate, uridine-monophosphate, guanosine-monophosphate and cytidine-monophosphate monomers which are connected to each other along a so-called backbone. The backbone is formed by phosphodiester bonds between the sugar, i.e. ribose, of a first and a phosphate moiety of a second, adjacent monomer. The specific succession of the monomers is called the RNA-sequence. Usually RNA may be obtainable by transcription of a DNA-sequence, e.g., inside a cell. In eukaryotic cells, transcription is typically performed inside the nucleus or the
mitochondria. In vivo, transcription of DNA usually results in the so-called premature RNA which has to be processed into so-called messenger-RNA, usually abbreviated as mRNA. Processing of the premature RNA, e.g. in eukaryotic organisms, comprises a variety of different posttranscriptional-modifications such as splicing, 5'-capping, polyadenylation, export from the nucleus or the mitochondria and the like. The sum of these processes is also called maturation of RNA. The mature messenger RNA usually provides the nucleotide sequence that may be translated into an amino-acid sequence of a particular peptide or protein. Typically, a mature mRNA comprises a 5'-cap, a 5'-UTR, an open reading frame, a 3'-UTR and a poly (A) sequence. Aside from messenger RNA, several non-coding types of RNA exist which may be involved in regulation of transcription and/or translation.
Heterologous sequence: Two sequences are typically understood to be 'heterologous' if they are not derivable from the same gene. i.e., although heterologous sequences may be derivable from the same organism, they naturally (in nature) do not occur in the same nucleic acid molecule, such as in the same mRNA.
Open reading frame: An open reading frame (ORF) in the context of the invention may typically be a sequence of several nucleotide triplets, which may be translated into a peptide or protein. An open reading frame preferably contains a start codon, i.e. a combination of three subsequent nucleotides coding usually for the amino acid methionine (ATG) , at its 5'-end and a subsequent region, which usually exhibits a length which is a multiple of 3 nucleotides. An ORF is preferably terminated by a stop-codon (e.g., TAA, TAG, TGA) . Typically, this is the only stop-codon of the open reading frame. Thus, an open reading frame in the context of the present invention is preferably a nucleotide sequence, consisting of a number of nucleotides that may be divided by three, which starts with a start codon (e.g. ATG) and which preferably terminates with a stop codon (e.g., TAA, TGA, or TAG) . The open reading frame may be isolated or it may be incorporated in a longer nucleic acid sequence, for example in a vector or an mRNA. An open reading frame may also be termed " (protein) coding sequence" or, preferably, "coding sequence" .
Codon: As used herein, the term "codon" refers to a sequence of three nucleotides that together form a unit of genetic code in a DNA or RNA molecule. A codon is operationally defined by the initial nucleotide from which translation starts and sets the frame for a run of successive nucleotide triplets, which is known as an "open reading frame" (ORF) . For example, the string GGGAAACCC, if read from the first position, contains the codons GGG, AAA, and CCC; if read from the second position, it contains the codons GGA and AAC; and if read from the third position, GAA and ACC. Thus, every nucleic sequence read in its 5'→ 3' direction comprises three reading frames, each producing a possibly distinct amino acid sequence (in the given example, Gly-Lys-Pro, Gly-Asn, or Glu-Thr, respectively) . DNA is double-stranded defining six possible reading frames, three in the forward orientation on one strand and three reverse on the opposite strand. Open reading frames encoding polypeptides are typically defined by a start codon, usually the first AUG codon in the sequence.
Initiation Codon: As used herein, the term "initiation codon" , used interchangeably with the term "start codon" , refers to the first codon of an open reading frame that is translated by the ribosome and is comprised of a triplet of linked adenine-uracil-guanine nucleobases. The initiation codon is depicted by the first letter codes of adenine (A) , uracil (U) , and guanine (G) and is often written simply as "AUG" . Although natural mRNAs may use codons other than AUG as the initiation codon, which are referred to herein as "alternative initiation codons" , the initiation codons of polynucleotides described herein use the AUG codon. During the process of translation initiation, the sequence comprising the initiation codon is recognized via complementary base-pairing to the anticodon of an initiator tRNA (Met-tRNAiMet) bound by the ribosome. Open reading frames may contain more than one AUG initiation codon, which are referred to herein as "alternate initiation codons" .
The initiation codon plays a critical role in translation initiation. The initiation codon is the first codon of an open reading frame that is translated by the ribosome. Typically, the initiation codon comprises the nucleotide triplet AUG, however, in some instances translation initiation can occur at other codons comprised of distinct nucleotides. The initiation of translation in eukaryotes is a multistep biochemical process that involves numerous protein-protein, protein-RNA, and RNA-RNA interactions between messenger RNA molecules (mRNAs) , the 40S ribosomal subunit, other components of the translation machinery (e.g., eukaryotic initiation factors; eIFs) . The current model of mRNA translation initiation postulates that the pre-initiation complex (alternatively "43S pre-initiation complex" ; abbreviated as "PIC" ) translocates from the site of recruitment on the mRNA (typically the 5' cap) to the initiation codon by scanning nucleotides in a 5' to 3' direction until the first AUG codon that resides within a specific translation-promotive nucleotide context (the Kozak sequence) is encountered (Kozak (1989) J Cell Biol 108: 229-241) . Scanning by the PIC ends upon complementary base-pairing between nucleotides comprising the anticodon of the initiator Met-tRNAiMet transfer RNA and nucleotides comprising the initiation codon of the mRNA. Productive base-pairing between the AUG codon and the Met-tRNAiMet anticodon elicits a series of structural and biochemical events that culminate in the joining of the large 60S ribosomal subunit to the PIC to form an active ribosome that is competent for translation elongation.
Sequence identity: Two or more sequences are identical if they exhibit the same length and order of nucleotides or amino acids. The percentage of identity typically describes the extent to which two sequences are identical, i.e. it typically describes the percentage of nucleotides that correspond in their sequence position with identical nucleotides of a reference-sequence. For determination of the degree of identity ( "%identity) , the sequences to be compared are typically considered to exhibit the same length, i.e. the length of the longest sequence of the sequences to be compared. This means that a first sequence consisting of 8 nucleotides is 80%identical to a second sequence consisting of 10 nucleotides comprising the first sequence. In other words, in the context of the present invention, identity of sequences preferably relates to the percentage of nucleotides
or amino acids of a sequence which have the same position in two or more sequences having the same length. Specifically, the "%identity" of two amino acid sequences or two nucleic acid sequences may be determined by aligning the sequences for optimal comparison purposes (e.g., gaps can be introduced in either sequence for best alignment with the other sequence) and comparing the amino acids or nucleotides at corresponding positions. Gaps are usually regarded as non-identical positions, irrespective of their actual position in an alignment. The " best alignment" is typically an alignment of two sequences that results in the highest percent identity. The percent identity is determined by the number of identical nucleotides in the sequences being compared (i.e., %identity = #of identical positions/total #of positions x 100) . The determination of percent identity between two sequences can be accomplished using a mathematical algorithm known to those of skill in the art.
The term "variant" in the context of nucleic acid sequences of genes refers to nucleic acid sequence variants, i.e. nucleic acid sequences or genes comprising a nucleic acid sequence that differs in at least one nucleic acid from a reference (or "parent" ) nucleic acid sequence of a reference (or "parent" ) nucleic acid or gene. Variant nucleic acids or genes may thus preferably comprise, in their nucleic acid sequence, at least one mutation, substitution, insertion or deletion as compared to their respective reference sequence. Preferably, the term "variant" as used herein includes naturally occurring variants, and engineered variants of nucleic acid sequences or genes. Therefore, a "variant" as defined herein can be derived from, isolated from, related to, based on or homologous to the reference nucleic acid sequence/Variants" may preferably have a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90%and most preferably of at least 95%or even 97%, to a nucleic acid sequence of the respective naturally occurring (wild-type) nucleic acid sequence or gene, or a homolog, fragment or derivative thereof.
The term "fragment" in the context of nucleic acid sequences or genes refers to a continuous subsequence of the full-length reference (or "parent" ) nucleic acid sequence or gene. In other words, a "fragment" may typically be a shorter portion of a full-length nucleic acid sequence or gene. Accordingly, a fragment, typically, consists of a sequence that is identical to the corresponding stretch within the full-length nucleic acid sequence or gene. The term includes naturally occurring fragments as well as engineered fragments. A preferred fragment of a sequence in the context of the present invention, consists of a continuous stretch of nucleic acids corresponding to a continuous stretch of entities in the nucleic acid or gene the fragment is derived from, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, and most preferably at least 80%of the total (i.e. full-length) nucleic acid sequence or gene from which the fragment is derived. A sequence identity indicated with respect to such a fragment preferably refers to the entire nucleic
acid sequence or gene. Preferably, a "fragment" may comprise a nucleic acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90%and most preferably of at least 95%or even 97%, to a reference nucleic acid sequence or gene that it is derived from.
The following examples further illustrate the present disclosure, but the present disclosure is not limited thereto.
Below presents preferred embodiments of the present disclosure based on the drawings in order to illustrate the technical schemes of the present disclosure in detail.
Example 1. LNP formulation and analytics
The Cas9 mRNA and sgRNA were encapsulated within lipid nanoparticles (LNP) using a Precision MPF-L2TM AITESEN. In general, the lipid nanoparticle components were dissolved in 100%ethanol with the lipid component (LP01, DSPC, cholesterol and DMG-PEG2000 in molar ratio of 50: 9: 38: 3 or ALC-0315, DSPC, cholesterol and ALC-0159 in molar ratio of 46.3: 42.7: 9.4: 1.6) . The RNA cargos were dissolved in 25 mM citrate, 100 mM NaCl, pH 5.0, resulting in a concentration of RNA cargo of approximately 0.45 mg/mL. The LNPs were formulated with a lipid amine to RNA phosphate (N: P) molar ratio of about 6, with the ratio of mRNA to gRNA at 2: 1 by weight. RNA in aqueous buffer conditions is mixed with ethanolic lipid mix, respectively in the volume ratio of 3 parts of RNA and 1 part of lipid mix. After mixing, the LNPs were collected, and then remaining buffer was exchanged into PH7.5 TSS (contain 5%w/v (g/L) sucrose, 45 mM NaCl, 50 mM Tris , 100-fold excess of sample volume) , overnight at 4℃ under gentle stirring using a 10 kDa dialysis bag. The resulting mixture was then filtered using a 0.2 um sterile filter. The resulting filtrate was stored at 2-8 ℃.
Lp01 is (9Z, 12Z) -3- ( (4, 4-Bis (octyloxy) butanoyl) oxy) -2- ( ( ( (3- (diethylamino) propoxy) carbonyl) oxy) methyl) propyl octadeca-9, 12-dienoate and Lp01 has the structure shown in the below:
ALC-0315 is 6- ( (2-hexyldecanoyl) oxy) -N- (6- ( (2-hexyldecanoyl) oxy) hexyl) -N- (4-hydroxybutyl) hexan-1-aminium and ALC-0315 has the structure shown in the below:
Table 17 Prescription of LNP-01
Table 18 Prescription of LNP-0315
Dynamic Light Scattering ( "DLS" ) is used to determine the average particle size and polydispersity index ( "PDI" ) and of LNP samples. Average particle size and polydispersity are measured by dynamic light scattering (DLS) using a Malvern Zetasizer DLS instrument. Dilute LNP samples in 1x PBS at volumetric ratios of 1: 99 prior to testing. Measure the mean hydrodynamic diameter of each sample by reporting average particle size along with PDI. Zeta potential of the LNP is also measured by Malvern Zetasizer. Samples are diluted at volumetric ratios of 1: 19 in 10mM NaCl prior to measurement.
A fluorescence-based assay (ThermoFisher Scientific) is used to determine the encapsulation efficiency, which is calculated as (Total RNA -Free RNA) /Total RNA. LNP samples are diluted with1x TE buffer containing 1%Triton-X 100 to a proper concentration to determine total RNA or 1x TE buffer to determine free RNA. Standard curves are prepared by utilizing the starting RNA solution used to
make the formulations according to the manufacturer's instructions) . stain is then added to each of the standards and samples and allowed stain to incubate for approximately 5 minutes at room temperature, in the absence of light. A Tecan INFINITE 200 PRO is used to read the samples with excitation and emission wavelength at 480nm, 520nm respectively. Total RNA and free RNA are calculated from the appropriate standard curves after subtracting the fluorescence value of the reagent blank from that of each of the samples.
Typically, when preparing LNPs, encapsulation was >80%, particle size was <120 nm, and pdi was < 0.2
Example 2. SpCas9 mRNA sequence optimization
Primary human liver hepatocytes (PHH) cells were thawed and resuspended in hepatocyte thawing medium with supplements (Lonza, Cat. MCHT50) followed by centrifugation at 100 g for 10 minutes. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium (Lonza, Cat. MP100) plus 10%fetal bovine serum. Cells were counted and plated on Ultra Low Adsorption Cell Culture 96-well plates (Liver Biotech, Cat. LV-ULA002-96W) at a density of 40,000 cells/well. Plated cells were allowed to settle and adhere for 24 hours in a tissue culture incubator at 37℃ and 5%CO2 atmosphere. After incubation cells were checked for monolayer formation and media was replaced with hepatocyte culture medium (Lonza, Cat. CC-3198) plus 10%fetal bovine serum. Cells were sequentially transfected with a lipoplex containing SpCas9 mRNA/sgRNA (6.25/1.56 nM sgRNA sg5815-3, mRNA/sgRNA=2: 1 w/w) , Lipofectamine RNAiMax (0.3 μL/well, InvitrogenTM) and OptiMem per the manufacturer's protocol. The plate was placed at 37℃ and 5%CO2 and cells were harvested on day three post-transfection. Total genomic DNA were extracted separately using the QuickExtractTM DNA Extraction Soln 1.0 according to the manufacturer’s instructions. Next-generation sequencing (NGS) was used to measure the editing efficiency.
Figure 1 and Figure 2 shows the editing efficiency in PHH cells for SpCas9 mRNA indicated m5815-1, SEQ ID NO: 233/CAS001D (327) , SEQ ID NO: 236/CAS001F (331) , SEQ ID NO: 238. The result indicated the highest editing efficiency for SpCas9 mRNA CAS001F.
Example 3. LNP delivery to mice having WT (murine) TTR.
Selected guide RNA, mRNA and LNP formulation were further tested in mice. mice from, ranging 6-12 weeks of age, were used in each study involving mice. LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1.
For studies involving in vivo editing, liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) . All DNA samples were prepared for Sanger-seq and NGS analysis as described in Example 2.
The data shown in Figure 3 and Figure 4 are from WT mice (n=5 per group) administered LNP0315, containing SpCas9 mRNA m5815-1, SEQ ID NO: 233/m5815-9, SEQ ID NO: 240/CAS001F, SEQ ID NO: 238 and sgRNA sg5815-1 (SEQ ID NO: 241) at 0.3 MPK (total RNA content, mRNA/sgRNA=2: 1 w/w) and euthanized at 6 days post dose. Samples were processed as described above.
Example 4
Methods
In vitro transcription (IVT)
1. Preparation of template DNA
1.1 Plasmid construction
mRNA was IVT synthesized from a linear plasmid DNA template. The plasmid template was constructed with pVAX1 as backbone. In the plasmid, T7 promoter was fused to a 5’ UTR, kozak sequence, SpCas9 CDS or SaCas9 CDS or Firefly luciferase reporter gene, a 3’ UTR and poly (A) of less than A100 or separated poly (A) of A60+6dNTP+A60. Plasmid construction was performed by either Gibson assembly or Golden gate assembly.
1.2 Plasmid isolation
Plate the E. coli strains carrying the plasmid DNA template from a glycerol stock onto a fresh LB plate containing corresponding antibiotic (ampicillin 100 μg/mL, or kanamycin 100 μg/mL) and incubate at 37℃ overnight in an incubator.
Inoculate 20/200 mL LB liquid medium with a fresh single colony from the plate and cultivate at 37℃, 220 rpm overnight.
Plasmid was isolated using commercial plasmid prep kit ( (TIANGEN endo-free plasmid midi prep kit (DP108) or (Macherey-Nagel) NucleoBond Xtra Midi EF, Midi kit for endotoxin-free plasmid DNA (740420.10) ) .
1.3 Plasmid linearization
Plasmid template was linearized with BsaI or BspQI or Esp3I or NdeI (NEB) overnight at 37℃according to manufacturer’s instruction.
1.4 Proteinase K treatment
Linearized plasmid DNA template was treated with proteinase K (100–200 μg/mL) and 0.5%SDS for 30 min at 50℃ to remove potential inhibitors of transcription introduced during plasmid prep, such as RNases. The reaction was followed with incubation at 65℃ for 10 min to heat inactivate the enzyme .
1.5 Purification of linearized plasmid
Following up, the RE digestion reaction is purified with phenol/chloroform extraction (using an equal volume) and ethanol precipitation.
1) Add 1/10 volume of 3 M sodium acetate, mix well.
2) Add 1 volume of DNA extraction reagent (phenol: chloroform: isoamyl alcohol= 25: 24: 1) , mix well, sit at room temperature for 3 min.
3) Centrifuge at 12000 rpm for 15 min.
4) Carefully transfer the supernatant to a fresh EP tube.
5) Add an equal volume of isopropanol, precipitate at -20℃ for at least 30 min.
6) Centrifuge at 12000 rpm for 15 min to pellet the linear plasmid DNA. Carefully discard the supernatant.
7) Wash the DNA pellet with 1 mL 70%ethanol, centrifuge at 12000 rpm for 1 min.
8) Repeat the wash step.
9) Carefully remove any residual fluid with a fine-tipped pipette.
10) Air dry the pellet for around 10 min, or longer if needed.
11) Resuspend the pellet using 30-50 μL NF water (if 20 μg plasmid was digested) .
12) Check the concentration of the linear plasmid using Qubit or NanoDrop.
1.6 In vitro transcription (IVT)
Co-transcriptional capping was performed using IVT reagents and GAG cap from Hongene. The 20μL IVT reaction contains ATP (10 mM) , UTP (10 mM) , GTP (10 mM) , VTP (10 mM) , GAG or GAG 3’ oMe cap (10 mM) , DNA template (1 μg) , 5X Reaction buffer (1X) , and T7 enzyme mix (1.5 μL) . The reaction was assembled at room temperature, mixed well by gently flicking the tube and incubated at 37℃ for 4 h.
In the case of co-transcriptional capping with ARCA cap, ARCA cap instead of GAG cap was added to the IVT reaction at final concentration of 8 mM, and GTP was used at final concentration 2 mM. All other components kept the same as in co-transcriptional capping with GAG cap.
1.7 DNase digestion
After 4 h incubation at 37℃, the IVT reaction was treated with DNase I (2 U for 20 μL reaction) and incubated at 37℃ for 15 min.
1.8 mRNA purification
mRNA product was purified through lithium chloride precipitation. Briefly,
Stop the reaction (20+1 μL) and precipitate the RNA by adding 30 μL Nuclease-free Water and 30 μL LiCl Precipitation Solution (7.5 M lithium chloride, 50 mM EDTA, final concentration 2.778 M (2.5-2.8 M) ) .
Mix thoroughly. Chill for ≥30 min at –20℃.
Centrifuge at 4℃ for 15 min at maximum speed to pellet the RNA.
Carefully remove the supernatant. Wash the pellet once with ~1 mL pre-cooled 70%ethanol, and re-centrifuge to maximize the removal of unincorporated nucleotides.
Carefully remove the 70%ethanol, and resuspend the RNA in a solution orappropriate for your application. Determine the RNA concentration and store frozen at –20℃ or –70℃.
1.9 mRNA quantitation
The final mRNA product was quantified by either fluorescent dye (Qubit) or UV light absorbance (Nanodrop) . The purity of mRNA product was checked on both agarose gel and by capillary electrophoresis using 5300 fragment analyzer (Agilent) .
1.10 mRNA activity assay in HepG2
HepG2 cells were cultured in Advanced Dulbecco’s Modified Eagle’s Medium (Advanced DMEM, Gibco) containing 2 mM L-glutamine (GlutaMAXTM-l, Gibco) , supplemented with 10%fetal bovine serum (FETAL BOVINE SERUM, GEMINI) at 37 ℃ in 5%CO2-buffered incubators. For transfection of mRNAs, 2.0*105 HepG2 cells were seeded in each well of a 24-well plate. 12.5 nM or 3.125 nM or 1.56 nM gRNA and corresponding mRNA with cargo ratio 1: 1 (w/w) were transfected using Lipofectamine RNAiMAX Reagent as per the manufacturer’s instructions (Life Technologies) . After 72 hours of incubation, the cells were harvest and gDNA was isolated from the cells. The editing efficiency at the target was analyzed via NGS or Sanger seq with the PCR product at the target site.
1.11 Fluc mRNA activity assay in HEK293T
HEK293T cells were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM, CORNING) containing 2 mM L-glutamine (GlutaMAXTM-l, Gibco) , supplemented with 10%fetal bovine serum (FETAL BOVINE SERUM, GEMINI) at 37℃ in 5%CO2-buffered incubators. For transfection of Fluc mRNAs, 3.0*104 HEK293T cells were seeded in each well of a 96-well plate. 50 ng Fluc mRNA was transfected using Lipofectamine RNAiMAX Reagent as per the manufacturer’s instructions (Life Biotechnologies) . At different time points post-transfection, e.g., 4 h, 6 h, 9 h, 21 h and/or 24 h, the cells in each well were treated with Bright-LumiTM Firefly luciferase Reporter Gene Assay kit (Beyotime Technology, China) according to manufacturer’s introduction. Briefly, 1/1 volume of the Luciferase assay reagent was added into 100 μL cell culture in the well and mix well. After 2 min reaction at room temperature (around 25℃) , the luminescence signal was detected using a multi-mode microplate reader (INFINITE 200 PRO, TECAN) .
Example 5
The cap utilized in the mRNA sequences named GEB_mRNA_p318, GEB_mRNA_p917, and GEB_mRNA_p676 was sourced from SyngeneBio. The product name is CAP 5 m7G (5') vppp (5') (2'OMeA) pG, with the item number CAP5011. And the cap has the following structure:
Primary human liver hepatocytes (PHH) cells were thawed and resuspended in hepatocyte thawing medium with supplements (Lonza, Cat. MCHT50) followed by centrifugation at 100 g for 10 minutes. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium (Lonza, Cat. MP100) plus 10%fetal bovine serum. Cells were counted and plated on Ultra Low Adsorption Cell Culture 96-well plates (Liver Biotech, Cat. LV-ULA002-96W) at a density of 40,000 cells/well. Plated cells were allowed to settle and adhere for 24 hours in a tissue culture incubator at 37℃ and 5%CO2 atmosphere. After incubation cells were checked for monolayer formation and media was replaced with hepatocyte culture medium (Lonza, Cat. CC-3198) plus 10%fetal bovine serum. Cells were sequentially transfected with a lipoplex containing SpCas9 mRNA/sgRNA (0.12 nM sgRNA sg5815-3, mRNA/sgRNA=1: 1 w/w) , Lipofectamine RNAiMax (0.3 μL/well, InvitrogenTM) and OptiMem per the manufacturer's protocol. The plate was placed at 37℃ and 5%CO2 and cells were harvested on day three post-transfection. Total genomic DNA were extracted separately using the QuickExtractTM DNA Extraction Soln 1.0 according to the manufacturer’s instructions. Next-generation sequencing (NGS) was used to measure the editing efficiency. Figure 9A and Figure 9B shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95 respectively, in PHH cells.
Selected guide RNA, mRNA and LNP formulation were further tested in mice. mice from, ranging 6-12 weeks of age, were used in each study involving mice. LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1. For studies involving in vivo editing, liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) . All DNA samples were prepared for Sanger-seq and NGS analysis as described in Example 2. Figure 9C shows the editing efficiency of mRNA tailed with different poly (A) structures, segmented A60+A60 and A95
respectively, in CD-1 mice. The data shown in Figure 9C are from WT mice (n=5 per group) administered LNP0315, containing SpCas9 mRNA and sgRNA sg5815-1 (SEQ ID NO: 241) at 0.03 MPK (total RNA content, mRNA/sgRNA=2: 1 w/w) or 0.1 MPK (total RNA content, mRNA/sgRNA=2: 1 w/w) , and euthanized at 6 days post dose. Samples were processed as described above.
Example 6
Selected guide RNA, mRNA and LNP formulation were further tested in mice. mice from, ranging 6-12 weeks of age, were used in each study involving mice. LNPs were dosed via the lateral tail vein injection. The animals were observed post dose for adverse effects. Body weight was measured at twenty-four hours before-administration and animals euthanized post dose via exsanguination under isoflurane anesthesia or CO2 asphyxiation. Blood was collected from inner canthal veins into serum separator tubes. The LNPs were made as described in Example 1. For studies involving in vivo editing, liver tissue was collected from the left lateral lobe from each animal. Genomic DNA was isolated using Nucleic acid extraction Kit (Denogen, DNS033-48) . All DNA samples were prepared for Sanger-seq and NGS analysis as described in Example 2. Figure 10 shows the editing efficiency of mRNA with different UTRs, in CD-1 mice. The data shown in Figure 10 are from WT mice (n=5 per group) administered LNP0315, containing SpCas9 mRNA and sgRNA sg5815-1 (SEQ ID NO: 241) at 0.1 MPK (total RNA content, mRNA/sgRNA=2: 1 w/w) , and euthanized at 6 days post dose. Samples were processed as described above.
Claims (46)
- An mRNA comprising an open reading frame encoding an RNA-guided DNA-binding agent, wherein the open reading frame comprises a sequence having at least 90%identity to any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
- The mRNA of claim 1, wherein the open reading frame comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to the sequence of any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232, and 247.
- The mRNA of claim 1 or 2, wherein the mRNA further comprises a 5’ UTR with at least 90%identity to any one of SEQ ID NOs: 110-171 or 1-62.
- The mRNA of any one of the preceding claims, wherein the mRNA further comprises a 5’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 110-171 or 1-62.
- The mRNA of any one of the preceding claims, wherein the mRNA further comprises a 3’ UTR with at least 90%identity to any one of SEQ ID NOs: 172-210 or 63-101.
- The mRNA of any one of the preceding claims, wherein the mRNA further comprises a 3’ UTR with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to any one of SEQ ID NOs: 172-210 or 63-101.
- The mRNA of any one of the preceding claims, wherein the mRNA further comprises a poly-adenylated (poly-A) tail with at least 90%identity to SEQ ID NO: 243, 244, 245 or 246.
- The mRNA of any one of the preceding claims, wherein the mRNA further comprises a poly-adenylated (poly-A) tail with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to SEQ ID NO: 243, 244, 245 or 246.
- An mRNA comprising a sequence with at least 98%identity to any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- The mRNA of claim 9, wherein the mRNA comprises a sequence selected from any one of SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- The mRNA of any one of claims 1 to 10, wherein at least 10%of the uridine is substituted with a modified uridine, wherein the modified uridine is one or more of N1-methyl-pseudouridine, pseudouridine, 5-methoxyuridine, or 5-iodouridine.
- An expression construct comprising a promoter operably linked to a sequence encoding an mRNA according to any one of claims 1-11, wherein the expression construct is optionally a plasmid expression construct.
- An isolated host cell comprising the expression construct of claim 12.
- A composition comprising the mRNA according to any one of claims 1-11 and at least one guide RNA.
- A lipid nanoparticle comprising the mRNA according to any one of claims 1-11.
- A pharmaceutical composition comprising the mRNA according to any one of claims 1-13 and a pharmaceutically acceptable carrier.
- A method of genome editing or modifying a target gene comprising contacting a cell with the mRNA according to any one of claims 1-13;preferably, the method is in vitro or ex vivo, and/or, the method is not diagnostic, therapeutic, and surgical methods for the treatment of humans or animals.
- An artificial nucleic acid molecule comprising a. at least one coding region encoding at least one CRISPR-associated protein; b. at least one 5’ untranslated region (5’ UTR) element; and c. at least one 3' untranslated region (3’ UTR) element, wherein said coding region comprises a sequence having at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from any one of SEQ ID NOs: 212, 213, 215, 225, 226, 227, 228, 229, 230, 231, 232 and 247.
- The artificial nucleic acid molecule according to claim 18, wherein said coding region further comprises at least one nuclear localization signal.
- The artificial nucleic acid molecule according to claim 18 or 19, wherein said artificial nucleic acid molecule is an RNA.
- The artificial nucleic acid molecule according to claim 20, wherein the RNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 233, 234, 235, 236, 237, 238, 239, 240, 250, 252, and 254.
- The artificial nucleic acid molecule according to claim 18 or 19, wherein said artificial nucleic acid molecule is a DNA.
- The artificial nucleic acid molecule according to claim 22, wherein the DNA comprises a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%sequence identity to a nucleic acid sequence selected from SEQ ID NOs: 217-224, and 249-251.
- A recombinant expression vector comprising the artificial nucleic acid molecule according to any one of claims to 18 to 23.
- The recombinant expression vector according to claim 24, wherein the artificial nucleic acid molecule is operably linked to a promoter.
- The recombinant expression vector according to claim 25, wherein the promoter is an inducible promoter.
- The recombinant expression vector according to any one of the preceding claims, wherein the artificial nucleic acid molecule further comprising a multiple cloning site.
- An in vitro genetically modified host cell comprising the artificial nucleic acid according to any one of claims 18 to 23 or the recombinant expression vector according to any one of claims 24 to 27.
- A composition comprising the artificial nucleic acid molecule according to any one of claims 18 to 23 or the recombinant expression vector according to any one of claims 24 to 27 and a pharmaceutically acceptable carrier and/or excipient.
- The composition according to claim 29, wherein the composition further comprises a guide RNA or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest.
- The composition according to claim 30, wherein the guide RNA is a single guide (sgRNA) .
- The composition according to claim 30, wherein the guide RNA is a dual guide (dgRNA) .
- A kit comprising the artificial nucleic acid molecule according to any one of claims 18 to 23 or the recombinant expression vector according to any one of claims 24 to 27, and optionally a liquid vehicle and/or optionally technical instructions with information on the administration and dosage of the artificial nucleic acid molecule or the composition.
- The kit according to claim 33, further comprising a guide RNA (gRNA) or a nucleic acid encoding the same, said gRNA being capable of targeting the CRISPR-associated protein to a target DNA sequence of interest, or a regulatory element operably linked thereto.
- The kit according to claim 34, wherein the guide RNA is a single guide (sgRNA) .
- The kit according to claim 34, wherein the guide RNA is a dual guide (dgRNA) .
- A method of inducing a double stranded break (DSB) within a gene of interest, and/or modifying the gene of interest, comprising delivering the artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 to a cell.
- A method of treating, preventing or diagnosing diseases associated with a gene of interest comprising administering the artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 to a subject in need thereof.
- Use of the artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 in the preparation of medicament.
- Use of the artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 for gene therapy.
- Use of the artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 for inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
- The artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 for use in inducing a double-stranded break (DSB) within a gene of interest, modifying the gene of interest, and/or modulating the expression of a gene of interest in a cell or subject.
- The artificial nucleic acid molecule according to any one of claims 18-23, the recombinant expression vector according to any one of claims 24 to 27, the composition according to any one of claims 29 to 32, or the kit according to any one of claim 33 to 36 for use in treating, preventing or diagnosing diseases associated with a gene of interest in a subject.
- An artificial nucleic acid molecule comprisingi. at least one coding region encoding at least one peptide of interest;ii. at least one 5’ untranslated region (5’ UTR) element comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 110-171 or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 1-62;iii. at least one 3’ untranslated region (3’ UTR) element comprising a DNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 172-210; or an RNA sequence with at least 90%identity to a sequence selected from any one of SEQ ID NOs: 63-101.
- The artificial nucleic acid molecule of claim 44, wherein the 5’ untranslated region (5’ UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 110-171, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 1-62.
- The artificial nucleic acid molecule of claim 34, wherein the 3’ untranslated region (3’ UTR) element comprising a DNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs: 172-210, or an RNA sequence with at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%identity to a sequence selected from any one of SEQ ID NOs:63-101.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNPCT/CN2023/116171 | 2023-08-31 | ||
CN2023116171 | 2023-08-31 | ||
CN2024074906 | 2024-01-31 | ||
CNPCT/CN2024/074906 | 2024-01-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2025045247A1 true WO2025045247A1 (en) | 2025-03-06 |
Family
ID=94818172
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2024/116339 WO2025045247A1 (en) | 2023-08-31 | 2024-09-02 | Nucleic acids encoding crispr-associated proteins and uses thereof |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2025045247A1 (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019067992A1 (en) * | 2017-09-29 | 2019-04-04 | Intellia Therapeutics, Inc. | Formulations |
CN111405912A (en) * | 2017-09-29 | 2020-07-10 | 因特利亚治疗公司 | Polynucleotides, compositions and methods for genome editing |
US20200268666A1 (en) * | 2017-06-14 | 2020-08-27 | Modernatx, Inc. | Polynucleotides encoding coagulation factor viii |
WO2020219876A1 (en) * | 2019-04-25 | 2020-10-29 | Intellia Therapeutics, Inc. | Ionizable amine lipids and lipid nanoparticles |
CN113056559A (en) * | 2018-09-28 | 2021-06-29 | 因特利亚治疗公司 | Compositions and methods for Lactate Dehydrogenase (LDHA) gene editing |
US20210403925A1 (en) * | 2017-03-24 | 2021-12-30 | Curevac Ag | Nucleic acids encoding crispr-associated proteins and uses thereof |
US20230159926A1 (en) * | 2020-04-09 | 2023-05-25 | Verve Therapeutics, Inc. | Base editing of pcsk9 and methods of using same for treatment of disease |
-
2024
- 2024-09-02 WO PCT/CN2024/116339 patent/WO2025045247A1/en unknown
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210403925A1 (en) * | 2017-03-24 | 2021-12-30 | Curevac Ag | Nucleic acids encoding crispr-associated proteins and uses thereof |
US20200268666A1 (en) * | 2017-06-14 | 2020-08-27 | Modernatx, Inc. | Polynucleotides encoding coagulation factor viii |
WO2019067992A1 (en) * | 2017-09-29 | 2019-04-04 | Intellia Therapeutics, Inc. | Formulations |
CN111405912A (en) * | 2017-09-29 | 2020-07-10 | 因特利亚治疗公司 | Polynucleotides, compositions and methods for genome editing |
CN113056559A (en) * | 2018-09-28 | 2021-06-29 | 因特利亚治疗公司 | Compositions and methods for Lactate Dehydrogenase (LDHA) gene editing |
WO2020219876A1 (en) * | 2019-04-25 | 2020-10-29 | Intellia Therapeutics, Inc. | Ionizable amine lipids and lipid nanoparticles |
US20230159926A1 (en) * | 2020-04-09 | 2023-05-25 | Verve Therapeutics, Inc. | Base editing of pcsk9 and methods of using same for treatment of disease |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111405912B (en) | Polynucleotides, compositions and methods for genome editing | |
US20230250439A1 (en) | Polynucleotide secondary structure | |
ES2958832T3 (en) | Plasmid containing a sequence coding for an mRNA with a cleaved poly(A) tail | |
EP2996697B1 (en) | Intracellular translation of circular rna | |
EP3317415B1 (en) | Utrs increasing the translation efficiency of rna molecules | |
CN109475646A (en) | Lipid Nanoparticle Formulations for CRISPR/CAS Components | |
TW201924724A (en) | Formulations | |
US20230272378A1 (en) | ENCODING AND EXPRESSION OF ACE-tRNAs | |
US20230340523A1 (en) | Site-Specific Gene Modifications | |
CN113811611A (en) | Nontoxic CAS9 enzyme and its application | |
CN117230062A (en) | The 5' UTR sequence designed by artificial optimization can improve the translation expression of exogenous gene | |
WO2021113494A1 (en) | Synthetic guide rna, compositions, methods, and uses thereof | |
TW202102529A (en) | Polynucleotides, compositions, and methods for polypeptide expression | |
US20240376468A1 (en) | CIRCULAR GUIDE RNAs FOR CRISPR/CAS EDITING SYSTEMS | |
CN117645996A (en) | Nucleic acid 5' UTR molecules and uses thereof | |
WO2025045247A1 (en) | Nucleic acids encoding crispr-associated proteins and uses thereof | |
US20220290147A1 (en) | System for regulating gene expression | |
US20240091343A1 (en) | Technology platform of uncapped-linear mrna with unmodified uridine | |
WO2022166771A1 (en) | 3'utr construction method and application | |
EP3502258A1 (en) | Click-modified in vitro transcribed mrna for gene expression | |
CN117460839A (en) | Linear DNA with enhanced resistance to exonucleases | |
EP2329023B1 (en) | Compositions and methods for delivery of protein-coding rnas to correct mitochondrial dysfunction | |
CN118516381B (en) | MRNA encoding luciferase and uses thereof | |
CN118256506B (en) | Artificially designed 5' UTR structure enhances expression of target gene | |
EP4410995A1 (en) | Linear dna with enhanced resistance against exonucleases and methods for the production thereof |