WO2025224107A1 - Method and compositions for detecting off-target editing - Google Patents
Method and compositions for detecting off-target editingInfo
- Publication number
- WO2025224107A1 WO2025224107A1 PCT/EP2025/060933 EP2025060933W WO2025224107A1 WO 2025224107 A1 WO2025224107 A1 WO 2025224107A1 EP 2025060933 W EP2025060933 W EP 2025060933W WO 2025224107 A1 WO2025224107 A1 WO 2025224107A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- site
- synthetic dna
- dna molecule
- attachment
- cryptic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6811—Selection methods for production or design of target specific oligonucleotides or binding molecules
Definitions
- the large serine integrase (LSI) family constitutes a diverse group of site-specific recombinases that play pivotal roles in mediating DNA rearrangements.
- the precise and efficient DNA manipulation capabilities of the large serine integrase family have positioned it as a valuable tool in gene therapy and gene editing applications.
- Therapeutic genome editing with site-specific integrases requires a genotoxic safety evaluation that includes an unbiased and genome-wide characterization of recombination specificity because unintended recombination will result in large genomic rearrangements like off-target transgene insertion and chromosomal translocations.
- the present disclosure provides new technologies for discovering potential off-target editing and profiling the recombination landscapes of integrases.
- This technology can be used to support and validate new drug development involving phage integrases, such as for site-specific genetic engineering using Programmable Addition via Site-Specific Targeting Elements (PASTE) (see lonnidi et al. doi: 10.1101/2021.11.01.466786; U.S. Patent No’s. 11,572,556; 11,834,658; 11,827881; PCT Publication No. WO 2022/087235 each of which is herein incorporated by reference in its entirety) or other suitable gene editing or gene incorporation technology.
- PASTE Site-Specific Targeting Elements
- Cryptic-Seq discovered about 40 cryptic Bxbl attB sites in the human genome and CHANGE-Seq discovered about 250 cryptic Bxbl attB sites in the human genome, while Cryptic-Seq discovered about 40,000 cryptic Bxbl attB sites in the human genome.
- the application of the presently disclosed Cryptic-Seq technology begins with the creation of DNA recombination substrates that contain phage integrase attachment sequences and Illumina next generation sequencing (NGS) compatible PCR amplification priming sequences.
- the Illumina NGS compatible sequencing adapters can be used for PCR enrichment and priming for the Illumina sequencing by synthesis chemistry, which improves NGS library preparation efficiency.
- the optional UMls improve data analysis by reducing PCR biases through the deduplication NGS reads to unique molecules at the start of the reaction.
- Plasmid DNA is a convenient format for the creation, propagation and transfection of Cryptic-Seq recombination substrates.
- Linear synthetic DNA substrates also work well.
- the Cryptic-Seq recombination substrates enable the enrichment, identification, and relative quantification of recombined genomic sequences from any species in either a cell-based or biochemical recombination reaction with a phage integrase.
- the Cryptic-Seq recombination substrates are utilized in a phage integrase-mediated recombination reaction with genomic DNA from any species, which results in the integration of Cryptic-Seq recombination substrates with target genomic DNA at potential cryptic recombination sites.
- the integration of unique DNA sequences from the recombination reaction with Cryptic-Seq substrates allows targeted PCR enrichment and NGS of recombined locations in the genome of interest. Through bioinformatic analysis, the NGS reads are filtered for attL (left) and attR (right) sequences that are associated with the completion of an integrase-mediated recombination reaction.
- the filtered reads are then aligned to the correct reference genome to identify the attL and attR genomic junctions and report the coordinates ranked by reads deduplicated from PCR amplification through the UMIs.
- the list of genomic coordinates completes the discovery phase of the potential recombination landscape for any integrase in any genome. Loci discovered by Cryptic-Seq can then be validated for recombination in the cell type of interest after genome editing with targeted methods.
- the integration enzyme is a large serine integrase.
- the large serine integrase is an integration enzyme disclose in PCT Publication No. W02023/070031, the disclosure of which is incorporated by reference in its entirety.
- FIGs. 1A-1E shows analysis of AttP variants.
- FIG. 1A shows a non-limiting schematic of AttP mutations tested for improving integration efficiency (SEQ ID NOS: 394 and 540-542, respectively, in order of appearance).
- FIG. IB shows integration efficiencies of wildtype and mutant AttP sites across a panel of AttB lengths.
- FIG. 1C shows a non-limiting schematic of multiplexed integration of different cargo sets at specific genomic loci. Three fluorescent cargos (GFP, mCherry, and YFP) are inserted orthogonally at three different loci (ACTB, LMNB1, NOLC1) for in-frame gene tagging.
- FIG. 1A shows a non-limiting schematic of AttP mutations tested for improving integration efficiency (SEQ ID NOS: 394 and 540-542, respectively, in order of appearance).
- FIG. IB shows integration efficiencies of wildtype and mutant AttP sites across a panel of AttB lengths.
- FIG. 1C shows
- FIG. 2 illustrates a schematic of single atgRNA and dual atgRNA approaches for beacon placement (“integration recognition site”).
- FIG. 3 shows a schematic of Cryptic-Seq.
- Cryptic-Seq begins with the construction of DNA sequences that contains an attP or attB attachment site, unique molecular identifiers (UMIs) and PCR primer targeting sites (P7 and N7) that serve dual purposes, PCR amplification and Illumina sequencing by synthesis (SBS).
- UMIs unique molecular identifiers
- P7 and N7 PCR primer targeting sites
- SBS Illumina sequencing by synthesis
- the DNA sequences with attachment sequences are then combined with genomic DNA (gDNA) from any species in a biochemical recombination reaction with any large serine integrase (LSI).
- gDNA genomic DNA
- LSI large serine integrase
- the genomic DNA is simultaneously sequence-tagged and fragmented (tagmented) with the transposase Tn5 to impart a custom DNA sequence tag that includes a partial Illumina P5 sequencing adapter.
- the library is amplified with anchor PCR to enrich recombined DNA sequences and impart the full-length sequencing adapters for sequencing on the Illumina HTS platform.
- the next generation sequencing (NGS) data is converted to FASTQs for each sample that contains the sequences corresponding quality scores associated with each sample.
- FIG. 4 shows a schematic of HIDE-Seq.
- genomic DNA is exposed to an LSI protein and linear DNA attachment substrates, a double strand break is produced with the attachment sequences appended at the break points.
- HIDE-Seq can also detect double-strand DNA breaks and gross chromosomal rearrangements between two endogenous loci that were recombined by an LSI.
- FIG. 5 shows a schematic of Cryptic-seq analysis workflow with tbChaSin.
- FIG. 6 shows a schematic of HIDE-seq analysis workflow with tbDigln.
- FIG. 7 shows a schematic of hybridization capture followed by next generation sequencing (“Hybrid Capture”).
- FIG. 8 shows a schematic of CHASER-Seq.
- FIG. 9 shows a schematic of Att-Seq.
- FIG. 10 shows the potential cryptic attB sites discovered with Cryptic-Seq.
- the attB sequence logo that represents the specificity profile of Bxbl integrase was derived from recombinogenic human sites that match some described priority base pairs in attB from previous functional assessments of attB in isolation.
- FIG. 11 shows the comparison of Bxbl Integrase-dependent recurrent DSBs as discovered by HIDE-Seq with the recurrent DSBs of CRISPR/Cas9.
- FIG. 12 shows the potential cryptic attB sites discovered with HIDE-Seq.
- the attB sequence logo that represents the specificity profile of Bxbl integrase was derived from recombinogenic human sites that match some described priority base pairs in attB from previous functional assessments of attB in isolation.
- Figure discloses SEQ ID NO: 410.
- FIG. 13 shows the highest ranked cryptic attB sites discovered by HIDE-Seq and Cryptic-Seq and evaluated by hybrid capture.
- FIG. 14 shows the heatmap of the cryptic sites validated by hybrid capture.
- FIGs. 15A-15D Cryptic-seq off-target discovery for the LSI Bxbl across multiple central dinucleotides.
- Fig. 15A is a sequence logo that shows genome-wide search with the PWM generated by HIDE-seq with HOMER identified 4,598,283 potential off-target loci.
- Fig. 15A is a sequence logo that shows genome-wide search with the PWM generated by HIDE-seq with HOMER identified 4,598,283 potential off-target loci.
- FIG. 15C is a Manhattan-style scatter plot showing Genomic distribution of cryptic attB sites plotted against the UMI signal discovered by Cryptic-seq.
- Fig. 15D is a DNA sequence motif from the 410,776 cyptic attB sites discovered by Cryptic-seq. Natural Bxbl attB sequence is displayed on the bottom. Figure discloses SEQ ID NO: 426.
- FIGs. 16A-16B Target vs predicted cross-validation plots from IntQuery and a linear regression model.
- FIGs. 17A-17C Large Serine Integrase (LSI) Mechanism of Action.
- Fig. 17A is a schematic diagram showing serine integrases catalyzing recombination between attachment (all) sites on linear or circular DNA substrates. Integrase dimers bind to specific all sequences in the phage attP and bacterial host (attB) DNA. Integrase bound to attP and integrase bound to attB associate to form a synaptic complex that connects the paired homologous sequences.
- the integrase subunits cleave all four DNA strands at the central dinucleotide, forming 5 '-phosphoserine linkages between integrase subunits and DNA halfsites and generating 3 '-dinucleotide overhangs.
- the P' and B'-linked subunits exchange places by rotating 180° about a horizontal axis relative to the P and B-linked subunits. Basepairing between the central dinucleotides promotes ligation of the DNA strands, resulting in formation of two new attachment sites, attachment left (attL) and attachment right (attR).
- Figure discloses SEQ ID NOS 394, 590, 410, and 591, respectively, in order of appearance. Fig.
- FIG. 17B Loci in the human genome with homology to LSI attB or attP sites are often called cryptic attachment sites because they are not obvious but have sufficient homology to catalyze recombination.
- Figure discloses SEQ ID NOS 592 and 410, respectively, in order of appearance.
- Fig. 17C There are 2 classes of off-target editing from an LSI; DNA mutagenesis in the form of indels from FDEs that may occur if recombination is disrupted in between strand cleavage and re-ligation.
- the second class is DNA structural variants, and these can come in two forms, the first is from off-target cargo insertion at a site in the human genome with homology to the attachment sequences. The second appears in the form of gross chromosomal rearrangements that may involve the attachment sequence introduced by genome editing or the interaction between two sites with homology to the attachment sequences.
- FIGs. 18A-18E High-throughput Integrase-mediated DNA Event Sequencing (HIDE-seq).
- Fig. 18A is a graphic of HIDE-seq experimental workflow. Genomic DNA is recombined with linear attB or attP substrates by an LSI and then subjected to WGS.
- Fig. 18B are Venn diagrams of the loci from FDEs detected by HIDE-seq for Bxbl and Digenome-seq for Cas9. Each circle represents a biochemical replicate reaction. Zero recurrent and background levels of FDEs were detected in samples treated with Bxbl alone, Bxbl and attP, or Bxbl and attB. In contrast, 12,597 recurrent FDEs were detected in our inline control using Digenome-seq and Cas9 with the off-target standard sgRNA VEGFA site 2.
- Fig. 18C is an integrative genomics viewer (IGV) of HIDE-seq reads supporting recombination display distinct opposing read alignments.
- the top panel corresponds to on- target attB integration reads supporting recombination with attL (P’) and attR (P) sequences appended to the ends.
- the reads align with lentiviral genomic sequence in the HEK293attB genome and then appear soft-clipped (multicolored) where the linear attP substrates were recombined because these sequences do not exist in the human genome hg38 alignment reference file.
- the bottom panel corresponds to off-target integration where soft clipped reads that correspond to attL (P’) and attR (P) sequences are detected amongst the WGS coverage.
- FIGs. 19A-19E Cryptic-seq
- Fig. 19A is a graphic of Cryptic-seq experimental workflow. First, a library of genomic DNA is created with Tn5 tagmentation that imparts partial Illumina sequencing adapters.
- Fig. 19B is a representative IGV browser of Cryptic-Seq reads supporting recombination at the on-target attB site and a cryptic attB.
- the top panel corresponds to on-target alignment of the left library (N7) with reads supporting the right junction (attR).
- the reads that align with lentiviral genomic sequence in the HEK293 ⁇ //7> genome appear soft-clipped where the linear attP substrates were recombined because these sequences do not exist in the human genome hg38 alignment reference file.
- the bottom panel corresponds to off-target alignment of the N7 library with reads supporting attR (in this view the cryptic site is in the antiparallel orientation such that the GT dinucleotide is on the bottom DNA strand so the attR reads appear on the left).
- the reads align with genomic sequence and then appear soft-clipped where the linear attP substrates were recombined because these sequences do not exist in the human genome hg38 alignment reference file.
- Fig. 19C is a summary table with the number of cryptic attB sites detected in the human genome.
- the "Recurrence” section summarizes the overlap analysis across replicates.
- the number of sites shared between the corresponding set of replicates is indicated, e.g., the top row “Repl,Rep2,Rep3” corresponds to sites observed in all three Cryptic-seq replicates, at increasing UMI levels. For sites shared between replicates, the lowest of the UMI values is used. As shown here, all sites > 50 UMIs were observed in all three replicates.
- the bottom row “Unique sites” section indicates the total number of unique sites across all three replicates, at increasing UMI levels. A total of 44,311 unique cryptic attB sites were detected in the hg38 reference human genome with 1 UMI or more.
- Fig. 19D is a Venn diagram of cryptic attB discovery overlap from HIDE-Seq and Cryptic-seq.
- Fig. 19E is a DNA sequence motif logos, unweighted by UMI count (top) and weighted by UMI count (bottom), created by aligning the 46 bp sequence from the 44,311 cryptic attB sites.
- FIGs. 20A-20G Hybrid Capture NGS Validation of Off-Target Editing
- Fig. 20A is a graphic showing that K562attB cells were edited with Bxbl integrase and the purified gDNA was used for hybrid capture followed by NGS. Hybrid capture probes were designed on the left and right sides of the cryptic attachment loci to enable unbiased detection of indels, cargo integration and genomic rearrangements.
- Fig. 20B consists of graphs showing the sensitivity of off-target insertion detection with hybrid capture NGS was 0.1% when using a synthetic DNA standard spike-in from one of the off-target loci discovered, CAS031.
- r 2 Pearson coefficient of determination Fig.
- FIG. 20C is a graph of K562attB cells edited with Bxbl had 45- 48% levels of insertion measured by ddPCR.
- Fig. 20D is a scatter plot showing hybrid capture NGS with single-sided capture probes detected 52 sites with off-target integration in K562attB cells and most validated off-target frequencies were at or below our lower limit of detection.
- Fig. 20E is a graph showing quantification of off-target insertion at CAS421 in K562attB cells using ddPCR confirms the frequency of off-target integration detected by hybrid capture NGS.
- Fig. 20F is a graph of 5 cryptic attB sites validated by WGS.
- Fig. 20G is a graph Discovery assay classification errors for HIDE-seq and Cryptic-Seq reveal high level of false positive rate for Cryptic-seq but achieve the target goal of 0% false negative rate.
- this disclosure features a method for identifying cryptic attachment sites that are recognizable by a selected large serine integrase (LSI) in a target genome, the method comprising:
- the present disclosure features a method for identifying DNA recombination events of a selected integrase in a target genome, the method comprising:
- the integration site is the canonical phage attachment site attP or attB and is functionally capable of supporting recombination, resulting in integration of attP or attB containing DNA cargo into the genome.
- the phage integrase can mediate the integration of plasmids bearing the canonical phage attachment site into native sequences that have partial sequence identity with attP or attB.
- Gene editor is a protein that that can be used to perform gene editing, gene modification, gene insertion, gene deletion, or gene inversion.
- gene editor polynucleotide refers to polynucleotide sequence encoding the gene editor protein.
- the gene editor comprises DNA- or RNA-targetable nuclease protein (i.e., Cas protein) wherein target specificity is mediated by a complexed nucleic acid (i.e., guide RNA).
- the gene editor is a DNA-targetable or RNA-targetable protein in which target specificity is mediated by internal, conjugated, fused, or linked amino acids, such as within TALENs, ZFNs, or meganucleases.
- target specificity is mediated by internal, conjugated, fused, or linked amino acids, such as within TALENs, ZFNs, or meganucleases.
- the skilled person in the art would appreciate that the gene editor can demonstrate targeted nuclease activity, targeted binding with no nuclease activity, targeted nickase activity (or cleavase activity).
- a gene editor comprising a targetable protein may be fused, linked, complexed, operate in cis or trans to one or more proteins or protein fragment motifs.
- Gene editors may be fused or linked to one or more integrase, recombinase, polymerase, telomerase, reverse transcriptase, or invertase.
- a gene editor can be an alpha editor fusion protein or a gene writer fusion protein.
- Base editors such as ADAR or AD AT may be delivered as cargo in various embodiments, as described herein.
- Alpha editor protein or fusion protein is a gene editor protein.
- Alpha editor system as used herein describes the components used in alpha editing and alpha editing and alpha editor are used interchangeably herein with the terms “prime editing or prime editor (PE).
- Alpha editing uses a CRISPR protein, which has enzymatic activity that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; the nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts.
- Prime editing (PE) and prime editing components (pegRNA) can be utilized as well.
- the nickase is programmed (targeted) with an alpha-editing guide RNA (aeg RNA or a pegRNA).
- aeg RNA or a pegRNA alpha-editing guide RNA
- pegRNA both specifies the target site and encodes the desired edit.
- Attachment site containing guide RNA (atgRNA) that both specifies the target and encodes for the desired integrase target recognition site are provided.
- the nickase may be programmed (targeted) with an atgRNA.
- the nickase is a catalytically impaired Cas9 endonuclease, i.e., a Cas9 nickase, that is fused to a reverse transcriptase.
- the reverser transcriptase can also be provided in the alpha editor system split from the Cas9 nickase.
- the Cas9 nickase part of the protein is guided to the DNA target site by the atgRNA (or pegRNA), whereby a nick or single stranded cut occurs.
- the reverse transcriptase domain then uses the atgRNA (or pegRNA) to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand.
- the edited DNA strand replaces the original DNA strand, creating a heteroduplex containing one edited strand and one unedited strand.
- the alpha editor guides resolution of the heteroduplex to favor copying the edit onto the unedited strand, completing the process (typically achieved with a nickase gRNA).
- Other enzymes that can be used to nick or cut only a single strand of double stranded DNA includes a cleavase (e.g., cleavase I enzyme).
- an additional agent or agents may be added that improve the efficiency and outcome purity of the alpha or prime edit.
- the agent may be chemical or biological and disrupt DNA mismatch repair (MMR) processes at or near the edit site (i.e., PE4 and PE5 and PEmax architecture by Chen et al. Cell, 184, 1-18, October 28, 2021; Chen et al. is incorporated herein by reference).
- MMR DNA mismatch repair
- the agent is a MMR-inhibiting protein.
- the MMR-inhibiting protein is dominant negative MMR protein.
- the dominant negative MMR protein is MLHldn.
- the MMR-inhibiting agent is incorporated into the multicomponent delivery method described herein.
- the MMR- inhibiting agent is linked or fused to the alpha editor protein fusion, which may or may not have a linked or fused integrase.
- the MMR-inhibiting agent is linked or fused to the Gene WriterTM protein, which may or may not have a linked or fused integrase.
- the alpha editor or gene editor system can be used to achieve DNA deletion and replacement.
- the DNA deletion replacement is induced using a pair of atgRNAs or pegRNA that target opposite DNA strands, programming not only the sites that are nicked but also the outcome of the repair (i.e., PrimeDel by Choi et al. Nat.
- the DNA deletion is induced using a single atgRNA.
- the DNA deletion and replacement is induced using a wild type Cas9 alpha or prime editor (AE-Cas9 or PE-Cas9) system (i.e., PED AR by Jiang et al. Nat. Biotechnology, October 14, 2021; Jiang et al. is incorporated herein by reference in its entirety).
- the DNA replacement is an integrase target recognition site or recombinase target recognition site.
- the constructs and methods described herein may be utilized to incorporate the pair of pegRNAs (or atgRNAs) used in PrimeDel, TwinPE (WO2021226558 incorporated by reference herein in its entirety), or PED AR, the alpha editor fusion protein or Gene Writer protein, optionally a nickase guide RNA (ngRNA), an integrase, a nucleic acid cargo, and optionally a recombinase into a LNP delivery system or vector delivery system (e.g., AAV or Adenovirus).
- the integrase may be directly linked, for example by a peptide linker, to the prime editor fusion or gene writer protein.
- the alpha editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a CRISPR enzyme nickase such as a Cas9 H840A nickase, a Cas9nickase.
- the alpha editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a cleavase.
- M-MLV Moloney Murine Leukemia Virus
- RT Moloney Murine Leukemia Virus
- the RT can be fused at, near or to the C- terminus of a Cas9nickase, e.g., Cas9 H840A. Fusing the RT to the C-terminus region, e.g., to the C-terminus, of the Cas9 nickase may result in higher editing efficiency.
- a complex is called PEI.
- the CRISPR enzyme nickase e.g., Cas9(H840A), i.e., a Cas9nickase
- the CRISPR enzyme nickase instead of being a Cas9 (H840A), i.e., instead of being a Cas9 nickase, the CRISPR enzyme nickase instead can be a CRISPR enzyme that naturally is a nickase or cuts a single strand of double stranded DNA; for instance, the CRISPR enzyme nickase can be Casl2a/b. Alternatively, the CRISPR enzyme nickase can be another mutation of Cas9, such as Cas9(D10A).
- a CRISPR enzyme such as a CRISPR enzyme nickase, such as Cas9 (wild type), Cas9(H840A), Cas9(D10A) or Cas 12a/b nickase can be fused in some embodiments to a pentamutant of M-MLV RT (D200N/ L603W/ T330P/ T306K/ W313F), whereby there can be up to about 45-fold higher efficiency, and this is called PE2.
- a CRISPR enzyme nickase such as Cas9 (wild type), Cas9(H840A), Cas9(D10A) or Cas 12a/b nickase
- a pentamutant of M-MLV RT D200N/ L603W/ T330P/ T306K/ W313F
- the M- MLV RT comprise one or more of the mutations Y8H, P51L, S56A, S67R, E69K, V129P, L139P, T197A, H204R, V223H, T246E, N249D, E286R, Q2911, E302K, E302R, F309N, M320L, P330E, L435G, L435R, N454K, D524A, D524G, D524N, E562Q, D583N, H594Q, E607K, D653N, and L671P. Specific M-MLV RT mutations are shown in Table 1.
- the reverse transcriptase can also be a wild-type or modified transcription xenopolymerase (RTX), avian myeloblastosis virus reverse transcriptase (AMV RT), Feline Immunodeficiency Virus reverse transcriptase (FIV-RT), FeLV-RT (Feline leukemia virus reverse transcriptase), HIV-RT (Human Immunodeficiency Virus reverse transcriptase).
- RTX transcription xenopolymerase
- AMV RT avian myeloblastosis virus reverse transcriptase
- FV-RT Feline Immunodeficiency Virus reverse transcriptase
- FeLV-RT FeLV-RT
- Feline leukemia virus reverse transcriptase FeLV-RT
- HIV-RT Human Immunodeficiency Virus reverse transcriptase
- the reverse transcriptase can be a fusion of MMuLV to the Sto7d DNA binding domain (see lonnidi et al.
- PE3, PE3b, PE4, PE5, and/or PEmax which a skilled person can incorporate into the co-delivery system described herein, involves nicking the non-edited strand, potentially causing the cell to remake that strand using the edited strand as the template to induce HR.
- the nicking of the non-edited strand can involve the use of a nicking guide RNA (ngRNA).
- Prime editors can be found in the following: W02020/191153, W02020/191171, WO2020/191233, WO2020/191234, WO2020/191239, W02020/191241, WO2020/191242, WO2020/191243, WO2020/191245, WO2020/191246, WO2020/191248, WO2020/191249, each of which is incorporated by reference herein in its entirety.
- the prime editor protein Prior to RT -mediated edit incorporation, the prime editor protein (or system) (1) site-specifically targets a genomic locus and (2) performs a catalytic cut or nick. These steps are typically performed by a CRISPR-Cas.
- the Cas protein may be substituted by other nucleic acid programmable DNA binding proteins (napDNAbp) such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), or meganucleases.
- ZFNs zinc finger nucleases
- TALENs transcription activator-like effector nucleases
- meganucleases meganucleases
- a Gene Writer protein comprises: (A) a polypeptide or a nucleic acid encoding a polypeptide, wherein the polypeptide comprises (i) a reverse transcriptase domain, and either (x) an endonuclease domain that contains DNA binding functionality or (y) an endonuclease domain and separate DNA binding domain; and (B) a template RNA comprising (i) a sequence that binds the polypeptide and (ii) a heterologous insert sequence. Examples of such Gene WriterTM proteins and related systems can be found in US20200109398, which is incorporated by reference herein in its entirety.
- the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct.
- the split construct in reconstituted in a cell.
- the split construct can be fused or ligated via intein protein splicing.
- the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions.
- the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization.
- the split construct can be adapted into one or more delivery vectors described herein.
- an integrase or recombinase is directly linked or fused, for example by a peptide linker, which may be cleavable or non-cleavable, to the prime editor fusion protein (i.e., fused Cas9 nickase-reverse transcriptase) or Gene Writer protein.
- a peptide linker which may be cleavable or non-cleavable, to the prime editor fusion protein (i.e., fused Cas9 nickase-reverse transcriptase) or Gene Writer protein.
- Suitable linkers for example between the Cas9, RT, and integrase, may be selected from Table 3:
- the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct.
- the split construct in reconstituted in a cell.
- the split construct can be fused or ligated via intein protein splicing.
- the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions.
- the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization.
- the split construct can be adapted into one or more nucleic acid constructs described herein.
- SpCas9 Streptococcus pyogenes Cas9
- REC recognition
- NUC nuclease
- the REC lobe can be divided into three regions, a long a helix referred to as the bridge helix (residues 60-93), the RECI (residues 94-179 and 308-713) domain, and the REC2 (residues 180-307) domain.
- the NUC lobe consists of the RuvC (residues 1-59, 718-769, and 909-1098), HNH (residues 775- 908), and PAM-interacting (PI) (residues 1099-1368) domains.
- the negatively charged sgRNA:target DNA heteroduplex is accommodated in a positively charged groove at the interface between the REC and NUC lobes.
- the RuvC domain is assembled from the three split RuvC motifs (RuvC I— III) and interfaces with the PI domain to form a positively charged surface that interacts with the 30 tail of the sgRNA.
- the HNH domain lies between the RuvC II— III motifs and forms only a few contacts with the rest of the protein. Structural aspects of SpCas9 are described by Nishimasu et al., Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA, Cell 156, 935-949, February 27, 2014.
- the REC lobe includes the RECI and REC2 domains.
- the REC2 domain does not contact the bound guide Target heteroduplex, indicating that truncation of REC lobe may be tolerated by SpCas9.
- SpCas9 mutant lacking the REC2 domain (D175-307) retained -50% of the wild-type Cas9 activity, indicating that the REC2 domain is not critical for DNA cleavage.
- PAM-Inter acting domain' The NUC lobe contains the PAM-interacting (PI) domain that is positioned to recognize the PAM sequence on the noncomplementary DNA strand.
- the PI domain of SpCas9 is required for the recognition of 5’-NGG-3’ PAM, and deletion of the PI domain (A1099-1368) abolished the cleavage activity, indicating that the PI domain is critical for SpCas9 function and a major determinant for the PAM specificity.
- RuvC domain' The RuvC nucleases of SpCas9 have an RNase H fold and four catalytic residues, AsplO (Ala), Glu762, His983, and Asp986, that are critical for the two- metal cleavage of the noncomplementary strand of the target DNA.
- AsplO AsplO
- Glu762, His983, and Asp986, that are critical for the two- metal cleavage of the noncomplementary strand of the target DNA.
- the Cas9 RuvC domain has other structural elements involved in interactions with the guide:target heteroduplex (an end-capping loop between a42 and a43) and the PI domain/stem loop 3 (P hairpin formed by P3 and [34).
- SpCas9 HNH nucleases have three catalytic residues, Asp839, His840, and Asn863 and cleave the complementary strand of the target DNA through a single-metal mechanism.
- sgRNA:DNA recognition' The sgRNA guide region is primarily recognized by the REC lobe.
- the backbone phosphate groups of the guide region interact with the RECI domain (Argl65, Glyl66, Arg403, Asn407, Lys510, Tyr515, and Arg661) and the bridge helix (Arg63, Arg66, Arg70, Arg71, Arg74, and Arg78).
- the 20- hydroxyl groups of Gl, C15, U16, and G19 hydrogen bond with Vai 1009, Tyr450, Arg447/Ile448, and Thr404, respectively.
- the alanine mutations of the repeat anti -repeat duplex-interacting residues (Arg75 and Lysl63) and the stemloop- 1 -interacting residue (Arg69) resulted in decreased DNA cleavage activity, confirming the functional importance of the recognition of the repeat: anti -repeat duplex and stem loop 1 by Cas9.
- SpCas9 recognizes the guide:target heteroduplex in a sequence-independent manner.
- the backbone phosphate groups of the target DNA (nucleotides 1, 9-11, 13, and 20) interact with the RECI (Asn497, Trp659, Arg661, and Gln695), RuvC (Gln926), and PI (Glul 108) domains.
- the C2’ atoms of the target DNA form van der Waals interactions with the RECI domain (Leul69, Tyr450, Met495, Met694, and His698) and the RuvC domain (Ala728).
- the terminal base pair of the guide:target heteroduplex (G1 :C20’) is recognized by the RuvC domain via end-capping interactions; the sgRNA G1 and target DNA C20’ nucleobases interact with the Tyrl013 and Vai 1015 side chains, respectively, whereas the 20-hydroxyl and phosphate groups of sgRNA G1 interact with Vall009 and Gln926, respectively.
- A51 adopts the syn conformation and is oriented in the direction opposite to U50.
- the nucleobase of A51 is sandwiched between Phel 105 and U63, with its Nl, N6, and N7 atoms hydrogen bonded with G62, Glyl 103, and Phel 105, respectively.
- Stem-loop recognition' Stem loop 1 is primarily recognized by the REC lobe, together with the PI domain.
- the backbone phosphate groups of stem loop 1 interact with the RECI domain (Leu455, Ser460, Arg467, Thr472, and Ile473), the PI domain (Lysl 123 and Lysl 124), and the bridge helix (Arg70 and Arg74), with the 20- hydroxyl group of G58 hydrogen bonded with Leu455.
- A52 interacts with Phel 105 through a face-to-edge p-p stacking interaction, and the flipped U59 nucleobase hydrogen bonds with Asn77.
- the single-stranded linker and stem loops 2 and 3 are primarily recognized by the NUC lobe.
- the backbone phosphate groups of the linker (nucleotides 63-65 and 67) interact with the RuvC domain (Glu57, Lys742, and Lysl097), the PI domain (Thrl 102), and the bridge helix (Arg69), with the 20-hydroxyl groups of U64 and A65 hydrogen bonded with Glu57 and His721, respectively.
- the C67 nucleobase forms two hydrogen bonds with Vail 100.
- Stem loop 2 is recognized by Cas9 via the interactions between the NUC lobe and the non-Watson-Crick A68:G81 pair, which is formed by direct (between the A68 N6 and G81 06 atoms) and water-mediated (between the A68 N1 and G81 N1 atoms) hydrogen-bonding interactions.
- the A68 and G81 nucleobases contact Serl351 and Tyrl356, respectively, whereas the A68:G81 pair interacts with Thrl358 via a water-mediated hydrogen bond.
- the 20-hydroxyl group of A68 hydrogen bonds with Hisl349, whereas the G81 nucleobase hydrogen bonds with Lys33.
- Stem loop 3 interacts with the NUC lobe more extensively, as compared to stem loop 2.
- the backbone phosphate group of G92 interacts with the RuvC domain (Arg40 and Lys44), whereas the G89 and U90 nucleobases hydrogen bond with Gin 1272 and Glul225/Alal227, respectively.
- the A88 and C91 nucleobases are recognized by Asn46 via multiple hydrogen-bonding interactions.
- Cas9 proteins smaller than SpCas9 allow more efficient packaging of nucleic acids encoding CRISPR systems, e.g., Cas9 and sgRNA into one rAAV (“all-in-one-AAV”) particle.
- efficient packaging of CRISPR systems can be achieved in other viral vector systems (i.e., lentiviral, integration deficient lentiviral, hd-AAV, etc.) and non-viral vector systems (i.e., lipid nanoparticle).
- Small Cas9 proteins can be advantageous for multidomain-Cas-nuclease-based systems for prime editing.
- Cas9 proteins include Staphylococcus aureus (SauCas9, 1053 amino acid residues) and Campylobacter jejuni (CjCas9, 984 amino residues).
- SerCas9 Staphylococcus aureus
- CjCas9 Campylobacter jejuni
- Staphylococcus lugdunensis (Siu) Cas9 as having genome-editing activity and provided homology mapping to SpCas9 and SauCas9 to facilitate generation of nickases and inactive (“dead”) enzymes (Schmidt et al., 2021, Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases. Nat Commun 12, 4219. doi.org/10.1038/s41467- 021-24454-5) and engineered nucleases with higher cleavage activity by fragmenting and shuffling Cas9 DNAs.
- the small Cas9s and nickases are useful in the instant invention.
- the Cas9 proteins used herein may also include other “Cas9 variants” having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 protein, including any wild type Cas9, or mutant Cas9 (e.g., a dead Cas9 or Cas9 nickase), or fragment Cas9, or circular permutant Cas9, or other variant of Cas9 disclosed herein or known in the art.
- Cas9 variants having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 protein, including any wild
- a Cas9 variant may have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 21, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more amino acid changes compared to a reference Cas9.
- the Cas9 variant comprises a fragment of a reference Cas9 (e.g., a gRNA binding domain or a DNA-cleavage domain), such that the fragment is at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to the corresponding fragment of wild type Cas9.
- a reference Cas9 e.g., a gRNA binding domain or a DNA-cleavage domain
- the fragment is at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% identical, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% of the amino acid length of a corresponding wild type Cas9 (e.g., SEQ ID NO: 18).
- the disclosure also may utilize Cas9 fragments that retain their functionality and that are fragments of any herein disclosed Cas9 protein.
- the Cas9 fragment is at least 100 amino acids in length.
- the fragment is at least 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150, 1200, 1250, or at least 1300 amino acids in length.
- the prime editors disclosed herein may comprise one of the Cas9 variants described as follows, or a Cas9 variant thereof having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 variants.
- prime editors utilized herein comprise CRISPR-Cas system enzymes other than type II enzymes.
- prime editors comprise type V or type VI CRISPR-Cas system enzymes. It will be appreciated that certain CRISPR enzymes exhibit promiscuous ssDNA cleavage activity and appropriate precautions should be considered.
- prime editors comprise a nickase or a dead CRISPR with nuclease function comprised in a different component.
- the nucleic acid programmable DNA binding proteins utilized herein include, without limitation, Cas9 (e.g., dCas9 and nCas9), Casl2a (Cpfl), Casl2bl (C2cl), Casl2b2, Casl2c (C2c3), Casl2d (CasY), Casl2e (CasX), C2c4, C2c5, C2c8, C2c9, C2cl0, Cast 3a (C2c2), Cast 3b (C2c6), Cast 3c (C2c7), Cast 3d, and Argonaute.
- Cas9 e.g., dCas9 and nCas9
- Casl2a Casl2a
- Casl2a Casl2a
- Casl2bl Casl2b2cl
- Casl2c3 Casl2d
- CasY Casl2e
- CasX
- Cas-equivalents further include those described in Makarova et al., “C2c2 is a singlecomponent programmable RNA-guided RNA-targeting CRISPR effector,” Science 2016; 353(6299) and Makarova et al., “Classification and Nomenclature of CRISPR-Cas Systems: Where from Here?,” The CRISPR Journal, Vol.l. No.5, 2018, the contents of which are incorporated herein by reference.
- One example of a nucleic acid programmable DNA-binding protein that has different PAM specificity than Cas9 is Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 (i.e, Casl2a (Cpfl)).
- Casl2a (Cpfl) is also a Class 2 CRISPR effector, but it is a member of type V subgroup of enzymes, rather than the type II subgroup. It has been shown that Casl2a (Cpfl) mediates robust DNA interference with features distinct from Cas9.
- Casl2a (Cpfl) is a single RNA- guided endonuclease lacking tracrRNA, and it utilizes a T-rich protospacer-adjacent motif (TTN, TTTN, or YTN). Moreover, Cpfl cleaves DNA via a staggered DNA double-stranded break.
- Cpfl proteins are known in the art and have been described previously, for example Yamano et al., “Crystal structure of Cpfl in complex with guide RNA and target DNA.” Cell (165) 2016, p.949-962; the entire contents of which is hereby incorporated by reference.
- prime editors used herein comprise the type V CRISPR family includes Francisella novicida U112 Cpfl (FnCpfl) also known as FnCasl2a.
- FnCpfl adopts a bilobed architecture with the two lobes connected by the wedge (WED) domain.
- the N- terminal REC lobe consists of two a-helical domains (RECI and REC2) that have been shown to coordinate the crRNA-target DNA heteroduplex.
- the C-terminal NUC lobe consists of the C-terminal RuvC and Nuc domains involved in target cleavage, the arginine-rich bridge helix (BH), and the PAM-interacting (PI) domain.
- the repeat-derived segment of the crRNA forms a pseudoknot stabilized by intra-molecular base-pairing and hydrogen-bonding interactions.
- the pseudoknot is coordinated by residues from the WED, RuvC, and REC2 domains, as well as by two hydrated magnesium cations.
- nucleotides 1-5 of the crRNA are ordered in the central cavity of FnCasl2a and adopt an A-form-like helical conformation. Conformational ordering of the seed sequence is facilitated by multiple interactions between the ribose and phosphate moieties of the crRNA backbone and FnCpfl residues in the WED and RECI domains.
- FnCasl2a-crRNA complex further reveals that the bases of the seed sequence are solvent exposed and poised for hybridization with target DNA.
- Structural aspects of FnCpfl are described by Swarts et al., Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Casl2a, Molecular Cell 66, 221-233, April 20, 2017.
- the crRNA-target DNA strand heteroduplex is enclosed in the central cavity formed by the REC and NUC lobes and interacts extensively with the RECI and REC2 domains.
- the PAM-containing DNA duplex comprises target strand nucleotides dT0-dT8 and non-target strand nucleotides dA(8)*-dA0* and is contacted by the PI, WED, and RECI domains.
- the 5’-TTN-3’ PAM is recognized in FnCasl2a by a mechanism combining the shape-specific recognition of a narrowed minor groove, with base-specific recognition of the PAM bases by two invariant residues, Lys671 and Lys613.
- the duplex of the target DNA is disrupted by the side chain of residue Lys667, which is inserted between the DNA strands and forms a cation-7t stacking interaction with the dA0-dT0* base pair.
- the phosphate group linking target strand residues dT(-l) and dTO is coordinated by hydrogen-bonding interactions with the side chain of Lys823 and the backbone amide of Gly826.
- Target strand residue dT(-l) bends away from residue TO, allowing the target strand to interact with the seed sequence of the crRNA.
- the non-target strand nucleotides dTl*-dT5* interact with the Arg692-Ser702 loop in FnCasl2a through hydrogen-bonding and ionic interactions between backbone phosphate groups and side chains of Arg692, Asn700, Ser702, and Gln704, as well as main-chain amide groups of Lys699, Asn700, and Ser702.
- Alanine substitution of Q704 or replacement of residues Thr698-Ser702 in FnCasl2a with the sequence Ala-Gly3 (SEQ ID NO: 115) substantially reduced DNA cleavage activity, suggesting that these residues contribute to R-loop formation by stabilizing the displaced conformation of the nontarget DNA strand.
- the crRNA-target strand heteroduplex is terminated by a stacking interaction with a conserved aromatic residue (Tyr410). This prevents base pairing between the crRNA and the target strand beyond nucleotides U20 and dA(-20), respectively. Beyond this point, the target DNA strand nucleotides re-engage the non-target DNA strand, forming a PAM-distal DNA duplex comprising nucleotides dC(-21)-dA(-27) and dG21*-dT27*, respectively. The duplex is confined between the REC2 and Nuc domains at the end of the central channel formed by the REC and NUC lobes.
- FnCpfl can independently accommodate both the target and non-target DNA strands in the catalytic pocket of the RuvC domain.
- the RuvC active site contains three catalytic residues (D917, E1006, and D1255). Structural observations suggest that both the target and non-target DNA strands are cleaved by the same catalytic mechanism in a single active site in Cpfl/Casl2a enzymes.
- Another type V CRISPR is AsCpfl from Acidaminococcus sp BV3L6 (Yamano et al., Crystal structure of Cpfl in complex with guide RNA and target DNA, Cell 165, 949-962, May 5, 2016)
- the nuclease comprises a Casl2f effector.
- Small CRISPR- associated effector proteins belonging to the type V-F subtype have been identified through the mining of sequence databases and members classified into Casl2fl (Casl4a and type V- U3), Casl2f2 (Casl4b) and Casl2f3 (Casl4c, type V-U2 and U4).
- Casl2fl Casl2fl
- Casl4b Casl2f2
- Casl4c type V-U2 and U4
- CRISPR-Cas proteins and enzymes used in the prime editors herein include the following without limitation.
- protospacer adjacent sequence or “protospacer adjacent motif’ or “PAM” refers to an approximately 2-6 base pair DNA sequence (or a 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 11-, 12-long nucleotide sequence) that is an important targeting component of a Cas9 nuclease.
- PAM sequence is on either strand, and is downstream in the 5' to 3' direction of Cas9 cut site.
- the canonical PAM sequence (i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes or SpCas9) is 5'-NGG-3' wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases.
- N is any nucleobase followed by two guanine (“G”) nucleobases.
- G guanine
- Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms.
- any given Cas9 nuclease may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes alternative PAM sequence.
- the PAM specificity can be modified by introducing one or more mutations, including (a) DI 135V, R1335Q, and T1337R “the VQR variant”, which alters the PAM specificity to NGAN or NGNG, (b) DI 135E, R1335Q, and T1337R “the EQR variant”, which alters the PAM specificity to NGAG, and (c) DI 135V, G1218R, R1335E, and T1337R “the VRER variant”, which alters the PAM specificity to NGCG.
- the DI 135E variant of canonical SpCas9 still recognizes NGG, but it is more selective compared to the wild type SpCas9 protein.
- Cas9 enzymes from different bacterial species can have varying PAM specificities and some embodiments are therefore chosen based on the desired PAM recognition.
- Cas9 from Staphylococcus aureus (SaCas9) recognizes NGRRT or NGRRN.
- Cas9 from Neisseria meningitis (NmCas) recognizes NNNNGATT.
- Cas9 from Streptococcus thermophilis (StCas9) recognizes NNAGAAW.
- Cas9 from Treponema denticola (TdCas) recognizes NAAAAC. These examples are not meant to be limiting.
- non-SpCas9s bind a variety of PAM sequences, which makes them useful to expand the range of sequences that can be targeted according to the invention.
- non-SpCas9s may have other characteristics that make them more useful than SpCas9.
- Cas9 from Staphylococcus aureus (SaCas9) is about 1 kilobase smaller than SpCas9, so it can be packaged into adeno-associated virus (AAV).
- AAV adeno-associated virus
- Oh, Y. et al. describe linking reverse transcriptase to a Francisella novicida Cas9 [FnCas9(H969A)] nickase module.
- FnCas9(H969A) Francisella novicida Cas9
- nickase module By increasing the distance to the PAM, the FnCas9(H969A) nickase module expands the region of a reverse transcription template (RTT) following the primer binding site.
- Prime editor fusion protein describes a protein that is used in prime editing.
- Prime editing uses CRISPR enzyme that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; and a nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts.
- Such an enzyme can be a catalytically-impaired Cas9 endonuclease (a nickase).
- a nickase can be a Casl2a/b, MAD7, or variant thereof.
- the nickase is fused to an engineered reverse transcriptase (RT).
- the nickase is programmed (directed) with a prime-editing guide RNA (pegRNA).
- pegRNA prime-editing guide RNA
- the pegRNA both specifies the target site and encodes the desired edit.
- the nickase is a catalytically-impaired Cas9 endonuclease, a Cas9 nickase, that is fused to the reverse transcriptase.
- the Cas9 nickase part of the protein is guided to the DNA target site by the pegRNA, whereby a nick or single stranded cut occurs.
- the reverse transcriptase domain then uses the pegRNA to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand.
- the edited DNA strand replaces the original DNA strand, creating a heteroduplex containing one edited strand and one unedited strand.
- the prime editor guides resolution of the heteroduplex to favor copying the edit onto the unedited strand, completing the process (typically achieved with a nickase gRNA).
- PEI refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a wild type MMLV RT having the following N-terminus to C-terminus structure: [NLS]-[Cas9(H840A)]- [linker]-[MMLV_RT(wt)] + a desired atgRNA (or PEgRNA).
- the prime editors disclosed herein is comprised of PEI.
- PE2 refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a variant MMLV RT having the following N-terminus to C-terminus structure:
- the prime editors disclosed herein are comprised of PE2.
- the prime editors disclosed herein is comprised of PE2 and co-expression of MMR protein MLHldn, that is PE4.
- PE3 refers to PE2 plus a second-strand nicking guide RNA that complexes with the PE2 and introduces a nick in the non-edited DNA strand. The induction of the second nick increases the chances of the unedited strand, rather than the edited strand, to be repaired.
- the prime editors disclosed herein are comprised of PE3.
- the prime editors disclosed herein are comprised of PE3 and co-expression of MMR protein MLHldn, that is PE5.
- PE3b refers to PE3 but wherein the second-strand nicking guide RNA is designed for temporal control such that the second strand nick is not introduced until after the installation of the desired edit. This is achieved by designing a gRNA with a spacer sequence with mismatches to the unedited original allele that matches only the edited strand. Using this strategy, mismatches between the protospacer and the unedited allele should disfavor nicking by the sgRNA until after the editing event on the PAM strand takes place.
- a prime editing complex consists of a type II CRISPR PE protein containing an RNA-guided DNA-nicking domain fused to a reverse transcriptase (RT) domain and complexed with a pegRNA.
- the pegRNA comprises (5’ to 3’) a spacer that is complementary to the target sequence of a genomic DNA, a nickase (e.g. Cas9) binding site, a reverse transcriptase template including editing positions, and primer binding site (PBS).
- the PE-pegRNA complex binds the target DNA and the CRISPR protein nicks the PAM-containing strand.
- the resulting 3' end of the nicked target hybridizes to the primer-binding site (PBS) of the pegRNA, then primes reverse transcription of new DNA containing the desired edit using the RT template of the pegRNA.
- PBS primer-binding site
- the overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end. The structure leaves the PBS at the 3’ end of the pegRNA free to bind to the nicked strand complementary to the target which forms the primer for reverse transcription.
- Guide RNAs of CRISPRs differ in overall structure. For example, while the spacer of a type II gRNA is located at the 5’ end, the spacer of a type V gRNA is located towards the 3’ end, with the CRISPR protein (e.g. Casl2a) binding region located toward the 5’ end. Accordingly, the regions of a type V pegRNA are rearranged compared to a type II pegRNA.
- the overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end.
- the pegRNA comprises (5’ to 3’) a CRISPR protein-binding region, a spacer which is complementary to the target sequence of a genomic DNA, a reverse transcriptase template including editing positions, and primer binding site (PBS).
- an atgRNA comprises a reverse transcriptase template that encodes, partially or in its entirety, an integration recognition site (also referred to as an integration target recognition site) or a recombinase recognition site (also referred to as a recombinase target recognition site).
- the integration target recognition site which is to be placed at a desired location in the genome or intracellular nucleic acid, is referred to as a “beacon” site or an “attachment site” or a “landing pad” or “landing site.”
- An integration target recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA).
- the term “attachment site-containing guide RNA” refers to an extended single guide RNA (sgRNA) comprising a primer binding site (PBS), a reverse transcriptase (RT) template sequence, and wherein the RT template encodes for an integration recognition site or a recombinase recognition site that can be recognized by a recombinase, integrase, or transposase.
- the RT template comprises a clamp sequence and an integration recognition site.
- an atgRNA may be referred to as a guide RNA.
- An integration recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA).
- first integration recognition site e.g., any of the integration recognition sites described herein
- second recognition site e.g., any of the integration recognition sites described herein
- the term “cognate pair” refers to a first integration recognition site (e.g., any of the integration recognition sites described herein) and a functionally symmetric second integration recognition site (e.g., any of the integration recognition sites described herein) that can be recombined.
- Each of the first and second integration recognition sites is a “cognate”, or “integration cognate”, or “integration cognate site”, of the other member of the cognate pair.
- a non-limiting example of a cognate pair are an attB site and an attP site, which are capable of recombination by the large serine integrase BxBl.
- an atgRNA comprises a reverse transcriptase template that encodes, partially or in its entirety, an integration recognition site (also referred to as an integration target recognition site) or a recombinase recognition site (also referred to as a recombinase target recognition site).
- the integration target recognition site which is to be placed at a desired location in the genome or intracellular nucleic acid, is referred to as a “beacon,” a “beacon” site or an “attachment site” or a “landing pad” or “landing site.”
- An integration target recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA).
- the primer binding site allows the 3’ end of the nicked DNA strand to hybridize to the atgRNA, while the RT template serves as a template for the synthesis of edited genetic information.
- the atgRNA is capable for instance, without limitation, of (i) identifying the target nucleotide sequence to be edited and (ii) encoding new genetic information that replaces (or in some cases adds) the targeted sequence.
- the atgRNA is capable of (i) identifying the target nucleotide sequence to be edited and (ii) encoding an integration site that replaces (or inserts/deletes within) the targeted sequences.
- the co-delivery system described herein includes a polynucleotide sequence encoding an attachment site-containing guide RNA (atgRNA) packaged in an LNP.
- the co-delivery system described herein includes a vector comprising a polynucleotide sequence encoding an atgRNA.
- the atgRNA comprises a domain that is capable of guiding the prime editor fusion protein to a target sequence, thereby identifying the target nucleotide sequence to be edited; and a reverse transcriptase (RT) template that comprises a first integration recognition site.
- the atgRNA comprises a domain that is capable of guiding the prime editor fusion protein (or prime editor system) to a target sequence, thereby identifying the target nucleotide sequence to be edited; and a reverse transcriptase (RT) template that comprises at least a portion first integration recognition site.
- a domain that is capable of guiding the prime editor fusion protein (or prime editor system) to a target sequence, thereby identifying the target nucleotide sequence to be edited
- RT reverse transcriptase
- the co-delivery system described herein includes a polynucleotide sequence encoding a first attachment site-containing guide RNA (atgRNA) and a polynucleotide nucleotide sequence encoding a second attachment site-containing guide RNA (atgRNA) packaged into the same LNP.
- the co-delivery system described herein includes a polynucleotide sequence encoding a first attachment sitecontaining guide RNA (atgRNA) packaged into a first LNP and a polynucleotide nucleotide sequence encoding a second attachment site-containing guide RNA (atgRNA) packaged into a second LNP.
- the co-delivery system described herein includes a vector comprising a polynucleotide sequence encoding a first attachment site-containing guide RNA (atgRNA), a polynucleotide sequence encoding a second atgRNA, or both.
- atgRNA attachment site-containing guide RNA
- the co-delivery system described herein includes a polynucleotide sequence encoding a first attachment site-containing guide RNA (atgRNA) packaged into a first LNP and a vector comprising a polynucleotide sequence encoding a second atgRNA.
- atgRNA attachment site-containing guide RNA
- the co-delivery system contains a first atgRNA and a second atgRNA
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, where the at least first pair of atgRNAs have domains that are capable of guiding the gene editor protein or prime editor fusion protein to a target sequence
- the first atgRNA further includes a first RT template that comprises at least a portion of the first integration recognition site
- the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site
- the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA’ s reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprising a second reverse transcriptase template.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 5 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second singlestranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site.
- Use of two guide RNAs that are (or encode DNA that is) partially complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site-containing guide RNAs (atgRNAs).
- RNAs that are (or encode DNA that is) full complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site-containing guide RNAs (atgRNAs).
- atgRNAs dual, paired, annealing, complementary, or twin attachment site-containing guide RNAs
- the first atgRNA upon introducing the nucleic acid construct into a cell, incorporates the first integration recognition site into the cell’s genome at the target sequence.
- Table 9 includes atgRNAs, sgRNAs and nicking guides that can be used herein. Spacers are labeled in capital font (SPACER), RT regions in bold capital (RT REGION), AttB sites in bold lower case (attB site), and PBS in capital italics (PBS). Unless otherwise denoted, the AttB is for Bxbl. 5.8. Integrases/Recombinases and Integration/Recombination Sites
- the co-delivery system described herein contains an integrase and/or a recombinase.
- the co-delivery system includes an integrase and/or a recombinase packaged in a LNP.
- the co-delivery system includes a polynucleotide encoding an integrase and/or a recombinase.
- the co-delivery system includes an integrase or a recombinase packaged in a vector (e.g., a viral vector).
- the co-delivery system includes at least a first integrase (e.g., a first integrase and a second integrase) and/or at least a first recombinase (e.g., a first recombinase and a second recombinase).
- a first integrase e.g., a first integrase and a second integrase
- a first recombinase e.g., a first recombinase and a second recombinase
- the integration enzyme e.g., the integrase or recombinase
- the integration enzyme is selected from the group consisting of Dre, Vika, Bxbl, ⁇ pC31, RDF, cpBTl, Rl, R2, R3, R4, R5, TP901-1, Al 18, cpFCl, cpCl, MR11, TGI, cp370.1, W , BL3, SPBc, K38, Peaches, Veracruz, Rebeuca, Theia, Benedict, KSSJEB, PattyP, Doom, Scowl, Lockley, Switzer, Bob3, Troube, Abrogate, Anglerfish, Sarfire, SkiPole, Conceptll, Museum, Severus, Airmid, Benedict, Hinder, ICleared, Sheen, Mundrea, BxZ2, cpRV, retrotransposases encoded by a Tcl/mariner family member including but not limited to retrotransposases encoded by
- Xu et al describes methods for evaluating integrase activity in E. coli and mammalian cells and confirmed at least R4, cpC31, (pBTl, Bxbl, SPBc, TP901-1 and WP integrases to be active on substrates integrated into the genome of HT1080 cells (Xu et al., 2013, Accuracy and efficiency define Bxbl integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome. BMC Biotechnol. 2013 Oct 20; 13:87.
- LSRs serine recombinases
- embodiments can include any serine recombinase such as BceINT, SSCINT, SACINT, and INT10 (see lonnidi et al, 2021; Drag-and-drop genome insertion without DNA cleavage with CRISPR directed integrases.
- the integration site can be selected from an attB site, an attP site, an attL site, an attR site, a lox71 site a Vox site, or a FRT site.
- the Cre-lox system is referred to either as a control for programmable gene insertion or as a tool for a recombinase- mediated event separate and distinct from insertion of the donor polynucleotide template (or exogenous nucleic acid) into the integrated recognition site.
- integrases, transposases and the like can depend on nuclear localization.
- prokaryotic enzymes are adapted to modulate nuclear localization.
- eukaryotic or vertebrate enzymes are adapted to modulate nuclear localization.
- the invention provides fusion or hybrid proteins. Such modulation can comprise addition or removal of one or more nuclear localization signal (NLS) and/or addition or removal of one or more nuclear export signal (NES).
- NLS nuclear localization signal
- NES nuclear export signal
- nuclear export signal (NES) of transposases affects the transposition activity of mariner-like elements Ppmarl and Ppmar2 ofmoso bamboo. Mob DNA. 2019 Aug 19;10:35. doi: 10.1186/sl3100-019-0179-y).
- the methods and constructs are used to modulate nuclear localization of system components of the invention.
- the integrase used herein is selected from below (Table 10).
- FIGs. 14A-14E shows analysis of effect of variant AttP sites on integration efficiency.
- This disclosure features methods of delivering (e.g., co-delivery or dual delivery) a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, where the methods includes delivering to a (i) gene editor construct and a (ii) template polynucleotide, and (iii) at least a first attachment site-containing guide (atgRNA).
- a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, where the methods includes delivering to a (i) gene editor construct and a (ii) template polynucleotide, and (iii) at least a first attachment site-containing guide (atgRNA).
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes: delivering a lipid nanoparticle (LNP) comprising a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and a vector comprising a template polynucleotide and at least a first attachment site-containing guide RNA (atgRNA).
- the first atgRNA comprises (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of a first integration recognition site.
- the RT template comprises the entirety of the first integration recognition site. In some embodiments, where the vector comprises a polynucleotide encoding a first atgRNA, the vector also includes a sequence encoding a nicking guide RNA (ngRNA).
- ngRNA nicking guide RNA
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes: delivering a lipid nanoparticle (LNP) comprising a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and a vector comprising a template polynucleotide and a first attachment site-containing guide RNA (atgRNA) and a second attachment sitecontaining guide RNA (atgRNA).
- LNP lipid nanoparticle
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the at least first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap (e.g., 6bp of complementarity).
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes: delivering into a cell a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct), and (ii) a first attachment sitecontaining guide RNA (atgRNA); and a vector comprising: (i) a template polynucleotide, and (ii) a second atgRNA.
- LNP lipid nanoparticle
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the a first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap (e.g., 6bp of complementarity).
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes delivering: a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct), (ii) a first attachment site-containing guide RNA (atgRNA), and (iii) a second atgRNA; and a vector comprising (i) a template polynucleotide.
- LNP lipid nanoparticle
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the at least first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the at least first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap (e.g., 6bp of complementarity).
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes delivering: a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and (ii) a first attachment site-containing guide RNA (atgRNA); and a vector comprising: (i) a template polynucleotide, and (ii) a nicking atgRNA.
- LNP lipid nanoparticle
- a gene editor polynucleotide e.g., a gene editor polynucleotide construct
- atgRNA first attachment site-containing guide RNA
- a vector comprising: (i) a template polynucleotide, and (ii) a nicking atgRNA.
- the first atgRNA comprises (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of a first integration recognition site.
- the RT template comprises the entirety of the first integration recognition site.
- the LNP and the first vector are delivered at least 1 day, at least 2 days, at least 3 days, at least 4 days, at least 5 days, at least 6 days, at least 7 days, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, or at least 8 weeks apart.
- the LNP and the second vector are delivered a different times on the same day, at least 1 day, at least 2 days, at least 3 days, at least 4 days, at least 5 days, at least 6 days, at least 7 days, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, or 8 weeks apart.
- the LNP and the first vector are delivered about 6 weeks apart.
- This disclosure also features a method for delivering a system capable of site- specifically integrating a template polynucleotide into the genome of a cell, where the method includes delivering the system in vivo.
- the system is delivered to a fetus or a neonate to site-specifically integrate in vivo a template polynucleotide into the genome of a cell.
- Delivering the system to a fetus or a neonate provides advantages over delivering the system later in life (e.g., after the neonate phase ends), including: (i) fewer number of cells that need to be treated (e.g., in the adult, there are trillions of cells, but in a fetus, there are significantly fewer cells); (ii) developmental benefits: the early stage of development of a fetus or a neonate means that if a genetic disease is treated successfully, the individual could potentially develop normally, with significant reduction or even complete removal of any of the disease manifestations; (iii) preventing disease progression: in certain genetic conditions the physiological damage is irreversible damage and in some instances is exacerbated as the disease progresses, therefore, intervening at the fetal (or neonate) stage, it is possible to prevent or reduce the progression of the disease and potentially prevent any irreversible damage from occurring; (iv) higher cell turnover and cell division rate: in a developing fetus, cells are dividing rapidly as the
- the method includes delivering an LNP and a first vector, the LNP and the first vector are delivered to a cell in vivo.
- the in vivo cells are present in a fetus or a neonate.
- the LNP is delivered between age 0 (day of birth) and age 7 days and the vector is delivered between age 5 weeks and age 7 weeks.
- the LNP is delivered at about at 2 days and the vector is delivered at about age 6 weeks.
- the LNP and the first vector are delivered to a cell in vivo
- the LNP can be delivered to a fetus at a first time point and the vector is delivered to the fetus after the fetus is born (referred to after birth as a neonate).
- the LNP is delivered to a fetus and the vector is delivered to the fetus after birth (i.e., at the neonate stage) at any point between birth and up to age 8 weeks.
- the LNP is delivered to the fetus and the vector is delivered at about age 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 7 weeks or 8 weeks.
- the LNP and the first vector are delivered to a cell in vivo
- the LNP can be delivered to a fetus at a first time point and the vector is delivered to the fetus (child) after the fetus (child) is bom, for example, when the child is age 90 days or older (e.g., age 6 months, age 9 months, age 1 year, age 2 years, age 3 years, age 4 years, age 5 years, age 6 years, or older).
- the term “fetus” refers to an unborn offspring.
- the term “neonate” refers to a newborn infant, which includes the first 90 days of life.
- This disclosure also features a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, the system comprising: a lipid nanoparticle (LNP) comprising a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and a vector comprising a template polynucleotide and at least a first attachment sitecontaining guide RNA (atgRNA).
- the first atgRNA comprises (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of a first integration recognition site.
- the RT template comprises the entirety of the first integration recognition site. In some embodiments, where the vector comprises a polynucleotide encoding a first atgRNA, the vector also includes a sequence encoding a nicking guide RNA (ngRNA).
- ngRNA nicking guide RNA
- This disclosure also features a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, the system comprising: a lipid nanoparticle (LNP) comprising a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and a vector comprising a template polynucleotide and a first attachment site-containing guide RNA (atgRNA) and a second attachment site-containing guide RNA (atgRNA).
- LNP lipid nanoparticle
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the a first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- This disclosure also features a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, the system comprising: a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct), and (ii) a first attachment site-containing guide RNA (atgRNA); and a vector comprising: (i) a template polynucleotide, and (ii) a second atgRNA.
- LNP lipid nanoparticle
- a gene editor polynucleotide e.g., a gene editor polynucleotide construct
- atgRNA first attachment site-containing guide RNA
- a vector comprising: (i) a template polynucleotide, and (ii) a second atgRNA.
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the a first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- This disclosure also features a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, the system comprising: co-delivering: a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct), (ii) a first attachment site-containing guide RNA (atgRNA), and (iii) a second atgRNA; and a vector comprising (i) a template polynucleotide.
- LNP lipid nanoparticle
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the a first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- This disclosure also features a system capable of site-specifically integrating a template polynucleotide into the genome of a cell, the system comprising: a lipid nanoparticle (LNP) comprising: (i) a gene editor polynucleotide (e.g., a gene editor polynucleotide construct) and (ii) a first attachment site-containing guide RNA (atgRNA); and a vector comprising: (i) a template polynucleotide, and (ii) a nicking atgRNA.
- LNP lipid nanoparticle
- a gene editor polynucleotide e.g., a gene editor polynucleotide construct
- atgRNA first attachment site-containing guide RNA
- a vector comprising: (i) a template polynucleotide, and (ii) a nicking atgRNA.
- the first atgRNA comprises (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of a first integration recognition site.
- the RT template comprises the entirety of the first integration recognition site.
- the LNP comprising a gene editor polynucleotide construct is capable delivering to a cell cytoplasm the gene editor polynucleotide construct. In some embodiments, the LNP comprising a gene editor polynucleotide construct is capable delivering to a cell nucleus the gene editor polynucleotide construct. In some embodiments, the LNP comprises a gene editor protein and associated guide nucleic acids. In some embodiments, the LNP comprises a gene editor protein and associated guide nucleic acids that are capable of localizing to cell nucleus.
- a gene editor polynucleotide construct is delivered to a cell by a fusosome.
- a gene editor polynucleotide construct is delivered to a cell cytoplasm by a fusosome.
- the fusosome comprises a gene editor protein and associated guide nucleic acids.
- a gene editor polynucleotide construct is delivered to a cell by an exosome.
- a gene editor polynucleotide construct is delivered to a cell cytoplasm by an exosome.
- the exosome comprises a gene editor protein and associated guide nucleic acids.
- the prime editor or Gene Writer protein fusion is incorporated (i.e., packaged) into LNP as protein.
- associated atgRNA and optional ngRNAs may be co-packaged with gene editor proteins in LNP.
- the gene editor polynucleotide construct comprises (a) a polynucleotide sequence encoding a prime editor fusion protein or a Gene WriterTM protein,
- ngRNA nickase guide RNA
- e a polynucleotide sequence encoding an integrase
- the prime editor or Gene Writer protein fusion is expressed as a split construct.
- the split construct in reconstituted in a cell.
- the split construct can be fused or ligated via intein protein splicing.
- the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions.
- the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization.
- the split construct can be adapted into one or more nucleic acid constructs described herein.
- the systems described include a gene editor polynucleotide that is delivered to a cell using the methods described herein.
- the gene editor polynucleotide is delivered as a polynucleotide (e.g., an mRNA).
- the gene editor polynucleotide is delivered as a protein.
- the gene editor polynucleotide or protein is packaged, and thereby vectorized, within a lipid nanoparticle (LNP).
- LNP lipid nanoparticle
- the gene editor polynucleotide or protein is packaged in a LNP and is co-delivered with a template polynucleotide (i.e., nucleic acid “cargo” or nucleic acid “payload”) packaged into a separate vector (e.g., a viral vector (e.g., an AAV or adenovirus)) or a second lipid nanoparticle (LNP).
- a template polynucleotide i.e., nucleic acid “cargo” or nucleic acid “payload” packaged into a separate vector (e.g., a viral vector (e.g., an AAV or adenovirus)) or a second lipid nanoparticle (LNP).
- a separate vector e.g., a viral vector (e.g., an AAV or adenovirus)
- LNP second lipid nanoparticle
- the gene editor polynucleotide is delivered to the cells as a polynucleotide.
- the gene editor polynucleotide is delivered to the cells as an mRNA encoding the gene editor polynucleotide (e.g., the gene editor protein or the prime editor system).
- the mRNA comprises one or more modified uridines.
- the mRNA comprises a sequence where each of the uridines is a modified uridine.
- the mRNA is uridine depleted.
- the mRNA encoding the nickase comprises one or more modified uridines.
- the mRNA encoding the reverse transcriptase comprises one or more modified uridines. In some embodiments, the mRNA encoding the nickase comprises one or more modified uridines, and the mRNA encoding the reverse transcriptase comprises one or more modified uridines. In some embodiments, where the integrase is encoded in an mRNA, the mRNA comprises modified uridines. In some embodiments, a modified uridine is a Nl- Methylpseudouridine-5’ -Triphosphate. In some embodiments, a modified uridine is a pseudouridine. In some embodiments, the mRNA comprises a 5’ cap. In some embodiments, the 5’ cap comprises a molecular formula of C32H43N15O24P4 (free acid).
- the gene editor polynucleotide (e.g., a gene editor polynucleotide construct) comprises a polynucleotide sequence encoding a primer editor system (e.g., any of the prime editor systems described herein).
- the prime editor system comprises a nucleotide sequence encoding a nickase (e.g., any of the Cas proteins or variants thereof (e.g., nickases) and nickases described herein, see Tables 4-8) and a nucleotide sequence encoding a reverse transcriptase (e.g., any of the reverse transcriptases described herein).
- the nucleotide sequence encoding the nickase and the nucleotide sequence encoding the reverse transcriptase are positioned in the construct such that when expressed the nickase is linked to the reverse transcriptase.
- the nickase is linked to the reverse transcriptase by in-frame fusion.
- the nickase is linked to the reverse transcriptase by a linker.
- the linker is a peptide fused in-frame between the nickase and reverse transcriptase.
- the gene editor polynucleotide (e.g., a gene editor polynucleotide construct) further comprises a polynucleotide sequence encoding at least a first integrase (e.g., any of the integrases described herein, e.g., as described in Table 10 and also in Yarnall et al., Nat. Biotechnol., 2022, doi.org/10.1038/s41587-022-01527-4 and Durrant et al., Nat. Biotechnol., 2022, doi.org/10.1038/s41587-022-01494-w, each of which are herein incorporated by reference in their entireties).
- a first integrase e.g., any of the integrases described herein, e.g., as described in Table 10 and also in Yarnall et al., Nat. Biotechnol., 2022, doi.org/10.1038/s4
- the linked nickase-reverse transcriptase are further linked to the first integrase.
- the gene editor polynucleotide construct further comprises a polynucleotide sequence encoding at least a first recombinase (e.g., any of the recombinases described herein).
- the systems and methods described herein include a vector that is capable of co-delivering a template polynucleotide, one or more attachment site-containing gRNA, one or more integrases, one or more recombinases, a gene editor polynucleotide, one or more integration recognition sites, one or more recombinase recognition sites, or a combination thereof.
- Non-limiting examples of vectors that can be used in the methods or systems described herein include the vectors described in FIGs. 3-6.
- the vector includes a polynucleotide sequence encoding an attachment site-containing guide RNA (atgRNA).
- the polynucleotide sequence encoding the attachment site-containing guide RNA (atgRNA) is operably linked to a regulatory element (e.g., a U6 promoter) that is capable of driving expression of the atgRNA.
- the atgRNA comprises (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of a first integration recognition site.
- the RT template comprises the entirety of the first integration recognition site.
- the vector or the LNP includes a polynucleotide sequence encoding a nicking gRNA.
- the vector includes a polynucleotide sequence encoding a first attachment site-containing guide RNA (atgRNA) and a polynucleotide sequence encoding a second attachment site-containing guide RNA (atgRNA).
- atgRNA first attachment site-containing guide RNA
- atgRNA second attachment site-containing guide RNA
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the a first integration recognition site; the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- the vector includes a template polynucleotide and a sequence that is an integration cognate of an integration recognition site site-specifically incorporated into the genome of a cell.
- the vector includes a template polynucleotide and a second integration recognition site that is a cognate pair with the first integration recognition site site-specifically incorporated into the genome of the cell.
- the sequence that is an integration cognate e.g., a second integration recognition site
- the vector comprising a template polynucleotide is a recombinant adenovirus, a helper dependent adenovirus, an AAV, a lentivirus, an HSV, an annelovirus, a retrovirus, a DoggyboneTM DNA (dbDNA), a minicircle, a plasmid, a miniDNA, an exosome, a fusosome, or an nanoplasmid.
- the vector is capable of localizing to the nucleus.
- the template polynucleotide is delivered to the cytoplasm and localizes to the nucleus. In certain embodiments, the template polynucleotide is delivered to the cytoplasm by LNP. In certain embodiments, the donor template polynucleotide construct comprises a recognition sequence that is recognized by a DNA binding protein (DNA binding domain) or a transcription factor binding domain. In certain embodiments, the donor template polynucleotide construct is delivered to the nucleus by an integrase or recombinase.
- the template polynucleotide is delivered to the mitochondria.
- the donor template polynucleotide construct comprises a mitochondria targeting sequence.
- the vector comprising a template polynucleotide is AAV.
- the AAV contains a 5’ inverted terminal repeat (ITR).
- the AAV contains a 3’ inverted terminal repeat (ITR).
- the AAV contains a 5’ and a 3’ ITR.
- the 5’ and 3’ ITR are not derived from the same serotype of virus.
- the ITRs are derived from adenovirus, AAV2, and/or AAV5.
- the vector comprising a template polynucleotide is single stranded AAV (ssAAV).
- the vector comprising a donor template polynucleotide construct is self-complementary AAV (scAAV).
- a vector comprises an attachment site-containing guideRNA (atgRNA), a nicking-guideRNA (ngRNA), and template polynucleotide.
- the vector comprising an attachment site-containing guideRNA (atgRNA), a nicking-guideRNA (ngRNA), and template polynucleotide is recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, DoggyboneTM DNA (dbDNA), minicircle, plasmid, miniDNA, exosome, fusosome, or nanoplasmid.
- the vector is capable of localizing to the nucleus.
- the attachment site-containing guideRNA (atgRNA) sequence and the nicking-guideRNA (ngRNA) sequence contain a terminal poly dT.
- a vector comprises an attachment site-containing guideRNA (atgRNA), and donor template.
- the vector comprising an attachment site-containing guideRNA (atgRNA) and donor template is recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, DoggyboneTM DNA (dbDNA), minicircle, plasmid, miniDNA, exosome, fusosome, or nanoplasmid.
- the vector is capable of localizing to the nucleus.
- the attachment site-containing guideRNA (atgRNA) sequence contain a terminal poly dT.
- the template polynucleotide is capable of being integrated into a genomic locus that contains an integrase target recognition site or a recombinase target recognition site.
- the template polynucleotide comprises at least one of the following: a gene, a gene fragment, an expression cassette, a logic gate system, or any combination thereof. In some embodiments, the template polynucleotide comprises at least one intron or exon.
- the template polynucleotide further comprises at least one integrase target recognition site or a recombinase target integrase site.
- at least one integrase target recognition site or a recombinase target integrase site is placed within the donor template vector inverted terminal repeat.
- the delivery system (e.g., co-delivery system) includes a vector having a sub-sequence that is capable of self-circularizing to form a self-circular nucleic acid.
- the vector comprises a physical portion or region of the vector that is capable of self-circularizing to form a circular construct.
- sequence refers to a portion of the vector that is capable of self-circularizing, where the subsequence is flanked by integration recognition sites or recombinase recognition sites positioned to enable self-circularization.
- self-circular nucleic acid refers to a double-stranded, circular nucleic acid construct produced as a result of recombination of a cognate pair of integrase or recombinase recognition sites present on the vector. Recombination occurs when the vector is contacted with an integrase or a recombinase under conditions that allow for recombination of the cognate pair of integrase or recombinase recognition sites.
- the sub-sequence of the vector includes a first recombinase recognition site and a second recombinase recognition site, wherein the first and second recombinase recognition sites are capable of being recombined by a recombinase.
- the sub-sequence of the vector includes a first recombinase recognition site, a second recombinase recognition site, and a second integration recognition site (e.g., the second integration recognition site is a cognate pair of the first integration recognition site), where the first and second recombinase recognition sites flank the integration recognition site.
- the first recombinase recognition site, the second recombinase recognition, and a recombinase enable the self-circularizing and formation of the circular construct.
- the sub-sequence of the vector includes a third integration recognition site and a fourth integration recognition site, wherein the third and fourth integration recognition sites are a cognate pair.
- the subsequence of the vector includes the second integration recognition site, the third integration recognition site, the fourth integration recognition site, where the third and fourth integration recognition sites flank the second integration recognition site (where the second integration recognition site is a cognate pair of the first integration recognition site).
- the third integration recognition site, the fourth integration recognition site, and an integrase enable self - circularization and formation of the circular construct.
- the third integration recognition site and/or the fourth integration recognition sites cannot recombine with the first integration recognition site and/or the second integration recognition site due, in part, to having different central dinucleotides than the first and second integration recognition sites.
- each integration recognition site or each pair of integration recognition is capable of being recognized by a different integrase.
- each integration recognition site or each pair of integration recognition comprises a different central dinucleotide.
- self-circularizing is mediated at the integration recognition sites or recombinase recognition sites. In some embodiments, the self-circularizing is mediated by an integrase or a recombinase.
- the self-circular nucleic acid comprising the second integration recognition site is capable of being integrated into the cell’s genome at the target sequence that contains the first integration recognition site.
- the self-circular nucleic acid comprises one or more additional integration recognition sites that enable integration of an additional nucleic acid cargo.
- the additional nucleic acid cargo includes a sequence that is a cognate pair with one or more of the additional integration recognition sites in the self-circular nucleic acid.
- integration of the self-circular nucleic acid into the genome of a cell results in integration of the one or more additional integration recognition sites into the genome along with the nucleic acid cargo.
- the integrated one or more additional integration recognition sites serve as an integration recognition site (beacon) for placing the additional nucleic acid cargo.
- the self-circularized nucleic acid comprises a DNA cargo
- the DNA cargo is a gene or gene fragment.
- the DNA cargo is an expression cassette.
- the DNA cargo is a logic gate or logic gate system.
- the logic gate or logic gate system may be DNA based, RNA based, protein based, or a mix of DNA, RNA, and protein.
- the nucleic acid cargo is a genetic, protein, or peptide tag and/or barcode.
- the system or methods described herein include a second vector.
- the gene editor polynucleotide encodes a prime editor system comprising a nickase (e.g., any of the Cas proteins or variants thereof (e.g., nickases) and nickases described herein, see Tables 4-8) and a reverse transcriptase (e.g., any of the reverse transcriptase described herein)
- the second vector comprises a polynucleotide sequence encoding an integrase (e.g., any of the integrases described herein, e.g., as described in Table 10 and also in Yarnall et al., Nat.
- the second vector comprises a polynucleotide sequence encoding at least a first recombinase.
- the gene editor polynucleotide encodes a prime editor system comprising a nickase, a reverse transcriptase, and an integrase the second vector comprises a polynucleotide sequence encoding at least a first recombinase.
- the second vector comprises a polynucleotide sequence encoding at least a second integrase.
- the second vector includes a template polynucleotide and a sequence that is an integration cognate of an integration recognition site site-specifically incorporated into the genome of a cell.
- the second vector includes a template polynucleotide and a second integration recognition site that is a cognate pair with the first integration recognition site site-specifically incorporated into the genome of the cell.
- the sequence that is an integration cognate e.g., a second integration recognition site
- the second vector is a vector selected from: adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, DoggyboneTM DNA (dbDNA), minicircle, plasmid, miniDNA, exosome, fusosome, or nanoplasmid.
- the polynucleotide sequence encoding the prime editor system is encoded on at least two different vectors.
- a first vector comprises a polynucleotide sequence encoding a nickase and a second vector comprises a polynucleotide sequence encoding a reverse transcriptase. In such cases, the first vector and second are delivered concurrently.
- the polynucleotide sequence(s) encoding the prime editor system is encoded on at least two (non-contiguous) polynucleotide sequences.
- a first polynucleotide sequence encodes a nickase and a second polynucleotide sequence encodes a reverse transcriptase.
- the first vector and second are delivered concurrently (e.g., in a first LNP).
- the method includes co-delivering to a cell a first gene editor polynucleotide construct and a first attachment site-containing guide RNA (atgRNA) are packaged, and thereby vectorized, within the first LNP, and a second gene editor polynucleotide construct and a second attachment site containing guide RNR (atgRNA) are packaged, and thereby vectorized, within the second LNP, where the first atgRNA and the second atgRNA are an at least first pair of atgRNA.
- the at least first pair of atgRNAs comprise domains that are capable of guiding the prime editor system to a target sequence.
- the first atgRNA further includes a first RT template that comprises at least a portion of a first integration recognition site.
- the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site.
- the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- the method includes delivering a first LNP (e.g., a first LNP comprising a first gene editor polynucleotide construct and a first atgRNA) and a second LNP (e.g., a second LNP comprising a second gene editor polynucleotide construct and a second atgRNA)
- the first LNP and the second LNP are mixed prior to delivering to a cell.
- the first LNP and the second LNP are mixed at a ratio of first LNP to second LNP of 1:10, 1:9, 1:8, 1:7, 1:6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1,2:1,3:1,4:1,5:1, 6:1, 7:1, 8:1, 9:1, or 10:1.
- the first LNP and the second LNP are mixed at a ratio of 1 : 1.
- a first LNP comprising a first gene editor polynucleotide construct and a first attachment site-containing guide RNA comprises a ratio of ratio of gene editor polynucleotide construct (e.g., mRNA) to atgRNAl of 1 : 10, 1:9, 1:8, 1:7, 1:6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1, 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1, or 10:1.
- the first LNP comprises a ratio of mRNA to atgRNAl of 2: 1.
- a second LNP comprising a second gene editor polynucleotide construct and a second attachment site-containing guide RNA (atgRNA2) comprises a ratio of gene editor polynucleotide construct (e.g., mRNA) to atgRNA2 of 1 : 10, 1:9, 1:8, 1 :7, 1 :6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1, 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1, or 10:1.
- the second LNP comprises a ratio of mRNA to atgRNA2 of 2: 1.
- the method includes delivering a first LNP (e.g., a first LNP comprising a first gene editor polynucleotide construct and a first atgRNA) and a second LNP (e.g., a second LNP comprising a second gene editor polynucleotide construct and a second atgRNA)
- the first LNP and the second LNP are mixed such that the ratio of gene editor polynucleotide construct (e.g., mRNA) to first atgRNA (atgRNAl) to second atgRNA (atgRNA2) is 1:0.25:0.25, l:0.5:0.5, 1:0.75:0.75, or 1:1:1.
- the method of co-delivering to a cell a mixture of LNPs includes co-delivering three or more LNPs, four or more LNPs, five or more LNPs, six or more LNPs, seven or more LNPs, eight or more LNPs, nine or more LNPs, or ten or more LNPs.
- a system capable of site-specifically integrating at least a first integration recognition site into the genome of a cell, the system comprising: a first gene editor polynucleotide construct and a first attachment site-containing guide RNA (atgRNA) are packaged, and thereby vectorized, within the first LNP, and a second gene editor polynucleotide construct and a second attachment site containing guide RNR (atgRNA) are packaged, and thereby vectorized, within the second LNP, where the first atgRNA and the second atgRNA are an at least first pair of atgRNA.
- the at least first pair of atgRNAs comprise domains that are capable of guiding the prime editor system to a target sequence.
- the first atgRNA further includes a first RT template that comprises at least a portion of a first integration recognition site.
- the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site.
- the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA and second atgRNA include at least a 6bp overlap.
- the system comprises a first LNP (e.g., any of the first LNPs described herein) and a second LNP (e.g., any of the second LNPs described herein) at a ratio of first LNP to second LNP of 1:10, 1:9, 1:8, 1:7, 1:6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1, 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1, or 10:1.
- the system comprise the first LNP and the second LNP at a ratio of 1 : 1.
- the system comprises a first LNP having a ratio of a first gene editor polynucleotide construct to a first attachment site-containing guide RNA (atgRNAl) of 1:10, 1:9, 1:8, 1:7, 1:6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1, 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9: 1, or 10: 1.
- the system includes a first LNP having a ratio of mRNA (i.e., mRNA encoding the gene editor protein) to atgRNAl of 2: 1.
- the system comprise a second LNP having a ratio of a second gene editor polynucleotide construct to a second attachment site-containing guide RNA (atgRNA2) of 1:10, 1:9, 1:8, 1:7, 1:6, 1:5, 1:4, 1:3, 1:2, 1:1, 1:0.75, 0.75:1,2:1,3:1,4:1,5:1, 6:1, 7:1, 8:1, 9:1, or 10:1.
- the system includes a second LNP having a ratio of mRNA (i.e., mRNA encoding the gene editor protein) to atgRNA2 of 2: 1.
- the system comprises a ratio of gene editor polynucleotide construct (e.g., mRNA encoding the gene editor protein) to first atgRNA (atgRNAl) to second atgRNA (atgRNA2) of 1:0.25:0.25, l:0.5:0.5, 1:0.75:0.75, or 1:1:1.
- gene editor polynucleotide construct e.g., mRNA encoding the gene editor protein
- atgRNAl first atgRNA
- atgRNA2 second atgRNA
- the system comprises a mixture of LNPs comprising three or more LNPs, four or more LNPs, five or more LNPs, six or more LNPs, seven or more LNPs, eight or more LNPs, nine or more LNPs, or ten or more LNPs.
- a vector comprising a template polynucleotide and a sequence that is an integration cognate (i.e., cognate to an integration recognition site site- specifically incorporated into the genome of a cell) can be delivered to the cell concurrently with the split LNPs or after delivery of the split LNPs.
- a vector that includes a template polynucleotide and a second integration recognition site that is a cognate pair with the first integration recognition site is delivered to the cell.
- the sequence that is an integration cognate e.g., a second integration recognition site
- a second integration recognition site enables integration of the template polynucleotide or portion thereof when contacted with an integrase and the site-specifically incorporated first integration recognition site.
- the invention involves vectors, e.g. for delivering or introducing in a cell, but also for propagating these components (e.g. in prokaryotic cells).
- a "vector” is a tool that allows or facilitates the transfer of an entity from one environment to another. It is a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment.
- a vector is capable of replication when associated with the proper control elements.
- the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art.
- a ’’plasmid refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques.
- viral vector wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g.
- Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
- Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors).
- Other vectors e.g., non-episomal mammalian vectors are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
- vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as "expression vectors.” Vectors for and that result in expression in a eukaryotic cell can be referred to herein as “eukaryotic expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
- Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed.
- "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
- Vector delivery e.g., plasmid, viral delivery:
- the CRISPR enzyme for instance a Type V protein such as C2cl or C2c3, and/or any of the present RNAs, for instance a guide RNA
- Effector proteins and one or more guide RNAs can be packaged into one or more vectors, e.g., plasmid or viral vectors.
- the vector e.g., plasmid or viral vector is delivered to the tissue of interest by, for example, an intramuscular injection, while other times the delivery is via intravenous, transdermal, intranasal, oral, mucosal, or other delivery methods. Such delivery may be either via a single dose, or multiple doses.
- the actual dosage to be delivered herein may vary greatly depending upon a variety of factors, such as the vector choice, the target cell, organism, or tissue, the general condition of the subject to be treated, the degree of transformation/modification sought, the administration route, the administration mode, the type of transformation/modification sought, etc.
- Such a dosage may further contain, for example, a carrier (water, saline, ethanol, glycerol, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, etc.), a diluent, a pharmaceutically-acceptable carrier (e.g., phosphate-buffered saline), a pharmaceutically-acceptable excipient, and/or other compounds known in the art.
- a carrier water, saline, ethanol, glycerol, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, etc.
- a pharmaceutically-acceptable carrier e.g., phosphate-buffered saline
- a pharmaceutically-acceptable excipient e.g., phosphate-buffered saline
- the dosage may further contain one or more pharmaceutically acceptable salts such as, for example, a mineral acid salt such as a hydrochloride, a hydrobromide, a phosphate, a sulfate, etc.; and the salts of organic acids such as acetates, propionates, mal onates, benzoates, etc.
- auxiliary substances such as wetting or emulsifying agents, pH buffering substances, gels or gelling materials, flavorings, colorants, microspheres, polymers, suspension agents, etc. may also be present herein.
- Suitable exemplary ingredients include microcrystalline cellulose, carboxymethylcellulose sodium, polysorbate 80, phenylethyl alcohol, chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, parachlorophenol, gelatin, albumin and a combination thereof.
- the delivery is via an adenovirus, which may be at a single booster dose containing at least 1 x 10 5 particles (also referred to as particle units, pu) of adenoviral vector.
- the dose preferably is at least about 1 x 10 6 particles (for example, about 1 x 10 6 - 1 x 10 11 particles), more preferably at least about 1 x 10 7 particles, more preferably at least about 1 x 10 8 particles (e.g., about 1 x 10 8 -l x 10 11 particles or about 1 x 10 9 -l x 10 12 particles), and most preferably at least about 1 x IO 10 particles (e.g., about 1 x 10 9 -l x IO 10 particles or about 1 x 10 9 -l x 10 12 particles), or even at least about 1 x IO 10 particles (e.g., about 1 x 10 10 -l x 10 12 particles) of the adenoviral vector.
- the dose comprises no more than about 1 x 10 14 particles, preferably no more than about 1 x 10 13 particles, even more preferably no more than about 1 x 10 12 particles, even more preferably no more than about 1 x 10 11 particles, and most preferably no more than about 1 x IO 10 particles (e.g., no more than about 1 x 10 9 particles).
- the dose may contain a single dose of adenoviral vector with, for example, about 1 x 10 6 particle units (pu), about 2 x 10 6 pu, about 4 x 10 6 pu, about 1 x 10 7 pu, about 2 x 10 7 pu, about 4 x 10 7 pu, about 1 x 10 8 pu, about 2 x 10 8 pu, about 4 x 10 8 pu, about 1 x 10 9 pu, about 2 x 10 9 pu, about 4 x 10 9 pu, about 1 x IO 10 pu, about 2 x IO 10 pu, about 4 x IO 10 pu, about 1 x 10 11 pu, about 2 x 10 11 pu, about 4 x 10 11 pu, about 1 x 10 12 pu, about 2 x 10 12 pu, or about 4 x 10 12 pu of adenoviral vector.
- adenoviral vector with, for example, about 1 x 10 6 particle units (pu), about 2 x 10 6 pu, about 4 x 10 6 pu, about 1 x 10 7 pu, about 2 x
- the adenoviral vectors in U.S. Pat. No. 8,454,972 B2 to Nabel, et. al., granted on Jun. 4, 2013; incorporated by reference herein, and the dosages at col 29, lines 36-58 thereof.
- the adenovirus is delivered via multiple doses.
- the delivery is via an AAV.
- a therapeutically effective dosage for in vivo delivery of the AAV to a human is believed to be in the range of from about 20 to about 50 ml of saline solution containing from about 1 x 10 10 to about 1 x 10 50 functional AAV/ml solution. The dosage may be adjusted to balance the therapeutic benefit against any side effects.
- the AAV dose is generally in the range of concentrations of from about 1 x 10 5 to 1 x 10 50 genomes AAV (sometimes referred to herein as “vector genomes” or “vg”), from about 1 x 10 8 to 1 x IO 20 genomes AAV, from about 1 x 10 10 to about 1 x 10 16 genomes, or about 1 x 10 11 to about 1 x 10 16 genomes AAV.
- a human dosage may be about 1 x 10 13 genomes AAV.
- concentrations may be delivered in from about 0.001 ml to about 100 ml, about 0.05 to about 50 ml, or about 10 to about 25 ml of a carrier solution.
- the promoter used to drive nucleic acid-targeting effector protein coding nucleic acid molecule expression can include: AAV ITR can serve as a promoter: this is advantageous for eliminating the need for an additional promoter element (which can take up space in the vector). The additional space freed up can be used to drive the expression of additional elements (gRNA, etc.). Also, ITR activity is relatively weaker, so can be used to reduce potential toxicity due to over expression of nucleic acid-targeting effector protein. For ubiquitous expression, can use promoters: CMV, CAG, CBh, PGK, SV40, Ferritin heavy or light chains, etc.
- the promoter used to drive guide RNA can include: Pol III promoters such as U6 or Hl Use of Pol II promoter and intronic cassettes to express guide RNA Adeno Associated Virus (AAV).
- AAV AdoAdeno Associated Virus
- Nucleic acid-targeting effector protein and one or more guide RNA can be delivered using adeno associated virus (AAV), lentivirus, adenovirus or other plasmid or viral vector types, in particular, using formulations and doses from, for example, U.S. Pat. No. 8,454,972 (formulations, doses for adenovirus), U.S. Pat. No. 8,404,658 (formulations, doses for AAV) and U.S. Pat. No. 5,846,946 (formulations, doses for DNA plasmids) and from clinical trials and publications regarding the clinical trials involving lentivirus, AAV and adenovirus.
- AAV adeno associated virus
- lentivirus lentivirus
- adenovirus or other plasmid or viral vector types in particular, using formulations and doses from, for example, U.S. Pat. No. 8,454,972 (formulations, doses for adenovirus), U.S.
- the route of administration, formulation and dose can be as in U.S. Pat. No. 8,454,972 and as in clinical trials involving AAV.
- the route of administration, formulation and dose can be as in U.S. Pat. No. 8,404,658 and as in clinical trials involving adenovirus.
- the route of administration, formulation and dose can be as in U.S. Pat. No. 5,846,946 and as in clinical studies involving plasmids.
- Doses may be based on or extrapolated to an average 70 kg individual (e.g., a male adult human), and can be adjusted for patients, subjects, mammals of different weight and species.
- Frequency of administration is within the ambit of the medical or veterinary practitioner (e.g., physician, veterinarian), depending on usual factors including the age, sex, general health, other conditions of the patient or subject and the particular condition or symptoms being addressed.
- the viral vectors can be injected into the tissue of interest.
- the expression of nucleic acid-targeting effector can be driven by a cell-type specific promoter.
- liver-specific expression might use the Albumin promoter and neuron-specific expression (e.g., for targeting CNS disorders) might use the Synapsin I promoter.
- AAV is advantageous over other viral vectors for a couple of reasons: Low toxicity (this may be due to the purification method not requiring ultra centrifugation of cell particles that can activate the immune response) and Low probability of causing insertional mutagenesis because it doesn't integrate into the host genome.
- AAV has a packaging limit of 4.5 or 4.75 Kb.
- nucleic acid-targeting effector protein such as a Type V protein such as C2cl or C2c3
- a promoter and transcription terminator have to be all fit into the same viral vector. Therefore embodiments of the invention include utilizing homologs of nucleic acid-targeting effector protein (such as a Type V protein such as C2cl or C2c3) that are shorter.
- the AAV can be AAV1, AAV2, AAV5 or any combination thereof.
- AAV8 is useful for delivery to the liver. The herein promoters and vectors are preferred individually.
- Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and psi2 cells or PA317 cells, which package retrovirus.
- Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line may also be infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference.
- Millington-Ward et al. (Molecular Therapy, vol. 19 no. 4, 642-649 April 2011) describes adeno-associated virus (AAV) vectors to deliver an RNA interference (RNAi)- based rhodopsin suppressor and a codon-modified rhodopsin replacement gene resistant to suppression due to nucleotide alterations at degenerate positions over the RNAi target site.
- RNAi RNA interference
- An injection of either 6.0 x 10 8 vp or 1.8 x 10 10 vp AAV were subretinally injected into the eyes by Millington-Ward et al.
- the AAV vectors of Millington-Ward et al. may be applied to the system of the present invention, contemplating a dose of about 2 x 10 11 to about 6 x 10 11 vp administered to a human.
- Dalkara et al. also relates to in vivo directed evolution to fashion an AAV vector that delivers wild-type versions of defective genes throughout the retina after noninjurious injection into the eyes' vitreous humor.
- Dalkara describes a 7 mer peptide display library and an AAV library constructed by DNA shuffling of cap genes from AAV1, 2, 4, 5, 6, 8, and 9.
- the rcAAV libraries and rAAV vectors expressing GFP under a CAG or Rho promoter were packaged and deoxyribonucleaseresistant genomic titers were obtained through quantitative PCR.
- the libraries were pooled, and two rounds of evolution were performed, each consisting of initial library diversification followed by three in vivo selection steps.
- P30 rho-GFP mice were intravitreally injected with 2 ml of iodixanol-purified, phosphate-buffered saline (PBS)- dialyzed library with a genomic titer of about 1. times.10. sup.12 vg/ml.
- PBS phosphate-buffered saline
- the AAV vectors of Dalkara et al. may be applied to the nucleic acid-targeting system of the present invention, contemplating a dose of about 1 x 10 15 to about 1 x 10 16 vg/ml administered to a human.
- Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression.
- Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SW), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66: 1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol.
- MiLV murine leukemia virus
- GaLV gibbon ape leukemia virus
- SW Simian Immuno deficiency virus
- HAV human immuno deficiency virus
- adenoviral based systems may be used.
- Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.
- Adeno-associated virus vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94: 1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No.
- Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and yr2 cells or PA317 cells, which package retrovirus.
- Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line may also be infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference.
- a host cell is transiently or non-transiently transfected with one or more vectors described herein.
- a cell is transfected as it naturally occurs in a subject.
- a cell that is transfected is taken from a subject.
- Cells taken from a subject include, but are not limited to, hepatocytes or cells isolated from muscle, the CNS, eye or lung.
- Immunological cells are also contemplated, such as but not limited to T cells, HSCs, B-cells and NK cells.
- mRNA delivery methods and compositions that may be utilized in the present disclosure including, for example, PCT/US2014/028330, US8822663B2, NZ700688A, ES2740248T3, EP2755693A4, EP2755986A4, WO2014152940A1, EP3450553B1, BRI 12016030852A2, and EP3362461A1.
- Expression of CRISPR systems in particular is described by W02020014577.
- Each of these publications are incorporated herein by reference in their entireties. Additional disclosure hereby incorporated by reference can be found in Kowalski et al., “Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery,” Mol Therap., 2019; 27(4): 710-728.
- the cell is derived from cells taken from a subject, such as a cell line.
- a cell line A wide variety of cell lines for tissue culture are known in the art. Examples of cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD-3, NHDF, HeLa- S3, Huhl, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panel, PC-3, TF1, CTLL-2, CIR, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calul, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHL231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep
- a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences.
- one or more vectors described herein are used to produce a non-human transgenic animal or transgenic plant.
- the transgenic animal is a mammal, such as a mouse, rat, or rabbit.
- the organism or subject is a plant.
- the organism or subject or plant is algae. Methods for producing transgenic plants and animals are known in the art, and generally begin with a method of cell transfection, such as described herein.
- the invention provides for methods of modifying a target polynucleotide in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo or in vitro.
- the method comprises sampling a cell or population of cells from a human or non-human animal or plant (including micro-algae) and modifying the cell or cells. Culturing may occur at any stage ex vivo. The cell or cells may even be re-introduced into the non-human animal or plant (including micro-algae).
- pathogens are often host-specific.
- Fusariumn oxysporum f. sp. lycopersici causes tomato wilt but attacks only tomato
- Plants have existing and induced defenses to resist most pathogens. Mutations and recombination events across plant generations lead to genetic variability that gives rise to susceptibility, especially as pathogens reproduce with more frequency than plants.
- there can be non-host resistance e.g., the host and pathogen are incompatible.
- Horizontal Resistance e.g., partial resistance against all races of a pathogen, typically controlled by many genes
- Vertical Resistance e.g., complete resistance to some races of a pathogen but not to other races, typically controlled by a few genes.
- Plant and pathogens evolve together, and the genetic changes in one balance changes in other. Accordingly, using Natural Variability, breeders combine most useful genes for Yield. Quality, Uniformity, Hardiness, Resistance.
- the sources of resistance genes include native or foreign Varieties, Heirloom Varieties, Wild Plant Relatives, and Induced Mutations, e.g., treating plant material with mutagenic agents.
- plant breeders are provided with a new tool to induce mutations. Accordingly, one skilled in the art can analyze the genome of sources of resistance genes, and in Varieties having desired characteristics or traits employ the present invention to induce the rise of resistance genes, with more precision than previous mutagenic agents and hence accelerate and improve plant breeding programs.
- target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide.
- target polynucleotides include a disease associated gene or polynucleotide.
- a “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non disease control.
- a disease- associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease.
- the transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.
- the delivery system is packaged in one or more LNPs and administered intravenously.
- the co-delivery system is packaged in one or more LNPs and administered intrathecally.
- the co-delivery system is packaged in one or more LNPs and administered by intracerebral ventricular injection.
- the co-delivery system is packaged in one or more LNPs and administered by intracistemal magna administration.
- the co-delivery system is packaged in one or more LNPs and administered by intravitreal injection.
- lipidmucleic acid complexes including targeted liposomes such as immunolipid complexes
- crystal Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos.
- the LNP formulations are selected from LP01 (Cas No. 1799316-64-5), ALC-0315 (Cas No. 2036272-55-4), and cKK-E12 (Cas No. 1432494-65-9).
- the LNP formulation is LP01.
- the LNP formulation is ALC-0315.
- the LNP formulation is cKK-E12.
- LNP doses range from about 0.1 mg/kg to about 100 mg/kg (or any of the values or subranges therein). In some embodiments, LNP doses is about 0.1 mg/kg, about 0.2 mg/kg, about 0.3 mg/kg, about 0.4 mg/kg, about 0.5 mg/kg, about 0.6 mg/kg, about 0.7 mg/kg, about 0.8 mg/kg, about 0.9 mg/kg, about 1.0 mg/kg, 1.5 mg/kg, about 2 mg/kg, about 2.5 mg/kg, about 3 mg/kg, about 3.5 mg/kg, about 4 mg/kg, about 4.5 mg/kg, about 5 mg/kg, about 6 mg/kg, about7 mg/kg, about 7 mg/kg, about 8 mg/kg, about 9 mg/kg, about 10 mg/kg, about 15 mg/kg, about 20 mg/kg, about 25 mg/kg, about 30 mg/kg, about 35 mg/kg, about 40 mg/kg, about 45 mg/kg, or about 50 mg/kg or more
- LNP doses of about 0.01 to about 1 mg per kg of body weight administered intravenously are contemplated.
- Medications to reduce the risk of infusion-related reactions are contemplated, such as dexamethasone, acetaminophen, diphenhydramine or cetirizine, and ranitidine are contemplated.
- Multiple doses of about 0.3 mg per kilogram every 4 weeks for five doses are also contemplated.
- the charge of the LNP must be taken into consideration. As cationic lipids combined with negatively charged lipids to induce nonbilayer structures that facilitate intracellular delivery. Because charged LNPs are rapidly cleared from circulation following intravenous injection, ionizable cationic lipids with pKa values below 7 were developed (see, e.g., Rosin et al, Molecular Therapy, vol. 19, no. 12, pages 1286-2200, December 2011). Negatively charged polymers such as RNA may be loaded into LNPs at low pH values (e.g., pH 4) where the ionizable lipids display a positive charge. However, at physiological pH values, the LNPs exhibit a low surface charge compatible with longer circulation times.
- pH 4 e.g., pH 4
- ionizable cationic lipids Four species of ionizable cationic lipids have been focused upon, namely l,2-dilineoyl-3- dimethylammonium-propane (DLinDAP), l,2-dilinoleyloxy-3-N,N-dimethylaminopropane (DLinDMA), l,2-dilinoleyloxy-keto-N,N-dimethyl-3 -aminopropane (DLinKDMA), and 1,2- dilinoleyl-4-(2-dimethylaminoethyl)-[l,3]-dioxolane (DLinKC2-DMA).
- DLinDAP l,2-dilineoyl-3- dimethylammonium-propane
- DLinDMA l,2-dilinoleyloxy-3-N,N-dimethylaminopropane
- DLinKDMA l,2-dilinoleyl
- LNP siRNA systems containing these lipids exhibit remarkably different gene silencing properties in hepatocytes in vivo, with potencies varying according to the series DLinKC2- DMA>DLinKDMA>DLinDMA»DLinDAP employing a Factor VII gene silencing model (see, e.g., Rosin et al, Molecular Therapy, vol. 19, no. 12, pages 1286-2200, December 2011).
- a dosage of 1 pg/ml of LNP in or associated with the LNP may be contemplated, especially for a formulation containing DLinKC2-DMA.
- the LNP composition comprises one or more one or more ionizable lipids.
- ionizable lipid has its ordinary meaning in the art and may refer to a lipid comprising one or more charged moieties.
- an ionizable lipid may be positively charged or negatively charged.
- the one or more ionizable lipids are selected from the group consisting of 3-(didodecylamino)-Nl,Nl,4-tridodecyl-l-piperazineethanamine (KL10), Nl-[2- (didodecylamino)ethyl]-Nl,N4,N4-tridodecyl-l,4-piperazinediethanami- ne (KL22), 14,25- ditridecyl- 15, 18,21 ,24-tetraaza-octatriacontane (KL25), 1 ,2-dilinoleyloxy-N,N- dimethylaminopropane (DLin-DMA), 2, 2-dilinoleyl-4-dimethylaminomethyl-[l,3]-di oxolane (DLin-K-DMA), heptatriaconta-6,9,28,31-tetraen-19-yl
- the lipid nanoparticle may include one or more (e.g., 1, 2, 3, 4, 5, 6, 7, or 8) cationic and/or ionizable lipids.
- cationic and/or ionizable lipids include, but are not limited to, 3-(didodecylamino)-Nl,Nl,4-tridodecyl-l-piperazineethanamine (KL10), Nl-[2-(didodecylamino)ethyl]-Nl,N4,N4-tridodecyl-l,4-piperazinediethanami- ne (KL22), 14,25-ditridecyl-15,18,21,24-tetraaza-octatriacontane (KL25), l,2-dilinoleyloxy-N,N- dimethylaminopropane (DLin-DMA), 2, 2-dilinoleyl-4-dimethylamin
- lipids e.g., LIPOFECTIN.RTM. (including DOTMA and DOPE, available from GIBCO/BRL), and LIPOFECTAMINE.RTM. (including DOSPA and DOPE, available from GIBCO/BRL).
- LIPOFECTIN.RTM including DOTMA and DOPE, available from GIBCO/BRL
- LIPOFECTAMINE.RTM. including DOSPA and DOPE, available from GIBCO/BRL
- KL10, KL22, and KL25 are described, for example, in U.S. Pat. No. 8,691,750.
- the LNP composition comprises one or more amino lipids.
- amino lipid and “cationic lipid” are used interchangeably herein to include those lipids and salts thereof having one, two, three, or more fatty acid or fatty alkyl chains and a pH-titratable amino head group (e.g., an alkylamino or dialkylamino head group).
- a pH-titratable amino head group e.g., an alkylamino or dialkylamino head group.
- the cationic lipid is typically protonated (i.e., positively charged) at a pH below the pKa of the cationic lipid and is substantially neutral at a pH above the pKa.
- the cationic lipids can also be termed titratable cationic lipids.
- the one or more cationic lipids include: a protonatable tertiary amine (e.g., pH- titratable) head group; alkyl chains, wherein each alkyl chain independently has 0 to 3 (e.g., 0, 1, 2, or 3) double bonds; and ether, ester, or ketal linkages between the head group and alkyl chains.
- cationic lipids include, but are not limited to, DSDMA, DODMA, DOTMA, DLinDMA, DLenDMA, .gamma.
- -DLenDMA DLin-K-DMA
- DLin-K-C2-DMA also known as DLin-C2K-DMA, XTC2, and C2K
- DLin-K-C3-DMA also known as DLin-K-C4-DMA
- DLen-C2K-DMA y-DLen-C2-DMA
- C12-200 cKK-E12, cKK-A12, cKK-012
- DLin-MC2- DMA also known as MC2
- DLin-MC3-DMA also known as MC3
- Anionic lipids suitable for use in lipid nanoparticles include, but are not limited to, phosphatidylglycerol, cardiolipin, diacylphosphatidylserine, diacylphosphatidic acid, N- dodecanoyl phosphatidylethanoloamine, N-succinyl phosphatidylethanolamine, N-glutaryl phosphatidylethanolamine, lysylphosphatidylglycerol, and other anionic modifying groups joined to neutral lipids.
- Neutral lipids suitable for use in lipid nanoparticles include, but are not limited to, diacylphosphatidylcholine, diacylphosphatidylethanolamine, ceramide, sphingomyelin, dihydrosphingomyelin, cephalin, sterols (e.g., cholesterol) and cerebrosides.
- the lipid nanoparticle comprises cholesterol.
- Lipids having a variety of acyl chain groups of varying chain length and degree of saturation are available or may be isolated or synthesized by well-known techniques. Additionally, lipids having mixtures of saturated and unsaturated fatty acid chains and cyclic regions can be used.
- the neutral lipids used in the disclosure are DOPE, DSPC, DPPC, POPC, or any related phosphatidylcholine.
- the neutral lipid may be composed of sphingomyelin, dihydrosphingomyeline, or phospholipids with other head groups, such as serine and inositol.
- amphipathic lipids are included in nanoparticles.
- Exemplary amphipathic lipids suitable for use in nanoparticles include, but are not limited to, sphingolipids, phospholipids, fatty acids, and amino lipids.
- the lipid composition of the pharmaceutical composition may comprise one or more phospholipids, for example, one or more saturated or (poly)unsaturated phospholipids or a combination thereof.
- phospholipids comprise a phospholipid moiety and one or more fatty acid moieties.
- a phospholipid moiety can be selected, for example, from the non-limiting group consisting of phosphatidyl choline, phosphatidyl ethanolamine, phosphatidyl glycerol, phosphatidyl serine, phosphatidic acid, 2-lysophosphatidyl choline, and a sphingomyelin.
- a fatty acid moiety can be selected, for example, from the non-limiting group consisting of lauric acid, myristic acid, myristoleic acid, palmitic acid, palmitoleic acid, stearic acid, oleic acid, linoleic acid, alpha-linolenic acid, erucic acid, phytanoic acid, arachidic acid, arachidonic acid, eicosapentaenoic acid, behenic acid, docosapentaenoic acid, and docosahexaenoic acid.
- Particular amphipathic lipids can facilitate fusion to a membrane.
- a cationic phospholipid can interact with one or more negatively charged phospholipids of a membrane (e.g., a cellular or intracellular membrane). Fusion of a phospholipid to a membrane can allow one or more elements (e.g., a therapeutic agent) of a lipid-containing composition (e.g., LNPs) to pass through the membrane permitting, e.g., delivery of the one or more elements to a target tissue.
- a cationic phospholipid can interact with one or more negatively charged phospholipids of a membrane (e.g., a cellular or intracellular membrane). Fusion of a phospholipid to a membrane can allow one or more elements (e.g., a therapeutic agent) of a lipid-containing composition (e.g., LNPs) to pass through the membrane permitting, e.g., delivery of the one or more elements to a target tissue.
- elements e.g., a
- Non-natural amphipathic lipid species including natural species with modifications and substitutions including branching, oxidation, cyclization, and alkynes are also contemplated.
- a phospholipid can be functionalized with or cross-linked to one or more alkynes (e.g., an alkenyl group in which one or more double bonds is replaced with a triple bond).
- alkynes e.g., an alkenyl group in which one or more double bonds is replaced with a triple bond.
- an alkyne group can undergo a copper- catalyzed cycloaddition upon exposure to an azide.
- Such reactions can be useful in functionalizing a lipid bilayer of a nanoparticle composition to facilitate membrane permeation or cellular recognition or in conjugating a nanoparticle composition to a useful component such as a targeting or imaging moiety (e.g., a dye).
- a targeting or imaging moiety e.g., a dye
- Phospholipids include, but are not limited to, glycerophospholipids such as phosphatidylcholines, phosphatidylethanolamines, phosphatidylserines, phosphatidylinositols, phosphatidy glycerols, and phosphatidic acids. Phospholipids also include phosphosphingolipid, such as sphingomyelin.
- the LNP composition comprises one or more phospholipids.
- the phospholipid is selected from the group consisting of 1,2-dilinoleoyl- sn-glycero-3 -phosphocholine (DLPC), 1,2-dimyristoyl-sn-glycero-phosphocholine (DMPC), l,2-dioleoyl-sn-glycero-3 -phosphocholine (DOPC), l,2-dipalmitoyl-sn-glycero-3- phosphocholine (DPPC), l,2-distearoyl-sn-glycero-3-phosphocholine (DSPC), 1,2- diundecanoyl-sn-glycero-phosphocholine (DUPC), l-palmitoyl-2-oleoyl-sn-glycero-3- phosphocholine (POPC), l,2-di-O-octadecenyl-sn
- DLPC 1,2-dilino
- phosphorus-lacking compounds such as sphingolipids, glycosphingolipid families, diacylglycerols, and .beta.-acyloxyacids, may also be used. Additionally, such amphipathic lipids can be readily mixed with other lipids, such as triglycerides and sterols.
- the LNP composition comprises one or more helper lipids.
- helper lipid refers to lipids that enhance transfection (e.g., transfection of an LNP comprising an mRNA that encodes a site-directed endonuclease, such as a SpCas9 polypeptide).
- the mechanism by which the helper lipid enhances transfection includes enhancing particle stability.
- the helper lipid enhances membrane fusogenicity.
- helper lipid of the LNP compositions disclosure herein can be any helper lipid known in the art.
- helper lipids suitable for the compositions and methods include steroids, sterols, and alkyl resorcinols.
- helper lipids suitable for use in the present disclosure include, but are not limited to, saturated phosphatidylcholine (PC) such as distearoyl-PC (DSPC) and dipalymitoyl-PC (DPPC), di oleoylphosphatidylethanolamine (DOPE), 1 ,2-dilinoleoyl-sn-glycero-3 -phosphocholine (DLPC), cholesterol, 5-heptadecylresorcinol, and cholesterol hemisuccinate.
- PC saturated phosphatidylcholine
- DSPC distearoyl-PC
- DPPC dipalymitoyl-PC
- DOPE di oleoylphosphatidylethanolamine
- DLPC 1 ,2-dilinoleoyl-sn-glycero-3 -phosphocholine
- cholesterol 5-heptadecylresorcinol
- cholesterol hemisuccinate hemisuccinate.
- the LNP composition comprises one or more structural lipids.
- structural lipid refers to sterols and also to lipids containing sterol moieties. Without being bound to any particular theory, it is believed that the incorporation of structural lipids into the LNPs mitigates aggregation of other lipids in the particle.
- Structural lipids can be selected from the group including but not limited to, cholesterol, fecosterol, sitosterol, ergosterol, campesterol, stigmasterol, brassicasterol, tomatidine, tomatine, ursolic acid, alpha-tocopherol, hopanoids, phytosterols, steroids, and mixtures thereof.
- the structural lipid is a sterol.
- sterols are a subgroup of steroids consisting of steroid alcohols.
- the structural lipid is a steroid.
- the structural lipid is cholesterol.
- the structural lipid is an analog of cholesterol.
- the lipid component of a lipid nanoparticle composition may include one or more molecules comprising polyethylene glycol, such as PEG or PEG-modified lipids.
- the LNP composition disclosed herein comprise one or more polyethylene glycol (PEG) lipid.
- PEG-lipid refers to polyethylene glycol (PEG)-modified lipids. Such lipids are also referred to as PEGylated lipids.
- Non-limiting examples of PEG- lipids include PEG-modified phosphatidylethanolamine and phosphatidic acid, PEG- ceramide conjugates (e.g., PEG-CerC14 or PEG-CerC20), PEG-modified dialkylamines and PEG-modified l,2-diacyloxypropan-3 -amines
- a PEG lipid can be PEG-c- DOMG, PEG-DMG, PEG-DLPE, PEG-DMPE, PEG-DPPC, or a PEG-DSPE lipid.
- the PEG-lipid includes, but not limited to 1,2-dimyristoyl-sn-glycerol methoxypolyethylene glycol (PEG-DMG), l,2-distearoyl-sn-glycero-3- phosphoethanolamine-N-[amino(polyethylene glycol)] (PEG-DSPE), PEG-disteryl glycerol (PEG-DSG), PEG-dipalmetoleyl, PEG-dioleyl, PEG-distearyl, PEG-diacylglycamide (PEGDAG), PEG-dipalmitoyl phosphatidylethanolamine (PEG-DPPE), or PEG-1, 2- dimyristyloxlpropyl-3-amine (PEG-c-DMA).
- PEG-DMG 1,2-dimyristoyl-sn-glycerol methoxypolyethylene glycol
- PEG-DSPE l,2-distearoyl-sn
- the PEG-lipid is selected from the group consisting of a PEG-modified phosphatidylethanolamine, a PEG-modified phosphatidic acid, a PEG-modified ceramide, a PEG-modified dialkylamine, a PEG-modified diacylglycerol, a PEG-modified dialkylglycerol, and mixtures thereof.
- the lipid moiety of the PEG-lipids includes those having lengths of from about C. sub.14 to about C. sub.22, preferably from about C. sub.14 to about C. sub.16.
- a PEG moiety for example a mPEG-NH.sub.2, has a size of about 1000, 2000, 5000, 10,000, 15,000 or 20,000 daltons.
- the PEG-lipid is PEG2k-DMG.
- the one or more PEG lipids of the LNP composition comprises PEG-DMPE.
- the one or more PEG lipids of the LNP composition comprises PEG- DMG.
- the ratio between the lipid components and the nucleic acid molecules of the LNP composition is sufficient for (i) formation of LNPs with desired characteristics, e.g., size, charge, and (ii) delivery of a sufficient dose of nucleic acid at a dose of the lipid component(s) that is tolerable for in vivo administration as readily ascertained by one of skill in the art.
- a nanoparticle e.g., a lipid nanoparticle
- a targeting moiety that is specific to a cell type and/or tissue type.
- a nanoparticle may be targeted to a particular cell, tissue, and/or organ using a targeting moiety.
- a nanoparticle comprises a targeting moiety.
- targeting moieties include ligands, cell surface receptors, glycoproteins, vitamins (e.g., riboflavin) and antibodies (e.g., full-length antibodies, antibody fragments (e.g., Fv fragments, single chain Fv (scFv) fragments, Fab' fragments, or F(ab')2 fragments), single domain antibodies, camelid antibodies and fragments thereof, human antibodies and fragments thereof, monoclonal antibodies, and multispecific antibodies (e.g., bispecific antibodies)).
- the targeting moiety may be a polypeptide.
- the targeting moiety may include the entire polypeptide (e.g., peptide or protein) or fragments thereof.
- a targeting moiety is typically positioned on the outer surface of the nanoparticle in such a manner that the targeting moiety is available for interaction with the target, for example, a cell surface receptor.
- a variety of different targeting moieties and methods are known and available in the art, including those described, e.g., in Sapra et al., Prog. Lipid Res. 42(5):439-62, 2003 and Abra et al., J. Liposome Res. 12: 1-3, 2002.
- a lipid nanoparticle may include a surface coating of hydrophilic polymer chains, such as polyethylene glycol (PEG) chains (see, e.g., Allen et al., Biochimica et Biophysica Acta 1237: 99-108, 1995; DeFrees et al., Journal of the American Chemistry Society 118: 6101-6104, 1996; Blume et al., Biochimica et Biophysica Acta 1149: 180-184,1993; Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; U.S. Pat. No.
- PEG polyethylene glycol
- a targeting moiety for targeting the lipid nanoparticle is linked to the polar head group of lipids forming the nanoparticle.
- the targeting moiety is attached to the distal ends of the PEG chains forming the hydrophilic polymer coating (see, e.g., Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; Kirpotin et al., FEBS Letters 388: 115-118, 1996).
- Standard methods for coupling the targeting moiety or moi eties may be used.
- phosphatidylethanolamine which can be activated for attachment of targeting moieties, or derivatized lipophilic compounds, such as lipid-derivatized bleomycin, can be used.
- Antibody -targeted liposomes can be constructed using, for instance, liposomes that incorporate protein A (see, e.g., Renneisen et al., J. Bio. Chem., 265: 16337-16342, 1990 and Leonetti et al., Proc. Natl. Acad. Sci. (USA), 87:2448-2451, 1990).
- Other examples of antibody conjugation are disclosed in U.S. Pat. No. 6,027,726.
- targeting moieties can also include other polypeptides that are specific to cellular components, including antigens associated with neoplasms or tumors.
- Polypeptides used as targeting moieties can be attached to the liposomes via covalent bonds (see, for example Heath, Covalent Attachment of Proteins to Liposomes, 149 Methods in Enzymology 111-119 (Academic Press, Inc. 1987)).
- Other targeting methods include the biotin-avidin system.
- a lipid nanoparticle includes a targeting moiety that targets the lipid nanoparticle to a cell including, but not limited to, hepatocytes, colon cells, epithelial cells, hematopoietic cells, epithelial cells, endothelial cells, lung cells, bone cells, stem cells, mesenchymal cells, neural cells, cardiac cells, adipocytes, vascular smooth muscle cells, cardiomyocytes, skeletal muscle cells, beta cells, pituitary cells, synovial lining cells, ovarian cells, testicular cells, fibroblasts, B cells, T cells, reticulocytes, leukocytes, granulocytes, and tumor cells (including primary tumor cells and metastatic tumor cells).
- the targeting moiety targets the lipid nanoparticle to a hepatocyte.
- the lipid nanoparticles described herein may be lipidoid-based.
- the synthesis of lipidoids has been extensively described and formulations containing these compounds are particularly suited for delivery of polynucleotides (see Mahon et al., Bioconjug Chem. 2010 21 : 1448-1454; Schroeder et al., J Intern Med. 2010 267:9-21; Akinc et al., Nat. Biotechnol. 2008 26:561-569; Love et al., Proc Natl Acad Sci USA. 2010 107: 1864-1869; Siegwart et al., Proc Natl Acad Sci USA. 2011 108: 12996-3001).
- lipidoid formulations for intramuscular or subcutaneous routes may vary significantly depending on the target cell type and the ability of formulations to diffuse through the extracellular matrix into the blood stream. While a particle size of less than 150 nm may be desired for effective hepatocyte delivery due to the size of the endothelial fenestrae (see e.g., Akinc et al., Mol Ther. 2009 17:872-879), use of lipidoid oligonucleotides to deliver the formulation to other cells types including, but not limited to, endothelial cells, myeloid cells, and muscle cells may not be similarly size-limited.
- lipidoid formulations may have a similar component molar ratio.
- Different ratios of lipidoids and other components including, but not limited to, a neutral lipid (e.g., diacylphosphatidylcholine), cholesterol, a PEGylated lipid (e.g., PEG-DMPE), and a fatty acid (e.g., an omega-3 fatty acid) may be used to optimize the formulation of the mRNA or system for delivery to different cell types including, but not limited to, hepatocytes, myeloid cells, muscle cells, etc.
- a neutral lipid e.g., diacylphosphatidylcholine
- cholesterol e.g., a PEGylated lipid
- PEG-DMPE PEGylated lipid
- a fatty acid e.g., an omega-3 fatty acid
- Exemplary lipidoids include, but are not limited to, DLin-DMA, DLin-K-DMA, DLin-KC2-DMA, 98N12-5, C 12-200 (including variants and derivatives), DLin-MC3-DMA and analogs thereof.
- lipidoid formulations for the localized delivery of nucleic acids to cells may also not require all of the formulation components which may be required for systemic delivery, and as such may comprise the lipidoid and the mRNA or system.
- a system described herein may be formulated by mixing the mRNA or system, or individual components of the system, with the lipidoid at a set ratio prior to addition to cells.
- In vivo formulations may require the addition of extra ingredients to facilitate circulation throughout the body.
- a system or individual components of a system is added and allowed to integrate with the complex. The encapsulation efficiency is determined using a standard dye exclusion assays.
- In vivo delivery of systems may be affected by many parameters, including, but not limited to, the formulation composition, nature of particle PEGylation, degree of loading, oligonucleotide to lipid ratio, and biophysical parameters such as particle size (Akinc et al., Mol Ther. 2009 17:872-879; herein incorporated by reference in its entirety).
- particle size Akinc et al., Mol Ther. 2009 17:872-879; herein incorporated by reference in its entirety.
- small changes in the anchor chain length of poly(ethylene glycol) (PEG) lipids may result in significant effects on in vivo efficacy.
- Formulations with the different lipidoids including, but not limited to penta[3-(l-laurylaminopropionyl)]-triethylenetetramine hydrochloride (TETA-5LAP; aka 98N12-5, see Murugaiah et al., Analytical Biochemistry, 401 :61 (2010)), Cl 2-200 (including derivatives and variants), MD1, DLin-DMA, DLin-K-DMA, DLin-KC2- DMA and DLin-MC3-DMA can be tested for in vivo activity.
- the lipidoid referred to herein as "98N12-5" is disclosed by Akinc et al., Mol Ther. 2009 17:872-879).
- the lipidoid referred to herein as "C12-200” is disclosed by Love et al., Proc Natl Acad Sci USA. 2010 107: 1864- 1869 and Liu and Huang, Molecular Therapy. 2010 669-670.
- the LNPs of the present disclosure in which a nucleic acid is entrapped within the lipid portion of the particle and is protected from degradation, can be formed by any method known in the art including, but not limited to, a continuous mixing method, a direct dilution process, and an in-line dilution process. Additional techniques and methods suitable for the preparation of the LNPs described herein include coacervation, microemulsions, supercritical fluid technologies, phase-inversion temperature (PIT) techniques.
- PIT phase-inversion temperature
- the LNPs used herein are produced via a continuous mixing method, e.g., a process that includes providing an aqueous solution a nucleic acid described herein in a first reservoir, providing an organic lipid solution in a second reservoir (wherein the lipids present in the organic lipid solution are solubilized in an organic solvent, e.g., a lower alkanol such as ethanol), and mixing the aqueous solution with the organic lipid solution such that the organic lipid solution mixes with the aqueous solution so as to substantially instantaneously produce a lipid vesicle (e.g., liposome) encapsulating the nucleic acid molecule within the lipid vesicle.
- a continuous mixing method e.g., a process that includes providing an aqueous solution a nucleic acid described herein in a first reservoir, providing an organic lipid solution in a second reservoir (wherein the lipids present in the organic lipid solution are solubilized in an organic solvent
- the LNPs used herein are produced via a direct dilution process that includes forming a lipid vesicle (e.g., liposome) solution and immediately and directly introducing the lipid vesicle solution into a collection vessel containing a controlled amount of dilution buffer.
- the collection vessel includes one or more elements configured to stir the contents of the collection vessel to facilitate dilution.
- the amount of dilution buffer present in the collection vessel is substantially equal to the volume of lipid vesicle solution introduced thereto.
- the LNPs are produced via an in-line dilution process in which a third reservoir containing dilution buffer is fluidly coupled to a second mixing region.
- the lipid vesicle (e.g., liposome) solution formed in a first mixing region is immediately and directly mixed with dilution buffer in the second mixing region.
- This disclosure is not limited to systems and methods described herein. Any delivery method that is capable of delivering the systems described herein can be used as long as it is capable of site-specifically integrating a template polynucleotide into the genome of a cell.
- compositions and co-delivery methods for correcting or replacing genes or gene fragments (including introns or exons) or inserting genes in new locations.
- a method comprises recombination or integration into a safe harbor site (SHS).
- SHS safe harbor site
- a frequently used human SHS is the AAVS1 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion.
- Another locus comprises the human homolog of the murine Rosa26 locus.
- Yet another SHS comprises the human Hl 1 locus on chromosome 22.
- a complete gene may be prohibitively large and replacement of an entire gene impractical.
- a method of the invention comprises recombining corrective gene fragments into a defective locus.
- the methods and compositions can be used to target, without limitation, stem cells for example induced pluripotent stem cells (iPSCs), HSCs, HSPCs, mesenchymal stem cells, or neuronal stem cells and cells at various stages of differentiation.
- stem cells for example induced pluripotent stem cells (iPSCs), HSCs, HSPCs, mesenchymal stem cells, or neuronal stem cells and cells at various stages of differentiation.
- methods and compositions of the invention are adapted to target organoids, including patient derived organoids.
- methods and compositions of the invention are adapted to treat muscle cells, not limited to cardiomyocytes for Duchene Muscular Dystrophy (DMD).
- the dystrophin gene is the largest gene in the human genome, spanning ⁇ 2.3 Mb of DNA. DMD is composed of 79 exons resulting in a 14-kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon.
- An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss-of-function mutations, thereby restoring the open reading frame (ORFs).
- the methods and systems described herein are used to treat DMD by site-specifically integrating in the genome a polynucleotide template that repairs or replaces all or a portion of the defective DMD gene.
- ANCA Anti-Neutrophil Cytoplasmic Antibody
- SLE Systemic Lupus Erythematosus
- LN Lupus Nephritis
- MN Membranous Nephropathy
- HCU Homocystinuria
- CFTR cystic fibrosis transmembrane conductance regulator
- the most common cystic fibrosis (CF) mutation F508del removes a single amino acid.
- recombining human CFTR into an SHS of a cell that expresses CFTR F508del is a corrective treatment path.
- the methods and systems described herein are used to CF by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing CF. Proposed validation is detection of persistent CFTR mRNA and protein expression in transduced cells.
- Sickle cell disease is caused by mutation of a specific amino acid — valine to glutamic acid at amino acid position 6.
- SCD is corrected by recombination of the HBB gene into a safe harbor site (SHS) and by demonstrating correction in a proportion of target cells that is high enough to produce a substantial benefit.
- the methods and systems described herein are used to sickle cell disease by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the disease.
- validation is detection of persistent HBB mRNA and protein expression in transduced cells.
- DMD Duchenne Muscular Dystrophy.
- the dystrophin gene is the largest gene in the human genome, spanning ⁇ 2.3 Mb of DNA. DMD is composed of 79 exons resulting in a 14-kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon. An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss- of-function mutations, thereby restoring the open reading frame (ORFs).
- recombination will be into safe harbor sites (SHS).
- SHS safe harbor sites
- a frequently used human SHS is the A4P57 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion.
- the site is the human homolog of the e murine Rosa26 locus (pubmed.ncbi.nlm.nih.gov/18037879).
- the site is the human Hl 1 locus on chromosome 22.
- Proposed target cells for recombination include stem cells for example induced pluripotent stem cells (iPSCs) and cells at various stages of differentiation. In some cases, a complete gene may be prohibitively large and replacement of an entire gene impractical. In such instances, rescuing mutants by recombining in corrected gene fragments with the methods and systems described herein is a corrective option.
- iPSCs induced pluripotent stem cells
- correcting mutations in exon 44 (or 51) by recombining in a corrective coding sequence downstream of exon 43 (or 50), using the methods and systems described herein is a corrective option.
- Proposed validation is detection of persistent DMD mRNA and protein expression in transduced cells.
- F8 Factor VIII
- F8 Factor VIII
- F8 A large proportion of severe hemophilia A patients harbor one of two types of chromosomal inversions in the FVIII gene.
- the recombinase technology and methods described herein are well suited to correcting such inversions (and other mutations) by recombining of the FVIII gene into a SHS.
- correcting factor VIII deficiency by recombining the FVIII gene into an SHS is a corrective path.
- the methods and systems described herein are used to correct factor VIII deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency. Proposed validation is detection of persistent FVIII mRNA and protein expression in transduced cells.
- Factor 9 (Factor IX) Hemophilia B, also called factor IX (FIX) deficiency is a genetic disorder caused by missing or defective factor IX, a clotting protein.
- the methods and systems described herein are used to correct factor IX deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency.
- Proposed validation is detection of persistent FiX mRNA and protein expression in transduced cells.
- Ornithine transcarbamylase deficiency is a rare genetic condition that causes ammonia to build up in the blood.
- the condition - more commonly called OTC deficiency - is more common in boys than girls and tends to be more severe when symptoms emerge shortly after birth.
- the methods and systems described herein are used to correct OTC deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the OTC deficiency or integrates a polynucleotide encoding a functional ornithine transcarbamylase enzyme.
- Proposed validation is detection of persistent OTC mRNA and protein expression in transduced cells.
- Phenylketonuria also called PKU, is a rare inherited disorder that causes an amino acid called phenylalanine to build up in the body. PKU is caused by a change in the phenylalanine hydroxylase (PAH) gene. This gene helps create the enzyme needed to break down phenylalanine.
- PKU phenylalanine hydroxylase
- the methods and systems described herein are used to correct PKU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the PKU deficiency or integrates a polynucleotide encoding a functional phenylalanine hydroxylase (PAH) gene.
- Proposed validation is detection of persistent PAH mRNA and protein expression in transduced cells.
- Homocystinuria is elevation of the amino acid, homocysteine (protein building block coming from our diet) in the urine or blood.
- Common causes of HCU include: problems with the enzyme cystathionine beta synthase (CBS), which converts homocysteine to the amino acid cystathionine (which then becomes cysteine) and needs the vitamin B6 (pyridoxine); and problems with converting homocysteine to the amino acid methionine.
- CBS cystathionine beta synthase
- pyridoxine pyridoxine
- the methods and systems described herein are used to correct HCU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the HCU or integrates a polynucleotide encoding a functional copy of a gene (e.g., CBS) able to reduce or prevent buildup of homocysteine in the urine.
- Proposed validation is detection of persistent CBS mRNA and protein expression in transduced cells.
- IgA Nephropathy (Berger’s disease). IgA nephropathy, also known as Berger's disease, is a kidney/autoimmune disease that occurs when an antibody called immunoglobulin A (IgA) builds up in the kidneys.
- IgA immunoglobulin A
- the methods and systems described herein are used to treat Berger’s disease by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of Berger’s disease.
- ANCA vasculitis is an autoimmune disease affecting small blood vessels in the body. It is caused by autoantibodies called ANCAs, or Anti-Neutrophilic Cytoplasmic Autoantibodies. ANCAs target and attack a certain kind of white blood cells called neutrophils.
- the methods and systems described herein are used to treat ANCA vasculitis by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of ANCA vasculitis.
- Lupus is an autoimmune — a disorder in which the body’s immune system attacks the body’s own cells and organs.
- the methods and systems described herein are used to treat SLE/LN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of SLE/LN.
- MN Membranous Nephropathy
- the methods and systems described herein are used to treat MN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of MN.
- C3 glomerulonephritis C3GN.
- C3 glomerulopathy is a group of related conditions that cause the kidneys to malfunction.
- the major features of C3 glomerulopathy include high levels of protein in the urine (proteinuria), blood in the urine (hematuria), reduced amounts of urine, low levels of protein in the blood, and swelling in many areas of the body.
- Affected individuals may have particularly low levels of a protein called complement component 3 (or C3) in the blood.
- the methods and systems described herein are used to treat C3 glomerulopathy by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of C3 glomerulopathy.
- Glucagon-like peptide 1 (GLP-1) is a small peptide component of the prohormone, proglucagon, that is produced in the gut.
- the methods and systems described herein include administering, and thereby site-specifically integrating, a polynucleotide encoding GLP-1 or a GLP-1 agonist. 5.12. Methods of treatment
- methods of treatment comprises administering an effective amount of the pharmaceutical composition comprising the nucleic acid construct or vectorized nucleic acid construct described above to a patient in need thereof.
- the system e.g., any of the systems described herein
- the systems are delivered to a patient, thereby delivering to a cell in vivo.
- DNA or RNA viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients ex vivo).
- Conventional viral based systems to be used herein could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno- associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
- the co-delivery system described herein e.g., a gene editor construct packaged in a LNP and a donor template packaged in a vector
- the co-delivery system described herein is administered intravenously.
- the co-delivery system described herein e.g., a gene editor construct packaged in a LNP and a donor template packaged in a vector
- the co-delivery system described herein e.g., a gene editor construct packaged in a LNP and a donor template packaged in a vector
- the co-delivery system described herein e.g., a gene editor construct packaged in a LNP and a donor template packaged in a vector
- the co-delivery system described herein is administered by intracistemal magna administration.
- the co-delivery system described herein e.g., a gene editor construct packaged in a LNP and a donor template packaged in a vector
- Methods of non-viral delivery of the donor DNA template described herein include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipidmucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA.
- Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., TransfectamTM and LipofectinTM).
- Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
- mRNA delivery methods and compositions that may be utilized in the present disclosure including, for example, PCT/US2014/028330, US8822663B2, NZ700688A, ES2740248T3, EP2755693A4, EP2755986A4, WO2014152940A1, EP3450553B1, BRI 12016030852A2, and EP3362461A1.
- Expression of CRISPR systems in particular is described by W02020014577.
- Each of these publications are incorporated herein by reference in their entireties. Additional disclosure hereby incorporated by reference can be found in Kowalski et al., “Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery,” Mol Therap., 2019; 27(4): 710-728.
- the method for identifying off-target editing is Cryptic site sequencing (Cryptic-Seq).
- Cryptic-Seq can be used to identify off-target editing of integration/recombination enzymes that recognize an attachment site, such as sequence specific integrase, transposase, insertion element, resolvase, invertase, or recombinase.
- the integrase is an integrase of the serine or tyrosine family.
- the integrase is a large serine integrase (LSI).
- the transposase is a Mu transposase.
- the recombinase is a recombinase of the serine or tyrosine family. In some embodiments, the recombinase is a RAG family recombinase. [0273] In some embodiments, Cryptic-Seq is used to identify the off-target editing of a large serine integrase. In certain embodiments, the large serine integrase is an integration enzyme disclosed in PCT Publication No. W02023/070031, the disclosure of which is incorporated by reference in its entirety.
- Cryptic-Seq is used with one large serine integrase (LSI) multiplexed with attachment sites that differ only in the central dinucleotide sequence. In some embodiments, Cryptic-Seq is used with multiple large serine integrases (LSIs) simultaneously with each of the LSI’s specific attachment site DNA substrates.
- LSI large serine integrase
- a method for identifying cryptic attachment sites that are recognizable by a selected large serine integrase (LSI) in a target genome comprising: (a) fragmenting genomic DNA isolated from target cells; (b) tagging the 5’ and 3’ ends of the DNA fragments with a first oligonucleotide adapter; (c) contacting the tagged genomic DNA fragments with (i) a first species of synthetic DNA molecule, wherein the first species of synthetic DNA molecule comprises a first attachment site known to be recognized by the selected LSI, and a second oligonucleotide adapter; (ii) optionally at least a second species of synthetic DNA molecule, wherein the second species of synthetic DNA molecule comprises a first attachment site known to be recognized by the selected LSI, wherein the first attachment site of the at least second species of synthetic DNA molecule is different in sequence in the central dinucleotide from the first attachment site of the first species of synthetic DNA
- the first oligonucleotide adapter comprises a primer binding site.
- the second oligonucleotide adapter comprises a primer binding site.
- the primer binding site of the first oligonucleotide adapter, the primer binding site of the second oligonucleotide adapter, or both are capable of mediating PCR amplification.
- the primer binding site of the first oligonucleotide adapter, the primer binding site of the second oligonucleotide adapter, or both are capable of mediating sequencing-by-synthesis.
- the primer binding site of the first oligonucleotide adapter, the primer binding site of the second oligonucleotide adapter, or both are capable of mediating PCR amplification and sequencing-by-synthesis.
- the synthetic DNA molecule further comprises a unique molecular identifier (UMI).
- UMI unique molecular identifier
- the UMI is selected from: a random UMI, a variable length UMI, a variable length non-random UMI, a fixed length random UMI, a dual-indexed UMI, and a duplex UMI.
- the UMI is positioned in the synthetic DNA molecule between the first attachment site and the second oligonucleotide adapter.
- the synthetic DNA molecule further comprises a barcode.
- the barcode is selected from: a predefined set of DNA barcodes, a variable length barcode, and a dual-indexed barcode.
- the synthetic DNA molecule is circular. In some embodiments, the synthetic DNA molecule is linear.
- the large serine integrase is Bxbl.
- the first attachment site is an attP or a modified attP site. In some embodiments, the first attachment site is an attB or a modified attB site.
- generating a sequencing library comprises an initial step of amplifying the tagged genomic DNA fragment after recombination with the synthetic DNA molecule.
- identifying a cryptic attachment site comprises determining the sequence of the genomic site into which the DNA molecule has been recombined or the sequence of DNA flanking the cryptic attachment site. In some embodiments, identifying a cryptic attachment site comprises detecting an attL-genomic DNA junction, an attR-genomic DNA junction, or both. In some embodiments, identifying a cryptic attachment site comprises: aligning the sequencing data to a reference genome; detecting an attachment site- genomic DNA junction; and reporting the coordinates and using the coordinates to identify cryptic attachment site.
- detecting an attachment site-genomic DNA junction comprises detecting an attL-genomic DNA junction, an attR-genomic DNA junction, or both.
- reporting the coordinates comprises ranking the coordinates based on sequencing reads de-duplicated from PCR amplification using the UMIs.
- the UMI count represents the recombination efficacy of the first attachment site.
- the identified cryptic attachment site was not previously known to be an attachment site of the selected LSI.
- a multiplexed method for identifying cryptic attachment sites that are separately recognizable by each of a plurality of large serine integrases (LSIs) in a target genome comprising: (a) fragmenting genomic DNA isolated from target cells; (b) tagging the 5’ and 3’ ends of the DNA fragments with a first oligonucleotide adapter; (c) contacting the tagged genomic DNA fragments with (i) a first species of synthetic DNA molecule, wherein the first species of synthetic DNA molecule comprises a first attachment site known to be recognized by a first LSI, and a second oligonucleotide adapter; (ii) at least a second species of synthetic DNA molecule, wherein the second species of synthetic DNA molecule comprises a first attachment site known to be recognized by a second LSI, wherein the first attachment site of the at least second species of synthetic DNA molecule is different in sequence from the first attachment site of the first species of synthetic DNA
- the present disclosure also provides a synthetic DNA molecule for identifying cryptic attachment sites that are recognizable by a selected large serine integrase (LSI) in a target genome, comprising: a first attachment site and an oligonucleotide adapter, wherein the first attachment site is known to be recognized by the selected LSI.
- LSI large serine integrase
- the oligonucleotide adapter comprises a primer binding site.
- the primer binding site is capable of mediating PCR amplification.
- the primer binding site is capable of mediating sequencing-by-synthesis.
- the primer binding site is capable of mediating PCR amplification and sequencing-by-synthesis.
- the synthetic DNA molecule further comprises a unique molecular identifier (UMI).
- UMI unique molecular identifier
- the UMI is selected from: a random UMI, a variable length UMI, a variable length non-random UMI, a fixed length random UMI, a dual-indexed UMI, and a duplex UMI.
- the UMI is positioned in the synthetic DNA molecule between the first attachment site and the oligonucleotide adapter.
- the synthetic DNA molecule further comprises a barcode.
- the barcode is selected from: a predefined set of DNA barcodes, a variable length barcode, and a dual-indexed barcode.
- the synthetic DNA molecule is circular. In some embodiments, the synthetic DNA molecule is linear.
- the large serine integrase is Bxbl.
- the first integration recognition site is an attP or a modified attP site.
- the first integration recognition site is an attB or a modified attB site.
- a set of synthetic DNA molecules for identifying cryptic attachment sites that are recognizable by a selected large serine integrase comprising: a first species of synthetic DNA molecule, comprising a first attachment site known to be recognized by the selected LSI, an oligonucleotide adapter, and a first DNA barcode; and at least a second species of synthetic DNA molecule, comprising a first attachment site known to be recognized by the selected LSI, wherein the first attachment site of the at least second species of synthetic DNA molecule is different in sequence in the central dinucleotide from the first attachment site of the first species of synthetic DNA molecule, an oligonucleotide adapter, and a second DNA barcode; wherein the first DNA barcode and the second DNA barcode allow for multiplexing of multiple first attachment sites of different sequences in one reaction.
- LSI large serine integrase
- the oligonucleotide adapter comprises a primer binding site.
- the primer binding site is capable of mediating PCR amplification. In some embodiments, the primer binding site is capable of mediating sequencing-by-synthesis. In some embodiments, the primer binding site is capable of mediating PCR amplification and sequencing-by-synthesis.
- the synthetic DNA molecule further comprises a unique molecular identifier (UMI).
- UMI unique molecular identifier
- the UMI is selected from: a random UMI, a variable length UMI, a variable length non-random UMI, a fixed length random UMI, a dual-indexed UMI, and a duplex UMI.
- the UMI is positioned in the synthetic DNA molecule between the first attachment site and the oligonucleotide adapter.
- the barcode is selected from: a predefined set of DNA barcodes, a variable length barcode, and a dual-indexed barcode.
- the synthetic DNA molecule is circular. In some embodiments, the synthetic DNA molecule is linear.
- the large serine integrase is Bxbl.
- the first integration recognition site is an attP or a modified attP site.
- the first integration recognition site is an attB or a modified attB site.
- the present disclosure also provides a set of synthetic DNA molecules for identifying cryptic attachment sites that are separately recognizable by each of a plurality of large serine integrases (LSIs), comprising: a first species of synthetic DNA molecule, comprising a first attachment site known to be recognized by a first LSI, an oligonucleotide adapter, and a first DNA barcode; and at least a second species of synthetic DNA molecule, comprising a first attachment site known to be recognized by a second LSI, wherein the first attachment site of the at least second species of synthetic DNA molecule is different in sequence from the first attachment site of the first species of synthetic DNA molecule, an oligonucleotide adapter, and a second DNA barcode; wherein the first DNA barcode and the second DNA barcode allow for identifying the respective cryptic attachment sites of each of the plurality of LSIs in one reaction.
- LSIs large serine integrases
- kits for identifying cryptic attachment sites that are recognizable by a selected large serine integrase (LSI) or by each of a plurality of large serine integrases (LSIs).
- the kit comprises the synthetic DNA molecule of the present disclosure and instructions for performing the method of the present disclosure. 5.13.2. HIDE-Seq
- the method for identifying off-target editing is High- throughput Integrase-mediated DNA Event Sequencing (HIDE-Seq).
- HIDE-Seq can identify loci of recombination with DNA substrates, loci of recombination without substrates, loci with double-strand break formation from abortive recombination events, and loci with intra- genomic recombination independent of the synthetic DNA substrates.
- a method for identifying DNA recombination events of a selected integrase in a target genome comprising: (a) contacting genomic DNA isolated from target cells with (i) a first species of synthetic DNA molecule, wherein the first species of synthetic DNA molecule is linear and comprises a first attachment site known to be recognized by the selected integrase, and (ii) the selected integrase, under conditions suitable for the integrase to effect the recombination of the genomic DNA; (b) generating a sequencing library comprising the recombined genomic DNA; (c) sequencing the sequencing library; and (d) detecting DNA recombination events based on the sequencing data.
- the DNA recombination events are selected from: a DNA double strand break, a recombination of the attachment site with a cryptic site in the genomic DNA, and a recombination between two cryptic sites in the genomic DNA.
- the integrase is a large serine integrase (LSI). In some embodiments, the large serine integrase is Bxbl. In some embodiments, the first integration recognition site is an attP or a modified attP site. In some embodiments, the first integration recognition site is an attB or a modified attB site.
- generating a sequencing library does not comprise amplifying the genomic DNA after recombination of the synthetic DNA molecule. In some embodiments, generating a sequencing library comprises generating a whole genome sequencing library. In some embodiments, sequencing the sequencing library comprises whole genome sequencing.
- detecting an DNA recombination event comprising: aligning the sequencing data to a reference genome; and detecting a DNA double strand break, a recombination of the integration recognition site with a cryptic site in the genomic DNA, or a recombination between two cryptic sites in the genomic DNA.
- a synthetic DNA molecule for identifying DNA recombination events of a selected integrase in a target genome comprising: a first attachment site, wherein the first attachment site is known to be recognized by the selected integrase, wherein the synthetic DNA molecule is linear.
- the DNA recombination events are selected from: a DNA double strand break, a recombination of the attachment site with a cryptic site in the genomic DNA, and a recombination between two cryptic sites in the genomic DNA.
- the integrase is a large serine integrase (LSI). In some embodiments, the large serine integrase is Bxbl. In some embodiments, the first integration recognition site is an attP or a modified attP site. In some embodiments, the first integration recognition site is an attB or a modified attB site.
- kits for identifying DNA recombination events of a selected integrase comprising: (a) the synthetic DNA molecule of the present disclosure; and (b) instructions for performing the method of the present disclosure.
- the method for validating off-target editing is hybridization capture followed deep sequencing of capture targets by with NGS (hybrid capture NGS).
- Hybrid capture NGS can be used to determine the risk of off-target editing across the potential ‘landscape’ of cryptic attachment sequences in the human genome (FIG. 7).
- Hybridization capture followed by next generation sequencing is a target enrichment approach for comprehensive and quantitative validation of off-target sites in edited genomic DNA isolated from edited cells.
- probes can be designed and ordered from vendors such as IDT or Twist Biosciences by submitting hg38 genome coordinates for the registered dinucleotide breakpoint and allow for probe design directly 5’ or 3’ adjacent to the breakpoint and distances up to 180 nucleotides upstream (5’) or downstream (3’) of the LSI site breakpoint.
- the probe is designed to be adjacent, but not overlapping, with the LSI dinucleotide breakpoint in the off-target site to be queried.
- two sets of hybrid capture validation probes can be designed to enable comprehensive validation of potential off-target editing; probe set 1 hybridizes 5’ (i.e., upstream) of the potential off-target site and probe set 2 hybridizes 3’ (i.e., downstream) of the potential off-target site which allows the detection of off-target insertion, intra- or inter-chromosomal translocations and indels. Derisking of any validated off-target involves assessing gene function and cancer relevance, guiding decisions on therapeutic suitability.
- the method for identifying off-target editing is Circularization for High-throughput Analysis of Large SErine Recombinase by Sequencing (CHASER-Seq) (FIG. 8).
- This technology is an amplification-based method that takes advantage of the sequencing of double stranded DNA circles generated from ligated fragments of genomic DNA that linearize upon recombination with the unique attachment sequences in combination with PCR amplification of linearized circles followed by next genome sequencing to discover potential LSI off-target sites.
- CHASER-Seq can also detect double-strand DNA breaks.
- the method for identifying off-target editing is Attachment site sequencing (Att-Seq) (FIG. 9).
- Att-Seq is a targeted but amplification-free sequencing method that takes advantage of direct DNA sequencing of small attP and attB containing double stranded oligonucleotides post-recombination by Bxb 1 integrase in a biochemical reaction.
- Att-Seq is performed with known attB and attP oligonucleotide substrates.
- Att-Seq provides quantitative information on site recombination preferences.
- Att-Seq detects rare events, such as DSBs or antiparallel recombination that can occur with mismatched attP and attB substrates.
- multiple attP and or attB substrates of known quantity and sequence are inputted to survey the recombination potential in the presence of LSI or mutated variants of an LSI (e.g., high fidelity LSI protein are compared to WT LSI proteins in the ability to recombine multiple off-target attB substrates with a wildtype attP substrate).
- Genomic DNA from internally developed cell line HEK Clone 12 was isolated using the Qiagen Blood and Tissue Kit.
- the gDNA was incubated with Tn5 transposase (custom recombinant protein purification by Genscript, pre-annealed with MES_Rev_3InvdR and MES_AmpSeq_P5 oligos) for 7 minutes at 55°C to tagment the gDNA and shear it to an average size of around 700bp.
- Tn5 transposase custom recombinant protein purification by Genscript, pre-annealed with MES_Rev_3InvdR and MES_AmpSeq_P5 oligos
- PCR was used to amplify the regions of integration using Q5 polymerase and primers for P5 containing distinct barcodes for each sample (LM P5 F1-F6) and a single N7 primer (GN037_CrypticSeq_N7_i7_N701). Following PCR, a 1.5x bead clean-up was performed and products were run on tape station using DI 000 tape to confirm amplification. Library concentration was determined using the NEB Next Library Quant Kit for Illumina. Libraries were normalized to 2nM, loaded at a concentration of 750pM and sequenced via Illumina Next Seq according to the manufacturer's instructions.
- Recombination reactions with 8 micrograms of purified human genomic DNA (gDNA) from PBMCs (Qiagen Blood and Tissue kit) or, in separate reactions, internal HEK293 cell line C12 that contains an Bxbl integrase attB site in an integrated lentiviral construct on chromosome 5 was incubated with 10 nM attB_66bp annealed oligonucleotide (for cryptic attP site or DSB discovery) or 10 nM attP_72bp annealed oligonucleotide (for cryptic attB site or DSB discovery) and 400 nM of recombinant Bxbl integrase (Genscript) in recombination buffer (10 nM Tris-HCl, pH 8.0, 100 mM KC1, 5% glycerol from) for 8 hours at 37°C and then held at 4°C.
- recombination buffer 10 nM Tris
- AttB_66 and attP_72 oligonucleotide substrates for HIDE-seq 25 pl of attB_66bp_P_Top (lOOpM) and 25 pl of attB_66bp_P_Bot (lOOpM) or 25 pl of attP_72bp_P_Top (lOOpM) and 25 pl of attP_72bp_P_Bot (lOOpM), respectively were mixed together in wells of a 20 pl 96 well plate.
- Oligonucleotides were resuspended in IDT duplex buffer (IDT). The plate with oligonucleotide samples were placed into a thermocycler, and the following anneal program was performed with a 105°C heated lid:
- iPSCs induced pluripotent stem cells harboring an on-target Bxbl integrase attB sequence at the B2M gene (Tome iPSC clone 52) that were edited by electroporation of Bxbl integrase mRNA and plasmid DNA cargo containing an Bxbl integrase attP sequence (EXP23001648) were treated with the Qiagen Blood and Tissue DNA extraction kit to isolate gDNA.
- iPSCs induced pluripotent stem cells harboring an on-target Bxbl integrase attB sequence at the B2M gene (Tome iPSC clone 52) that were edited by electroporation of Bxbl integrase mRNA and plasmid DNA cargo containing an Bxbl integrase attP sequence (EXP23001648) were treated with the Qiagen Blood and Tissue DNA extraction kit to isolate gDNA.
- tbHCA Tome Bio Hybrid Capture Analysis, github.com/tomebio/tbHCA
- FASTQ files from Illumina sequencing were loaded into a custom bioinformatic pipeline called tbHCA (Tome Bio Hybrid Capture Analysis, github.com/tomebio/tbHCA) developed to quantify edited reads corresponding to unedited reads, reads containing indels, reads containing cargo DNA sequence and thus represent recombined reads, or structural variants such as translocations.
- input data in FASTQ or BAM format underwent quality control using FASTP, which included adapter trimming, quality filtering, and extraction of unique molecular identifiers (UMIs). Additionally, paired-end reads were merged into a single file.
- UMIs unique molecular identifiers
- the processed reads were aligned to the hg38 reference genome using BWA aligner.
- UMIs were deduplicated from the aligned BAM files to remove PCR duplicates.
- Target information was collected and used to generate reference amplicons.
- Target reads were extracted from the deduplicated BAM files and aligned to the reference amplicons using BWA aligner.
- Editing events were quantified using a custom Python script designed to analyze the aligned target reads. Visualization of editing events was performed using a custom script to generate graphical representations. Editing sites were collated across samples are outputted into an excel spreadsheet using a custom script.
- An HTML report was generated using papermill (papermill. readthedocs.io/en/latest/index.html) to summarize the results of the analysis.
- LSIs Large Serine Integrases
- IntQuery a deep learning model that can predict LSI activity genome-wide.
- IntQuery was trained on quantitative off-target data from 410,776 cryptic attB sequences discovered by Cryptic-seq, an unbiased in vitro discovery technology for LSI off-target recombination.
- IntQuery can accurately predict in vitro LSI activity, providing a tool for in silico off-target prediction of large serine integrases to advance therapeutic applications.
- the large serine integrase (LSI) family constitutes a diverse group of site-specific recombinases that play pivotal roles in mediating DNA rearrangements 1 ' 3 .
- Serine integrases in contrast to their tyrosine recombinase counterparts, utilize a serine residue for catalysis, leading to distinct mechanistic features4.
- This large family encompasses integrases with varying sizes and functionalities, with notable members including PhiC31 integrase from Streptomyces bacteriophage PhiC31 5 and Bxbl integrase discovered in my cobacteriophage Bxbl 6 .
- PhiC31 and Bxbl integrases are well-recognized for their utility in site-specific recombination applications by virtue of direct recombination between phage attachment site attP and bacterial attachment site attB with the requirement of no co-factors or DNA supercoiling 7 ' 9 .
- the precise and efficient DNA manipulation capabilities of the large serine integrase family have positioned it as an attractive tool for synthetic biology and genome editing applicationslO-14.
- This approach to gene insertion has the advantage of minimizing unintended editing at the on-target locus, but it still presents the risk of potential off-target insertion and gross chromosomal rearrangements related to LSI-mediated recombination at ‘cryptic’ or ‘pseudo’ attachment sequences that may be present in the human genome 20 ' 22 .
- LSI attachment sequences are demarcated by a canonical central dinucleotide 2 which promotes annealing and ligation after 180-degree rotation to complete the recombination reaction. While the central dinucleotide does not directly interact with the LSI protein, it plays a critical role in determining recombination specificity because the landscape of cryptic attachment sites aligns with the central dinucleotide of the complementary attachment sequence. When applied to CRISPR-directed integrases, this feature allows simultaneous and specific multiplex gene insertion of unique cargos by utilizing orthogonal central dinucleotidesl4.
- Cryptic-seq is a quantitative biochemical off-target discovery assay because each recombination event imparts a unique molecular identifier (UMI) to the NGS reads 20 .
- UMI unique molecular identifier
- IntQuery provides a simple method for predicting LSI cryptic attachment site identity and activity directly from DNA sequence. We anticipate that IntQuery will be valuable as an in-silico method for discovery of off-target sites and prioritization of potential high-activity sites for verification in LSI-edited cells of interest. 6.4.2. Discussion
- IntQuery is a machine learning approach to predict the potential for LSI recombination at any site in the human genome.
- the approach taken by IntQuery is applicable to empirical LSI off-target discovery data from any genome.
- Computational off- target prediction with IntQuery complements empirical discovery data generated in the laboratory to help scientists prioritize potential off-target sites for experimental verification in edited cells. This two-step strategy was inspired by the approaches pioneered for the first Cas9 genome editing therapies 24, 26, 31 , because the risk of off-target editing from Cas9 and LSIs are both dependent on DNA sequence homology 22, 32 .
- IntQuery can also be used for variant-aware off-target prediction 33 for any large serine integrase, including naturally discovered 12, 14 and engineered 21, 34, 35 versions, by generating a large quantitative discovery dataset with technologies like Cryptic-Seq 20 .
- Experimental verification of IntQuery off-target predictions in edited cells will increase its relevance in supporting the non-clinical studies required for a new drug application.
- We hope our communication of a deep learning off-target prediction tool will help enable the safe development of LSI-based therapeutics.
- the position weight matrix discovered by HIDE-Seq 20 was used to predict locations in the hg38 human reference genome 36 that might be loci for off-target recombination by Bxbl. Briefly, we aligned the sequences flanking the integration sites discovered by HIDE- seq20 and generated a custom motif file based on the position frequency matrix and ran HOMER v4.11 motif analysis (scanMotifGenomeWide.pl) 29 with default parameters to predict binding at all sites in the hg38 human reference genome.
- Cryptic-seq was performed as previously described 20 with the following modifications.
- End prep and dA tailing of the gDNA was performed by using NEBNext® UltraTM II DNA Library Prep Kit for Illumina® (E7645L) as protocol instructed, after which Illumina TruSeq sequencing annealed Y adaptors containing 8 bp UMIs were ligated to gDNA listed below.
- Reaction 1 contains the following plasmids at equimolar ratios: PL2327 AA, PL2341 TT, PL2337 GG, PL2332 CC, PL2339 TC, PL2335 GA, with a final total plasmid concentration of 30 nM.
- the plasmid pool was then reacted with 1 pg sheared/adapter-ligated gDNA and 1 pM Bxbl integrase at 37°C for 4h in a recombination buffer 7 .
- Reaction 2 contains the following plasmids at equimolar ratios: PL2312 GT, PL2328 AC, PL2331 CA, PL2340 TG, PL2334 CT, PL2329 AG and was reacted under the same conditions as reaction described above.
- a representative sequence map of the cryptic-seq plasmid PL2312 GT can be found in our previous publication 20 with all plasmids in this study being identical to PL2312 except for the indicated central attP dinucleotide.
- the reaction was stopped by adding sodium dodecyl sulfate to a final concentration of 0.1% and the products were cleaned using the Zymo clean and concentrator kit (Zymo research).
- PCR was used to amplify the regions of integration using Q5 polymerase (NEB) and common primer for P5 (LM_P5) and N7 primer (GN037_CrypticSeq_N7_i7_N701). Following PCR, a 1.5x AMPure bead clean-up was performed, and products were run on a Tapestation 4200 (Agilent Technologies) using DI 000 tape to confirm amplification. Library concentration was determined using the Next Library Quant Kit (NEB) for Illumina. Libraries were sequenced on an Illumina NovaSeq (Fulgent Genetics).
- Bxbl integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome.
- HIDE-Seq High- throughput Integrase-mediated DNA Event Sequencing
- Cryptic-Seq is a PCR-based unbiased genomewide biochemical or cellular-based assay that is more sensitive than HIDE-Seq but is limited to the discovery of sites with off-target recombination.
- HIDE-Seq and Cryptic-Seq discovered 38 and 44,311 potential off-target sites respectively. 2,455 sites were prioritized for validation by hybrid capture NGS in LSI-edited K562 cells and off-target integration was detected at 52 of the sites.
- WGS whole genome sequencing
- the large serine integrase (LSI) family constitutes a diverse group of site-specific recombinases that play pivotal roles in mediating DNA rearrangements 1 ' 3 .
- Serine integrases in contrast to their tyrosine recombinase counterparts, utilize a serine residue for catalysis, leading to distinct mechanistic features 4 .
- This large family encompasses integrases with varying sizes and functionalities, with notable members including PhiC31 integrase from Streptomyces bacteriophage PhiC31 5 and Bxbl integrase discovered in my cobacteriophage Bxbl6.
- PhiC31 and Bxbl integrases are well-recognized for their utility in site-specific recombination applications by virtue of direct recombination between phage attachment site attP and bacterial attachment site attB with the requirement of no co-factors or DNA supercoiling 7 ' 9 .
- the precise and efficient DNA manipulation capabilities of the large serine integrase family have positioned it as an attractive tool for synthetic biology and genome editing applications 10 ' 14 .
- This cleavage mechanism covalently bonds DNA strands to integrase, avoiding free DNA ends and the opportunity for mis-repair by host DNA machinery. Subunits exchange places, promoting irreversible ligation through the formation of new attachment sites, attL and attR. Unlike CRISPR/Cas9 knock-in approaches that depend on cellular DNA repair machineries5, 15, LSI integration is deterministic and independent of host cell factors 5 ‘ 15 .
- I- PGI integrase-mediated programmable genomic integration
- HIDE-Seq High-throughput Integrase- mediated DNA Event Sequencing
- Cryptic-Seq is a PCR-based unbiased genome-wide biochemical or cellular-based assay that is more sensitive than HIDE-Seq but is limited to the discovery of sites with off-target recombination.
- HIDE-Seq is an unbiased and genome-wide LSI off- target discovery technology
- LSI Bxbl integrase 6, 22 and isolated human genomic DNA from two sources (1) a HEK293 cell line that contains two copies of the Bxbl attB site in a lentiviral vector integrated into a single locus on chromosome 5 (referred to from here on as HEK293attB) and (2) commercially available peripheral blood mononuclear cells (PBMCs).
- a HEK293 cell line that contains two copies of the Bxbl attB site in a lentiviral vector integrated into a single locus on chromosome 5
- PBMCs peripheral blood mononuclear cells
- HIDE-seq allows for tuning and supraphy siological saturation of LSI and DNA substrates in the reaction to enable sensitive discovery of potential off-target events to deeply map the landscape of potential off-target events across any given DNA sample.
- 8 pg of PBMC or HEK293 ⁇ //7> gDNA was incubated with 10 nM attP or attB linear substrate and 400 nM of recombinant Bxbl integrase in various reactions in triplicate for 8 hours at 37C° as outlined in Table 14.
- HIDE-seq identified high levels of on-target integration between the attP donor and the endogenous attB sites in the HEK293 ⁇ //7> gDNA samples, with integration read count frequencies of 100%, 100%, and 94% for HEK293attS Bxbl + attP donor replicates 1, 2, and 3 respectively (quantified as percentage of alii. and attR reads versus attB total read counts, data not shown).
- An example visualization image of HEK293attB Bxbl + attP donor replicate 1 displaying 100% integration of the attP donor in integrative genomics viewer 25 (IGV) is shown in Figure 18C.
- HIDE-seq is a sensitive and unbiased approach to identify any potential FDEs or off-target integration events by LSIs.
- HIDE-seq can identify potential LSI cryptic sites via sequencing whole genomes
- This approach Cryptic-seq, and it relies on the application of a specialized DNA donor substrate to enable one-step PCR enrichment and sequencing of recombined cryptic sites ( Figure 19A).
- gDNA is first tagmented with Tn5 transposase21 loaded DNA oligonucleotides containing Illumina P5 sequencing adaptors, i5 indexes, and a unique molecule identifier (UMI) 27 ' 29 ( Figure 19A).
- This pre-tagmentation of the gDNA is a crucial step to increase the sensitivity of the assay as it prevents sequencing of any unrecombined donor DNA substrate.
- gDNA is incubated with recombinant Bxbl integrase and a Cryptic-seq plasmid-based vector that contains either attP (GT dinucleotide) flanked by a unique barcode sequence (BC1) and an Illumina N7 primer binding site 5’ of the P half-site of attP, and on 3’ side of the P’ site of the attP a second unique barcode sequence (BC2) and an Illumina P7 primer binding site (Figure 19A).
- attP GT dinucleotide
- the gDNA is purified and PCR amplified with i7 indexed primers that prime off either the N7 or P7 primer binding site in the cryptic-seq vector for one-step amplification of recombined DNA fragments (Figure 19A), followed by NGS and bioinformatic identification of genomic reads sequence-tagged with either the P or P’ half-sites from the attP donor.
- Cryptic-seq is a biochemical-based assay and the reactions are performed with supraphy siological concentrations of Bxbl integrase and DNA donor, conditions far higher than we can achieve in a cell, for maximum sensitivity to enable deep discovery of any potential off-target events.
- the synthetic DNA fragment was titrated into human gDNA at specific copy number concentrations (95%, 50%, 10%, 1%, 0.1%, 0.05%, 0.001%, and 0%) to generate an 8-point standard curve.
- DNA standards were processed in triplicate by NGS library prep and target enrichment with hybrid capture panels containing probes hybridizing either left or right of CAS031.
- Illumina short-read sequencing and bioinformatic analysis was performed via our custom hybrid capture analysis pipeline.
- hybrid capture NGS affords accurate quantitation across standard curve and reliable detection of recombined reads (as defined as detection of the edit in > 2/3 replicates) down to 0.1%and single-point detection with both left and right CAS031 probes and single point detection as low as 0.05% with the left probe at the level of sequencing performed in this experiment.
- a read UMI count
- K562 cell line that contains two atlB-Gl sequences in different genomic locations. This line was generated by transducing K562 cells with a lentivirus carrying insert in a PGK-attB-EFla-PuroR cassette that integrated at unique genomic sites in both chromosome 6 and 17 (K562attB cells).
- the probe panel consists of discovered cryptic sites and does not contain an on-target probe targeting the lentiviral cassette that contains the canonical attB sequence and, given the concordance in detection by left and right probes shown in Figure 20B, we designed the panel to contain only a single capture probe (either left or right) for each target (i.e., single sided panels). This inclusion criteria resulted in a single-sided probe panel size of 2,455 genomic sites.
- HIDE-seq discovered 36 potential cryptic attB sites (Figure 18E) with 16 of these sites present in the hybrid capture panel and 5 of the 16 tested sites displaying off-target editing in K562attB cells. As a result, HIDE-seq displays a false positive rate of 68.8% (l-(5 validated sites 16 tested discovered sites) and false negative rate of 90.6% (l-(5 validated sites ⁇ -53 validated sites) ( Figure 20G).
- Cryptic-seq discovered 44,311 unique cryptic attBs with 2,455 of these sites present in the hybrid capture panel and 53 of these sites displaying off-target editing in K562attB cells. Accordingly, Cryptic-seq displays a false positive rate of 97.8% ( 1 -(53 validated sites 2,455 tested discovered sites) and false negative rate of 0% ( 1 -(53 validated sites 53 validated sites) ( Figure 20G).
- LSIs can be used in combination with Cas9 nickases and writing enzymes (e.g. reverse transcriptase 11 , 14 or ligases 38 ) to enable therapeutic applications including endogenous gene replacement and the development of highly engineered cell-based medicines.
- LSIs allow for the directional, seamless integration of large DNA sequences without depending on FDEs or cellular DNA repair pathways. To date, comprehensive methods for detecting and validating LSI specificity have not been developed but will be required to develop integrase-based therapeutics.
- Standard genotoxicity evaluation approaches like the Ames test or Comet assays 39 , are not suitable for homology dependent off-target editing from an LSI or an RNA-guided nuclease because they are designed to detect the effects of homology independent sources DNA damage, such as chemical or physical agents.
- Homology dependent off-target editing poises the risk of recurrent gene disruption of a tumor suppressor that would be undetectable with standard approaches. Therefore, we have sought to contribute fit for purpose and sensitive methods to evaluate the specificity any LSI in the human genome to help advance the therapeutic application of endogenous gene replacement.
- the approaches we advance here comprise a comprehensive and sensitive suite of assays to discover and validate LSI off-target events to directly address regulatory and genotoxicity considerations for LSI-based strategies in gene and cell therapy approaches.
- HIDE-seq is a readily applicable and unbiased biochemicalbased WGS approach for identification of potential LSI off-target outcomes such as indels, cryptic integrations, and large structural variants.
- Cryptic-seq uses supraphy si ologi cal amounts of integrase and DNA template, significantly higher than a human cell would ever be exposed to in a therapeutic setting, followed by PCR enrichment, to identify a deep pool of off target sites for expansive analysis.
- Both HIDE-seq and Cryptic-seq can be adapted to accommodate multiple LSIs (i.e., multiplexing) or novelly designed LSIs.
- both assays can be performed with genomic DNA isolated from therapeutically relevant cell types (e.g., human donor derived T cells, CD34+ hematopoietic stem and progenitor cells, primary human hepatocytes) or mixed populations of genomic DNA for more genetically relevant discovery of potential off-target sites.
- hybrid capture NGS To readily perform scalable off-target validation of up to 1000s of potential LSI off-target sites in relevant edited cell types, we present a customized application of hybrid capture NGS that is tailored to accurately detect and quantify LSI off-target events, such as indels and cryptic integrations, which we benchmark to bulk WGS.
- hybrid capture NGS has greater than 98% apparent validation hit rate (52/53 validated sites detected) compared to an apparent hit rate of WGS (at 40x coverage) of 9.4% (5/53 validated sites detected).
- lentivirus containing a transfer plasmid with an EFla-PuroR- WPRE backbone containing a 46 bp Bxbl attB insert (PL312) was transduced into HEK293 cells.
- Cells with Low MOI were plated in sterile 96 well plates under puromycin selection via serial dilutions for clone selection.
- confirmation of lentiviral copy and identification of the lentivirus insertion site was performed using ligation mediated PCR with primers targeting the 5’ and 3’ LTRs along with Cergentis TLA mapping confirmed single lentiviral insertion on chromosome 5 (data not shown).
- Lentiviral plasmid (PL2811) with a PGK-EFla-PuroR backbone containing a 46 bp Bxbl attB inserts was acquired from GenScript and used for lentiviral production at Azenta.
- K562 cells (ATCC, Cat# CCL- 243) were transduced with lentivirus doses of 5 pL, 10 pL, 20 pL, 30 pL and 50 pL infected by spinfection at 1000g for 30 minutes at 33 °C to find appropriate dose of lentivirus with low multiplicity of infection (MOI).
- HIDE-seq reactions with 8 pg of purified gDNA from PBMCs (Qiagen Blood and Tissue kit) or, in separate reactions, 8 pg of purified gDNA HEK293attB cells was incubated with 10 nM attB_66bp annealed oligonucleotide (for cryptic attP site or FDE discovery) or 10 nM attP_72bp annealed oligonucleotide (for cryptic attB site or FDE discovery) and 400 nM of recombinant Bxbl integrase (GenScript) in recombination buffer?
- AttB_66 and attP_72 oligonucleotide substrates for HIDE-seq 25 pl of attB_66bp_P_Top (100 mM) and 25 pl of attB_66bp_P_Bot (100 mM) or 25 pl of attP_72bp_P_Top (100 mM) and 25 pl of attP_72bp_P_Bot (100 mM), respectively were mixed together in wells of a 20 pl 96 well plate.
- oligonucleotides were resuspended in duplex buffer (Integrated DNA technologies (IDT)).
- the plate with oligonucleotide samples were placed into a thermocycler, and the following anneal program was performed with a 105°C heated lid: 95°C/2 minutes, 750 cycles at -0.1°C per cycle/ls, 4°C hold.
- Reactions were inactivated with RNAase A (New England Biolabs (NEB) Monarch) and then Proteinase K (NEB) and gDNA was purified with 0.7x AMPure (Beckmann Coulter) bead cleanup.
- Control Digenome-seq reactions were performed with gDNA and spCas9 (IDT) with VEGFA S2 sgRNA (IDT) were performed according to the published protocol 21 .
- Isolated gDNA from both HIDE-seq and Digenome-seq reactions was quantified by Qubit fluorometry (Thermo Fisher Scientific) and submitted for PCR-free WGS at 30x coverage (Azenta).
- tbDigln github.com/didacs/tbDigln
- Tome Biosciences and Fulcrum Genomics which quantified the number of unclipped reads (for FDE detection) or soft-clipped reads (recombined sites).
- the bioinformatic workflow for tbDigln starts with the alignment of sequencing reads from FASTQ files against the hg38 human reference genome41 with appended attP and attB sequences by BWA aligner42 to create mapped BAM files for each sample.
- the resulting BAM files are deduplicated by Picard 43 and inputted into HIDE, a modified Digenomitas 24 pipeline to output .csv files containing sites per sample as clipped counts for integration events and unclipped counts for FDEs.
- HEK293attB gDNA was incubated with Tn5 transposase 27 (custom purification by GenScript) pre-annealed with MES_Rev_3InvdR and MES_AmpSeq_P5 oligos) for 7 minutes at 55°C to tagment the gDNA and shear it to an average size of approximately 700bp.
- the Cryptic-seq donor plasmid includes an attP integrase attachment site with a GT dinucleotide flanked by a unique 12 bp barcode sequence (BC1) and Illumina N7 primer binding site (PBS) on the 5’ side of the attP and a unique barcode sequence (BC2) sequence on and an Illumina P7 sequencing primer binding site (PBS) on the 3’ side of the attP.
- BC1 and Illumina N7 primer binding site PBS
- BC2 unique barcode sequence
- PBS Illumina P7 sequencing primer binding site
- PCR was used to amplify the regions of integration using Q5 polymerase (NEB) and primers for P5 containing distinct barcodes for each sample (LM P5 F1-F3) and a single N7 primer (GN037_CrypticSeq_N7_i7_N701).
- NEB Q5 polymerase
- primers for P5 containing distinct barcodes for each sample LM P5 F1-F3
- a 1.5x AMPure bead clean-up was performed, and products were run on a Tapestation 4200 (Agilent Technologies) using DI 000 tape to confirm amplification.
- Library concentration was determined using the Next Library Quant Kit (NEB) for Illumina. Libraries were normalized to 2nM, loaded at a concentration of 750pM and sequenced via Illumina Next Seq according to the manufacturer's instructions.
- Reads trimmed for att and ME sequences are aligned against the hg38 human reference genome41 with appended attP and attB sequences by BWA aligner42 to create mapped BAM files for each sample.
- the resulting BAM files are deduplicated by Picard43 and queried for integration sites with to generate output .csv files containing sites per sample along with collation of sites across samples to generate output .csv files containing collated sites.
- gBlocks were designed to include approximately 200 bp left and right of the recombination junction (attL or attK) dinucleotide followed by approximately 1300 bp of random DNA stuff er sequence at each end to achieve a fragment length of approximately 3000 bp for optimal DNA fragmentation (CHR6_RC_Off_PL753_attL, CHR6_RC_Off_PL753_attR).
- Standard curve titrations were prepared by spiking in gBlocks into human gDNA (Promega catalog #G304A) at molar ratios representing 95%, 50%, 10%, 1%, 0.1%, 0.05%, 0.001%, and 0% I-PGI. Samples were fragmented to an average size of 550 bp via sonication (Covaris ME220) and NGS libraries prepared according to the IDT xGen DNA Library Prep Kit MC UNI (Version 2) protocol using xGenTM UDI-UMI Adapters (IDT catalog #10005903).
- Target enrichment was performed according to xGenTM hybridization capture of DNA libraries (Version 7) protocol using custom hybrid capture panels containing probes either left (5’) or right (3’) of CAS031.
- Hybrid capture libraries were sequenced by Illumina short-read sequencing via NextSeq 2000 paired-end (2 x 150 bp) sequencing using standard NextSeq 2000 P3 Reagents (300-Cycles) (Illumina catalog #20040561) and analyzed using a custom hybrid capture bioinformatics pipeline called tbREVEAL developed at Tome Biosciences (described below).
- K562 ⁇ //7> cells were maintained in RMPI1640 media (Gibco) +10% fetal bovine serum (Gibco) + 2ug/ml puromycin (Life Technologies) and passaged at ratio of 0.1 million cells per ml every 3-5 days.
- 200,000 cells were electroporated with 3 pg mRNA for Bxb 1 integrase and 3 pg of a donor construct containing attP integrase attachment sites and a GT central dinucleotide using the Lonza 4D-NucleofectorTM X Unit according to the manufacturer’s instructions for K562 cells. After 3 days, 80% of the cells were collected for gDNA extraction using the Qiagen blood and tissue kit and the remaining 20% were expanded then banked.
- Genomic DNA extracted from edited K562attB cells were used as input for hybridization capture NGS.
- a Covaris ME220 focused ultrasonicator was used to shear 400 ng of Rep 1, Rep 2, Control 1, and Control 2 gDNA samples to an average size of 400 bp.
- Fragmented DNA was end repaired and A-tailed followed by ligation of Illumina sequencing adapters using Twist library preparation kit (cat# 104177) to create Illumina paired end sequencing libraries.
- Hybridization, capture and post capture amplification of the prepared libraries to probes was performed using Twist target enrichment standard hybridization kit as per manufacturer’s instructions (Twist Library Preparation Kit 1, Mechanical Fragmentation, cat # 100876).
- paired-end reads are merged into a single file.
- the processed reads are aligned to the hg38 reference genome 41 using BWA aligner 42 .
- UMIs are deduplicated from the aligned BAM files to remove PCR duplicates.
- Target information is collected and used to generate reference amplicons.
- Target reads are extracted from the deduplicated BAM files and aligned to the reference amplicons using BWA aligner 42 .
- Editing events are quantified using a custom Python script designed to analyze the aligned target reads. Visualization of editing events is performed using a custom script to generate graphical representations. Editing sites are collated across samples are outputted into an excel spreadsheet using a custom script. HTML reports by papermill (papermill. readthedocs.io/en/latest/index.html) are generated to summarize the results of the analysis.
- Bxbl integrase-edited Rep 1 and Rep 2 K562a//// gDNA genomic DNA was extracted using DNeasy blood and tissue kit (Qiagen) and samples library preparation and Illumina short read sequencing with a target of 60x genomic coverage.
- genomic DNA was sheared using a Covaris ME220 sonicator to average size of 400 bp, sheared fragments were end repaired, A-tailed and ligated with Illumina sequencing adapters and the resulting library was sequenced on an Illumina NovaSeq platform (Azenta).
- FASTQ files from Illumina sequencing were mapped to the hg38 reference genome and cargo reference using BWA 42 followed by fgsv 26 from Fulcrum Genomics was used to predict potential chimeric reads. Chimeric reads with one breakpoint at the reference genome and with the second breakpoint at the attP dinucleotides in the DNA cargo reference were retained for further investigation.
- Bxbl integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome.
- Ligase-mediated programmable genomic integration (L-PGI): an efficient site-specific gene editing system that overcomes the limitations of reverse transcriptase-based editing systems. bioRxiv, 2024.2009.2027.615478 (2024).
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP25721812.3A EP4677108A1 (en) | 2024-04-22 | 2025-04-22 | Method and compositions for detecting off-target editing |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202463637351P | 2024-04-22 | 2024-04-22 | |
| US63/637,351 | 2024-04-22 | ||
| US202463643820P | 2024-05-07 | 2024-05-07 | |
| US63/643,820 | 2024-05-07 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025224107A1 true WO2025224107A1 (en) | 2025-10-30 |
Family
ID=95559067
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP2025/060933 Pending WO2025224107A1 (en) | 2024-04-22 | 2025-04-22 | Method and compositions for detecting off-target editing |
Country Status (2)
| Country | Link |
|---|---|
| EP (1) | EP4677108A1 (en) |
| WO (1) | WO2025224107A1 (en) |
Citations (104)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4186183A (en) | 1978-03-29 | 1980-01-29 | The United States Of America As Represented By The Secretary Of The Army | Liposome carriers in chemotherapy of leishmaniasis |
| US4217344A (en) | 1976-06-23 | 1980-08-12 | L'oreal | Compositions containing aqueous dispersions of lipid spheres |
| US4235871A (en) | 1978-02-24 | 1980-11-25 | Papahadjopoulos Demetrios P | Method of encapsulating biologically active materials in lipid vesicles |
| US4261975A (en) | 1979-09-19 | 1981-04-14 | Merck & Co., Inc. | Viral liposome particle |
| US4485054A (en) | 1982-10-04 | 1984-11-27 | Lipoderm Pharmaceuticals Limited | Method of encapsulating biologically active materials in multilamellar lipid vesicles (MLV) |
| US4501728A (en) | 1983-01-06 | 1985-02-26 | Technology Unlimited, Inc. | Masking of liposomes from RES recognition |
| US4774085A (en) | 1985-07-09 | 1988-09-27 | 501 Board of Regents, Univ. of Texas | Pharmaceutical administration systems containing a mixture of immunomodulators |
| US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
| US4837028A (en) | 1986-12-24 | 1989-06-06 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
| US4897355A (en) | 1985-01-07 | 1990-01-30 | Syntex (U.S.A.) Inc. | N[ω,(ω-1)-dialkyloxy]- and N-[ω,(ω-1)-dialkenyloxy]-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| US4946787A (en) | 1985-01-07 | 1990-08-07 | Syntex (U.S.A.) Inc. | N-(ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| US5013556A (en) | 1989-10-20 | 1991-05-07 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
| US5049386A (en) | 1985-01-07 | 1991-09-17 | Syntex (U.S.A.) Inc. | N-ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)Alk-1-YL-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| WO1991016024A1 (en) | 1990-04-19 | 1991-10-31 | Vical, Inc. | Cationic lipids for intracellular delivery of biologically active molecules |
| WO1991017424A1 (en) | 1990-05-03 | 1991-11-14 | Vical, Inc. | Intracellular delivery of biologically active substances by means of self-assembling lipid complexes |
| US5173414A (en) | 1990-10-30 | 1992-12-22 | Applied Immune Sciences, Inc. | Production of recombinant adeno-associated virus vectors |
| WO1993024641A2 (en) | 1992-06-02 | 1993-12-09 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Adeno-associated virus with inverted terminal repeat sequences as promoter |
| US5846946A (en) | 1996-06-14 | 1998-12-08 | Pasteur Merieux Serums Et Vaccins | Compositions and methods for administering Borrelia DNA |
| US6027726A (en) | 1994-09-30 | 2000-02-22 | Inex Phamaceuticals Corp. | Glycosylated protein-liposome conjugates and methods for their preparation |
| US20030087817A1 (en) | 1999-01-12 | 2003-05-08 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
| US20040142025A1 (en) | 2002-06-28 | 2004-07-22 | Protiva Biotherapeutics Ltd. | Liposomal apparatus and manufacturing methods |
| US20040171156A1 (en) | 1995-06-07 | 2004-09-02 | Invitrogen Corporation | Recombinational cloning using nucleic acids having recombination sites |
| US20070042031A1 (en) | 2005-07-27 | 2007-02-22 | Protiva Biotherapeutics, Inc. | Systems and methods for manufacturing liposomes |
| US8404658B2 (en) | 2007-12-31 | 2013-03-26 | Nanocor Therapeutics, Inc. | RNA interference for the treatment of heart failure |
| US8454972B2 (en) | 2004-07-16 | 2013-06-04 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Method for inducing a multiclade immune response against HIV utilizing a multigene and multiclade immunogen |
| WO2013086354A1 (en) | 2011-12-07 | 2013-06-13 | Alnylam Pharmaceuticals, Inc. | Biodegradable lipids for the delivery of active agents |
| WO2013116126A1 (en) | 2012-02-01 | 2013-08-08 | Merck Sharp & Dohme Corp. | Novel low molecular weight, biodegradable cationic lipids for oligonucleotide delivery |
| US8691750B2 (en) | 2011-05-17 | 2014-04-08 | Axolabs Gmbh | Lipids and compositions for intracellular delivery of biologically active compounds |
| US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
| EP2755986A1 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| EP2755693A2 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| US8795965B2 (en) | 2012-12-12 | 2014-08-05 | The Broad Institute, Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US8822663B2 (en) | 2010-08-06 | 2014-09-02 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| WO2014152940A1 (en) | 2013-03-14 | 2014-09-25 | Shire Human Genetic Therapies, Inc. | Mrna therapeutic compositions and use to treat diseases and disorders |
| US8865406B2 (en) | 2012-12-12 | 2014-10-21 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
| US8889356B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
| US8906616B2 (en) | 2012-12-12 | 2014-12-09 | The Broad Institute Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
| US8993233B2 (en) | 2012-12-12 | 2015-03-31 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
| US9023649B2 (en) | 2012-12-17 | 2015-05-05 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US9068179B1 (en) | 2013-12-12 | 2015-06-30 | President And Fellows Of Harvard College | Methods for correcting presenilin point mutations |
| US9074199B1 (en) | 2013-11-19 | 2015-07-07 | President And Fellows Of Harvard College | Mutant Cas9 proteins |
| US9163284B2 (en) | 2013-08-09 | 2015-10-20 | President And Fellows Of Harvard College | Methods for identifying a target site of a Cas9 nuclease |
| US9228207B2 (en) | 2013-09-06 | 2016-01-05 | President And Fellows Of Harvard College | Switchable gRNAs comprising aptamers |
| US9267135B2 (en) | 2013-06-04 | 2016-02-23 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
| NZ700688A (en) | 2009-12-01 | 2016-02-26 | Shire Human Genetic Therapies | Delivery of mrna for the augmentation of proteins and enzymes in human genetic diseases |
| US9322037B2 (en) | 2013-09-06 | 2016-04-26 | President And Fellows Of Harvard College | Cas9-FokI fusion proteins and uses thereof |
| US9322006B2 (en) | 2011-07-22 | 2016-04-26 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
| US9359599B2 (en) | 2013-08-22 | 2016-06-07 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
| US9405700B2 (en) | 2010-11-04 | 2016-08-02 | Sonics, Inc. | Methods and apparatus for virtualization in an integrated circuit |
| US9526784B2 (en) | 2013-09-06 | 2016-12-27 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
| US9587252B2 (en) | 2013-07-10 | 2017-03-07 | President And Fellows Of Harvard College | Orthogonal Cas9 proteins for RNA-guided gene regulation and editing |
| US9777262B2 (en) | 2013-03-13 | 2017-10-03 | President And Fellows Of Harvard College | Mutants of Cre recombinase |
| US9790490B2 (en) | 2015-06-18 | 2017-10-17 | The Broad Institute Inc. | CRISPR enzymes and systems |
| BR112016030852A2 (en) | 2014-07-02 | 2018-01-16 | Shire Human Genetic Therapies | rna messenger encapsulation |
| US9914939B2 (en) | 2013-07-26 | 2018-03-13 | President And Fellows Of Harvard College | Genome engineering |
| US10000772B2 (en) | 2012-05-25 | 2018-06-19 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| EP3362461A1 (en) | 2015-10-16 | 2018-08-22 | Modernatx, Inc. | Mrna cap analogs with modified phosphate linkage |
| US10077453B2 (en) | 2014-07-30 | 2018-09-18 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
| US10113163B2 (en) | 2016-08-03 | 2018-10-30 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
| US10167457B2 (en) | 2015-10-23 | 2019-01-01 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
| US10190137B2 (en) | 2013-11-07 | 2019-01-29 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAS |
| US10266887B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
| US10377998B2 (en) | 2013-12-12 | 2019-08-13 | The Broad Institute, Inc. | CRISPR-CAS systems and methods for altering expression of gene products, structural information and inducible modular CAS enzymes |
| US10375938B2 (en) | 2015-10-08 | 2019-08-13 | President And Fellows Of Harvard College | Multiplexed genome editing |
| US10385336B2 (en) | 2014-09-05 | 2019-08-20 | Vilnius University | Programmable RNA shredding by the type III-A CRISPR-Cas system of Streptococcus thermophilus |
| US10494621B2 (en) | 2015-06-18 | 2019-12-03 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
| EP3450553B1 (en) | 2014-03-24 | 2019-12-25 | Translate Bio, Inc. | Mrna therapy for treatment of ocular diseases |
| US10519454B2 (en) | 2014-08-06 | 2019-12-31 | Toolgen Incorporated | Genome editing using Campylobacter jejuni CRISPR/CAS system-derived RGEN |
| WO2020014577A1 (en) | 2018-07-13 | 2020-01-16 | Allele Biotechnology And Pharmaceuticals, Inc. | Methods of achieving high specificity of genome editing |
| US10550372B2 (en) | 2013-12-12 | 2020-02-04 | The Broad Institute, Inc. | Systems, methods and compositions for sequence manipulation with optimized functional CRISPR-Cas systems |
| ES2740248T3 (en) | 2011-06-08 | 2020-02-05 | Translate Bio Inc | Lipid nanoparticle compositions and methods for mRNA administration |
| US10577630B2 (en) | 2013-06-17 | 2020-03-03 | The Broad Institute, Inc. | Delivery and use of the CRISPR-Cas systems, vectors and compositions for hepatic targeting and therapy |
| US10612011B2 (en) | 2015-07-30 | 2020-04-07 | President And Fellows Of Harvard College | Evolution of TALENs |
| CA3118135A1 (en) * | 2018-11-02 | 2020-05-07 | Greenvenus, Llc | Serine recombinases mediating stable integration into plant genomes |
| US10648020B2 (en) | 2015-06-18 | 2020-05-12 | The Broad Institute, Inc. | CRISPR enzymes and systems |
| US10689691B2 (en) | 2014-12-19 | 2020-06-23 | The Broad Institute, Inc. | Unbiased identification of double-strand breaks and genomic rearrangement by genome-wide insert capture sequencing |
| US10711285B2 (en) | 2013-06-17 | 2020-07-14 | The Broad Institute, Inc. | Optimized CRISPR-Cas double nickase systems, methods and compositions for sequence manipulation |
| US10731181B2 (en) | 2012-12-06 | 2020-08-04 | Sigma, Aldrich Co. LLC | CRISPR-based genome modification and regulation |
| US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
| US10781444B2 (en) | 2013-06-17 | 2020-09-22 | The Broad Institute, Inc. | Functional genomics using CRISPR-Cas systems, compositions, methods, screens and applications thereof |
| WO2020191241A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| US10787684B2 (en) | 2013-11-19 | 2020-09-29 | President And Fellows Of Harvard College | Large gene excision and insertion |
| US10851380B2 (en) | 2012-10-23 | 2020-12-01 | Toolgen Incorporated | Methods for cleaving a target DNA using a guide RNA specific for the target DNA and Cas protein-encoding nucleic acid or Cas protein |
| US10851369B2 (en) | 2016-06-21 | 2020-12-01 | President And Fellows Of Harvard College | Frequency-based modulation of diverse species in a nucleic acid library |
| US10851357B2 (en) | 2013-12-12 | 2020-12-01 | The Broad Institute, Inc. | Compositions and methods of use of CRISPR-Cas systems in nucleotide repeat disorders |
| US10930367B2 (en) | 2012-12-12 | 2021-02-23 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for Cas enzymes or CRISPR-Cas systems for target sequences and conveying results thereof |
| US10946108B2 (en) | 2013-06-17 | 2021-03-16 | The Broad Institute, Inc. | Delivery, use and therapeutic applications of the CRISPR-Cas systems and compositions for targeting disorders and diseases using viral components |
| US10954514B2 (en) | 2014-12-12 | 2021-03-23 | The Broad Institute, Inc. | Escorted and functionalized guides for CRISPR-Cas systems |
| US10968257B2 (en) | 2018-04-03 | 2021-04-06 | The Broad Institute, Inc. | Target recognition motifs and uses thereof |
| US11001829B2 (en) | 2014-09-25 | 2021-05-11 | The Broad Institute, Inc. | Functional screening with optimized functional CRISPR-Cas systems |
| US11008588B2 (en) | 2013-06-17 | 2021-05-18 | The Broad Institute, Inc. | Delivery, engineering and optimization of tandem guide systems, methods and compositions for sequence manipulation |
| US11021740B2 (en) | 2017-03-15 | 2021-06-01 | The Broad Institute, Inc. | Devices for CRISPR effector system based diagnostics |
| US11041173B2 (en) | 2012-12-12 | 2021-06-22 | The Broad Institute, Inc. | Delivery, engineering and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
| US11060115B2 (en) | 2015-06-18 | 2021-07-13 | The Broad Institute, Inc. | CRISPR enzymes and systems |
| US11071790B2 (en) | 2014-10-29 | 2021-07-27 | Massachusetts Eye And Ear Infirmary | Method for efficient delivery of therapeutic molecules in vitro and in vivo |
| US11085072B2 (en) | 2016-08-31 | 2021-08-10 | President And Fellows Of Harvard College | Methods of generating libraries of nucleic acid sequences for detection via fluorescent in situ sequencing |
| US11104967B2 (en) | 2015-07-22 | 2021-08-31 | President And Fellows Of Harvard College | Evolution of site-specific recombinases |
| US11104937B2 (en) | 2017-03-15 | 2021-08-31 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
| US11111521B2 (en) | 2011-12-22 | 2021-09-07 | President And Fellows Of Harvard College | Compositions and methods for analyte detection |
| US11111472B2 (en) | 2014-10-31 | 2021-09-07 | Massachusetts Institute Of Technology | Delivery of biomolecules to immune cells |
| WO2021226558A1 (en) | 2020-05-08 | 2021-11-11 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
| WO2022087235A1 (en) | 2020-10-21 | 2022-04-28 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
| WO2023070031A2 (en) | 2021-10-21 | 2023-04-27 | Massachusetts Institute Of Technology | Discovery and engineering of integrases for high-efficiency gene integration |
| WO2023122764A1 (en) * | 2021-12-22 | 2023-06-29 | Tome Biosciences, Inc. | Co-delivery of a gene editor construct and a donor template |
-
2025
- 2025-04-22 EP EP25721812.3A patent/EP4677108A1/en active Pending
- 2025-04-22 WO PCT/EP2025/060933 patent/WO2025224107A1/en active Pending
Patent Citations (209)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4217344A (en) | 1976-06-23 | 1980-08-12 | L'oreal | Compositions containing aqueous dispersions of lipid spheres |
| US4235871A (en) | 1978-02-24 | 1980-11-25 | Papahadjopoulos Demetrios P | Method of encapsulating biologically active materials in lipid vesicles |
| US4186183A (en) | 1978-03-29 | 1980-01-29 | The United States Of America As Represented By The Secretary Of The Army | Liposome carriers in chemotherapy of leishmaniasis |
| US4261975A (en) | 1979-09-19 | 1981-04-14 | Merck & Co., Inc. | Viral liposome particle |
| US4485054A (en) | 1982-10-04 | 1984-11-27 | Lipoderm Pharmaceuticals Limited | Method of encapsulating biologically active materials in multilamellar lipid vesicles (MLV) |
| US4501728A (en) | 1983-01-06 | 1985-02-26 | Technology Unlimited, Inc. | Masking of liposomes from RES recognition |
| US5049386A (en) | 1985-01-07 | 1991-09-17 | Syntex (U.S.A.) Inc. | N-ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)Alk-1-YL-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| US4897355A (en) | 1985-01-07 | 1990-01-30 | Syntex (U.S.A.) Inc. | N[ω,(ω-1)-dialkyloxy]- and N-[ω,(ω-1)-dialkenyloxy]-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| US4946787A (en) | 1985-01-07 | 1990-08-07 | Syntex (U.S.A.) Inc. | N-(ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
| US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
| US4774085A (en) | 1985-07-09 | 1988-09-27 | 501 Board of Regents, Univ. of Texas | Pharmaceutical administration systems containing a mixture of immunomodulators |
| US4837028A (en) | 1986-12-24 | 1989-06-06 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
| US5013556A (en) | 1989-10-20 | 1991-05-07 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
| WO1991016024A1 (en) | 1990-04-19 | 1991-10-31 | Vical, Inc. | Cationic lipids for intracellular delivery of biologically active molecules |
| WO1991017424A1 (en) | 1990-05-03 | 1991-11-14 | Vical, Inc. | Intracellular delivery of biologically active substances by means of self-assembling lipid complexes |
| US5173414A (en) | 1990-10-30 | 1992-12-22 | Applied Immune Sciences, Inc. | Production of recombinant adeno-associated virus vectors |
| WO1993024641A2 (en) | 1992-06-02 | 1993-12-09 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Adeno-associated virus with inverted terminal repeat sequences as promoter |
| US6027726A (en) | 1994-09-30 | 2000-02-22 | Inex Phamaceuticals Corp. | Glycosylated protein-liposome conjugates and methods for their preparation |
| US20040171156A1 (en) | 1995-06-07 | 2004-09-02 | Invitrogen Corporation | Recombinational cloning using nucleic acids having recombination sites |
| US5846946A (en) | 1996-06-14 | 1998-12-08 | Pasteur Merieux Serums Et Vaccins | Compositions and methods for administering Borrelia DNA |
| US20030087817A1 (en) | 1999-01-12 | 2003-05-08 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
| US20040142025A1 (en) | 2002-06-28 | 2004-07-22 | Protiva Biotherapeutics Ltd. | Liposomal apparatus and manufacturing methods |
| US8454972B2 (en) | 2004-07-16 | 2013-06-04 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Method for inducing a multiclade immune response against HIV utilizing a multigene and multiclade immunogen |
| US20070042031A1 (en) | 2005-07-27 | 2007-02-22 | Protiva Biotherapeutics, Inc. | Systems and methods for manufacturing liposomes |
| US8404658B2 (en) | 2007-12-31 | 2013-03-26 | Nanocor Therapeutics, Inc. | RNA interference for the treatment of heart failure |
| NZ700688A (en) | 2009-12-01 | 2016-02-26 | Shire Human Genetic Therapies | Delivery of mrna for the augmentation of proteins and enzymes in human genetic diseases |
| US8822663B2 (en) | 2010-08-06 | 2014-09-02 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| US9405700B2 (en) | 2010-11-04 | 2016-08-02 | Sonics, Inc. | Methods and apparatus for virtualization in an integrated circuit |
| US8691750B2 (en) | 2011-05-17 | 2014-04-08 | Axolabs Gmbh | Lipids and compositions for intracellular delivery of biologically active compounds |
| ES2740248T3 (en) | 2011-06-08 | 2020-02-05 | Translate Bio Inc | Lipid nanoparticle compositions and methods for mRNA administration |
| US10323236B2 (en) | 2011-07-22 | 2019-06-18 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
| US9322006B2 (en) | 2011-07-22 | 2016-04-26 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
| EP2755986A1 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| EP2755693A2 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
| WO2013086354A1 (en) | 2011-12-07 | 2013-06-13 | Alnylam Pharmaceuticals, Inc. | Biodegradable lipids for the delivery of active agents |
| US11111521B2 (en) | 2011-12-22 | 2021-09-07 | President And Fellows Of Harvard College | Compositions and methods for analyte detection |
| WO2013116126A1 (en) | 2012-02-01 | 2013-08-08 | Merck Sharp & Dohme Corp. | Novel low molecular weight, biodegradable cationic lipids for oligonucleotide delivery |
| US11008589B2 (en) | 2012-05-25 | 2021-05-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10337029B2 (en) | 2012-05-25 | 2019-07-02 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10487341B2 (en) | 2012-05-25 | 2019-11-26 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10519467B2 (en) | 2012-05-25 | 2019-12-31 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10443076B2 (en) | 2012-05-25 | 2019-10-15 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10526619B2 (en) | 2012-05-25 | 2020-01-07 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10428352B2 (en) | 2012-05-25 | 2019-10-01 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10421980B2 (en) | 2012-05-25 | 2019-09-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10415061B2 (en) | 2012-05-25 | 2019-09-17 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10407697B2 (en) | 2012-05-25 | 2019-09-10 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10400253B2 (en) | 2012-05-25 | 2019-09-03 | The Regents Of The University Of California | Methods and compositions or RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US11028412B2 (en) | 2012-05-25 | 2021-06-08 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US11008590B2 (en) | 2012-05-25 | 2021-05-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10533190B2 (en) | 2012-05-25 | 2020-01-14 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10385360B2 (en) | 2012-05-25 | 2019-08-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10550407B2 (en) | 2012-05-25 | 2020-02-04 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10513712B2 (en) | 2012-05-25 | 2019-12-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10358658B2 (en) | 2012-05-25 | 2019-07-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US11001863B2 (en) | 2012-05-25 | 2021-05-11 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10988782B2 (en) | 2012-05-25 | 2021-04-27 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10988780B2 (en) | 2012-05-25 | 2021-04-27 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10982230B2 (en) | 2012-05-25 | 2021-04-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10358659B2 (en) | 2012-05-25 | 2019-07-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10982231B2 (en) | 2012-05-25 | 2021-04-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10351878B2 (en) | 2012-05-25 | 2019-07-16 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10900054B2 (en) | 2012-05-25 | 2021-01-26 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10563227B2 (en) | 2012-05-25 | 2020-02-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10308961B2 (en) | 2012-05-25 | 2019-06-04 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10301651B2 (en) | 2012-05-25 | 2019-05-28 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10570419B2 (en) | 2012-05-25 | 2020-02-25 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10793878B1 (en) | 2012-05-25 | 2020-10-06 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10774344B1 (en) | 2012-05-25 | 2020-09-15 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10752920B2 (en) | 2012-05-25 | 2020-08-25 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10577631B2 (en) | 2012-05-25 | 2020-03-03 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10000772B2 (en) | 2012-05-25 | 2018-06-19 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10266850B2 (en) | 2012-05-25 | 2019-04-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10227611B2 (en) | 2012-05-25 | 2019-03-12 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10676759B2 (en) | 2012-05-25 | 2020-06-09 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10669560B2 (en) | 2012-05-25 | 2020-06-02 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10113167B2 (en) | 2012-05-25 | 2018-10-30 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10640791B2 (en) | 2012-05-25 | 2020-05-05 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10626419B2 (en) | 2012-05-25 | 2020-04-21 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10612045B2 (en) | 2012-05-25 | 2020-04-07 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10597680B2 (en) | 2012-05-25 | 2020-03-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
| US10851380B2 (en) | 2012-10-23 | 2020-12-01 | Toolgen Incorporated | Methods for cleaving a target DNA using a guide RNA specific for the target DNA and Cas protein-encoding nucleic acid or Cas protein |
| US10731181B2 (en) | 2012-12-06 | 2020-08-04 | Sigma, Aldrich Co. LLC | CRISPR-based genome modification and regulation |
| US8889418B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
| US8993233B2 (en) | 2012-12-12 | 2015-03-31 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
| US8895308B1 (en) | 2012-12-12 | 2014-11-25 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
| US9822372B2 (en) | 2012-12-12 | 2017-11-21 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US8906616B2 (en) | 2012-12-12 | 2014-12-09 | The Broad Institute Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
| US8771945B1 (en) | 2012-12-12 | 2014-07-08 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
| US8932814B2 (en) | 2012-12-12 | 2015-01-13 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
| US8889356B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
| US10930367B2 (en) | 2012-12-12 | 2021-02-23 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for Cas enzymes or CRISPR-Cas systems for target sequences and conveying results thereof |
| US8795965B2 (en) | 2012-12-12 | 2014-08-05 | The Broad Institute, Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US8871445B2 (en) | 2012-12-12 | 2014-10-28 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
| US9840713B2 (en) | 2012-12-12 | 2017-12-12 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US8945839B2 (en) | 2012-12-12 | 2015-02-03 | The Broad Institute Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
| US8865406B2 (en) | 2012-12-12 | 2014-10-21 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
| US11041173B2 (en) | 2012-12-12 | 2021-06-22 | The Broad Institute, Inc. | Delivery, engineering and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
| US8999641B2 (en) | 2012-12-12 | 2015-04-07 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
| US9023649B2 (en) | 2012-12-17 | 2015-05-05 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US10717990B2 (en) | 2012-12-17 | 2020-07-21 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US9260723B2 (en) | 2012-12-17 | 2016-02-16 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US10435708B2 (en) | 2012-12-17 | 2019-10-08 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US9970024B2 (en) | 2012-12-17 | 2018-05-15 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US10273501B2 (en) | 2012-12-17 | 2019-04-30 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US9777262B2 (en) | 2013-03-13 | 2017-10-03 | President And Fellows Of Harvard College | Mutants of Cre recombinase |
| WO2014152940A1 (en) | 2013-03-14 | 2014-09-25 | Shire Human Genetic Therapies, Inc. | Mrna therapeutic compositions and use to treat diseases and disorders |
| US9267135B2 (en) | 2013-06-04 | 2016-02-23 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
| US10640789B2 (en) | 2013-06-04 | 2020-05-05 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
| US10767194B2 (en) | 2013-06-04 | 2020-09-08 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
| US11008588B2 (en) | 2013-06-17 | 2021-05-18 | The Broad Institute, Inc. | Delivery, engineering and optimization of tandem guide systems, methods and compositions for sequence manipulation |
| US10946108B2 (en) | 2013-06-17 | 2021-03-16 | The Broad Institute, Inc. | Delivery, use and therapeutic applications of the CRISPR-Cas systems and compositions for targeting disorders and diseases using viral components |
| US10577630B2 (en) | 2013-06-17 | 2020-03-03 | The Broad Institute, Inc. | Delivery and use of the CRISPR-Cas systems, vectors and compositions for hepatic targeting and therapy |
| US10781444B2 (en) | 2013-06-17 | 2020-09-22 | The Broad Institute, Inc. | Functional genomics using CRISPR-Cas systems, compositions, methods, screens and applications thereof |
| US10711285B2 (en) | 2013-06-17 | 2020-07-14 | The Broad Institute, Inc. | Optimized CRISPR-Cas double nickase systems, methods and compositions for sequence manipulation |
| US9587252B2 (en) | 2013-07-10 | 2017-03-07 | President And Fellows Of Harvard College | Orthogonal Cas9 proteins for RNA-guided gene regulation and editing |
| US10329587B2 (en) | 2013-07-10 | 2019-06-25 | President And Fellows Of Harvard College | Orthogonal Cas9 proteins for RNA-guided gene regulation and editing |
| US10563225B2 (en) | 2013-07-26 | 2020-02-18 | President And Fellows Of Harvard College | Genome engineering |
| US9914939B2 (en) | 2013-07-26 | 2018-03-13 | President And Fellows Of Harvard College | Genome engineering |
| US10508298B2 (en) | 2013-08-09 | 2019-12-17 | President And Fellows Of Harvard College | Methods for identifying a target site of a CAS9 nuclease |
| US9163284B2 (en) | 2013-08-09 | 2015-10-20 | President And Fellows Of Harvard College | Methods for identifying a target site of a Cas9 nuclease |
| US10954548B2 (en) | 2013-08-09 | 2021-03-23 | President And Fellows Of Harvard College | Nuclease profiling system |
| US9359599B2 (en) | 2013-08-22 | 2016-06-07 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
| US11046948B2 (en) | 2013-08-22 | 2021-06-29 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
| US10227581B2 (en) | 2013-08-22 | 2019-03-12 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
| US9340799B2 (en) | 2013-09-06 | 2016-05-17 | President And Fellows Of Harvard College | MRNA-sensing switchable gRNAs |
| US10912833B2 (en) | 2013-09-06 | 2021-02-09 | President And Fellows Of Harvard College | Delivery of negatively charged proteins using cationic lipids |
| US9388430B2 (en) | 2013-09-06 | 2016-07-12 | President And Fellows Of Harvard College | Cas9-recombinase fusion proteins and uses thereof |
| US10597679B2 (en) | 2013-09-06 | 2020-03-24 | President And Fellows Of Harvard College | Switchable Cas9 nucleases and uses thereof |
| US9322037B2 (en) | 2013-09-06 | 2016-04-26 | President And Fellows Of Harvard College | Cas9-FokI fusion proteins and uses thereof |
| US9228207B2 (en) | 2013-09-06 | 2016-01-05 | President And Fellows Of Harvard College | Switchable gRNAs comprising aptamers |
| US9340800B2 (en) | 2013-09-06 | 2016-05-17 | President And Fellows Of Harvard College | Extended DNA-sensing GRNAS |
| US10858639B2 (en) | 2013-09-06 | 2020-12-08 | President And Fellows Of Harvard College | CAS9 variants and uses thereof |
| US9737604B2 (en) | 2013-09-06 | 2017-08-22 | President And Fellows Of Harvard College | Use of cationic lipids to deliver CAS9 |
| US9999671B2 (en) | 2013-09-06 | 2018-06-19 | President And Fellows Of Harvard College | Delivery of negatively charged proteins using cationic lipids |
| US9526784B2 (en) | 2013-09-06 | 2016-12-27 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
| US10682410B2 (en) | 2013-09-06 | 2020-06-16 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
| US10640788B2 (en) | 2013-11-07 | 2020-05-05 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAs |
| US10190137B2 (en) | 2013-11-07 | 2019-01-29 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAS |
| US10100291B2 (en) | 2013-11-19 | 2018-10-16 | President And Fellows Of Harvard College | Mutant Cas9 proteins |
| US9074199B1 (en) | 2013-11-19 | 2015-07-07 | President And Fellows Of Harvard College | Mutant Cas9 proteins |
| US10683490B2 (en) | 2013-11-19 | 2020-06-16 | President And Fellows Of Harvard College | Mutant Cas9 proteins |
| US10435679B2 (en) | 2013-11-19 | 2019-10-08 | President And Fellows Of Harvard College | Mutant Cas9 proteins |
| US10787684B2 (en) | 2013-11-19 | 2020-09-29 | President And Fellows Of Harvard College | Large gene excision and insertion |
| US10465176B2 (en) | 2013-12-12 | 2019-11-05 | President And Fellows Of Harvard College | Cas variants for gene editing |
| US9068179B1 (en) | 2013-12-12 | 2015-06-30 | President And Fellows Of Harvard College | Methods for correcting presenilin point mutations |
| US10851357B2 (en) | 2013-12-12 | 2020-12-01 | The Broad Institute, Inc. | Compositions and methods of use of CRISPR-Cas systems in nucleotide repeat disorders |
| US10550372B2 (en) | 2013-12-12 | 2020-02-04 | The Broad Institute, Inc. | Systems, methods and compositions for sequence manipulation with optimized functional CRISPR-Cas systems |
| US9840699B2 (en) | 2013-12-12 | 2017-12-12 | President And Fellows Of Harvard College | Methods for nucleic acid editing |
| US11053481B2 (en) | 2013-12-12 | 2021-07-06 | President And Fellows Of Harvard College | Fusions of Cas9 domains and nucleic acid-editing domains |
| US10377998B2 (en) | 2013-12-12 | 2019-08-13 | The Broad Institute, Inc. | CRISPR-CAS systems and methods for altering expression of gene products, structural information and inducible modular CAS enzymes |
| EP3450553B1 (en) | 2014-03-24 | 2019-12-25 | Translate Bio, Inc. | Mrna therapy for treatment of ocular diseases |
| BR112016030852A2 (en) | 2014-07-02 | 2018-01-16 | Shire Human Genetic Therapies | rna messenger encapsulation |
| US10077453B2 (en) | 2014-07-30 | 2018-09-18 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
| US10704062B2 (en) | 2014-07-30 | 2020-07-07 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
| US10519454B2 (en) | 2014-08-06 | 2019-12-31 | Toolgen Incorporated | Genome editing using Campylobacter jejuni CRISPR/CAS system-derived RGEN |
| US10385336B2 (en) | 2014-09-05 | 2019-08-20 | Vilnius University | Programmable RNA shredding by the type III-A CRISPR-Cas system of Streptococcus thermophilus |
| US11001829B2 (en) | 2014-09-25 | 2021-05-11 | The Broad Institute, Inc. | Functional screening with optimized functional CRISPR-Cas systems |
| US11071790B2 (en) | 2014-10-29 | 2021-07-27 | Massachusetts Eye And Ear Infirmary | Method for efficient delivery of therapeutic molecules in vitro and in vivo |
| US11111472B2 (en) | 2014-10-31 | 2021-09-07 | Massachusetts Institute Of Technology | Delivery of biomolecules to immune cells |
| US10954514B2 (en) | 2014-12-12 | 2021-03-23 | The Broad Institute, Inc. | Escorted and functionalized guides for CRISPR-Cas systems |
| US10689691B2 (en) | 2014-12-19 | 2020-06-23 | The Broad Institute, Inc. | Unbiased identification of double-strand breaks and genomic rearrangement by genome-wide insert capture sequencing |
| US10648020B2 (en) | 2015-06-18 | 2020-05-12 | The Broad Institute, Inc. | CRISPR enzymes and systems |
| US11060115B2 (en) | 2015-06-18 | 2021-07-13 | The Broad Institute, Inc. | CRISPR enzymes and systems |
| US9790490B2 (en) | 2015-06-18 | 2017-10-17 | The Broad Institute Inc. | CRISPR enzymes and systems |
| US10494621B2 (en) | 2015-06-18 | 2019-12-03 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
| US10876100B2 (en) | 2015-06-18 | 2020-12-29 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
| US11091798B2 (en) | 2015-06-18 | 2021-08-17 | The Broad Institute, Inc. | CRISPR enzymes and systems |
| US11104967B2 (en) | 2015-07-22 | 2021-08-31 | President And Fellows Of Harvard College | Evolution of site-specific recombinases |
| US11078469B2 (en) | 2015-07-30 | 2021-08-03 | President And Fellows Of Harvard College | Evolution of TALENs |
| US10612011B2 (en) | 2015-07-30 | 2020-04-07 | President And Fellows Of Harvard College | Evolution of TALENs |
| US10925263B2 (en) | 2015-10-08 | 2021-02-23 | President And Fellows Of Harvard College | Multiplexed genome editing |
| US10375938B2 (en) | 2015-10-08 | 2019-08-13 | President And Fellows Of Harvard College | Multiplexed genome editing |
| US11064684B2 (en) | 2015-10-08 | 2021-07-20 | President And Fellows Of Harvard College | Multiplexed genome editing |
| US10959413B2 (en) | 2015-10-08 | 2021-03-30 | President And Fellows Of Harvard College | Multiplexed genome editing |
| EP3362461A1 (en) | 2015-10-16 | 2018-08-22 | Modernatx, Inc. | Mrna cap analogs with modified phosphate linkage |
| US10167457B2 (en) | 2015-10-23 | 2019-01-01 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
| US10851369B2 (en) | 2016-06-21 | 2020-12-01 | President And Fellows Of Harvard College | Frequency-based modulation of diverse species in a nucleic acid library |
| US10113163B2 (en) | 2016-08-03 | 2018-10-30 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
| US10947530B2 (en) | 2016-08-03 | 2021-03-16 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
| US11085072B2 (en) | 2016-08-31 | 2021-08-10 | President And Fellows Of Harvard College | Methods of generating libraries of nucleic acid sequences for detection via fluorescent in situ sequencing |
| US10266887B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
| US10266886B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
| US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
| US11104937B2 (en) | 2017-03-15 | 2021-08-31 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
| US11021740B2 (en) | 2017-03-15 | 2021-06-01 | The Broad Institute, Inc. | Devices for CRISPR effector system based diagnostics |
| US10968257B2 (en) | 2018-04-03 | 2021-04-06 | The Broad Institute, Inc. | Target recognition motifs and uses thereof |
| WO2020014577A1 (en) | 2018-07-13 | 2020-01-16 | Allele Biotechnology And Pharmaceuticals, Inc. | Methods of achieving high specificity of genome editing |
| CA3118135A1 (en) * | 2018-11-02 | 2020-05-07 | Greenvenus, Llc | Serine recombinases mediating stable integration into plant genomes |
| WO2020191246A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191241A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191245A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191234A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191242A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191243A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191171A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191153A2 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191239A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191248A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Method and compositions for editing nucleotide sequences |
| WO2020191233A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2020191249A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
| WO2021226558A1 (en) | 2020-05-08 | 2021-11-11 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
| WO2022087235A1 (en) | 2020-10-21 | 2022-04-28 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
| US11572556B2 (en) | 2020-10-21 | 2023-02-07 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
| US11827881B2 (en) | 2020-10-21 | 2023-11-28 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
| US11834658B2 (en) | 2020-10-21 | 2023-12-05 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
| WO2023070031A2 (en) | 2021-10-21 | 2023-04-27 | Massachusetts Institute Of Technology | Discovery and engineering of integrases for high-efficiency gene integration |
| WO2023122764A1 (en) * | 2021-12-22 | 2023-06-29 | Tome Biosciences, Inc. | Co-delivery of a gene editor construct and a donor template |
Non-Patent Citations (121)
| Title |
|---|
| "Methods in Enzymology", vol. 149, 1987, ACADEMIC PRESS, INC., article "Heath, Covalent Attachment of Proteins to Liposomes", pages: 111 - 119 |
| ABRA ET AL., J. LIPOSOME RES, vol. 12, 2002, pages 1 - 3 |
| AHMAD ET AL., CANCER RES, vol. 52, 1992, pages 4817 - 4820 |
| AKINC ET AL., MOL THER, vol. 17, 2009, pages 872 - 879 |
| AKINC ET AL., NAT. BIOTECHNOL., vol. 26, 2008, pages 561 - 569 |
| ALLEN ET AL., BIOCHIMICA ET BIOPHYSICA ACTA, vol. 1237, 1995, pages 99 - 108 |
| ALLEN, D. ET AL.: "CRISPR-Cas9 engineering of the RAG2 locus via complete coding sequence replacement for therapeutic applications", NAT COMMUN, vol. 14, 2023, pages 6771 |
| AMIT, I. ET AL.: "CRISPECTOR provides accurate estimation of genome editing translocation and off-target activity from comparative NGS data", NAT COMMUN, vol. 12, 2021, pages 3042, XP055852901, DOI: 10.1038/s41467-021-22417-4 |
| ANZALONE ET AL., BIORXIV, 2 November 2021 (2021-11-02) |
| ANZALONE ET AL., NATURE, vol. 576, 2019, pages 149 |
| ANZALONE, A.V. ET AL.: "Programmable deletion, replacement, integration and inversion of large DNA sequences with twin prime editing", NAT BIOTECHNOL, vol. 40, 2022, pages 731 - 740, XP037927032, DOI: 10.1038/s41587-021-01133-w |
| BLAESE ET AL., CANCER GENE THER, vol. 2, 1995, pages 291 - 297 |
| BLANCH-ASENSIO, A. ET AL.: "STRAIGHT-IN enables high-throughput targeting of large DNA payloads in human pluripotent stem cells", CELL REP METHODS, vol. 2, 2022, pages 100300, XP093107487, DOI: 10.1016/j.crmeth.2022.100300 |
| BLUME ET AL., BIOCHIMICA ET BIOPHYSICA ACTA, vol. 1149, 1993, pages 180 - 184 |
| BUCHSCHER ET AL., J. VIROL., vol. 66, 1992, pages 1635 - 1640 |
| CANCELLIERI, S. ET AL.: "Human genetic diversity alters off-target outcomes of therapeutic gene editing", NATURE GENETICS, vol. 55, 2023, pages 34 - 43, XP093241103, DOI: 10.1038/s41588-022-01257-y |
| CHALBERG, T.W. ET AL.: "Integration specificity of phage phiC31 integrase in the human genome", J MOLBIOL, vol. 357, 2006, pages 28 - 48, XP024950758, DOI: 10.1016/j.jmb.2005.11.098 |
| CHAUDHARI, H.G. ET AL.: "Evaluation of Homology-Independent CRISPR-Cas9 Off-Target Assessment Methods", CRISPR J, vol. 3, 2020, pages 440 - 453 |
| CHEN ET AL., CELL, vol. 184, 28 October 2021 (2021-10-28), pages 1 - 18 |
| CRYSTAL, SCIENCE, vol. 270, 1995, pages 404 - 410 |
| DALKARA ET AL., SCI TRANSL MED, vol. 5, 2013, pages 189 - 76 |
| DEFREES ET AL., JOURNAL OF THE AMERICAN CHEMISTRY SOCIETY, vol. 118, 1996, pages 6101 - 6104 |
| DOENCH, J.G. ET AL.: "Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9", NAT BIOTECHNOL, vol. 34, 2016, pages 184 - 191, XP093235579, DOI: 10.1038/nbt.3437 |
| DURRANT ET AL., NAT. BIOTECHNOL., 2022 |
| DURRANT MATTHEW G. ET AL: "Large-scale discovery of recombinases for integrating DNA into the human genome", BIORXIV, 9 November 2021 (2021-11-09), pages 1 - 49, XP093185806, Retrieved from the Internet <URL:https://www.biorxiv.org/content/biorxiv/early/2021/11/06/2021.11.05.467528.full.pdf> DOI: 10.1101/2021.11.05.467528 * |
| DURRANT, M.G. ET AL.: "Systematic discovery of recombinases for efficient integration of large DNA sequences into the human genome", NAT BIOTECHNOL, vol. 41, 2023, pages 488 - 499, XP093042676, DOI: 10.1038/s41587-022-01494-w |
| EHRHARDT, A.ENGLER, J.A.XU, H.CHERRY, A.M.KAY, M.A.: "Molecular analysis of chromosomal rearrangements in mammalian cells after phiC31-mediated integration", HUM GENE THER, vol. 17, 2006, pages 1077 - 1094 |
| FARRUGGIO, A.P.BHAKTA, M.S.DU BOIS, H.MA, J.M, P.C.: "Genomic integration of the full-length dystrophin coding sequence in Duchenne muscular dystrophy induced pluripotent stem cells", BIOTECHNOL J, vol. 12, 2017 |
| FAUSER, F. ET AL.: "Systematic Development of Reprogrammed Modular Integrases Enables Precise Genomic Integration of Large DNA Sequences", BIORXIV, 2024 |
| FINDLAY, S.D.VINCENT, K.M.BERMAN, J.R.POSTOVIT, L.M.: "A Digital PCR-Based Method for Efficient and Highly Specific Screening of Genome Edited Cells", PLOS ONE, vol. 11, 2016, pages 0153901 |
| FRANGOUL, H. ET AL.: "CRISPR-Cas9 Gene Editing for Sickle Cell Disease and beta-Thalassemia", N ENGL J MED, vol. 384, 2021, pages 252 - 260 |
| FRANGOUL, H. ET AL.: "CRISPR-Cas9 Gene Editing for Sickle Cell Disease and beta-Thalassemia", N ENGLJ MED, vol. 384, 2021, pages 252 - 260 |
| GAO ET AL., GENE THERAPY, vol. 2, 1995, pages 710 - 722 |
| GASIUNAS ET AL.: "A catalogue of biochemically diverse CRISPR-Cas9 orthologs", NATURE COMMUNICATIONS, vol. 11, pages 5512 |
| GHOSH, P.KIM, A.I.HATFULL, G.F.: "The orientation of mycobacteriophage Bxb 1 integration is solely dependent on the central dinucleotide of attP and attB", MOL CELL, vol. 12, 2003, pages 1101 - 1111, XP002464453, DOI: 10.1016/S1097-2765(03)00444-1 |
| GIANNOUKOS, G. ET AL.: "UDiTaS, a genome editing detection method for indels and genome rearrangements", BMC GENOMICS, vol. 19, 2018, pages 212 |
| GILLMORE, J.D. ET AL.: "CRISPR-Cas9 In Vivo Gene Editing for Transthyretin Amyloidosis", N ENGL J MED, vol. 385, 2021, pages 493 - 502, XP055978811, DOI: 10.1056/NEJMoa2107454 |
| GNIRKE, A. ET AL.: "Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing", NAT BIOTECHNOL, vol. 27, 2009, pages 182 - 189 |
| GRINDLEY, N.D.WHITESON, K.L.RICE, P.A.: "Mechanisms of site-specific recombination", ANNU REV BIOCHEM, vol. 75, 2006, pages 567 - 605, XP002527516, DOI: 10.1146/ANNUREV.BIOCHEM.73.011303.073908 |
| GROTH, A.C.CALOS, M.P.: "Phage integrases: biology and applications", J MOL BIOL, vol. 335, no. 14693, 2004, pages 667 - 678, XP055359406, DOI: 10.1016/j.jmb.2003.09.082 |
| GROTH, A.C.OLIVARES, E.C.THYAGARAJAN, B.CALOS, M.P.: "A phage integrase directs efficient site-specific integration in human cells", PROC NATL ACAD SCI, vol. 97, 2000, pages 5995 - 6000, XP002221880, DOI: 10.1073/pnas.090527097 |
| HAZELBAKER DANE Z. ET AL: "Large Serine Integrase Off-target Discovery and Validation for Therapeutic Genome Editing", BIORXIV, 10 October 2024 (2024-10-10), XP093292047, Retrieved from the Internet <URL:https://batavia.internal.epo.org/search-service-layer/master/v4/api/images/documents/XP/093292047/formats/original-pdf> DOI: 10.1101/2024.08.23.609471 * |
| HAZELBAKER, D.Z. ET AL.: "Large Serine Integrase Off-target Discovery and Validation for Therapeutic Genome Editing", BIORXIV, 2024 |
| HEINZ, S. ET AL.: "Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities", MOL CELL, vol. 38, 2010, pages 576 - 589 |
| HENNIG, B.P. ET AL.: "Large-Scale Low-Cost NGS Library Preparation Using a Robust Tn5 Purification and Tagmentation Protocol", G3, vol. 8, 2018, pages 79 - 89, XP093002331, DOI: 10.1534/g3.117.300257 |
| HERMONATMUZYCZKA, PNAS, vol. 81, 1984, pages 6466 - 6470 |
| HEW, B.E. ET AL.: "Directed evolution of hyperactive integrases for site specific insertion of transgenes", BIORXIV, 2024 |
| HSU, P.D. ET AL.: "DNA targeting specificity of RNA-guided Cas9 nucleases", NAT BIOTECHNOL, vol. 31, 2013, pages 827 - 832, XP055219426, DOI: 10.1038/nbt.2647 |
| IONNIDI ET AL.: "Drag-and-drop genome insertion without DNA cleavage with CRISPR directed integrases", BIORXIV 2021.11.01.466786, 2021 |
| JIANG ET AL., NAT. BIOTECHNOLOGY, 14 October 2021 (2021-10-14) |
| KANTER, J. ET AL.: "Biologic and Clinical Efficacy of LentiGlobin for Sickle Cell Disease", N ENGLJ MED, vol. 386, 2022, pages 617 - 628, XP093018358, DOI: 10.1056/NEJMoa2117175 |
| KARVELIS ET AL.: "PAM recognition by miniature CRISPR-Cas12f nucleases triggers programmable double-stranded DNA target cleavage", NUCLEIC ACIDS RESEARCH, vol. 48, no. 9, 21 May 2020 (2020-05-21), pages 5016 - 23, XP055920188, DOI: 10.1093/nar/gkaa208 |
| KASAI, F.MIZUKOSHI, K.NAKAMURA, Y.: "Variable characteristics overlooked in human K-562 leukemia cell lines with a common signature", SCI REP, vol. 14, 2024, pages 9619 |
| KIM ET AL.: "Digenome-seq: genome-wide profiling of CRISPR-Cas9 off-target effects in human cells", NAT. METHODS, vol. 12, 2015, pages 237 - 243, XP055554961, DOI: 10.1038/nmeth.3284 |
| KIM, A.I. ET AL.: "Mycobacteriophage Bxb1 integrates into the Mycobacterium smegmatis groEL1 gene", MOL MICROBIOL, vol. 50, 2003, pages 463 - 473, XP055013652, DOI: 10.1046/j.1365-2958.2003.03723.x |
| KIM, D. ET AL.: "Digenome-seq: genome-wide profiling of CRISPR-Cas9 off-target effects in human cells", NAT METHODS, vol. 12, 2015, pages 237 - 243,231 |
| KIRPOTIN ET AL., FEBS LETTERS, vol. 388, 1996, pages 115 - 118 |
| KLIBANOV ET AL., JOURNAL OF LIPOSOME RESEARCH, vol. 2, 1992, pages 321 - 334 |
| KOTIN, HUMAN GENE THERAPY, vol. 5, 1994, pages 793 - 801 |
| KOWALSKI ET AL.: "Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery", MOL THERAP, vol. 27, no. 4, 2019, pages 710 - 728, XP055634628, DOI: 10.1016/j.ymthe.2019.02.012 |
| LAZZAROTTO ET AL.: "CHANGE-seq reveals genetic and epigenetic effects on CRISPR-Cas9 genome-wide activity", NAT. BIOTECHNOL., vol. 38, 2020, pages 1317 - 1327, XP037614246, DOI: 10.1038/s41587-020-0555-7 |
| LEONETTI ET AL., PROC. NATL. ACAD. SCI., vol. 87, 1990, pages 2448 - 2451 |
| LI, H.DURBIN, R.: "Fast and accurate short read alignment with Burrows-Wheeler transform", BIOINFORMATICS, vol. 25, 2009, pages 1754 - 1760, XP055553969, DOI: 10.1093/bioinformatics/btp324 |
| LIUHUANG, MOLECULAR THERAPY, vol. 2010, pages 669 - 670 |
| LONGHURST, H.J. ET AL.: "CRISPR-Cas9 In Vivo Gene Editing of KLKB 1 for Hereditary Angioedema", N ENGLJ MED, vol. 390, 2024, pages 432 - 441 |
| LOVE ET AL., PROC NATL ACAD SCI USA., vol. 107, 2010, pages 1864 - 1869 |
| LOVE ET AL., PROC NATL ACAD SCI, vol. 107, 2010, pages 1864 - 1869 |
| MAEDER, M.L. ET AL.: "Development of a gene-editing approach to restore vision loss in Leber congenital amaurosis type 10", NATMED, vol. 25, 2019, pages 229 - 233, XP036693196, DOI: 10.1038/s41591-018-0327-9 |
| MAHON ET AL., BIOCONJUG CHEM, vol. 21, 2010, pages 1448 - 1454 |
| MALI, P. ET AL.: "RNA-guided human genome engineering via Cas9", SCIENCE, vol. 339, 2013, pages 823 - 826, XP055469277, DOI: 10.1126/science.1232033 |
| MEDIAVILLA, J. ET AL.: "Genome organization and characterization of mycobacteriophage Bxb1", MOL MICROBIOL, vol. 38, 2000, pages 955 - 970, XP093110617, DOI: 10.1046/j.1365-2958.2000.02183.x |
| MENZ, J. ET AL.: "Genotoxicity assessment: opportunities, challenges and perspectives for quantitative evaluations of dose-response data", ARCH TOXICOL, vol. 97, 2023, pages 2303 - 2328 |
| MILLER ET AL., J. VIROL., vol. 65, 1991, pages 2220 - 2224 |
| MILLINGTON-WARD ET AL., MOLECULAR THERAPY, vol. 19, no. 4, April 2011 (2011-04-01), pages 642 - 649 |
| MURUGAIAH ET AL., ANALYTICAL BIOCHEMISTRY, vol. 401, 2010, pages 61 |
| MUZYCZKA, J. CLIN. INVEST., vol. 94, 1994, pages 1351 |
| NAMBIAR, T.S.BAUDRIER, L.BILLON, P.CICCIA, A.: "CRISPR-based genome editing through the lens of DNA repair", MOL CELL, vol. 82, 2022, pages 348 - 388, XP086933265, DOI: 10.1016/j.molcel.2021.12.026 |
| NAN, A.X. ET AL.: "Ligase-mediated programmable genomic integration (L-PGI): an efficient site-specific gene editing system that overcomes the limitations of reverse transcriptase-based editing systems", BIORXIV, 2024 |
| NILS HOMER, P.R.TIM FENNELLCLINT VALENTINEJOHN DIDIONMATT STONE, FULCRUMGENOMICS, 2024 |
| NISHIMASU ET AL.: "Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA", CELL, vol. 156, 27 February 2014 (2014-02-27), pages 935 - 949, XP028667665, DOI: 10.1016/j.cell.2014.02.001 |
| OH, Y. ET AL.: "Expansion of the prime editing modality with Cas9 from Francisella novicida", BIORXIV 2021.05.25.445577 |
| OLORUNNIJI, F.J. ET AL.: "Control of serine integrase recombination directionality by fusion with the directionality factor", NUCLEIC ACIDS RES, vol. 45, 2017, pages 8635 - 8645, XP055702912, DOI: 10.1093/nar/gkx567 |
| PANDEY, S. ET AL.: "Efficient site-specific integration of large genes in mammalian cells via continuously evolved recombinases and prime editing", NAT BIOMED ENG, 2024 |
| PICELLI, S. ET AL.: "Tn5 transposase and tagmentation procedures for massively scaled sequencing projects", GENOME RES, vol. 24, 2014, pages 2033 - 2040, XP055236186, DOI: 10.1101/gr.177881.114 |
| RAMAKRISHNAN ET AL.: "Nuclear export signal (NES) of transposases affects the transposition activity of mariner-like elements Ppmar 1 and Ppmar2 of moso bamboo", MOB DNA, vol. 10, 19 August 2019 (2019-08-19), pages 35 |
| REMY ET AL., BIOCONJUGATE CHEM, vol. 5, 1994, pages 647 - 654 |
| RENNEISEN ET AL., J. BIO. CHEM., vol. 265, 1990, pages 16337 - 16342 |
| ROBINSON, J.T. ET AL.: "Integrative genomics viewer", NAT BIOTECHNOL, vol. 29, 2011, pages 24 - 26, XP037104061, DOI: 10.1038/nbt.1754 |
| ROSIN ET AL., MOLECULAR THERAPY, vol. 19, no. 1432494-65-9, December 2011 (2011-12-01), pages 1286 - 2200 |
| RUMELHART, D.E.HINTON, G.E.WILLIAMS, R.J.: "Learning representations by back-propagating errors", NATURE, vol. 323, 1986, pages 533 - 536, XP037129009, DOI: 10.1038/323533a0 |
| SAMULSKI ET AL., J. VIROL., vol. 63, 1989, pages 03822 - 3828 |
| SAPRA ET AL., PROG. LIPID RES., vol. 42, no. 5, 2003, pages 439 - 62 |
| SCHMIDT ET AL.: "Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases", NAT COMMUN, vol. 12, 2021, pages 4219 |
| SCHNEIDER, V.A. ET AL.: "Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly", GENOME RES, vol. 27, 2017, pages 849 - 864 |
| SCHROEDER ET AL., J INTERN MED, vol. 267, 2010, pages 9 - 21 |
| SHAH ET AL.: "Protospacer recognition motifs: mixed identities and functional diversity", RNA BIOLOGY, vol. 10, no. 5, pages 891 - 899 |
| SIEGWART ET AL., PROC NATL ACAD SCI, vol. 108, 2011, pages 12996 - 3001 |
| SMITH, M.C.M.: "Phage-encoded Serine Integrases and Other Large Serine Recombinases", MICROBIOL SPECTR, vol. 3, 2015, pages 3, XP093110438, DOI: 10.1128/microbiolspec.MDNA3-0059-2014 |
| SOMMNERFELT ET AL., VIROL, vol. 176, 1990, pages 58 - 59 |
| STARK, W.M.: "The Serine Recombinases", MICROBIOL SPECTR, vol. 2, 2014, pages 2 |
| SWARTS ET AL.: "Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a", MOLECULAR CELL, vol. 66, 20 April 2017 (2017-04-20), pages 221 - 233, XP055569665, DOI: 10.1016/j.molcel.2017.03.016 |
| THORPE, H.M.SMITH, M.C.: "In vitro site-specific integration of bacteriophage DNA catalyzed by a recombinase of the resolvase/invertase family", PROC NATL ACAD SCI, vol. 95, 1998, pages 5505 - 5510, XP002268753, DOI: 10.1073/pnas.95.10.5505 |
| TOU, C.J.KLEINSTIVER, B.P.: "Programmable RNA-guided enzymes for next-generation genome editing", NATURE, vol. 630, 2024, pages 827 - 828 |
| TRATSCHIN ET AL., MOL. CELL. BIOL., vol. 4, 1984, pages 2072 - 2081 |
| TRATSCHIN ET AL., MOL. CELL. BIOL., vol. 5, 1985, pages 3251 - 3260 |
| TSAI, S.Q. ET AL.: "GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases", NAT BIOTECHNOL, vol. 33, 2015, pages 187 - 197, XP055555627, DOI: 10.1038/nbt.3117 |
| WANG, J.Y.DOUDNA, J.A.: "CRISPR technology: A decade of genome editing is only the beginning", SCIENCE, vol. 379, 2023, pages 8643 |
| WANG, X. ET AL.: "Bxb 1 integrase serves as a highly efficient DNA recombinase in rapid metabolite pathway assembly", ACTA BIOCHIM BIOPHYS SIN (SHANGHAI, vol. 49, 2017, pages 44 - 50 |
| WANG, X. ET AL.: "Bxb 1 integrase serves as a highly efficient DNA recombinase in rapid metabolite pathway assembly", ACTA BIOCHIM BIOPHYS SIN, vol. 49, 2017, pages 44 - 50 |
| WEST ET AL., VIROLOGY, vol. 160, 1987, pages 38 - 47 |
| XU ET AL.: "Accuracy and efficiency define Bxb1 integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome", BMC BIOTECHNOL, vol. 13, 20 October 2013 (2013-10-20), pages 87 |
| XU ET AL.: "Comparison and optimization of ten phage encoded serine integrases for genome engineering in Saccharomyces cerevisiae", BMC BIOTECHNOL, vol. 16, 9 February 2016 (2016-02-09), pages 13 |
| XU, X. ET AL.: "Engineered Miniature CRISPR-Cas System for Mammalian Genome Regulation and Editing", MOLECULAR CELL, vol. 81, no. 20, 21 October 2021 (2021-10-21), pages 4333 - 45 |
| XU, Z. ET AL.: "Accuracy and efficiency define Bxb 1 integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome", BMC BIOTECHNOL, vol. 13, 2013, pages 87 |
| YAMANAKA, S.: "Pluripotent Stem Cell-Based Cell Therapy-Promise and Challenges", CELL STEM CELL, vol. 27, 2020, pages 523 - 531, XP086279810, DOI: 10.1016/j.stem.2020.09.014 |
| YAMANO ET AL.: "Crystal structure of Cpf1 in complex with guide RNA and target DNA", CELL, vol. 165, 5 May 2016 (2016-05-05), pages 949 - 962, XP029530759, DOI: 10.1016/j.cell.2016.04.003 |
| YARNALL MATTHEW T. ET AL: "Drag-and-drop genome insertion of large sequences without double-strand DNA cleavage using CRISPR-directed integrases", NATURE BIOTECHNOLOGY, vol. 41, no. 4, 24 November 2022 (2022-11-24), New York, pages 500 - 512, XP093203561, ISSN: 1087-0156, Retrieved from the Internet <URL:https://www.nature.com/articles/s41587-022-01527-4> DOI: 10.1038/s41587-022-01527-4 * |
| YARNALL, M.T.N. ET AL.: "Drag-and-drop genome insertion of large sequences without double-strand DNA cleavage using CRISPR-directed integrases", NAT BIOTECHNOL, vol. 41, 2023, pages 500 - 512, XP093203561, DOI: 10.1038/s41587-022-01527-4 |
| ZALIPSKY, BIOCONJUGATE CHEMISTRY, vol. 4, 1993, pages 296 - 299 |
| ZALIPSKY, FEBS LETTERS, vol. 353, 1994, pages 71 - 74 |
| ZALIPSKY: "Stealth Liposomes", 1995, CRC PRESS |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4677108A1 (en) | 2026-01-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7536053B2 (en) | Systems, methods and compositions for sequence manipulation with optimized CRISPR-Cas systems | |
| JP7280905B2 (en) | Crystal structure of CRISPRCPF1 | |
| US20240093193A1 (en) | Dead guides for crispr transcription factors | |
| US20250057980A1 (en) | Co-Delivery of a Gene Editor Construct and a Donor Template | |
| US12215318B2 (en) | Crispr enzymes and systems | |
| US20170306335A1 (en) | Rna-targeting system | |
| US20260022386A1 (en) | Single Construct Platform for Simultaneous Delivery of Gene Editing Machinery and Nucleic Acid Cargo | |
| US20240263153A1 (en) | Integrase compositions and methods | |
| US12077775B2 (en) | Effector proteins and methods of use | |
| JP2025537750A (en) | RNA compositions containing lipid nanoparticles or lipid-reconstituted natural messenger packs | |
| JP2024534207A (en) | RNA-guided genome recombineering on the kilobase scale | |
| JP2025505732A (en) | RNA-guided genome recombineering on the kilobase scale | |
| WO2023225670A2 (en) | Ex vivo programmable gene insertion | |
| WO2023205744A1 (en) | Programmable gene insertion compositions | |
| WO2023215831A1 (en) | Guide rna compositions for programmable gene insertion | |
| WO2024234006A1 (en) | Systems, compositions, and methods for targeting liver sinusodial endothelial cells (lsecs) | |
| WO2024138194A1 (en) | Platforms, compositions, and methods for in vivo programmable gene insertion | |
| EP4677108A1 (en) | Method and compositions for detecting off-target editing | |
| WO2025050069A1 (en) | Programmable gene insertion using engineered integration enzymes | |
| WO2025224182A2 (en) | Single construct platform for simultaneous delivery of gene editing machinery and nucleic acid cargo | |
| CN118829727A (en) | A single construct platform for simultaneous delivery of gene editing machinery and nucleic acid cargo | |
| WO2025201481A1 (en) | Crispr-cas systems | |
| JP2025514304A (en) | Identifying tissue-specific extragenic safe harbors for gene therapy | |
| WO2024168265A1 (en) | Aav delivery of rna guided recombination system | |
| WO2024192211A2 (en) | Effector proteins and uses thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| WWE | Wipo information: entry into national phase |
Ref document number: 2025721812 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2025721812 Country of ref document: EP Effective date: 20251009 |
|
| ENP | Entry into the national phase |
Ref document number: 2025721812 Country of ref document: EP Effective date: 20251009 |
|
| ENP | Entry into the national phase |
Ref document number: 2025721812 Country of ref document: EP Effective date: 20251009 |
|
| ENP | Entry into the national phase |
Ref document number: 2025721812 Country of ref document: EP Effective date: 20251009 |