WO2009134714A2 - Fusion molecules of rationally-designed dna-binding proteins and effector domains - Google Patents
Fusion molecules of rationally-designed dna-binding proteins and effector domains Download PDFInfo
- Publication number
- WO2009134714A2 WO2009134714A2 PCT/US2009/041796 US2009041796W WO2009134714A2 WO 2009134714 A2 WO2009134714 A2 WO 2009134714A2 US 2009041796 W US2009041796 W US 2009041796W WO 2009134714 A2 WO2009134714 A2 WO 2009134714A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sense strand
- meganuclease
- group
- modification
- altered
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Definitions
- the invention relates to the field of molecular biology and recombinant nucleic acid technology.
- the invention relates to rationally-designed, non-naturally- occurring meganucleases with altered DNA recognition sequence specificity and/or altered affinity.
- the invention also relates to methods of producing such meganucleases, and methods of producing recombinant nucleic acids and organisms using such meganucleases.
- Genome engineering requires the ability to insert, delete, substitute and otherwise manipulate specific genetic sequences within a genome, and has numerous therapeutic and biotechnological applications.
- the development of effective means for genome modification remains a major goal in gene therapy, agrotechnology, and synthetic biology (Porteus et al. (2005), Nat. Biotechnol. 23: 967-73; Tzfira et al. (2005), Trends Biotechnol. 23: 567-9; McDaniel et al. (2005), Curr. Opin. Biotechnol. 16: 476-83).
- a common method for inserting or modifying a DNA sequence involves introducing a transgenic DNA sequence flanked by sequences homologous to the genomic target and selecting or screening for a successful homologous recombination event. Recombination with the transgenic DNA occurs rarely but can be stimulated by a double-stranded break in the genomic DNA at the target site. Numerous methods have been employed to create DNA double-stranded breaks, including irradiation and chemical treatments. Although these methods efficiently stimulate recombination, the double-stranded breaks are randomly dispersed in the genome, which can be highly mutagenic and toxic. At present, the inability to target gene modifications to unique sites within a chromosomal background is a major impediment to successful genome engineering.
- One approach to achieving this goal is stimulating homologous recombination at a double-stranded break in a target locus using a nuclease with specificity for a sequence that is sufficiently large to be present at only a single site within the genome (see, e.g., Porteus et al. (2005), Nat. Biotechnol. 23: 967-73).
- the effectiveness of this strategy has been demonstrated in a variety of organisms using chimeric fusions between an engineered zinc finger DNA-binding domain and the non-specific nuclease domain of the Fokl restriction enzyme (Porteus (2006), MoI Ther 13: 438-46; Wright et al. (2005), Plant J.
- a group of naturally-occurring nucleases which recognize 15-40 base-pair cleavage sites commonly found in the genomes of plants and fungi may provide a less toxic genome engineering alternative.
- Such "meganucleases” or “homing endonucleases” are frequently associated with parasitic DNA elements, such as group 1 self-splicing introns and inteins. They naturally promote homologous recombination or gene insertion at specific locations in the host genome by producing a double-stranded break in the chromosome, which recruits the cellular DNA-repair machinery (Stoddard (2006), Q. Rev. Biophys. 38: 49- 95).
- LAGLIDADG Meganucleases are commonly grouped into four families: the LAGLIDADG family, the GIY-YIG family, the His-Cys box family and the HNH family. These families are characterized by structural motifs, which affect catalytic activity and recognition sequence. For instance, members of the LAGLIDADG family are characterized by having either one or two copies of the conserved LAGLIDADG motif (see Chevalier et al. (2001), Nucleic Acids Res. 29(18): 3757-3774). The LAGLIDADG meganucleases with a single copy of the LAGLIDADG motif form homodimers, whereas members with two copies of the LAGLIDADG motif are found as monomers.
- the GIY-YIG family members have a GIY-YIG module, which is 70-100 residues long and includes four or five conserved sequence motifs with four invariant residues, two of which are required for activity (see Van Roey et al. (2002), Nature Struct. Biol. 9: 806-811).
- the His-Cys box meganucleases are characterized by a highly conserved series of histidines and cysteines over a region encompassing several hundred amino acid residues (see Chevalier et al. (2001), Nucleic Acids Res. 29(18): 3757-3774).
- the members are defined by motifs containing two pairs of conserved histidines surrounded by asparagine residues (see Chevalier et al. (2001), Nucleic Acids Res. 29(18): 3757-3774).
- the four families of meganucleases are widely separated from one another with respect to conserved structural elements and, consequently, DNA recognition sequence specificity and catalytic activity.
- the meganuclease I-Crel from Chlamydomonas reinhardtii is a member of the LAGLIDADG family which recognizes and cuts a 22 base-pair recognition sequence in the chloroplast chromosome, and which presents an attractive target for meganuclease redesign.
- the wild-type enzyme is a homodimer in which each monomer makes direct contacts with 9 base pairs in the full-length recognition sequence. Genetic selection techniques have been used to identify mutations in I-Crel that alter base preference at a single position in this recognition sequence (Sussman et al. (2004), J. MoI. Biol. 342: 31-41; Chames et al. (2005), Nucleic Acids Res.
- the I-Crel protein-DNA interface contains nine amino acids that contact the DNA bases directly and at least an additional five positions that can form potential contacts in modified interfaces. The size of this interface imposes a combinatorial complexity that is unlikely to be sampled adequately in sequence libraries constructed to select for enzymes with drastically altered cleavage sites.
- the present invention is based, in part, upon the identification and characterization of specific amino acid residues in the LAGLIDADG family of meganucleases that make contacts with DNA bases and the DNA backbone when the meganucleases associate with a double-stranded DNA recognition sequence, and thereby affect the specificity and activity of the enzymes.
- This discovery has been used, as described in detail below, to identify amino acid substitutions which can alter the recognition sequence specificity and/or DNA-binding affinity of the meganucleases, and to rationally design and develop non-naturally-occurring meganucleases that can recognize a desired DNA sequence that naturally-occurring meganucleases do not recognize.
- non-naturally-occurring, rationally-designed meganucleases can be used in conjunction with regulatory or effector domains to regulate cellular process in vivo and in vitro.
- non-naturally occurring, rationally-designed meganucleases can be used in conjunction with a transcription effector domain to provide a targeted transcriptional activator for regulation of gene expression in vivo or in vitro.
- the invention provides a targeted transcriptional effector comprising:
- targeted transcriptional effector further comprises a domain linker joining the meganuclease DNA-binding domain and the transcription effector domain.
- the domain linker can comprise a polypeptide.
- the meganuclease DNA-binding domain is altered from a naturally-occurring meganuclease by at least one point mutation which reduces or abolishes endonuclease cleavage activity.
- the targeted transcriptional effector can further comprise a nuclear localization signal.
- the transcriptional effector domain is a transcription activator or a transcription repressor.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Crel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 2-153 of the I- Crel meganuclease of SEQ ID NO: 1; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Crel meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5; wherein said recombinant meganuclease comprises at least one modification of Table 1 and a modification which reduces or abolishes said endonuclease cleavage activity.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Msol meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 6-160 of the I- Msol meganuclease of SEQ ID NO: 6; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Msol meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 7 and SEQ ID NO: 8; wherein said recombinant meganuclease comprises at least one modification of Table 2 and a modification which reduces or abolishes said endonuclease cleavage activity.
- the modification which reduces or abolishes said endonuclease cleavage activity is D22
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for a recognition sequence relative to a wild-type I-Scel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 3-186 of the I- Scel meganuclease of SEQ ID NO: 9; and having specificity for a recognition sequence which differs by at least one base pair from an I-Scel meganuclease recognition sequence of SEQ ID NO: 10 and SEQ ID NO: i i; wherein said recombinant meganuclease comprises at least one modification of Table 3 and a modification which reduces or abolishes said endonuclease cleavage activity.
- the modification which reduces or abolishes said endonuclease cleavage activity is D44N or D145N.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Ceul meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 5-211 of the I- Ceul meganuclease of SEQ ID NO: 12; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Ceul meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 13 and SEQ ID NO: 14; wherein said recombinant meganuclease comprises at least one modification of Table 4 and a modification which reduces said endonuclease cleavage activity.
- the modification which reduces said endonuclease cleavage activity is E66Q.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Crel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 2-153 of the I-Crel meganuclease of SEQ ID NO: 1; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Crel meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5; wherein:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Msol meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 6-160 of the I- Msol meganuclease of SEQ ID NO: 6; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Msol meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 7 and SEQ ID NO: 8; wherein: (1) specificity at position -1 has been altered:
- the meganuclease DNA-binding domain comprises recombinant meganuclease having altered specificity for a recognition sequence relative to a wild-type I-Scel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 3-186 of the I- Scel meganuclease of SEQ ID NO: 9; and having specificity for a recognition sequence which differs by at least one base pair from an I-Scel meganuclease recognition sequence of SEQ ID NO: 10 and SEQ ID NO: i i; wherein:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered specificity for at least one recognition sequence half-site relative to a wild-type I-Ceul meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 5-211 of the I- Ceul meganuclease of SEQ ID NO: 12; and having specificity for a recognition sequence half-site which differs by at least one base pair from a half-site within an I-Ceul meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 13 and SEQ ID NO: 14; wherein:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Crel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 2-153 of the I- Crel meganuclease of SEQ ID NO: 1; wherein DNA-binding affinity has been increased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Crel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 2-153 of the I- Crel meganuclease of SEQ ID NO: 1; wherein DNA-binding affinity has been decreased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Msol meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 6-160 of the I- Msol meganuclease of SEQ ID NO: 6; wherein DNA-binding affinity has been increased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Msol meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 6-160 of the I- Msol meganuclease of SEQ ID NO: 6; wherein DNA-binding affinity has been decreased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Scel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 3-186 of the I- Scel meganuclease of SEQ ID NO: 9; wherein DNA-binding affinity has been increased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Scel meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 3-186 of the I- Scel meganuclease of SEQ ID NO: 9; wherein DNA-binding affinity has been decreased by at least one modification corresponding to:
- meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Ceul meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 5-211 of the I- Ceul meganuclease of SEQ ID NO: 12; wherein DNA-binding affinity has been increased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease having altered binding affinity for double-stranded DNA relative to a wild-type I-Ceul meganuclease, comprising: a polypeptide having at least 85% sequence similarity to residues 5-211 of the I- Ceul meganuclease of SEQ ID NO: 12; wherein DNA-binding affinity has been decreased by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease monomer having altered affinity for dimer formation with a reference meganuclease monomer, comprising: a polypeptide having at least 85% sequence similarity to residues 2-153 of the I- Crel meganuclease of SEQ ID NO: 1; wherein affinity for dimer formation has been altered by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease heterodimer comprising: a first polypeptide having at least 85% sequence similarity to residues 2-153 of the I-Crel meganuclease of SEQ ID NO: 1; wherein affinity for dimer formation has been altered by at least one modification corresponding to substitution of K7, K57 or K96 with D or E; and a second polypeptide having at least 85% sequence similarity to residues 2-153 of the I-Crel meganuclease of SEQ ID NO: 1; wherein affinity for dimer formation has been altered by at least one modification corresponding to a substitution of E8 or E61 with K or R.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease monomer having altered affinity for dimer formation with a reference meganuclease monomer, comprising: a polypeptide having at least 85% sequence similarity to residues 6-160 of the I- Msol meganuclease of SEQ ID NO: 6; wherein affinity for dimer formation has been altered by at least one modification corresponding to:
- the meganuclease DNA-binding domain comprises a recombinant meganuclease heterodimer comprising: a first polypeptide having at least 85% sequence similarity to residues 6-160 of the I-Msol meganuclease of SEQ ID NO: 6; wherein affinity for dimer formation has been altered by at least one modification corresponding to a substitution of R302 with D or E; and a second polypeptide having at least 85% sequence similarity to residues 6-160 of the I-Msol meganuclease of SEQ ID NO: 6; wherein affinity for dimer formation has been altered by at least one modification corresponding to a substitution of D20, El 1 or Q64 with K or R.
- the meganuclease DNA-binding domain comprises a recombinant meganuclease monomer having altered affinity for dimer formation with a reference meganuclease monomer, comprising: a polypeptide having at least 85% sequence similarity to residues 5-211 of the I- Ceul meganuclease of SEQ ID NO: 12; wherein affinity for dimer formation has been altered by at least one modification corresponding to:
- meganuclease DNA-binding domain comprises a recombinant meganuclease heterodimer comprising: a first polypeptide having at least 85% sequence similarity to residues 5-211 of the I-Ceul meganuclease of SEQ ID NO: 12; wherein affinity for dimer formation has been altered by at least one modification corresponding to a substitution of R93 with D or E; and a second polypeptide having at least 85% sequence similarity to residues 5-211 of the I-Ceul meganuclease of SEQ ID NO: 12; wherein affinity for dimer formation has been altered by at least one modification corresponding to a substitution of E 152 with K or R.
- the recombinant meganuclease monomer or heterodimer further comprises at least one modification selected from Table 1.
- the invention provides a nucleic acid encoding the targeted transcriptional effector.
- the invention provides a method for treating a disease or condition in a subject in need thereof, the method comprising: introducing the nucleic acid encoding the targeted transcriptional effector into a subject, whereby the polypeptide encoded by the nucleic acid binds to the target site and affects transcription of the gene of interest.
- the invention provides a method for treating a disease or condition in a subject in need thereof, the method comprising: introducing the targeted transcriptional effector of claims 1-34 into a subject, whereby the polypeptide binds to the target site and affects transcription of the gene of interest.
- Figure l(A) illustrates the interactions between the I-Crel homodimer and its naturally-occurring double-stranded recognition sequence, based upon crystallographic data.
- This schematic representation depicts the recognition sequence (SEQ ID NO: 2 and SEQ ID NO: 3), shown as unwound for illustration purposes only, bound by the homodimer, shown as two ovals.
- the bases of each DNA half-site are numbered -1 through -9, and the amino acid residues of I-Crel which form the recognition surface are indicated by one-letter amino acid designations and numbers indicating residue position.
- Solid black lines hydrogen bonds to DNA bases.
- Dashed lines amino acid positions that form additional contacts in enzyme designs but do not contact the DNA in the wild-type complex.
- Figure l(B) illustrates the wild-type contacts between the A-T base pair at position -4 of the cleavage half- site on the right side of Figure 1(A). Specifically, the residue Q26 is shown to interact with the A base. Residue 177 is in proximity to the base pair but not specifically interacting.
- Figure l(C) illustrates the interactions between a non-naturally-occurring, rationally-designed variant of the I-Crel meganuclease in which residue 177 has been modified to E77.
- a G-C base pair is preferred at position -4.
- the interaction between Q26 and the G base is mediated by a water molecule, as has been observed crystallographically for the cleavage half-site on the left side of Figure 1(A).
- Figure l(D) illustrates the interactions between a non-naturally-occurring, rationally-designed variant of the I-Crel meganuclease in which residue Q26 has been modified to E26 and residue 177 has been modified to R77.
- a C-G base pair is preferred at position -4.
- Figure l(E) illustrates the interactions between a non-naturally-occurring, rationally-designed variant of the I-Crel meganuclease in which residue Q26 has been modified to A26 and residue 177 has been modified to Q77. As a result of this change, a T-A base pair is preferred at position -4.
- Figure 2(A) shows a comparison of one recognition sequence for each of the wild type I-Crel meganuclease (WT) and 11 non-naturally-occurring, rationally-designed meganuclease heterodimers described herein. Bases that are conserved relative to the WT recognition sequence are shaded. The 9 bp half- sites are bolded.
- WT wild type I-Crel meganuclease
- WT wild-type (SEQ ID NO: 4); CF: ⁇ F508 allele of the human CFTR gene responsible for most cases of cystic fibrosis (SEQ ID NO: 25); MYD: the human DM kinase gene associated with myotonic dystrophy (SEQ ID NO: 27); CCR: the human CCR5 gene (a major HIV co-receptor) (SEQ ID NO: 26); ACH: the human FGFR3 gene correlated with achondroplasia (SEQ ID NO: 23); TAT: the HIV-I TAT/REV gene (SEQ ID NO: 15); HSV: the HSV-I UL36 gene (SEQ ID NO: 28); LAM: the bacteriophage ⁇ pO5 gene (SEQ ID NO: 22); POX: the Variola (smallpox) virus gp009 gene (SEQ ID NO: 30); URA: the Saccharomyces cerevisiae URA3 gene (SEQ ID NO: 36); GLA
- Figure 2(B) illustrates the results of incubation of each of wild-type I-Crel (WT) and 11 non-naturally-occurring, rationally-designed meganuclease heterodimers with plasmids harboring the recognition sites for all 12 enzymes for 6 hours at 37 0 C. Percent cleavage is indicated in each box.
- Figure 3 illustrates cleavage patterns of wild-type and non-naturally-occurring, rationally-designed I-Crel homodimers.
- A wild type I-Crel.
- B I-Crel Kl 16D.
- C-L non- naturally-occurring, rationally-designed meganucleases described herein.
- Enzymes were incubated with a set of plasmids harboring palindromes of the intended cleavage half- site the 27 corresponding single -base pair variations. Bar graphs show fractional cleavage (F) in 4 hours at 37 0 C. Black bars: expected cleavage patterns based on Table 1. Gray bars: DNA sites that deviate from expected cleavage patterns. White squares indicate bases in the intended recognition site. Also shown are cleavage time-courses over two hours. The open circle time-course plots in C and L correspond to cleavage by the CCRl and BRP2 enzymes lacking the E80Q mutation.
- Figure 4 demonstrates DNA recognition by Endo-TNF.
- Purified Endo-TNFsc was incubated with pUC-19 plasmid substrates (linearized with Seal) for 2 hours at 37 0 C.
- Lanes 1 and 2 molecular weight markers.
- Lanes 3 and 4 Endo-TNFsc incubated with empty plasmid (lane 3) or plasmid harboring the wild-type I-Crel site (lane 4).
- Lanes 5-7 linearized plasmid harboring the Endo-TNFsc recognition site incubated with buffer only (lane 5), Endo-TNFsc (lane 6), or the inactivated Endo-TNF ⁇ o- Bands of 0.9 and 1.8 kb in length in lane 6 indicate cleavage by Endo-TNFsc of its intended recognition site.
- Figure 5 shows the results of a chromatin immunoprecipitation (ChIP) assay with Endo-TNF ⁇ o- Cultured HEK 293 cells were transfected with either GFP or Endo-TNF ⁇ o and a ChIP assay was performed.
- ChIP chromatin immunoprecipitation
- PCR was performed on DNA isolated from input cell lysates (In) or on DNA isolated from cell lysates immunoprecipitated with I-Crel antiserum (IP) or fetal bovine serum (-AB) using primers specific for TNF- ⁇ .
- IP I-Crel antiserum
- -AB fetal bovine serum
- FIG. 6 demonstrates activity of the CCR2 REP transcription repressor.
- GFP transfection efficiency
- the present invention is based, in part, upon the identification and characterization of specific amino acids in the LAGLIDADG family of meganucleases that make specific contacts with DNA bases and non-specific contacts with the DNA backbone when the meganucleases associate with a double-stranded DNA recognition sequence, and which thereby affect the recognition sequence specificity and DNA-binding affinity of the enzymes.
- the invention provides methods for generating non-naturally- occurring, rationally-designed LAGLIDADG meganucleases containing altered amino acid residues at sites within the meganuc lease that are responsible for (1) sequence-specific binding to individual bases in the double-stranded DNA recognition sequence, or (2) non- sequence-specific binding to the phosphodiester backbone of a double-stranded DNA molecule.
- Altering the amino acids involved in binding to the DNA backbone can alter not only the activity of the enzyme, but also the degree of specificity or degeneracy of binding to the recognition sequence by increasing or decreasing overall binding affinity for the double- stranded DNA.
- specific residues can be altered to reduce or eliminate catalytic activity.
- the rules include both steric considerations relating to the distances in a meganuclease-DNA complex between the amino acid side chains of the meganuclease and the bases in the sense and anti-sense strands of the DNA, and considerations relating to the non-covalent chemical interactions between functional groups of the amino acid side chains and the desired DNA base at the relevant position.
- the invention provides non-naturally-occurring, rationally-designed meganucleases in which monomers differing by at least one amino acid position are dimerized to form heterodimers.
- both monomers are rationally-designed to form a heterodimer which recognizes a non-palindromic recognition sequence.
- a mixture of two different monomers can result in up to three active forms of meganuclease dimer: the two homodimers and the heterodimer.
- amino acid residues are altered at the interfaces at which monomers can interact to form dimers, in order to increase or decrease the likelihood of formation of homodimers or heterodimers.
- a linker such as a polypeptide is added between the monomer domains to aid in heterodimer formation.
- the invention provide methods for rationally designing non- naturally-occurring LAGLIDADG meganucleases containing amino acid changes that alter the specificity and/or affinity of the enzymes for DNA-binding.
- the invention provides the non-naturally-occurring, rationally-designed meganucleases resulting from these methods and their use as sequence-specific DNA-binding proteins to target effector domains to specific loci in a genome.
- the invention provides methods that use such fusion molecules of non-naturally-occurring, rationally-designed meganucleases and effector domains to regulate gene expression in vivo or in vitro.
- the invention provides methods for treating conditions which can be treated by increasing or decreasing the expression of a gene, by administering a fusion molecule provided by the invention.
- meganuclease refers to an endonuclease that binds double-stranded DNA at a recognition sequence that is greater than 12 base pairs.
- Naturally- occurring meganucleases can be monomeric (e.g., I-Scel) or dimeric (e.g., I-Crel).
- the term meganuclease, as used herein, can be used to refer to monomeric meganucleases, dimeric meganucleases, or to the monomers which associate to form a dimeric meganuclease.
- the term "homing endonuclease” is synonymous with the term “meganuclease.”
- the meganucleases can be catalytically active (i.e., capable of binding and cleaving double- stranded DNA at their recognition sequence) or can be inactivated by way of rational design. For most embodiments described herein, the meganuclease will be inactivated, although catalytically active meganucleases can be employed as intermediates and controls while developing inactive meganucleases.
- LAGLIDADG meganuclease refers either to meganucleases including a single LAGLIDADG motif, which are naturally dimeric, or to meganucleases including two LAGLIDADG motifs, which are naturally monomeric.
- the term "mono-LAGLIDADG meganuclease” is used herein to refer to meganucleases including a single LAGLIDADG motif
- the term "di-LAGLIDADG meganuclease” is used herein to refer to meganucleases including two LAGLIDADG motifs, when it is necessary to distinguish between the two.
- Each of the two structural domains of a di-LAGLIDADG meganuclease which includes a LAGLIDADG motif can be referred to as a LAGLIDADG subunit.
- rationally-designed means non-naturally occurring and/or genetically engineered.
- the rationally-designed meganucleases described herein differ from wild-type or naturally-occurring meganucleases in their amino acid sequence or primary structure, and may also differ in their secondary, tertiary or quaternary structure.
- the rationally-designed meganucleases described herein also differ from wild-type or naturally-occurring meganucleases in recognition sequence-specificity, affinity and/or activity.
- the term “recombinant” means having an altered amino acid sequence as a result of the application of genetic engineering techniques to nucleic acids which encode the protein, and cells or organisms which express the protein.
- nucleic acid the term “recombinant” means having an altered nucleic acid sequence as a result of the application of genetic engineering techniques. Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies; homologous recombination; site-directed mutagenesis; and gene fusion.
- Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies; homologous recombination; site-directed mutagenesis; and gene fusion.
- a protein having an amino acid sequence identical to a naturally-occurring protein, but produced by cloning and expression in a heterologous host is not considered recombinant.
- modification means any insertion, deletion or substitution of an amino acid residue in the recombinant sequence relative to a reference sequence (e.g., a wild-type).
- the term “genetically-modified” refers to a cell or organism in which, or in an ancestor of which, a genomic DNA sequence has been deliberately modified by recombinant technology. As used herein, the term “genetically-modified” encompasses the term "transgenic.”
- wild-type refers to any naturally-occurring form of a meganuclease.
- wild-type is not intended to mean the most common allelic variant of the enzyme in nature but, rather, any allelic variant found in nature. Wild-type meganucleases are distinguished from recombinant or non-naturally-occurring meganucleases.
- the term "recognition sequence half-site” or simply “half site” means a nucleic acid sequence in a double-stranded DNA molecule which is recognized by a monomer of a mono-LAGLID ADG meganuclease or by one LAGLIDADG subunit of a di- LAGLIDADG meganuclease.
- the term "recognition sequence” refers to a pair of half-sites which is bound by either a mono-LAGLID ADG meganuclease dimer or a di-LAGLIDADG meganuclease monomer.
- the two half-sites may or may not be separated by base pairs that are not specifically recognized by the enzyme.
- the recognition sequence half-site of each monomer spans 9 base pairs, and the two half-sites are separated by four base pairs which are not recognized specifically but which constitute the actual cleavage site (which has a 4 base pair overhang).
- the combined recognition sequences of the I-Crel, I-Msol and I-Ceul meganuclease dimers normally span 22 base pairs, including two 9 base pair half-sites flanking a 4 base pair cleavage site.
- the base pairs of each half- site are designated -9 through -1, with the -9 position being most distal from the cleavage site and the -1 position being adjacent to the 4 central base pairs, which are designated N 1 -N 4 .
- the strand of each half-site which is oriented 5' to 3' in the direction from -9 to -1 (i.e., towards the cleavage site), is designated the "sense" strand and the opposite strand is designated the "antisense strand", although neither strand may encode protein.
- the "sense" strand of one half-site is the antisense strand of the other half-site. See, for example, Figure l(A).
- the recognition sequence is an approximately 18 bp non- palindromic sequence, and there are no central base pairs which are not specifically recognized.
- one of the two strands is referred to as the "sense” strand and the other the “antisense” strand, although neither strand may encode protein.
- the term "specificity" means the ability of a meganuclease to recognize double-stranded DNA molecules only at a particular sequence of base pairs referred to as the recognition sequence, or only at a particular set of recognition sequences.
- the set of recognition sequences will share certain conserved positions or sequence motifs, but may be degenerate at one or more positions.
- a highly-specific meganuclease is capable of binding only one or a very few recognition sequences.
- a meganuclease For catalytically active meganucleases, specificity can be determined in a cleavage assay as described in Example 1. For inactive meganucleases, binding assays can be substituted.
- a meganuclease has "altered" specificity if it binds to a recognition sequence which is not bound to by a reference meganuclease (e.g., a wild-type) or if the affinity of binding of a recognition sequence is increased or decreased by a significant (10-fold or more) amount relative to a reference meganuclease.
- the term "degeneracy” means the opposite of "specificity.”
- a highly-degenerate meganuclease is capable of binding a large number of divergent recognition sequences.
- a meganuclease can have sequence degeneracy at a single position within a half-site or at multiple, even all, positions within a half-site.
- sequence degeneracy can result from (i) the inability of any amino acid in the DNA-binding domain of a meganuclease to make a specific contact with any base at one or more positions in the recognition sequence, (ii) the ability of one or more amino acids in the DNA-binding domain of a meganuclease to make specific contacts with more than one base at one or more positions in the recognition sequence, and/or (iii) sufficient non-specific DNA binding affinity.
- a "completely" degenerate position can be occupied by any of the four bases and can be designated with an "N" in a half-site.
- a “partially” degenerate position can be occupied by two or three of the four bases (e.g., either purine (Pu), either pyrimidine (Py), or not G).
- DNA-binding affinity means the tendency of a meganuclease to non-covalently associate with a reference DNA molecule (e.g., a recognition sequence or an arbitrary sequence). Binding affinity can be measured by a dissociation constant, K D (e.g., the K D of I-Crel for the WT recognition sequence is approximately 0.1 nM).
- a meganuclease has "altered" binding affinity if the K D of the recombinant meganuclease for a reference recognition sequence is increased or decreased by a significant (10-fold or more) amount relative to a reference meganuclease.
- the DNA-binding affinity of a polypeptide can be determined, for example, by filter-binding, electrophoretic mobility-shift, or immunoprecipitation assays, as well as by any other methods known in the art.
- affinity for dimer formation means the tendency of a meganuclease monomer to non-covalently associate with a reference meganuclease monomer.
- the affinity for dimer formation can be measured with the same monomer (i.e., homodimer formation) or with a different monomer (i.e., heterodimer formation) such as a reference wild-type meganuclease. Binding affinity can be measured by a dissociation constant, K D .
- a meganuclease has "altered" affinity for dimer formation if the K D of the recombinant meganuclease monomer for a reference meganuclease monomer is increased or decreased by a significant (10-fold or more) amount relative to a reference meganuclease monomer.
- the term "palindromic” refers to a recognition sequence consisting of inverted repeats of identical half-sites. In this case, however, the palindromic sequence need not be palindromic with respect to the four central base pairs, which are not contacted by the enzyme. In the case of dimeric meganucleases, palindromic DNA sequences are recognized by homodimers in which the two monomers make contacts with identical half- sites. [0081] As used herein, the term “pseudo-palindromic” refers to a recognition sequence consisting of inverted repeats of non-identical or imperfectly palindromic half- sites.
- the pseudo-palindromic sequence not only need not be palindromic with respect to the four central base pairs, but also can deviate from a palindromic sequence between the two half-sites.
- Pseudo-palindromic DNA sequences are typical of the natural DNA sites recognized by wild-type homodimeric meganucleases in which two identical enzyme monomers make contacts with different half- sites.
- non-palindromic refers to a recognition sequence composed of two unrelated half-sites of a meganuc lease.
- the non-palindromic sequence need not be palindromic with respect to either the four central base pairs or the two monomer half-sites.
- Non-palindromic DNA sequences are recognized by either di- LAGLIDADG meganucleases, highly degenerate mono-LAGLIDADG meganucleases (e.g., I-Ceul) or by heterodimers of mono-LAGLIDADG meganuclease monomers that recognize non-identical half-sites.
- the term "activity" refers to the rate at which a meganuclease of described herein cleaves a particular recognition sequence. Such activity is a measurable enzymatic reaction, involving the hydrolysis of phosphodiester bonds of double-stranded DNA.
- the activity of a meganuclease acting on a particular DNA substrate is affected by the affinity or avidity of the meganuclease for that particular DNA substrate which is, in turn, affected by both sequence-specific and non-sequence-specific interactions with the DNA. In inactive meganucleases, this activity is lacking.
- a meganuclease which is "inactive,” “inactivated” or “lacks catalytic activity” refers to a genetically-engineered meganuclease DNA-binding domain which cleaves the cleavage site of the wild-type enzyme at a rate that is reduced at least 10- fold, at least 100-fold, or at least 1, 000-fold, when compared to the wild-type enzyme under the same cleavage conditions, or which does not cleave the cleavage site of the wild-type enzyme at all. If no cleavage of the cleavage site of the wild-type enzyme can be observed, it is said that such cleavage is "abolished.”
- homologous recombination refers to the natural, cellular process in which a double-stranded DNA-break is repaired using a homologous DNA sequence as the repair template (see, e.g. Cahill et al. (2006), Front. Biosci. 11 : 1958-1976).
- the homologous DNA sequence may be an endogenous chromosomal sequence or an exogenous nucleic acid that was delivered to the cell.
- a catalytically active meganuclease can be used to cleave a recognition sequence within a target sequence and an exogenous nucleic acid with homology to or substantial sequence similarity with the target sequence can be delivered into the cell and used as a template for repair by homologous recombination.
- the DNA sequence of the exogenous nucleic acid which may differ significantly from the target sequence, is thereby incorporated into the chromosomal sequence.
- the process of homologous recombination occurs primarily in eukaryotic organisms.
- the term "homology" is used herein as equivalent to "sequence similarity" and is not intended to require identity by descent or phylogenetic relatedness.
- non-homologous end-joining refers to the natural, cellular process in which a double-stranded DNA-break is repaired by the direct joining of two non-homologous DNA segments (see, e.g. Cahill et al. (2006), Front. Biosci. 11 :1958- 1976). DNA repair by non-homologous end-joining is error-prone and frequently results in the untemplated addition or deletion of DNA sequences at the site of repair.
- a catalytically active meganuclease can be used to produce a double-stranded break at a meganuclease recognition sequence within a target sequence to disrupt a gene ⁇ e.g., by introducing base insertions, base deletions, or frameshift mutations) by non-homologous end- joining.
- An exogenous nucleic acid lacking homology to or substantial sequence similarity with the target sequence may be captured at the site of a meganuclease-stimulated double- stranded DNA break by non-homologous end-joining (see, e.g. Salomon, et al. (1998), EMBOJ. 17:6086-6095).
- the process of non-homologous end-joining occurs in both eukaryotes and prokaryotes such as bacteria.
- sequence of interest means any nucleic acid sequence, whether it codes for a protein, RNA, or regulatory element ⁇ e.g., an enhancer, silencer, or promoter sequence), that can be inserted into a genome or used to replace a genomic DNA sequence using a catalytically active meganuclease protein.
- Sequences of interest can have heterologous DNA sequences that allow for tagging a protein or RNA that is expressed from the sequence of interest.
- a protein can be tagged with tags including, but not limited to, an epitope ⁇ e.g., c-myc, FLAG) or other ligand ⁇ e.g., poly-His).
- a sequence of interest can encode a fusion protein, according to techniques known in the art (see, e.g., Ausubel et al, Current Protocols in Molecular Biology, Wiley 1999).
- the sequence of interest is flanked by a DNA sequence that is recognized by a catalytically active meganuclease for cleavage.
- the flanking sequences are cleaved allowing for proper insertion of the sequence of interest into genomic recognition sequences cleaved by the active meganuclease.
- the entire sequence of interest is homologous to or has substantial sequence similarity with the a target sequence in the genome such that homologous recombination effectively replaces the target sequence with the sequence of interest.
- the sequence of interest is flanked by DNA sequences with homology to or substantial sequence similarity with the target sequence such that homologous recombination inserts the sequence of interest within the genome at the locus of the target sequence.
- the sequence of interest is substantially identical to the target sequence except for mutations or other modifications in a meganuclease recognition sequence such that an active meganuclease can not cleave the target sequence after it has been modified by the sequence of interest.
- targeted transcriptional effector refers to a non-natural protein comprising a first domain comprising a non-naturally-occurring, rationally-designed meganuclease that has been modified relative to a wild-type meganuclease and a second domain comprising a natural or non-natural transcription effector domain.
- the first domain comprises a non-naturally-occurring, rationally-designed meganuclease that has been modified relative to a wild-type meganuclease with respect to DNA-binding specificity, DNA-binding affinity, and/or the ability to form heterodimers, and which has been inactivated with respect to its ability to cleave DNA.
- Such an inactive meganuclease is referred to as a "meganuclease DNA-binding domain.”
- the second domain comprises a natural or non-natural transcription effector domain.
- Such a transcription effector domain is able to interact directly or indirectly with the transcription machinery of a cell to either increase or decrease gene expression.
- the first and the second domains of a targeted transcriptional effectors can be fused together, or they can be connected through a flexible linker.
- domain linker means a chemical moiety which co valently joins a rationally-designed meganuclease DNA-binding domain and an effector domain (e.g., a transcription effector domain), having a backbone of chemical bonds forming a continuous connection between the peptides, and having a plurality of freely rotating bonds along that backbone.
- the domain linkers described herein have a backbone length (i.e., the sum of the bond lengths forming a continuous connection between the peptides) of at least about 13 A.
- a domain linker comprises a plurality of amino acid residues but this need not be the case.
- domain linkers are polypeptide linkers comprising 3-15 amino acid residues. Such domain linkers will have backbone lengths of approximately 13-65 A.
- the domain linkers can be substantially linear, biochemically inert, hydrophilic and/or non-cleavable by proteases, but branched domain linkers, or linkers with reactive moieties, hydrophobic residues and protease cleavage sites may be suitable for certain embodiments.
- the domain linkers can also be designed to lack secondary structure under physiological conditions.
- the domain linker sequences can be composed of a plurality of residues selected from the group consisting of glycine, serine, threonine, cysteine, asparagine, glutamine, and proline.
- domain linkers consist essentially of glycine and serine residues. Domain linkers including the larger, aromatic residues may also be included, although they may cause steric hindrance. Similarly, the charged amino acids may be included, but they may interact to form secondary structures, and the nonpolar amino acids may be included, but they may decrease solubility. Domain linkers which do not satisfy one or more of these criteria may prove to be at least as effective in some embodiments. [0092] For chemical synthesis of domain linkers, one of skill in the art of organic synthesis may design a wide variety of linkers which satisfy the requirements discussed above.
- appropriate end groups are chosen for the linker such that the linker may be joined to the chosen termini of the two proteins to be fused (e.g., using a naturally occurring amino acid, D-isomer amino acid, or modified amino acid, such as sarcosine or D-alanine, at one or both ends).
- domain linkers include polymers or copolymers of organic acids, aldehydes, alcohols, thiols, amines and the like.
- polymers or copolymers of hydroxy-, amino-, or di-carboxylic acids, such as glycolic acid, lactic acid, sebacic acid, or sarcosine may be employed.
- polymers or copolymers of saturated or unsaturated hydrocarbons such as ethylene glycol, propylene glycol, saccharides, and the like may be employed.
- One example of such a domain linker is polyethylene glycol (with or without, e.g., D-alanine at the ends), available from Shearwater Polymers, Inc.
- linkers can optionally have amide linkages, sulfhydryl linkages, or hetero functional linkages.
- Other examples include polymers or copolymers of non-naturally occurring amino acids (including, for example, D-isomers). Certain non-naturally occurring amino acids have characteristics which may be advantageous in connection with the present invention.
- N-methyl glycine sarcosine
- a polysarcosine linker (with or without, e.g., lysine at the ends) may be employed.
- domain linkers can be rationally designed using computer program capable of modeling both DNA- binding sites and the peptides themselves (Desjarlais & Berg (1993), Proc. Natl. Acad. Sci. USA 90:2256-2260 (1993), Desjarlais & Berg (1994), Proc. Natl. Acad. Sci. USA 91 : 11099-11103), or by phage display methods.
- non-covalent methods can be used to produce molecules with meganuclease DNA-binding domains associated with effector domains.
- a meganuclease DNA-binding domain can be expressed as a fusion protein such as maltose binding protein ("MBP"), glutathione S transferase (GST), hexahistidine, c-myc, and the FLAG epitope, for ease of purification, monitoring expression, or monitoring cellular and subcellular localization.
- MBP maltose binding protein
- GST glutathione S transferase
- hexahistidine hexahistidine
- c-myc hexahistidine
- FLAG epitope the term "single-chain meganuclease” refers to a non-naturally- occurring meganuclease comprising a pair of mono-L AGLID ADG meganuc leases that are co valently joined into a single polypeptide using an amino acid linker.
- a pair of rationally-designed meganucleases derived from I-Crel may be joined using an amino acid linker to join a first rationally-designed meganuclease monomer with a second rationally designed meganuclease monomer to produce a single-chain heterodimer (see, e.g., Example 5).
- Single-chain meganucleases typically comprise a pair of rationally-designed meganuclease subunits that recognize different half-sites such that the recognition sequence for a single-chain meganuclease is non-palindromic.
- percentage similarity and “sequence similarity” refer to a measure of the degree of similarity of two sequences based upon an alignment of the sequences which maximizes similarity between aligned amino acid residues or nucleotides, and which is a function of the number of identical or similar residues or nucleotides, the number of total residues or nucleotides, and the presence and length of gaps in the sequence alignment.
- a variety of algorithms and computer programs are available for determining sequence similarity using standard parameters.
- sequence similarity is measured using the BLASTp program for amino acid sequences and the BLASTn program for nucleic acid sequences, both of which are available through the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov), and are described in, for example, Altschul et al. (1990), J. MoI. Biol. 215:403 -410; Gish and States (1993), Nature Genet. 3:266-272; Madden et al. (1996), Meth. Enzymol. 266:131-141; Altschul et al. (1997), Nucleic Acids Res. 25:33 89-3402); Zhang et al. (2000), J. Comput. Biol.
- the term "corresponding to” is used to indicate that a specified modification in the first protein is a substitution of the same amino acid residue as in the modification in the second protein, and that the amino acid position of the modification in the first proteins corresponds to or aligns with the amino acid position of the modification in the second protein when the two proteins are subjected to standard sequence alignments (e.g., using the BLASTp program).
- the modification of residue "X” to amino acid "A” in the first protein will correspond to the modification of residue "Y” to amino acid "A” in the second protein if residues X and Y correspond to each other in a sequence alignment, and despite the fact that X and Y may be different numbers.
- variable As used herein, the recitation of a numerical range for a variable is intended to convey that the invention may be practiced with the variable equal to any of the values within that range. Thus, for a variable which is inherently discrete, the variable can be equal to any integer value within the numerical range, including the end-points of the range. Similarly, for a variable which is inherently continuous, the variable can be equal to any real value within the numerical range, including the end-points of the range.
- a variable which is described as having values between 0 and 2 can take the values 0, 1 or 2 if the variable is inherently discrete, and can take the values 0.0, 0.1, 0.01, 0.001, or any other real values > 0 and ⁇ 2 if the variable is inherently continuous.
- the word "or” is used in the inclusive sense of "and/or” and not the exclusive sense of "either/or.”
- recombinant meganucleases are rationally-designed by first predicting amino acid substitutions that can alter base preference at each position in the half-site. These substitutions can be experimentally validated individually or in combinations to produce meganucleases with the desired cleavage specificity.
- amino acid substitutions that can cause a desired change in base preference are predicted by determining the amino acid side chains of a reference meganuclease (e.g., a wild-type meganuclease, or a non-naturally-occurring reference meganuclease) that are able to participate in making contacts with the nucleic acid bases of the meganuclease's DNA recognition sequence and the DNA phosphodiester backbone, and the spatial and chemical nature of those contacts.
- a reference meganuclease e.g., a wild-type meganuclease, or a non-naturally-occurring reference meganuclease
- These amino acids include but are not limited to side chains involved in contacting the reference DNA half-site.
- this determination requires having knowledge of the structure of the complex between the meganuclease and its double-stranded DNA recognition sequence, or knowledge of the structure of a highly similar complex (e.g., between the same meganuclease and an alternative DNA recognition sequence, or between an allelic or phylogenetic variant of the meganuclease and its DNA recognition sequence).
- Three-dimensional structures, as described by atomic coordinates data, of a polypeptide or complex of two or more polypeptides can be obtained in several ways.
- protein structure determinations can be made using techniques including, but not limited to, X-ray crystallography, NMR, and computer simulations.
- Another approach is to analyze databases of existing structural co-ordinates for the meganuclease of interest or a related meganuclease.
- Such structural data is often available from databases in the form of three-dimensional coordinates. Often this data is accessible through online databases (e.g., the RCSB Protein Data Bank at www.rcsb.org/pdb).
- Structural information can be obtained experimentally by analyzing the diffraction patterns of, for example, X-rays or electrons, created by regular two- or three-dimensional arrays (e.g., crystals) of proteins or protein complexes. Computational methods are used to transform the diffraction data into three-dimensional atomic co-ordinates in space. For example, the field of X-ray crystallography has been used to generate three-dimensional structural information on many protein-DNA complexes, including meganucleases (see, e.g., Chevalier et ⁇ l. (2001), Nucleic Acids Res. 29(18): 3757-3774).
- NMR Nuclear Magnetic Resonance
- Multi-dimensional NMR methods combined with computational methods have succeeded in determining the atomic co-ordinates of polypeptides of increasing size (see, e.g., Tzakos et ⁇ l. (2006), Annu. Rev. Biophys. Biomol. Struct. 35:19-42.).
- computational modeling can be used by applying algorithms based on the known primary structures and, when available, secondary, tertiary and/or quaternary structures of the protein/DNA, as well as the known physiochemical nature of the amino acid side chains, nucleic acid bases, and bond interactions.
- Such methods can optionally include iterative approaches, or experimentally-derived constraints.
- An example of such computational software is the CNS program described in Adams et al. (1999), Acta Crystallogr. D. Biol. Crystallogr. 55 (Pt 1): 181-90.
- a variety of other computational programs have been developed that predict the spatial arrangement of amino acids in a protein structure and predict the interaction of the amino acid side chains of the protein with various target molecules (see, e.g., U.S. Pat. No. 6,988,041).
- computational models are used to identify specific amino acid residues that specifically interact with DNA nucleic acid bases and/or facilitate non-specific phosphodiester backbone interactions.
- computer models of the totality of the potential meganuclease-DNA interaction can be produced using a suitable software program, including, but not limited to, MOLSCRIPTTM 2.0 (Avatar Software AB, Sweden), the graphical display program O (Jones et. al. (1991), Acta Crystallography, A47: 110), the graphical display program GRASPTM (Nicholls et al.
- PROTEINS Structure, Function and Genetics 11(4): 28 Iff
- INSIGHTTM graphical display program
- Computer hardware suitable for producing, viewing and manipulating three-dimensional structural representations of protein- DNA complexes are commercially available and well known in the art ⁇ e.g., Silicon Graphics Workstation, Silicon Graphics, Inc., Mountainview, CA).
- interactions between a meganuclease and its double-stranded DNA recognition sequences can be resolved using methods known in the art.
- a representation, or model, of the three dimensional structure of a multi-component complex structure, for which a crystal has been produced can be determined using techniques which include molecular replacement or SIR/MIR (single/multiple isomorphous replacement) (see, e.g., Brunger (1997), Meth. Enzym. 276: 558-580; Navaza and Saludjian (1997), Meth. Enzym. 276: 581-594; Tong and Rossmann (1997), Meth. Enzym. 276: 594-611; and Bentley (1997), Meth.
- SIR/MIR single/multiple isomorphous replacement
- Enzym. 276: 611-619 can be performed using a software program, such as AMoRe/Mosflm (Navaza (1994), Acta Cryst. A50: 157-163; CCP4 (1994), Acta Cryst. D50: 760-763) or XPLOR (see, Brunger et al. (1992), X-PLOR Version 3.1. A System for X-ray Crystallography and NMR, Yale University Press, New Haven, CT). [0110] The determination of protein structure and potential meganuclease-DNA interaction allows for rational choices concerning the amino acids that can be changed to affect enzyme activity and specificity.
- amino acid side chain interactions with a particular base or DNA phosphodiester backbone.
- Chemical interactions used to determine appropriate amino acid substitutions include, but are not limited to, van der Waals forces, steric hindrance, ionic bonding, hydrogen bonding, and hydrophobic interactions.
- Amino acid substitutions can be selected which either favor or disfavor specific interactions of the meganuclease with a particular base in a potential recognition sequence half-site in order to increase or decrease specificity for that sequence and, to some degree, overall binding affinity and activity.
- amino acid substitutions can be selected which either increase or decrease binding affinity for the phosphodiester backbone of double-stranded DNA in order to increase or decrease overall activity and, to some degree, to decrease or increase specificity.
- a three-dimensional structure of a meganuclease- DNA complex is determined and a "contact surface" is defined for each base-pair in a DNA recognition sequence half-site.
- the contact surface comprises those amino acids in the enzyme with ⁇ -carbons less than 9.0 A from a major groove hydrogen- bond donor or acceptor on either base in the pair, and with side chains oriented toward the DNA, irrespective of whether the residues make base contacts in the wild-type meganuclease-DNA complex.
- residues can be excluded if the residues do not make contact in the wild-type meganuclease-DNA complex, or residues can be included or excluded at the discretion of the designer to alter the number or identity of the residues considered.
- the contact surfaces were limited to the amino acid positions that actually interact in the wild-type enzyme-DNA complex.
- the contact surfaces were defined to contain additional amino acid positions that are not involved in wild-type contacts but which could potentially contact a base if substituted with a different amino acid.
- residues are selected which are sufficiently close to contact the sense strand at position X and which favor the presence of an A, and/or residues are selected which are sufficiently close to contact the antisense strand at position X and which favor the presence of a T.
- a residue is considered sufficiently close if the ⁇ -carbon of the residue is within 9 A of the closest atom of the relevant base.
- an amino acid with a ⁇ -carbon within 9A of the DNA sense strand but greater than 9A from the antisense strand is considered for potential interactions with only the sense strand.
- an amino acid with a ⁇ -carbon within 9 A of the DNA antisense strand but greater than 9A from the sense strand is considered for potential interactions with only the antisense strand.
- Amino acids with ⁇ -carbons that are within 9A of both DNA strands are considered for potential interactions with either strand.
- potential amino acid substitutions are selected based on their predicted ability to interact favorably with one or more of the four DNA bases.
- the selection process is based upon two primary criteria: (i) the size of the amino acid side chains, which will affect their steric interactions with different nucleic acid bases, and (ii) the chemical nature of the amino acid side chains, which will affect their electrostatic and bonding interactions with the different nucleic acid bases.
- amino acids with shorter and/or smaller side chains can be selected if an amino acid ⁇ -carbon in a contact surface is ⁇ 6 A from a base, and amino acids with longer and/or larger side chains can be selected if an amino acid ⁇ - carbon in a contact surface is >6 A from a base.
- Amino acids with side chains that are intermediate in size can be selected if an amino acid ⁇ -carbon in a contact surface is 5-8 A from a base.
- the amino acids with relatively shorter and smaller side chains can be assigned to Group 1, including glycine (G), alanine (A), serine (S), threonine (T), cysteine (C), valine (V), leucine (L), isoleucine (I), aspartate (D), asparagine (N) and proline (P).
- G glycine
- A alanine
- S serine
- T threonine
- V valine
- L leucine
- I isoleucine
- D aspartate
- N asparagine
- proline proline
- Proline is expected to be used less frequently because of its relative inflexibility.
- glycine is expected to be used less frequently because it introduces unwanted flexibility in the peptide backbone and its very small size reduces the likelihood of effective contacts when it replaces a larger residue.
- glycine can be used in some instances for promoting a degenerate position.
- the amino acids with side chains of relatively intermediate length and size can be assigned to Group 2, including lysine (K), methionine (M), arginine (R), glutamate (E) and glutamine (Q).
- the amino acids with relatively longer and/or larger side chains can be assigned to Group 3, including lysine (K), methionine (M), arginine (R), histidine (H), phenylalanine (F), tyrosine (Y), and tryptophan (W). Tryptophan, however, is expected to be used less frequently because of its relative inflexibility.
- the side chain flexibility of lysine, arginine, and methionine allow these amino acids to make base contacts from long or intermediate distances, warranting their inclusion in both Groups 2 and 3. These groups are also shown in tabular form below:
- the different amino acids are evaluated for their potential interactions with the different nucleic acid bases (e.g. , van der Waals forces, ionic bonding, hydrogen bonding, and hydrophobic interactions) and residues are selected which either favor or disfavor specific interactions of the meganuclease with a particular base at a particular position in the double-stranded DNA recognition sequence half-site.
- residues are selected which either favor or disfavor specific interactions of the meganuclease with a particular base at a particular position in the double-stranded DNA recognition sequence half-site.
- residues which favor the presence of two or more bases, or residues which disfavor one or more bases can be achieved by sterically hindering a pyrimidine at a sense or antisense position.
- G bases Recognition of guanine (G) bases is achieved using amino acids with basic side chains that form hydrogen bonds to N7 and 06 of the base. Cytosine (C) specificity is conferred by negatively-charged side chains which interact unfavorably with the major groove electronegative groups present on all bases except C. Thymine (T) recognition is rationally-designed using hydrophobic and van der Waals interactions between hydrophobic side chains and the major groove methyl group on the base. Finally, adenine (A) bases are recognized using the carboxamide side chains Asn and GIn or the hydroxyl side chain of Tyr through a pair of hydrogen bonds to N7 and N6 of the base.
- His can be used to confer specificity for a purine base (A or G) by donating a hydrogen bond to N7.
- each amino acid residue can be assigned to one or more different groups corresponding to the different bases they favor (i.e., G, C, T or A).
- Group G includes arginine (R), lysine (K) and histidine (H);
- Group C includes aspartate (D) and glutamate (E);
- Group T includes alanine (A), valine (V), leucine (L), isoleucine (I), cysteine (C), threonine (T), methionine (M) and phenylalanine (F); and
- Group A includes asparagine (N), glutamine (N), tyrosine (Y) and histidine (H).
- More than one such residue comprising the contact surface can be selected for analysis and modification and, in some embodiments, each such residue is analyzed and multiple residues are modified.
- the distance between the ⁇ -carbon of a residue included in the contact surface and each of the two bases of the base pair at position X can be determined and, if the residue is within 9 A of both bases, then different substitutions can be made to affect the two bases of the pair (e.g., a residue from Group 1 to affect a proximal base on one strand, or a residue from Group 3 to affect a distal base on the other strand).
- a combination of residue substitutions capable of interacting with both bases in a pair can affect the specificity (e.g.
- non- random or site-directed mutagenesis techniques are used to create specific sequence modifications.
- Non-limiting examples of non-random mutagenesis techniques include overlapping primer PCR (see, e.g., Wang et al. (2006), Nucleic Acids Res. 34(2): 517-527), site-directed mutagenesis (see, e.g., U.S. Pat. No. 7,041,814), cassette mutagenesis (see, e.g., U.S. Pat. No. 7,041,814), and the manufacturer's protocol for the Altered Sites® II Mutagenesis Systems kit commercially available from Promega Biosciences, Inc. (San Luis Obispo, CA).
- the recognition and cleavage of a specific DNA sequence by a rationally- designed meganuclease can be assayed by any method known by one skilled in the art (see, e.g., U.S. Pat. Pub. No. 2006/0078552).
- the determination of meganuclease cleavage is determined by in vitro cleavage assays.
- Such assays use in vitro cleavage of a polynucleotide substrate comprising the intended recognition sequence of the assayed meganuclease and, in certain embodiments, variations of the intended recognition sequence in which one or more bases in one or both half-sites have been changed to a different base.
- the polynucleotide substrate is a double-stranded DNA molecule comprising a target site which has been synthesized and cloned into a vector.
- the polynucleotide substrate can be linear or circular, and typically comprises only one recognition sequence.
- the meganuclease is incubated with the polynucleotide substrate under appropriate conditions, and the resulting polynucleotides are analyzed by known methods for identifying cleavage products ⁇ e.g., electrophoresis or chromatography). If there is a single recognition sequence in a linear, double-strand DNA substrate, the meganuclease activity is detected by the appearance of two bands (products) and the disappearance of the initial full-length substrate band.
- meganuclease activity can be assayed as described in, for example, Wang et al. (1997), Nucleic Acid Res., 25: 3767-3776.
- the cleavage pattern of the meganuclease is determined using in vivo cleavage assays (see, e.g., U.S. Pat. Pub. No. 2006/0078552).
- the in vivo test is a single-strand annealing recombination test (SSA). This kind of test is known to those of skill in the art (Rudin et al. (1989), Genetics 122: 519-534; Fishman-Lobell et al. (1992), Science 258: 480-4).
- substitutions can be made to domains of the meganuclease enzymes other than those involved in DNA recognition and binding without complete loss of activity.
- Substitutions can be conservative substitutions of similar amino acid residues at structurally or functionally constrained positions, or can be non-conservative substitutions at positions which are less structurally or functionally constrained.
- Such substitutions, insertions and deletions can be identified by one of ordinary skill in the art by routine experimentation without undue effort.
- the recombinant meganucleases described herein include proteins having anywhere from 85% to 99% sequence similarity (e.g., 85%, 87.5%, 90%, 92.5%, 95%, 97.5%, 99%) to a reference meganuclease sequence.
- sequence similarity e.g., 85%, 87.5%, 90%, 92.5%, 95%, 97.5%, 99%
- the most N-terminal and C-terminal sequences are not clearly visible in X-ray crystallography studies, suggesting that these positions are not structurally or functionally constrained.
- residues 2-153 of SEQ ID NO: 1 for I-Crel residues 6-160 of SEQ ID NO: 6 for I-Msol, residues 3-186 of SEQ ID NO: 9 for I-Scel, and residues 5-211 of SEQ ID NO: 12 for I-CeuI.
- the LAGLIDADG meganuclease family is composed of more than 200 members from a diverse phylogenetic group of host organisms. All members of this family have one or two copies of a highly conserved LAGLIDADG motif along with other structural motifs involved in cleavage of specific DNA sequences. Enzymes that have a single copy of the LAGLIDADG motif (i.e., mono-LAGLIDADG meganucleases) function as dimers, whereas the enzymes that have two copies of this motif (i.e., di-LAGLIDADG meganucleases) function as monomers.
- LAGLIDADG family members recognize and cleave relatively long sequences (> 12 bp), leaving four nucleotide 3' overhangs. These enzymes also share a number of structural motifs in addition to the LAGLIDADG motif, including a similar arrangement of anti-parallel ⁇ -strands at the protein-DNA interface. Amino acids within these conserved structural motifs are responsible for interacting with the DNA bases to confer recognition sequence specificity.
- the overall structural similarity between some members of the family e.g., I-Crel, I-Msol, I-Scel and I-Ceul has been elucidated by X-ray crystallography.
- the members of this family can be modified at particular amino acids within such structural motifs to change the overall activity or sequence- specificity of the enzymes, and corresponding modifications can reasonable be expected to have similar results in other family members. See, generally, Chevalier et al. (2001), Nucleic Acid Res.29(18): 3757-3774).
- the present invention relates to non-naturally-occurring, rationally- designed meganucleases which are based upon or derived from the I-Crel meganuclease of Chlamydomonas reinhardtii.
- the wild-type amino acid sequence of the I-Crel meganuclease is shown in SEQ ID NO: 1, which corresponds to Genbank Accession # PO5725.
- Two recognition sequence half sites of the wild-type I-Crel meganuclease from crystal structure having PDB identifier (PDB ID) 1BP7 are shown below:
- Wild-type I-Crel also recognizes and cuts the following perfectly palindromic (except for the central N1-N4 bases) sequence:
- the palindromic sequence of SEQ ID NO: 4 and SEQ ID NO: 5 is considered to be a better substrate for the wild-type I-Crel because the enzyme binds this site with higher affinity and cleaves it more efficiently than the natural DNA sequence.
- this palindromic sequence cleaved by wild-type I-Crel is referred to as "WT" (see, e.g., Figure 2(A)).
- WT this palindromic sequence cleaved by wild-type I-Crel is referred to as "WT" (see, e.g., Figure 2(A)).
- the two recognition sequence half-sites are shown in bold on their respective sense strands.
- Figure l(A) depicts the interactions of a wild-type I-Crel meganuclease homodimer with a double-stranded DNA recognition sequence
- Figures l(B) shows the specific interactions between amino acid residues of the enzyme and bases at the -4 position of one half-site for a wild-type enzyme and one wild-type recognition sequence
- Figures 1(C)-(E) show the specific interactions between amino acid residues of the enzyme and bases at the -4 position of one half-site for three rationally-designed meganucleases described herein with altered specificity at position -4 of the half-site.
- the base preference at any specified base position of the half-site can be rationally altered to each of the other three base pairs using the methods disclosed herein.
- the wild-type recognition surface at the specified base position is determined (e.g. , by analyzing meganuclease-DNA complex co-crystal structures; or by computer modeling of the meganuclease-DNA complexes).
- Second, existing and potential contact residues are determined based on the distances between the ⁇ -carbons of the surrounding amino acid positions and the nucleic acid bases on each DNA strand at the specified base position.
- the I-Crel wild type meganuclease- DNA contact residues at position -4 involve a glutamine at position 26 which hydrogen bonds to an A base on the antisense DNA strand.
- Residue 77 was also identified as potentially being able to contact the -4 base on the DNA sense strand.
- the ⁇ -carbon of residue 26 is 5.9 A away from N7 of the A base on the antisense DNA strand, and the ⁇ -carbon of residue 77 is 7.15 A away from the C5 -methyl of the T on the sense strand.
- a C on the sense strand could hydrogen bond with a glutamic acid at position 77 and a G on the antisense strand could bond with glutamine at position 26 (mediated by a water molecule, as observed in the wild-type I-Crel crystal structure) (see Fig 1(C)); a G on the sense strand could hydrogen bond with an arginine at position 77 and a C on the antisense strand could hydrogen bond with a glutamic acid at position 26 (see Fig 1(D)); an A on the sense strand could hydrogen bond with a glutamine at position 77 and a T on the antisense strand could form hydrophobic contacts with an alanine at position 26 (see Fig.
- the wild-type contact Q26
- the wild-type contact Q26
- can be substituted e.g., with a serine residue
- complementary mutations at positions 26 and 77 can be combined to specify a particular base pair (e.g. , A26 specifies a T on the antisense strand and Q77 specifies an A on the sense strand (Fig. 1(E)).
- Bold entries are wild-type contact residues and do not constitute "modifications" as used herein.
- the present invention relates to non-naturally-occurring, rationally-designed meganucleases which are based upon or derived from the I-Msol meganuclease of Monomastix sp.
- the wild-type amino acid sequence of the I-Msol meganuclease is shown in SEQ ID NO: 6, which corresponds to Genbank Accession # AAL34387.
- Two recognition sequence half-sites of the wild-type I-Msol meganuclease from crystal structure having PDB identifier (PDB ID) 1M5X are shown below:
- recognition sequence is not perfectly palindromic, even outside the central four base pairs.
- the two recognition sequence half- sites are shown in bold on their respective sense strands.
- Bold entries are represent wild-type contact residues and do not constitute "modifications" as used herein.
- An asterisk indicates that the residue contacts the base on the antisense strand.
- the present invention relates to non-naturally-occurring, rationally-designed meganucleases which are based upon or derived from the I-Scel meganuclease of Saccharomyces cerevisiae.
- the wild-type amino acid sequence of the I- Scel meganuclease is shown in SEQ ID NO: 9, which corresponds to Genbank Accession # CAA09843.
- the recognition sequence of the wild-type I-Scel meganuclease from crystal structure having PDB identifier (PDB ID) 1R7M is shown below:
- recognition sequence is non-palindromic and there are not four base pairs separating half-sites.
- Bold entries are wild-type contact residues and do not constitute "modifications" as used herein. An asterisk indicates that the residue contacts the base on the antisense strand.
- the present invention relates to non-naturally-occurring, rationally-designed meganucleases which are based upon or derived from the I-Ceul meganuclease of Chlamydomonas eugametos.
- the wild-type amino acid sequence of the I- Ceul meganuclease is shown in SEQ ID NO: 12, which corresponds to Genbank Accession # P32761.
- Two recognition sequence half sites of the wild-type I-Ceul meganuclease from crystal structure having PDB identifier (PDB ID) 2EX5 are shown below:
- Bold entries are wild-type contact residues and do not constitute "modifications" as used herein. An asterisk indicates that the residue contacts the base on the antisense strand.
- the present invention is not intended to embrace certain recombinant meganucleases which have been described in the prior art, and which have been developed by alternative methods. These excluded meganucleases include those described by Arnould et al. (2006), J. MoI. Biol. 355: 443-58; Sussman et al. (2004), J. MoI. Biol. 342: 31- 41; Chames et al. (2005), Nucleic Acids Res. 33: el78; Seligman et al. (2002), Nucleic Acids Res. 30: 3870-9; and Ashworth et al.
- the present invention relates to non-naturally-occurring, rationally-designed meganucleases which are produced by combining two or more amino acid modifications as described in sections 2.2.1-2.2.4 above, in order to alter half-site preference at two or more positions in a DNA recognition sequence half-site.
- the enzyme DJl was derived from I- Crel by incorporating the modifications R30/E38 (which favor C at position -7), R40 (which favors G at position -6), R42 (which favors at G at position -5), and N32 (which favors complete degeneracy at position -9).
- the rationally-designed DJl meganuclease invariantly recognizes C -7 G_6 G_5 compared to the wild-type preference for A_ 7 A_ 6 C_5, and has increased tolerance for A at position -9.
- the ability to combine residue substitutions that affect different base positions is due in part to the modular nature of the LAGLIDADG meganucleases.
- a majority of the base contacts in the LAGLIDADG recognition interfaces are made by individual amino acid side chains, and the interface is relatively free of interconnectivity or hydrogen bonding networks between side chains that interact with adjacent bases. This generally allows manipulation of residues that interact with one base position without affecting side chain interactions at adjacent bases.
- the additive nature of the mutations listed in sections 2.2.1- 2.2.4 above is also a direct result of the method used to identify these mutations. The method predicts side chain substitutions that interact directly with a single base. Interconnectivity or hydrogen bonding networks between side chains is generally avoided to maintain the independence of the substitutions within the recognition interface.
- R40 and R42 can be combined with a serine or aspartic acid at position 28.
- Combinations of amino substitutions, identified as described herein, can be used to rationally alter the specificity of a wild-type meganuclease (or a previously modified meganuclease) from an original recognition sequence to a desired recognition sequence which may be present in a nucleic acid of interest (e.g., a genome).
- Figure 2A shows the "sense" strand of the I-Crel meganuclease recognition sequence WT (SEQ ID NO: 4 ) as well as a number of other sequences for which a rationally-designed meganuclease would be useful.
- recombinant meganucleases based on the I-Crel meganuclease can be rationally-designed for each of these desired recognition sequences, as well as any others, by suitable amino acid substitutions as described herein.
- the DNA-binding affinity of the recombinant meganucleases described herein can be modulated by altering certain amino acids that form the contact surface with the phosphodiester backbone of DNA.
- the contact surface comprises those amino acids in the enzyme with ⁇ -carbons less than 9 A from the DNA backbone, and with side chains oriented toward the DNA, irrespective of whether the residues make contacts with the DNA backbone in the wild-type meganuclease-DNA complex. Because DNA-binding is a necessary precursor to enzyme activity, increases/decreases in DNA-binding affinity have been shown to cause increases/decreases, respectively, in enzyme activity.
- an amino acid with a negatively-charged side chain e.g., glutamic acid, aspartic acid
- a negatively-charged side chain e.g., glutamic acid, aspartic acid
- the glutamic acid at position 80 in the I-Crel meganuclease is altered to either a lysine or a glutamine to increase activity.
- the tyrosine at position 66 of I-Crel is changed to arginine or lysine, which increases the activity of the meganuclease.
- enzyme activity is decreased by changing the lysine at position 34 of I-Crel to aspartic acid, changing the tyrosine at position 66 to aspartic acid, and/or changing the lysine at position 116 to aspartic acid.
- the activities of the recombinant meganucleases can be modulated such that the recombinant enzyme has anywhere from no activity to very high activity with respect to a particular recognition sequence.
- the DJl recombinant meganuclease when carrying glutamic acid mutation at position 26 loses activity completely.
- the combination of the glutamic acid substitution at position 26 and a glutamine substitution at position 80 creates a recombinant meganuclease with high specificity and activity toward a guanine at -4 within the recognition sequence half- site (see Figure 1(D)).
- amino acids at various positions in proximity to the phosphodiester DNA backbone can be changed to simultaneously affect both meganuclease activity and specificity.
- This "tuning" of the enzyme specificity and activity is accomplished by increasing or decreasing the number of contacts made by amino acids with the phosphodiester backbone.
- a variety of contacts with the phosphodiester backbone can be facilitated by amino acid side chains.
- ionic bonds, salt bridges, hydrogen bonds, and steric hindrance affect the association of amino acid side chains with the phosphodiester backbone.
- amino acid is negatively-charged or hydrophobic, mutate it to uncharged/polar (less effect) or positively-charged (K or R, more effect).
- amino acid is hydrophobic or uncharged/polar, mutate it to negatively-charged.
- the invention provides rationally-designed, non-naturally- occurring meganucleases which are heterodimers formed by the association of two monomers, one of which may be a wild-type and one or both of which may be a non- naturally-occurring or recombinant form.
- wild-type I-Crel meganuclease is normally a homodimer composed of two monomers that each bind to one half-site in the pseudo-palindromic recognition sequence.
- a heterodimeric recombinant meganuclease can be produced by combining two meganucleases that recognize different half-sites, for example by co-expressing the two meganucleases in a cell or by mixing two meganucleases in solution.
- the formation of heterodimers can be favored over the formation of homodimers by altering amino acids on each of the two monomers that affect their association into dimers.
- certain amino acids at the interface of the two monomers are altered from negatively-charged amino acids (D or E) to positively charged amino acids (K or R) on a first monomer and from positively charged amino acids to negatively-charged amino acids on a second monomer (Table 6).
- lysines at positions 7 and 57 are mutated to glutamic acids in the first monomer and glutamic acids at positions 8 and 61 are mutated to lysines in the second monomer.
- the result of this process is a pair of monomers in which the first monomer has an excess of positively-charged residues at the dimer interface and the second monomer has an excess of negatively-charged residues at the dimer interface.
- the first and second monomer will, therefore, associate preferentially over their identical monomer pairs due to the electrostatic interactions between the altered amino acids at the interface.
- amino acids at the interface of the two monomers can be altered to sterically hinder homodimer formation.
- amino acids in the dimer interface of one monomer are substituted with larger or bulkier residues that will sterically prevent the homodimer.
- Amino acids in the dimer interface of the second monomer optionally can be substituted with smaller residues to compensate for the bulkier residues in the first monomer and remove any clashes in the heterodimer, or can be unmodified.
- an ionic bridge or hydrogen bond can be buried in the hydrophobic core of a heterodimeric interface.
- a hydrophobic residue on one monomer at the core of the interface can be substituted with a positively charged residue.
- a hydrophobic residue on the second monomer, that interacts in the wild type homodimer with the hydrophobic residue substituted in the first monomer can be substituted with a negatively charged residue.
- the two substituted residues can form an ionic bridge or hydrogen bond.
- the electrostatic repulsion of an unsatisfied charge buried in a hydrophobic interface should disfavor homodimer formation.
- each monomer of the heterodimer can have different amino acids substituted in the DNA recognition region such that each has a different DNA half-site and the combined dimeric DNA recognition sequence is non-palindromic.
- the catalytic activity of a non-naturally-occurring, rationally-designed meganuclease can be reduced or eliminated by mutating amino acids involved in catalysis (e.g., the mutation of Q47 to E in I-Crel, see Chevalier et al. (2001), Biochemistry. 43:14015- 14026); the mutation of D44 or D 145 to N in I-Scel; the mutation of E66 to Q in I-Ceul; the mutation of D22 to N in I-Msol).
- the inactivated meganuclease can then be fused to an effector domain from another protein including, but not limited to, a transcription activator ⁇ e.g., the GAL4 transactivation domain or the VP 16 transactivation domain), a transcription repressor ⁇ e.g., the KRAB domain from the Kruppel protein), a DNA methylase domain ⁇ e.g., M.CviPI or M.SssI), or a histone acetyltransferase domain ⁇ e.g., HDACl or HDAC2).
- Chimeric proteins consisting of an engineered DNA-binding domain, most notably an engineered zinc finger domain, and an effector domain are known in the art (see, e.g., Papworth et al. (2006), Gene 366:27-38).
- the meganuclease will also comprise a nuclear localization signal ⁇ e.g. the SV40 NLS (SEQ ID NO. 38), which can be added to the N-terminus of the meganuclease domain).
- the meganuclease DNA-binding domain may comprise a mono- LAGLIDADG meganuclease domain which recognizes a palindromic or pseudo-palindromic DNA sequence.
- the meganuclease DNA-binding domain may comprise a single-chain meganuclease in which a pair of mono-LAGLIDADG subunits derived from I-Crel are joined into a single polypeptide. The latter embodiment is useful for the recognition of non-palindromic DNA sites.
- the engineered meganuclease DNA-binding domain (“meganuclease DNA-binding domain”) can recognize a DNA site in the gene or in the gene promoter. If the goal is gene activation, the meganuclease DNA- binding domain can recognize a DNA site in the promoter that is upstream from the start of gene transcription. If the goal is gene repression, the meganuclease DNA-binding domain can recognize a DNA site which is upstream or downstream from the transcription start site in either the promoter of the gene itself. In some embodiments, the meganuclease DNA- binding domain will recognize a DNA site that is within 2,000 bases of the transcription start site.
- the meganuclease DNA-binding domain will recognize a DNA site that is within 500 bases of the transcription start site. In the case of a meganuclease DNA-binding domain intended to repress gene expression, it may be useful if the meganuclease DNA-binding domain recognizes a DNA site which is as close to the transcription start site as possible.
- transcription start sites of many genes of interest are known in the art and can be readily found in the scientific literature or in databases such as GenBank (http://www.ncbi.nlm.nih.gov/Genbank/).
- the transcription start site for a gene of interest may be determined experimentally by RT-PCR or other methods that are known in the art (see, e.g., Ohara, et al. (1990), Nuc. Acids Res. 23:6997-7002).
- the meganuclease DNA-binding domain can be designed to bind a recognition sequence which is known in advance to be in an accessible region of the chromatin.
- the accessibility of a particular recognition sequence can be determined by DNaseI hypersensitivity analysis. Such analyses have been performed for many genes of interest and are well-known in the scientific literature. In cases where such data are not already publicly available, DNaseI sensitivity may be determined experimentally using standard protocols ⁇ e.g., Lu and Richardson (2004), Methods MoI. Biol. 287:77-86).
- a meganuclease DNA-binding domain may be produced that binds to a recognition sequence in or near the recognition sequence for a known, native transcription factor.
- the DNA sequences recognized by many native transcription factors are known in the art (see, e.g., the TRANSFAC database, www.gene-regulation.com). Where such DNA sequences appear in the promoters of genes, it is generally believed that those sites, as well as the immediately flanking regions, are accessible within the chromatin structure.
- a transcription effector domain will affect gene expression by interacting, directly or indirectly, with the cellular transcription machinery. Effector domains can be found as part of natural transcription factors and are distinguished by their ability to either activate or repress gene transcription.
- Many transcription activator domains are known in the art and include the GAL4 activation domain (comprising amino acids 768-881 of the S. cerevisiae GAL4 protein, SEQ ID NO: 39) and the Herpes virus VP 16 activation domain (comprising amino acids 413-490 of the HSV-I VP16 protein, SEQ ID NO: 40).
- Transcription repressor domains are also known in the art and include the KRAB (Kruppel Associated Box) family of repressor domains.
- KRAB domains are ubiquitous in nature where they are typically found as components Of CyS 2 HiS 2 zinc finger transcription factors (see, e.g., Huntley et al. (2006), Genome Res. 16:669-677).
- one KRAB domain suitable for some embodiments of the invention comprises amino acids 12-74 of the Rattus norvegicus Kid-1 protein (GenBank accession number Q02975, SEQ ID NO: 41).
- Transcription effector domains may be fused to either the N- or C-terminus of a meganuclease-derived DNA-binding domain.
- a meganuclease-derived DNA-binding domain In the case of meganuclease DNA-binding domains derived from I-Crel, it may be preferable to fuse the effector domain to the C- terminus.
- the use of flexible linkers rich in glycine and serine amino acids to join protein domains is known in the art ⁇ e.g., Mack et al.
- Domain linkers other than short, flexible amino acid linkers can, as described above, also be used.
- Targeted transcriptional effectors described herein can be used to control gene expression in isolated cells or organisms. For most applications, a targeted transcriptional effector will be produced to bind to and regulate a native promoter/gene in a prokaryotic or eukaryotic cell. In some cases, however, it may be desirable to produce a targeted transcriptional effector which binds to and regulates an exogenous promoter/gene that has been introduced into the cell. Such an exogenous promoter/gene could exist in the cell extrachromosomally ⁇ e.g., on a plasmid) or it could be integrated into the genome of the cell ⁇ e.g., by viral transduction).
- a targeted transcriptional effector may be produced to bind and regulate the genes of a virus ⁇ e.g. HIV or HSV-I) such that the pathogenicity of the virus is reduced.
- a targeted transcriptional effector may be used to reduce the expression of viral genes necessary for integration into the host genome, replication, the emergence from latency, virus particle formation, cell exit, or the evasion of host defenses.
- Targeted transcriptional effectors can be delivered to cells as protein or in the form of a nucleic acid which encodes the protein.
- the effects that a targeted transcriptional effector exert on the expression of a gene of interest will persist only as long as the targeted transcriptional effector itself exists within the cell.
- delivery of a targeted transcriptional effector in protein form can be expected to yield a transient effect on gene transcription ⁇ e.g., a few days).
- Delivery of a targeted transcriptional effector gene carried on a non-replicating nucleic acid ⁇ e.g., non-replicating plasmid DNA) to a cell can be expected to effect the transcription of the gene of interest for a longer period of time ⁇ e.g., days to weeks).
- a targeted transcriptional effector gene carried on a replicating nucleic acid ⁇ e.g. , a replicating plasmid or a virus that integrates into the genome
- Delivery of a targeted transcriptional effector gene carried on a replicating nucleic acid can be expected to effect the expression of a gene of interest for the greatest length of time and can be made permanent.
- the present disclosure provides targeted transcriptional effectors that have been engineered to specifically recognize, with high efficacy, endogenous cellular genes.
- targeted transcriptional effectors based on engineered meganucleases can be used to regulate expression of an endogenous cellular gene that is present in its native chromatin environment.
- the methods of regulation use targeted transcriptional effectors with a IQ for the targeted recognition sequence of less than about 25 nM to activate or repress gene transcription.
- the targeted transcriptional repressors can be used to decrease transcription of an endogenous cellular gene by 20% or more, and targeted transcriptional activators can be used to increase transcription of an endogenous cellular gene by 20% or more (as measured by changes in transcript number during the first half-life of the targeted transcriptional effector after administration).
- the methods described herein for regulating gene expression allow for novel human and mammalian therapeutic applications, e.g. , treatment of genetic diseases; cancer; fungal, protozoal, bacterial, and viral infection; ischemia; vascular disease; arthritis; immunological disorders; etc., as well as providing means for functional genomics assays, and means for developing plants with altered phenotypes, including disease resistance, fruit ripening, sugar and oil composition, yield, and color.
- targeted transcriptional activators can be designed to recognize any suitable target site, for regulation of expression of any endogenous gene of choice.
- endogenous genes suitable for regulation include VEGF, CCR5, ERa, Her2/Neu, Tat, Rev, HBV C, S, X, and P, LDL-R, PEPCK, CYP7, Fibrinogen, ApoB, Apo E, Apo(a), renin, NF- ⁇ B, I- ⁇ B, TNF- ⁇ , FAS ligand, amyloid precursor protein, atrial naturetic factor, ob-leptin, ucp-1, IL-I, IL-2, IL-3, IL-4, IL-5, IL-6, IL- 12, G-CSF, GM-CSF, Epo, PDGF, PAF, p53, Rb, fetal hemoglobin, dystrophin, eutrophin, GDNF, NGF, IGF-I, VEGF receptors fit and
- suitable genes to be regulated include cytokines, lymphokines, growth factors, mitogenic factors, chemotactic factors, onco-active factors, receptors, potassium channels, G-proteins, signal transduction molecules, and other disease-related genes.
- a general theme in transcription factor regulation of gene expression is that simple binding and sufficient proximity to the promoter are all that is generally needed. Exact positioning relative to the promoter, orientation and, within limits, distance do not matter greatly. This feature allows considerable flexibility in choosing sites for constructing artificial transcription factors. Therefore, the target site recognized by the targeted transcriptional effector can be any suitable site in the target gene that will allow activation or repression of gene expression by a targeted transcriptional effector, optionally linked to a regulatory domain.
- Possible target sites include regions adjacent to, downstream, or upstream of the transcription start site.
- specific regulatory sites e.g., SP-I sites, hypoxia response elements, nuclear receptor recognition elements, p53 binding sites
- sites in the cDNA encoding region or in an expressed sequence tag (EST) coding region e.g., EST, expressed sequence tag (EST) coding region.
- the targeted transcriptional activator is linked to at least one or more regulatory domains, described below.
- regulatory domains include transcription factor repressor or activator domains such as KRAB and VP 16, co-repressor and co-activator domains, DNA methyl transferases, histone acetyltransferases, histone deacetylases, and endonucleases such as Fokl.
- transcription factor repressor or activator domains such as KRAB and VP 16
- co-repressor and co-activator domains DNA methyl transferases
- histone acetyltransferases histone deacetylases
- endonucleases such as Fokl.
- the expression of the gene is reduced by about 20% (i.e., 80% of non-targeted transcriptional activator modulated expression), about 50% (i.e., 50% of non-targeted transcriptional activator modulated expression), or about 75-100% (i.e., 25% to 0% of non- targeted transcriptional activator modulated expression).
- typically expression is activated by about 20% (i.e., 120% of non-targeted transcriptional activator modulated expression), about 50% (i.e., 150% of non-targeted transcriptional activator modulated expression), about 100% (i.e., 200% of non-targeted transcriptional activator modulated expression), about 5-10 fold (i.e., 500-1000% of non-targeted transcriptional activators modulated expression), up to at least 100 fold or more.
- targeted transcriptional effectors activators and repressors
- tet-regulated systems and the RU-486 system see, e.g., Gossen & Bujard (1992), Proc. Natl. Acad. ScL USA 89:5547; Oligino et al. (1998), Gene Ther. 5:491-496; Wang et al. (1997), Gene Ther. 4:432-441; Neering et al. (1996), Blood 88:1147-1155; and Rendahl et al. (1998), Nat. Biotechnol. 16:757-761).
- a promoter can be a normal cellular promoter or, for example, a promoter of an infecting microorganism such as, for example, a bacterium or a virus.
- the long terminal repeat (LTR) of retroviruses is a promoter region which may be a target for a modified zinc finger binding polypeptide.
- Promoters from members of the Lentivirus group which include such pathogens as human T-cell lymphotrophic virus (HTLV) 1 and 2, or human immunodeficiency virus (HIV) 1 or 2 are examples of viral promoter regions which may be targeted for transcriptional modulation by a modified zinc finger binding polypeptide as described herein.
- a "promoter” is defined as an array of nucleic acid control sequences that direct transcription.
- a promoter typically includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of certain RNA polymerase II type promoters, a TATA element, enhancer, CCAAT box, SP-I site, etc.
- a promoter also optionally includes distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription.
- the promoters often have an element that is responsive to transactivation by a DNA-binding moiety such as a polypeptide, e.g., a nuclear receptor, Gal4, the lac repressor and the like.
- a "transcriptional activator” and a “transcriptional repressor” refer to proteins or functional fragments of proteins that have the ability to modulate transcription.
- proteins include, e.g., transcription factors and co-factors (e.g., KRAB, MAD, ERD, SID, nuclear factor kappa B subunit p65, early growth response factor 1, and nuclear hormone receptors, VP 16, VP64), endonucleases, integrases, recombinases, methyltransferases, histone acetyltransferases, histone deacetylases etc.
- transcription factors and co-factors e.g., KRAB, MAD, ERD, SID, nuclear factor kappa B subunit p65, early growth response factor 1, and nuclear hormone receptors, VP 16, VP64
- endonucleases e.g., integrases, recombinases, methyltransferases, histone acetyltransfer
- Activators and repressors include co-activators and co-repressors (see, e.g., Utley et ⁇ l. (1998), Nature 394: 498-502).
- a "fusion molecule” is a molecule in which two or more subunit molecules are physically joined or linked (e.g., covalently).
- the subunit molecules can be the same chemical type of molecule, or can be different chemical types of molecules.
- Examples of the first type of fusion molecule include, but are not limited to, fusion polypeptides (for example, a fusion between an engineered meganuclease DNA-binding domain and a transcriptional effector domain) and fusion nucleic acids (for example, a nucleic acid encoding the fusion polypeptide described herein).
- An example of the second type of fusion molecule includes, but is not limited to, a fusion between a DNA-binding protein and a nucleic acid.
- the invention provides a targeted transcriptional effector comprising: (i) an engineered meganuclease DNA-binding domain lacking endonuclease cleavage activity that is engineered to bind to a target site in a gene of interest; and (ii) a regulatory domain, wherein the targeted regulator binds to the target site and regulates a desired function.
- the engineered meganuclease DNA-binding domain can be covalently or non-covalently associated with one or more regulatory domains, alternatively two or more regulatory domains, with the two or more domains being two copies of the same domain, or two different domains.
- the regulatory domains can be covalently linked to the engineered meganuclease DNA-binding domain, e.g., via an amino acid linker, as part of a fusion protein.
- the engineered meganuclease DNA-binding domains can also be associated with a regulatory domain via a non-covalent dimerization domain, e.g., a leucine zipper, a STAT protein N terminal domain, or an FK506 binding protein (see, e.g., O'Shea, Science. 254: 539 (1991), Barahmand-Pour et al, Curr. Top. Microbiol. Immunol. 211 : 121-128 (1996); Klemm etal., Annu. Rev. Immunol.
- the regulatory domain can be associated with the engineered meganuclease DNA- binding domain at any suitable position, including the C-or N-terminus of the engineered meganuclease DNA-binding domain.
- Common regulatory domains for addition to the engineered meganuclease DNA- binding domain include, e.g., effector domains from transcription factors (activators, repressors, co-activators, co-repressors), silencers, nuclear hormone receptors, oncogene transcription factors ⁇ e.g., myc, jun, fos, myb, max, mad, rel, ets, bcl, myb, mos family members etc.
- Transcription factor polypeptides from which one can obtain a regulatory domain include those that are involved in regulated and basal transcription.
- Such polypeptides include transcription factors, their effector domains, coactivators, silencers, nuclear hormone receptors (see, e.g., Goodrich et al, Cell 84: 825-30 (1996) for a review of proteins and nucleic acid elements involved in transcription; transcription factors in general are reviewed in Barnes & Adcock, Clin. Exp. Allergy. 25 Suppl. 2: 46-9 (1995) and Roeder, Methods Enzymol. 273: 165-71 (1996)).
- Databases dedicated to transcription factors are known (see, e.g., Science. 269: 630 (1995)).
- Nuclear hormone receptor transcription factors are described in, for example, Rosen et al, J. Med. Chem. 38: 4855- 74 (1995).
- TATA box binding protein TBP
- TAF polypeptides which include TAF30, TAF55, TAF80, TAFl 10, TAF150, and TAF250
- TAF30, TAF55, TAF80, TAFl 10, TAF150, and TAF250 TAF30, TAF55, TAF80, TAFl 10, TAF150, and TAF250
- TAF30, TAF55, TAF80, TAFl 10, TAF150, and TAF250 TAF30, TAF55, TAF80, TAFl 10, TAF150, and TAF250
- the STAT family of transcription factors are reviewed in, for example, Barahmand- Pour et al., Curr. Top. Microbiol. Immunol. 211 : 121-8 (1996). Transcription factors involved in disease are reviewed in Aso et al., J. Clin. Invest. 97: 1561-9 (1996).
- the KRAB repression domain from the human KOX-I protein is used as a transcriptional repressor (Thiesen et al., New Biologist. 2: 363-374 (1990); Margolin et al., PNAS. 91 : 4509-4513 (1994); Pengue et al., Nucl. Acids Res. 22: 2908- 2914 (1994); Witzgall et al, PNAS. 91 : 4514-4518 (1994)).
- KAP-I a KRAB co-repressor
- KRAB is used with KRAB (Friedman et al, Genes Dev. 10: 2067-2078 (1996)).
- KAP-I can be used alone with a engineered meganuclease DNA- binding domain.
- Other transcription factors and transcription factor domains that act as transcriptional repressors include MAD (see, e.g., Sommer et al, J. Biol. Chem. 273: 6632- 6642 (1998); Gupta et al, Oncogene. 16: 1149-1159 (1998); Queva et al, Oncogene. 16: 967-977 (1998); Larsson et al, Oncogene. 15: 737-748 (1997); Laherty et al, Cell. 89: 349- 356 (1997); and Cultraro et al, MoI. Cell. Biol.
- the HSV VP 16 activation domain is used as a transcriptional activator (see, e.g., Hagmann et al, J. Virol. 71 : 5952-5962 (1997)).
- Other transcription factors that could supply activation domains include the VP64 activation domain (Seipel et al, EMBO J I l : 4961-4968 (1996)) ; nuclear hormone receptors (see, e.g., Torchia et al, Curr. Opin. Cell. Biol. 10: 373-383 (1998)) ; the p65 subunit of nuclear factor kappa B (Bitko & Barik, J. Virol.
- kinases, phosphatases, and other proteins that modify polypeptides involved in gene regulation are also useful as regulatory domains for engineered meganuclease DNA- binding domains. Such modifiers are often involved in switching on or off transcription mediated by, for example, hormones.
- useful domains can also be obtained from the gene products of oncogenes (e.g., myc, jun, fos, myb, max, mad, rel, ets, bcl, myb, mos family members) and their associated factors and modifiers.
- Oncogenes are described in, for example, Cooper, Oncogenes, 2nd ed., The Jones and Bartlett Series in Biology, Boston, MA, Jones and Bartlett Publishers, 1995.
- the ets transcription factors are reviewed in Waslylk et al., Eur. J. Biochem. 211 : 7-18 (1993) and Crepieux et al, Crit. Rev. Oncog. 5: 615-38 (1994).
- Myc oncogenes are reviewed in, for example, Ryan et al., Biochem. J. 314: 713- 21 (1996).
- the jun and fos transcription factors are described in, for example, The Fos and Jun Families of Transcription Factors, Angel & Herrlich, eds. (1994).
- the max oncogene is reviewed in Hurlin et al., Cold Spring Harb. Symp. Quant. Biol. 59: 109-16.
- the myb gene family is reviewed in Kanei-Ishii et al., Curr. Top. Microbiol. Immunol. 211 :89-98 (1996).
- the mos family is reviewed in Yew et al., Curr. Opin. Genet. Dev.
- Engineered meganuclease DNA-binding domains can include regulatory domains obtained from DNA repair enzymes and their associated factors and modifiers. DNA repair systems are reviewed in, for example, Vos, Curr. Opin. Cell Biol. 4: 385-95 (1992); Sancar, Ann. Rev. Genet. 29: 69-105 (1995); Lehmann, Genet. Eng. 17: 1-19 (1995); and Wood, Ann. Rev. Biochem. 65: 135-67 (1996).
- DNA rearrangement enzymes and their associated factors and modifiers can also be used as regulatory domains (see, e.g., Gangloff et al., Experientia. 50: 261-9 (1994); Sadowski, FASEB J. 7: 760-7 (1993)).
- regulatory domains can be derived from DNA modifying enzymes ⁇ e.g., DNA methy transferases, topoisomerases, helicases, ligases, kinases, phosphatases, polymerases) and their associated factors and modifiers.
- Helicases are reviewed in Matson et al., Bioessays, 16: 13-22 (1994), and methyltransferases are described in Cheng, Curr. Opin. Struct. Biol. 5: 4-10 (1995).
- Chromatin associated proteins and their modifiers ⁇ e.g., kinases, acetylases and deacetylases), such as histone deacetylase (Wolffe, Science.
- the regulatory domain is a DNA methyl transferase that acts as a transcriptional repressor (see, e.g., Van den Wyngaert et al, FEBS Lett. 426: 283-289 (1998); Flynn et al, J. MoL Biol. 279: 101-116 (1998); Okano et al., Nucleic Acids Res. 26: 2536-2540 (1998); and Zardo & Caiafa, J. Biol. Chem. 273: 16517-16520 (1998)).
- Factors that control chromatin and DNA structure, movement and localization and their associated factors and modifiers; factors derived from microbes (e.g. , prokaryotes, eukaryotes and virus) and factors that associate with or modify them can also be used to obtain chimeric proteins.
- recombinases and integrases are used as regulatory domains.
- histone acetyltransferase is used as a transcriptional activator (see, e.g., Jin & Scotto, MoI. Cell. Biol. 18: 4377-4384 (1998); Wolffe, Science. 272: 371-372 (1996); Taunton et al, Science.
- histone deacetylase is used as a transcriptional repressor (see, e.g., Jin & Scotto, MoI. Cell. Biol. 18: 4377- 4384 (1998); Syntichaki & Thireos, J. Biol. Chem. 273: 24414-24419 (1998); Sakaguchi et al, Genes Dev. 12: 2831-2841 (1998) ; and Martinez et al, J. Biol. Chem. 273: 23781- 23785 (1998)).
- MBD- 2B methyl binding domain protein 2B
- Another useful repression domain is that associated with the v-ErbA protein (see infra). See, for example, Damm, et al (1989) Nature. 339: 593-597; Evans (1989) Int. J. Cancer Suppl. 4: 26-28; Pain et al (1990) New Biol. 2: 284-294 ; Sap et al (1989) Nature. 340: 242-244; Zenke et al. (1988) Cell.
- Additional exemplary repression domains include, but are not limited to, thyroid hormone receptor (TR, see inf7a), SID, MBDl, MBD2, MBD3, MBD4, MBD- like proteins, members of the DNMT family (e.g., DNMTl, DNMT3A, DNMT3B), Rb, MeCPl and MeCP2.
- TR thyroid hormone receptor
- Additional exemplary repression domains include, but are not limited to, R0M2 and AtHD2A. See, for example, Chern et al (1996) Plant Cell. 8: 305-321; and Wu et al (2000) Plant J. 22: 19-27.
- NHR nuclear hormone receptor
- TRs thyroid hormone receptors
- RARs retinoic acid receptors
- a nuclear hormone receptor transcription control domain can be fused to a engineered meganuclease DNA-binding domain such that the transcriptional regulatory activity of the receptor can be targeted to a chromosomal region of interest (e.g., a gene) by virtue of the engineered meganuclease DNA-binding domain.
- a chromosomal region of interest e.g., a gene
- the structure of TR and other nuclear hormone receptors can be altered, either naturally or through recombinant techniques, such that it loses all capacity to respond to hormone (thus losing its ability to drive transcriptional activation), but retains the ability to effect transcriptional repression. This approach is exemplified by the transcriptional regulatory properties of the oncoprotein v-ErbA.
- the v-ErbA protein is one of the two proteins required for leukemic transformation of immature red blood cell precursors in young chicks by the avian erythroblastosis virus.
- TR is a major regulator of erythropoiesis (Beug et al, Biochim Biophys Acta. 1288 (3): M35-47 (1996); in particular, in its unliganded state, it represses genes required for cell cycle arrest and the differentiated state.
- the administration of thyroid hormone to immature erythroblasts leads to their rapid differentiation.
- the v-ErbA oncoprotein is an extensively mutated version of TR; these mutations include: (i) deletion of 12 amino- terminal amino acids; (ii) fusion to the gag oncoprotein; (iii) several point mutations in the DNA binding domain that alter the DNA binding specificity of the protein relative to its parent, TR, and impair its ability to heterodimerize with the retinoid X receptor; (iv) multiple point mutations in the ligand- binding domain of the protein that effectively eliminate the capacity to bind thyroid hormone; and (v) a deletion of a carboxy-terminal stretch of amino acids that is essential for transcriptional activation. Stunnenberg et al, Biochim Biophys Acta. 1423 (1): F15-33 (1999).
- v- ErbA retains the capacity to bind to naturally occurring TR target genes and is an effective transcriptional repressor when bound (Umov et al, supra ; Sap et al, Nature. 340: 242-244 (1989) ; and Ciana et al, EMBO J. 17 (24): 7382- 7394 (1999).
- TR Reactive Regulatory Commission
- v-ErbA is completely insensitive to thyroid hormone, and thus maintains transcriptional repression in the face of a challenge from any concentration of thyroids or retinoids, whether endogenous to the medium, or added by the investigator.
- v-ErbA or its functional fragments are used as a repression domain.
- TR or its functional domains are used as a repression domain in the absence of ligand and/or as an activation domain in the presence of ligand ⁇ e.g., 3,5, 3'-triiodo-L-thyronine or T3).
- TR can be used as a switchable functional domain (i.e., a bifunctional domain); its activity (activation or repression) being dependent upon the presence or absence (respectively) of ligand.
- Additional exemplary repression domains are obtained from the DAX protein and its functional fragments. Zazopoulos et al, Nature. 390: 311-315 (1997). In particular, the C-terminal portion of DAX-I, including amino acids 245-470, has been shown to possess repression activity. Altincicek et al, J. Biol. Ther. 275: 7662-7667 (2000).
- a further exemplary repression domain is the RBPl protein and its functional fragments.
- RBPl polypeptide 21 2918-2932 (2001) and WO 01/04296.
- the full-length RBPl polypeptide contains 1257 amino acids.
- Exemplary functional fragments of RBPl are a polypeptide comprising amino acids 1114-1257, and a polypeptide comprising amino acids 243-452.
- Members of the TIEG family of transcription factors contain three repression domains known as Rl, R2 and R3. Repression by TIEG family proteins is achieved at least in part through recruitment of mSIN3A histone deacetylases complexes. Cook et al. (1999) J. Biol. Chem. 21 A: 29,500-29, 504; Zhang et al. (200I) Mo/. Cell. Biol.
- repression domains or their functional fragments
- HCMV human cytomegalovirus
- the product of the human cytomegalovirus (HCMV) UL34 open reading frame acts as a transcriptional repressor of certain HCMV genes, for example, the US3 gene.
- HCMV human cytomegalovirus
- the UL34 gene product, or functional fragments thereof can be used as a component of a fusion polypeptide also comprising a zinc finger binding domain. Nucleic acids encoding such fusions are also useful in the methods and compositions disclosed herein.
- CDF-I transcription factor and/or its functional fragments. See, for example, WO 99/27092.
- the Ikaros family of proteins are involved in the regulation of lymphocyte development, at least in part by transcriptional repression. Accordingly, an Ikaros family member (e.g., Ikaros, Aiolos) or a functional fragment thereof, can be used as a repression domain. See, for example, Sabbattini et al. (2001) EMBOJ. 20: 2812-2822.
- the yeast Ashlp protein comprises a transcriptional repression domain. Maxon et al. (2001) Proc. Natl. Acad. ScL USA 98 : 1495-1500.
- Ashlp protein its functional fragments, and homologues of Ashlp, such as those found, for example, in, vertebrate, mammalian, and plant cells, can serve as a repression domain for use in the methods and compositions disclosed herein.
- Additional exemplary repression domains include those derived from histone deacetylases (HDACs, e.g., Class I HDACs, Class II HDACs, SIR-2 homologues), HDAC- interacting proteins ⁇ e.g., SIN3, SAP30, SAP15, NCoR, SMRT, RB, plO7, pl30, RBAP46/48, MTA, Mi-2, Brgl, Brm), DNA-cytosine methyltransferases ⁇ e.g., Dnmtl, Dnmt3a, Dnmt3b), proteins that bind methylated DNA (e.g., MBDl, MBD2, MBD3, MBD4, MeCP2, DMAPl), protein methyltransferases (e.g., lysine and arginine methylases, SuVar homologues such as Suv39Hl), polycomb-type repressors (e.g., Bmi- 1, eedl, RINGl
- exemplary repression domains include members of the polycomb complex and their homologues, HPHl, HPH2, HPC2, NC2, groucho, Eve, tramtrak, mHPl, SIPl, ZEBl, ZEB2, and Enxl/Ezh2.
- HPHl full- length protein
- HPC2 HPC2
- NC2 NC2
- groucho Eve
- tramtrak mHPl
- SIPl ZEBl
- ZEB2 ZEB2
- Enxl/Ezh2 Enxl/Ezh2
- any homologues of the aforementioned proteins can also be used as repression domains, as can proteins (or their functional fragments) that interact with any of the aforementioned proteins.
- Hesl is a human homologue of the Drosophila hairy gene product and comprises a functional fragment encompassing amino acids 910-1014.
- a WRPW (trp-arg- pro-trp) motif can act as a repression domain. Fisher et al (1996) MoI. Cell. Biol. 16: 2670- 2677.
- TLEl, TLE2 and TLE3 proteins are human homologues of the Drosophila groucho gene product. Functional fragments of these proteins possessing repression activity reside between amino acids 1-400. Fisher et al., supra.
- the Tbx3 protein possesses a functional repression domain between amino acids 524-721. He et al. (1999) Proc. Natl. Acad. Sci. USA 96: 10,212-10, 217.
- the Tbx2 gene product is involved in repression of the pl4/pl6 genes and contains a region between amino acids 504-702 that is homologous to the repression domain of Tbx3; accordingly Tbx2 and/or this functional fragment can be used as a repression domain.
- Carreira et al. (1998) MoI. Cell. Biol. 18: 5,099-5, 108.
- the human Ezh2 protein is a homologue of Drosophila e71ha71cer of zeste and recruits the eedl poly comb-type repressor.
- a region of the Ezh2 protein comprising amino acids 1-193 can interact with eedl and repress transcription ; accordingly Ezh2 and/or this functional fragment can be used as a repression domain. Denisenko et al. (1998) MoI. Cell. Biol. 18: 5634-5642.
- the RYBP protein is a corepressor that interacts with polycomb complex members and with the YYl transcription factor.
- a region of RYBP comprising amino acids 42-208 has been identified as functional repression domain.
- the RING finger protein RING 1 A is a member of two different vertebrate polycomb-type complexes, contains multiple binding sites for various components of the polycomb complex, and possesses transcriptional repression activity. Accordingly, RING 1 A or its functional fragments can serve as a repression domain. Satjin et al. (1997) MoI. Cell. Biol. 17: 4105-4113.
- the Bmi-1 protein is a member of a vertebratepolycomb complex and is involved in transcriptional silencing. It contains multiple binding sites for various polycomb complex components. Accordingly, Bmi-1 and its functional fragments are useful as repression domains. Gunster et al. (1997) MoI. Cell. Biol. 17: 2326-2335; Hemenway et al. (1998) Oncogen. 16: 2541-2547.
- the E2F6 protein is a member of the mammalian Bmi-1-containingpolycomb complex and is a transcriptional repressor that is capable or recruiting RYBP, Bmi-1 and RINGlA.
- a functional fragment of E2F6 comprising amino acids 129-281 acts as a transcriptional repression domain. Accordingly, E2F6 and its functional fragments can be used as repression domains. Trimarchi et al. (2001) Proc Natl. Acad. Sci. USA 98: 1519- 1524.
- the eedl protein represses transcription at least in part through recruitment of histone deacetylases ⁇ e.g., HDAC2). Repression activity resides in both the N-and C- terminal regions of the protein. Accordingly, eedl and its functional fragments can be used as repression domains, van der Vlag et al. (1999) Nature Genet. 23: 474-478.
- the CTB P2 protein represses transcription at least in part through recruitment of an HPC2-polycomb complex. Accordingly, CTBP2 and its functional fragments are useful as repression domains. Richard et al. (1999) MoI. Cell. Biol. 19: 777-787.
- Neuron-restrictive silencer factors are proteins that repress expression of neuron- specific genes. Accordingly, a NRSF or functional fragment thereof can serve as a repression domain. See, for example, US Patent No. 6,270,990.
- any repressor or a molecule that interacts with a repressor is suitable as a functional domain.
- any molecule capable of recruiting a repressive complex and/or repressive activity (such as, for example, histone deacetylation) to the target gene is useful as a repression domain of a fusion protein.
- Additional exemplary activation domains include, but are not limited to, p300, CBP, PCAF, SRCl PvALF, AtHD2A and ERF-2. See, for example, Robyr et al. (2000) MoI. Endocrinol. 14: 329-347; Collingwood et al. (1999) J.
- Additional exemplary activation domains include, but are not limited to, OsGAI, HALF-I, Cl, API, ARF-5, -6,-7, and-8, CPRFl, CPRF4, MYC-RP/GP, and TRABl.
- OsGAI OsGAI
- HALF-I HLF-I
- Cl CIR-5
- -6,-7 CRF4
- CPRFl CPRF4
- MYC-RP/GP TRABl.
- any activator or a molecule that interacts with an activator is suitable as a functional domain.
- any molecule capable of recruiting an activating complex and/or activating activity (such as, for example, histone acetylation) to the target gene is useful as an activating domain of a fusion protein.
- Insulator domains, chromatin remodeling proteins such as ISWI-containing domains and/or methyl binding domain proteins suitable for use as functional domains in fusion molecules are described, for example, in co-owned WO 01/83793; WO 02/26959; WO 02/26960 and WO 02/44376.
- an engineered meganuclease DNA-binding domain is fused to a bifunctional domain (BFD).
- a bifunctional domain is a transcriptional regulatory domain whose activity depends upon interaction of the BFD with a second molecule.
- the second molecule can be any type of molecule capable of influencing the functional properties of the BFD including, but not limited to, a compound, a small molecule, a peptide, a protein, a polysaccharide or a nucleic acid.
- An exemplary BFD is the ligand binding domain of the estrogen receptor (ER).
- the ER ligand binding domain acts as a transcriptional activator; while, in the absence of estradiol and the presence of tamoxifen or 4-hydroxy-tamoxifen, it acts as a transcriptional repressor.
- Another example of a BFD is the thyroid hormone receptor (TR) ligand binding domain which, in the absence of ligand, acts as a transcriptional repressor and in the presence of thyroid hormone (T3), acts as a transcriptional activator.
- TR thyroid hormone receptor
- An additional BFD is the glucocorticoid receptor (GR) ligand binding domain. In the presence of dexamethasone, this domain acts as a transcriptional activator; while, in the presence of RU486, it acts as a transcriptional repressor.
- An additional exemplary BFD is the ligand binding domain of the retinoic acid receptor. In the presence of its ligand all- trans- retinoic acid, the retinoic acid receptor recruits a number of co-activator complexes and activates transcription. In the absence of ligand, the retinoic acid receptor is not capable of recruiting transcriptional co-activators. Additional BFDs are known to those of skill in the art. See, for example, US Patent Nos.
- Another class of functional domains derived from nuclear receptors, are those whose functional activity is regulated by a non-natural ligand. These are often mutants or modified versions of naturally-occurring receptors and are sometimes referred to as "switchable" domains. For example, certain mutants of the progesterone receptor (PR) are unable to interact with their natural ligand, and are therefore incapable of being transcriptionally activated by progesterone. Certain of these mutants, however, can be activated by binding small molecules other than progesterone (one example of which is the antiprogestin mifepristone).
- PR progesterone receptor
- a fusion comprising a targeted engineered meganuclease DNA- binding domain, a functional domain, and a mutant PR ligand binding domain of this type can be used for mifepristone-dependent activation or repression of an endogenous gene of choice, by designing the engineered meganuclease DNA-binding domain such that it binds in or near the gene of choice. If the fusion contains an activation domain, mifepristone- dependent activation of gene expression is obtained; if the fusion contains a repression domain, mifepristone- dependent repression of gene expression is obtained.
- polynucleotides encoding such fusion proteins are provided, as are vectors comprising such polynucleotides and cells comprising such polynucleotides and vectors. It will be clear to those of skill in the art that modified or mutant versions of receptors other than PR can also be used as switchable domains. See, for example, Tora et al. (1989) EMBOJ. 8: 1981-1986.
- the nucleic acid encoding the targeted transcriptional effector of choice is typically cloned into intermediate vectors for transformation into prokaryotic or eukaryotic cells for replication and/or expression, e.g., for determination of IQ.
- Intermediate vectors are typically prokaryote vectors, e.g., plasmids, or shuttle vectors, or insect vectors, for storage or manipulation of the nucleic acid encoding engineered meganuclease DNA-binding domain or production of protein.
- the nucleic acid encoding a engineered meganuclease DNA-binding domain is also typically cloned into an expression vector, for administration to a plant cell, animal cell ⁇ e.g., a human or other mammalian cell), fungal cell, bacterial cell, or protozoal cell.
- a engineered meganuclease DNA-binding domain is typically subcloned into an expression vector that contains a promoter to direct transcription.
- Suitable bacterial and eukaryotic promoters are well known in the art and described, e.g., in Sambrook et al., Molecular Cloning, A Laboratory Manual (2nd ed. 1989); Kriegler, Gene Trtisfei-and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular Biology (Ausubel et al., eds. , 1994).
- Bacterial expression systems for expressing the ZFP are available in, e.g., E. coli, Bacillus sp., and Salmonella (Palva et al., Gene. 22: 229-235 (1983)). Kits for such expression systems are commercially available.
- Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well known in the art and are also commercially available.
- the promoter used to direct expression of a targeted transcriptional effector nucleic acid depends on the particular application. For example, a strong constitutive promoter can be used for expression and purification of targeted transcriptional effector. In contrast, when a targeted transcriptional effector is administered in vivo for gene regulation, either a constitutive or an inducible promoter can be used, depending on the particular use of the targeted transcriptional effector.
- a promoter for administration of a targeted transcriptional effector can be a weak promoter, such as HSV TK, or a promoter having similar activity.
- the promoter also can include elements that are responsive to transactivation, e.g., hypoxia response elements, Gal4 response elements, lac repressor response element, and small molecule control systems such as tet-regulated systems and the RU-486 system (see, e.g., Gossen & Bujard, PNAS. 89: 5547 (1992); Oligino et al., Gene Ther. 5: 491-496 (1998); Wang et al., Gene Ther. 4: 432-441 (1997); Neering et al., Blood. 88: 1147-1155 (1996); and Rendahl et al., Nat. Biotechnol. 16: 757-761 (1998)).
- elements that are responsive to transactivation e.g., hypoxia response elements, Gal4 response elements, lac repressor response element, and small molecule control systems such as tet-regulated systems and the RU-486 system (see, e.g., Gossen & Bujard, P
- the expression vector can contain a transcription unit or expression cassette that contains all the additional elements required for the expression of the nucleic acid in host cells, either prokaryotic or eukaryotic.
- An expression cassette can contain a promoter operably linked, e.g., to the nucleic acid sequence encoding the targeted transcriptional effector, and signals required, e.g., for efficient polyadenylation of the transcript, transcriptional termination, ribosome binding sites, or translation termination.
- Additional elements of the cassette may include, e.g., enhancers, and heterologous spliced intronic signals.
- the particular expression vector used to transport the genetic information into the cell is selected with regard to the intended use of the targeted transcriptional effector, e.g., expression in plants, animals, bacteria, fungus, protozoa etc.
- Standard bacterial expression vectors include plasmids such as pBR322 based plasmids, pSKF, pET23D, and commercially available fusion expression systems such as GST and LacZ.
- a common fusion protein is the maltose binding protein, "MBP. "Such fusion proteins are used for purification of the targeted transcriptional effector.
- Epitope tags can also be added to recombinant proteins to provide convenient methods of isolation, for monitoring expression, and for monitoring cellular and subcellular localization, e.g., c-myc or FLAG.
- Expression vectors containing regulatory elements from eukaryotic viruses are often used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, and vectors derived from Epstein-Barr virus.
- exemplary eukaryotic vectors include pMSG, pAV009/A+, pMT010/A+, pMAMneo-5, baculovirus pDSVE, and any other vector allowing expression of proteins under the direction of the SV40 early promoter, SV40 late promoter, metallothionein promoter, murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown effective for expression in eukaryotic cells.
- Some expression systems have markers for selection of stably transfected cell lines such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase.
- High yield expression systems are also suitable, such as using a baculovirus vector in insect cells, with a targeted transcriptional effector encoding sequence under the direction of the polyhedrin promoter or other strong baculovirus promoters.
- the elements that are typically included in expression vectors also include a replicon that functions in E. coli, a gene encoding antibiotic resistance to permit selection of bacteria that harbor recombinant plasmids, and unique restriction sites in nonessential regions of the plasmid to allow insertion of recombinant sequences.
- Standard transfection methods are used to produce bacterial, mammalian, yeast or insect cell lines that express large quantities of protein, which are then purified using standard techniques (see, e.g., Colley et al., J. Biol. Chem. 264: 17619-17622 (1989); Guide to Protein Purification, in Methods in Enzymology, vol. 182 (Deutscher, ed., 1990) ). Transformation of eukaryotic and prokaryotic cells are performed according to standard techniques (see, e.g. , Morrison, J. Bad. 132: 349-351 (1977); Clark-Curtiss & Curtiss, Methods in Enzymology 101 : 347-362 (Wu et al., eds, 1983).
- Any of the well known procedures for introducing foreign nucleotide sequences into host cells may be used. These include the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection, naked DNA, plasmid vectors, viral vectors, both episomal and integrative, and any of the other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook et al., supra). It is only necessary that the particular genetic engineering procedure used be capable of successfully introducing at least one gene into the host cell capable of expressing the protein of choice. 12. Assays for determining regulation of gene expression
- a variety of assays can be used to determine the level of gene expression regulation by targeted transcriptional effectors.
- the activity of a particular targeted transcriptional effector can be assessed using a variety of ill vitro and in vivo assays, by measuring, e.g., protein or mRNA levels, product levels, enzyme activity, tumor growth; transcriptional activation or repression of a reporter gene; second messenger levels ⁇ e.g.
- cytokine and hormone production levels e.g., IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12, IL-12
- Targeted transcriptional effectors can be tested for activity in vitro using cultured cells, e.g., HEK 293 cells, CHO cells, VERO cells, BHK cells, HeLa cells, COS cells, and the like.
- the targeted transcriptional effectors is often first tested using a transient expression system with a reporter gene, and then regulation of the target endogenous gene is tested in cells and in animals, both in vivo and ex vivo.
- the targeted transcriptional effector can be recombinantly expressed in a cell, recombinantly expressed in cells transplanted into an animal, or recombinantly expressed in a transgenic animal, as well as administered as a protein to an animal or cell using delivery vehicles described below.
- the cells can be immobilized, be in solution, be injected into an animal, or be naturally occurring in a transgenic or non-transgenic animal.
- Modulation of gene expression is tested using one of the in vitro or in vivo assays described herein. Samples or assays are treated with a targeted transcriptional effector and compared to control samples without the test compound, to examine the extent of modulation. As described above, for regulation of endogenous gene expression, the targeted transcriptional effector typically has a K d of 200 nM or less, or 100 nM or less, or 50 nM or less, or 25 nM or less.
- the effects of the targeted transcriptional effectors can be measured by examining any of the parameters described above. Any suitable gene expression, phenotypic, or physiological change can be used to assess the influence of a targeted transcriptional effector.
- Any suitable gene expression, phenotypic, or physiological change can be used to assess the influence of a targeted transcriptional effector.
- the functional consequences are determined using intact cells or animals, one can also measure a variety of effects such as tumor growth, neovascularization, hormone release, transcriptional changes to both known and uncharacterized genetic markers ⁇ e.g., northern blots or oligonucleotide array studies), changes in cell metabolism such as cell growth or pH changes, and changes in intracellular second messengers such as cGMP.
- Assays for targeted transcriptional effector regulation of endogenous gene expression can be performed in vitro.
- targeted transcriptional effector regulation of endogenous gene expression in cultured cells is measured by examining protein production using an ELISA assay.
- the test sample is compared to control cells treated with an empty vector or an unrelated targeted transcriptional effector that is targeted to another gene.
- targeted transcriptional effector regulation of endogenous gene expression is determined in vitro by measuring the level of target gene mRNA expression.
- the level of gene expression is measured using amplification, e.g., using PCR, LCR, or hybridization assays, e.g., northern hybridization, RNase protection, dot blotting. RNase protection is used in one embodiment.
- the level of protein or mRNA is detected using directly or indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or enzymatically labeled antibodies, and the like, as described herein.
- a reporter gene system can be devised using the target gene promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, CAT, or p-gal.
- the reporter construct is typically co-transfected into a cultured cell.
- the amount of reporter gene transcription, translation, or activity is measured according to standard techniques known to those of skill in the art.
- an assay format useful for monitoring targeted transcriptional effector regulation of endogenous gene expression is performed in vivo.
- This assay is particularly useful for examining targeted transcriptional effectors that inhibit expression of tumor promoting genes, genes involved in tumor support, such as neovascularization ⁇ e.g., VEGF), or that activate tumor suppressor genes such as p53.
- cultured tumor cells expressing the targeted transcriptional effector of choice are injected subcutaneous Iy into an immune compromised mouse such as an athymic mouse, an irradiated mouse, or a SCID mouse.
- tumor growth is measured, e.g., by volume or by its two largest dimensions, and compared to the control. Tumors that have statistically significant reduction (using, e.g., Student's T test) are said to have inhibited growth. Alternatively, the extent of tumor neovascularization can also be measured. Immunoassays using endothelial cell specific antibodies are used to stain for vascularization of the tumor and the number of vessels in the tumor. Tumors that have a statistically significant reduction in the number of vessels (using, e.g., Student's T test) are said to have inhibited neovascularization.
- Transgenic and non-transgenic animals are also used in some embodiments for examining regulation of endogenous gene expression in vivo.
- Transgenic animals typically express the targeted transcriptional effector of choice.
- animals that transiently express the ZFP of choice, or to which the targeted transcriptional effector has been administered in a delivery vehicle can be used. Regulation of endogenous gene expression is tested using any one of the assays described herein.
- Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome.
- Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell.
- Methods of non- viral delivery of nucleic acids encoding targeted transcriptional effectors include lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid: nucleic acid conjugates, naked DNA, artificial virions, and agent- enhanced uptake of DNA.
- Lipofection is described in e.g., U.S. Patent Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially ⁇ e.g., TransfectamTM and Lipofectin).
- Cationic and neutral lipids that are suitable for efficient receptor- recognition lipofection of polynucleotides include those of Feigner, WO 91/17424 and WO 91/16024. Delivery can be to cells (ex vivo administration) or target tissues (in vivo administration).
- lipid nucleic acid complexes, including targeted liposomes such as immunolipid complexes
- the preparation of lipid: nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science. 270: 404-410 (1995); Blaese et al, Cancer Gene Ther. 2: 291-297 (1995); Behr et al, Bioconjugate Chem. 5: 382-389 (1994); Remy et al, Bioconjugate Chem. 5: 647-654 (1994); Gao et al, Gene Therapy. 2: 710-722 (1995); Ahmad et al, Cancer Res. 52: 4817- 4820 (1992); U. S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).
- RNA or DNA viral based systems for the delivery of nucleic acids encoding a targeted transcriptional effector take advantage of highly evolved processes for targeting a virus to specific cells in the body and trafficking the viral payload to the nucleus.
- Viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro and the modified cells are administered to patients (ex vivo).
- Conventional viral based systems for the delivery of targeted transcriptional effectors could include retroviral, lentivirus, adenoviral, adeno- associated and herpes simplex virus vectors for gene transfer. Viral vectors are currently the most efficient and versatile method of gene transfer in target cells and tissues.
- Lentiviral vectors are retroviral vector that are able, to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis- acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression.
- Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscheretal., J. Virol 66: 2731-2739 (1992); Johann et al., J. Virol 66: 1635-1640 (1992); Sommerfelt et al, Virol 176: 58-59 (1990); Wilson et al., J. Virol 63: 2374-2378 (1989); Miller et al., J. Virol 65: 2220-2224 (1991); PCT/US94/05700).
- MiLV murine leukemia virus
- GaLV gibbon ape leukemia virus
- SIV Simian Immuno deficiency virus
- HAV human immuno deficiency virus
- Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.
- Adeno-associated virus (“AAV”) vectors are also used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al, Virology. 160: 38-47 (1987); U. S. Patent No.
- At least six viral vector approaches are currently available for gene transfer in clinical trials, with retroviral vectors by far the most frequently used system.
- All of these viral vectors utilize approaches that involve complementation of defective vectors by genes inserted into helper cell lines to generate the transducing agent.
- pLASN and MFG-S are examples are retroviral vectors that have been used in clinical trials (Dunbar et al., Blood. 85: 3048-305 (1995); Kohn et al., Nat. Med. 1 : 1017- 102 (1995); Malech et al, PNAS. 94 : 22 12133-12138 (1997)).
- PA317/pLASN was the first therapeutic vector used in a gene therapy trial.
- rAAV Recombinant adeno-associated virus vectors
- All vectors are derived from a plasmid that retains only the AAV 145 bp inverted terminal repeats flanking the transgene expression cassette. Efficient gene transfer and stable transgene delivery due to integration into the genomes of the transduced cell are key features for this vector system.
- Ad vectors Replication-deficient recombinant adenoviral vectors (Ad) are predominantly used for colon cancer gene therapy, because they can be produced at high titer and they readily infect a number of different cell types. Most adenovirus vectors are engineered such that a transgene replaces the Ad EIa, EIb, and E3 genes; subsequently the replication defector vector is propagated in human 293 cells that supply deleted gene function in trans. Ad vectors can transduce multiply types of tissues in vivo, including nondividing, differentiated cells such as those found in the liver, kidney and muscle system tissues.
- Ad vectors have a large carrying capacity.
- An example of the use of an Ad vector in a clinical trial involved polynucleotide therapy for antitumor immunization with intramuscular injection (Sterman et al, Hum. Gene Ther. 7: 1083-9 (1998)).
- Additional examples of the use of adenovirus vectors for gene transfer in clinical trials include Rosenecker et al, Infection. 24: 1 5-10 (1996); Sterman et al, Hum. Gene Ther. 9: 7 1083- 1089 (1998); Welsh et al, Hum. Gene Ther. 2: 205-18 (1995); Alvarez et al, Hum. Gene Ther. 5: 597-613 (1997); Topf et al, Gene Ther. 5: 507-513 (1998); Sterman et al, Hum. Gene Ther. 7: 1083-1089 (1998).
- Packaging cells are used to form virus particles that are capable of infecting a host cell. Such cells include HEK 293 cells, which package adenovirus, and W2 cells or PA317 cells, which package retrovirus.
- Viral vectors used in gene therapy are usually generated by producer cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the protein to be expressed. The missing viral functions are supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line is also infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV.
- the gene therapy vector be delivered with a high degree of specificity to a particular tissue type.
- a viral vector is typically modified to have specificity for a given cell type by expressing a ligand as a fusion protein with a viral coat protein on the viruses outer surface.
- the ligand is chosen to have affinity for a receptor known to be present on the cell type of interest.
- Han et al., PNAS 92: 9747-9751 (1995) reported that Moloney murine leukemia virus can be modified to express human heregulin fused to gp70, and the recombinant virus infects certain human breast cancer cells expressing human epidermal growth factor receptor.
- filamentous phage can be engineered to display antibody fragments ⁇ e.g., FAB or Fv) having specific binding affinity for virtually any chosen cellular receptor.
- FAB fragment-binding protein
- Fv protein-binding protein
- Gene therapy vectors can be delivered in vivo by administration to an individual patient, typically by systemic administration ⁇ e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described below.
- vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient ⁇ e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
- cells ex vivo such as cells explanted from an individual patient ⁇ e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
- Ex vivo cell transfection for diagnostics, research, or for gene therapy ⁇ e.g., via re- infusion of the transfected cells into the host organism) is well known to those of skill in the art.
- cells are isolated from the subject organism, transfected with a targeted transcriptional effector nucleic acid (gene or cDNA), and re-infused back into the subject organism (such as a patient).
- a targeted transcriptional effector nucleic acid gene or cDNA
- Various cell types suitable for ex vivo transfection are well known to those of skill in the art (see, e.g., Freshney et al., Culture of Animal Cells, A Manual of Basic Sechnique (3rd ed. 1994)) and the references cited therein for a discussion of how to isolate and culture cells from patients).
- stem cells are used in ex vivo procedures for cell transfection and gene therapy.
- the advantage to using stem cells is that they can be differentiated into other cell types in vitro, or can be introduced into a mammal (such as the donor of the cells) where they will engraft in the bone marrow.
- Methods for differentiating CD34+ cells in vitro into clinically important immune cell types using cytokines such a GM-CSF, IFN-y and TNF- a are known (see Inaba et al, J. Exp. Med. 176: 1693-1702 (1992)).
- cytokines such as GM-CSF, IFN-y and TNF- a are known (see Inaba et al, J. Exp. Med. 176: 1693-1702 (1992)).
- Stem cells are isolated for transduction and differentiation using known methods.
- stem cells are isolated from bone marrow cells by panning the bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and CD8+ (T cells), CD45+ (panB cells), GR-I (granulocytes), and lad (differentiated antigen presenting cells) (see Inaba et al, J. Exp. Med. 176: 1693-1702 (1992)).
- unwanted cells such as CD4+ and CD8+ (T cells), CD45+ (panB cells), GR-I (granulocytes), and lad (differentiated antigen presenting cells) (see Inaba et al, J. Exp. Med. 176: 1693-1702 (1992)).
- Vectors ⁇ e.g., retroviruses, adenoviruses, liposomes, etc.
- therapeutic targeted transcriptional effector nucleic acids can be also administered directly to the organism for transduction of cells in vivo.
- naked DNA can be administered.
- Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells. Suitable methods of administering such nucleic acids are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- compositions are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions available, as described below (see, e.g., Remington's Pharmaceutical Sciences, 17th ed., 1989).
- polypeptide compounds such as the targeted transcriptional effectors
- the polypeptide has the ability to traverse the plasma membrane of a cell, or the membrane of an intra-cellular compartment such as the nucleus.
- Cellular membranes are composed of lipid-protein bilayers that are freely permeable to small, nonionic lipophilic compounds and are inherently impermeable to polar compounds, macromolecules, and therapeutic or diagnostic agents.
- proteins and other compounds such as liposomes have been described, which have the ability to translocate polypeptides such as targeted transcriptional effectors across a cell membrane.
- membrane translocation polypeptides have amphiphilic or hydrophobic amino acid subsequences that have the ability to act as membrane-translocating carriers.
- homeodomain proteins have the ability to translocate across cell membranes.
- the shortest intemalizable peptide of a homeodomain protein, Antennapedia was found to be the third helix of the protein, from amino acid position 43 to 58 (see, e.g., Prochiantz, Current Opinion in Neurobiology 6: 629-634 (1996)).
- peptide sequences which can be linked to a protein, for facilitating uptake of the protein into cells include, but are not limited to: an 11 amino acid peptide of the tat protein of HIV; a 20 residue peptide sequence which corresponds to amino acids 84- 103 of the pl6 protein (see Fahraeus et al, Current Biology.
- Toxin molecules also have the ability to transport polypeptides across cell membranes.
- such molecules are composed of at least two parts (called “binary toxins”): a translocation or binding domain or polypeptide and a separate toxin domain or polypeptide.
- the translocation domain or polypeptide binds to a cellular receptor, and then the toxin is transported into the cell.
- Clostridium perfrisagens iota toxin diphtheria toxin (DT), Pseudomonas exotoxin A (PE), pertussis toxin (PT), Bacillus aitthracis toxin, and pertussis adenylate cyclase (CYA)
- DT diphtheria toxin
- PE Pseudomonas exotoxin A
- PT pertussis toxin
- CYA pertussis adenylate cyclase
- Amino acid sequences which facilitate internalization of linked polypeptides into cells can be selected from libraries of randomized peptide sequences. See, for example, Yeh et al (2003) Molecular Therapy. 7 (5): S461 (Abstract #1191). Such "internalization peptides" can be fused to a targeted transcriptional effector to facilitate entry of the protein into a cell.
- Such subsequences can be used to translocate targeted transcriptional effectors across a cell membrane.
- ZFPs can be conveniently fused to or derivatized with such sequences.
- the translocation sequence is provided as part of a fusion protein.
- a linker can be used to link the targeted transcriptional effector and the translocation sequence. Any suitable linker can be used, e.g., a peptide linker.
- the targeted transcriptional effector can also be introduced into an animal cell (e.g., a mammalian cell) via a liposomes and liposome derivatives such as immuno liposomes.
- liposome refers to vesicles comprised of one or more concentrically ordered lipid bilayers, which encapsulate an aqueous phase.
- the aqueous phase typically contains the compound to be delivered to the cell, i.e., a targeted transcriptional effector.
- the liposome fuses with the plasma membrane, thereby releasing the drug into the cytosol.
- the liposome is phagocytosed or taken up by the cell in a transport vesicle. Once in the endosome or phagosome, the liposome either degrades or fuses with the membrane of the transport vesicle and releases its contents.
- the liposome In current methods of drug delivery via liposomes, the liposome ultimately becomes permeable and releases the encapsulated compound (in this case, a targeted transcriptional effector) at the target tissue or cell. For systemic or tissue specific delivery, this can be accomplished, for example, in a passive manner wherein the liposome bilayer degrades over time through the action of various agents in the body. Alternatively, active drug release involves using an agent to induce a permeability change in the liposome vesicle. [0289] Liposome membranes can be constructed so that they become destabilized when the environment becomes acidic near the liposome membrane (see, e.g., PNAS. 84: 7851 (1987); Biochemistry. 28: 908 (1989)).
- DOPE Dioleoylphosphatidylethanolamine
- Such liposomes typically comprise a targeted transcriptional effector and a lipid component, e.g., a neutral and/or cationic lipid, optionally including a receptor-recognition molecule such as an antibody that binds to a predetermined cell surface receptor or ligand ⁇ e.g., an antigen).
- a lipid component e.g., a neutral and/or cationic lipid, optionally including a receptor-recognition molecule such as an antibody that binds to a predetermined cell surface receptor or ligand ⁇ e.g., an antigen).
- Suitable methods include, for example, sonication, extrusion, high pressure/homogenization, microfluidization, detergent dialysis, calcium- induced fusion of small liposome vesicles and ether- fusion methods, all of which are well known in the art.
- targeting moieties that are specific to a particular cell type, tissue, and the like. Targeting of liposomes using a variety of targeting moieties ⁇ e.g., ligands, receptors, and monoclonal antibodies) has been previously described (see, e.g., U.S. Patent Nos. 4,957,773 and 4,603,044).
- targeting moieties include monoclonal antibodies specific to antigens associated with neoplasms, such as prostate cancer specific antigen and MAGE.
- Tumors can also be diagnosed by detecting gene products resulting from the activation or over- expression of oncogenes, such as ras or c-erbB2.
- oncogenes such as ras or c-erbB2.
- many tumors express antigens normally expressed by fetal tissue, such as the alphafetoprotein (AFP) and carcinoembryonic antigen (CEA).
- Sites of viral infection can be diagnosed using various viral antigens such as hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human immunodeficiency type-1 virus (HIVl) and papilloma virus antigens.
- Inflammation can be detected using molecules specifically recognized by surface molecules which are expressed at sites of inflammation such as integrins ⁇ e.g., VCAM-I), selectin receptors ⁇ e.g., ELAM-I) and the like.
- Standard methods for coupling targeting agents to liposomes can be used. These methods generally involve incorporation into liposomes lipid components, e.g., phosphatidylethanolamine, which can be activated for attachment of targeting agents, or derivatized lipophilic compounds, such as lipid derivatized bleomycin.
- lipid components e.g., phosphatidylethanolamine
- Antibody targeted liposomes can be constructed using, for instance, liposomes which incorporate protein A (see Renneisen et al, J. Biol. Chem. 265: 16337-16342 (1990) and Leonetti et al, PNAS. 87: 2448-2451 (1990).
- the dose administered to a patient should be sufficient to effect a beneficial therapeutic response in the patient over time.
- particular dosage regimens can be useful for determining phenotypic changes in an experimental setting, e.g., in functional genomics studies, and in cell or animal models.
- the dose will be determined by the efficacy and IQ of the particular engineered DNS-binding domain employed, the nuclear volume of the target cell, and the condition of the patient, as well as the body weight or surface area of the patient to be treated.
- the size of the dose also will be determined by the existence, nature, and extent of any adverse side-effects that accompany the administration of a particular compound or vector in a particular patient.
- the maximum therapeutically effective dosage of targeted transcriptional effector for approximately 99% binding to target sites is calculated to be in the range of less than about 1. 5x10 5 to 1. 5x10 6 copies of the specific targeted transcriptional effector molecule per cell.
- the number of targeted transcriptional effector s per cell for this level of binding is calculated as follows, using the volume of a HeLa cell nucleus (approximately 1000 ⁇ m 3 or 10 "12 L; Cell Biology, (Altman & Katz, eds. (1976)). As the HeLa nucleus is relatively large, this dosage number is recalculated as needed using the volume of the target cell nucleus. This calculation also does not take into account competition for targeted transcriptional effector binding by other sites.
- the appropriate dose of an expression vector encoding a targeted transcriptional effector can also be calculated by taking into account the average rate of targeted transcriptional effector expression from the promoter and the average rate of targeted transcriptional effector degradation in the cell.
- a weak promoter such as a wild-type or mutant HSV TK can be used, as described above.
- the dose of targeted transcriptional effector in micrograms is calculated by taking into account the molecular weight of the particular targeted transcriptional effector being employed.
- the physician evaluates circulating plasma levels of the targeted transcriptional effector or nucleic acid encoding the targeted transcriptional effector, potential targeted transcriptional effector toxicities, progression of the disease, and the production of anti-targeted transcriptional effector antibodies. Administration can be accomplished via single or divided doses.
- Targeted transcriptional effector s and expression vectors encoding targeted transcriptional effectors can be administered directly to the patient for modulation of gene expression and for therapeutic or prophylactic applications, for example, cancer, ischemia, diabetic retinopathy, macular degeneration, rheumatoid arthritis, psoriasis, HIV infection, sickle cell anemia, Alzheimer's disease, muscular dystrophy, neurodegenerative diseases, vascular disease, cystic fibrosis, stroke, and the like.
- pathogenic bacteria e.g.,
- viral diseases e.g., hepatitis (A, B, or C), herpes virus (e.g., VZV, HSV-I, HSV-6, HSV-11, CMV, and EBV), HIV, Ebola, adenovirus, influenza virus, flaviviruses, echovirus, rhinovirus, coxsackie virus, cornovirus, respiratory syncytial virus, mumps virus, rotavirus, measles virus, rubella virus, parvovirus, vaccinia virus, HTLV virus, dengue virus, papillomavirus, poliovirus, rabies virus, and arboviral encephalitis virus, etc.
- herpes virus e.g., VZV, HSV-I, HSV-6, HSV-11, CMV, and EBV
- HIV Ebola
- adenovirus e.g., influenza virus, flaviviruses, echovirus, rhinovirus, coxsackie virus, cornovirus, respiratory s
- Administration of therapeutically effective amounts is by any of the routes normally used for introducing targeted transcriptional effector into ultimate contact with the tissue to be treated.
- the targeted transcriptional effectors are administered in any suitable manner, optionally with pharmaceutically acceptable carriers. Suitable methods of administering such modulators are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions that are available (see, e.g., Remiyakon's Pharfncaceutical Sciences, 1 7th ed. 1985)).
- Aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like.
- Formulations suitable for parenteral administration include aqueous and nonaqueous, isotonic sterile injection solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives.
- the disclosed compositions can be administered, for example, by intravenous infusion, orally, topically, intraperitoneally, intravesically or intrathecally .
- the formulations of compounds can be presented in unit-dose or multi-dose sealed containers, such as ampules and vials. Injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind previously described.
- Regulation of gene expression in plants targeted transcriptional effectors can be used to engineer plants for traits such as increased disease resistance, modification of structural and storage polysaccharides, flavors, proteins, and fatty acids, fruit ripening, yield, color, nutritional characteristics, improved storage capability, and the like.
- traits such as increased disease resistance, modification of structural and storage polysaccharides, flavors, proteins, and fatty acids, fruit ripening, yield, color, nutritional characteristics, improved storage capability, and the like.
- the engineering of crop species for enhanced oil production e.g., the modification of the fatty acids produced in oilseeds, is of interest.
- Seed oils are composed primarily of triacylglycerols (TAGs), which are glycerol esters of fatty acids. Commercial production of these vegetable oils is accounted for primarily by six major oil crops (soybean, oil palm, rapeseed, sunflower, cotton seed, and peanut. ) Vegetable oils are used predominantly (90%) for human consumption as margarine, shortening, salad oils, and frying oil. The remaining 10% is used for non-food applications such as lubricants, oleochemicals, biofuels, detergents, and other industrial applications.
- the desired characteristics of the oil used in each of these applications varies widely, particularly in terms of the chain length and number of double bonds present in the fatty acids making up the TAGs. These properties are manipulated by the plant in order to control membrane fluidity and temperature sensitivity. The same properties can be controlled using targeted transcriptional effectors to produce oils with improved characteristics for food and industrial uses.
- the primary fatty acids in the TAGs of oilseed crops are 16 to 18 carbons in length and contain 0 to 3 double bonds. Palmitic acid (16: 0 [16 carbons: 0 double bonds] ), oleic acid (18: 1), linoleic acid (18: 2), and linolenic acid (18: 3) predominate. The number of double bonds, or degree of saturation, determines the melting temperature, reactivity, cooking performance, and health attributes of the resulting oil.
- A12-oleate desaturase also referred to as omega-6 desaturase.
- a block at this step in the fatty acid desaturation pathway should result in the accumulation of oleic acid at the expense of polyunsaturates.
- targeted transcriptional effectors are used to regulate expression of the FAD2-1 gene in soybeans. Two genes encoding microsomal A6 desaturases have been cloned recently from soybean, and are referred to as FAD2-1 and FAD2-2 (Heppard et al., Plant Physiol. 110: 311-319 (1996) ).
- FAD2-1 (delta 12 desaturase) appears to control the bulk of oleic acid desaturation in the soybean seed.
- Targeted transcriptional effectors can thus be used to modulate gene expression of FAD2-1 in plants.
- targeted transcriptional effectors can be used to inhibit expression of the FAD2-1 gene in soybean in order to increase the accumulation of oleic acid (18: 1) in the oil seed.
- targeted transcriptional effectors can be used to modulate expression of any other plant gene, such as delta-9 desaturase, delta- 12 desaturases from other plants, delta- 15 desaturase, acetyl- CoA carboxylase, acyl-ACP-thioesterase, ADP-glucose pyrophosphorylase, starch synthase, cellulose synthase, sucrose synthase, senescence-associated genes, heavy metal chelators, fatty acid hydroperoxide lyase, polygalacturonase, EPSP synthase, plant viral genes, plant fungal pathogen genes, and plant bacterial pathogen genes.
- delta-9 desaturase delta- 12 desaturases from other plants
- delta- 15 desaturase acetyl- CoA carboxylase
- acyl-ACP-thioesterase ADP-glucose pyrophosphorylase
- starch synthase cellulose synthase
- sucrose synthase sucrose
- Recombinant DNA vectors suitable for transformation of plant cells are also used to deliver protein (e.g. , targeted transcriptional effector) -encoding nucleic acids to plant cells.
- protein e.g. , targeted transcriptional effector
- Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature (see, e.g., Weising et al. Ann. Rev. Genet. 22: 421-477 (1988) ).
- a DNA sequence coding for the desired targeted transcriptional effectors is combined with transcriptional and translational initiation regulatory sequences which will direct the transcription of the targeted transcriptional effectors in the intended tissues of the transformed plant.
- a plant promoter fragment may be employed which will direct expression of the targeted transcriptional effectors in all tissues of a regenerated plant.
- Such promoters are referred to herein a "constitutive" promoters and are active under most environmental conditions and states of development or cell differentiation.
- constitutive promoters include the cauliflower mosaic virus (CaMV) 35 S transcription initiation region, the 1'- or 2'-promoter derived from T-DNA of Agrobacteriuna turnafaciefas, and other transcription initiation regions from various plant genes known to those of skill.
- the plant promoter may direct expression of the targeted transcriptional effectors in a specific tissue or may be otherwise under more precise environmental or developmental control.
- inducible promoters are referred to here as "inducible" promoters.
- environmental conditions that may effect transcription by inducible promoters include anaerobic conditions or the presence of light.
- promoters under developmental control include promoters that initiate transcription only in certain tissues, such as fruit, seeds, or flowers.
- a polygalacturonase promoter can direct expression of the targeted transcriptional effectors in the fruit
- a CHS-A (chalcone synthase A from petunia) promoter can direct expression of the ZFP in flower of a plant.
- the vector comprising a targeted transcriptional effector sequence will typically comprise a marker gene which confers a selectable phenotype on plant cells.
- the marker may encode biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosuforon or Basta.
- DNA constructs may be introduced into the genome of the desired plant host by a variety of conventional techniques.
- the DNA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be introduced directly to plant tissue using biolistic methods, such as DNA particle bombardment.
- biolistic methods such as DNA particle bombardment.
- DNA constructs may be combined with suitable T- DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector.
- Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA when the cell is infected by the bacteria.
- Electroporation techniques are described in Fromm et al. PNAS. 82: 5824 (1985).
- Transformed plant cells which are derived by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed genotype and thus the desired targeted transcriptional effector-controlled phenotype.
- Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the ZFP nucleotide sequences.
- Plant regeneration from cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp. 124-176 (1983); and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73 (1985). Regeneration can also be obtained from plant callus, explants, organs, or parts thereof.
- Such regeneration techniques are described generally in Klee et al. Ann. Rev. of Plant Plays. 38: 467-486 (1987).
- Engineered targeted transcriptional effectors can be readily used to up or down-regulate any endogenous target gene. Very little sequence information is required to create a gene-specific DNA binding domain. This makes the targeted transcriptional effectors technology ideal for analysis of long lists of poorly characterized differentially expressed genes.
- a further application of the targeted transcriptional effector technology is manipulating gene expression in transgenic animals.
- over-expression of an endogenous gene or the introduction of a heterologous gene to a transgenic animal, such as a transgenic mouse is a fairly straightforward process.
- the targeted transcriptional effector technology is an improvement in these types of methods because one can circumvent the need for generating full-length cDNA clones of the gene under study.
- conventional down-regulation of gene expression in transgenic animals is plagued by technical difficulties. Gene knockout by homologous recombination is the method most commonly applied currently. This method requires a relatively long genomic clone of the gene to be knocked out (ca. 10 kb).
- a selectable marker is inserted into an exon of the gene of interest to effect the gene disruption, and a second counter-selectable marker provided outside of the region of homology to select homologous versus non-homologous recombinants.
- This construct is transfected into embryonic stem cells and recombinants selected in culture.
- Recombinant stem cells are combined with very early stage embryos generating chimeric animals. If the chimerism extends to the germline homozygous knockout animals can be isolated by back-crossing. When the technology is successfully applied, knockout animals can be generated in approximately one year. Unfortunately two common issues often prevent the successful application of the knockout technology; embryonic lethality and developmental compensation.
- Embryonic lethality results when the gene to be knocked out plays an essential role in development. This can manifest itself as a lack of chimerism, lack of germline transmission or the inability to generate homozygous back crosses. Genes can play significantly different physiological roles during development versus in adult animals. Therefore, embryonic lethality is not considered a rationale for dismissing a gene target as a useful target for therapeutic intervention in adults.
- Embryonic lethality most often simply means that the gene of interest can not be easily studied in mouse models, using conventional methods.
- the recognition elements are inserted into a gene of interest using homologous recombination (as described above) and the expression of the recombinase induced in adult mice post-development. This causes the deletion of a portion of the target gene and avoids developmental complications.
- the method is labor intensive and suffers form chimerism due to non-uniform induction of the recombinase.
- targeted transcriptional effectors to manipulate gene expression can be restricted to adult animals using the small molecule regulated systems described in the previous section. Expression and/or function of a zinc finger-based repressor can be switched off during development and switched on at will in the adult animals. This approach relies on the addition of the targeted transcriptional effectors expressing module only; homologous recombination is not required. Because the targeted transcriptional effectors repressors are trans dominant, there is no concern about germline transmission or homozygosity. These issues dramatically affect the time and labor required to go from a poorly characterized gene candidate (a cDNA or EST clone) to a mouse model.
- a poorly characterized gene candidate a cDNA or EST clone
- Chimeric targeted mice can be derived according to Hogan et al., Manipulating the Mouse Embryo : A Laboratory Manual, (1988); Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, Robertson, ed., Oxford University Press (1987); and Capecchi et al, Science. 244: 1288 (1989).
- TAT Gene In accordance with Table 1, two meganucleases, TATl and TAT2, were designed to bind the half-sites 5'-GAAGAGCTC-3' (SEQ ID NO: 16) and 5'-TGACTGTTC-
- the two enzymes were cloned, expressed in E. coli, and assayed for enzyme activity against the corresponding DNA recognition sequence as described below. In both cases, the rationally-designed meganucleases were found to be inactive. A second generation of each was then produced in which E80 was mutated to Q to improve contacts with the DNA backbone. The second generation TAT2 enzyme was found to be active against its intended recognition sequence while the second generation TATl enzyme remained inactive. Visual inspection of the wild-type I-Crel co-crystal structure suggested that TATl was inactive due to a steric clash between R40 and K28.
- TATl variants were produced in which K28 was mutated to an amino acid with a smaller side chain (A, S, T, or C) while maintaining the Q80 mutation.
- these enzymes were produced in E. coli and assayed, the TATl variants with S28 and T28 were both found to be active against the intended recognition sequence while maintaining the desired base preference at position -7.
- Wild-type I-Crel and all engineered meganucleases were expressed and purified using the following method.
- the constructs cloned into a pET21a vector were transformed into chemically competent BL21 (DE3) pLysS, and plated on standard 2xYT plates containing 200 ⁇ g/ml carbanicillin. Following overnight growth, transformed bacterial colonies were scraped from the plates and used to inoculate 50 ml of 2XYT broth. Cells were grown at 37°C with shaking until they reached an optical density of 0.9 at a wavelength of 600 nm. The growth temperature was then reduced from 37°C to 22°C.
- Protein expression was induced by the addition of ImM IPTG, and the cells were incubated with agitation for two and a half hours. Cells were then pelleted by centrifugation for 10 min. at 6000 xg. Pellets were resuspended in ImI binding buffer (20 mM Tris-HCL, pH 8.0, 500 mM NaCl, 10 mM imidazole) by vortexing. The cells were then disrupted with 12 pulses of sonication at 50% power and the cell debris was pelleted by centrifugation for 15 min. at 14,000 xg. Cell supernatants were diluted in 4 ml binding buffer and loaded onto a 200 ⁇ l nickel-charged metal-chelating Sepharose column (Pharmacia).
- the column was subsequently washed with 4 ml wash buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 60 mM imidazole) and with 0.2 ml elution buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 400 mM imidazole).
- Meganuclease enzymes were eluted with an additional 0.6 ml of elution buffer and concentrated to 50-130 ⁇ l using Vivospin disposable concentrators (ISC, Inc., Kaysville, UT).
- the enzymes were exchanged into SA buffer (25 mM Tris-HCL, pH 8.0, 100 mM NaCl, 5 mM MgCl 2 , 5mM EDTA) for assays and storage using Zeba spin desalting columns (Pierce Biotechnology, Inc., Rockford, IL).
- SA buffer 25 mM Tris-HCL, pH 8.0, 100 mM NaCl, 5 mM MgCl 2 , 5mM EDTA
- SA buffer 25 mM Tris-HCL, pH 8.0, 100 mM NaCl, 5 mM MgCl 2 , 5mM EDTA
- SA buffer 25 mM Tris-HCL, pH 8.0, 100 mM NaCl, 5 mM MgCl 2 , 5mM EDTA
- the enzyme concentration was determined by absorbance at 280 nm using an extinction coefficient of 23,590 M 1 Cm "1 . Purity and molecular weight of the enzymes
- Heterodimeric enzymes were produced either by purifying the two proteins independently, and mixing them in vitro or by constructing an artificial operon for tandem expression of the two proteins in E. coli.
- the purified meganucleases were mixed 1 : 1 in solution and pre-incubated at 42°C for 20 minutes prior to the addition of DNA substrate.
- the two genes were cloned sequentially into the pET-21a expression vector using Ndel/EcoRI and EcoRI/Hindlll.
- the first gene in the operon ends with two stop codons to prevent read-through errors during transcription.
- a 12-base pair nucleic acid spacer and a Shine-Dalgarno sequence from the pET21 vector separated the first and second genes in the artificial operon.
- the enzyme digests contained 5 ⁇ l 0.05 ⁇ M DNA substrate, 2.5 ⁇ l 5 ⁇ M recombinant I-Crel meganuclease, 9.5 ⁇ l SA buffer, and 0.5 ⁇ l Xmnl, Seal, or Bpml.
- Digests were incubated at either 37°C, or 42°C for certain meganuclease enzymes, for four hours.
- Digests were stopped by adding 0.3 mg/ml Proteinase K and 0.5% SDS, and incubated for one hour at 37°C. Digests were analyzed on 1.5% agarose and visualized by ethidium bromide staining.
- TATl and TAT2 meganucleases recognized DNA sequences that were distinct from the wild-type meganuclease recognition sequence (Fig. 2(B)).
- the wild-type I-Crel meganuclease cleaves the WT recognition sequence, but cuts neither the intended sequence for TATl nor the intended sequence for TAT2.
- TATl and TAT2 likewise, cut their intended recognition sequences but not the wild-type sequence.
- the meganucleases were then evaluated for half-site preference and overall specificity (Fig 3). Wild-type I-Crel was found to be highly tolerant of single-base-pair substitutions in its natural half-site.
- TATl and TAT2 were found to be highly-specific and completely intolerant of base substitutions at positions -1, -2, -3, -6, and -8 in the case of TATl, and positions -1, -2, and -6 in the case of TAT2.
- Both enzymes were expressed in E. coli, purified, and assayed as in Example 1. Both first generation enzymes were found to cleave their intended recognition sequences with rates that were considerably below that of wild-type I-Crel with its natural recognition sequence. To alleviate this loss in activity, the DNA-binding affinity of CCRl and BRP2 was increased by mutating E80 to Q in both enzymes. These second-generation versions of CCRl and BRP2 were found to cleave their intended recognition sequences with substantially increased catalytic rates. 2. Rationally-designed meganucleases with decreased DNA-binding affinity and decreased activity but increased specificity.
- Wild-type I-Crel was found to be highly-tolerant of substitutions to its half-site (Fig. 3(A)).
- the lysine at position 116 of the enzyme which normally makes a salt-bridge with a phosphate in the DNA backbone, was mutated to aspartic acid to reduce DNA-binding affinity.
- This rationally-designed enzyme was found to cleave the wild-type recognition sequence with substantially reduced activity but the recombinant enzyme was considerably more specific than wild-type.
- the half-site preference of the Kl 16D variant was evaluated as in Example 1 and the enzyme was found to be entirely intolerant of deviation from its natural half-site at positions -1, -2, and -3, and displayed at least partial base preference at the remaining 6 positions in the half-site (Fig. 3(B)).
- LAMl and LAM2 Two meganucleases, LAMl and LAM2, were rationally-designed to cleave the half-sites 5'-TGCGGTGTC-3' (SEQ ID NO: 20) and 5'-CAGGCTGTC-S' (SEQ ID NO: 21), respectively.
- the heterodimer of these two enzymes was expected to recognize the DNA sequence 5 '-TGCGGTGTCCGGCGACAGCCTG-S ' (SEQ ID NO: 22) found in the bacteriophage ⁇ pO5 gene.
- LAMl and LAM 2 were cloned, expressed in E. coli, and purified individually as described in Example 1. The two enzymes were then mixed 1 : 1 and incubated at 42 0 C for 20 minutes to allow them to exchange subunits and re-equilibrate. The resulting enzyme solution, expected to be a mixture of LAMl homodimer, LAM2 homodimer, and LAMl /L AM2 heterodimer, was incubated with three different recognition sequences corresponding to the perfect palindrome of the LAMl half-site, the perfect palindrome of the LAM2 half-site, and the non-palindromic hybrid site found in the bacteriophage ⁇ genome.
- the purified LAMl enzyme alone cuts the LAMl palindromic site, but neither the LAM2 palindromic site, nor the LAMl /L AM2 hybrid site.
- the purified LAM2 enzyme alone cuts the LAM2 palindromic site but neither the LAMl palindromic site nor the LAM1/LAM2 hybrid site.
- the 1 :1 mixture of LAMl and LAM2 cleaves all three DNA sites. Cleavage of the LAM1/LAM2 hybrid site indicates that two distinct re-designed meganucleases can be mixed in solution to form a heterodimeric enzyme capable of cleaving a non-palindromic DNA site.
- LAMl homodimer formation of the LAMl homodimer was expected to be reduced due to electrostatic repulsion between E7, E57, and E96 in one monomer and E8 and E61 in the other monomer.
- formation of the LAM2 homodimer was expected to be reduced due to electrostatic repulsion between K7, K57, and K96 on one monomer and K8 and K61 on the other monomer.
- the LAMl /L AM2 heterodimer was expected to be favored due to electrostatic attraction between E7, E57, and E96 in LAMl and K8 and K61 in LAM2.
- the LAMl /L AM2 hybrid site was found to be cleaved preferentially over the two palindromic sites, indicating that substitutions in the meganuclease protein-protein interface can drive the preferential formation of heterodimers.
- a rationally-designed meganuclease heterodimer (ACH1/ACH2) can be produced that cleaves the sequence 5 '-CTGGGAGTCTCAGGACAGCCTG-S ' (SEQ ID NO: 23) in the human FGFR3 gene, mutations in which cause achondroplasia.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites: ACHl:
- a rationally-designed meganuclease heterodimer (HGH1/HGH2) can be produced that cleaves the sequence 5 '-CCAGGTGTCTCTGGACTCCTCC-S ' (SEQ ID NO: 24) in the promoter of the Human Growth Hormone gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (CF1/CF2) can be produced that cleaves the sequence 5 '-GAAAATATCATTGGTGTTTCCT-S ' (SEQ ID NO: 25) in the ⁇ F508 allele of the human CFTR gene.
- a meganuclease was rationally- designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites:
- a rationally-designed meganuclease heterodimer (CCR1/CCR2) can be produced that cleaves the sequence 5 '-AACCCTCTCCAGTGAGATGCCT-S ' (SEQ ID NO: 26) in the human CCR5 gene (an HIV co-receptor).
- a meganuclease was rationally- designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites:
- a rationally-designed meganuclease heterodimer (MYDl /M YD2) can be produced that cleaves the sequence 5 '-GACCTCGTCCTCCGACTCGCTG-S ' (SEQ ID NO: 27) in the 3' untranslated region of the human DM kinase gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (HSV1/HSV2) can be produced that cleaves the sequence 5 '-CTCGATGTCGGACGACACGGCA-S ' (SEQ ID NO: 28) in the UL36 gene of Herpes Simplex Virus- 1 and Herpes Simplex Virus-2.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites:
- a rationally-designed meganuclease heterodimer (ANT1/ANT2) can be produced that cleaves the sequence 5 '-ACAAGTGTCTATGGACAGTTTA-S ' (SEQ ID NO: 29) in the Bacillus anthracis genome.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites: ANTl :
- a rationally-designed meganuclease heterodimer (P0X1/P0X2) can be produced that cleaves the sequence 5 '-AAAACTGTCAAATGACATCGCA-S ' (SEQ ID NO: 30) in the Variola (smallpox) virus gp009 gene.
- a meganuclease was designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites: POXl :
- a rationally-designed meganuclease homodimer (EBB1/EBB1) can be produced that cleaves the pseudo-palindromic sequence 5 '-CGGGGTCTCGTGCGAGGCCTCC-S ' (SEQ ID NO: 31) in the Epstein-Barr Virus BALF2 gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (GLA1/GLA2) can be produced that cleaves the sequence 5 '-CACTAACTCGTATGAGTCGGTG-S ' (SEQ ID NO: 32) in the Arabidopsis thalianna GL2 gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (BRP1/BRP2) can be produced that cleaves the sequence 5 '-TGCCTCCTCTAGAGACCCGGAG-S ' (SEQ ID NO: 33) in the Arabidopsis thalianna BPl gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (MGC 1/MGC2) can be produced that cleaves the sequence 5 '-TAAAATCTCTAAGGTCTGTGCA-S ' (SEQ ID NO: 34) in the Nicotiana tabacum Magnesium Chelatase gene.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- a rationally-designed meganuclease heterodimer (CYP/HGH2) can be produced that cleaves the sequence 5 '-CAAGAATTCAAGCGAGCATTAA-S ' (SEQ ID NO: 35) in the Nicotiana tabacum CYP82E4 gene.
- a meganuclease was rationally- designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites:
- a rationally-designed meganuclease heterodimer (URA1/URA2) can be produced that cleaves the sequence 5 '-TTAGATGACAAGGGAGACGCAT-S ' (SEQ ID NO: 36) in the Saccharomyces cerevisiae URA3 gene.
- a meganuclease was rationally- designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half- sites:
- Rheumatoid arthritis is a chronic inflammatory disease that targets synovial joints and is primarily characterize by joint destruction. The prevalence of the disease is estimated to be as high as 1% in adults and greatly diminishes the quality of life of affected individuals. Although the exact cause of the disease has yet to be determined, the immunological basis of the synovial inflammation and joint destruction is well understood. Activated monocytes and macrophages within the synovial cavity produce high levels of cytokines including interleukin-1 (IL-I) and tumor necrosis factor ⁇ (TNF- ⁇ ). These proinflammatory cytokines induce a cascade of events that ultimately lead to the production of matrix metalloproteinases and osteoclasts, which result in severe damage to cartilage and bone.
- IL-1 interleukin-1
- TNF- ⁇ tumor necrosis factor ⁇
- TNF- ⁇ antagonists as therapy for RA.
- DMARDs disease modifying antirheumatic drugs
- sulphasalazine sulphasalazine
- cyclosporine A cyclosporine A
- methotrexate methotrexate
- TNF- ⁇ antagonists There are currently three TNF- ⁇ antagonists available for clinical use: two are anti-TNF antibodies (Infliximab and Adalimumab) and the third is a soluble TNF-receptor fusion protein (Etanercept). These antagonists effectively block the downstream actions of TNF- ⁇ , and have demonstrated success in reducing the clinical manifestations of RA.
- TNF- ⁇ antagonists Despite the clinical success of TNF- ⁇ antagonists, there are serious adverse effects associated with these agents, including an increased risk of tuberculosis, increased incidence of lymphoma, autoimmune responses, and demyelinating syndromes. These adverse effects are likely due to the systemic inhibition of TNF- ⁇ . Given the serious nature of these side effects, there are considerable efforts to develop alternative and/or complementary strategies to treat RA and other rheumatic diseases. [0370] Targeting TNF- ⁇ at the transcriptional level. TNF- ⁇ inhibitors currently target this important cytokine at either the protein level or the RNA level.
- TNF- ⁇ at the transcriptional level, by engineering a transcriptional repressor that recognizes a DNA sequence unique to the TNF- ⁇ gene.
- This approach has several major advantages over current tactics to inhibit TNF- ⁇ .
- the risk of adverse effects caused by systemic inhibition of TNF- ⁇ should be reduced.
- the minimum effective dose should be significantly less for an engineered transcription factor, because there are only two copies of the TNF- ⁇ promoter in a cell and, thus, only two targets for an engineered transcription factor.
- inhibitors that act at the RNA or protein level there will be hundreds or thousands of targets which, necessarily, require high levels of inhibitors.
- a rationally-designed meganuclease heterodimer (TNF1/TNF2) can be produced that cleaves the sequence 5 '-AATGGAGACGCAAGAGAGGGAG -3' (SEQ ID NO: 42) in the human tumor necrosis factor alpha (TNF- ⁇ ) gene 436bp downstream from the transcription start site.
- a meganuclease was rationally-designed based on the I-Crel meganuclease, as described above, with the following contact residues and recognition sequence half-sites:
- TNFl and TNF2 meganuclease monomers were then arranged into a single- chain meganuclease by joining an N-terminal TNFl monomer, terminated at L155, with a C- terminal TNF2 initiated at K7 using a 38 amino acid linker (SEQ ID NO: 37).
- the SV40 nuclear localization signal (SEQ ID NO: 38) was added to the N-terminus.
- Endo-TNFsc SEQ ID NO: 43
- Endo-TNFsc was expressed in E. coli and purified as described in Example 1.
- the purified meganuclease was then incubated with a plasmid substrate harboring its intended recognition sequence (SEQ ID NO: 42) and cleavage activity was determined as in Example 1.
- Endo-TNF ⁇ n DNA-binding domain Production and evaluation of the Endo-TNF ⁇ n DNA-binding domain.
- the DNA cleavage activity of Endo-TNFsc was eliminated by mutating the glutamine amino acids in positions 57 and 244 to glutamic acid. Q57 and Q244 in TNFsc correspond to Q47 in wild-type I-Crel.
- the resulting protein, Endo-TNF K o (SEQ ID NO: 44), was expressed in E. coli, purified, and tested for cleavage activity as above. No DNA cleavage activity was detected ( Figure 4). Endo-TNF ⁇ o was then cloned into a mammalian expression vector (pCI, Promega).
- This plasmid was used to transfect HEK-293 cells and binding of the Endo-TNF K o protein to its intended recognition sequence in the human TNF- ⁇ gene was confirmed by chromatin immunoprecipitation using standard protocols ⁇ e.g. , the protocol below).
- ChIP Chromatin Immunoprecipitation Protocol
- wash beads 3X in wash buffer 1 (1% Triton X-100, 0.1% SDS, 150 mM NaCL, 2 mM EDTA, 20 mM Tris-HCl, pH 8.0) with protease inhibitors.
- LiCL buffer (0.25 M LiCl, 1% NP4o, 1% deoxycholate, 1 mM EDTA, 10 mM Tris-HCl, pH 8.0).
- Elute immune complexes by adding 150 ⁇ L elution buffer (1% SDS, 100 mM NaHCO 3 ), Proteinase K (500 ⁇ g/mL) and RNase A (500 ⁇ g/ml) and incubating at 37°C for 30 minutes.
- elution buffer 1% SDS, 100 mM NaHCO 3
- Proteinase K 500 ⁇ g/mL
- RNase A 500 ⁇ g/ml
- FIG. 5 shows the results of this ChIP analysis which confirms that the Endo- TNFK O protein does, indeed, bind to its intended site in the TNF- ⁇ gene.
- Endo-TNF ⁇ o is a suitable DNA-binding domain for the production of targeted transcriptional effector intended to regulate expression of the human TNF- ⁇ gene.
- a TNF- ⁇ repressor can be produced by linking Endo-TNF ⁇ o to a KRAB repressor domain (e.g. SEQ ID NO: 41) together using a short (3-15 amino acid) linker rich in glysine and serine residues.
- a transcription factor can be delivered to human cells and its ability to repress transcription of the TNF- ⁇ gene can be determined by RT-PCR to evaluate TNF- ⁇ transcript levels or by ELISA to evaluate TNF- ⁇ protein levels.
- EXAMPLE 6 A Targeted Transcriptional Activator Derived from a Rationally Designed Meganuclease 1. Production of the CCR2 ⁇ o DNA-binding domain.
- the DNA-contacting amino acids of the CCR2 meganuclease are presented in Example 4.
- the CCR2 meganuclease homodimer recognizes the palindromic DNA sequence 5 '-AGGCATCTCGTACGAGATGCCT-S ' (SEQ ID NO: 45).
- the CCR2 K0 meganuclease DNA-binding domain was produced by i) mutating Q47 to E (Q47E) to eliminate DNA cleavage activity ii) adding an N-terminal nuclear- localization signal (SEQ ID NO: 38).
- a KRAB domain from the R. norvegicus Kid-1 protein (SEQ ID NO: 41) was fused to the C-terminus of CCR2 K o using a 9 amino acid linker (GSSGSSGSS).
- the resulting targeted transcriptional activator is referred to as CCR2 REP (SEQ ID NO 46).
- E. coli beta-galactosidase (LacZ) gene was inserted into the mammalian expression vector pCI (Promega) between Pstl and Notl.
- LacZ expression is driven by a truncated CMV promoter (corresponding to the 3' 442 bp of the canonical CMV promoter, SEQ ID NO: 47).
- a CCR2 recognition sequence (SEQ ID NO: 45) was then inserted at the 5' end of this promoter (see Figure 6A).
- HEK 293 cells (IXlO 5 ) were transfected first with either the pCI empty vector or pCI carrying the CCR2 REP gene under the control of a constitutive CMV promoter using Lipofectamine 2000 according to the manufacturer's instructions (Invitrogen). 6 hours post- transfection, transfection complexes were removed and replaced with fresh media. 24 hours post-transfection, the cells were re -transfected with the LacZ reporter plasmid using Lipofectamine 2000. As a measure of transfection efficiency, additional cells were transfected at both time points with pCI eGFP.
- Lysates from transfected cells were subjected to a standard o-nitrophenyl- ⁇ -D- galactoside (ONPG) assay (Current Protocols in Molecular Biology, ed. V. B. Chanda. Vol. 2. 2004, John Wiley & Sons, Inc). Briefly, an aliquot of each lysate was diluted in 300 ⁇ L Z Buffer (60 mM Na 2 HPO 4 , 40 mM NaH 2 PO 4 , 10 mM KCl, 1 mM MgSO 4 , 50 mM 2- mercaptoethanol) in 1.5 mL Eppendorf tubes.
- Z Buffer 60 mM Na 2 HPO 4 , 40 mM NaH 2 PO 4 , 10 mM KCl, 1 mM MgSO 4 , 50 mM 2- mercaptoethanol
- SEQ ID NO: 2 wild-type I-Crel recognition sequence
- SEQ ID NO: 3 wild-type I-Crel recognition sequence
- SEQ ID NO: 4 wild-type I-Crel recognition sequence
- SEQ ID NO: 5 wild-type I-Crel recognition sequence
- SEQ ID NO: 6 wild-type I-Msol, Genbank Accession # AAL34387
- SEQ ID NO: 7 wild-type I-Msol, recognition sequence
- SEQ ID NO: 8 wild-type I-Msol, recognition sequence
- SEQ ID NO: 9 wild-type I-Scel, Genbank Accession # CAA09843
- SEQ ID NO: 10 wild-type I-Scel, recognition sequence
- SEQ ID NO: 11 wild-type I-Scel, recognition sequence 1 CTAGGGATAA CAGGGTAA
- SEQ ID NO: 12 wild-type I-Ceul, Genbank Accession # P32761
- SEQ ID NO: 13 wild-type I-Ceul, recognition sequence
- SEQ ID NO: 14 wild-type I-Ceul, recognition sequence
- SEQ ID NO: 15 HAV-I TAT gene, partial sequence
- SEQ ID NO: 16 (rationally-designed TATl recognition sequence half- site)
- SEQ ID NO: 17 (rationally-designed TAT2 recognition sequence half- site)
- SEQ ID NO: 18 (rationally-designed CCRl recognition sequence half-site)
- SEQ ID NO: 20 (rationally-designed LAMl recognition sequence half- site)
- SEQ ID NO: 21 (rationally-designed LAM2 recognition sequence half- site)
- SEQ ID NO: 22 (LAMl /L AM2 recognition sequence in bacteriophage ⁇ pO5 gene)
- SEQ ID NO: 24 potential recognition sequence in human growth hormone promoter
- SEQ ID NO: 25 (potential recognition sequence in human CFTR gene ⁇ F508 allele)
- SEQ ID NO: 26 potential recognition sequence in human CCR5 gene
- SEQ ID NO: 27 potential recognition sequence in human DM kinase gene 3' UTR
- SEQ ID NO: 28 potential recognition sequence in Herpes Simplex Virus- 1 and Herpes Simplex Virus-2 UL36 gene
- SEQ ID NO: 29 potential recognition sequence in Bacillus anthracis genome
- SEQ ID NO: 30 (potential recognition sequence in the Variola (smallpox) virus gp009 gene) 1 AAAACTGTCA AATGACATCG CA
- SEQ ID NO: 31 potential recognition sequence in the Epstein-Barr Virus BALF2 gene
- SEQ ID NO: 32 potential recognition sequence in the Arabidopsis thalianna GL2 gene
- SEQ ID NO: 33 potential recognition sequence in the Arabidopsis thalianna BPl gene
- SEQ ID NO: 35 potential recognition sequence in the Nicotiana tabacum CYP82E4 gene
- SEQ ID NO: 36 potential recognition sequence in the Saccharomyces cerevisiae URA3 gene
- SEQ ID NO: 37 (I-Crel single-chain linker amino acid sequence)
- SEQ ID NO: 38 (SV40 nuclear localization signal)
- SEQ ID NO: 40 (VP 16 activation domain amino acid sequence)
- SEQ ID NO: 45 CCR2 Homodimer Recognition Sequence
- SEQ ID NO: 46 CCR2 REP Amino Acid Sequence
- SEQ ID NO: 47 (Truncated CMV Promoter Sequence)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Pharmacology & Pharmacy (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Enzymes And Modification Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
Claims
Priority Applications (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2011507554A JP2011519558A (en) | 2008-04-28 | 2009-04-27 | A rationally designed fusion molecule of DNA-binding protein and effector domain |
EP09739516A EP2279250A4 (en) | 2008-04-28 | 2009-04-27 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
AU2009241351A AU2009241351A1 (en) | 2008-04-28 | 2009-04-27 | Fusion molecules of rationally-designed DNA-binding proteins and effector domains |
CA2722797A CA2722797A1 (en) | 2008-04-28 | 2009-04-27 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
US12/914,014 US20110123509A1 (en) | 2008-04-28 | 2010-10-28 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
US13/623,017 US20140010797A1 (en) | 2005-10-18 | 2012-09-19 | Fusion Molecules of Rationally-Designed DNA-Binding Proteins and Effector Domains |
US14/679,733 US20150315556A1 (en) | 2005-10-18 | 2015-04-06 | Fusion Molecules of Rationally-Designed DNA-Binding Proteins and Effector Domains |
US15/666,425 US20180023065A1 (en) | 2005-10-18 | 2017-08-01 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
US16/658,987 US20200109384A1 (en) | 2005-10-18 | 2019-10-21 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
US17/224,054 US20210332338A1 (en) | 2005-10-18 | 2021-04-06 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
US18/352,037 US20240228992A1 (en) | 2005-10-18 | 2023-07-13 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US4849908P | 2008-04-28 | 2008-04-28 | |
US61/048,499 | 2008-04-28 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/223,852 Continuation-In-Part US20120030783A1 (en) | 2005-10-18 | 2011-09-01 | Rationally designed meganucleases with altered sequence specificity and dna-binding affinity |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/914,014 Continuation US20110123509A1 (en) | 2005-10-18 | 2010-10-28 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2009134714A2 true WO2009134714A2 (en) | 2009-11-05 |
WO2009134714A3 WO2009134714A3 (en) | 2010-01-28 |
Family
ID=41255712
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2009/041796 WO2009134714A2 (en) | 2005-10-18 | 2009-04-27 | Fusion molecules of rationally-designed dna-binding proteins and effector domains |
Country Status (6)
Country | Link |
---|---|
US (1) | US20110123509A1 (en) |
EP (1) | EP2279250A4 (en) |
JP (1) | JP2011519558A (en) |
AU (1) | AU2009241351A1 (en) |
CA (1) | CA2722797A1 (en) |
WO (1) | WO2009134714A2 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010136981A3 (en) * | 2009-05-26 | 2011-04-28 | Cellectis | Meganuclease variants cleaving the genome of a pathogenic non-integrating virus and uses thereof |
WO2011064736A1 (en) | 2009-11-27 | 2011-06-03 | Basf Plant Science Company Gmbh | Optimized endonucleases and uses thereof |
WO2011064750A1 (en) | 2009-11-27 | 2011-06-03 | Basf Plant Science Company Gmbh | Chimeric endonucleases and uses thereof |
WO2011104382A1 (en) * | 2010-02-26 | 2011-09-01 | Cellectis | Use of endonucleases for inserting transgenes into safe harbor loci |
WO2012136476A1 (en) | 2011-04-07 | 2012-10-11 | Ludwig-Maximilians-Universität München | Method of activating a target gene in a cell |
DE112010004584T5 (en) | 2009-11-27 | 2012-11-29 | Basf Plant Science Company Gmbh | Chimeric endonucleases and applications thereof |
EP2612918A1 (en) | 2012-01-06 | 2013-07-10 | BASF Plant Science Company GmbH | In planta recombination |
WO2013136274A1 (en) | 2012-03-13 | 2013-09-19 | University Of Guelph | Myb55 promoter and use thereof |
WO2013136273A2 (en) | 2012-03-13 | 2013-09-19 | University Of Guelph | Methods of increasing tolerance to heat stress and amino acid content of plants |
WO2016100333A1 (en) | 2014-12-15 | 2016-06-23 | Syngenta Participations Ag | Pesticidal microrna carriers and use thereof |
WO2018078390A1 (en) | 2016-10-31 | 2018-05-03 | Institute Of Genetics And Developmental Biology Chinese Academy Of Sciences | Compositions and methods for enhancing abiotic stress tolerance |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102076719B1 (en) * | 2012-02-02 | 2020-02-12 | 다우 아그로사이언시즈 엘엘씨 | Plant transactivation interaction motifs and uses thereof |
UA118090C2 (en) * | 2012-09-07 | 2018-11-26 | ДАУ АГРОСАЙЄНСІЗ ЕлЕлСі | Fad2 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks |
ES2745566T3 (en) * | 2013-06-11 | 2020-03-02 | Portage Pharmaceuticals Ltd | Structure, Manufacture, and Uses of Peptides that Can Penetrate Human-derived Cells Conjugated to Specific Biologically Active Charging Peptides |
US20150044772A1 (en) * | 2013-08-09 | 2015-02-12 | Sage Labs, Inc. | Crispr/cas system-based novel fusion protein and its applications in genome editing |
US9388430B2 (en) * | 2013-09-06 | 2016-07-12 | President And Fellows Of Harvard College | Cas9-recombinase fusion proteins and uses thereof |
JP7249620B2 (en) * | 2018-10-26 | 2023-03-31 | 国立大学法人 香川大学 | nucleic acid binding factor |
JP2023529503A (en) * | 2020-06-13 | 2023-07-10 | オクロジェネクス インコーポレイテッド | AAV-mediated gene transfer for retinopathy |
EP4284823A1 (en) | 2021-01-28 | 2023-12-06 | Precision BioSciences, Inc. | Modulation of tgf beta signaling in genetically-modified eukaryotic cells |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2646438B1 (en) * | 1989-03-20 | 2007-11-02 | Pasteur Institut | A METHOD FOR SPECIFIC REPLACEMENT OF A COPY OF A GENE PRESENT IN THE RECEIVER GENOME BY INTEGRATION OF A GENE DIFFERENT FROM THAT OR INTEGRATION |
WO1993004169A1 (en) * | 1991-08-20 | 1993-03-04 | Genpharm International, Inc. | Gene targeting in animal cells using isogenic dna constructs |
US5792640A (en) * | 1992-04-03 | 1998-08-11 | The Johns Hopkins University | General method to clone hybrid restriction endonucleases using lig gene |
US5436150A (en) * | 1992-04-03 | 1995-07-25 | The Johns Hopkins University | Functional domains in flavobacterium okeanokoities (foki) restriction endonuclease |
US5474896A (en) * | 1992-05-05 | 1995-12-12 | Institut Pasteur | Nucleotide sequence encoding the enzyme I-SceI and the uses thereof |
US6395959B1 (en) * | 1992-05-05 | 2002-05-28 | Institut Pasteur | Nucleotide sequence encoding the enzyme I SceI and the use thereof |
US5792632A (en) * | 1992-05-05 | 1998-08-11 | Institut Pasteur | Nucleotide sequence encoding the enzyme I-SceI and the uses thereof |
US5527695A (en) * | 1993-01-29 | 1996-06-18 | Purdue Research Foundation | Controlled modification of eukaryotic genomes |
US6051409A (en) * | 1995-09-25 | 2000-04-18 | Novartis Finance Corporation | Method for achieving integration of exogenous DNA delivered by non-biological means to plant cells |
US5830729A (en) * | 1996-04-18 | 1998-11-03 | Institut Pasteur | I Sce I-induced gene replacement and gene conversion in embryonic stem cells |
US6265196B1 (en) * | 1996-05-07 | 2001-07-24 | Johns Hopkins University | Methods for inactivating target DNA and for detecting conformational change in a nucleic acid |
US7135608B1 (en) * | 1997-08-28 | 2006-11-14 | The Salk Institute For Biological Studies | Site-specific recombination in eukaryotes and constructs useful therefor |
US6114600A (en) * | 1997-10-31 | 2000-09-05 | The United States Of America As Represented By The Secretary Of Agriculture | Resolution of complex integration patterns to obtain single copy transgenes |
US6781032B1 (en) * | 1998-08-14 | 2004-08-24 | The Regents Of The University Of California | Homologous recombination in plants |
WO2000046385A1 (en) * | 1999-02-03 | 2000-08-10 | The Children's Medical Center Corporation | Gene repair involving in vivo excision of targeting dna |
WO2000046386A2 (en) * | 1999-02-03 | 2000-08-10 | The Children's Medical Center Corporation | Gene repair involving the induction of double-stranded dna cleavage at a chromosomal target site |
US6794136B1 (en) * | 2000-11-20 | 2004-09-21 | Sangamo Biosciences, Inc. | Iterative optimization in the design of binding proteins |
US6746870B1 (en) * | 1999-07-23 | 2004-06-08 | The Regents Of The University Of California | DNA recombination in eukaryotic cells by the bacteriophage PHIC31 recombination system |
AU7447000A (en) * | 1999-09-30 | 2001-04-30 | Izumu Saito | Dna containing variant frt sequences |
US7126041B1 (en) * | 1999-12-10 | 2006-10-24 | North Carolina State Unversity | High efficiency gene targeting in plants |
BR0112633A (en) * | 2000-07-21 | 2003-09-16 | Us Agriculture | Method for replacing, translocating and stacking DNA in eukaryotic genomes |
DE10131786A1 (en) * | 2001-07-04 | 2003-01-16 | Sungene Gmbh & Co Kgaa | Recombination systems and methods for removing nucleic acid sequences from the genome of eukaryotic organisms |
WO2003012036A2 (en) * | 2001-07-27 | 2003-02-13 | The Government Of The United States Of America As Represented By The Secretary Of Health And Human Services | Systems for in vivo site-directed mutagenesis using oligonucleotides |
EP1427828B1 (en) * | 2001-09-14 | 2010-04-28 | Cellectis | Random integration of a polynucleotide after in vivo linearization |
CA2474486C (en) * | 2002-01-23 | 2013-05-14 | The University Of Utah Research Foundation | Targeted chromosomal mutagenesis using zinc finger nucleases |
DK1485475T3 (en) * | 2002-03-15 | 2008-01-21 | Cellectis | Hybrid meganuclease and single-chain maganuclease and their use |
US20060130179A1 (en) * | 2002-03-29 | 2006-06-15 | Suttie Janet L | Lambda integrase mediated recombination in plants |
CA2497913C (en) * | 2002-09-05 | 2014-06-03 | California Institute Of Technology | Use of chimeric nucleases to stimulate gene targeting |
US7285699B2 (en) * | 2002-10-07 | 2007-10-23 | Stowers Institute For Medical Research | Ends-out gene targeting method |
US20030175968A1 (en) * | 2002-10-30 | 2003-09-18 | Golic Kent G. | Gene targeting method |
JP2006518372A (en) * | 2003-01-28 | 2006-08-10 | セレクティス | Use of meganuclease and its application to induce homologous recombination ex vivo and into vivo in vertebrate body tissues |
US7888121B2 (en) * | 2003-08-08 | 2011-02-15 | Sangamo Biosciences, Inc. | Methods and compositions for targeted cleavage and recombination |
ES2320228T3 (en) * | 2003-11-18 | 2009-05-20 | Bayer Bioscience N.V. | IMPROVED ADDRESSED DNA INSERT IN PLANTS. |
US7935862B2 (en) * | 2003-12-02 | 2011-05-03 | Syngenta Participations Ag | Targeted integration and stacking of DNA through homologous recombination |
US20060288444A1 (en) * | 2004-08-13 | 2006-12-21 | Mccarroll Robert | Soybean polymorphisms and methods of genotyping |
EP2327771A1 (en) * | 2005-03-15 | 2011-06-01 | Cellectis | I-crel meganuclease variants with modified specificity, method of preparation and uses thereof |
AP2693A (en) * | 2005-05-27 | 2013-07-16 | Monsanto Technology Llc | Soybean event MON89788 and methods for detection thereof |
AU2006304668B2 (en) * | 2005-10-18 | 2013-03-07 | Duke University | Rationally-designed meganucleases with altered sequence specificity and DNA-binding affinity |
US20080083042A1 (en) * | 2006-08-14 | 2008-04-03 | David Butruille | Maize polymorphisms and methods of genotyping |
AR067114A1 (en) * | 2007-06-22 | 2009-09-30 | Monsanto Technology Llc | METHODS AND COMPOSITIONS TO SELECT LOCI BASED ON PERFORMANCE AND EXCESSION OF CHARACTERISTICS |
-
2009
- 2009-04-27 EP EP09739516A patent/EP2279250A4/en not_active Withdrawn
- 2009-04-27 CA CA2722797A patent/CA2722797A1/en not_active Abandoned
- 2009-04-27 AU AU2009241351A patent/AU2009241351A1/en not_active Abandoned
- 2009-04-27 WO PCT/US2009/041796 patent/WO2009134714A2/en active Application Filing
- 2009-04-27 JP JP2011507554A patent/JP2011519558A/en active Pending
-
2010
- 2010-10-28 US US12/914,014 patent/US20110123509A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of EP2279250A4 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010136981A3 (en) * | 2009-05-26 | 2011-04-28 | Cellectis | Meganuclease variants cleaving the genome of a pathogenic non-integrating virus and uses thereof |
US9404099B2 (en) | 2009-11-27 | 2016-08-02 | Basf Plant Science Company Gmbh | Optimized endonucleases and uses thereof |
WO2011064736A1 (en) | 2009-11-27 | 2011-06-03 | Basf Plant Science Company Gmbh | Optimized endonucleases and uses thereof |
WO2011064750A1 (en) | 2009-11-27 | 2011-06-03 | Basf Plant Science Company Gmbh | Chimeric endonucleases and uses thereof |
DE112010004583T5 (en) | 2009-11-27 | 2012-10-18 | Basf Plant Science Company Gmbh | Chimeric endonucleases and applications thereof |
DE112010004584T5 (en) | 2009-11-27 | 2012-11-29 | Basf Plant Science Company Gmbh | Chimeric endonucleases and applications thereof |
DE112010004582T5 (en) | 2009-11-27 | 2012-11-29 | Basf Plant Science Company Gmbh | Optimized endonucleases and applications thereof |
US10316304B2 (en) | 2009-11-27 | 2019-06-11 | Basf Plant Science Company Gmbh | Chimeric endonucleases and uses thereof |
AU2010325549B2 (en) * | 2009-11-27 | 2017-04-20 | Basf Plant Science Company Gmbh | Optimized endonucleases and uses thereof |
WO2011104382A1 (en) * | 2010-02-26 | 2011-09-01 | Cellectis | Use of endonucleases for inserting transgenes into safe harbor loci |
WO2012136476A1 (en) | 2011-04-07 | 2012-10-11 | Ludwig-Maximilians-Universität München | Method of activating a target gene in a cell |
EP2612918A1 (en) | 2012-01-06 | 2013-07-10 | BASF Plant Science Company GmbH | In planta recombination |
WO2013102875A1 (en) | 2012-01-06 | 2013-07-11 | Basf Plant Science Company Gmbh | In planta recombination |
WO2013136273A2 (en) | 2012-03-13 | 2013-09-19 | University Of Guelph | Methods of increasing tolerance to heat stress and amino acid content of plants |
WO2013136274A1 (en) | 2012-03-13 | 2013-09-19 | University Of Guelph | Myb55 promoter and use thereof |
WO2016100333A1 (en) | 2014-12-15 | 2016-06-23 | Syngenta Participations Ag | Pesticidal microrna carriers and use thereof |
WO2018078390A1 (en) | 2016-10-31 | 2018-05-03 | Institute Of Genetics And Developmental Biology Chinese Academy Of Sciences | Compositions and methods for enhancing abiotic stress tolerance |
Also Published As
Publication number | Publication date |
---|---|
CA2722797A1 (en) | 2009-11-05 |
EP2279250A4 (en) | 2011-10-12 |
EP2279250A2 (en) | 2011-02-02 |
JP2011519558A (en) | 2011-07-14 |
US20110123509A1 (en) | 2011-05-26 |
WO2009134714A3 (en) | 2010-01-28 |
AU2009241351A1 (en) | 2009-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20110123509A1 (en) | Fusion molecules of rationally-designed dna-binding proteins and effector domains | |
US20220356493A1 (en) | Dna-binding proteins and uses thereof | |
US9833479B2 (en) | Gene correction of SCID-related genes in hematopoietic stem and progenitor cells | |
AU2017315414B2 (en) | Engineered target specific nucleases | |
JP6231595B2 (en) | Engineered truncated half-domain | |
US20240228992A1 (en) | Fusion molecules of rationally-designed dna-binding proteins and effector domains | |
Grizot et al. | Efficient targeting of a SCID gene by an engineered single-chain homing endonuclease | |
JP2013518602A (en) | Targeted genome modification by partially single-stranded donor molecules | |
WO2018081775A1 (en) | Gene correction of scid-related genes in hematopoietic stem and progenitor cells | |
US20180238877A1 (en) | Isolation of antigen specific b-cells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09739516 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011507554 Country of ref document: JP Ref document number: 2722797 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009241351 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009739516 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2009241351 Country of ref document: AU Date of ref document: 20090427 Kind code of ref document: A |