CA2354228A1 - Plants and seeds containing synthetic promoters - Google Patents
Plants and seeds containing synthetic promoters Download PDFInfo
- Publication number
- CA2354228A1 CA2354228A1 CA002354228A CA2354228A CA2354228A1 CA 2354228 A1 CA2354228 A1 CA 2354228A1 CA 002354228 A CA002354228 A CA 002354228A CA 2354228 A CA2354228 A CA 2354228A CA 2354228 A1 CA2354228 A1 CA 2354228A1
- Authority
- CA
- Canada
- Prior art keywords
- promoter
- plant
- gene
- expression
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Synthetic elements for enhancing expression of genes in plant cells are disclosed. These include a promoter with a "TATA to start" sequence containing 64% or greater GC content and a synthetic upstream element incorporating several OCS
binding motifs and novel flanking sequences. Upstream activating regions (UARs) are also disclosed that can further increase the constitutive transcriptional activity when they are operably linked to said promoter and/or the synthetic upstream element. In particular, the nucleotide sequence of the UAR of the maize Ubi-1 gene is provided and its use in expression cassettes and vectors containing these promoter elements. Cells and plants transformed with these vectors are further provided. These include a transgenic sunflower expressing an exogenous oxalate oxidase gene at a high level under the transcriptional control of a recombinant promoter having at least one upstream activating region of the 35S CaMV
promoter.
binding motifs and novel flanking sequences. Upstream activating regions (UARs) are also disclosed that can further increase the constitutive transcriptional activity when they are operably linked to said promoter and/or the synthetic upstream element. In particular, the nucleotide sequence of the UAR of the maize Ubi-1 gene is provided and its use in expression cassettes and vectors containing these promoter elements. Cells and plants transformed with these vectors are further provided. These include a transgenic sunflower expressing an exogenous oxalate oxidase gene at a high level under the transcriptional control of a recombinant promoter having at least one upstream activating region of the 35S CaMV
promoter.
Description
62451-854 (D) PLANTS AND SEEDS CONTAINING
SYNTHETIC PROMOTERS
RELATED APPLICATIONS
This application is a di..visional application of .'~ Canadian app,~ication 2,314,598 which entered the Canadian National Phase on August 15, 2000 stemming from PCT application No. PCT/US99/03863 filed on February 23, 1999.
FIELD OF THE INVENTION
This invention relates generally to the field of plant molecular biology and in particular to enhanced expression of desired structural genes in both monocotyledonous and dicotyledonous plants.
BACKGROUND OF THE INVENTION
Gene expression encompasses a number of steps originating .from the DNA template ultimately to the final protein or protein product. Control and regulation of gene expression can occur 1=hrough numerous mechanisms. The initiation o:f transcr_i_ption of a gene is generally thought of as the predominant control of gene expression. The transcriptional controls (or promoters) are generally relegated to relatively short sequences imbedded in the 5'-flanking or upstream region of the transcribed gene. There are DNA
sequences which affect gene expression in response to environmental stimuli, nutrient availability, or adverse conditions including heat shock, anaerobiosis or the presence of heavy metals. There are also DNA sequences which control gene expression during development or in a tissue, or organ specific fashion.
Promoters con.t:ain the signals for RNA polymerase to begin transcription so that protein synthesis can proceed. DNA
62451-854 (D) binding, nuclear protE:i~:zs interact specifically with these cognate promoter DNA sequences to promote the formation of the transcriptional complex and eventually initiate the gene expression to process.
.'i One of the most common sequence motifs present in the promoters of genes transcribed by eukaryotic RNA polymerase II
(polII) system is the ~"rATA" element which resides upstream of the start of transcription. Eukaryotic promoters are complex and are comprised of c.o:mponents which include a TATA box consensus sequence at about 35 base pairs 5' relative to the transcription start site or cap site which is defined as +1.
The TATA motif is the site where the TATA-binding-protein (TBP) as part of a complex of several polypeptides (TFIID complex) binds and productively interacts !directly or indirectly) with factors bound to other sequence elements of the promoter. This TFIID comple:~ in turn recruits the RNA polymerase II complex to be positioned for the start of transcription generally 25 to 30 base pairs downstream of the TATA element and promotes elongation thus producing RNA molecules. The sequences around 21) the start of transcription (designated INR) of some polII genes seem to provide an alternate binding site for factors that also recruit members of they TF IID complex and thus ~~ activate"
transcription. These INR sequences are particularly relevant in promoters that. lack functional TATA elements providing the 2.~ core promoter binding sites for eventual transcription. It has been proposed that promoters containing both a functional TATA
and INR motif are the most efficient in transcriptional activity. (Zenzi_e-Gregory et al. (1992) J. Biol. Chem.
267:2823-2830).
30 In most instances sequence elements other than the TATA motif are required..for accurate transcription. Such elements are often located upstream of the TATA motif and a subset may have homology to the consensus sequence CCAAT.
62451-854(D) Other DNA sequences have been found to elevate the overall leve7_ of expression of the nearby genes. One of the more common elements that. have been described reside far upstream from the initiation site and seem to exhibit position '.> and orientation independent characteristics. These far upstream elements have :peen designated enhancers.
One of the less common elements by virtue of their specificities are sequences that interact with specific DNA
binding factors. These sequence motifs are collectively known as upstream elements which are usually position and orientation dependent.
Many upstream elements have been identified in a number of plant promoters based initially on function and secondarily on sequence homologies. These promoter upstream elements range widely in type of control: from environmental responses like temper<~ture, moisture, wounding, etc., developmental cues, (germination, seed maturation, flowering, etc.) to spatial information (tissue specificity). These elements also seem to exhibit modularity in that they may be exchanged with other elements while maintaining their characteristic contro=1_ ever gene expression.
Promoters are usually positioned 5' or upstream relative to the start of the coding region of the corresponding gene, and the entire region containing all the ancillary 2.~ elements affecting regulation or absolute levels of transcription may be comprised of less than 100 base pairs or as much as 1 kilobase pair.
A number of promoters which are active in plant cells have been described in the literature. These include nopaline synthase (NOS) and octcpine synthase (OCS) promoters (which are carried on tumor indu~:ir~g plasmids of Agrobacterium 62451-854(D) tumefaciens). The cauliflower mosaic virus (CaMV) 19S and 35S
promoters, the light-inducible promoter from the small subunit of ribulose bisphosphat~s carboxylase (ssRUBICSO, a very abundant plant polypeptide), and the sucrose synthase promoter .'i are also inc~~uded. A11 of these promoters have been used to create various types of DNA constructs which have been expressed in plants. (See for example PCT publication W084/02913 Rogers, et al).
Two promoters that have been widely used in plant cell transformations are those of the genes encoding alcohol dehydrogenase, AdhI and AdhII. Both genes are induced after the onset of anaerobiosis. Maize AdhI has been cloned and sequenced as has been AdhII. Formation of an AdhI chimeric gene, Adh-Cat compris,~.ng the AdhI promoter links to the chloramphenicol acetyl.transferase (CAT) coding sequences and nopaline synthase (NOS) 3' signal caused CAT expression at approximately 4-fold higher levels at low oxygen concentrations than under control conditions. Sequence elements necessary for anaerobic induction of the ADH-CAT chimeric have also been identified. The existence of anaerobic regulatory element (ARE) between positions -140 and --99 of the maize AdhI promoter composed of at least twe sequence elements, positions -133 to -124 and positions -113 to 99 botr~ of which have found to be necessary and are sufficient for low oxygen expression of ADH-2.'~ CAT gene activity. Tlne Adh promoter however responds to anaerobiosis and is not a constitutive promoter drastically limiting its effectiveness.
Another commonly used promoter is the 35S promoter of Cauliflower Mosaic Virus. The (CaMV) 35S promoter is a dicot virus promoter howeve:r_ it directs expression of genes introduced into protoplasts of both dicots and monocots. The 35S promoter is a very ~;trong promoter and this accounts for its widespread use fo:r high level expression of traits in 62451-854 (D) transgenic plants. The CaMV 35S promoter however has also demonstrated relatively l.ow activity in several agriculturally significant grami.naceous plants such as wheat. While these promoters all give high expression in dicots, few give high '.i levels of expression in monocots. A need exists for synthetic promoters and other elements that induce expression in transformed monocot protoplast cel7_s.
SUMMARY OF THE INVENTION
Methods and compositions for the expression of heterologous sequences in host cells are pravided. The compositions find particular use in controlling the expression of sequences in plants. The compositions of the invention comprise promoter sequences. In particular, a novel synthetic core promoter molecule and regulatory elements useful in controlling expression in target cells are provided. The core promoter comprises a ~.L'ATA box and a start of transcription.
Further, the "TATA to start" region is 640 or greater GC rich.
The regulatory elements include a novel upstream element and upstream activating regions. The upstream activating region is different from the synthetic upstream element. The elements can be used together or with other promoter elements to control expression of sequences of interest.
It is a primary object of the invention to provide synthetic regulatory elements that enhance expression of introduced genes in plant cells and plant tissues.
It is an objects of the invention to provide a recombinant promoter molecule that provides for reliably high levels of expression of introduced genes in target cells. It is yet another object of the invention to provide heterologous upstream enhancer elements that can enhance the activity of any promoter.
SYNTHETIC PROMOTERS
RELATED APPLICATIONS
This application is a di..visional application of .'~ Canadian app,~ication 2,314,598 which entered the Canadian National Phase on August 15, 2000 stemming from PCT application No. PCT/US99/03863 filed on February 23, 1999.
FIELD OF THE INVENTION
This invention relates generally to the field of plant molecular biology and in particular to enhanced expression of desired structural genes in both monocotyledonous and dicotyledonous plants.
BACKGROUND OF THE INVENTION
Gene expression encompasses a number of steps originating .from the DNA template ultimately to the final protein or protein product. Control and regulation of gene expression can occur 1=hrough numerous mechanisms. The initiation o:f transcr_i_ption of a gene is generally thought of as the predominant control of gene expression. The transcriptional controls (or promoters) are generally relegated to relatively short sequences imbedded in the 5'-flanking or upstream region of the transcribed gene. There are DNA
sequences which affect gene expression in response to environmental stimuli, nutrient availability, or adverse conditions including heat shock, anaerobiosis or the presence of heavy metals. There are also DNA sequences which control gene expression during development or in a tissue, or organ specific fashion.
Promoters con.t:ain the signals for RNA polymerase to begin transcription so that protein synthesis can proceed. DNA
62451-854 (D) binding, nuclear protE:i~:zs interact specifically with these cognate promoter DNA sequences to promote the formation of the transcriptional complex and eventually initiate the gene expression to process.
.'i One of the most common sequence motifs present in the promoters of genes transcribed by eukaryotic RNA polymerase II
(polII) system is the ~"rATA" element which resides upstream of the start of transcription. Eukaryotic promoters are complex and are comprised of c.o:mponents which include a TATA box consensus sequence at about 35 base pairs 5' relative to the transcription start site or cap site which is defined as +1.
The TATA motif is the site where the TATA-binding-protein (TBP) as part of a complex of several polypeptides (TFIID complex) binds and productively interacts !directly or indirectly) with factors bound to other sequence elements of the promoter. This TFIID comple:~ in turn recruits the RNA polymerase II complex to be positioned for the start of transcription generally 25 to 30 base pairs downstream of the TATA element and promotes elongation thus producing RNA molecules. The sequences around 21) the start of transcription (designated INR) of some polII genes seem to provide an alternate binding site for factors that also recruit members of they TF IID complex and thus ~~ activate"
transcription. These INR sequences are particularly relevant in promoters that. lack functional TATA elements providing the 2.~ core promoter binding sites for eventual transcription. It has been proposed that promoters containing both a functional TATA
and INR motif are the most efficient in transcriptional activity. (Zenzi_e-Gregory et al. (1992) J. Biol. Chem.
267:2823-2830).
30 In most instances sequence elements other than the TATA motif are required..for accurate transcription. Such elements are often located upstream of the TATA motif and a subset may have homology to the consensus sequence CCAAT.
62451-854(D) Other DNA sequences have been found to elevate the overall leve7_ of expression of the nearby genes. One of the more common elements that. have been described reside far upstream from the initiation site and seem to exhibit position '.> and orientation independent characteristics. These far upstream elements have :peen designated enhancers.
One of the less common elements by virtue of their specificities are sequences that interact with specific DNA
binding factors. These sequence motifs are collectively known as upstream elements which are usually position and orientation dependent.
Many upstream elements have been identified in a number of plant promoters based initially on function and secondarily on sequence homologies. These promoter upstream elements range widely in type of control: from environmental responses like temper<~ture, moisture, wounding, etc., developmental cues, (germination, seed maturation, flowering, etc.) to spatial information (tissue specificity). These elements also seem to exhibit modularity in that they may be exchanged with other elements while maintaining their characteristic contro=1_ ever gene expression.
Promoters are usually positioned 5' or upstream relative to the start of the coding region of the corresponding gene, and the entire region containing all the ancillary 2.~ elements affecting regulation or absolute levels of transcription may be comprised of less than 100 base pairs or as much as 1 kilobase pair.
A number of promoters which are active in plant cells have been described in the literature. These include nopaline synthase (NOS) and octcpine synthase (OCS) promoters (which are carried on tumor indu~:ir~g plasmids of Agrobacterium 62451-854(D) tumefaciens). The cauliflower mosaic virus (CaMV) 19S and 35S
promoters, the light-inducible promoter from the small subunit of ribulose bisphosphat~s carboxylase (ssRUBICSO, a very abundant plant polypeptide), and the sucrose synthase promoter .'i are also inc~~uded. A11 of these promoters have been used to create various types of DNA constructs which have been expressed in plants. (See for example PCT publication W084/02913 Rogers, et al).
Two promoters that have been widely used in plant cell transformations are those of the genes encoding alcohol dehydrogenase, AdhI and AdhII. Both genes are induced after the onset of anaerobiosis. Maize AdhI has been cloned and sequenced as has been AdhII. Formation of an AdhI chimeric gene, Adh-Cat compris,~.ng the AdhI promoter links to the chloramphenicol acetyl.transferase (CAT) coding sequences and nopaline synthase (NOS) 3' signal caused CAT expression at approximately 4-fold higher levels at low oxygen concentrations than under control conditions. Sequence elements necessary for anaerobic induction of the ADH-CAT chimeric have also been identified. The existence of anaerobic regulatory element (ARE) between positions -140 and --99 of the maize AdhI promoter composed of at least twe sequence elements, positions -133 to -124 and positions -113 to 99 botr~ of which have found to be necessary and are sufficient for low oxygen expression of ADH-2.'~ CAT gene activity. Tlne Adh promoter however responds to anaerobiosis and is not a constitutive promoter drastically limiting its effectiveness.
Another commonly used promoter is the 35S promoter of Cauliflower Mosaic Virus. The (CaMV) 35S promoter is a dicot virus promoter howeve:r_ it directs expression of genes introduced into protoplasts of both dicots and monocots. The 35S promoter is a very ~;trong promoter and this accounts for its widespread use fo:r high level expression of traits in 62451-854 (D) transgenic plants. The CaMV 35S promoter however has also demonstrated relatively l.ow activity in several agriculturally significant grami.naceous plants such as wheat. While these promoters all give high expression in dicots, few give high '.i levels of expression in monocots. A need exists for synthetic promoters and other elements that induce expression in transformed monocot protoplast cel7_s.
SUMMARY OF THE INVENTION
Methods and compositions for the expression of heterologous sequences in host cells are pravided. The compositions find particular use in controlling the expression of sequences in plants. The compositions of the invention comprise promoter sequences. In particular, a novel synthetic core promoter molecule and regulatory elements useful in controlling expression in target cells are provided. The core promoter comprises a ~.L'ATA box and a start of transcription.
Further, the "TATA to start" region is 640 or greater GC rich.
The regulatory elements include a novel upstream element and upstream activating regions. The upstream activating region is different from the synthetic upstream element. The elements can be used together or with other promoter elements to control expression of sequences of interest.
It is a primary object of the invention to provide synthetic regulatory elements that enhance expression of introduced genes in plant cells and plant tissues.
It is an objects of the invention to provide a recombinant promoter molecule that provides for reliably high levels of expression of introduced genes in target cells. It is yet another object of the invention to provide heterologous upstream enhancer elements that can enhance the activity of any promoter.
62451-854(D) It is yet another object of the invention to provide plants, plant. cells and plant tissues containing either or both of the recombinant promoter or upstream element of the invention.
.'i It is yet another object of the invention to provide vehicles for transformation of plant cells including viral or plasmid vectors and expression cassettes incorporating the synthetic promoter and upstream elements of the invention.
It is yet another object of the invention to provide 1() bacterial cells comprising such vectors for maintenance, and plant transformation.
Other objects of the invention will become apparent from the description of the invention which follows.
DESCRIPTION OF THE FIGURES
15 Figure 1 is a depiction of a typical nucleotide base arrangement of a core promoter containing the consensus sequences of TATA and INR motifs present in plant promotors. A
designates +1 of the t:.ranscribed region.
Figure 2 is a depiction of the complete Syn II Core 20 Promoter Sequence with an example of a plant promoter and both are aligned at the ma=jor start of transcription (bold letter).
The TATA motif is underlined. The CaMV 35S promoter is shown with percent GC content sequences shown in parentheses.
Figure 3 is the DNA sequence of the Rsyn7 upstream 25 element. The TGACG motifs are indicated in bold.
Figure 4 is a plasmid map of one embodiment of the invention comprising the Syn II Core promoter and Rsyn7 elements driving a GUS containing construct.
.'i It is yet another object of the invention to provide vehicles for transformation of plant cells including viral or plasmid vectors and expression cassettes incorporating the synthetic promoter and upstream elements of the invention.
It is yet another object of the invention to provide 1() bacterial cells comprising such vectors for maintenance, and plant transformation.
Other objects of the invention will become apparent from the description of the invention which follows.
DESCRIPTION OF THE FIGURES
15 Figure 1 is a depiction of a typical nucleotide base arrangement of a core promoter containing the consensus sequences of TATA and INR motifs present in plant promotors. A
designates +1 of the t:.ranscribed region.
Figure 2 is a depiction of the complete Syn II Core 20 Promoter Sequence with an example of a plant promoter and both are aligned at the ma=jor start of transcription (bold letter).
The TATA motif is underlined. The CaMV 35S promoter is shown with percent GC content sequences shown in parentheses.
Figure 3 is the DNA sequence of the Rsyn7 upstream 25 element. The TGACG motifs are indicated in bold.
Figure 4 is a plasmid map of one embodiment of the invention comprising the Syn II Core promoter and Rsyn7 elements driving a GUS containing construct.
62451-854 (D) Figure 5 depicts several schematics of synthetic promoters according to the present invention tested in transient and stable transformants.
Figure 6 is a depiction of transient assay data using .'i the plasmids incorporating 'the promoter sequences of the invention.
Figure 7(A). Rsyn7::GUS (PHP6086) activity to TO
maize plants.
Figure 7(B) is a schematic of VT stage corn plants with sites of tissue samples indicated.
Figure 8 depicts GUS activity in root segments of a segregating population of maize T1 transgenic seedlings containing the Rsyn7::GUS (PHP6086) or the LJBI:GUS (PHP3953) construct.
1'~ Figures 9A and 9B depict GUS expression of three synthetic promoters in TO transgenic maize plants including the promoter sequences of the invention as comparison.
Figure 10 shows the comparison of the activities of the Rsyn7 promoter, the CaMV 35S promoter, and the 35SU-Rsyn7 promoter in transient expression in sunflower cotyledons.
Figure 11 shows the effect of the Ubi-1 upstream activating region on the strength of the Rsyn7 promoter in transient expression :in sunflower cotyledons.
Figure 12 shcws that in the stably transformed sunflower callus, GUS expression behind the control of the 35SU-Rsyn7 is CaMV 20'~ higher than when behind the control of the 35S promoter.
62451-854 (D) Figure 13 shows the effect of the Ubi-I upstream activating region on t:h~~ activity of the Rsyn7 promoter in transgenic sunflower callus assay.
DETAILED DESCRIPTION OF THE INVENTION
In the description that follows a number of terms are used extensively. The following definitions are provided in order to remove ambiguities in thE: intent or. scope of their usage in the specification and claims, and to facilitate understanding of the invention.
1() A structural gene is a DNA sequence that is transcribed into messenger RNA (mRNA) which is then translated into a sequence of amino acids characteristic of a specific polypeptide.
A promoter is a DNA sequence that directs the 15 transcription of a structural gene. Typically a promoter is located in the 5' region of a gene, proximal to the transcriptional start site of a structural gene. The promoter of the invention comprises at least a core promoter as defined below. Additionally, the promoter may also include at least 20 one upstream element. Such elements include UARs and optionally, other DNA sequences that affect transcription of a structural gene such as a synthetic upstream element.
A core promoter or minimal promoter contains the essential nucleotide sequences for expression of the operably 2.'~ linked coding sequence, including the TATA box and start of transcription. E3y this definition, a core promoter may or may not have detectable activity in the absence of specific sequences that may enhance the activity or confer tissue specific activity. For example, the maize SGB6 gene core 30 promoter consists of about 37 nucleotides 5' of the transcriptional start site of the SGB6 gene, while the g 62451-854(D) Cauliflower Mosaic Virus (CaMV) 35S core promoter consists of about 33 nucleotides 5' of the transcriptional start site of the 35S genome.
ADH refers generally to a plant expressible alcohol _'> dehydrogenase gene and specifically to the alcohol dehydrogenase gene from maize.
ADH 1 UAR refers to the DNA fragment spanning the region between nucleotide positions about -1094 to about -106 of the alcohol dehydrogenase gene 1 from maize, or homologous fragment than is functionally equivalent. The sequence is numbered with the start of transcription site designated as +1 according to the correction published by Ellis et al. (1987) supra.
"TATA to start" shall mean the sequence between the primary TATA motif and the start of transcription.
A synthetic DNA is an artificially created DNA
sequence that is not produced naturally, and must be introduced to an organism or to an ancestor of that organism to control or to be expressed.
OCS element refers to the TGACG motif identified from the octopine synthase gene, histone genes, enzyme genes for agropine biosynthesis, the mannopi.ne synthase gene, the CaMV
35S gene, histone H3 gene and nopaline synthase gene. As used herein the term includes any sequence capable of binding the ASF-1 factor as identi.fi.ed in U.S. Patent No. 4,990,607 by Katagiri.
UAR is typically a position or orientation dependent element that primarily directs tissue, cell type, or regulated expression.
62451-854(D) An enhancer is a DNA regulatory element that can increase efficiency of transcription regardless of the distance or orientation of the enhancer relative to the start site of transcription.
The term expression refers to biosynthesis of a gene product. In the case of a structural gene, expression involves transcription of the structural. gene into mRNA and then translation of the mRNA into one or more polypeptides.
A cloning vector is a DNA molecule such as a plasmid, cosmid or bacterial phage that has the capability of replicating autonomously in a host cell. Cloning vectors typically contain one or a small number of restriction endonuclease recognition sites at which foreign DNA sequences can be inserted in a determinable fashion without loss of 1.'~ essential biological function of the vector, as well as a marker gene that is suitable for use in the identification and selection of cells transformed with the cloning vector. Marker genes typically include genes that provide tetracycline resistance, hygromycin resistance or ampicillin resistance.
An expression vector is a DNA molecule comprising a gene that is expr_essec~ in a host cell. Typically gene expression is placed under the control of certain regulatory elements including promoters, tissue specific regulatory elements, and enhancers. Such a gene is said to be "operably 2.5 linked to" the regulatory elements.
A recombinant host may be any prokaryotic or eukaryotic cell that contains either a cloning vector or an expression vector. This; term also includes those prokaryotic or eukaryotic cells that: have been genetically engineered to contain the cloned genes in the chromosome or genome of the host cell.
62451-854(D) A transgenic plant is a ~>lant having one or more plant cells that contain an expression vector.
It will be un~.~erstood that there may be minor sequence variations within sequence or fragments used or .'i disclosed in this app7.ication. By ~~minor variations" is intended that the sequences have at least 800, preferably 90%
sequence identity. These variations may be determined by standard techniques to enable those of ordinary skill in the art to manipulate and bring into utility the functional units of the promoter elements necessary to direct initiation of transcription in the structural gene followed by a plant expressible transcription termination (and perhaps polyadenylation) signal.
Plant tissue includes differentiated and undifferentiated tissues or plants, including but not limited to roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of r_ells and culture such as single cells, protoplast, embryos, and callus tissue. The plant tissue may be in plants or i.n organ, tissue or cell culture.
One embodiment of the invention, the core promoter, is shown in SEQ ID N0:1. The core promoter is capable of driving expression of a coding sequence in a target cell, particularly plant cells. The core promoter finds use in driving expression of sequences which are only needed at 2.~ minimal levels in the target cells. Also disclosed is a novel upstream element, SEQ ID N0:2 that helps to potentiate transcription. 'The synthetic core promoter can be used with combinations of enhancer, upstream elements, and/or activating sequences from the 5'-flanking regions of plant expressible structural genes. Similarly the upstream element can be used in combination with various plant core promoter sequences. In one embodiment the core promoter and upstream element are used 62451-854(D) together to obtain ten-fold higher expression of an introduced marker gene in monocot transgenic plants than is obtained with the maize ubiquitin 1 promoter.
The core promoter comprises a TATA motif and a GC
.'i rich ~~TATA to start of transcription" region (64% or greater GC
content) that is generally characteristic of animal promoters.
The sequence is placed 5' of a structural gene and will promote constitutive expression which is non-tissue specific in transgenic plant ce.ll:.; .
The invention also comprises an expression cassette comprising (the upstream element) the synthetic core promoter, a structural gene, the expression of which is desired in plant cells, and a polyadenylation or stop signal. The expression cassette can be encompassed in plasmid or viral vectors for transformation of plant protoplast cells.
The invention also encompasses transformed bacterial cells for maintenance and replication of the vector, as well as transformed monocot or. dicot cells and ultimately transgenic plants.
In another embodiment, the invention encompasses an upstream element that can be used in combination with the synthetic promoter or with other known promoters in the art.
The upstream element comprises at least 3 OCS binding motifs (TGACG) with a novel intervening sequence. One embodiment is 2~ disclosed in SEQ ID Nc:):2 and is placed 5' to a core promoter sequence to enhance the transcription levels of the resulting gene product. Thus the invention comprises an expression cassette comprising the synthetic upstream element of the invention, 5' to a plant inducible promoter which is 5' to a structural gene. This expression cassette can be embodied in vectors and plasmids as earlier described.
62451-854 (D) In a preferred embodiment the synthetic upstream element is used in combination with the synthetic core promoter sequence to achieve non-tissue specific constitutive expression of the gene product which is a ten-fold enhancement of the .'i maize Ubi-1 promoter.
The present invention also encompasses a promoter construct comprising the synthetic core promoter described above and an upstream activating region. The upstream activating region is different from the synthetic upstream element. Preferably the upstream activating region is an upstream activating region (UAR) having substantial sequence similarity to the UAR of CaMV 35S or maize Ubi-I. Promoter constructs of the invention may comprise the synthetic core promoter in combination with at least one UAR and optionally at least one synthetic upstream element.
The promoter construct can be contained for convenience in an expression cassette. This expression cassette can be embodied in transformation vectors.
The sequence cf the upstream activating region (UAR) of the maize Ubi-1 gene is also provided. This UAR can be used in combination with any core promoter to enhance the activity of the promoter.
The promoter of the invention as seen in SEQ ID NO:l and/or SEQ ID NO:10 (modified core promoter), can be used to obtain high levels of expression o.f structural genes.
Similarly the upstream element of the invention (SEQ ID N0:2) can be used in combination with other promoters or the promoter of the invention to potentiate levels of transcription in genetically modified plants. Production of a genetically modified plant tissue expressing a structural gene under the control of the regulatory elements of the invention combines 62451-854(D) teachings of the present disclosure with a variety of techniques and expedients known in the art. In most instances alternate expedients exist for each stage of the overall process. The choice of expedients depends on the variables '.i such as the plasmid vector system chosen for the cloning and introduction of the recombinant DNA molecule, the plant species to be modified, the particular structural gene, promoter elements and upstream elements used. Persons skilled in the art are able to select and use appropriate alternatives to achieve functionality. Culture conditions for expressing desired structural genes and cultured cells are known in the art. Also as known in the art, a number of both monocotyledonous and dicotyledonous plant species are transformable and regenerable such that whole plants containing and expressing desired genes under regulatory control of the promoter molecules and upstream elements of the invention may be obtained. As is known to those of skill in the art, expression in transformed plants may be tissue specific and/or specific to certain developmental stages or environmental influences. Truncated promoter selection and structural gene selection are other parameters which may be optimized to achieve desired plant expression as is known to those of skill in the art and taught herein.
The nuc:leoti.de sequences of the invention can be 2.'~ introduced into any p_1_ant. The genes to be introduced can be conveniently used in expression cassettes for introduction and expression in any plant of interest.
Such expression cassettes will comprise the transcriptional initiation region of the invention linked to a 3) nucleotide sequence oa i.nterest. Such an expression cassette is provided with a plurality of restriction sites for insertion of the gene of interest to be under the transcriptional 62451-854(D) regulation of the regulatory regions. The expression cassette may additionally contain selectable marker genes.
The transcri.ptional cassette will include in the 5'-3' direction of transcription, a transcriptional and translational initiation region, a DNA sequence of interest, and a transcriptional and translational termination region functional in plants. The termination region may be native with the transcri_ptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau et al., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon eat al. (1991) Genes Dev. 5:141-149; Mogen et a1 (1990) Plarnt Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639.
The genes of. the invention are provided in expression cassettes for expression in the plant of interest. The cassette will include 5' and 3' regulatory sequences operably linked to the gene of interest. i'he cassette may additionally contain at least one additional gene to be cotransformed into the organism. Alternatively, the additional genes) can be provided on another expression cassette. Where appropriate, 2~ the genes) may be op~imized for increased expression in the transformed plant. That is, the genes can be synthesized using plant preferred codons for improved expression. Methods are available in the art for synthesizing plant preferred genes.
See, for example, U.S. Patent Nos. 5,380,831, 5,436,391 and Murray et al. (1989) Nucleic Acids Res. 17:477-498.
Additional ,sequence modifications are known to enhance gene expression in a cellular host. These include 62451-854(D) elimination of sequences encoding spurious polyadenylation signals, exonintron splice site signals, transposon-like repeats, and other such well-characterized sequences which may be deleterious to gene expression. The G-C content of the .'i sequence may be adjusted to levels average f.or a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.
The selection of an appropriate expression vector 1() will depend upon the method of introducing the expression vector into host cells. Typically an expression vector contains (1) prokaryotic DNA elements coding for a bacterial replication origin and an antibiotic resistance gene to provide for the amplification and selection of the expression vector in 15 a bacterial host; (2) DNA elements that control initiation of transcription such as a promoter; (3) DNA elements that control the processing of transcripts such as introns, transcription termination/polyadenylation sequence; and (4) a reported gene that is operatively linked to the DNA elements to control 20 transcription in3_tiation. Useful reported genes include (3-glucuronidase, ~i-galactosidase, chloramphenicol acetyl transferase, luciferase, green fluorescent protein (GFP) and the like. Preferably the reported gene is either (3-glucuronidase (GUS), GFf or luciferase. The general 2~ descriptions of plant expression vectors and reporter genes can be found in Gruber, et~. a.l., "Vectors for Plant Transformation, in Methods in Plant Molecular Biology & Biotechnology" in Glich et al., (Eds. pp. 89-119, CRC Press, 1993). Moreover GUS
expression vectors and C;US gene cassettes are available from 3~ Clonetech Laboratories, Inc.., Palo Alto, California while luciferase expression vectors and luciferase gene cassettes are available from Promega C:orp. (Madison, Wisconsin).
62451-854(D) Expression vectors containing genomic or synthetic fragments can be introduced into protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced into intact. tissue. General methods of culturing plant tissues are provided for example by Maki et al.
"Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular Biology & Biotechnology, Glich et al. (Eds.
pp. 67-88 CRC Press, x.993); and by Phillips et al. "Cell-Tissue Culture and In-Vitro Manipulation" in Corn & Corn Improvement, 3rd Edit~.on Sprague et al. (Eds. pp. 345-387) American Society of Agronomy Inc. et al. 1988.
Methods of introducing expression vectors into plant tissue include the direct infection or co-cultivation of plant cell with Agrobacterium tumefaciens, Horsch et al., Science, 227:1229 (1985). Desc:ripti.ons of Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer provided by Gruber, et al. supra.
Preferably, expression vectors are introduced into maize or other plant tissues using a direct gene transfer method such as microprojectile-mediated delivery, DNA
injection, e.lectroporation and the like. More preferably expression vectors are introduced into plant tissues using the microprojectile media delivery with the biolistic device. See, for example, Tomes et al.. "Direct DNA transfer into intact plant cells via rnicroprojectile bombardment" In: Gamborg and Phillips (Eds.) Plant Cell, Tissue and Organ Culture:
Fundamental Methods, :~pringer-Verlag, Berlin (1995).
The vectors ef~ the invention can not only be used for expression of structural. genes but may also be used in exon trap cloning, or promoter trap procedures to detect differential gene expression in varieties of tissues, K.
Lindsey et al., 1993 "'fagging Genomic Sequences That Direct 62451-854(D) Transgene Expression by Activation of a Promoter Trap in Plants", Transgenic Research 2:33-47. D. Auch & Reth, et al., "Exon Trap Cloning: Using PCR to Rapidly Detect and Clone Exons from Genomic DNA Fragments", Nucleic Acids Research, Vol. 18, No. 22, p. 6743.
This inventive promoter is based in part on the discovery that a GC rich "TATA to start" region i.n a plant promoter acts as a very strong nontissue specific core promoter inducing constitutive expression in plant cells. The TATA
element of plant promoters of polII genes generally have the sequence TATA(A/T)A(AiT)A, SEQ ID N0:3, whereas the consensus of the start of transcription consists of the sequence 5'...TYYTCAT(A/C)AA...3'. SEQ ID N0:3, where the A designates the starting base for transcription. The typical plant promoter sequence is depicted in Figure 1.
Sequences intervening the TATA element and the start of transcription have been shown to play a significant role in transcriptional activation efficiency. The TATA binding protein has been shown to interact with the minor groove of the double helix binding too the TATA motif bending it towards the major groove side (Kim, et al. 1993, Nature, 365:512-520). It thus follows that sequences downstream of the TATA motif that impact this finding will affect the efficiency of stable transcriptional complex formation and ultimately expression.
Surveys of the "TATA t.o start" regions of plant promoters show a significantly higher level of AT-rich sequences leading to the potential of minor groove compression (Yaurawj et al Biological Abstracts Vol. 47, Issue 8, Ref. 144712, "Consensus Sequences for Plant Minimal Promoters" Annual Meeting of the American Society of Plant Physiologists, July 29-August 2, 1995, Plant Physiology 108 [2 Supp.] 1995, 114). Generally animal promoters show a GC-rich "TATA to start" sequence that leads to a major groove compression suggesting that average 62451-854(D) plant and animal core promoter transcriptional complexes recognize and interact with a somewhat different TATA to start structure with the corresponding sequence difference. Quite surprisingly the applicant has found that a GC-rich animal type .'~ synthetic promoter works very well in plants.
While the invention is not bound by any theory, it is possible that the AT-rich TATA motif present in a GC-rich sequence may "present itself" more prominently to the TATA-binding complex by a sharp demarcation of the TATA motif that would interact more tightly with the TATA-binding complex.
This would improve the start of transcription efficiency, by shifting the equilibria of binding to a more stabilized form, whereas the "non-bounded TATA" version, i.e. having a higher level of AT-rich sequences flanking the TATA motif, the TATA-1_'i binding complex would potentially slide or stutter 5' or 3' to the start site and effectively reduce the efficiency of binding ultimately reducing tz:anscription. Little data regarding this region of plant promoters is available except crude deletions and some point mutations. The obvious design of a synthetic 2C) core promoter for plant expression would include the AT-rich "TATA to start" sequence based on surveys of known plant promoters. However, based on the "bounded" mechanism, it is postulated by the mechanism of the invention that a more efficient core promoter is a result of a TATA motif imbedded in 2'i a GC-rich sequence.
Figure 2 depi~~ts the Syn II Core promoter sequence, SEQ ID N0:1 of the inve~:ztion with examples of plant core promoters aligned to the mayor start of transcription. Another example of a plant promoter 35S of CaMV (SEQ ID N0:4) are shown 30 with percent GC-rich sequences shown at the right in parentheses. The Syn II Core sequence does not show any significant sequence homology to sequences in the public sequence databases.
62451-854(D) The synthetic Syn II Core promoter sequence shows a 64o GC-rich "TATA to start:" sequence different from the overall 40o GC-rich sequence present in traditional plant promoters (CaMV35S for example). The naturally occurring and isolated .'~ UBI core promoter which potentiates very high levels of activity in monocots usually shows a 64o GC-rich "TATA to start" sequence more similar to animal promoters. Such examples provided the impetus to design a high GC-rich "TATA to start" sequence for efficient transcription in opposition to the current dogma of plant core promoters.
Thus the invention comprises a synthetic plant core promoter sequence comprising a TATA motif and a "TATA to start"
region that is 64o GC-rich or greater. In a preferred embodiment, the promoter may include restriction endonuclease 1'i target sites for ease of cloning. In the most preferred embodiment, the sequence is that of SEQ ID N0:1. As will be appreciated by those of skill in the art, several base transversions within SEQ ID N0:1 may occur which will maintain the percent GC-content and are intended within the scope of 2C) this invention. For example guanines could be replaced with cytosines and vice-versa without affecting the overall efficacy of the promoter, so lon~~ as the percent GC-content is maintained.
In another embodiment, the invention comprises a 25 synthetic upstream element positioned 5' to any naturally occurring or synthetic promoter for use in plants, particularly maize gene expression.
From the activity of numerous promoters, basic elements (binding sites) have been defined. These include for 30 example AT-rich regions from heat shock promoters, and ASF-1 binding site (AS-1) elements present in octopine synthase (OCS) and Cauliflower Mosaic Virus promoters. AS-1 is one of the 62451-854 (D) better known upstream elements and its binding sequence (OCS
element) is present in many consts.t=utive plant promoters such as the CaMV35S, A. tumefaciens, NGS and OCS wheat histone promoters. The OCS element was first isolated as an enhancer .'~ element in the promoter of the OCS gene where it was identified as a 16-base pair pali.ndromic sequence (Ellis et al. (1987) EMBO J. 6:11-16), but has been reduced to its essential features as a TGACG motif. See U.S. Patent No. 4,990,607. The upstream element of the invention has a 71% identity to the promoter enhances element disclosed in U.S. Patent 5,023,179 to Lam et al. The two sequences are quite different in their flanking sequences surrounding the TGACG motif, which regions have been shown to impact the level of transcription enhancement. The transcriptional enhancing activity of the OCS
1.'i element correlates with the in-vitro binding of a transcriptional factor'. Similar elements were also identified in the promoter regions of six other cDNA genes involved in opine synthesis and throve plant viral promoters including the CaMV 35S promoter (Bouchez, et al 1989) supra. These elements were shown to bind the ~DCS transcription factor in-vitro and enhance transcription in plant cells.
In tobacco a DNA binding factor, TGA1, was shown to interact specifically with the AS-1 element either alone or in conjunction with other ,~oromoter elements. (Katagiri et al, 2'i 1989, Nature 340:727-730). This factor was also shown to be expressed in a root-preferred manner in tobacco plants. Core promoters with one or two copies of the OCS upstream element tend to potentiate gene expression whereas 4 or more repeats of this element produce more or less constitutive activity albeit low relative to intact: 35S promoters.
Thus the invention incorporates a synthetic upstream element which can be used with the core promoter of the invention or other core promoters to enhance gene expression.
62451-854 (D) The element incorporates t=hree OCS-like motifs and novel intervening sequences which enhance gene expression.
Figure 3, SEQ ID N0:2 shows the complete sequence of one embodiment (Rsyn7) of the synthetic upstream element which .'~ incorporates at least three TGACG SEQ ID N0:5 OSC-like motifs which are indicated in bold.
Sequences flanking many elements such as the TGACG
SEQ ID N0:5 motif have been shown to have profound impacts on binding affinities of DNA binding factors and thus play as an important role as the central motifs themselves. (Burrows et al. 1992, Plant Molecular Biology 19:665-675, Shinder et al.
1992, Plant Cell 4:1309-1319, Foster et al. 1994, FASEBJ 8:192-200). The novel sequences flanking the TGACG motifs in the Rsyn7 promoter have been determined and established clear 1.'i enhancement of transcri:ptional activity with various promoters, particularly when used with the Syn II Core promoter.
Rsyn7 upstrea;n element has been cloned upstream of the Syn II Core promoter driving a GUS construct and has yielded levels of GUS activity in transgenic maize plants approximately ten-fold higher than the ubiquitin promoter, the strongest maize promoter to date.
In yet another aspect of the present invention, at least one upstream activating region (UAR), which is different from the synthetic upstream element, is operably linked to the 2.'i synthetic core promoter. The UAR may be used alone or in combination with the synthetic upstream element described herein. Preferably, th~~ upstream activating regions of the cauliflower mosaic virus (CaMV) 35S promoter and the maize Ubi-1 gene promoter are utilized. Additionally, sequences having sequence similarity to these UARs may be utilized as long as such sequences retain the ability to enhance promoter 62451-854 (D) activity. Enhancements can be measured by assaying for levels of transcripts or alternatively protein production.
CaMV 35S UARs have been well studied in the art. The complete nucleotide sequence of the CaMV circular double-strand .'i DNA has been established in the art.. See Guilley et al. (1980) Cell 21:285-294. The 35S promoter transcribes the major 35S
RNA transcript from the circular viral genome by nucleus RNA
polymerase II. See Guilley et al. (1982) Cell 30:763-773.
Moreover, the 35S UARs can function with a heterologous promoter and increase expression of a gene of interest in cells and transgenic plants. Shah et al. (1986) Science 233:478-481.
Multiple cis regulatory elements for the activity of the CaMV
35S promoter have been identified. See Odell et al. (1985) Nature 313:810-812; Fang et al. (1989) Plant Cell 1:141-150.
1.'i In the present invention, a large fragment of the upstream activating regions (UARs) of the CaMV 35S promoter can be utilized to enhance the activity of the core synthetic promoter. The size of the UAR can, for example, range from about 15 base pairs to about 850 base pairs, preferably from about 20 to about 500, more preferably from about 20-25 to about 50-200 base pairs. A preferred region of the 35S CaMV
upstream region includes sequences from about -421 to about -90. It is recognized that modifications, i.n length and nucleotide sequence can be made to the region and still result 2_'> in enhanced activity of the core synthetic promoter. Such modifications can be tested for effect on activity by using expression systems as set forth in the Experimental Section of the present application. The numbers on the UAR sequence diagram indicate the position upstream from the transcription 3C) start site, or +1 position of the 35S structural gene. For example, -25 means a po~s:ition 25 base pairs upstream from the transcription start site= of the 35S structural gene.
62451-854(D) The upstream activating region of the maize ubiquitin gene Ubi-1 can also be utilized in the invention. The sequence of the Ubi-1 gene transcription regulatory region is disclosed in US Patent No. 5,510,474. See also Christensen et al. (1992) Plant Mol. Biol. 18:675-689; Cornejo et a1. (1993) Plant Mol.
Biol. 23:567-581; Takimoto et al. (1994) Plant Mol. Biol.
26:1007-1012; and Chri.stensen et al. (1996) Transgenic Res.
5:213-218. The UAR of the Ubi-1 gene promoter comprises preferably from about -867 to about -54. As indicated above for the 35S UAR, modifications of the Ubi-1 UAR that still function to enhance the activity of the core promoter are encompassed.
While the full sequence of the ubiquitin promoter has been published, this is the first disclosure of the Ubi UAR.
1.'i Thus, the invention discloses the UAR of the ubiquitin promoter as well as the Ubi UAR in combination with any promoter.
Additionally, methods for using the Ubi UAR to enhance activity of promoters are encompassed.
The upstream activating regions as described herein can be linked with the synthetic core promoter and/or other upstream elements by any conventional method that is generally known in the art as long as an operative element or promoter is constructed. The upstream activating regions are generally operably linked to the 5' end of the core promoter. When the 2'i synthetic upstream element is also present, the upstream activating regions can be linked to either the 5' end of the synthetic upstream element, the 3' end of the core synthetic promoter or inserted between the synthetic upstream element and the synthetic core promoter. In a preferred embodiment the 3C) upstream activating regions are linked in close proximity to the synthetic upstream element, if present, and the synthetic core promoter. By close proximity is intended within from about 1 to about 50 nucleotides. However, it is recognized 62451-854 (D) that more than 50 nucleotides may separate the elements. The upstream activating regions can be in the 5' to 3' direction or the 3' to 5' direction, but preferably in the 5' to 3' direction at the 5' end of the synthetic core promoter or the .'~ synthetic upstream element.
One or multiple copies of the upstream activating regions can be used. When multiple copies are utilized, they can be tandem repeats of one UAR or combinations of several UARs. In this manner, the level of expression of a nucleotide 1c) sequence of interest r_an be controlled by the number of UARs present in the promoter construction since the results indicate that increased expression levels are obtained with increased numbers of UARs. Thus, the invention provides methods for regulating levels of Expression of a gene or nucleotide l.'~ sequence of interest .
As indicated, multiple copies of a UAR can be used to enhance the activity of the operably linked promoter. As noted, multiple copies of the same or different UARs can be utilized. For example, any combination of CaMV 35S UARs and 20 maize Ubi-1 gene UARs can be utilized.
The promoters of this invention having one or more UARs as described above can be provided in expression cassettes and such cassettes contained in p1_asmid or viral vectors. Such vectors can be used for transformation of bacteria and plant 25 cells. Transgenic plants can be ultimately regenerated from such transformed plant ce:Lls.
The UARs incorporated into plant promoters can substantially enhance transcription activity in transgenic plants. For example, ene or more copies of the upstream 30 activating region of the maize Uby-1 gene can be operably linked to a promoter having the core synthetic promoter 62451-854(D) sequence and the synthetic upstream element of the invention.
The promoter constructs of the invention can be operably linked to any nucleotide sequence or gene of interest. The promoter construct, for example, can be used to enhance oxalate oxidase gene expression i.n transgenic plants. Oxalate oxidase is a plant enzyme implicated in plant defense mechanisms against a pathogens attack. The enzyme degrades the chemical compound oxalic acid secreted by plant pathogens. See e.g., PCT
Publication No. WO 92/14824. Increasing the oxalate oxidase 1() level in plants such as sunflower will lead to increased plant resistance to plant pathogens.
The following examples are for illustration purposes only and are intended in no way to limit the scope or application of the present invention. Those of skill in the 1.'i art will appreciate that many permutations can be achieved and are in fact intended to be within the scope of the invention.
Plasmids were designed using the multiple cloning site of pBlueScri.ptIIKS+* from Stratagene. (To facilitate 2C) cloning of the different combination of elements).
Oligonucleotides containing the sequences of the elements were synthesized with restriction endonuclease sites at the ends.
Thus elements could be added or removed and replaced as needed.
GUS and Luciferase were used for reporter genes.
2-'i For transient assays, plasmid DNA was introduced into intact 3-day-old maize .seedlings by particle bombardment.
Following 16 hours incubation at 25EC in the dark, expression was assayed by measuring GUS enzyme activity in root and shoot extracts from each seedling to determine if any tissue-3C1 preferred expression wa;s demonstrated. GUS activity was *Trade-mark 62451-854(D) measured using a GUS-Light assay kit from Tropix (47 Wiggins Avenue, Bedford, MA 01730).
Constructs that gave high levels of expression were introduced into a cell line to produce stable transformants.
.'i These stable transformants (TO) were assayed by PCR to determine the presence of the GUS gene by MUG (4-methylumelliferyl.-glucuronide) assay to quantify the activity level of the GUS protein being produced. When the plants were ready to be transferred to the greenhouse they were assayed histochemically with X-gluc* to determine where the GUS product was being synthesized. Plants demonstrating preferred expression levels were grown in the greenhouse to V6 stage.
Construction of Plasmids Containing the Syn II Core 1'. Promoter.
Standard molecular biological techniques were carried out according to Maniantis et al. (1982) Molecular Cloning: A
Laboratory Manual, Cold .Spring Harbor Laboratory, Cold Spring Harbor, N.Y. All plasm:ids utilized in the invention can be prepared according to the directions of the specification by a person of ordinary skil=L in the art without undue experimentation employing materials readily available in the art.
Oligos N306 SEQ ID N0:6 5'-TCGACACTGC AGCTCTAGGG
ATGGTAGCGC AGGGTGCGTA GC~'rACGTATT TATAGCCGCT CGAGTG-3' and N307 SEQ ID N0:7 5'-GATCCACTC:G AGCGGCTATA AATACGTACC TACGCACCCT
GCGCTACCAT CCTAGAGCT GCAGTG-3' were synthesized according to directions on an automated DNA synthesizer (such as Applied Biosystems Inc. DNA Synthesizer (Model 380B). These automated synthesizers are commercially available. The oligos were then *Trade-mark 62451-854(D) ligated to the BamHI fragment of the pBlueScriptIIKS+ plasmid comprising of the (3-gl.ucuronidase gene interrupted by the maize ADhI intron 1 region. .A map of a plasmid incorporating both the Syn II Core promoter and the upstream e1_ement is disclosed :i as Figure 4. Several. other embodiments are shown in other plasmids depicted in figure 5. Plasmid numbers are shown to the right of each promoter diagram with the corresponding legend placed below the diagrams. The top diagram shows the complete transplant transcriptional unit with the subsequent diagrams focusing on the salient differences between 35S and Syn II Core promoters. The legend shows the number and nature of the various promoter subelements, the sequence if relatively short, the source of the element and position relative to the start of transcription.
The sequence of the core promoter consists of 35 base pairs with enzyme sites upstream of a TATA box and a start of transcription with 10 to 15 base pairs downstream. Upstream elements (Gal 4-binding sites, Rsyn, AT-GBL etc.) were fused to the core sequence with ADhI-intron and different marker genes (LUC or GUS) and were demonstrated functional both in transient assays (Figure 6) and Rsyn stably transformed plants (Figure 7) .
Construction of Upstream Element Rsyn7 Fused to Syn 2.5 II Core Promoter Resulting in Plasmids PHP5903 and PHP6086.
Oligos for constructing the Rsyn7 promoter subelement N1965: (SEQ ID N0:8) GA.TCCTATGA CGTATGGTAT GACGTGTGTT
CAAGATGATG ACTTCAAACC TF~CCTATGAC G'rATGGTATG ACGTGTGTCG
ACTGATGACT TA-3' and N1966: (SEQ :ID N0:9) GATCTAAGTC ATCAGTCGAC
ACACGTCATA CCATACGTCA fAGGTAGGTT TGAAGTCATC ATCTTGAACA
CACGTCATAC CATACGTCA TP,CJ-3' were synthesized as earlier 62451-854(D) described. The oligos were annealed and cloned into a PHP3398 plasmid upstream of the Syn II Core sequence and resulted in several versions of the original Rsyn7 sequence due to spontaneous deletions. The Rsyn7-2 version involved a single _'i base deletion resulting in a 3X reiterative TGACG motif upstream of the Syn I7: ;:ore promoter (Rsyn7::LUC, P5903). The LUC coding sequence was replaced by GUS coding sequence to produce the Rsyn7::GUS construct P6086. P6086 was later introduced into transgenic maize resulting in high levels of constitutive activity in four of the six active events examined (Figure 7).
The progeny from TO plants from several transformation events were examined and GUS activity ranging from 1 to 400 PPM (micrograms GUS enzyme/GFW in root tissue of 7-day old seedlings) Figure 8. These TO and Tl plants generally produced 4X-lOX greated GUS activity than plants harboring the ubiquiti.n::GUS report=er gene.
Thus from the foregoing, it can be seen that the invention accomplishes at least all of its objectives.
21) EXAMPLE 4 Transformation. and Expression with Syn II Core Promoter and/or Rsyn7 Upstream Element.
Using transient bombardment assays the Syn II Core promoter sequence was compared against the 35S core sequence 2.~ either alone or in conjunction with numerous activation elements. Figure 6 is a depiction of transient assay data using the plasmids incorporating the promoter sequences of the invention and shows transient GUS or LUC activity in three-day old maize roots or BMS callus bombarded with chimeric 30 promoter::GUS or LUC constructs. The -33 CaMV35S in the Syn II
Core promoter versions of the synthetic promoter:: GUS (or LUC) 62451-854(D) constructs were bombarded into three-day old roots (or cultured BMS calli as described hereinafter) and assayed for enzyme activity 20 hours after bombardment=s. The data shown are the raw enzyme units of a compilation of at least three experiments and have not been normalized in any fashion due to the inherent variability of the transient assays. Control plasmids 1654 and 3537 are the LUC constructs tested in maize BMS calli. There is approximately 4 to 20 fold difference in transient activity between the 35S and Syn II Core versions. The Y axis is in log scale. Both core promoters were driving a GUS containing construct (Figures 4 and 5) and generated a basal level of activity (Figure 6). However when activator elements were placed upstream of the TATA motif, the Syn II Core provided generally higher levels of activity (2-4 fold better) in corn cells than when the activator elements were placed upstream of the 35S core (Figure 6).
The Syn II Core sequence has been shown to enhance activity in stably transformed plants. Further with certain activator sequences upstream of the TATA element activity levels in stably transformed corn plants reached levels ten-fold greater than maize ubiquitin constructs which produces extremely high levels of activity.
Figures 7 and 8 show GUS activity levels from isolated tissues of VT stage TO plants and root tissue from T1 2.~ seedlings, respectively. These data demonstrate that this core sequence can participate in potentiating very high levels of activity as a functiorual partner for the active chimeric promoters. Figure 7 shows the Rsyn7::GUS (6086) activity in TO
maize plants. VT stage plants with ears post pollinated 3 to 8 days were dissected and assayed for GUS activity. 7A depicts GUS expression in designated tissues. 7B depicts a schematic of a corn plant with sites of measurement indicated. Plants from TO events that derr~onstrated a range of activities with the 62451-854(D) Rsyn7 promoter were ass<~yed. Log scale again noted. The activity range for UBI::GUS plants is indicated at the right of graph for comparisons. These data demonstrate that the Rsyn7 promoter can increase activity to ten-fold above levels of the ubiquitin promoter yet.;shows little tissue preference making the Rsyn7 suitable as a strong constitutive promoter.
Figure 8 depi~~ts GUS activity in root segments of a segregating population of maize Tl transgenic seedlings containing the Rsyn7::G!JS (6086) or the UBI::GUS (3953) construct. 1 cm root segments from six to seven-day old transgenic maize seedlings were dissected, weighed and assayed for GUS using GUS-light kit. Activity is represented as parts per million of fresh weight. The root activity of several T1 plants harboring the Rsyn7::GUS promoter shows higher activity 1.'i than much of the activity levels produced by the UBI promoter.
This is consistent with data from TO transgenic plant.
Activity levels in Rsyn7::GUS containing young leaves are also much higher than the activity levels of UBI::GUS-containing young leaves (data not shown). The Syn II Core sequence was shown to function well. with a variety of upstream elements including GAL 4 binding sites, Rsyn7 elements, GBL elements, etc.
Figure 9 shows GUS expression of three synthetic promoters in TO transgenic maize plants. Dissected tissues 2.'~ (See Figure '7B) from VT stage transgenic TO plants harboring Rsyn7 (Rsyn), Atsyn or the Syn II Core alone (syn-core) promoter::GUS constructs were quantitatively assayed for GUS
activity. Each circle represents an average of tissue activity of transgenic maize plants from a single transformation event.
The TGACG-motif corresponds to the Rsyn7 sequence and the "AT-com" motif refers to they consensus AT-like composite element, Atcom, from W. Gurley, et al. 1993. In: Control of Plant Gene Expression. ed. by Desh Pal Verma. CRC press, Boca Raton, FL.
62451-854(D) pp. 103-123. Syn-core refers to the Syn II Core promoter sequence containing the TATA element and the start of transcription.
Construction of Rsyn7 promoter having the upstream activating regions from the CaMV 35S gene and the maize Ubi-1 gen a .
To construct an expression vector having 35SU
(upstream activating regions from 35S gene)--Rsyn7 promoter, 1() PHP413 was digested with BglII and EcoRV. The staggered/sticky ends of the linearized vector were filled in by Klenow in the presence of dNTP. The 2x CaMV fragment was blunt end ligated into BamHl digested PHP6086 after filling the BamHl ends. The CaMV 35S-Rsyn7 fragment was then cut out from the new construct 15 by digestion with Xba1 and Pst l, and ligated into the 4 kb XbaI-Pstl vector from PHP9925 to form the expression vector of 35SU-Rsyn7::GUS (PHP9'778).
To construct. an expression vector having UbiU
(upstream activat=ing elements from the maize Ubi-I gene)-Rsyn7 20 promoter, the Xbal-Spel fragment from PHP82'77 was ligated into the Xbal site of PHP608F.~ to form PHP10539, into the Xbal site of PHP10970 to form PHP10971, and into the Xbal site of PHP10971 to form the expression vector of Ubi-1-Rsyn7::GUS
(PHP10972).
25 Those sequences not referenced otherwise include:
SEQ ID N0:11 sets forth the 35S UAR.
SEQ ID N0:12 sets forth the SCPl promoter sequence, (35S UAR operably linked to core promoter o.f SEQ ID N0:1).
SEQ ID N0:13 sets forth the Ubil UAR.
62451-854 (D) SEQ ID N0:14 sets forth SCP1 operably linked to the oxalate oxidase coding sequence operably linked with the PinII
terminator.
SEQ ID N0:16 sets forth the UCP2 promoter sequence (2 .') copies of Ub_i1 UAR operably with the core promoter).
SEQ ID N0:18 set forth the UCP4 promoter sequence (4 copies of Ubil UAR operably with the core promoter).
Transformation and Expression of promoter constructs.
11) The various promoters:: GUS fragments were cloned into a Bin9 binary vector that contains ALS3::NPTII as selection marker for generating transgenic sunflower callus or Arabidopsis.
For transient expression, SMF3 sunflower seeds were 1.'S planted in greenhouse. 15-day-old seeds after pollination were collected from the plants and used in the transient expression system. After removing the pericarp, the cotyledons with seed coats were sterilized by incubation in 20o bleach at RT for 15 minutes, and washed faun times with sterile double distilled 20 water. The cotyledons were then incubated on 3MM filter wetted with MS medium overnight, before they were bombarded according to the method diclosed t~y Klein et al. (1989) Proc. Natl. Acad.
Sci. USA 86:6681-6685. GUS activity was analyzed 20 hours after bombardment usirug the GUS-Light assay kit from Tropix 25 according to the manufacaurer's protocol.
For leaf disc transformation, young expanded SMF3 sunflower leaf from 30-day-old sunflower was harvested and sterilized in 20~ bleach with a couple of drops of Tween 20 for 20 minutes. Leaf discs were prepared from the sterile leaves 30 after washing them with sterile double distilled water 4 times.
62451-854(D) The leaf discs were then incubated for 10 minutes in inoculation medium (12.5 mM MES, 1. g/1 NH4C1, and C.3 g/1 MgS04) containing Agrobacteri.u:m (EHA105) transformed with the vector constructs to be tested at A600=0.75. The leaf discs were then .'i grown for 3 days in non-selection medium and were then transferred to selection medium.
To transform A.rabidopsis, Arabidopsis were grown in greenhouse to the stage when bolts start to emerge at 15 plants/pot. The emerging bolts were clipped off to encourage the growth of multiple secondary bolts. After 7 days, the plants were ready for infiltration. Agrobacterium (EHA105) carrying the construct to be tested was cultured at 28°C to when A600 was between 0.65 and 0.8. The cells were harvested in inoculation medium (4.3 g/1 of MS salt, 0.5 mg/1 of nicotinic acid, 0.5 mg/1 of pyridoxine-HCl, 1 mg/1 of Thiamine-HC1, 0.1 g of myo-inositol, 1 g/1 of casamino acids, 0.01 mg of BAP, 68.5 g/1 of sucrose, and 36 g/1 of glucose) at A600 of 7.5.
The clipped plants to be transformed were inverted into a 250 ml beaker c:on.taining the above Agrobacterium solution. The beaker was placed into a bell jar and was vacuumed until bubbles formed on leaf and stem surface. After 15 minutes of infiltration, the vacuum was released and the plants were removed from the beaker, laid on its side in a plastic flat, and covered with plastic wrap. The plants were 2~ set upright and grown in greenhouse for four weeks before seeds were harvested. Transgenic seeds were selected by planting the seeds on a medium plate containing 65 ~g/ml of kanamycin.
In transient expression assays, the Rsyn7 promoter in PHP10464 has 15% of the 35S promoter (PHP9925) activity, whereas the 35SU-Rsyn°7 promoter (PHP9778) (hereinafter SCPl promoter) has about 107°. of the 35S promoter activity (Figure 62451-854 (D) 10). Thus, the upstream activating region of the CaMV 35S gene increased the Rsyn7 a<:ti.vity by about 6 fold.
The maize Ubi-1 upstream element {UbiU) has similar effects on the Rsyn7 promoter in transient assays. When the .'~ upstream activating region (UAR) of the maize Ubi-1 was fused to Rsyn7, the GUS enzyme activity increased with the number of copy of the UAR. In the presence of three copies of UbiU, GUS
activity increased by about 4-fold. This additive effect of UbiU was not observed when placed in the context of the maize 1() ubiquitin promoter (PHP11974). This suggests that replacement of the maize Ubil core promoter with Rsyn7 may convert the monocot Ubil promoter into a highly active promoter in dicot plants (Figure 11).
In the stably transformed sunflower callus, GUS
1!~ expression is 20~ higher behind the control of the 35SU-Rsyn7 (SCP1 promoter) promoter than when behind the control of the 35S CaMV promoter (Figure 12).
The results of- the transgenic callus assay are given in Figure 13. The Rsyn7 promoter containing a single copy of 20 UbiU (PHP10991) (hereinafter UCPl promoter) (SEQ ID N0: 15) exhibited promoter activity of about 3 times that of the 35SU-Rsyn7 (SCP1) promoter (PHP10940). Three copies of UbiU
(PHP10993) increased Rsyn'7 promoter activity to about 7 times that of the 35SU-Rsyn'l promoter. The 3xUbiU-Rsyn7 (UCP3 25 promoter) (SEQ ID NO: 17) is by far the strongest promoter in sunflower tissues.
To determine the activi~y and tissue-specificity of the enhanced Rsyn7 promoters, stably-transformed sunflower and Arabidopsis were generated through Agrobacterium-mediated 30 transformation. The histochemical staining of GUS expression in transgenic T1 Arab_idopsis indicates that 35SU-Rsyn7 (SCP1 62451-854 (D) promoter) (PHP10940) has identical tissue-specificity and similar activity as 35S CaMV promoter (PHP10989). Both promoters express GUS in leaf, stem, petiole, and floral parts.
UbiU-Rsyn7 (USCP1 promoter) (PHP10991) exhibits higher activity ~i than maize Ubi-1 promoter (PHP11031) in Arabidopsis stem and leaf tissues.
All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains.
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.
SEQUENCE LISTING
<110> Wesley, Bruce <120> Synthetic Promoters <130> 62451-854D
<140> PCT/US99/03863 <141> 1999-02-23 <150> 08/661,601 <151> 1996-06-11 <160> 18 <170> PatentIn Ver. 2.0 <210> 1 <211> 72 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic core promoter sequence <400> 1 ggatccactc gagcggctat aaatacgt.ac ctacgcacgc tgcgctacca tcccgagcac 60 tgcagtgtcg ac 72 <210> 2 <211> 96 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: a synthetic upstream element that helps potentiate transcription <400> 2 ggatcctatg cgtatggtat gacgtgtc~tt caagatgatg acttcaaacc tacctatgac 60 gtatggtatg acgtgtgtcg actgatga.ct tagatc 96 <210> 3 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: A typical nucleotide base arrangement of a core promoter containing the consensus. sequences of TATA and INR
motifs present in plant promoters.
<220>
<221> unsure <222> (5) <223> The nucleotide at this position may be an a or t.
<220>
<221> unsure <222> (7) <223> The nucleotide at this xaosition may be an a or t.
<220>
<221> unsure <222> (10)..(11) <223> The nucleotides at these positions may be a t or c.
<220>
<221> unsure <222> (16) <223> The nucleotide at this position may be a or c.
<400> 3 tatawawaty ytcatmaa 18 <210> 4 <211> 40 <212> DNA
<213> Cauliflower mosaic virus <400> 4 cctctatata agcaagttca tttcatttgg agaggaaacg 40 <210> 5 <211> 5 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: OCS element <400> 5 tgacg 5 <210> 6 <211> 66 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 6 tcgacactgc agctctaggg atggtagcgc agggtgcgta ggtacgtatt tatagccgct 60 cgagtg 66 <210> 7 <211> 66 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 7 gatccactcg agcggctata aatacgta.cc tacgcaccct gcgctaccat ccctagagct 60 gcagtg 66 <210> 8 <211> 92 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 8 gatcctatga cgtatggtat gacgtgta~tt caagatgatg acttcaaacc tacctatgac 60 gtatggtatg acgtgtgtcg actgatga.ct to 92 <210> 9 <211> 92 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 9 gatctaagtc atcagtcgac acacgtcata ccatacgtca taggtaggtt tgaagtcatc 60 atcttgaaca cacgtcatac catacgtcat ag 92 <210> 10 <211> 72 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic nucleic acid core promoter with G/C transversions <220>
<223> The nucleotides at positions 26, 27, 30, 31, 34, 35, 36, 38, 39, 40, 42, 43, 44, 45, 48, and 49, may be g or c.
<400> 10 ggatccactc gagcggctat aaatasstas stasssasss tsssstassa tcccgagcac 60 tgcagtgtcg ac 72 <210> 11 <211> 332 <212> DNA
<213> Cauliflower mosaic virus <400> 11 cgtcaacatg gtggagcacg acactctcgt ctactccaag aatatcaaag atacagtctc 60 agaagaccaa agggctattg agacttttca acaaagggta atatcgggaa acctcctcgg 120 attccattgc ccagctatct gtcacttcat caaaaggaca gtagaaaagg aaggtggcac 180 ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt caagatgcct ctgccgacag 240 tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 300 cacgtcttca aagcaagtgg attgatgtga tg 332 <210> 12 <211> 499 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: 35S UAR of cauliflower mosaic virus operably linked to the core promoter sequence <400> 12 cgtcaacatg gtggagcacg acactctcgt ctactccaag aatatcaaag atacagtctc 60 agaagaccaa agggctattg agacttttca acaaagggta atatcgggaa acctcctcgg 120 attccattgc ccagctatct gtcacttcat caaaaggaca gtagaaaagg aaggtggcac 180 ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt. caagatgcct ctgccgacag 240 tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 300 cacgtcttca aagcaagtgg attgatgtga tgatcctatg cgtatggtat gacgtgtgtt 360 caagatgatg acttcaaacc tacctatgac gtatggtatg acgtgtgtcg actgatgact 420 tagatccact cgagcggcta taaatacgta cctacgcacc ctgcgctacc atccctagag 480 ctgcatgctt atttttaca 499 <210> 13 <211> 813 <212> DNA
<213> Zea mays <400> 13 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cct 813 <210> 14 <211> 1600 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: 35S UAR of cauliflower mosaic virus operably linked to the core promoter operably linked to the oxalate oxidase coding sequence operably linked to PinII
terminator.
<400> 14 gatctgagtc tagaaatccg tcaacatggt ggagcacgac actctcgtct actccaagaa 60 tatcaaagat acagtctcag aagaccaa.a.g ggctattgag acttttcaac aaagggtaat 120 atcgggaaac ctcctcggat tccattgrcc agctatctgt cacttcatca aaaggacagt 180 agaaaaggaa ggtggcacct acaaatgcca tcattgcgat aaaggaaagg ctatcgttca 240 agatgcctct gccgacagtg gtcccaaaga tggaccccca cccacgagga gcatcgtgga 300 aaaagaagac gttccaacca cgtcttcaaa gcaagtggat tgatgtgatg atcctatgcg 360 tatggtatga cgtgtgttca agatgatgac ttcaaaccta cctatgacgt atggtatgaa 420 cgtgtgtcga ctgatgactt agatccactc gagcggctat aaatacgtac ctacgcaccc 480 tgcgctacca tccctagagc tgcagcttat ttttacaaca attaccaaca acaacaaaca 540 acaaacaaca ttacaattac tatttacaat tacagtcgac ccgggatcca tggggtactc 600 caaaacccta gtagctggcc tgttcgcaat gctgttacta gctccggccg tcttggccac 660 cgacccagac cctctccagg acttctgtgt cgccgacctc gacggcaagg cggtctcggt 720 gaacgggcac acgtgcaagc ccatgtcgga ggccggcgac gacttcctct tctcgtccaa 780 gttggccaag gccggcaaca cgtccacccc gaacggctcc gccgtgacgg agctcgacgt 840 ggccgagtgg cccggtacca acaagctggg tggtgtcatg aaccgcgtgg attttggtcc 900 cggagggacc aacccaccac acatccaccc gcgtgccacc gagatcggca tcgtgatgaa 960 aggtgagctt ctcgtgggaa tccttggcag cctcgactcc gggaacaagc tctactcgag 1020 ggtggtgcgc gccggagaga cgttcctcat cccacggggc ctcatgcact tccagttcaa 1080 cgtcggtaag accgaggcct ccatggtcgt ctccttcaac agccagaacc ccggcattgt 1140 cttcgtgccc ctcacgctct tcggctccaa cccgcccatc~ ccaacgccgg tgctcaccaa 1200 ggcactccgg gtggaggcca gggtcgtgga acttctcaag tccaagtttg ccgctgggtt 1260 ttaatttcta ggatcctcta gagtcgaacc tagacttgtc catcttctgg attggccaac 1320 ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca ctataatgtg 1380 ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga gaaagagatc 1440 atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg atgaaccaga 1500 tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa ttaatatcaa 1560 ttgggttagc aaaacaaatc tagtctaggt gtgttttgcc 1600 <210> 15 <211> 994 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: Rsyn7 promoter containing 1 copy of the zea mays ubi-1 upstream element (UbiU) <400> 15 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagaa ctagtggatc ctatgcgtat 840 ggtatgacgt gtgttcaaga tgatgacttc aaacctacct atgacgtatg gtatgacgtg 900 tgtcgactga tgacttagat ccactcgagc ggctataaat acgtacctac gcaccctgcg 960 ctaccatccc tagagctgca tgcttatttt taca 994 <210> 16 <211> 1807 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: U~P2 promoter sequence containing 2 copies of the zea mays ubil UAR operably linked with the core promoter <400> 16 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct_. cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct: cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct at.tttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gaactagtgg atcctatgcg tatggtatga cgtgtgttca agatgatgac 1680 ttcaaaccta cctatgacgt atggtatgac gtgtgtcgac tgatgactta gatccactcg 1740 agcggctata aatacgtacc tacgcaccct gcgctaccat ccctagagct gcatgcttat 1800 ttttaca 1807 <210> 17 <211> 2620 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:Rsyn7 promoter containing 3 copies of zea mays UbiU upstream element <400> 17 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct attttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gagataatga gcattgcatg tctaagttat aaaaaattac cacatatttt 1680 ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt taaactttac 1740 tctacgaata atataatcta tagtactaca ataatatcag tgttttagag aatcatataa 1800 atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg actctacagt 1860 tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc acctatataa 1920 tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt ttttatagac 1980 taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga aaactaaaac 2040 tctattttag tttttttatt taataattta gatataaaat agaataaaat aaagtgacta 2100 aaaattaaac aaataccctt taagaaatta aaaaaactaa. ggaaacattt ttcttgtttc 2160 gagtagataa tgccagcctg ttaaacgccg tcgacgagtc taacggacac caaccagcga 2220 accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc 2280 tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa 2340 attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac 2400 ggcacggcag ctacggggga ttcctttccc accgctccta ctagaactag tggatcctat 2460 gcgtatggta tgacgtgtgt tcaagatgat gacttcaaac ctacctatga cgtatggtat 2520 gacgtgtgtc gactgatgac ttagatccac tcgagcggct ataaatacgt acctacgcac 2580 cctgcgctac catccctaga gctgcatgct tatttttaca 2620 <210> 18 <211> 3433 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: UCP4 promoter sequence containing 4 copies of ubil UAR operably linked with the core promoter.
<400> 18 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagat:ata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt. tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct attttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gagataatga gcattgcatg tctaagttat aaaaaattac cacatatttt 1680 ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt taaactttac 1740 tctacgaata atataatcta tagtactaca ataatatcag tgttttagag aatcatataa 1800 atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg actctacagt 1860 tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc acctatataa 1920 tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt ttttatagac 1980 taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga aa.actaaaac 2040 tctattttag tttttttatt taataattta gatataaaat agaataaaat aaagtgacta 2100 aaaattaaac aaataccctt taagaaatta aaaaaactaa ggaaacattt ttcttgtttc 2160 gagtagataa tgccagcctg ttaaacgccg tcgacgagtc taacggacac caaccagcga 2220 accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc 2280 tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa 2340 attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac 2400 ggcacggcag ctacggggga ttcctttccc accgctccta ctagagataa tgagcattgc 2460 atgtctaagt tataaaaaat taccacatat tttttttgtc acacttgttt gaagtgcagt 2520 ttatctatct ttatacatat atttaaactt tactctacga ataatataat ctatagtact 2580 acaataatat cagtgtttta gagaatcata taaatgaaca gttagacatg gtctaaagga 2640 caattgagta ttttgacaac aggactctac agttttatct ttttagtgtg catgtgttct 2700 cctttttttt tgcaaatagc ttcacctata taatacttca tccattttat tagtacatcc 2760 atttagggtt tagggttaat ggtttttata gactaatttt tttagtacat ctattttatt 2820 ctattttagc ctctaaatta agaaaactaa aactctattt tagttttttt atttaataat 2880 ttagatataa aatagaataa aataaagtga ctaaaaatta aacaaatacc ctttaagaaa 2940 ttaaaaaaac taaggaaaca tttttcttgt ttcgagtaga taatgccagc ctgttaaacg 3000 ccgtcgacga gtctaacgga caccaaccag cgaaccagca gcgtcgcgtc gggccaagcg 3060 aagcagacgg cacggcatct ctgtcgctgc ctctggaccc ctctcgagag ttccgctcca 3120 ccgttggact tgctccgctg tcggcatcca gaaattgcgt ggcggagcgg cagacgtgag 3180 ccggcacggc aggcggcctc ctcctcctct cacggcacgg cagctacggg ggattccttt 3240 cccaccgctc ctactagaac tagtggatcc tatgcgtatg gtatgacgtg tgttcaagat 3300 gatgacttca aacctaccta tgacgtatgg tatgacgtgt gtcgactgat gacttagatc 3360 cactcgagcg gctataaata cgtacctacg caccctgcgc taccatccct agagctgcat 3420 gcttattttt aca 3433
Figure 6 is a depiction of transient assay data using .'i the plasmids incorporating 'the promoter sequences of the invention.
Figure 7(A). Rsyn7::GUS (PHP6086) activity to TO
maize plants.
Figure 7(B) is a schematic of VT stage corn plants with sites of tissue samples indicated.
Figure 8 depicts GUS activity in root segments of a segregating population of maize T1 transgenic seedlings containing the Rsyn7::GUS (PHP6086) or the LJBI:GUS (PHP3953) construct.
1'~ Figures 9A and 9B depict GUS expression of three synthetic promoters in TO transgenic maize plants including the promoter sequences of the invention as comparison.
Figure 10 shows the comparison of the activities of the Rsyn7 promoter, the CaMV 35S promoter, and the 35SU-Rsyn7 promoter in transient expression in sunflower cotyledons.
Figure 11 shows the effect of the Ubi-1 upstream activating region on the strength of the Rsyn7 promoter in transient expression :in sunflower cotyledons.
Figure 12 shcws that in the stably transformed sunflower callus, GUS expression behind the control of the 35SU-Rsyn7 is CaMV 20'~ higher than when behind the control of the 35S promoter.
62451-854 (D) Figure 13 shows the effect of the Ubi-I upstream activating region on t:h~~ activity of the Rsyn7 promoter in transgenic sunflower callus assay.
DETAILED DESCRIPTION OF THE INVENTION
In the description that follows a number of terms are used extensively. The following definitions are provided in order to remove ambiguities in thE: intent or. scope of their usage in the specification and claims, and to facilitate understanding of the invention.
1() A structural gene is a DNA sequence that is transcribed into messenger RNA (mRNA) which is then translated into a sequence of amino acids characteristic of a specific polypeptide.
A promoter is a DNA sequence that directs the 15 transcription of a structural gene. Typically a promoter is located in the 5' region of a gene, proximal to the transcriptional start site of a structural gene. The promoter of the invention comprises at least a core promoter as defined below. Additionally, the promoter may also include at least 20 one upstream element. Such elements include UARs and optionally, other DNA sequences that affect transcription of a structural gene such as a synthetic upstream element.
A core promoter or minimal promoter contains the essential nucleotide sequences for expression of the operably 2.'~ linked coding sequence, including the TATA box and start of transcription. E3y this definition, a core promoter may or may not have detectable activity in the absence of specific sequences that may enhance the activity or confer tissue specific activity. For example, the maize SGB6 gene core 30 promoter consists of about 37 nucleotides 5' of the transcriptional start site of the SGB6 gene, while the g 62451-854(D) Cauliflower Mosaic Virus (CaMV) 35S core promoter consists of about 33 nucleotides 5' of the transcriptional start site of the 35S genome.
ADH refers generally to a plant expressible alcohol _'> dehydrogenase gene and specifically to the alcohol dehydrogenase gene from maize.
ADH 1 UAR refers to the DNA fragment spanning the region between nucleotide positions about -1094 to about -106 of the alcohol dehydrogenase gene 1 from maize, or homologous fragment than is functionally equivalent. The sequence is numbered with the start of transcription site designated as +1 according to the correction published by Ellis et al. (1987) supra.
"TATA to start" shall mean the sequence between the primary TATA motif and the start of transcription.
A synthetic DNA is an artificially created DNA
sequence that is not produced naturally, and must be introduced to an organism or to an ancestor of that organism to control or to be expressed.
OCS element refers to the TGACG motif identified from the octopine synthase gene, histone genes, enzyme genes for agropine biosynthesis, the mannopi.ne synthase gene, the CaMV
35S gene, histone H3 gene and nopaline synthase gene. As used herein the term includes any sequence capable of binding the ASF-1 factor as identi.fi.ed in U.S. Patent No. 4,990,607 by Katagiri.
UAR is typically a position or orientation dependent element that primarily directs tissue, cell type, or regulated expression.
62451-854(D) An enhancer is a DNA regulatory element that can increase efficiency of transcription regardless of the distance or orientation of the enhancer relative to the start site of transcription.
The term expression refers to biosynthesis of a gene product. In the case of a structural gene, expression involves transcription of the structural. gene into mRNA and then translation of the mRNA into one or more polypeptides.
A cloning vector is a DNA molecule such as a plasmid, cosmid or bacterial phage that has the capability of replicating autonomously in a host cell. Cloning vectors typically contain one or a small number of restriction endonuclease recognition sites at which foreign DNA sequences can be inserted in a determinable fashion without loss of 1.'~ essential biological function of the vector, as well as a marker gene that is suitable for use in the identification and selection of cells transformed with the cloning vector. Marker genes typically include genes that provide tetracycline resistance, hygromycin resistance or ampicillin resistance.
An expression vector is a DNA molecule comprising a gene that is expr_essec~ in a host cell. Typically gene expression is placed under the control of certain regulatory elements including promoters, tissue specific regulatory elements, and enhancers. Such a gene is said to be "operably 2.5 linked to" the regulatory elements.
A recombinant host may be any prokaryotic or eukaryotic cell that contains either a cloning vector or an expression vector. This; term also includes those prokaryotic or eukaryotic cells that: have been genetically engineered to contain the cloned genes in the chromosome or genome of the host cell.
62451-854(D) A transgenic plant is a ~>lant having one or more plant cells that contain an expression vector.
It will be un~.~erstood that there may be minor sequence variations within sequence or fragments used or .'i disclosed in this app7.ication. By ~~minor variations" is intended that the sequences have at least 800, preferably 90%
sequence identity. These variations may be determined by standard techniques to enable those of ordinary skill in the art to manipulate and bring into utility the functional units of the promoter elements necessary to direct initiation of transcription in the structural gene followed by a plant expressible transcription termination (and perhaps polyadenylation) signal.
Plant tissue includes differentiated and undifferentiated tissues or plants, including but not limited to roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of r_ells and culture such as single cells, protoplast, embryos, and callus tissue. The plant tissue may be in plants or i.n organ, tissue or cell culture.
One embodiment of the invention, the core promoter, is shown in SEQ ID N0:1. The core promoter is capable of driving expression of a coding sequence in a target cell, particularly plant cells. The core promoter finds use in driving expression of sequences which are only needed at 2.~ minimal levels in the target cells. Also disclosed is a novel upstream element, SEQ ID N0:2 that helps to potentiate transcription. 'The synthetic core promoter can be used with combinations of enhancer, upstream elements, and/or activating sequences from the 5'-flanking regions of plant expressible structural genes. Similarly the upstream element can be used in combination with various plant core promoter sequences. In one embodiment the core promoter and upstream element are used 62451-854(D) together to obtain ten-fold higher expression of an introduced marker gene in monocot transgenic plants than is obtained with the maize ubiquitin 1 promoter.
The core promoter comprises a TATA motif and a GC
.'i rich ~~TATA to start of transcription" region (64% or greater GC
content) that is generally characteristic of animal promoters.
The sequence is placed 5' of a structural gene and will promote constitutive expression which is non-tissue specific in transgenic plant ce.ll:.; .
The invention also comprises an expression cassette comprising (the upstream element) the synthetic core promoter, a structural gene, the expression of which is desired in plant cells, and a polyadenylation or stop signal. The expression cassette can be encompassed in plasmid or viral vectors for transformation of plant protoplast cells.
The invention also encompasses transformed bacterial cells for maintenance and replication of the vector, as well as transformed monocot or. dicot cells and ultimately transgenic plants.
In another embodiment, the invention encompasses an upstream element that can be used in combination with the synthetic promoter or with other known promoters in the art.
The upstream element comprises at least 3 OCS binding motifs (TGACG) with a novel intervening sequence. One embodiment is 2~ disclosed in SEQ ID Nc:):2 and is placed 5' to a core promoter sequence to enhance the transcription levels of the resulting gene product. Thus the invention comprises an expression cassette comprising the synthetic upstream element of the invention, 5' to a plant inducible promoter which is 5' to a structural gene. This expression cassette can be embodied in vectors and plasmids as earlier described.
62451-854 (D) In a preferred embodiment the synthetic upstream element is used in combination with the synthetic core promoter sequence to achieve non-tissue specific constitutive expression of the gene product which is a ten-fold enhancement of the .'i maize Ubi-1 promoter.
The present invention also encompasses a promoter construct comprising the synthetic core promoter described above and an upstream activating region. The upstream activating region is different from the synthetic upstream element. Preferably the upstream activating region is an upstream activating region (UAR) having substantial sequence similarity to the UAR of CaMV 35S or maize Ubi-I. Promoter constructs of the invention may comprise the synthetic core promoter in combination with at least one UAR and optionally at least one synthetic upstream element.
The promoter construct can be contained for convenience in an expression cassette. This expression cassette can be embodied in transformation vectors.
The sequence cf the upstream activating region (UAR) of the maize Ubi-1 gene is also provided. This UAR can be used in combination with any core promoter to enhance the activity of the promoter.
The promoter of the invention as seen in SEQ ID NO:l and/or SEQ ID NO:10 (modified core promoter), can be used to obtain high levels of expression o.f structural genes.
Similarly the upstream element of the invention (SEQ ID N0:2) can be used in combination with other promoters or the promoter of the invention to potentiate levels of transcription in genetically modified plants. Production of a genetically modified plant tissue expressing a structural gene under the control of the regulatory elements of the invention combines 62451-854(D) teachings of the present disclosure with a variety of techniques and expedients known in the art. In most instances alternate expedients exist for each stage of the overall process. The choice of expedients depends on the variables '.i such as the plasmid vector system chosen for the cloning and introduction of the recombinant DNA molecule, the plant species to be modified, the particular structural gene, promoter elements and upstream elements used. Persons skilled in the art are able to select and use appropriate alternatives to achieve functionality. Culture conditions for expressing desired structural genes and cultured cells are known in the art. Also as known in the art, a number of both monocotyledonous and dicotyledonous plant species are transformable and regenerable such that whole plants containing and expressing desired genes under regulatory control of the promoter molecules and upstream elements of the invention may be obtained. As is known to those of skill in the art, expression in transformed plants may be tissue specific and/or specific to certain developmental stages or environmental influences. Truncated promoter selection and structural gene selection are other parameters which may be optimized to achieve desired plant expression as is known to those of skill in the art and taught herein.
The nuc:leoti.de sequences of the invention can be 2.'~ introduced into any p_1_ant. The genes to be introduced can be conveniently used in expression cassettes for introduction and expression in any plant of interest.
Such expression cassettes will comprise the transcriptional initiation region of the invention linked to a 3) nucleotide sequence oa i.nterest. Such an expression cassette is provided with a plurality of restriction sites for insertion of the gene of interest to be under the transcriptional 62451-854(D) regulation of the regulatory regions. The expression cassette may additionally contain selectable marker genes.
The transcri.ptional cassette will include in the 5'-3' direction of transcription, a transcriptional and translational initiation region, a DNA sequence of interest, and a transcriptional and translational termination region functional in plants. The termination region may be native with the transcri_ptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau et al., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon eat al. (1991) Genes Dev. 5:141-149; Mogen et a1 (1990) Plarnt Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639.
The genes of. the invention are provided in expression cassettes for expression in the plant of interest. The cassette will include 5' and 3' regulatory sequences operably linked to the gene of interest. i'he cassette may additionally contain at least one additional gene to be cotransformed into the organism. Alternatively, the additional genes) can be provided on another expression cassette. Where appropriate, 2~ the genes) may be op~imized for increased expression in the transformed plant. That is, the genes can be synthesized using plant preferred codons for improved expression. Methods are available in the art for synthesizing plant preferred genes.
See, for example, U.S. Patent Nos. 5,380,831, 5,436,391 and Murray et al. (1989) Nucleic Acids Res. 17:477-498.
Additional ,sequence modifications are known to enhance gene expression in a cellular host. These include 62451-854(D) elimination of sequences encoding spurious polyadenylation signals, exonintron splice site signals, transposon-like repeats, and other such well-characterized sequences which may be deleterious to gene expression. The G-C content of the .'i sequence may be adjusted to levels average f.or a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.
The selection of an appropriate expression vector 1() will depend upon the method of introducing the expression vector into host cells. Typically an expression vector contains (1) prokaryotic DNA elements coding for a bacterial replication origin and an antibiotic resistance gene to provide for the amplification and selection of the expression vector in 15 a bacterial host; (2) DNA elements that control initiation of transcription such as a promoter; (3) DNA elements that control the processing of transcripts such as introns, transcription termination/polyadenylation sequence; and (4) a reported gene that is operatively linked to the DNA elements to control 20 transcription in3_tiation. Useful reported genes include (3-glucuronidase, ~i-galactosidase, chloramphenicol acetyl transferase, luciferase, green fluorescent protein (GFP) and the like. Preferably the reported gene is either (3-glucuronidase (GUS), GFf or luciferase. The general 2~ descriptions of plant expression vectors and reporter genes can be found in Gruber, et~. a.l., "Vectors for Plant Transformation, in Methods in Plant Molecular Biology & Biotechnology" in Glich et al., (Eds. pp. 89-119, CRC Press, 1993). Moreover GUS
expression vectors and C;US gene cassettes are available from 3~ Clonetech Laboratories, Inc.., Palo Alto, California while luciferase expression vectors and luciferase gene cassettes are available from Promega C:orp. (Madison, Wisconsin).
62451-854(D) Expression vectors containing genomic or synthetic fragments can be introduced into protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced into intact. tissue. General methods of culturing plant tissues are provided for example by Maki et al.
"Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular Biology & Biotechnology, Glich et al. (Eds.
pp. 67-88 CRC Press, x.993); and by Phillips et al. "Cell-Tissue Culture and In-Vitro Manipulation" in Corn & Corn Improvement, 3rd Edit~.on Sprague et al. (Eds. pp. 345-387) American Society of Agronomy Inc. et al. 1988.
Methods of introducing expression vectors into plant tissue include the direct infection or co-cultivation of plant cell with Agrobacterium tumefaciens, Horsch et al., Science, 227:1229 (1985). Desc:ripti.ons of Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer provided by Gruber, et al. supra.
Preferably, expression vectors are introduced into maize or other plant tissues using a direct gene transfer method such as microprojectile-mediated delivery, DNA
injection, e.lectroporation and the like. More preferably expression vectors are introduced into plant tissues using the microprojectile media delivery with the biolistic device. See, for example, Tomes et al.. "Direct DNA transfer into intact plant cells via rnicroprojectile bombardment" In: Gamborg and Phillips (Eds.) Plant Cell, Tissue and Organ Culture:
Fundamental Methods, :~pringer-Verlag, Berlin (1995).
The vectors ef~ the invention can not only be used for expression of structural. genes but may also be used in exon trap cloning, or promoter trap procedures to detect differential gene expression in varieties of tissues, K.
Lindsey et al., 1993 "'fagging Genomic Sequences That Direct 62451-854(D) Transgene Expression by Activation of a Promoter Trap in Plants", Transgenic Research 2:33-47. D. Auch & Reth, et al., "Exon Trap Cloning: Using PCR to Rapidly Detect and Clone Exons from Genomic DNA Fragments", Nucleic Acids Research, Vol. 18, No. 22, p. 6743.
This inventive promoter is based in part on the discovery that a GC rich "TATA to start" region i.n a plant promoter acts as a very strong nontissue specific core promoter inducing constitutive expression in plant cells. The TATA
element of plant promoters of polII genes generally have the sequence TATA(A/T)A(AiT)A, SEQ ID N0:3, whereas the consensus of the start of transcription consists of the sequence 5'...TYYTCAT(A/C)AA...3'. SEQ ID N0:3, where the A designates the starting base for transcription. The typical plant promoter sequence is depicted in Figure 1.
Sequences intervening the TATA element and the start of transcription have been shown to play a significant role in transcriptional activation efficiency. The TATA binding protein has been shown to interact with the minor groove of the double helix binding too the TATA motif bending it towards the major groove side (Kim, et al. 1993, Nature, 365:512-520). It thus follows that sequences downstream of the TATA motif that impact this finding will affect the efficiency of stable transcriptional complex formation and ultimately expression.
Surveys of the "TATA t.o start" regions of plant promoters show a significantly higher level of AT-rich sequences leading to the potential of minor groove compression (Yaurawj et al Biological Abstracts Vol. 47, Issue 8, Ref. 144712, "Consensus Sequences for Plant Minimal Promoters" Annual Meeting of the American Society of Plant Physiologists, July 29-August 2, 1995, Plant Physiology 108 [2 Supp.] 1995, 114). Generally animal promoters show a GC-rich "TATA to start" sequence that leads to a major groove compression suggesting that average 62451-854(D) plant and animal core promoter transcriptional complexes recognize and interact with a somewhat different TATA to start structure with the corresponding sequence difference. Quite surprisingly the applicant has found that a GC-rich animal type .'~ synthetic promoter works very well in plants.
While the invention is not bound by any theory, it is possible that the AT-rich TATA motif present in a GC-rich sequence may "present itself" more prominently to the TATA-binding complex by a sharp demarcation of the TATA motif that would interact more tightly with the TATA-binding complex.
This would improve the start of transcription efficiency, by shifting the equilibria of binding to a more stabilized form, whereas the "non-bounded TATA" version, i.e. having a higher level of AT-rich sequences flanking the TATA motif, the TATA-1_'i binding complex would potentially slide or stutter 5' or 3' to the start site and effectively reduce the efficiency of binding ultimately reducing tz:anscription. Little data regarding this region of plant promoters is available except crude deletions and some point mutations. The obvious design of a synthetic 2C) core promoter for plant expression would include the AT-rich "TATA to start" sequence based on surveys of known plant promoters. However, based on the "bounded" mechanism, it is postulated by the mechanism of the invention that a more efficient core promoter is a result of a TATA motif imbedded in 2'i a GC-rich sequence.
Figure 2 depi~~ts the Syn II Core promoter sequence, SEQ ID N0:1 of the inve~:ztion with examples of plant core promoters aligned to the mayor start of transcription. Another example of a plant promoter 35S of CaMV (SEQ ID N0:4) are shown 30 with percent GC-rich sequences shown at the right in parentheses. The Syn II Core sequence does not show any significant sequence homology to sequences in the public sequence databases.
62451-854(D) The synthetic Syn II Core promoter sequence shows a 64o GC-rich "TATA to start:" sequence different from the overall 40o GC-rich sequence present in traditional plant promoters (CaMV35S for example). The naturally occurring and isolated .'~ UBI core promoter which potentiates very high levels of activity in monocots usually shows a 64o GC-rich "TATA to start" sequence more similar to animal promoters. Such examples provided the impetus to design a high GC-rich "TATA to start" sequence for efficient transcription in opposition to the current dogma of plant core promoters.
Thus the invention comprises a synthetic plant core promoter sequence comprising a TATA motif and a "TATA to start"
region that is 64o GC-rich or greater. In a preferred embodiment, the promoter may include restriction endonuclease 1'i target sites for ease of cloning. In the most preferred embodiment, the sequence is that of SEQ ID N0:1. As will be appreciated by those of skill in the art, several base transversions within SEQ ID N0:1 may occur which will maintain the percent GC-content and are intended within the scope of 2C) this invention. For example guanines could be replaced with cytosines and vice-versa without affecting the overall efficacy of the promoter, so lon~~ as the percent GC-content is maintained.
In another embodiment, the invention comprises a 25 synthetic upstream element positioned 5' to any naturally occurring or synthetic promoter for use in plants, particularly maize gene expression.
From the activity of numerous promoters, basic elements (binding sites) have been defined. These include for 30 example AT-rich regions from heat shock promoters, and ASF-1 binding site (AS-1) elements present in octopine synthase (OCS) and Cauliflower Mosaic Virus promoters. AS-1 is one of the 62451-854 (D) better known upstream elements and its binding sequence (OCS
element) is present in many consts.t=utive plant promoters such as the CaMV35S, A. tumefaciens, NGS and OCS wheat histone promoters. The OCS element was first isolated as an enhancer .'~ element in the promoter of the OCS gene where it was identified as a 16-base pair pali.ndromic sequence (Ellis et al. (1987) EMBO J. 6:11-16), but has been reduced to its essential features as a TGACG motif. See U.S. Patent No. 4,990,607. The upstream element of the invention has a 71% identity to the promoter enhances element disclosed in U.S. Patent 5,023,179 to Lam et al. The two sequences are quite different in their flanking sequences surrounding the TGACG motif, which regions have been shown to impact the level of transcription enhancement. The transcriptional enhancing activity of the OCS
1.'i element correlates with the in-vitro binding of a transcriptional factor'. Similar elements were also identified in the promoter regions of six other cDNA genes involved in opine synthesis and throve plant viral promoters including the CaMV 35S promoter (Bouchez, et al 1989) supra. These elements were shown to bind the ~DCS transcription factor in-vitro and enhance transcription in plant cells.
In tobacco a DNA binding factor, TGA1, was shown to interact specifically with the AS-1 element either alone or in conjunction with other ,~oromoter elements. (Katagiri et al, 2'i 1989, Nature 340:727-730). This factor was also shown to be expressed in a root-preferred manner in tobacco plants. Core promoters with one or two copies of the OCS upstream element tend to potentiate gene expression whereas 4 or more repeats of this element produce more or less constitutive activity albeit low relative to intact: 35S promoters.
Thus the invention incorporates a synthetic upstream element which can be used with the core promoter of the invention or other core promoters to enhance gene expression.
62451-854 (D) The element incorporates t=hree OCS-like motifs and novel intervening sequences which enhance gene expression.
Figure 3, SEQ ID N0:2 shows the complete sequence of one embodiment (Rsyn7) of the synthetic upstream element which .'~ incorporates at least three TGACG SEQ ID N0:5 OSC-like motifs which are indicated in bold.
Sequences flanking many elements such as the TGACG
SEQ ID N0:5 motif have been shown to have profound impacts on binding affinities of DNA binding factors and thus play as an important role as the central motifs themselves. (Burrows et al. 1992, Plant Molecular Biology 19:665-675, Shinder et al.
1992, Plant Cell 4:1309-1319, Foster et al. 1994, FASEBJ 8:192-200). The novel sequences flanking the TGACG motifs in the Rsyn7 promoter have been determined and established clear 1.'i enhancement of transcri:ptional activity with various promoters, particularly when used with the Syn II Core promoter.
Rsyn7 upstrea;n element has been cloned upstream of the Syn II Core promoter driving a GUS construct and has yielded levels of GUS activity in transgenic maize plants approximately ten-fold higher than the ubiquitin promoter, the strongest maize promoter to date.
In yet another aspect of the present invention, at least one upstream activating region (UAR), which is different from the synthetic upstream element, is operably linked to the 2.'i synthetic core promoter. The UAR may be used alone or in combination with the synthetic upstream element described herein. Preferably, th~~ upstream activating regions of the cauliflower mosaic virus (CaMV) 35S promoter and the maize Ubi-1 gene promoter are utilized. Additionally, sequences having sequence similarity to these UARs may be utilized as long as such sequences retain the ability to enhance promoter 62451-854 (D) activity. Enhancements can be measured by assaying for levels of transcripts or alternatively protein production.
CaMV 35S UARs have been well studied in the art. The complete nucleotide sequence of the CaMV circular double-strand .'i DNA has been established in the art.. See Guilley et al. (1980) Cell 21:285-294. The 35S promoter transcribes the major 35S
RNA transcript from the circular viral genome by nucleus RNA
polymerase II. See Guilley et al. (1982) Cell 30:763-773.
Moreover, the 35S UARs can function with a heterologous promoter and increase expression of a gene of interest in cells and transgenic plants. Shah et al. (1986) Science 233:478-481.
Multiple cis regulatory elements for the activity of the CaMV
35S promoter have been identified. See Odell et al. (1985) Nature 313:810-812; Fang et al. (1989) Plant Cell 1:141-150.
1.'i In the present invention, a large fragment of the upstream activating regions (UARs) of the CaMV 35S promoter can be utilized to enhance the activity of the core synthetic promoter. The size of the UAR can, for example, range from about 15 base pairs to about 850 base pairs, preferably from about 20 to about 500, more preferably from about 20-25 to about 50-200 base pairs. A preferred region of the 35S CaMV
upstream region includes sequences from about -421 to about -90. It is recognized that modifications, i.n length and nucleotide sequence can be made to the region and still result 2_'> in enhanced activity of the core synthetic promoter. Such modifications can be tested for effect on activity by using expression systems as set forth in the Experimental Section of the present application. The numbers on the UAR sequence diagram indicate the position upstream from the transcription 3C) start site, or +1 position of the 35S structural gene. For example, -25 means a po~s:ition 25 base pairs upstream from the transcription start site= of the 35S structural gene.
62451-854(D) The upstream activating region of the maize ubiquitin gene Ubi-1 can also be utilized in the invention. The sequence of the Ubi-1 gene transcription regulatory region is disclosed in US Patent No. 5,510,474. See also Christensen et al. (1992) Plant Mol. Biol. 18:675-689; Cornejo et a1. (1993) Plant Mol.
Biol. 23:567-581; Takimoto et al. (1994) Plant Mol. Biol.
26:1007-1012; and Chri.stensen et al. (1996) Transgenic Res.
5:213-218. The UAR of the Ubi-1 gene promoter comprises preferably from about -867 to about -54. As indicated above for the 35S UAR, modifications of the Ubi-1 UAR that still function to enhance the activity of the core promoter are encompassed.
While the full sequence of the ubiquitin promoter has been published, this is the first disclosure of the Ubi UAR.
1.'i Thus, the invention discloses the UAR of the ubiquitin promoter as well as the Ubi UAR in combination with any promoter.
Additionally, methods for using the Ubi UAR to enhance activity of promoters are encompassed.
The upstream activating regions as described herein can be linked with the synthetic core promoter and/or other upstream elements by any conventional method that is generally known in the art as long as an operative element or promoter is constructed. The upstream activating regions are generally operably linked to the 5' end of the core promoter. When the 2'i synthetic upstream element is also present, the upstream activating regions can be linked to either the 5' end of the synthetic upstream element, the 3' end of the core synthetic promoter or inserted between the synthetic upstream element and the synthetic core promoter. In a preferred embodiment the 3C) upstream activating regions are linked in close proximity to the synthetic upstream element, if present, and the synthetic core promoter. By close proximity is intended within from about 1 to about 50 nucleotides. However, it is recognized 62451-854 (D) that more than 50 nucleotides may separate the elements. The upstream activating regions can be in the 5' to 3' direction or the 3' to 5' direction, but preferably in the 5' to 3' direction at the 5' end of the synthetic core promoter or the .'~ synthetic upstream element.
One or multiple copies of the upstream activating regions can be used. When multiple copies are utilized, they can be tandem repeats of one UAR or combinations of several UARs. In this manner, the level of expression of a nucleotide 1c) sequence of interest r_an be controlled by the number of UARs present in the promoter construction since the results indicate that increased expression levels are obtained with increased numbers of UARs. Thus, the invention provides methods for regulating levels of Expression of a gene or nucleotide l.'~ sequence of interest .
As indicated, multiple copies of a UAR can be used to enhance the activity of the operably linked promoter. As noted, multiple copies of the same or different UARs can be utilized. For example, any combination of CaMV 35S UARs and 20 maize Ubi-1 gene UARs can be utilized.
The promoters of this invention having one or more UARs as described above can be provided in expression cassettes and such cassettes contained in p1_asmid or viral vectors. Such vectors can be used for transformation of bacteria and plant 25 cells. Transgenic plants can be ultimately regenerated from such transformed plant ce:Lls.
The UARs incorporated into plant promoters can substantially enhance transcription activity in transgenic plants. For example, ene or more copies of the upstream 30 activating region of the maize Uby-1 gene can be operably linked to a promoter having the core synthetic promoter 62451-854(D) sequence and the synthetic upstream element of the invention.
The promoter constructs of the invention can be operably linked to any nucleotide sequence or gene of interest. The promoter construct, for example, can be used to enhance oxalate oxidase gene expression i.n transgenic plants. Oxalate oxidase is a plant enzyme implicated in plant defense mechanisms against a pathogens attack. The enzyme degrades the chemical compound oxalic acid secreted by plant pathogens. See e.g., PCT
Publication No. WO 92/14824. Increasing the oxalate oxidase 1() level in plants such as sunflower will lead to increased plant resistance to plant pathogens.
The following examples are for illustration purposes only and are intended in no way to limit the scope or application of the present invention. Those of skill in the 1.'i art will appreciate that many permutations can be achieved and are in fact intended to be within the scope of the invention.
Plasmids were designed using the multiple cloning site of pBlueScri.ptIIKS+* from Stratagene. (To facilitate 2C) cloning of the different combination of elements).
Oligonucleotides containing the sequences of the elements were synthesized with restriction endonuclease sites at the ends.
Thus elements could be added or removed and replaced as needed.
GUS and Luciferase were used for reporter genes.
2-'i For transient assays, plasmid DNA was introduced into intact 3-day-old maize .seedlings by particle bombardment.
Following 16 hours incubation at 25EC in the dark, expression was assayed by measuring GUS enzyme activity in root and shoot extracts from each seedling to determine if any tissue-3C1 preferred expression wa;s demonstrated. GUS activity was *Trade-mark 62451-854(D) measured using a GUS-Light assay kit from Tropix (47 Wiggins Avenue, Bedford, MA 01730).
Constructs that gave high levels of expression were introduced into a cell line to produce stable transformants.
.'i These stable transformants (TO) were assayed by PCR to determine the presence of the GUS gene by MUG (4-methylumelliferyl.-glucuronide) assay to quantify the activity level of the GUS protein being produced. When the plants were ready to be transferred to the greenhouse they were assayed histochemically with X-gluc* to determine where the GUS product was being synthesized. Plants demonstrating preferred expression levels were grown in the greenhouse to V6 stage.
Construction of Plasmids Containing the Syn II Core 1'. Promoter.
Standard molecular biological techniques were carried out according to Maniantis et al. (1982) Molecular Cloning: A
Laboratory Manual, Cold .Spring Harbor Laboratory, Cold Spring Harbor, N.Y. All plasm:ids utilized in the invention can be prepared according to the directions of the specification by a person of ordinary skil=L in the art without undue experimentation employing materials readily available in the art.
Oligos N306 SEQ ID N0:6 5'-TCGACACTGC AGCTCTAGGG
ATGGTAGCGC AGGGTGCGTA GC~'rACGTATT TATAGCCGCT CGAGTG-3' and N307 SEQ ID N0:7 5'-GATCCACTC:G AGCGGCTATA AATACGTACC TACGCACCCT
GCGCTACCAT CCTAGAGCT GCAGTG-3' were synthesized according to directions on an automated DNA synthesizer (such as Applied Biosystems Inc. DNA Synthesizer (Model 380B). These automated synthesizers are commercially available. The oligos were then *Trade-mark 62451-854(D) ligated to the BamHI fragment of the pBlueScriptIIKS+ plasmid comprising of the (3-gl.ucuronidase gene interrupted by the maize ADhI intron 1 region. .A map of a plasmid incorporating both the Syn II Core promoter and the upstream e1_ement is disclosed :i as Figure 4. Several. other embodiments are shown in other plasmids depicted in figure 5. Plasmid numbers are shown to the right of each promoter diagram with the corresponding legend placed below the diagrams. The top diagram shows the complete transplant transcriptional unit with the subsequent diagrams focusing on the salient differences between 35S and Syn II Core promoters. The legend shows the number and nature of the various promoter subelements, the sequence if relatively short, the source of the element and position relative to the start of transcription.
The sequence of the core promoter consists of 35 base pairs with enzyme sites upstream of a TATA box and a start of transcription with 10 to 15 base pairs downstream. Upstream elements (Gal 4-binding sites, Rsyn, AT-GBL etc.) were fused to the core sequence with ADhI-intron and different marker genes (LUC or GUS) and were demonstrated functional both in transient assays (Figure 6) and Rsyn stably transformed plants (Figure 7) .
Construction of Upstream Element Rsyn7 Fused to Syn 2.5 II Core Promoter Resulting in Plasmids PHP5903 and PHP6086.
Oligos for constructing the Rsyn7 promoter subelement N1965: (SEQ ID N0:8) GA.TCCTATGA CGTATGGTAT GACGTGTGTT
CAAGATGATG ACTTCAAACC TF~CCTATGAC G'rATGGTATG ACGTGTGTCG
ACTGATGACT TA-3' and N1966: (SEQ :ID N0:9) GATCTAAGTC ATCAGTCGAC
ACACGTCATA CCATACGTCA fAGGTAGGTT TGAAGTCATC ATCTTGAACA
CACGTCATAC CATACGTCA TP,CJ-3' were synthesized as earlier 62451-854(D) described. The oligos were annealed and cloned into a PHP3398 plasmid upstream of the Syn II Core sequence and resulted in several versions of the original Rsyn7 sequence due to spontaneous deletions. The Rsyn7-2 version involved a single _'i base deletion resulting in a 3X reiterative TGACG motif upstream of the Syn I7: ;:ore promoter (Rsyn7::LUC, P5903). The LUC coding sequence was replaced by GUS coding sequence to produce the Rsyn7::GUS construct P6086. P6086 was later introduced into transgenic maize resulting in high levels of constitutive activity in four of the six active events examined (Figure 7).
The progeny from TO plants from several transformation events were examined and GUS activity ranging from 1 to 400 PPM (micrograms GUS enzyme/GFW in root tissue of 7-day old seedlings) Figure 8. These TO and Tl plants generally produced 4X-lOX greated GUS activity than plants harboring the ubiquiti.n::GUS report=er gene.
Thus from the foregoing, it can be seen that the invention accomplishes at least all of its objectives.
21) EXAMPLE 4 Transformation. and Expression with Syn II Core Promoter and/or Rsyn7 Upstream Element.
Using transient bombardment assays the Syn II Core promoter sequence was compared against the 35S core sequence 2.~ either alone or in conjunction with numerous activation elements. Figure 6 is a depiction of transient assay data using the plasmids incorporating the promoter sequences of the invention and shows transient GUS or LUC activity in three-day old maize roots or BMS callus bombarded with chimeric 30 promoter::GUS or LUC constructs. The -33 CaMV35S in the Syn II
Core promoter versions of the synthetic promoter:: GUS (or LUC) 62451-854(D) constructs were bombarded into three-day old roots (or cultured BMS calli as described hereinafter) and assayed for enzyme activity 20 hours after bombardment=s. The data shown are the raw enzyme units of a compilation of at least three experiments and have not been normalized in any fashion due to the inherent variability of the transient assays. Control plasmids 1654 and 3537 are the LUC constructs tested in maize BMS calli. There is approximately 4 to 20 fold difference in transient activity between the 35S and Syn II Core versions. The Y axis is in log scale. Both core promoters were driving a GUS containing construct (Figures 4 and 5) and generated a basal level of activity (Figure 6). However when activator elements were placed upstream of the TATA motif, the Syn II Core provided generally higher levels of activity (2-4 fold better) in corn cells than when the activator elements were placed upstream of the 35S core (Figure 6).
The Syn II Core sequence has been shown to enhance activity in stably transformed plants. Further with certain activator sequences upstream of the TATA element activity levels in stably transformed corn plants reached levels ten-fold greater than maize ubiquitin constructs which produces extremely high levels of activity.
Figures 7 and 8 show GUS activity levels from isolated tissues of VT stage TO plants and root tissue from T1 2.~ seedlings, respectively. These data demonstrate that this core sequence can participate in potentiating very high levels of activity as a functiorual partner for the active chimeric promoters. Figure 7 shows the Rsyn7::GUS (6086) activity in TO
maize plants. VT stage plants with ears post pollinated 3 to 8 days were dissected and assayed for GUS activity. 7A depicts GUS expression in designated tissues. 7B depicts a schematic of a corn plant with sites of measurement indicated. Plants from TO events that derr~onstrated a range of activities with the 62451-854(D) Rsyn7 promoter were ass<~yed. Log scale again noted. The activity range for UBI::GUS plants is indicated at the right of graph for comparisons. These data demonstrate that the Rsyn7 promoter can increase activity to ten-fold above levels of the ubiquitin promoter yet.;shows little tissue preference making the Rsyn7 suitable as a strong constitutive promoter.
Figure 8 depi~~ts GUS activity in root segments of a segregating population of maize Tl transgenic seedlings containing the Rsyn7::G!JS (6086) or the UBI::GUS (3953) construct. 1 cm root segments from six to seven-day old transgenic maize seedlings were dissected, weighed and assayed for GUS using GUS-light kit. Activity is represented as parts per million of fresh weight. The root activity of several T1 plants harboring the Rsyn7::GUS promoter shows higher activity 1.'i than much of the activity levels produced by the UBI promoter.
This is consistent with data from TO transgenic plant.
Activity levels in Rsyn7::GUS containing young leaves are also much higher than the activity levels of UBI::GUS-containing young leaves (data not shown). The Syn II Core sequence was shown to function well. with a variety of upstream elements including GAL 4 binding sites, Rsyn7 elements, GBL elements, etc.
Figure 9 shows GUS expression of three synthetic promoters in TO transgenic maize plants. Dissected tissues 2.'~ (See Figure '7B) from VT stage transgenic TO plants harboring Rsyn7 (Rsyn), Atsyn or the Syn II Core alone (syn-core) promoter::GUS constructs were quantitatively assayed for GUS
activity. Each circle represents an average of tissue activity of transgenic maize plants from a single transformation event.
The TGACG-motif corresponds to the Rsyn7 sequence and the "AT-com" motif refers to they consensus AT-like composite element, Atcom, from W. Gurley, et al. 1993. In: Control of Plant Gene Expression. ed. by Desh Pal Verma. CRC press, Boca Raton, FL.
62451-854(D) pp. 103-123. Syn-core refers to the Syn II Core promoter sequence containing the TATA element and the start of transcription.
Construction of Rsyn7 promoter having the upstream activating regions from the CaMV 35S gene and the maize Ubi-1 gen a .
To construct an expression vector having 35SU
(upstream activating regions from 35S gene)--Rsyn7 promoter, 1() PHP413 was digested with BglII and EcoRV. The staggered/sticky ends of the linearized vector were filled in by Klenow in the presence of dNTP. The 2x CaMV fragment was blunt end ligated into BamHl digested PHP6086 after filling the BamHl ends. The CaMV 35S-Rsyn7 fragment was then cut out from the new construct 15 by digestion with Xba1 and Pst l, and ligated into the 4 kb XbaI-Pstl vector from PHP9925 to form the expression vector of 35SU-Rsyn7::GUS (PHP9'778).
To construct. an expression vector having UbiU
(upstream activat=ing elements from the maize Ubi-I gene)-Rsyn7 20 promoter, the Xbal-Spel fragment from PHP82'77 was ligated into the Xbal site of PHP608F.~ to form PHP10539, into the Xbal site of PHP10970 to form PHP10971, and into the Xbal site of PHP10971 to form the expression vector of Ubi-1-Rsyn7::GUS
(PHP10972).
25 Those sequences not referenced otherwise include:
SEQ ID N0:11 sets forth the 35S UAR.
SEQ ID N0:12 sets forth the SCPl promoter sequence, (35S UAR operably linked to core promoter o.f SEQ ID N0:1).
SEQ ID N0:13 sets forth the Ubil UAR.
62451-854 (D) SEQ ID N0:14 sets forth SCP1 operably linked to the oxalate oxidase coding sequence operably linked with the PinII
terminator.
SEQ ID N0:16 sets forth the UCP2 promoter sequence (2 .') copies of Ub_i1 UAR operably with the core promoter).
SEQ ID N0:18 set forth the UCP4 promoter sequence (4 copies of Ubil UAR operably with the core promoter).
Transformation and Expression of promoter constructs.
11) The various promoters:: GUS fragments were cloned into a Bin9 binary vector that contains ALS3::NPTII as selection marker for generating transgenic sunflower callus or Arabidopsis.
For transient expression, SMF3 sunflower seeds were 1.'S planted in greenhouse. 15-day-old seeds after pollination were collected from the plants and used in the transient expression system. After removing the pericarp, the cotyledons with seed coats were sterilized by incubation in 20o bleach at RT for 15 minutes, and washed faun times with sterile double distilled 20 water. The cotyledons were then incubated on 3MM filter wetted with MS medium overnight, before they were bombarded according to the method diclosed t~y Klein et al. (1989) Proc. Natl. Acad.
Sci. USA 86:6681-6685. GUS activity was analyzed 20 hours after bombardment usirug the GUS-Light assay kit from Tropix 25 according to the manufacaurer's protocol.
For leaf disc transformation, young expanded SMF3 sunflower leaf from 30-day-old sunflower was harvested and sterilized in 20~ bleach with a couple of drops of Tween 20 for 20 minutes. Leaf discs were prepared from the sterile leaves 30 after washing them with sterile double distilled water 4 times.
62451-854(D) The leaf discs were then incubated for 10 minutes in inoculation medium (12.5 mM MES, 1. g/1 NH4C1, and C.3 g/1 MgS04) containing Agrobacteri.u:m (EHA105) transformed with the vector constructs to be tested at A600=0.75. The leaf discs were then .'i grown for 3 days in non-selection medium and were then transferred to selection medium.
To transform A.rabidopsis, Arabidopsis were grown in greenhouse to the stage when bolts start to emerge at 15 plants/pot. The emerging bolts were clipped off to encourage the growth of multiple secondary bolts. After 7 days, the plants were ready for infiltration. Agrobacterium (EHA105) carrying the construct to be tested was cultured at 28°C to when A600 was between 0.65 and 0.8. The cells were harvested in inoculation medium (4.3 g/1 of MS salt, 0.5 mg/1 of nicotinic acid, 0.5 mg/1 of pyridoxine-HCl, 1 mg/1 of Thiamine-HC1, 0.1 g of myo-inositol, 1 g/1 of casamino acids, 0.01 mg of BAP, 68.5 g/1 of sucrose, and 36 g/1 of glucose) at A600 of 7.5.
The clipped plants to be transformed were inverted into a 250 ml beaker c:on.taining the above Agrobacterium solution. The beaker was placed into a bell jar and was vacuumed until bubbles formed on leaf and stem surface. After 15 minutes of infiltration, the vacuum was released and the plants were removed from the beaker, laid on its side in a plastic flat, and covered with plastic wrap. The plants were 2~ set upright and grown in greenhouse for four weeks before seeds were harvested. Transgenic seeds were selected by planting the seeds on a medium plate containing 65 ~g/ml of kanamycin.
In transient expression assays, the Rsyn7 promoter in PHP10464 has 15% of the 35S promoter (PHP9925) activity, whereas the 35SU-Rsyn°7 promoter (PHP9778) (hereinafter SCPl promoter) has about 107°. of the 35S promoter activity (Figure 62451-854 (D) 10). Thus, the upstream activating region of the CaMV 35S gene increased the Rsyn7 a<:ti.vity by about 6 fold.
The maize Ubi-1 upstream element {UbiU) has similar effects on the Rsyn7 promoter in transient assays. When the .'~ upstream activating region (UAR) of the maize Ubi-1 was fused to Rsyn7, the GUS enzyme activity increased with the number of copy of the UAR. In the presence of three copies of UbiU, GUS
activity increased by about 4-fold. This additive effect of UbiU was not observed when placed in the context of the maize 1() ubiquitin promoter (PHP11974). This suggests that replacement of the maize Ubil core promoter with Rsyn7 may convert the monocot Ubil promoter into a highly active promoter in dicot plants (Figure 11).
In the stably transformed sunflower callus, GUS
1!~ expression is 20~ higher behind the control of the 35SU-Rsyn7 (SCP1 promoter) promoter than when behind the control of the 35S CaMV promoter (Figure 12).
The results of- the transgenic callus assay are given in Figure 13. The Rsyn7 promoter containing a single copy of 20 UbiU (PHP10991) (hereinafter UCPl promoter) (SEQ ID N0: 15) exhibited promoter activity of about 3 times that of the 35SU-Rsyn7 (SCP1) promoter (PHP10940). Three copies of UbiU
(PHP10993) increased Rsyn'7 promoter activity to about 7 times that of the 35SU-Rsyn'l promoter. The 3xUbiU-Rsyn7 (UCP3 25 promoter) (SEQ ID NO: 17) is by far the strongest promoter in sunflower tissues.
To determine the activi~y and tissue-specificity of the enhanced Rsyn7 promoters, stably-transformed sunflower and Arabidopsis were generated through Agrobacterium-mediated 30 transformation. The histochemical staining of GUS expression in transgenic T1 Arab_idopsis indicates that 35SU-Rsyn7 (SCP1 62451-854 (D) promoter) (PHP10940) has identical tissue-specificity and similar activity as 35S CaMV promoter (PHP10989). Both promoters express GUS in leaf, stem, petiole, and floral parts.
UbiU-Rsyn7 (USCP1 promoter) (PHP10991) exhibits higher activity ~i than maize Ubi-1 promoter (PHP11031) in Arabidopsis stem and leaf tissues.
All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains.
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.
SEQUENCE LISTING
<110> Wesley, Bruce <120> Synthetic Promoters <130> 62451-854D
<140> PCT/US99/03863 <141> 1999-02-23 <150> 08/661,601 <151> 1996-06-11 <160> 18 <170> PatentIn Ver. 2.0 <210> 1 <211> 72 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic core promoter sequence <400> 1 ggatccactc gagcggctat aaatacgt.ac ctacgcacgc tgcgctacca tcccgagcac 60 tgcagtgtcg ac 72 <210> 2 <211> 96 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: a synthetic upstream element that helps potentiate transcription <400> 2 ggatcctatg cgtatggtat gacgtgtc~tt caagatgatg acttcaaacc tacctatgac 60 gtatggtatg acgtgtgtcg actgatga.ct tagatc 96 <210> 3 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: A typical nucleotide base arrangement of a core promoter containing the consensus. sequences of TATA and INR
motifs present in plant promoters.
<220>
<221> unsure <222> (5) <223> The nucleotide at this position may be an a or t.
<220>
<221> unsure <222> (7) <223> The nucleotide at this xaosition may be an a or t.
<220>
<221> unsure <222> (10)..(11) <223> The nucleotides at these positions may be a t or c.
<220>
<221> unsure <222> (16) <223> The nucleotide at this position may be a or c.
<400> 3 tatawawaty ytcatmaa 18 <210> 4 <211> 40 <212> DNA
<213> Cauliflower mosaic virus <400> 4 cctctatata agcaagttca tttcatttgg agaggaaacg 40 <210> 5 <211> 5 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: OCS element <400> 5 tgacg 5 <210> 6 <211> 66 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 6 tcgacactgc agctctaggg atggtagcgc agggtgcgta ggtacgtatt tatagccgct 60 cgagtg 66 <210> 7 <211> 66 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 7 gatccactcg agcggctata aatacgta.cc tacgcaccct gcgctaccat ccctagagct 60 gcagtg 66 <210> 8 <211> 92 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 8 gatcctatga cgtatggtat gacgtgta~tt caagatgatg acttcaaacc tacctatgac 60 gtatggtatg acgtgtgtcg actgatga.ct to 92 <210> 9 <211> 92 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic oligonucleotide <400> 9 gatctaagtc atcagtcgac acacgtcata ccatacgtca taggtaggtt tgaagtcatc 60 atcttgaaca cacgtcatac catacgtcat ag 92 <210> 10 <211> 72 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic nucleic acid core promoter with G/C transversions <220>
<223> The nucleotides at positions 26, 27, 30, 31, 34, 35, 36, 38, 39, 40, 42, 43, 44, 45, 48, and 49, may be g or c.
<400> 10 ggatccactc gagcggctat aaatasstas stasssasss tsssstassa tcccgagcac 60 tgcagtgtcg ac 72 <210> 11 <211> 332 <212> DNA
<213> Cauliflower mosaic virus <400> 11 cgtcaacatg gtggagcacg acactctcgt ctactccaag aatatcaaag atacagtctc 60 agaagaccaa agggctattg agacttttca acaaagggta atatcgggaa acctcctcgg 120 attccattgc ccagctatct gtcacttcat caaaaggaca gtagaaaagg aaggtggcac 180 ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt caagatgcct ctgccgacag 240 tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 300 cacgtcttca aagcaagtgg attgatgtga tg 332 <210> 12 <211> 499 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: 35S UAR of cauliflower mosaic virus operably linked to the core promoter sequence <400> 12 cgtcaacatg gtggagcacg acactctcgt ctactccaag aatatcaaag atacagtctc 60 agaagaccaa agggctattg agacttttca acaaagggta atatcgggaa acctcctcgg 120 attccattgc ccagctatct gtcacttcat caaaaggaca gtagaaaagg aaggtggcac 180 ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt. caagatgcct ctgccgacag 240 tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 300 cacgtcttca aagcaagtgg attgatgtga tgatcctatg cgtatggtat gacgtgtgtt 360 caagatgatg acttcaaacc tacctatgac gtatggtatg acgtgtgtcg actgatgact 420 tagatccact cgagcggcta taaatacgta cctacgcacc ctgcgctacc atccctagag 480 ctgcatgctt atttttaca 499 <210> 13 <211> 813 <212> DNA
<213> Zea mays <400> 13 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cct 813 <210> 14 <211> 1600 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: 35S UAR of cauliflower mosaic virus operably linked to the core promoter operably linked to the oxalate oxidase coding sequence operably linked to PinII
terminator.
<400> 14 gatctgagtc tagaaatccg tcaacatggt ggagcacgac actctcgtct actccaagaa 60 tatcaaagat acagtctcag aagaccaa.a.g ggctattgag acttttcaac aaagggtaat 120 atcgggaaac ctcctcggat tccattgrcc agctatctgt cacttcatca aaaggacagt 180 agaaaaggaa ggtggcacct acaaatgcca tcattgcgat aaaggaaagg ctatcgttca 240 agatgcctct gccgacagtg gtcccaaaga tggaccccca cccacgagga gcatcgtgga 300 aaaagaagac gttccaacca cgtcttcaaa gcaagtggat tgatgtgatg atcctatgcg 360 tatggtatga cgtgtgttca agatgatgac ttcaaaccta cctatgacgt atggtatgaa 420 cgtgtgtcga ctgatgactt agatccactc gagcggctat aaatacgtac ctacgcaccc 480 tgcgctacca tccctagagc tgcagcttat ttttacaaca attaccaaca acaacaaaca 540 acaaacaaca ttacaattac tatttacaat tacagtcgac ccgggatcca tggggtactc 600 caaaacccta gtagctggcc tgttcgcaat gctgttacta gctccggccg tcttggccac 660 cgacccagac cctctccagg acttctgtgt cgccgacctc gacggcaagg cggtctcggt 720 gaacgggcac acgtgcaagc ccatgtcgga ggccggcgac gacttcctct tctcgtccaa 780 gttggccaag gccggcaaca cgtccacccc gaacggctcc gccgtgacgg agctcgacgt 840 ggccgagtgg cccggtacca acaagctggg tggtgtcatg aaccgcgtgg attttggtcc 900 cggagggacc aacccaccac acatccaccc gcgtgccacc gagatcggca tcgtgatgaa 960 aggtgagctt ctcgtgggaa tccttggcag cctcgactcc gggaacaagc tctactcgag 1020 ggtggtgcgc gccggagaga cgttcctcat cccacggggc ctcatgcact tccagttcaa 1080 cgtcggtaag accgaggcct ccatggtcgt ctccttcaac agccagaacc ccggcattgt 1140 cttcgtgccc ctcacgctct tcggctccaa cccgcccatc~ ccaacgccgg tgctcaccaa 1200 ggcactccgg gtggaggcca gggtcgtgga acttctcaag tccaagtttg ccgctgggtt 1260 ttaatttcta ggatcctcta gagtcgaacc tagacttgtc catcttctgg attggccaac 1320 ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca ctataatgtg 1380 ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga gaaagagatc 1440 atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg atgaaccaga 1500 tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa ttaatatcaa 1560 ttgggttagc aaaacaaatc tagtctaggt gtgttttgcc 1600 <210> 15 <211> 994 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: Rsyn7 promoter containing 1 copy of the zea mays ubi-1 upstream element (UbiU) <400> 15 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagaa ctagtggatc ctatgcgtat 840 ggtatgacgt gtgttcaaga tgatgacttc aaacctacct atgacgtatg gtatgacgtg 900 tgtcgactga tgacttagat ccactcgagc ggctataaat acgtacctac gcaccctgcg 960 ctaccatccc tagagctgca tgcttatttt taca 994 <210> 16 <211> 1807 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: U~P2 promoter sequence containing 2 copies of the zea mays ubil UAR operably linked with the core promoter <400> 16 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct_. cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct: cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct at.tttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gaactagtgg atcctatgcg tatggtatga cgtgtgttca agatgatgac 1680 ttcaaaccta cctatgacgt atggtatgac gtgtgtcgac tgatgactta gatccactcg 1740 agcggctata aatacgtacc tacgcaccct gcgctaccat ccctagagct gcatgcttat 1800 ttttaca 1807 <210> 17 <211> 2620 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:Rsyn7 promoter containing 3 copies of zea mays UbiU upstream element <400> 17 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagatata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct attttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gagataatga gcattgcatg tctaagttat aaaaaattac cacatatttt 1680 ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt taaactttac 1740 tctacgaata atataatcta tagtactaca ataatatcag tgttttagag aatcatataa 1800 atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg actctacagt 1860 tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc acctatataa 1920 tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt ttttatagac 1980 taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga aaactaaaac 2040 tctattttag tttttttatt taataattta gatataaaat agaataaaat aaagtgacta 2100 aaaattaaac aaataccctt taagaaatta aaaaaactaa. ggaaacattt ttcttgtttc 2160 gagtagataa tgccagcctg ttaaacgccg tcgacgagtc taacggacac caaccagcga 2220 accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc 2280 tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa 2340 attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac 2400 ggcacggcag ctacggggga ttcctttccc accgctccta ctagaactag tggatcctat 2460 gcgtatggta tgacgtgtgt tcaagatgat gacttcaaac ctacctatga cgtatggtat 2520 gacgtgtgtc gactgatgac ttagatccac tcgagcggct ataaatacgt acctacgcac 2580 cctgcgctac catccctaga gctgcatgct tatttttaca 2620 <210> 18 <211> 3433 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: UCP4 promoter sequence containing 4 copies of ubil UAR operably linked with the core promoter.
<400> 18 tctagagata atgagcattg catgtctaag ttataaaaaa ttaccacata ttttttttgt 60 cacacttgtt tgaagtgcag tttatctatc tttatacata tatttaaact ttactctacg 120 aataatataa tctatagtac tacaataata tcagtgtttt agagaatcat ataaatgaac 180 agttagacat ggtctaaagg acaattgagt attttgacaa caggactcta cagttttatc 240 tttttagtgt gcatgtgttc tccttttttt ttgcaaatag cttcacctat ataatacttc 300 atccatttta ttagtacatc catttagggt ttagggttaa tggtttttat agactaattt 360 ttttagtaca tctattttat tctattttag cctctaaatt aagaaaacta aaactctatt 420 ttagtttttt tatttaataa tttagat:ata aaatagaata aaataaagtg actaaaaatt 480 aaacaaatac cctttaagaa attaaaaaaa ctaaggaaac atttttcttg tttcgagtag 540 ataatgccag cctgttaaac gccgtcgacg agtctaacgg acaccaacca gcgaaccagc 600 agcgtcgcgt cgggccaagc gaagcagacg gcacggcatc tctgtcgctg cctctggacc 660 cctctcgaga gttccgctcc accgttggac ttgctccgct gtcggcatcc agaaattgcg 720 tggcggagcg gcagacgtga gccggcacgg caggcggcct cctcctcctc tcacggcacg 780 gcagctacgg gggattcctt tcccaccgct cctactagag ataatgagca ttgcatgtct 840 aagttataaa aaattaccac atattttttt. tgtcacactt gtttgaagtg cagtttatct 900 atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata 960 atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg 1020 agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt 1080 tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag 1140 ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt 1200 tagcctctaa attaagaaaa ctaaaactct attttagttt ttttatttaa taatttagat 1260 ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa 1320 aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg 1380 acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag 1440 acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg 1500 gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca 1560 cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc 1620 gctcctacta gagataatga gcattgcatg tctaagttat aaaaaattac cacatatttt 1680 ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt taaactttac 1740 tctacgaata atataatcta tagtactaca ataatatcag tgttttagag aatcatataa 1800 atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg actctacagt 1860 tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc acctatataa 1920 tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt ttttatagac 1980 taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga aa.actaaaac 2040 tctattttag tttttttatt taataattta gatataaaat agaataaaat aaagtgacta 2100 aaaattaaac aaataccctt taagaaatta aaaaaactaa ggaaacattt ttcttgtttc 2160 gagtagataa tgccagcctg ttaaacgccg tcgacgagtc taacggacac caaccagcga 2220 accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc 2280 tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa 2340 attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac 2400 ggcacggcag ctacggggga ttcctttccc accgctccta ctagagataa tgagcattgc 2460 atgtctaagt tataaaaaat taccacatat tttttttgtc acacttgttt gaagtgcagt 2520 ttatctatct ttatacatat atttaaactt tactctacga ataatataat ctatagtact 2580 acaataatat cagtgtttta gagaatcata taaatgaaca gttagacatg gtctaaagga 2640 caattgagta ttttgacaac aggactctac agttttatct ttttagtgtg catgtgttct 2700 cctttttttt tgcaaatagc ttcacctata taatacttca tccattttat tagtacatcc 2760 atttagggtt tagggttaat ggtttttata gactaatttt tttagtacat ctattttatt 2820 ctattttagc ctctaaatta agaaaactaa aactctattt tagttttttt atttaataat 2880 ttagatataa aatagaataa aataaagtga ctaaaaatta aacaaatacc ctttaagaaa 2940 ttaaaaaaac taaggaaaca tttttcttgt ttcgagtaga taatgccagc ctgttaaacg 3000 ccgtcgacga gtctaacgga caccaaccag cgaaccagca gcgtcgcgtc gggccaagcg 3060 aagcagacgg cacggcatct ctgtcgctgc ctctggaccc ctctcgagag ttccgctcca 3120 ccgttggact tgctccgctg tcggcatcca gaaattgcgt ggcggagcgg cagacgtgag 3180 ccggcacggc aggcggcctc ctcctcctct cacggcacgg cagctacggg ggattccttt 3240 cccaccgctc ctactagaac tagtggatcc tatgcgtatg gtatgacgtg tgttcaagat 3300 gatgacttca aacctaccta tgacgtatgg tatgacgtgt gtcgactgat gacttagatc 3360 cactcgagcg gctataaata cgtacctacg caccctgcgc taccatccct agagctgcat 3420 gcttattttt aca 3433
Claims (13)
1. ~A dicotyledonous plant cell which has been transformed with a construct comprising a synthetic DNA plant promoter sequence, said sequence comprising:
a TATA motif;
a transcription start site;
a region between said TATA motif and said start site that is at least about 64% GC-rich;
an upstream element; and one or more upstream activating regions;
wherein said promoter sequence comprises a sequence selected from the group consisting of:
(a) a nucleotide sequence set forth in SEQ ID NO:
12;
(b) a nucleotide sequence set forth in SEQ ID NO:
15;
(c) a nucleotide sequence set forth in SEQ ID NO:
16;
(d) a nucleotide sequence set forth in SEQ ID NO:
17; and, (e) a nucleotide sequence set forth in SEQ ID NO:
18.
a TATA motif;
a transcription start site;
a region between said TATA motif and said start site that is at least about 64% GC-rich;
an upstream element; and one or more upstream activating regions;
wherein said promoter sequence comprises a sequence selected from the group consisting of:
(a) a nucleotide sequence set forth in SEQ ID NO:
12;
(b) a nucleotide sequence set forth in SEQ ID NO:
15;
(c) a nucleotide sequence set forth in SEQ ID NO:
16;
(d) a nucleotide sequence set forth in SEQ ID NO:
17; and, (e) a nucleotide sequence set forth in SEQ ID NO:
18.
2. ~The plant cell of claim 1, wherein said promoter is operably linked to a structural gene.
3. ~The plant cell of claim 2, wherein said structural gene is an oxalate oxidase gene.
4. ~The plant cell of claim 3, wherein said plant cell is from a sunflower plant.
5. ~A dicotyledonous seed cell which has been transformed with a construct comprising a synthetic DNA plant promoter sequence, said sequence comprising:
a TATA motif;
a transcription start site;
a region between said TATA motif and said start site that is at least about 64% GC-rich;
an upstream element; and one or more upstream activating regions;
wherein said promoter sequence comprises a sequence selected from the group consisting of:
(a) a nucleotide sequence set forth in SEQ ID NO:
12, (b) a nucleotide sequence set forth in SEQ ID NO:
15;
(c) a nucleotide sequence set forth in SEQ ID NO:
16;
(d) a nucleotide sequence set forth in SEQ ID NO:
17; and, (e) a nucleotide sequence set forth in SEQ ID NO:
18.
a TATA motif;
a transcription start site;
a region between said TATA motif and said start site that is at least about 64% GC-rich;
an upstream element; and one or more upstream activating regions;
wherein said promoter sequence comprises a sequence selected from the group consisting of:
(a) a nucleotide sequence set forth in SEQ ID NO:
12, (b) a nucleotide sequence set forth in SEQ ID NO:
15;
(c) a nucleotide sequence set forth in SEQ ID NO:
16;
(d) a nucleotide sequence set forth in SEQ ID NO:
17; and, (e) a nucleotide sequence set forth in SEQ ID NO:
18.
6. ~The seed cell of claim 5, wherein said promoter is operably liked to a structural gene.
7. ~The seed cell of claim 6, wherein said structural gene is an oxalate oxidase gene.
8. The seed cell of claim 7, wherein said plant cell is from a sunflower seed.
9. A transgenic dicotyledonous plant cell expressing an exogenous oxalate oxidase gene at a high level wherein said oxalate oxidase gene is under transcriptional control of the promoter defined in claim 1.
10. The transgenic plant cell of Maim 27, wherein said dicotyledonous plant cell is from a sunflower plant.
11. The plant cell of claim 27, wherein said promoter is operably linked to a structural gene.
12. The plant cell of claim 11, wherein said structural gene is an oxalate oxidase gene.
13. The plant cell of claim 13, wherein said plant cell is from a sunflower plant.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/028,819 US6072050A (en) | 1996-06-11 | 1998-02-24 | Synthetic promoters |
US09/028,819 | 1998-02-24 | ||
CA002314598A CA2314598C (en) | 1998-02-24 | 1999-02-23 | Synthetic promoters |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002314598A Division CA2314598C (en) | 1998-02-24 | 1999-02-23 | Synthetic promoters |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2354228A1 true CA2354228A1 (en) | 1999-09-02 |
Family
ID=25681981
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002354228A Abandoned CA2354228A1 (en) | 1998-02-24 | 1999-02-23 | Plants and seeds containing synthetic promoters |
Country Status (1)
Country | Link |
---|---|
CA (1) | CA2354228A1 (en) |
-
1999
- 1999-02-23 CA CA002354228A patent/CA2354228A1/en not_active Abandoned
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6555673B1 (en) | Synthetic promoters | |
AU729929B2 (en) | A synthetic plant core promoter and upstream regulatory element | |
NO323171B1 (en) | Isolated nucleotide sequence comprising an acetohydroxy acid synthase promoter, use thereof, transformation vector, plant cell and plant comprising said nucleotide sequence, nucleic acid construction comprising the sequence and methods of uses thereof. | |
US6031151A (en) | Callus-specific promoters | |
WO2001094394A2 (en) | Plant ubiquitin promoter sequences and methods of use | |
CN111116725B (en) | Application of gene Os11g0682000 and its encoded protein in regulating rice bacterial blight resistance | |
AU2010267655B2 (en) | Expression of transcription regulators that provide heat tolerance | |
AU4396400A (en) | Pathogen inducible promoter | |
EP1165755B1 (en) | Banana and melon promoters for expression of transgenes in plants | |
CN114349833B (en) | Application of calmodulin binding protein COLD12 in regulation and control of plant COLD tolerance | |
AU2002249270B2 (en) | Constitutive promoter from arabidopsis | |
CN115028698A (en) | Plant virus accumulation related protein and coding gene and application thereof | |
CN117209575B (en) | Application of protein and encoding gene thereof in regulation and control of corn northern leaf blight and northern leaf blight | |
CA2354228A1 (en) | Plants and seeds containing synthetic promoters | |
CA2497799C (en) | Acetolactate synthase gene promoter | |
KR101847974B1 (en) | Constitutive promoter from Oryza sativa Os09g0553100 gene for transforming monocot plant and uses thereof | |
AU7244400A (en) | A synthetic plant core promoter and upstream regulatory element | |
CN119391755A (en) | Application of GhPSKR gene in regulation and control of salt tolerance of plants | |
AU2012361915A1 (en) | Use of auxin synthase for improving crop yield | |
CN114645019A (en) | Stress-induced expression gene promoter KT626 and application thereof | |
EP1234881A1 (en) | Dna fragment having promoter function | |
MXPA98010574A (en) | Synthetic plan promoter | |
US20080134357A1 (en) | Promoter, promoter control elements, and combinations, and uses thereof | |
JP2004534535A (en) | New plant promoter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |