CA2324520A1 - Plant promoter sequences and methods of use thereof - Google Patents
Plant promoter sequences and methods of use thereof Download PDFInfo
- Publication number
- CA2324520A1 CA2324520A1 CA002324520A CA2324520A CA2324520A1 CA 2324520 A1 CA2324520 A1 CA 2324520A1 CA 002324520 A CA002324520 A CA 002324520A CA 2324520 A CA2324520 A CA 2324520A CA 2324520 A1 CA2324520 A1 CA 2324520A1
- Authority
- CA
- Canada
- Prior art keywords
- seq
- nucleic acid
- sequence
- acid sequence
- gus
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 78
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 197
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 176
- 125000003729 nucleotide group Chemical group 0.000 claims description 187
- 239000002773 nucleotide Substances 0.000 claims description 184
- 230000009261 transgenic effect Effects 0.000 claims description 81
- 230000000694 effects Effects 0.000 claims description 73
- 230000000295 complement effect Effects 0.000 claims description 61
- 108700019146 Transgenes Proteins 0.000 claims description 45
- 230000001172 regenerating effect Effects 0.000 claims description 4
- 241000196324 Embryophyta Species 0.000 abstract description 275
- 235000007201 Saccharum officinarum Nutrition 0.000 abstract description 91
- 230000014509 gene expression Effects 0.000 abstract description 82
- 108010068086 Polyubiquitin Proteins 0.000 abstract description 62
- 102100037935 Polyubiquitin-C Human genes 0.000 abstract description 48
- 230000001105 regulatory effect Effects 0.000 abstract description 8
- 240000000111 Saccharum officinarum Species 0.000 abstract 4
- 108090000623 proteins and genes Proteins 0.000 description 179
- 210000004027 cell Anatomy 0.000 description 169
- 210000001519 tissue Anatomy 0.000 description 93
- 241000609963 Saccharum hybrid cultivar Species 0.000 description 88
- 239000013612 plasmid Substances 0.000 description 73
- 108020004414 DNA Proteins 0.000 description 57
- 206010020649 Hyperkeratosis Diseases 0.000 description 54
- 101150070177 ubi4 gene Proteins 0.000 description 50
- 239000012634 fragment Substances 0.000 description 49
- 239000000523 sample Substances 0.000 description 44
- 241000589158 Agrobacterium Species 0.000 description 42
- 238000011144 upstream manufacturing Methods 0.000 description 41
- 102000004169 proteins and genes Human genes 0.000 description 37
- 240000008042 Zea mays Species 0.000 description 35
- 239000013598 vector Substances 0.000 description 34
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 32
- 238000009396 hybridization Methods 0.000 description 32
- 235000018102 proteins Nutrition 0.000 description 32
- 108020004707 nucleic acids Proteins 0.000 description 31
- 102000039446 nucleic acids Human genes 0.000 description 31
- 230000009466 transformation Effects 0.000 description 30
- 240000007594 Oryza sativa Species 0.000 description 29
- 235000007164 Oryza sativa Nutrition 0.000 description 29
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 29
- 235000009973 maize Nutrition 0.000 description 29
- 239000003550 marker Substances 0.000 description 29
- 235000009566 rice Nutrition 0.000 description 28
- 241000208125 Nicotiana Species 0.000 description 27
- 239000002609 medium Substances 0.000 description 27
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 26
- 108091081024 Start codon Proteins 0.000 description 26
- 239000002245 particle Substances 0.000 description 26
- 239000013615 primer Substances 0.000 description 26
- 238000013519 translation Methods 0.000 description 26
- 240000003768 Solanum lycopersicum Species 0.000 description 23
- 238000003556 assay Methods 0.000 description 23
- 108020004999 messenger RNA Proteins 0.000 description 23
- 238000003752 polymerase chain reaction Methods 0.000 description 21
- 108020003589 5' Untranslated Regions Proteins 0.000 description 20
- 102000004190 Enzymes Human genes 0.000 description 20
- 108090000790 Enzymes Proteins 0.000 description 20
- 108700008625 Reporter Genes Proteins 0.000 description 19
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 19
- 239000013604 expression vector Substances 0.000 description 19
- 235000007119 Ananas comosus Nutrition 0.000 description 18
- 244000068988 Glycine max Species 0.000 description 18
- 240000006394 Sorghum bicolor Species 0.000 description 18
- 238000013518 transcription Methods 0.000 description 18
- 230000035897 transcription Effects 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 17
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 16
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 16
- 239000002299 complementary DNA Substances 0.000 description 16
- 235000013399 edible fruits Nutrition 0.000 description 15
- 230000001744 histochemical effect Effects 0.000 description 15
- 229920001184 polypeptide Polymers 0.000 description 15
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 102000004196 processed proteins & peptides Human genes 0.000 description 15
- 230000008929 regeneration Effects 0.000 description 15
- 238000011069 regeneration method Methods 0.000 description 15
- 108020005345 3' Untranslated Regions Proteins 0.000 description 14
- 244000099147 Ananas comosus Species 0.000 description 14
- 235000010469 Glycine max Nutrition 0.000 description 14
- 230000010473 stable expression Effects 0.000 description 14
- 241000234282 Allium Species 0.000 description 13
- 241000219173 Carica Species 0.000 description 12
- 235000009467 Carica papaya Nutrition 0.000 description 12
- 241000234295 Musa Species 0.000 description 12
- 235000007238 Secale cereale Nutrition 0.000 description 12
- 241000209140 Triticum Species 0.000 description 12
- 235000021307 Triticum Nutrition 0.000 description 12
- 230000010474 transient expression Effects 0.000 description 12
- 229920001817 Agar Polymers 0.000 description 11
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 11
- 108020004705 Codon Proteins 0.000 description 11
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 11
- 241000209056 Secale Species 0.000 description 11
- 239000008272 agar Substances 0.000 description 11
- 210000002257 embryonic structure Anatomy 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 239000000725 suspension Substances 0.000 description 11
- 230000001052 transient effect Effects 0.000 description 11
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 10
- 235000007319 Avena orientalis Nutrition 0.000 description 10
- 244000075850 Avena orientalis Species 0.000 description 10
- 244000060011 Cocos nucifera Species 0.000 description 10
- 235000013162 Cocos nucifera Nutrition 0.000 description 10
- 108091026890 Coding region Proteins 0.000 description 10
- 235000007340 Hordeum vulgare Nutrition 0.000 description 10
- 240000005979 Hordeum vulgare Species 0.000 description 10
- 230000000692 anti-sense effect Effects 0.000 description 10
- 235000004879 dioscorea Nutrition 0.000 description 10
- 230000001939 inductive effect Effects 0.000 description 10
- 238000003780 insertion Methods 0.000 description 10
- 230000037431 insertion Effects 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 235000007558 Avena sp Nutrition 0.000 description 9
- 240000004270 Colocasia esculenta var. antiquorum Species 0.000 description 9
- 235000002723 Dioscorea alata Nutrition 0.000 description 9
- 235000007056 Dioscorea composita Nutrition 0.000 description 9
- 235000009723 Dioscorea convolvulacea Nutrition 0.000 description 9
- 235000005362 Dioscorea floribunda Nutrition 0.000 description 9
- 235000004868 Dioscorea macrostachya Nutrition 0.000 description 9
- 235000005361 Dioscorea nummularia Nutrition 0.000 description 9
- 235000005360 Dioscorea spiculiflora Nutrition 0.000 description 9
- 235000006350 Ipomoea batatas var. batatas Nutrition 0.000 description 9
- 241000501103 Saccharum hybrid cultivar H32-8560 Species 0.000 description 9
- 108091023045 Untranslated Region Proteins 0.000 description 9
- 150000001413 amino acids Chemical group 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 9
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 9
- 108010058731 nopaline synthase Proteins 0.000 description 9
- 108020005544 Antisense RNA Proteins 0.000 description 8
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 8
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 8
- 239000003184 complementary RNA Substances 0.000 description 8
- 210000001938 protoplast Anatomy 0.000 description 8
- 230000035939 shock Effects 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 7
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 7
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 7
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 7
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 7
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 7
- 102000044159 Ubiquitin Human genes 0.000 description 7
- 108090000848 Ubiquitin Proteins 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 230000000408 embryogenic effect Effects 0.000 description 7
- 229910052737 gold Inorganic materials 0.000 description 7
- 239000010931 gold Substances 0.000 description 7
- 230000012010 growth Effects 0.000 description 7
- 230000006801 homologous recombination Effects 0.000 description 7
- 238000002744 homologous recombination Methods 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 238000011282 treatment Methods 0.000 description 7
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 6
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 6
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 6
- 108010025815 Kanamycin Kinase Proteins 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 6
- 241001092459 Rubus Species 0.000 description 6
- 229930006000 Sucrose Natural products 0.000 description 6
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 6
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 6
- 108700041896 Zea mays Ubi-1 Proteins 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 6
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 6
- 239000004009 herbicide Substances 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 230000008520 organization Effects 0.000 description 6
- 230000036961 partial effect Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 238000010186 staining Methods 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 239000005720 sucrose Substances 0.000 description 6
- 230000008685 targeting Effects 0.000 description 6
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 5
- 241000234671 Ananas Species 0.000 description 5
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 5
- 102000030523 Catechol oxidase Human genes 0.000 description 5
- 108010031396 Catechol oxidase Proteins 0.000 description 5
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 5
- 239000005977 Ethylene Substances 0.000 description 5
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 5
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 5
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 229940024606 amino acid Drugs 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 230000027455 binding Effects 0.000 description 5
- 239000012472 biological sample Substances 0.000 description 5
- -1 dextran sulfate Substances 0.000 description 5
- 230000008642 heat stress Effects 0.000 description 5
- 230000002363 herbicidal effect Effects 0.000 description 5
- 229930027917 kanamycin Natural products 0.000 description 5
- 229960000318 kanamycin Drugs 0.000 description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 5
- 229930182823 kanamycin A Natural products 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 210000000056 organ Anatomy 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 101710194665 1-aminocyclopropane-1-carboxylate synthase Proteins 0.000 description 4
- PAJPWUMXBYXFCZ-UHFFFAOYSA-N 1-aminocyclopropanecarboxylic acid Chemical compound OC(=O)C1(N)CC1 PAJPWUMXBYXFCZ-UHFFFAOYSA-N 0.000 description 4
- SFYDWLYPIXHPML-UHFFFAOYSA-N 3-nitro-1-(2,4,6-trimethylphenyl)sulfonyl-1,2,4-triazole Chemical compound CC1=CC(C)=CC(C)=C1S(=O)(=O)N1N=C([N+]([O-])=O)N=C1 SFYDWLYPIXHPML-UHFFFAOYSA-N 0.000 description 4
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 4
- 241000701489 Cauliflower mosaic virus Species 0.000 description 4
- 108020004635 Complementary DNA Proteins 0.000 description 4
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 4
- 239000005562 Glyphosate Substances 0.000 description 4
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 4
- 235000014826 Mangifera indica Nutrition 0.000 description 4
- 240000007228 Mangifera indica Species 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 108020005120 Plant DNA Proteins 0.000 description 4
- 108700001094 Plant Genes Proteins 0.000 description 4
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 4
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 4
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- 108091036066 Three prime untranslated region Proteins 0.000 description 4
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 4
- 240000006365 Vitis vinifera Species 0.000 description 4
- 235000014787 Vitis vinifera Nutrition 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 210000004748 cultured cell Anatomy 0.000 description 4
- 238000005520 cutting process Methods 0.000 description 4
- 239000013613 expression plasmid Substances 0.000 description 4
- 239000007789 gas Substances 0.000 description 4
- 229940097068 glyphosate Drugs 0.000 description 4
- 101150054900 gus gene Proteins 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 231100001222 nononcogenic Toxicity 0.000 description 4
- 230000009871 nonspecific binding Effects 0.000 description 4
- 238000007899 nucleic acid hybridization Methods 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 239000013605 shuttle vector Substances 0.000 description 4
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 108700012359 toxins Proteins 0.000 description 4
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 235000005340 Asparagus officinalis Nutrition 0.000 description 3
- 241000219112 Cucumis Species 0.000 description 3
- 241001057636 Dracaena deremensis Species 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 229920002148 Gellan gum Polymers 0.000 description 3
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 3
- 108091052347 Glucose transporter family Proteins 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 3
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 3
- 102000008297 Nuclear Matrix-Associated Proteins Human genes 0.000 description 3
- 108010035916 Nuclear Matrix-Associated Proteins Proteins 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 244000062793 Sorghum vulgare Species 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- 108700006291 Sucrose-phosphate synthases Proteins 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 3
- 208000027418 Wounds and injury Diseases 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 108010031100 chloroplast transit peptides Proteins 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 3
- 239000004062 cytokinin Substances 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 230000004345 fruit ripening Effects 0.000 description 3
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 210000001161 mammalian embryo Anatomy 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 210000000299 nuclear matrix Anatomy 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 230000037039 plant physiology Effects 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 229960002429 proline Drugs 0.000 description 3
- 230000000644 propagated effect Effects 0.000 description 3
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- JXOHGGNKMLTUBP-HSUXUTPPSA-N shikimic acid Chemical compound O[C@@H]1CC(C(O)=O)=C[C@@H](O)[C@H]1O JXOHGGNKMLTUBP-HSUXUTPPSA-N 0.000 description 3
- JXOHGGNKMLTUBP-JKUQZMGJSA-N shikimic acid Natural products O[C@@H]1CC(C(O)=O)=C[C@H](O)[C@@H]1O JXOHGGNKMLTUBP-JKUQZMGJSA-N 0.000 description 3
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 3
- 239000002689 soil Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 229910052721 tungsten Inorganic materials 0.000 description 3
- 239000010937 tungsten Substances 0.000 description 3
- 239000011782 vitamin Substances 0.000 description 3
- 235000013343 vitamin Nutrition 0.000 description 3
- 229940088594 vitamin Drugs 0.000 description 3
- 229930003231 vitamin Natural products 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- HCWLJSDMOMMDRF-SZWOQXJISA-N 3-[(3s,6r)-2-oxo-6-[(1s,2r,3r)-1,2,3,4-tetrahydroxybutyl]morpholin-3-yl]propanamide Chemical compound NC(=O)CC[C@@H]1NC[C@H]([C@@H](O)[C@H](O)[C@H](O)CO)OC1=O HCWLJSDMOMMDRF-SZWOQXJISA-N 0.000 description 2
- JXCKZXHCJOVIAV-UHFFFAOYSA-N 6-[(5-bromo-4-chloro-1h-indol-3-yl)oxy]-3,4,5-trihydroxyoxane-2-carboxylic acid;cyclohexanamine Chemical compound [NH3+]C1CCCCC1.O1C(C([O-])=O)C(O)C(O)C(O)C1OC1=CNC2=CC=C(Br)C(Cl)=C12 JXCKZXHCJOVIAV-UHFFFAOYSA-N 0.000 description 2
- 108010000700 Acetolactate synthase Proteins 0.000 description 2
- ILQOASSPNGAIBC-UHFFFAOYSA-N Agropine Natural products OC(O)C(O)CC(O)C1CN2C(CCC2=O)C(=O)O1 ILQOASSPNGAIBC-UHFFFAOYSA-N 0.000 description 2
- 235000003840 Amygdalus nana Nutrition 0.000 description 2
- 108020004491 Antisense DNA Proteins 0.000 description 2
- 240000007087 Apium graveolens Species 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- 244000003416 Asparagus officinalis Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 235000004936 Bromus mango Nutrition 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 244000241257 Cucumis melo Species 0.000 description 2
- 235000009842 Cucumis melo Nutrition 0.000 description 2
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- IMXSCCDUAFEIOE-UHFFFAOYSA-N D-Octopin Natural products OC(=O)C(C)NC(C(O)=O)CCCN=C(N)N IMXSCCDUAFEIOE-UHFFFAOYSA-N 0.000 description 2
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 2
- IMXSCCDUAFEIOE-RITPCOANSA-N D-octopine Chemical compound [O-]C(=O)[C@@H](C)[NH2+][C@H](C([O-])=O)CCCNC(N)=[NH2+] IMXSCCDUAFEIOE-RITPCOANSA-N 0.000 description 2
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 2
- 101150074155 DHFR gene Proteins 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 244000000626 Daucus carota Species 0.000 description 2
- 235000002767 Daucus carota Nutrition 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 102000018711 Facilitative Glucose Transport Proteins Human genes 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 240000009088 Fragaria x ananassa Species 0.000 description 2
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 240000007049 Juglans regia Species 0.000 description 2
- 235000009496 Juglans regia Nutrition 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- 229930182821 L-proline Natural products 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- 240000008415 Lactuca sativa Species 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- 239000006137 Luria-Bertani broth Substances 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- 235000011430 Malus pumila Nutrition 0.000 description 2
- 244000070406 Malus silvestris Species 0.000 description 2
- 235000015103 Malus silvestris Nutrition 0.000 description 2
- 241000219823 Medicago Species 0.000 description 2
- NWBJYWHLCVSVIJ-UHFFFAOYSA-N N-benzyladenine Chemical compound N=1C=NC=2NC=NC=2C=1NCC1=CC=CC=C1 NWBJYWHLCVSVIJ-UHFFFAOYSA-N 0.000 description 2
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 238000010222 PCR analysis Methods 0.000 description 2
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 2
- 244000046052 Phaseolus vulgaris Species 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 2
- 241000220299 Prunus Species 0.000 description 2
- 235000011432 Prunus Nutrition 0.000 description 2
- 235000009827 Prunus armeniaca Nutrition 0.000 description 2
- 244000018633 Prunus armeniaca Species 0.000 description 2
- 240000001987 Pyrus communis Species 0.000 description 2
- 235000014443 Pyrus communis Nutrition 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 2
- 244000235659 Rubus idaeus Species 0.000 description 2
- 241000209051 Saccharum Species 0.000 description 2
- 239000005708 Sodium hypochlorite Substances 0.000 description 2
- 235000002597 Solanum melongena Nutrition 0.000 description 2
- 244000061458 Solanum melongena Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000009184 Spondias indica Nutrition 0.000 description 2
- 108700026226 TATA Box Proteins 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 235000009754 Vitis X bourquina Nutrition 0.000 description 2
- 235000012333 Vitis X labruscana Nutrition 0.000 description 2
- 108010046516 Wheat Germ Agglutinins Proteins 0.000 description 2
- 206010052428 Wound Diseases 0.000 description 2
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 101150067366 adh gene Proteins 0.000 description 2
- 230000009418 agronomic effect Effects 0.000 description 2
- 239000003816 antisense DNA Substances 0.000 description 2
- 238000000211 autoradiogram Methods 0.000 description 2
- 239000007640 basal medium Substances 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 235000011148 calcium chloride Nutrition 0.000 description 2
- 229960004261 cefotaxime Drugs 0.000 description 2
- AZZMGZXNTDTSME-JUZDKLSSSA-M cefotaxime sodium Chemical compound [Na+].N([C@@H]1C(N2C(=C(COC(C)=O)CS[C@@H]21)C([O-])=O)=O)C(=O)\C(=N/OC)C1=CSC(N)=N1 AZZMGZXNTDTSME-JUZDKLSSSA-M 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000012737 fresh medium Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 235000002532 grape seed extract Nutrition 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 239000001307 helium Substances 0.000 description 2
- 229910052734 helium Inorganic materials 0.000 description 2
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 239000012188 paraffin wax Substances 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 239000003415 peat Substances 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 235000014774 prunus Nutrition 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- SQGYOTSLMSWVJD-UHFFFAOYSA-N silver(1+) nitrate Chemical compound [Ag+].[O-]N(=O)=O SQGYOTSLMSWVJD-UHFFFAOYSA-N 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 229940063673 spermidine Drugs 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000004114 suspension culture Methods 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 2
- HXKWSTRRCHTUEC-UHFFFAOYSA-N 2,4-Dichlorophenoxyaceticacid Chemical compound OC(=O)C(Cl)OC1=CC=C(Cl)C=C1 HXKWSTRRCHTUEC-UHFFFAOYSA-N 0.000 description 1
- OVSKIKFHRZPJSS-DOMIDYPGSA-N 2-(2,4-dichlorophenoxy)acetic acid Chemical compound OC(=O)[14CH2]OC1=CC=C(Cl)C=C1Cl OVSKIKFHRZPJSS-DOMIDYPGSA-N 0.000 description 1
- VLEIUWBSEKKKFX-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O VLEIUWBSEKKKFX-UHFFFAOYSA-N 0.000 description 1
- XMTQQYYKAHVGBJ-UHFFFAOYSA-N 3-(3,4-DICHLOROPHENYL)-1,1-DIMETHYLUREA Chemical compound CN(C)C(=O)NC1=CC=C(Cl)C(Cl)=C1 XMTQQYYKAHVGBJ-UHFFFAOYSA-N 0.000 description 1
- QYOJSKGCWNAKGW-PBXRRBTRSA-N 3-phosphoshikimic acid Chemical compound O[C@@H]1CC(C(O)=O)=C[C@@H](OP(O)(O)=O)[C@H]1O QYOJSKGCWNAKGW-PBXRRBTRSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- 235000003934 Abelmoschus esculentus Nutrition 0.000 description 1
- 240000004507 Abelmoschus esculentus Species 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 235000011446 Amygdalus persica Nutrition 0.000 description 1
- 241000207875 Antirrhinum Species 0.000 description 1
- 235000002764 Apium graveolens Nutrition 0.000 description 1
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 description 1
- 235000010591 Appio Nutrition 0.000 description 1
- 235000003911 Arachis Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- 241001106067 Atropa Species 0.000 description 1
- 229930192334 Auxin Natural products 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 1
- 229920002799 BoPET Polymers 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000011303 Brassica alboglabra Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000011302 Brassica oleracea Nutrition 0.000 description 1
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 1
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 1
- 241000209200 Bromus Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- 101710163595 Chaperone protein DnaK Proteins 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 241000219109 Citrullus Species 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- 244000270200 Citrullus vulgaris Species 0.000 description 1
- 235000012840 Citrullus vulgaris Nutrition 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- 235000005976 Citrus sinensis Nutrition 0.000 description 1
- 240000002319 Citrus sinensis Species 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 108091027551 Cointegrate Proteins 0.000 description 1
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 241000612153 Cyclamen Species 0.000 description 1
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 1
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 238000007900 DNA-DNA hybridization Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000208296 Datura Species 0.000 description 1
- 241000208175 Daucus Species 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 240000003421 Dianthus chinensis Species 0.000 description 1
- 240000001879 Digitalis lutea Species 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000221079 Euphorbia <genus> Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 102000020897 Formins Human genes 0.000 description 1
- 108091022623 Formins Proteins 0.000 description 1
- 241000220223 Fragaria Species 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000208152 Geranium Species 0.000 description 1
- 241000735332 Gerbera Species 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- 241000233596 Glycine canescens Species 0.000 description 1
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 description 1
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 description 1
- 241000208818 Helianthus Species 0.000 description 1
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- 206010021929 Infertility male Diseases 0.000 description 1
- 206010061217 Infestation Diseases 0.000 description 1
- 235000021506 Ipomoea Nutrition 0.000 description 1
- 241000207783 Ipomoea Species 0.000 description 1
- 101150008942 J gene Proteins 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 241000208822 Lactuca Species 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 241000234435 Lilium Species 0.000 description 1
- 241000208204 Linum Species 0.000 description 1
- 241000209082 Lolium Species 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 241000227653 Lycopersicon Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- 241000121629 Majorana Species 0.000 description 1
- 208000007466 Male Infertility Diseases 0.000 description 1
- 241000220225 Malus Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 239000005041 Mylar™ Substances 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 241000234479 Narcissus Species 0.000 description 1
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 1
- 240000002853 Nelumbo nucifera Species 0.000 description 1
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 1
- 241001162910 Nemesia <spider> Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 241000088844 Nothocestrum Species 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 241000219830 Onobrychis Species 0.000 description 1
- 108090000417 Oxygenases Proteins 0.000 description 1
- 102000004020 Oxygenases Human genes 0.000 description 1
- 241000209117 Panicum Species 0.000 description 1
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 1
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 1
- 241000218996 Passiflora Species 0.000 description 1
- 241000208181 Pelargonium Species 0.000 description 1
- 241000209046 Pennisetum Species 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- 241000219833 Phaseolus Species 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- 241000219843 Pisum Species 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 208000020584 Polyploidy Diseases 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710113900 Protein SGT1 homolog Proteins 0.000 description 1
- 241001290151 Prunus avium subsp. avium Species 0.000 description 1
- 244000141353 Prunus domestica Species 0.000 description 1
- 235000011435 Prunus domestica Nutrition 0.000 description 1
- 240000005809 Prunus persica Species 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 241000218206 Ranunculus Species 0.000 description 1
- 241000220259 Raphanus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 235000011449 Rosa Nutrition 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 235000011034 Rubus glaucus Nutrition 0.000 description 1
- 235000009122 Rubus idaeus Nutrition 0.000 description 1
- 241000746444 Saccharum sp. Species 0.000 description 1
- 241001106018 Salpiglossis Species 0.000 description 1
- 241001496113 Santalum Species 0.000 description 1
- 235000008631 Santalum Nutrition 0.000 description 1
- 244000082988 Secale cereale Species 0.000 description 1
- 241000780602 Senecio Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- 102100027722 Small glutamine-rich tetratricopeptide repeat-containing protein alpha Human genes 0.000 description 1
- 235000002634 Solanum Nutrition 0.000 description 1
- 241000207763 Solanum Species 0.000 description 1
- 235000007230 Sorghum bicolor Nutrition 0.000 description 1
- 241000187435 Streptomyces griseolus Species 0.000 description 1
- 101150003725 TK gene Proteins 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- 241000219793 Trifolium Species 0.000 description 1
- 241001312519 Trigonella Species 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- 241000219977 Vigna Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 241000209149 Zea Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000006687 ag medium Substances 0.000 description 1
- 244000193174 agave Species 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 238000000540 analysis of variance Methods 0.000 description 1
- 210000003484 anatomy Anatomy 0.000 description 1
- 125000000129 anionic group Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000001387 apium graveolens Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 239000002363 auxin Substances 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 239000001390 capsicum minimum Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 108010079058 casein hydrolysate Proteins 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000024321 chromosome segregation Effects 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229930186364 cyclamen Natural products 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000003936 denaturing gel electrophoresis Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- 230000001819 effect on gene Effects 0.000 description 1
- 230000006353 environmental stress Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 239000012869 germination medium Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 235000015810 grayleaf red raspberry Nutrition 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 101150047832 hpt gene Proteins 0.000 description 1
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 1
- 229940097277 hygromycin b Drugs 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000010150 least significant difference test Methods 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 238000002803 maceration Methods 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 239000006870 ms-medium Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 235000001968 nicotinic acid Nutrition 0.000 description 1
- 229960003512 nicotinic acid Drugs 0.000 description 1
- 239000011664 nicotinic acid Substances 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 1
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 1
- 230000003032 phytopathogenic effect Effects 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 150000008442 polyphenolic compounds Chemical class 0.000 description 1
- 235000013824 polyphenols Nutrition 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000004844 protein turnover Effects 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 238000003345 scintillation counting Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000007226 seed germination Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 229910001961 silver nitrate Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 239000008174 sterile solution Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- 238000012090 tissue culture technique Methods 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 235000019155 vitamin A Nutrition 0.000 description 1
- 239000011719 vitamin A Substances 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
- 235000020234 walnut Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
The invention relates to nucleic acid sequences isolated from sugarcane and to methods of using them. In particular, the invention relates to nucleotide sequences which are derived from sugarcane polyubiquitin genes and which are capable of directing constitutive expression of a nucleic acid sequence of interest that is operably linked to the sugarcane polyubiquitin nucleotide sequences. The sugarcane polyubiquitin nucleotide sequences are useful in regulating expression of a nucleic acid sequence of interest in monocotyledonous and dicotyledonous plants.
Description
PLANT PROMOTER SEQUENCES AND
METHODS OF USE THEREOF
This work was made with Government support by the United States Department of S Agriculture. The United States Government has certain rights in this invention.
FIELD OF THE INVENTION
The invention relates to nucleic acid sequences isolated from sugarcane and to methods of using them. In particular, the inventions relates to nucleotide sequences which are derived from sugarcane polyubiquitin genes and which are capable of directing constitutive expression of a nucleic acid sequence of interest that is operably linked to the sugarcane polyubiquitin nucleotide sequences. The sugarcane polyubiquitin nucleotide sequences are useful in regulating expression of a nucleic acid sequence of interest in monocotyledonous and dicotyledonous plants.
BACKGROUND OF THE INVENTION
Much scientific effort has been directed at genetically engineering plants to produce agronomically relevant proteins. Recombinant genes for producing proteins in plants require a promoter sequence which is capable of directing protein expression in plant cells. Promoter sequences which direct high levels of protein expression in plant cells are particularly desirable since fewer numbers of transgenic plants need to be produced and screened to recover plants producing agronomically significant quantities of the target protein. In addition, high levels of protein expression aid the generation of plants which exhibit commercially important phenotypic properties, such as pest and disease resistance, resistance to environmental stress (e.g., water-logging, drought, heat, cold, light-intensity, day-length, chemicals, etc.), improved qualities (e.g., high yield of fruit, extended shelf life, uniform fruit shape and color, higher sugar content, higher vitamins C and A content, lower acidity, etc. ).
Some promoter sequences which are capable of driving expression of transgenes (e.g., selectable marker genes) in plants are known in the art and are derived from a variety of sources such as bacteria, plant DNA viruses, and plants. Promoters of bacterial origin include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids. Promoters of viral origin include the 35S and 19S RNA
promoters of cauliflower mosaic virus. Plant promoters include the ribulose-1.3-diphosphate carboxylase small subunit promoter. the phaseolin promoter, and the maize polyubiquitin promoter.
While some promoter sequences which function in plant cells are available, expression of more than one gene (e.g., a selectable marker gene and an agronomicaliy relevant gene) S which is operably linked to the same promoter sequence is likely to be hampered by homology dependent silencing of transgenes in plants. Thus, what is needed are additional promoter sequences which are capable of driving transgene expression. In particular, what is needed are promoter sequences which drive transgene expression in both monocotyledonous and dicotyledonous plant cells.
SUMMARY OF THE INVENTION
The present invention provides nucleic acid sequences having promoter activity. The nucleic acid sequences provided herein direct expression of operably linked nucleotide sequences in cells, tissues and organs of monocotyledonous and dicotyledonous plants. In one embodiment, the invention provides a substantially purified nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID NO:I, homologs of SEQ ID NO:l, homologs of the complement of SEQ ID
NO:1; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3. In a preferred embodiment, the nucleotide sequence is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another embodiment, nucleotide sequence is double-stranded. In yet another embodiment, the nucleotide sequence is single-stranded. In yet another alternative embodiment, the nucleic acid sequence is contained in a plant cell. In a preferred embodiment, the plant cell is derived from a monocotyledonous plant.
In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention also provides a substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
NO:I and the complement thereof. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alternative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID N0:7, from 245 to 7$7 of SEQ ID
N0:7, from 788 to 1020 of SEQ ID N0:7, from 1021 to 1084 of SEQ ID N0:7, from 1085 to 1168 of SEQ ID N0:7, from 1169 to 1173 of SEQ ID N0:7, from 1174 to 1648 of SEQ ID
N0:7, from 1649 to 180 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ ID NO: I , and from 445 to 1810 of SEQ ID NO: I . In a further alternative preferred embodiment, nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Further provided by the invention is a substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
N0:3 and the complement thereof. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alternative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides 1 to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID NO:10, from 3614 to 3691 of SEQ ID N0:3, from 1 to 2248 of SEQ ID N0:3, from :ZS 2249 to 2313 of SEQ ID N0:3, from 2314 to 3688 of SEQ ID N0:3, and from 1671 to 2248 of SEQ ID N0:3. In another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In yet a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, :30 wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
METHODS OF USE THEREOF
This work was made with Government support by the United States Department of S Agriculture. The United States Government has certain rights in this invention.
FIELD OF THE INVENTION
The invention relates to nucleic acid sequences isolated from sugarcane and to methods of using them. In particular, the inventions relates to nucleotide sequences which are derived from sugarcane polyubiquitin genes and which are capable of directing constitutive expression of a nucleic acid sequence of interest that is operably linked to the sugarcane polyubiquitin nucleotide sequences. The sugarcane polyubiquitin nucleotide sequences are useful in regulating expression of a nucleic acid sequence of interest in monocotyledonous and dicotyledonous plants.
BACKGROUND OF THE INVENTION
Much scientific effort has been directed at genetically engineering plants to produce agronomically relevant proteins. Recombinant genes for producing proteins in plants require a promoter sequence which is capable of directing protein expression in plant cells. Promoter sequences which direct high levels of protein expression in plant cells are particularly desirable since fewer numbers of transgenic plants need to be produced and screened to recover plants producing agronomically significant quantities of the target protein. In addition, high levels of protein expression aid the generation of plants which exhibit commercially important phenotypic properties, such as pest and disease resistance, resistance to environmental stress (e.g., water-logging, drought, heat, cold, light-intensity, day-length, chemicals, etc.), improved qualities (e.g., high yield of fruit, extended shelf life, uniform fruit shape and color, higher sugar content, higher vitamins C and A content, lower acidity, etc. ).
Some promoter sequences which are capable of driving expression of transgenes (e.g., selectable marker genes) in plants are known in the art and are derived from a variety of sources such as bacteria, plant DNA viruses, and plants. Promoters of bacterial origin include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids. Promoters of viral origin include the 35S and 19S RNA
promoters of cauliflower mosaic virus. Plant promoters include the ribulose-1.3-diphosphate carboxylase small subunit promoter. the phaseolin promoter, and the maize polyubiquitin promoter.
While some promoter sequences which function in plant cells are available, expression of more than one gene (e.g., a selectable marker gene and an agronomicaliy relevant gene) S which is operably linked to the same promoter sequence is likely to be hampered by homology dependent silencing of transgenes in plants. Thus, what is needed are additional promoter sequences which are capable of driving transgene expression. In particular, what is needed are promoter sequences which drive transgene expression in both monocotyledonous and dicotyledonous plant cells.
SUMMARY OF THE INVENTION
The present invention provides nucleic acid sequences having promoter activity. The nucleic acid sequences provided herein direct expression of operably linked nucleotide sequences in cells, tissues and organs of monocotyledonous and dicotyledonous plants. In one embodiment, the invention provides a substantially purified nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID NO:I, homologs of SEQ ID NO:l, homologs of the complement of SEQ ID
NO:1; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3. In a preferred embodiment, the nucleotide sequence is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another embodiment, nucleotide sequence is double-stranded. In yet another embodiment, the nucleotide sequence is single-stranded. In yet another alternative embodiment, the nucleic acid sequence is contained in a plant cell. In a preferred embodiment, the plant cell is derived from a monocotyledonous plant.
In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention also provides a substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
NO:I and the complement thereof. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alternative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID N0:7, from 245 to 7$7 of SEQ ID
N0:7, from 788 to 1020 of SEQ ID N0:7, from 1021 to 1084 of SEQ ID N0:7, from 1085 to 1168 of SEQ ID N0:7, from 1169 to 1173 of SEQ ID N0:7, from 1174 to 1648 of SEQ ID
N0:7, from 1649 to 180 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ ID NO: I , and from 445 to 1810 of SEQ ID NO: I . In a further alternative preferred embodiment, nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Further provided by the invention is a substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
N0:3 and the complement thereof. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alternative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides 1 to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID NO:10, from 3614 to 3691 of SEQ ID N0:3, from 1 to 2248 of SEQ ID N0:3, from :ZS 2249 to 2313 of SEQ ID N0:3, from 2314 to 3688 of SEQ ID N0:3, and from 1671 to 2248 of SEQ ID N0:3. In another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In yet a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, :30 wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention additionally provides a substantially purified nucleic acid sequence comprising the EcoRII~BaI fragment isolated from plasmid pubi4-GUS contained in E.schcrichiu coli cells deposited as NRRLB-3011 S, the complement of the fragment, homologs of the fragment, and homologs of the complement of the fragment. In a preferred embodiment, the nucleotide sequence is SEQ ID N0:7. In a more preferred embodiment, the nucleotide sequence is characterized by having promoter activity. In a yet more preferred embodiment. the promoter activity is constitutive. In an alternative yet more preferred embodiment. the nucleotide sequence is double-stranded. In another alternative more preferred embodiment, the nucleotide sequence is single-stranded. In yet another alternative more preferred embodiment, the nucleic acid sequence is contained in a plant cell. In another preferred embodiment, the plant cell is derived from a monocotyledonous plant.
In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana.
coconut, date. and hop. In an alternative preferred embodiment, the plant cell is derived from 1 S a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Also provided herein is a substantially purified nucleic acid sequence comprising the HinciIIIlXbaI fragment isolated from plasmid pubi9-GUS contained in Escherichia coli cells deposited as NRRLB-30116, the complement of the fragment, homologs of the fragment, and homologs of the complement of the fragment. In a preferred embodiment, the nucleotide sequence is SEQ ID NO:10. In an alternative preferred embodiment, the nucleotide sequence is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another alternative preferred embodiment, the nucleotide sequence is double-stranded. In yet another alternative preferred embodiment, the nucleotide sequence is single-stranded. In another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell.In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco.
tomato, soybean, and papaya.
In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana.
coconut, date. and hop. In an alternative preferred embodiment, the plant cell is derived from 1 S a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Also provided herein is a substantially purified nucleic acid sequence comprising the HinciIIIlXbaI fragment isolated from plasmid pubi9-GUS contained in Escherichia coli cells deposited as NRRLB-30116, the complement of the fragment, homologs of the fragment, and homologs of the complement of the fragment. In a preferred embodiment, the nucleotide sequence is SEQ ID NO:10. In an alternative preferred embodiment, the nucleotide sequence is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another alternative preferred embodiment, the nucleotide sequence is double-stranded. In yet another alternative preferred embodiment, the nucleotide sequence is single-stranded. In another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell.In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another more preferred embodiment, the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco.
tomato, soybean, and papaya.
The invention further provides a substantially purified nucleic acid sequence comprising a portion of the EcoRIlXbaI fragment isolated from plasmid pubi4-GUS contained in E.scherichia coli cells deposited as NRRLB-30115, and the complement of the fragment.
In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alterative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant.In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple. rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant.
In yet a more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Also provided by the invention is a substantially purified nucleic acid sequence comprising a portion of the HindIIIlXbaI fragment isolated from plasmid pubi9-GUS
contained in Escherichia coli cells deposited as NRRLB-30116, and the complement of the fragment. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another :ZO preferred embodiment, the portion is double-stranded. In yet another preferred embodiment, the portion is single-stranded. In yet another preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, '.',>_5 pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alterative more preferred embodiment, the plant cell is derived from a dicotyledonous plant.
In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention additionally provides a recombinant expression vector comprising a :30 nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID
NO:1;
SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3. In a preferred embodiment, the recombinant expression -S-vector is selected from the group consisting of pubi4-GUS, pubi9-GUS, 4PI-GUS
and 9PI-GUS.
The invention also provides a recombinant expression vector comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:l and the complement thereof.
Also provided herein is a recombinant expression vector comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID N0:3 and the complement thereof.
Additionally provided by the invention is a transgenic plant cell comprising a nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ
ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID NO:I; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3, wherein the nucleotide sequence is operably linked to a nucleic acid sequence of interest. In a preferred embodiment, the transgenic plant cell expresses the nucleic acid sequence of interest. In a more preferred embodiment, the expression is constitutive. In an alternative embodiment, the transgenic plant cell is derived from a monocotyledonous plant. In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another alternative embodiment, the transgenic plant cell is derived from a dicotyledonous plant. In a more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya. In yet another alternative embodiment, the nucleic acid sequence of interest is a sense sequence. In a more preferred embodiment, the sense sequence encodes a protein selected from the group consisting of ~-glucuronidase, luciferase, ~i-galactosidase, 1-aminocyclopropane-1-carboxylic acid deaminase, sucrose phosphate synthase, 5-enolpyruvyl-3- phosphoshikimate synthase, acetolactate synthase, RNase, wheat germ agglutinin, sweetness protein, and Bacillus thuringiensis crystal toxin proteins. In a further alternative embodiment, the nucleic acid sequence of interest is an antisense sequence. In a more preferred embodiment, the antisense sequence is selected from the group consisting of an antisense sequence to ACC synthase, to ethylene inducible sequences, and to polyphenol oxidase.
The invention further provides a transgenic plant cell comprising a nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:1 and the complement thereof.
Also provided herein is a transgenic plant cell comprising a nucleic acid sequence S comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
N0:3 and the complement thereof.
Additionally provided by the invention is a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a nucleotide sequence selected from the group consisting of SEQ
ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ 1D NO:1; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ 1D N0:3, and homologs of the complement of SEQ ID N0:3; b) operably linking the nucleic acid sequence of interest to the nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under 1 S conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell. In a preferred embodiment, the method further comprises d) identifying the transgenic plant cell. In another preferred embodiment, the method further comprises d) regenerating transgenic plant tissue from the transgenic plant cell. In an alternative preferred embodiment, the methods further comprises d) regenerating a transgenic plant from the transgenic plant cell. In another preferred embodiment, the plant cell is derived from a monocotyledonous plant. In yet a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another alterative more preferred embodiment the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention further provides a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ
ID NO:1 and the complement thereof; b) operably linking the nucleic acid sequence of interest to the portion of a nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell.
In a preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID N0:7, from 24~ to 787 of SEQ ID
N0:7, from 788 to 1020 of SEQ ID N0:7, from 1021 to 1084 of SEQ ID N0:7. from 1085 to 1168 of SEQ ID N0:7, from 1169 to 1173 of SEQ ID N0:7, from 1174 to 1648 of SEQ ID
N0:7.
from 1649 to 1805 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:I, from 379 to 444 of SEQ ID NO:1, and from 44~ to 1810 of SEQ ID NO:1.
Also provided by the invention is a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ
ID N0:3 and the complement thereof; b) operably linking the nucleic acid sequence of interest to the portion of a nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell.
In a preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides 1 to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID
NO:10, from 3614 to 3691 of SEQ ID N0:3, from 1 to 2248 of SEQ ID N0:3, from 2249 to 2313 of SEQ
ID N0:3, from 2314 to 3688 of SEQ ID N0:3, and from 1671 to 2248 of SEQ ID
N0:3..
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows a RNA gel blot of sugarcane polyubiquitin mRNA pools hybridized to gene-specific probes for Scubi 221, Scubi 511, Scubi 241, Scubi561, and Scubi 5121.
Figure 2 shows a "Reverse northern" blot of sugarcane polyubiquitin mRNA
pools.
Figure 3 shows the nucleotide sequence of the sugarcane polyubiquitin ubi4 gene.
Figure 3A shows the nucleotide sequence (SEQ ID NO:1 ) including the translation start codon and sequences upstream thereof. Figure 3B shows the nucleotide sequence (SEQ ID
N0:2) including the translation stop codon and sequences downstream thereof.
Boldface indicates putative TATA box, ATG translation initiation codon, TAA translation stop codon, and putative poly-A addition signal; underlining indicates the putative 5' and 3' UTRs, based on homology to scubi241 and 511 cDNAs; lowercase indicates that the base is as depicted but with some uncertainty (see also Tables 1 and 2 below).
Figure 4 is a diagrammatic representation of the initially-determined organization of ubi=l and ubi9 polyubiquitin genes. Numbered white boxes indicate polyubiquitin coding repeats, unnumbered white boxes are S' and 3' untranslated regions, black boxes are introns, _g_ and lines are flanking DNA. E: EcoRI; N: NruI; S: SaII; H: HindIII; M:
putative miniature inverted-repeat transposable element.
Figure 5 shows the nucleotide sequence (SEQ ID NO:S) (nucleotides 1 to 5512 of the 5551 nucleotide sequence deposited as GenBank accession number AF093504) (A) and S translated amino acid sequence (SEQ ID N0:6) (B) of sugarcane polyubiquitin ubi~l gene.
Figure 6 is a diagrammatic representation of the organization of ubi;t (GenBank accession number AF093504) and ubi9 (GenBank accession number AF093505) polyubiquitin genes. Numbered white boxes indicate polyubiquitin coding repeats, unnumbered white boxes are S' and 3' untranslated regions, black boxes are introns and lines are flanking DNA. E:
EcoRI; l4': NruI; S: SaII; H: HindIII; M: putative miniature inverted-repeat transposable element.
Figure 7 shows the nucleotide sequence of the sugarcane polyubiquitin ubi9 gene.
Figure 7A shows the nucleotide sequence (SEQ ID N0:3) including the translation start codon and sequences upstream thereof. Figure 7B shows the nucleotide sequence (SEQ ID
N0:4) including the translation stop codon and sequences downstream thereof.
Boldface indicates the putative TATA box, ATG translation start codon, TAA translation stop codon, and putative poly-A addition signal; underlining indicates the S' and 3' UTRs;
lowercase indicates that the base is most probably as depicted (see also Tables 4 and S
below).
Figure 8 shows the nucleotide sequence (SEQ ID N0:8) (GenBank accession number AF093505) (A) and translated amino acid sequence (SEQ ID N0:9) (B) of the sugarcane polyubiquitin ubi9 gene.
Figure 9 is a graph showing GUS activity following transient expression of GUS
under the control of the sugarcane ubi4 promoter (ubi4), sugarcane ubi9 promoter (ubi9), and maize ubi 1 promoter (mzubi 1 ) in sugarcane suspension cells (A, B) and tobacco leaves (C, D). Controls received no DNA. Letters within parentheses indicate least significant difference levels, i.e., mean GUS activity values with different letters being significantly different at P<_0.05 confidence level.
Figure 10 shows the nucleotide sequence (SEQ ID N0:7) of the portion of the ubi4 gene which was ligated to the gene encoding (3-glucuronidase (GUS) in plasmids pubi4-GUS
and 4PI-GUS. SEQ ID N0:7 corresponds to nucleotides 1-1802 of SEQ ID NO:S.
Figure 11 shows the nucleotide sequence (SEQ ID NO:10) of the portion of the ubi9 gene which was ligated to the gene encoding (3-glucuronidase (GUS) in plasmids pubi9-GUS
and 9PI-GUS. SEQ ID NO:10 corresponds to nucleotides 1-3688 of SEQ ID N0:8.
Figure 12 is a graph showing GUS activity following stable expression in sugarcane callus lines of GUS under the control of sugarcane ubi9 promoter (sc ubi9) (A, D), sugarcane ubi4 promoter (sc ubi4) (B, D), and maize ubil (C, D).
Figure 13 is a graph showing GUS reporter gene activity in stable transgenic rice callus lines expressed under the control of the sugarcane ubi9 promoter in the presence and absence of the putative nuclear matrix attachment region (MAR).
Figure 14 shows the nucleotide sequence (SEQ ID NO: I 1 ) encoding polyphenol oxidase (GenBank accession number s40S48).
Figure 1 S shows the nucleotide sequence (SEQ ID N0:2) encoding maize sucrose phosphate synthase enzyme (GenBank accession number m97SS0).
DEFINITIONS
To facilitate understanding of the invention, a number of terms are defined below.
The term "transgenic" when used in reference to a cell refers to a cell which contains I S a transgene, or whose genome has been altered by the introduction of a transgene. The term "transgenic" when used in reference to a tissue or to a plant refers to a tissue or plant, respectively, which comprises one or more cells that contain a transgene, or whose genome has been altered by the introduction of a transgene. Transgenic cells, tissues and plants may be produced by several methods including the introduction of a "transgene"
comprising nucleic acid (usually DNA) into a target cell or integration of the transgene into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
The term "transgene" as used herein refers to any nucleic acid sequence which is introduced into the genome of a cell by experimental manipulations. A
transgene may be an 2S "endogenous DNA sequence," or a "heterologous DNA sequence" (i. e. , "foreign DNA"). The term "endogenous DNA sequence" refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturaliy-occurring sequence. The term "heterologous DNA sequence" refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature.
Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell. Heterologous DNA also includes an endogenous DNA
sequence WO 99/4b976 PCT/US99/05985 which contains some modification. Generally, although not necessarily, heterologous DNA
encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins {e.g., proteins which confer drug resistance), etc.
The term "foreign gene" refers to any nucleic acid (e.g., gene sequence) which is introduced into the genome of a cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene contains some modification (e~.~~., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring gene.
The term "transformation" as used herein refers to the introduction of a transgene into a cell. Transformation of a cell may be stable or transient. The term "transient transformation" or "transiently transformed" refers to the introduction of one or more transgenes into a cell in the absence of integration of the transgene into the host cell's genome. Transient transformation may be detected by, for example, enzyme-linked immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one or more of the transgenes. Alternatively, transient transformation may be detected by detecting the activity of the protein (e.g., [3-glucuronidase) encoded by the transgene (e.g., the uid A
gene) as demonstrated herein [e.g., histochemical assay of GUS enzyme activity by staining with X-gluc which gives a blue precipitate in the presence of the GUS enzyme;
and a chemiluminescent assay of GUS enzyme activity using the GUS-Light kit (Tropix)). The term "transient transforrnant" refers to a cell which has transiently incorporated one or more transgenes. In contrast, the term "stable transformation" or "stably transformed" refers to the introduction and integration of one or more transgenes into the genome of a cell. Stable transformation of a cell may be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid sequences which are capable of binding to one or more of the transgenes.
Alternatively, stable transformation of a cell may also be detected by the polymerase chain reaction of genomic DNA of the cell to amplify transgene sequences. The term "stable transformant" refers to a cell which has stably integrated one or more transgenes into the genomic DNA. Thus, a stable transformant is distinguished from a transient transformant in that, whereas genomic DNA from the stable transformant contains one or more transgenes, genomic DNA from the transient transformant does not contain a transgene.
The term "nucleotide sequence of interest" refers to any nucleotide sequence, the manipulation of which may be deemed desirable for any reason (e.g., confer improved qualities), by one of ordinary skill in the art. Such nucleotide sequences include, but are not limited to. coding sequences of structural genes (e.g., reporter genes.
selection marker genes, oncogenes. drug resistance genes, growth factors, etc. ), and non-coding regulatory sequences which do not encode an mRNA or protein product, (e.g., promoter sequence, polyadenylation sequence. termination sequence, enhancer sequence, ete. ).
The term "isolated" when used in relation to a nucleic acid, as in "an isolated nucleic acid sequence" refers to a nucleic acid sequence that is identified and separated from at least one contaminant nucleic acid with which it is ordinarily associated in its natural source.
Isolated nucleic acid is nucleic acid present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA which are found in the state they exist in nature. For example. a given DNA
sequence (c'. g. , a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs which encode a multitude of proteins. However, an isolated nucleic acid sequence comprising SEQ ID NO:1 includes, by way of example, such nucleic acid sequences in cells which ordinarily contain SEQ ID NO:1 where the nucleic acid sequence is in a chromosomal or extrachromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid sequence may be present in single-stranded or double-stranded form. When an isolated nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will contain at a minimum at least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be single-stranded).
Alternatively, it may contain both the sense and anti-sense strands (i.e., the nucleic acid sequence may be double-stranded).
As used herein, the term "purified" refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated. An "isolated nucleic acid sequence" is therefore a purif ed nucleic acid sequence. "Substantially purified" molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which they are naturally associated.
As used herein, the terms "complementary" or "complementarity" are used in reference to nucleotide sequences related by the base-pairing rules. For example, the sequence ~'-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity can be "partial" or "total."
"Partial" complementarity is where one or more nucleic acid bases is not matched according to the base pairing rules. "Total" or "complete" complementarity between nucleic acids is where each and every nucleic acid base is matched with another base under the base pairing S rules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
A "complement" of a nucleic acid sequence as used herein refers to a nucleotide sequence whose nucleic acids show total complementarity to the nucleic acids of the nucleic acid sequence.
The term "homology" when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology (i.e., partial identity) or complete homology (i. e. , complete identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid and is referred to using the functional term "substantially homologous."
The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe (i.e., an oligonucleotide which is capable of hybridizing to another oIigonucleotide of interest) will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous sequence to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term "substantially homologous" refers to any probe which can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described infra.
When used in reference to a single-stranded nucleic acid sequence, the term "substantially homologous" refers to any probe which can hybridize to the single-stranded nucleic acid sequence under conditions of low stringency as described infra.
The term "hybridization" as used herein includes "any process by which a strand of nucleic acid joins with a complementary strand through base pairing." [Coombs J ( I 994) Dictionary n~~Biotechnology, Stockton Press, New York NY]. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementarity between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids.
As used herein, the term "Tm" is used in reference to the "melting temperature." The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the Tm of nucleic acids is well known in the art. As indicated by standard references, a simple estimate of the T", value may be calculated by the equation: T," = 81.5 + 0_41 (% G + C), when a nucleic acid is in aqueous solution at I M NaCI [see e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization (1985)].
Other references include more sophisticated computations which take structural as well as sequence 1 S characteristics into account for the calculation of Tm.
Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68°C in a solution consisting of SX SSPE (43.8 g/1 NaCI, 6.9 g/1 NaH~P04~H~O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 1% SDS, SX Denhardt's reagent [SOX Denhardt's contains the following per 500 ml:
In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In an alternative preferred embodiment, the portion is double-stranded. In another alterative preferred embodiment, the portion is single-stranded. In yet another alternative preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant.In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple. rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alternative more preferred embodiment, the plant cell is derived from a dicotyledonous plant.
In yet a more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
Also provided by the invention is a substantially purified nucleic acid sequence comprising a portion of the HindIIIlXbaI fragment isolated from plasmid pubi9-GUS
contained in Escherichia coli cells deposited as NRRLB-30116, and the complement of the fragment. In a preferred embodiment, the portion is characterized by having promoter activity. In a more preferred embodiment, the promoter activity is constitutive. In another :ZO preferred embodiment, the portion is double-stranded. In yet another preferred embodiment, the portion is single-stranded. In yet another preferred embodiment, the nucleic acid sequence is contained in a plant cell. In a more preferred embodiment, the plant cell is derived from a monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, '.',>_5 pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In an alterative more preferred embodiment, the plant cell is derived from a dicotyledonous plant.
In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention additionally provides a recombinant expression vector comprising a :30 nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID
NO:1;
SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3. In a preferred embodiment, the recombinant expression -S-vector is selected from the group consisting of pubi4-GUS, pubi9-GUS, 4PI-GUS
and 9PI-GUS.
The invention also provides a recombinant expression vector comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:l and the complement thereof.
Also provided herein is a recombinant expression vector comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID N0:3 and the complement thereof.
Additionally provided by the invention is a transgenic plant cell comprising a nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ
ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID NO:I; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ ID N0:3, and homologs of the complement of SEQ ID N0:3, wherein the nucleotide sequence is operably linked to a nucleic acid sequence of interest. In a preferred embodiment, the transgenic plant cell expresses the nucleic acid sequence of interest. In a more preferred embodiment, the expression is constitutive. In an alternative embodiment, the transgenic plant cell is derived from a monocotyledonous plant. In a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another alternative embodiment, the transgenic plant cell is derived from a dicotyledonous plant. In a more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya. In yet another alternative embodiment, the nucleic acid sequence of interest is a sense sequence. In a more preferred embodiment, the sense sequence encodes a protein selected from the group consisting of ~-glucuronidase, luciferase, ~i-galactosidase, 1-aminocyclopropane-1-carboxylic acid deaminase, sucrose phosphate synthase, 5-enolpyruvyl-3- phosphoshikimate synthase, acetolactate synthase, RNase, wheat germ agglutinin, sweetness protein, and Bacillus thuringiensis crystal toxin proteins. In a further alternative embodiment, the nucleic acid sequence of interest is an antisense sequence. In a more preferred embodiment, the antisense sequence is selected from the group consisting of an antisense sequence to ACC synthase, to ethylene inducible sequences, and to polyphenol oxidase.
The invention further provides a transgenic plant cell comprising a nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:1 and the complement thereof.
Also provided herein is a transgenic plant cell comprising a nucleic acid sequence S comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
N0:3 and the complement thereof.
Additionally provided by the invention is a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a nucleotide sequence selected from the group consisting of SEQ
ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ 1D NO:1; SEQ ID N0:3, the complement of SEQ ID N0:3, homologs of SEQ 1D N0:3, and homologs of the complement of SEQ ID N0:3; b) operably linking the nucleic acid sequence of interest to the nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under 1 S conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell. In a preferred embodiment, the method further comprises d) identifying the transgenic plant cell. In another preferred embodiment, the method further comprises d) regenerating transgenic plant tissue from the transgenic plant cell. In an alternative preferred embodiment, the methods further comprises d) regenerating a transgenic plant from the transgenic plant cell. In another preferred embodiment, the plant cell is derived from a monocotyledonous plant. In yet a more preferred embodiment, the monocotyledonous plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In another alterative more preferred embodiment the plant cell is derived from a dicotyledonous plant. In a yet more preferred embodiment, the dicotyledonous plant is selected from the group consisting of tobacco, tomato, soybean, and papaya.
The invention further provides a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ
ID NO:1 and the complement thereof; b) operably linking the nucleic acid sequence of interest to the portion of a nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell.
In a preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID N0:7, from 24~ to 787 of SEQ ID
N0:7, from 788 to 1020 of SEQ ID N0:7, from 1021 to 1084 of SEQ ID N0:7. from 1085 to 1168 of SEQ ID N0:7, from 1169 to 1173 of SEQ ID N0:7, from 1174 to 1648 of SEQ ID
N0:7.
from 1649 to 1805 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:I, from 379 to 444 of SEQ ID NO:1, and from 44~ to 1810 of SEQ ID NO:1.
Also provided by the invention is a method for expressing a nucleic acid sequence of interest in a plant cell, comprising: a) providing: i) a plant cell; ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ
ID N0:3 and the complement thereof; b) operably linking the nucleic acid sequence of interest to the portion of a nucleotide sequence to produce a transgene; and c) introducing the transgene into the plant cell to produce a transgenic plant cell under conditions such that the nucleic acid sequence of interest is expressed in the transgenic plant cell.
In a preferred embodiment, the portion comprises the nucleotide sequence selected from the group consisting of the nucleotides 1 to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID
NO:10, from 3614 to 3691 of SEQ ID N0:3, from 1 to 2248 of SEQ ID N0:3, from 2249 to 2313 of SEQ
ID N0:3, from 2314 to 3688 of SEQ ID N0:3, and from 1671 to 2248 of SEQ ID
N0:3..
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows a RNA gel blot of sugarcane polyubiquitin mRNA pools hybridized to gene-specific probes for Scubi 221, Scubi 511, Scubi 241, Scubi561, and Scubi 5121.
Figure 2 shows a "Reverse northern" blot of sugarcane polyubiquitin mRNA
pools.
Figure 3 shows the nucleotide sequence of the sugarcane polyubiquitin ubi4 gene.
Figure 3A shows the nucleotide sequence (SEQ ID NO:1 ) including the translation start codon and sequences upstream thereof. Figure 3B shows the nucleotide sequence (SEQ ID
N0:2) including the translation stop codon and sequences downstream thereof.
Boldface indicates putative TATA box, ATG translation initiation codon, TAA translation stop codon, and putative poly-A addition signal; underlining indicates the putative 5' and 3' UTRs, based on homology to scubi241 and 511 cDNAs; lowercase indicates that the base is as depicted but with some uncertainty (see also Tables 1 and 2 below).
Figure 4 is a diagrammatic representation of the initially-determined organization of ubi=l and ubi9 polyubiquitin genes. Numbered white boxes indicate polyubiquitin coding repeats, unnumbered white boxes are S' and 3' untranslated regions, black boxes are introns, _g_ and lines are flanking DNA. E: EcoRI; N: NruI; S: SaII; H: HindIII; M:
putative miniature inverted-repeat transposable element.
Figure 5 shows the nucleotide sequence (SEQ ID NO:S) (nucleotides 1 to 5512 of the 5551 nucleotide sequence deposited as GenBank accession number AF093504) (A) and S translated amino acid sequence (SEQ ID N0:6) (B) of sugarcane polyubiquitin ubi~l gene.
Figure 6 is a diagrammatic representation of the organization of ubi;t (GenBank accession number AF093504) and ubi9 (GenBank accession number AF093505) polyubiquitin genes. Numbered white boxes indicate polyubiquitin coding repeats, unnumbered white boxes are S' and 3' untranslated regions, black boxes are introns and lines are flanking DNA. E:
EcoRI; l4': NruI; S: SaII; H: HindIII; M: putative miniature inverted-repeat transposable element.
Figure 7 shows the nucleotide sequence of the sugarcane polyubiquitin ubi9 gene.
Figure 7A shows the nucleotide sequence (SEQ ID N0:3) including the translation start codon and sequences upstream thereof. Figure 7B shows the nucleotide sequence (SEQ ID
N0:4) including the translation stop codon and sequences downstream thereof.
Boldface indicates the putative TATA box, ATG translation start codon, TAA translation stop codon, and putative poly-A addition signal; underlining indicates the S' and 3' UTRs;
lowercase indicates that the base is most probably as depicted (see also Tables 4 and S
below).
Figure 8 shows the nucleotide sequence (SEQ ID N0:8) (GenBank accession number AF093505) (A) and translated amino acid sequence (SEQ ID N0:9) (B) of the sugarcane polyubiquitin ubi9 gene.
Figure 9 is a graph showing GUS activity following transient expression of GUS
under the control of the sugarcane ubi4 promoter (ubi4), sugarcane ubi9 promoter (ubi9), and maize ubi 1 promoter (mzubi 1 ) in sugarcane suspension cells (A, B) and tobacco leaves (C, D). Controls received no DNA. Letters within parentheses indicate least significant difference levels, i.e., mean GUS activity values with different letters being significantly different at P<_0.05 confidence level.
Figure 10 shows the nucleotide sequence (SEQ ID N0:7) of the portion of the ubi4 gene which was ligated to the gene encoding (3-glucuronidase (GUS) in plasmids pubi4-GUS
and 4PI-GUS. SEQ ID N0:7 corresponds to nucleotides 1-1802 of SEQ ID NO:S.
Figure 11 shows the nucleotide sequence (SEQ ID NO:10) of the portion of the ubi9 gene which was ligated to the gene encoding (3-glucuronidase (GUS) in plasmids pubi9-GUS
and 9PI-GUS. SEQ ID NO:10 corresponds to nucleotides 1-3688 of SEQ ID N0:8.
Figure 12 is a graph showing GUS activity following stable expression in sugarcane callus lines of GUS under the control of sugarcane ubi9 promoter (sc ubi9) (A, D), sugarcane ubi4 promoter (sc ubi4) (B, D), and maize ubil (C, D).
Figure 13 is a graph showing GUS reporter gene activity in stable transgenic rice callus lines expressed under the control of the sugarcane ubi9 promoter in the presence and absence of the putative nuclear matrix attachment region (MAR).
Figure 14 shows the nucleotide sequence (SEQ ID NO: I 1 ) encoding polyphenol oxidase (GenBank accession number s40S48).
Figure 1 S shows the nucleotide sequence (SEQ ID N0:2) encoding maize sucrose phosphate synthase enzyme (GenBank accession number m97SS0).
DEFINITIONS
To facilitate understanding of the invention, a number of terms are defined below.
The term "transgenic" when used in reference to a cell refers to a cell which contains I S a transgene, or whose genome has been altered by the introduction of a transgene. The term "transgenic" when used in reference to a tissue or to a plant refers to a tissue or plant, respectively, which comprises one or more cells that contain a transgene, or whose genome has been altered by the introduction of a transgene. Transgenic cells, tissues and plants may be produced by several methods including the introduction of a "transgene"
comprising nucleic acid (usually DNA) into a target cell or integration of the transgene into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
The term "transgene" as used herein refers to any nucleic acid sequence which is introduced into the genome of a cell by experimental manipulations. A
transgene may be an 2S "endogenous DNA sequence," or a "heterologous DNA sequence" (i. e. , "foreign DNA"). The term "endogenous DNA sequence" refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturaliy-occurring sequence. The term "heterologous DNA sequence" refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature.
Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell. Heterologous DNA also includes an endogenous DNA
sequence WO 99/4b976 PCT/US99/05985 which contains some modification. Generally, although not necessarily, heterologous DNA
encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins {e.g., proteins which confer drug resistance), etc.
The term "foreign gene" refers to any nucleic acid (e.g., gene sequence) which is introduced into the genome of a cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene contains some modification (e~.~~., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring gene.
The term "transformation" as used herein refers to the introduction of a transgene into a cell. Transformation of a cell may be stable or transient. The term "transient transformation" or "transiently transformed" refers to the introduction of one or more transgenes into a cell in the absence of integration of the transgene into the host cell's genome. Transient transformation may be detected by, for example, enzyme-linked immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one or more of the transgenes. Alternatively, transient transformation may be detected by detecting the activity of the protein (e.g., [3-glucuronidase) encoded by the transgene (e.g., the uid A
gene) as demonstrated herein [e.g., histochemical assay of GUS enzyme activity by staining with X-gluc which gives a blue precipitate in the presence of the GUS enzyme;
and a chemiluminescent assay of GUS enzyme activity using the GUS-Light kit (Tropix)). The term "transient transforrnant" refers to a cell which has transiently incorporated one or more transgenes. In contrast, the term "stable transformation" or "stably transformed" refers to the introduction and integration of one or more transgenes into the genome of a cell. Stable transformation of a cell may be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid sequences which are capable of binding to one or more of the transgenes.
Alternatively, stable transformation of a cell may also be detected by the polymerase chain reaction of genomic DNA of the cell to amplify transgene sequences. The term "stable transformant" refers to a cell which has stably integrated one or more transgenes into the genomic DNA. Thus, a stable transformant is distinguished from a transient transformant in that, whereas genomic DNA from the stable transformant contains one or more transgenes, genomic DNA from the transient transformant does not contain a transgene.
The term "nucleotide sequence of interest" refers to any nucleotide sequence, the manipulation of which may be deemed desirable for any reason (e.g., confer improved qualities), by one of ordinary skill in the art. Such nucleotide sequences include, but are not limited to. coding sequences of structural genes (e.g., reporter genes.
selection marker genes, oncogenes. drug resistance genes, growth factors, etc. ), and non-coding regulatory sequences which do not encode an mRNA or protein product, (e.g., promoter sequence, polyadenylation sequence. termination sequence, enhancer sequence, ete. ).
The term "isolated" when used in relation to a nucleic acid, as in "an isolated nucleic acid sequence" refers to a nucleic acid sequence that is identified and separated from at least one contaminant nucleic acid with which it is ordinarily associated in its natural source.
Isolated nucleic acid is nucleic acid present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA which are found in the state they exist in nature. For example. a given DNA
sequence (c'. g. , a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs which encode a multitude of proteins. However, an isolated nucleic acid sequence comprising SEQ ID NO:1 includes, by way of example, such nucleic acid sequences in cells which ordinarily contain SEQ ID NO:1 where the nucleic acid sequence is in a chromosomal or extrachromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid sequence may be present in single-stranded or double-stranded form. When an isolated nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will contain at a minimum at least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be single-stranded).
Alternatively, it may contain both the sense and anti-sense strands (i.e., the nucleic acid sequence may be double-stranded).
As used herein, the term "purified" refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated. An "isolated nucleic acid sequence" is therefore a purif ed nucleic acid sequence. "Substantially purified" molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which they are naturally associated.
As used herein, the terms "complementary" or "complementarity" are used in reference to nucleotide sequences related by the base-pairing rules. For example, the sequence ~'-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity can be "partial" or "total."
"Partial" complementarity is where one or more nucleic acid bases is not matched according to the base pairing rules. "Total" or "complete" complementarity between nucleic acids is where each and every nucleic acid base is matched with another base under the base pairing S rules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
A "complement" of a nucleic acid sequence as used herein refers to a nucleotide sequence whose nucleic acids show total complementarity to the nucleic acids of the nucleic acid sequence.
The term "homology" when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology (i.e., partial identity) or complete homology (i. e. , complete identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid and is referred to using the functional term "substantially homologous."
The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe (i.e., an oligonucleotide which is capable of hybridizing to another oIigonucleotide of interest) will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous sequence to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term "substantially homologous" refers to any probe which can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described infra.
When used in reference to a single-stranded nucleic acid sequence, the term "substantially homologous" refers to any probe which can hybridize to the single-stranded nucleic acid sequence under conditions of low stringency as described infra.
The term "hybridization" as used herein includes "any process by which a strand of nucleic acid joins with a complementary strand through base pairing." [Coombs J ( I 994) Dictionary n~~Biotechnology, Stockton Press, New York NY]. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementarity between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids.
As used herein, the term "Tm" is used in reference to the "melting temperature." The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the Tm of nucleic acids is well known in the art. As indicated by standard references, a simple estimate of the T", value may be calculated by the equation: T," = 81.5 + 0_41 (% G + C), when a nucleic acid is in aqueous solution at I M NaCI [see e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization (1985)].
Other references include more sophisticated computations which take structural as well as sequence 1 S characteristics into account for the calculation of Tm.
Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68°C in a solution consisting of SX SSPE (43.8 g/1 NaCI, 6.9 g/1 NaH~P04~H~O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 1% SDS, SX Denhardt's reagent [SOX Denhardt's contains the following per 500 ml:
5 g Ficoll (Type 400, Pharmacia), S g BSA (Fraction V; Sigma)] and 100 Pg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.2X SSPE, and 0.1%
SDS at room temperature when a DNA probe of about 100 to about 1000 nucleotides in length is employed.
High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68°C in a solution consisting of SX SSPE, 1 % SDS, SX Denhardt's reagent and 100 ~tg/ml denatured salmon sperm DNA
followed by washing in a solution comprising 0.1 X SSPE, and 0.1 % SDS at 68°C when a probe of about 100 to about 1000 nucleotides in length is employed.
The term "equivalent" when made in reference to a hybridization condition as it relates to a hybridization condition of interest means that the hybridization condition and the hybridization condition of interest result in hybridization of nucleic acid sequences which have the same range of percent (%) homology. For example, if a hybridization condition of interest results in hybridization of a first nucleic acid sequence with other nucleic acid sequences that have from 50% to 70% homology to the first nucleic acid sequence, then another hybridization condition is said to be equivalent to the hybridization condition of interest if this other hybridization condition also results in hybridization of the first nucleic acid sequence with the other nucleic acid sequences that have from 50% to 70%
homology to the first nucleic acid sequence.
When used in reference to nucleic acid hybridization the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions:
factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above-listed conditions.
Those skilled in the art know that whereas higher stringencies may be preferred to I S reduce or eliminate non-specific binding between the nucleotide sequence of SEQ ID NOs: l and 10 and other nucleic acid sequences, lower stringencies may be preferred to detect a larger number of nucleic acid sequences having different homologies to the nucleotide sequence of SEQ ID NOs:I and 10.
The term "promoter," "promoter element," or "promoter sequence" as used herein, refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into mRNA. A
promoter is typically, though not necessarily, located 5' (i. c., upstream) of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription.
Promoters may be tissue specific or cell specific. The term "tissue specific"
as it applies to a promoter refers to a promoter that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., petals) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., roots). Tissue specificity of a promoter may be evaluated by, for example, operably linking a reporter gene to the promoter sequence to generate a reporter construct, introducing the reporter construct into the genome of a plant such that the reporter construct is integrated into every tissue of the resulting transgenic plant, and detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues of the transgenic plant. The detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression of the reporter gene in other tissues shows that the promoter is specific for the tissues in which greater levels of S expression are detected. The term "cell type specific" as applied to a promoter refers to a promoter which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression of the same nucleotide sequence of interest in a different type of cell within the same tissue. The term "cell type specific" when applied to a promoter also means a promoter capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue. Cell type specificity of a promoter may be assessed using methods well known in the art, e.g., immunohistochemical staining. Briefly, tissue sections are embedded in paraffin. and paraffin sections are reacted with a primary antibody which is specific for the polypeptide product encoded by the nucleotide sequence of interest whose expression is controlled by the promoter. A labeled (e.g., peroxidase conjugated) secondary antibody which is specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy.
Promoters may be constitutive or regulatable. The term "constitutive" when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, light, etc. ). Typically, constitutive promoters are capable of directing expression of a transgene in substantially any cell and any tissue. In contrast, a "regulatable" promoter is one which is capable of directing a level of transcription of an operably linked nuclei acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, light, etc.) which is different from the level of transcription of the operably linked nucleic acid sequence in the absence of the stimulus.
The terms "infecting" and "infection" with a bacterium refer to co-incubation of a target biological sample, (e.g., cell, tissue, etc.) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells of the target biological sample.
The term "Agrobacterium" refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall. The term "Agrobacterium"
includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell.
Thus, Agrobacterium strains which cause production of nopaline (e.g.' strain LBA4301, C58, A208) are referred to as "nopaline-type" Agrobacteria; Agrobacterium strains which cause production of octopine (c~.~~. ~ strain LBA4404, AchS, B6) are referred to as "octopine-type"
Agrobacteria; and Agrobacterium strains which cause production of agropine (e.g., strain EHA10~, EHA101, A281 ) are referred to as "agropine-type" Agrobacteria.
The terms "bombarding, "bombardment," and "biolistic bombardment" refer to the process of accelerating particles towards a target biological sample (e.g., cell, tissue, etc.) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample. Methods for biolistic bombardment are known in the art (e.g., U.S. Patent No. 5,584,807, the contents of which are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile I S accelerator (PDS-1000/He) (BioRad).
The term "microwounding" when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle bombardment as described herein.
The term "plant" as used herein refers to a plurality of plant cells which are largely differentiated into a structure that is present at any stage of a plant's development. Such structures include, but are not limited to, a fruit, shoot, stem, leaf, flower petal, etc. The term "plant tissue" includes differentiated and undifferentiated tissues of plants including, but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various types of cells in culture (e. g., single cells, protoplasts, embryos, callus, protocorm-like bodies, etc.). Plant tissue may be in plants, in organ culture, tissue culture, or cell culture.
DESCRIPTION OF THE INVENTION
The present invention provides nucleic acid sequences having promoter activity. The nucleic acid sequences provided herein direct constitutive expression of operably linked nucleotide sequences in cells, tissues and organs of monocotyledonous and dicotyledonous plants. The promoter sequences provided herein were discovered by screening highly expressed sugarcane polyubiquitin genes. The sequences of the invention are capable of driving expression of operably linked nucleotide sequences in plant cells at a level which is comparable to or greater than those levels expressed under the control of the prior art's maize polyubiquitin promoter sequences. Also provided by the invention are methods for constitutive expression of a nucleic acid sequence of interest in plants. The nucleic acid sequences and methods of the invention allow generation of transgenic plants which exhibit S agronomically desirable characteristics.
The invention is further described under (A) Sugarcane Polyubiquitin Promoter Sequences, (B) Using Probes To Identify And Isolate Homologs of The Sugarcane Promoter Sequences, (C) Using Primers to Amplify Nucleotide Sequences, and (D) Generating Transgenic Plants.
A. Sugarcane Polyubiquitin Promoter Sequences Ubiquitin involvement has been shown in protein turnover, heat shock response, and many other important cellular processes [Hershko et al., Annual Review of Biochemistry 61, 761-808 (1992)]. Consistent with its essential biological roles, the structure of the protein is I S very highly conserved in all eukaryotes [Callis et al., Genetics 139, 921-39 (1995); Callis c~t al., Oxford Surv Plant Mol Cell Biol 6, 1-30 (1989); Sun et al., Plant J 11, 1017-27 (1997)].
Many genes encoding ubiquitin contain various numbers of tandem repeats of the entire protein coding region and hence are called polyubiquitin genes. The primary translation product is a polyprotein, which is processed to form ubiquitin monomers post-translationally.
Ubiquitin protein is abundant throughout the plant body, and several polyubiquitin genes have been shown to be expressed in most or all cell types under most or all environmental conditions [Kawalleck et al., Plant Mol Biol 21, 673-84 (1993)].
Numerous other polyubiquitin genes, however, are expressed in a tissue-specific manner [Callis et al., Proc Natl Acad Sci U S A 91, 6074-7 (1994); Plesse et al., Mol Gen Genet 254, ( 1997)), or in response to environmental signals such as heat stress [Christensen et al., Plant Molecular Biology 12, 619-632 ( 1989); Liu et al., Biochem Cell Biol 73, 19-30 ( 1995)], or both [Almoguera et al., Plant Physiology 107, 765-773 (1995); Binet et al., Plant Mol Biol 17, 395-407 ( 1991 ); Burke et al., Mol Gen Genet 213, 43 S-43 ( 1988);
Garbarino et al., Plant Mol Biol 2U, 235-44 ( 1992); Genschik et al., Gene 148, 195-202 (1994); Sun et al. ( 1997) supra; Takimoto et ul., Plant Mol Biol 26, 1007-12 (1994)). Several polyubiquitin promoters have been isolated and used to drive transgene expression [Christensen et al., Plant Mol Biol 18, 675-89 (1992); Garbarino et al., Plant Physiology 109, 1371-1378 (1995)), including some which have been widely used for constitutive expression of genes in plant transformation [Christensen et al., Transgenic Research 5, 213-218 ( 1996):
Gallo-Meagher et ul., Plant Cell Reporter 12, 666-670 (1993); Garbarino et al. (1995) supra:
Taylor et al., Plant Cell Reports 12, 491-495 (1993); Quail et al., U.S. Patent Numbers 5.614,399 and 5,510,474].
S All Saccharum species are polyploid and most are at least octoploid (2N=40-128 or more). Commercial sugarcane cultivars are all Saccharum species hybrids derived from 2N+N chromosome transmission [Sreenivasan et al.: Cytogenics. In: Heinz DJ
(ed) Sugarcane Improvement Through Breeding, pp. 211-253. Elsevier, Amsterdam (1987)]. This suggests that sugarcane hybrids may have at least twelve alleles for each of the multiple genes that make up the polyubiquitin gene family.
The nucleic acid sequences of the invention were discovered during a search by the inventors for promoters which are suitable for high-level constitutive transgene expression in monocotyledonous and dicotyledonous plants. The inventors isolated five polyubiquitin cDNA clones (scubi221, 241, 511, 561, and 5121) from sugarcane stem tissue [Albert et al., Plant Physiology 109, 337 (1995)]. Based on comparison of their 3' untranslated sequences, the inventors then grouped these five genes into four "sub-families." The inventors' investigation of the expression of two members of these four sub-families and isolated genomic clones, led to isolation of clones which contained promoters for two members of the most highly expressed sub-family.
The invention provides the nucleic acid sequence of two members of the sugarcane polyubiquitin family, the ubi4 gene and the ubi9 gene. Referring to the ubi4 gene, the initially determined nucleic acid sequence (SEQ ID NO:1 ) of the ubi4 gene including the translation start codon (ATG) and the sequence upstream of the translation start codon is shown in Figure 3A, and the nucleic acid sequence (SEQ ID N0:2) of the ubi4 gene including the translation stop codon and sequences downstram of the translation stop codon is shown in Figrue 3B. The subsequently determined nucleic acid sequence (SEQ ID
NO:S) of the entire ubi~ gene is shown in Figure SA with the subsequently determined nucleic acid sequence (SEQ ID N0:7) located upstream of the translation start codon of the ubi4 gene being shown in Figure 10. The nucleotide sequence of Figure SA represents nucleotides 1 to 5512 of the 5551 nucleotide sequence deposited as GenBank accession number AF093504.
Fragments of the ubi4 gene sequence which were identical in the initially determined and the subsequently determined nucleic acid sequences were as follows (the nucleotide numbers refer to the nucleotide number in SEQ ID NO:S): Fragment A: 1-242;
fragment B:
245-787; fragment C: 788-1020; fragment D: 1021-1084; fragment E: 1085-1168;
fragment F:
1169-1173; fragment G: 1174-1648; and fragment H: 1649-1805. The nucleotide sequence upstream of the transcription start codon of the ubi4 gene (i.e., nucleotides I-1810 of SEQ ID
NO:1, and nucleotides 1-1802 of SEQ ID NOs:S and 7) contained three regions:
(a) an upstream of the 5' UTR sequence (i.e., nucleotides 1-378 of SEQ ID NO:l, and nucleotides 1-377 of SEQ ID NOs:~ and 7), (b) a 5' UTR sequence (i.e.. nucleotides 379-444 of SEQ ID
NO:1. and nucleotides 378-442 of SEQ ID NOs:S and 7); and (c) an intron sequence (i.e., nucleotides 445-1810 of SEQ ID NO:I, and nucleotides 443-1802 of SEQ ID NOs:~
and 7).
With respect of the ubi9 gene, the initially determined nucleic acid sequence (SEQ ID
N0:3) of the ubi9 gene including the translation start codon and sequences upstream thereof is shown in Figure 7A and the initially determined nucleic acid sequence (SEQ
ID N0:4) of the ubi9 gene including the translation stop codon and sequences downstream thereof is shown in Figure 7B. The subsequently determined nucleic acid sequence (SEQ ID
N0:8) of the entire a~bi9 gene is shown in Figure 8A, with the nucleic acid sequence (SEQ ID NO:10) I S of the ubi9 gene upstream of the translation start codon being shown in Figure 11.
It is noted that while the 5' end of the ubi9 gene was obtained by cleavage with HindIII which recognizes the sequence 5'-AAGCTT-3', repeated sequencing of the 5'-end of the ubi9 gene showed instead the sequence 5'-AAGTTT-3' (Figures 8 and 11 ).
Thus, it is the inventors' view that the sequence of the full-length ubi9 gene (a) is as shown in Figure 8A (SEQ ID N0:8) (i.e., total number of nucleotides being 5174, with the ten nucleotides at the 5'-end being S'-AAGTTTTGnT-3'), (b) is as shown in Figure 8A (SEQ ID N0:8) with the exception that it has a total number of nucleotides of 5174, with the ten nucleotides at the 5'-end being 5'-AAGCTTTGnT-3', assuming substitution of the "T" at position 4 with a "C"), or (c) is as shown in Figure 8A (SEQ ID N0:8) with the exception that it has a total number of nucleotides of 5175, with the ten nucleotides at the 5'-end being 5'-AAGCTTTTGn-3', assuming insertion of a "C" at position 4. Accordingly, any reference herein to SEQ ID N0:8 is intended to mean each and every one of the three nucleotide sequences described in the preceding sentence.
Similarly, it is the inventors' view that the sequence upstream of the translation start codon of the ubi9 gene (a) is as shown in Figure 11 (SEQ ID NO: I 0) (i. e. , total number of nucleotides being 3688, with the ten nucleotides at the 5'-end being 5'-AAGTTTTGnT-3'), (b) is as shown in Figure 1 I (SEQ ID NO:10) with the exception that it has a total number of nucleotides of 3688, with the ten nucleotides at the S'-end being S'-AAGCTTTGnT-3', assuming substitution of the "T" at position 4 with a "C"). or (c) is as shown in Figure 11 (SEQ ID NO:10) with the exception that it has a total number of nucleotides of 3689, with the ten nucleotides at the 5'-end being 5'-AAGCTTTTGn-3', assuming insertion of a "C" at position 4. Accordingly, any reference herein to SEQ ID NO:10 is intended to mean each and every one of the three nucleotide sequences described in the preceding sentence.
Fragments of the ubi9 gene sequence which were identical in the initially determined and the subsequently determined nucleic acid sequences were as follows (the nucleotide number refers to the nucleotide number in SEQ ID N0:8): Fragment A: 1-3600:
fragment B:
3602-361?: and fragment C: 3614-3691. The nucleotide sequence upstream of the translation start codon of the ubi9 gene (i.e., nucleotides I-3691 of SEQ ID N0:3, and nucleotides 1-3691 of SEQ ID N0:8) contained three regions: (a) an upstream of the 5' UTR
sequence (i.e., nucleotides I-2248 of SEQ ID N0:3, and nucleotides 1-2248 of SEQ ID
NOs:8 and 10), (b) a S' UTR sequence (i.e., nucleotides 2249-2313 of SEQ ID N0:3, and nucleotides 2249-2313 of SEQ ID NOs:8 and 10), and (c) an intron sequence (i.e., nucleotides 2314-3688 of SEQ ID N0:3. and nucleotides 2314-3688 of SEQ ID NOs:B and 10). A BLAST search of the GenBank database showed 100% homology to only a 20-22 by region of nucleotides 1-3688 of SEQ ID NOs:8 and 10 (i. e, the region of the ubi9 gene which contained the sequence upstream of the 5' UTR, the 5' UTR sequence, and the intron sequence), 100%
homology to only a 20 by region of nucleotides 1-2248 of SEQ ID NOs:8 and 10 (i.e., the sequence upstream of the 5' UTR), and 87% homology between a 135 by fragment of the sequence upstream of the 5' UTR of SEQ ID NOs:8 and 10 and a 118 by fragment of Saccharum sp.
glucose transporter mRNA, 3' end (GenBank accession number L21752).
Data presented herein demonstrates that plasmids which contain the uid A gene encoding ~3-glucuronidase (GUS) under the control of SEQ ID N0:7 (which is equivalent to the initially determined SEQ ID NO:I) of the ubi4 gene or under the control of SEQ ID
NO:10 (which is equivalent to the initially determined SEQ ID N0:3) of the ubi9 gene successfully drive transient expression of GUS in monocotyledonous sugarcane suspension cultured cells (Example 3), monocotyledonous sorghum callus (Example 5), monocotyledonous pineapple leaves, protocorm-like bodies, roots and fruit (Example 6), and in dicotyledonous tobacco leaves (Example 4), as well as stable expression in monocotyledonous sugar cane callus (Example 7), monocotyledonous rice callus (Example 8), and dicotyledonous tobacco leaves (Example 9).
The present invention is not limited to SEQ ID NO:1 but specifically contemplates portions thereof. As used herein the term "portion" when made in reference to a nucleic acid sequence refers to a fragment of that sequence. The fragment may range in size from ten ( 10) contiguous nucleotide residues to the entire nucleic acid sequence minus one nucleic acid S residue. Thus. a nucleic acid sequence comprising "at least a portion of a nucleotide sequence comprises from ten ( 10) contiguous nucleotide residues of the nucleotide sequence to the entire nucleotide sequence.
In a preferred embodiment, portions contemplated to be within the scope of the invention include, but are not limited to, portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases; the sequence upstream of the ~' UTR sequence (i.c~.. nucleotide sequence from position 1 to 377 of SEQ ID N0:7), the S' UTR
sequence (i.e.. nucleotides sequence from position 378 to 442 of SEQ ID N0:7), and the intron sequence (i.e., nucleotide sequence from position 443 to 1802 of SEQ ID N0:7).
In an alternative preferred embodiment, portions within the scope of the invention include portions 1 S larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, those sequences which are upstream of the translation start codon and which are identical in the initially determined and subsequently determined sequence of the ubi=J gene, and are exemplified by the nucleotide sequence from position 1 to 242, from position 24S to 787, from position 788 to 1020, from position 1021 to 1084, from position 1085 to 1168, from position 1169 to 1173, from position 1174 to 1648, and from position 1649 to 1802 of SEQ
ID N0:7. In yet another alternative preferred embodiment, the portion contains the 377 by sequence which is upstream of the S'UTR and which is highly homologous (>90%
identity) in both the ubi4 and ubi9 gene sequences, i.e., the nucleotide sequence from position 1 to 377 of SEQ ID N0:7.
2S It is contemplated that the present invention is not limited to SEQ ID N0:3 but specifically includes portions thereof. In a preferred embodiment, portions contemplated to be within the scope of the invention include, but are not limited to, portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, the sequence upstream of the S' UTR sequence (i. e., nucleotide sequence from position 1 to 2248 of SEQ
ID NO:10), the S' UTR sequence (i. e. , nucleotides sequence from position 2249 to 2313 of SEQ ID
NO:10), and the intron sequence (i.c~., nucleotide sequence from position 2314 to 3688 of SEQ ID NO:10). In an alternative preferred embodiment, portions within the scope of the invention include portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, those sequences which are upstream of the translation start codon and which are identical in the initially determined and subsequently determined sequence of the ubi9 gene, and are exemplified by the nucleotide sequence from position 1 to 3600, from position 3602 to 3612, and from position 3614 to 3688 of SEQ ID NO:10. In yet another alternative preferred embodiment, the portion contains the sequence which is upstream of the transcription start codon and which is highly homologous (>90% identity) in both the ubi.~
and ubi9 gene sequences. i.c~., the nucleotide sequence from position 1671 to 2248 of SEQ ID
NO:10. This sequence includes the MITE (from position 1706 to 1906) which is not homologous to sequences in the polyubiquitin ubi4 promoter.
The sequences of the present invention are not limited to SEQ ID NOs:I and 10 and portions thereof, but also include homologs of SEQ ID NOs:I and 10, as well as portions of these homologs. A nucleotide sequence which is a "homolog" of SEQ ID NOs:I and 10 is defined herein as a nucleotide sequence which exhibits greater than 61 %
identity (but not 100% identity) to the sequence of SEQ ID NOs:I and 10, respectively.
The present invention also contemplates functioning or functional homologs of SEQ
ID NOs:I and 10. A "functional homolog" of SEQ ID NOs:I and 10 is defined as a nucleotide sequence having less than 100% homology with SEQ ID NOs:I and 10, respectively, and which has promoter activity having some or all the characteristics (e.g., constitutive promoter activity) of the promoter activity of SEQ ID NOs:I and 10, respectively. Homologs of SEQ ID NOs:I and 10, and of portions thereof, include, but are not limited to, nucleotide sequences having deletions, insertions or substitutions of different nucleotides or nucleotide analogs as compared to SEQ ID NOs:I and 10, respectively. Such homologs may be produced using methods well known in the art.
The invention also contemplates at least a portion of SEQ ID NOs:I and 10, and homologs thereof having promoter activity. The term "promoter activity" when made in reference to a nucleic acid sequence refers to the ability of the nucleic acid sequence to initiate transcription of an operably linked nucleotide sequence into mRNA.
The terms "operably linked," "in operable combination," and "in operable order" as used herein refer to the linkage of nucleic acid sequences in a manner such that a nucleic acid molecule is capable of directing the transcription of nucleic acid sequence of interest and/or the synthesis of a polypeptide sequence of interest.
Promoter activity may be determined using methods known in the art. For example, a candidate nucleotide sequence whose promoter activity is to be determined is ligated in-frame to a nucleic acid sequence of interest (c~.g., a reporter gene sequence, a selectable marker gene sequence) to generate a reporter vector, introducing the reporter vector into plant tissue using methods described herein, and detecting the expression of the reporter gene (e.g., detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by S the reporter gene). The reporter gene may express a visible markers.
Reporter gene systems which express visible markers include (3-glucuronidase and its substrate (X-Gluc), luciferase and its substrate (luciferin), and (3-galactosidase and its substrate (X-Gal) which are widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system [Rhodes CA et ul.
( 1995) Methods Mol Biol 55:121-131]. In a preferred embodiment, the reporter gene is a GUS
gene. The selectable marker gene may confer antibiotic or herbicide resistance. Examples of reporter genes include, but are not limited to, dhfr which confers resistance to methotrexate [Wigler M
et al., (1980) Proc Natl Acad Sci 77:3567-70J; npt, which confers resistance to the aminoglycosides neomycin and G-418 [Colbere-Garapin F et al., (1981) J. Mol.
Biol.
150:1-14] and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively. Detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by the reporter gene or the selectable marker gene indicates that the candidate nucleotide sequence has promoter activity.
Sequences within a promoter which affect promoter activity may be determined by using deletion constructs such as those described by Sherri et al. for the determination of HSP70 intron alterations which impact transcription of genes operably linked thereto [U.S.
patent number 5,593,874, hereby incorporated by reference]. Briefly, several expression plasmids are constructed to contain a reporter gene under the regulatory control of different candidate nucleotide sequences which are obtained either by restriction enzyme deletion of internal sequences in SEQ ID NOs:I and 10, restriction enzyme truncation of sequences at the S' andlor 3' end of SEQ ID NOs:l and 10, or by the introduction of single nucleic acid base changes by PCR into SEQ ID NOs:I and 10. Expression of the reporter gene by the deletion constructs is detected. Detection of expression of the reporter gene in a given deletion construct indicates that the candidate nucleotide sequence in that deletion construct has promoter activity.
At the 3' end of the nucleic acid sequence of interest, other DNA sequences may also be included, e.g., a 3' untranslated region containing a polyadenylation site and transcription termination sites.
The present invention is not limited to sense molecules of SEQ ID NOs:I and 10 but contemplates within its scope antisense molecules comprising a nucleic acid sequence complementary to at least a portion (e.g., a portion greater than 100 nucletide bases in length and more preferably greater than 200 nucleotide bases in length) of the nucleotide sequence of SEQ ID NOs:I and 10. These antisense molecules find use in, for example, reducing or preventing expression of a gene whose expression is controlled by SEQ ID NOs:I
and 10.
The nucleotide sequence of SEQ ID NOs:I and 10, portions, homologs and antisense sequences thereof may be synthesized by synthetic chemistry techniques which are commercially available and well known in the art [see Caruthers MH et al., ( 1980) Nuc.
Acids Res. Symp. Ser. 215-223; Horn T. et al., ( 1980) Nuc. Acids Res. Symp.
Ser. 225-232].
Additionally. fragments of SEQ ID NOs:I and 10 can be made by treatment of SEQ
ID
NOs:I and 10 with restriction enzymes followed by purification of the fragments by gel electrophoresis. Alternatively, sequences may also produced using the polymerase chain reaction (PCR) as described by Mullis [U.S. Patent Nos. 4,683,195, 4,683,202 and 4,965,188, all of which are hereby incorporated by reference]. SEQ ID NOs:I and 10, portions, homologs and antisense sequences thereof may be ligated to each other or to heterologous nucleic acid sequences using methods well known in the art.
The nucleotide sequence of synthesized sequences may be confirmed using commercially available kits as well as using methods well known in the art which utilize enzymes such as the Klenow fragment of DNA polymerase I, Sequenase~', Tag DNA
polymerase, or thermostable T7 polymerase. Capillary electrophoresis may also be used to analyze the size and confirm the nucleotide sequence of the products of nucleic acid synthesis, restriction enzyme digestion or PCR amplification.
It is readily appreciated by those in the art that the sequences of the present invention may be used in a variety of ways. For example, these sequences are useful in directing the expression of polypeptide sequences in vitro and in vivo. In plants, this is useful in determining the role of the polypeptide in disease development as well an in producing transgenic plants with desirable agronomic characteristics as described below.
In addition, portions of the sequences of the invention can be used as probes for the detection and isolation of complementary DNA sequences, and for the amplification of nucleotide sequences as described below.
B. Using Probes To Identify And Isolate Homologs of The Sugarcane Promoter Sequences The invention provided herein is not limited to SEQ ID NO:I and 10, homologs thereof. and portions thereof. having promoter activity, but includes sequences having no promoter activity (i.e., non-functional homologs and non-functional portions of homologs).
This may be desirable, for example, where a portion of SEQ ID NOs:I and 10 is used as a probe to detect the presence of SEQ ID NOs:I and 10, respectively, or of portions thereof in a sample.
As used herein, the term "probe" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, which is capable of hybridizing to a nucleotide sequence of interest. A
probe may be single-stranded or double-stranded. It is contemplated that any probe used in the present invention will be labelled with any "reporter molecule," so that it is detectable in any detection system including, but not limited to enzyme (e.g.. ELISA, as well as enzyme-I ~ based histochemical assays), fluorescent, radioactive, calorimetric, gravimetric, magnetic, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.
The probes provided herein are useful in the detection, identification and isolation of, for example, sequences such as those listed as SEQ ID NOs:I and 10 as well as of homologs thereof. Preferred probes are of sufficient length (e.g., from about 9 nucleotides to about 20 nucleotides or more in length) such that high stringency hybridization may be employed. In one embodiment, probes from 20 to 50 nucleotide bases in length are employed.
C. Using Primers to Amplify Nucleotide Sequences The invention provided herein is not limited to SEQ ID NO:1 and 10, homologs thereof, and portions thereof, having promoter activity, but includes sequences having no promoter activity. This may be desirable, for example, where a portion of the nucleic acid sequences set forth as SEQ ID NOs:I and 10 is used as a primer for the amplification of nucleic acid sequences by, for example, polymerise chain reactions (PCR) or reverse transcription-polymerise chain reactions (RT-PCR). The term "amplification" is defined as the production of additional copies of a nucleic acid sequence and is generally carried out using polymerise chain reaction technologies well known in the art [Dieffenbach CW and GS
Dveksler ( 1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview NY]. As used herein, the term "polymerise chain reaction" ("PCR") refers to the method of K.B. Mullis disclosed in U.S. Patent Nos. 4,683,19, 4,683,202 and 4,96.188.
all of which are hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA
without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerise. The two primers are complementary to their respective strands of the double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule.
Following annealing, the primers are extended with a polymerise so as to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerise extension can be repeated many times (i.e., denaturation, annealing and extension constitute one "cycle": there can be numerous "cycles") to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the "polymerise chain reaction"
(hereinafter "PCR"). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are the to be "PCR
amplified."
With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; and/or incorporation of'ZP-labeled deoxyribonucleotide triphosphates, such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any nucleotide sequence can be amplified with the appropriate set of primer molecules. In particular, the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications. Amplified target sequences may be used to obtain segments of DNA (e.g., genes) for the construction of targeting vectors, transgenes, etc.
As used herein, the term "primer" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i. e. , in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products.
Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long (e.g., from about 9 nucleotides to about 20 nucleotides or more in length) to prime the synthesis of extension products in the presence of the inducing agent.
Suitable lengths of the primers may be empirically determined and depend on factors such as temperature, source of primer and the use of the method. In one embodiment, the present invention employs probes from 20 to SO nucleotide bases in length.
The primers contemplated by the invention are useful in, for example, identifying sequences which are homologous to the sugarcane ubi4 and ubi9 gene sequences in plants and in other organisms.
D. Generating Transgenic Plants The present invention provides methods for constitutively expressing a nucleotide sequence of interest in a cell, tissue, organ, and/or organism. In one embodiment, the methods provided herein direct constitutive expression of a nucleotide sequence of interest in monocotyledonous and dicotyledonous plant cells. In one embodiment, this is accomplished by introducing into a plant cell a vector that contains a nucleotide sequence of interest operably linked to sequences provided herein which have promoter activity. The transformed plant cell is allowed to develop into a transgenic plant in which the nucleotide sequence of interest is preferably, though not necessarily, expressed in substantially every tissue. These steps are further described below for specific embodiments.
1. Expression Vectors For Plants In one embodiment, the methods of the invention involve transformation of monocotyledonous tissue (sugarcane suspension cultured cells, sugarcane callus, rice callus, maize embryos, pineapple leaves, protocorm-like bodies, roots and fruit, and sorghum callus) and dicotyledonous tissue (tobacco leaves, tomato plants, and soybean excised embryonic meristems) with expression vectors in which the ~i-glucuronidase (GUS) gene is under the transcriptional control of the exemplary sugarcane ubi-! gene promoter sequence (SEQ ID
NO:1) or of the exemplary sugarcane ubi9 gene promoter sequence (SEQ ID N0:3).
As used herein, the terms "vector" and "vehicle" are used interchangeably in reference to nucleic acid molecules that transfer DNA segments) from one cell to another. The term "expression vector" as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism.
The methods of the invention are not limited to the expression vectors disclosed herein. Any expression vector which is capable of introducing a nucleic acid sequence of interest into a plant cell is contemplated to be within the scope of this invention. Typically, expression vectors comprise the nucleic acid sequence of interest as well as companion sequences which allow the transcription of this sequence, and which allow cloning of the vector into a bacterial or phage host. The vector preferably, though not necessarily. contains an origin of replication which is functional in a broad range of prokaryotic hosts. A
1 S selectable marker is generally, but not necessarily, included to allow selection of cells bearing the desired vector.
In a preferred embodiment, the promoter sequence is SEQ ID NO:1 which is derived from the sugarcane ubi4 gene. In an alternative preferred embodiment, the promoter sequence is SEQ ID N0:3 which is derived from the sugarcane ubi9 gene.
However, the invention is not limited to the promoter sequences used herein. Any sequence which is a portion, homolog, or a homolog of a portion of SEQ ID NOs:I and 10 and which has promoter activity is contemplated to be within the scope of the invention.
In addition to a promoter sequence, the expression vector preferably contains a transcription termination sequence downstream of the nucleic acid sequence of interest to provide for efficient termination. Exemplary termination sequences include the nopaline synthase (NOS) termination sequence, and different fragments of the sugarcane ribulose-1,5-biphosphate carboxylase/oxygenase (rubisco) small subunit (scrbcs) gene. The termination sequences of the expression vectors are not critical to the invention. The termination sequence may be obtained from the same gene as the promoter sequence or may be obtained from different genes.
If the mRNA encoded by the nucleic acid sequence of interest is to be efficiently translated, polyadenylation sequences are also commonly added to the expression vector.
Examples of the polyadenylation sequences include, but are not limited to, the Agrobacterium WO 99/469?6 PCT/US99/05985 octopine synthase signal, or the nopaline synthase signal. Where it is preferred that the nucleic acid sequence of interest is not translated into a polypeptide (c~.g..
where the nucleic acid sequence of interest encodes an antisense RNA), polyadenylation signals are not necessary.
Vectors for the transformation of plant cells are not limited to the type or nature of the expressed genes disclosed herein. Any nucleic acid sequence of interest may be used to create transgenic plant cells, tissues, organs, and plants. Nucleic acid sequences of interest include sequences which encode a protein of interest. The terms "protein of interest" and "polypeptide of interest" refer to any protein or polypeptide, respectively.
the manipulation of which may be deemed desirable for any reason, by one of ordinary skill in the art.
For example, it may be desirable to express a nucleic acid sequence which encodes a polypeptide sequence having, for example, enzyme activity. One example of such an enzyme is the 1-aminocyclopropane-1-carboxylic acid (ACC) deaminase enzyme which metabolizes ACC in plant tissue thereby lowering the level of ethylene which is responsible for fruit ripening (U.S. Patent No. 5,512,466, the contents of which are hereby incorporated by reference).
Another enzyme which may be desirably expressed in a plant is the sucrose phosphate synthase enzyme which increases the level of sucrose in the fruit. The nucleic acid sequence of the gene encoding this enzyme is known [e. g., in maize; Worrell et al.
(1991) Plant Cell 3:1121-1130] (Figure 15) (SEQ ID N0:12) and has been assigned GenBank accession number m97550.
Yet another example of a suitable enzyme for use in this invention is EPSP
synthase (S-enolpyruvyl-3- phosphoshikimate synthase; EC:25.1.19) which is an enzyme involved in the shikimic acid pathway of plants. The shikimic acid pathway provides a precursor for the synthesis of aromatic amino acids essential to the plant. Specifically, EPSP
synthase catalyzes the conversion of phosphoenol pyruvate and 3-phosphoshikimic acid to 5-enolpyruvyl-3-phosphoshikimate acid. A herbicide containing N-phosphonomethylglycine inhibits the EPSP
synthase enzyme and thereby inhibits the shikimic acid pathway of the plant.
The term "glyphosate" is usually used to refer to the N-phosphonomethylglycine herbicide in its acidic or anionic forms. Novel EPSP synthase enzymes have been discovered that exhibit an increased tolerance to glyphosate containing herbicides. In particular, an EPSP synthase enzyme having a single glycine to alanine substitution in the highly conserved region having the sequence: -L-G-N-A-G-T-A- located between positions 80 and 120 in the mature wild-type EPSP synthase amino add sequence has been shown to exhibit an increased tolerance to glyphosate and is described in U.S. Patent No. 4.971,908, the teachings of which are hereby incorporated b}~ reference. Methods for transforming plants to exhibit glyphosate tolerance are discussed in U.S. Patent No. 4,940,835, incorporated herein by reference. A
glyphosate-tolerant EPSP synthase plant gene encodes a polypeptide which contains a chloroplast transit peptide (CTP) which enables the EPSP synthase polypeptide (or an active portion thereto) to be transported into a chloroplast inside the plant cell.
The EPSP synthase gene is transcribed into mRNA in the nucleus and the mRNA is translated into a precursor polypeptide (CTP/mature EPSP synthase) in the cytoplasm. The precursor polypeptide is transported into the chloroplast.
Additional examples of enzymes suitable for use in this invention are acetolactate synthase, RNase to impart male sterility [Mariani et al. ( 1990) Nature 347:
737-741 ], and wheat germ agglutinin.
Yet other examples of desirable nucleic acid sequence are those which encode the sweetness protein. The nucleic acid sequence for the gene encoding the sweetness protein is known in the art (see, e.g., U.S. Patent Application No. 08/670,186, the contents of which are herein incorporated by reference). Transformation of plants with the sweetness protein is useful in, for example, providing a base level of. sweetness in the fruit, thus reducing the effects of differences in fruit maturity by providing more uniform sweetness in different parts of the fruit.
Further examples of suitable proteins for use in this invention are Bacillus thuringiena~is (B.t.) crystal toxin proteins which when expressed in plants protect the plants from insect infestation because the insect, upon eating the plant containing the B.t. toxin protein either dies or stops feeding. B.t. toxin proteins which are toxic to either Lepidopteran or Coleopteran insects may be used. Examples of particularly suitable DNA
sequences encoding B.t. toxin protein are described in the EP patent application 385,962 entitled "Synthetic Plant Genes and Method for Preparation," published Sep. 5, 1990.
Alternatively, it may be desirable to express a nucleic acid sequence which encodes an antisense RNA that hybridizes with a genomic plant DNA sequence. For example, it may be of advantage to express antisense RNA which is specific for genomic plant DNA
sequences that encode an enzyme whose activity is sought to be decreased. Examples of DNA
sequences whose reduced expression may be desirable are known in the art including, but not limited to, the ethylene inducible sequences in fruit (U.S. Patent No.
5,545,815, the entire WO 99/46976 PCT/lJS99/05985 contents of which are herein incorporated by reference). Expression of antisense RNA which is homologous with these ethylene inducible sequences is useful in delaying fruit ripening and in increasing fruit firmness. Other DNA sequences whose expression may be desirably reduced include the ACC synthase gene which encodes the ACC synthase enzyme that is the first and rate limiting step in ethylene biosynthesis. Nucleic acid sequences for this gene have been described from a number of plant sources (e.g., Picton et al. (1993) The Plant J. 3:469-481: U.S. Patent Nos. 5,365,015 and 5,723,766, the contents of both of which are herein incorporated by reference). Expression of antisense RNA which hybridizes with ACC
synthase genomic sequences in plants may be desirable to delay fruit ripening.
Yet another sequence whose expression may be advantageously reduced is the genomic sequence encoding the enzyme polyphenol oxidase. This enzyme is involved in the browning reaction that occurs during chilling injury. Nucleic acid sequences encoding this enzyme have been previously described in the art (e.g., Shahar et al. (1992) Plant Cell 4:135-147), as shown in Figure 14 (SEQ ID NO:11 ) {GenBank accession number s40548).
The use of antisense polyphenol antisense sequences has been reported to inhibit polyphenol oxidase (PPO) gene expression and to inhibit browning [Bachem et al. ( 1994) Bio/Technology 12:1 I01-1105].
One of skill in the art knows that the antisense DNA segment to be introduced into the plant may include the full length coding region of the targeted gene or a portion thereof.
Complete homology between the nucleotide sequences of the antisense RNA and the targeted genomic DNA is not required. Rather, antisense DNA sequences which encode antisense RNA sequences that are partially homologous to a targeted genomic DNA sequence are contemplated to be within the scope of the invention so long as the antisense RNA sequences are capable of repressing expression of the target genomic DNA sequence.
The invention is not limited to vectors which express a single nucleic acid sequence of interest. Vectors which contain a plurality of (i.e., two or more) nucleic acid sequences under the transcriptional control of the same promoter sequence are expressly contemplated to be within the scope of the invention. Such vectors may be desirable, for example.
where the expression products of the plurality of nucleic acid sequences contained within the vector provide protection against different pathogens, and where simultaneous protection against these different pathogens is deemed advantageous.
Also included within the scope of this invention are vectors which contain the same or different nucleic acid sequences under the transcriptional control of different promoter sequences derived from SEQ ID NO:I, SEQ ID NO:10, and other sequences. Such vectors may be desirable to, for example, to control different levels of expression of different nucleic acid sequences of interest in plant tissues.
2. Transformation of Plant Cells Once an expression vector is prepared, transgenic plants and plant cells are obtained by introducing the expression vectors into plants and plant cells using methods known in the art. The present invention is suitable for any member of the monocotyledonous (monocot) plant family including, but not limited to, maize, rice, barley, oats, wheat, sorghum, rye, sugarcane, pineapple, yams, onion, banana, coconut, dates and hops. The present invention is also suitable for any member of the dicotyledonous (dicot) plant family including, but not limited to, tobacco, tomato, soybean, and papaya.
In one embodiment, the expression vectors are introduced into plant cells by particle mediated gene transfer. Particle mediated gene transfer methods are known in the art, are commercially available, and include, but are not limited to, the gas driven gene delivery instrument descried in McCabe, U.S. Patent No. 5,584,807, the entire contents of which are herein incorporated by reference. This method involves coating the nucleic acid sequence of interest onto heavy metal particles, and accelerating the coated particles under the pressure of compressed gas for delivery to the target tissue.
Other particle bombardment methods are also available for the introduction of heterologous nucleic acid sequences into plant cells. Generally, these methods involve depositing the nucleic acid sequence of interest upon the surface of small, dense particles of a material such as gold, platinum, or tungsten. The coated particles are themselves then coated onto either a rigid surface, such as a metal plate, or onto a carrier sheet made of a fragile material such as mylar. The coated sheet is then accelerated toward the target biological tissue. The use of the flat sheet generates a uniform spread of accelerated particles which maximizes the number of cells receiving particles under uniform conditions, resulting in the introduction of the nucleic acid sample into the target tissue.
Alternatively, an expression vector may be inserted into the genome of plant cells by infecting the cells with a bacterium, including but not limited to an Agrobacterium strain previously transformed with the nucleic acid sequence of interest. Since most dicotyledonous plant are natural hosts for Agrobacterium, almost every dicotyledonous plant may be transformed by Agrobacterium in vitro. Although monocotyledonous plants, and in particular, cereals and grasses, are not natural hosts to Agrobacterium, work to transform them using Agrobacterium has also been carried out (Hooykas-Van Slogteren et al., (1984) Nature 311:763-764). Plant genera that may be transformed by Agrobacterium include Chrysanthemum, Dianthus, Gerbera, Euphorbia. Pelaronium, Ipomoea, Passiflora.
Cyclamen, Malus, Prunus. Rosa, Rubus, Populus, Santalum, Allium, Lilium, Narcissus, Ananas. Arachis, Phaseolus and Pisum.
For transformation with Agrobacterium, disarmed Agrobacterium cells are transformed with recombinant Ti plasmids of Agrobacterium tumefaciens or Ri plasmids of Agrobucterium rl7izogenes (such as those described in U.S. Patent No. 4,940,838, the entire contents of which are herein incorporated by reference) which are constructed to contain the nucleic acid sequence of interest using methods well known in the art [J. Sambrook et al. ( 1989) Molecular Clonin~~: A Laboratory Manual, Cold Spring Harbor Press, NY]. The nucleic acid sequence of interest is then stably integrated into the plant genome by infection with the transformed Agrobucterium strain. For example, heterologous nucleic acid sequences have been introduced into plant tissues using the natural DNA transfer system of Agrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et al.
( 1987) Ann.
Rev. Plant Phys. 38:467-48b).
Construction of recombinant Ti and Ri plasmids in general follows methods typically used with the more common bacterial vectors, such as pBR322. Additional use can be made of accessory genetic elements sometimes found with the native plasmids and sometimes constructed from foreign sequences. These may include but are not limited to structural genes for antibiotic resistance as selection genes.
There are two systems of recombinant Ti and Ri plasmid vector systems now in use.
The first system is called the "cointegrate" system. In this system, the shuttle vector containing the gene of interest is inserted by genetic recombination into a non-oncogenic Ti plasmid that contains both the cis-acting and traps-acting elements required for plant transformation as, for example, in the pMLJI shuttle vector and the non-oncogenic Ti plasmid pGV3850. The second system is called the "binary" system in which two plasmids are used;
the gene of interest is inserted into a shuttle vector containing the cis-acting elements required for plant transformation. The other necessary functions are provided in traps by the non-oncogenic Ti plasmid as exemplified by the pBINl9 shuttle vector and the non-oncogenic Ti plasmid PAL4404. Some of these vectors are commercially available.
There are three common methods to transform plant cells with Agrobacterium:
The first method is by co-cultivation of Agrobacterium with cultured isolated protoplasts. This method requires an established culture system that allows culturing protoplasts and plant - regeneration from cultured protoplasts. The second method is by transformation of cells or tissues with Agrobacterium. This method requires (a) that the plant cells or tissues can be transformed by Agrobacterium and (b) that the transformed cells or tissues can be induced to regenerate into whole plants. The third method is by transformation of seeds, apices or meristems with Agrobacterium. This method requires micropropagation.
One of skill in the art knows that the efficiency of transformation by Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency with Agrobacterium tumefaciens [Shahla et al. (1987) Plant Molec. Biol. 8:291-298].
Alternatively. transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. [see, e.g., Bidney et al.
(1992) Plant Molec. Biol. 18:301-313J.
It may be desirable to target the nucleic acid sequence of interest to a particular locus on the plant genome. Site-directed integration of the nucleic acid sequence of interest into ?0 the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium-derived sequences. Generally, plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homologous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer-DNA
(T-DNA) sequences, as previously described (Offxinga et al., (1996), U.S. Patent No.
5,501,967, the ?5 entire contents of which are herein incorporated by reference). One of skill in the art knows that homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part of the targeted plant gene, whether belonging to the regulatory elements of the gene. or the coding regions of the gene.
Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid ,0 sequence of regions flanking the site to be targeted is known.
Where homologous recombination is desired, the targeting vector used may be of the replacement-or insertion-type (Offringa et al. ( 1996), supra). Replacement-type vectors generally contain two regions which are homologous with the targeted genomic sequence and which flank a heterologous nucleic acid sequence, e.g., a selectable marker gene sequence.
Replacement-type vectors result in the insertion of the selectable marker gene which thereby disrupts the targeted gene. Insertion-type vectors contain a single region of homology with the targeted gene and result in the insertion of the entire targeting vector into the targeted gene.
Other methods are also available for the introduction of expression vectors into plant tissue, e.g., electroinjection (Nan et al. (1995) In "Biotechnology in Agriculture and Forestry."
Ed. Y.P.S. Bajaj. Springer-Verlag Berlin Heidelberg, Vol 34:145-155; Griesbach (1992) HortScience 27:620): fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies (Fraley er al. (1982) Proc. Natl. Acad. Sci. USA 79:1859-1863); polyethylene glycol (Krens c~/ crl. ( I 982) nature 296:72-74); chemicals that increase free DNA uptake;
transformation using virus, and the like.
3. Selection of Transgenic Plant Cells Plants, plant cells and tissues transformed with a heterologous nucleic acid sequence of interest are readily detected using methods known in the art including, but not limited to, restriction mapping of the genomic DNA, PCR-analysis, DNA-DNA hybridization, DNA-RNA hybridization, DNA sequence analysis and the like.
Additionally. selection of transformed plant cells may be accomplished using a selection marker gene. It is preferred, though not necessary, that a selection marker gene be used to select transformed plant cells. A selection marker gene may confer positive or negative selection.
A positive selection marker gene may be used in constructs for random integration and site-directed integration. Positive selection marker genes include antibiotic resistance genes, and herbicide resistance genes and the like. In one embodiment, the positive selection marker gene is the NPTII gene which confers resistance to geneticin (G418) or kanamycin. In another embodiment the positive selection marker gene is the HPT gene which confers resistance to hygromycin. The choice of the positive selection marker gene is not critical to the invention as long as it encodes a functional polypeptide product. Positive selection genes known in the art include, but are not limited to, the ALS gene (chlorsulphuron resistance), and the DHFR-gene (methothrexate resistance).
A negative selection marker gene may also be included in the constructs. The use of one or more negative selection marker genes in combination with a positive selection marker gene is preferred in constructs used for homologous recombination. Negative selection marker genes are generally placed outside the regions involved in the homologous recombination event. The negative selection marker gene serves to provide a disadvantage (preferably lethality) to cells that have integrated these genes into their genome in an expressible manner. Cells in which the targeting vectors for homologous recombination are S randomly integrated in the genome will be harmed or killed due to the presence of the negative selection marker gene. Where a positive selection marker gene is included in the construct. only those cells having the positive selection marker gene integrated in their genome will survive.
The choice of the negative selection marker gene is not critical to the invention as long as it encodes a functional polypeptide in the transformed plant cell. The negative selection gene may for instance be chosen from the aux-2 gene from the Ti-plasmid of Agrobacterium, the tk-gene from SV40, cytochrome P450 from Streptomyces griseolus, the Adh-gene from Maize or Arabidopsis, etc. Any gene encoding an enzyme capable of converting a substance which is otherwise harmless to plant cells into a substance which is harmful to plant cells may be used.
4. Regeneration of Transgenic Plants The present invention provides transgenic plants. The transgenic plants of the invention are not limited to plants in which each and every cell expresses the nucleic acid sequence of interest under the control of the sequences provided herein.
Included within the scope of this invention is any plant which contains at least one cell which expresses the nucleic acid sequence of interest (e.g., chimeric plants). It is preferred, though not necessary, that the transgenic plant express the nucleic acid sequence of interest in more than one cell, and more preferably in one or more tissue.
Once transgenic plant tissue which contains an expression vector has been obtained, transgenic plants may be regenerated from this transgenic plant tissue using methods known in the art. The term "regeneration" as used herein, means growing a whole plant from a plant cell. a group of plant cells, a plant part or a plant piece (e.g., from a protoplast, callus, protocorm-like body, or tissue part).
Species from the following examples of genera of plants may be regenerated from transformed protoplasts: Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, WO 99/46976 PCT/US99/059$5 Majorana, Ciohorium, Helianthus, Lactuca, Bromus, Asparagus. Antirrhinum, Hererocallis, Nemesia, Pelargonium. Panicum. Pennisetum, Ranunculus. Senecio. Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea. Triticum, Sorghum, and Datura.
For regeneration of transgenic plants from transgenic protoplasts, a suspension of transformed protoplasts or a petri plate containing transformed explants is first provided.
Callus tissue is formed and shoots may be induced from callus and subsequently rooted.
Alternatively, somatic embryo formation can be induced in the callus tissue.
These somatic embryos germinate as natural embryos to form plants. The culture media will generally contain various amino acids and plant hormones, such as auxin and cytokinins.
It is also advantageous to add glutamic acid and proline to the medium, especially for such species as corn and alfalfa. Efficient regeneration will depend on the medium, on the genotype, and on the history of the culture. These three variables may be empirically controlled to result in reproducible regeneration.
Plants may also be regenerated from cultured cells or tissues. Dicotyledonous plants which have been shown capable of regeneration from transformed individual cells to obtain transgenic whole plants include, for example, apple (Malus pumila), blackberry (Rubus), Blackberry/raspberry hybrid (Rubus), red raspberry (Rubus), carrot (Daucus carota), cauliflower (Brassica oleracea), celery (Apium graveolens), cucumber (Cucumis sativus), eggplant (Solanum melongena), lettuce (Lactuca sativa), potato (Solanum tuberosum), rape (Brassica napus), wild soybean (Glycine canescens), strawberry (Fragaria x ananassa), tomato (Lycopersicon esculentum), walnut (Juglans regia), melon (Cucumis melo), grape (Vitis vinifera), and mango (Mangifera indica). Monocotyledonous plants which have been shown capable of regeneration from transformed individual cells to obtain transgenic whole plants include, for example, rice (Oryza sativa), rye (Secale cereale), and maize.
In addition, regeneration of whole plants from cells (not necessarily transformed) has also been observed in: apricot (Prunus armeniaca), asparagus (Asparagus officinalis), banana (hybrid Musa), bean (Phaseolus vulgaris), cherry (hybrid Prunus), grape (Vitis vinifera), mango (Mangifera indica), melon (Cucumis melo), ochra (Abelmoschus esculentus), onion (hybrid Allium), orange (Citrus sinensis), papaya (Carrica papaya), peach (Prunus persica), plum (Prunus domestica), pear (Pyrus communis), pineapple (Ananas comosus), watermelon (Citrullus vulgaris), and wheat (Triticum aestivum).
The regenerated plants are transferred to standard soil conditions and cultivated in a conventional manner. After the expression vector is stably incorporated into regenerated transgenic plants, it can be transferred to other plants by vegetative propagation or by sexual crossing. For example, in vegetatively propagated crops, the mature transgenic plants are propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants. In seed propagated crops, the mature transgenic plants are self crossed to produce a homozygous inbred plant which is capable of passing the transgene to its progeny by Mendelian inheritance. The inbred plant produces seed containing the nucleic acid sequence of interest. These seeds can be grown to produce plants that would produce the selected phenotype. The inbred plants can also be used to develop new hybrids by crossing the inbred plant with another inbred plant to produce a hybrid.
Confirmation of the transgenic nature of the cells, tissues, and plants may be performed by PCR analysis, antibiotic or herbicide resistance, enzymatic analysis and/or Southern blots to verify transformation. Progeny of the regenerated plants may be obtained and analyzed to verify whether the transgenes are heritable. Heritability of the transgene is further confirmation of the stable transformation of the transgene in the plant.
EXPERIMENTAL
The following examples serve to illustrate certain preferred embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.
Isolation and Characterization of Sugarcane Polyubiquitin ubi4 And ubi9 Genes A. Plant materials The plant materials used for the cDNA library and for expression studies were sugarcane hybrid H65-7052 plants grown in the greenhouse of the Hawaii Agriculture Research Center, Aiea, Hawaii. The genomic library used to isolate the genomic clones was made from the related sugarcane hybrid H32-8560 [Albert et al., Plant Mol Biol 20, 663-71 (1992)]. Internodes were numbered consecutively down the culm, with number one defined as that internode subtending the youngest fully expanded leaf, as previously described [Moore P. H.: Anatomy and Morphology. In: Heinz DJ (ed) Sugarcane Improvement Through Breeding, pp. 85-142. Elsevier, Amsterdam (1987)].
Sequence comparisons of the 3' untranslated regions (UTR) of five polyubiquitin cDNA clones revealed significant differences, except for clones scubi241 and 511, which were identical in both S' and 3' UTR sequences. Despite these identical regions, these clones do not represent transcripts from the same gene, as scubi241 contained four copies of the polyubiquitin coding repeat, whereas scubi5l 1 contained five of these repeats (data not shown).
B. Plasmid DNA gel blots Polvubiquitin cDNA plasmid clones were digested with appropriate restriction enzymes and size fractionated by gel electrophoresis. DNA was transferred to Hvbond-N+
(Amersham) membranes by capillary transfer. Identical DNA gel blots of the five clones were hybridized to 3'UTR probes at high stringency. High stringency hybridization and stringency washes at 65°C were by the method of [Church et al., Proc.
Natl. Acad. Sci. USA
81, 1991-1995 (1984)). Probe templates were prepared by PCR amplification of the 3' UTR
of each cDNA clone. Probes were 3zP labeled by the method of [Feinberg et al., Anal.
Biochem. l3?. 6-13 (1983)). Blots were exposed to Kodak X-Omat RP XRP-S film for five to 10 min at room temperature.
C. RNA extraction Total RNA was extracted from 10 g of each tissue sample by the method of [Bugos et al., Biotechniques 19, 734-737 (1995)]. RNA concentration was determined by spectrophotometer. Poly-A+ RNA was isolated from total RNA using the PolyATract System (Promega).
D. RNA gel blot analysis Fifteen ~.g of each RNA sample was separated by denaturing gel electrophoresis as described by [Fourney et al., Focus 10, 5-7 (1988)). Hybridization and stringency washes at 65°C were by the simplified northern blot method of [Virca et al., BioTechniques 8, 370-371 (1990)). 3'-P incorporation into probes was monitored by two methods:
scintillation counting following Sephadex G-50 chromatography and TCA precipitation. Equal activity ( 1.7x 10' cpm/ml) of ''-P-labeled gene-specific probe was used for each hybridization.
Autoradiography was with Kodak X-Omat RP XRP-5 film that was preflashed to an OD54~ of 0.15 to maximize linearity of response, exposed at -80°C.
Four identical RNA gel blots, each containing 1 S p.g total RNA from mature leaves, immature leaves, stem apices, internode 1, internode 2, internode 3, roots, and callus cultures at control (26°C) and heat shock (37°C) temperatures, were hybridized to each of the four gene specific 3' UTR probes, respectively, and exposed to film for 16 h as shown in Figure 1.
In Figure l, 15 pg total RNA from each indicated tissue was hybridized to equal activities of each gene-specific probe. Ethidium bromide stained gel indicates equal loading of RNA from each tissue. M.L., mature leaves; LL., immature leaves; S.A., shoot apex; I1, internode 1; I2, internode 2; I3, internode 3; R, roots; C, callus 26°C; C-HS, callus 37°C.
Autoradiogram for scubi221 probe was exposed for 72 h, all others for 16 h.
With this exposure time of 16 h, transcripts homologous to scubi221 were not detected except in callus tissue under 37°C heat-shock, where a single band was barely detectable. After 72 h exposure, the single band in heat-shock callus was clear, but no clear signal was evident for any other tissue. The scubi241/511 probe detected transcripts of two size classes (on the 16 h autoradiogram exposure shown in Figure 1 the two bands overlap, however on shorter exposures two bands can be resolved), with both transcripts present in approximately equal amounts in all tested tissues. Transcripts for this sub-family were present at the highest levels of all tested polyubiquitin genes; heat shock at 37°C did not induce a significant change in transcript accumulation. The scubi561 probe also detected two mRNA size classes; however, the levels of these transcripts were considerably lower than those detected with the 241/511 probe. Transcript pools hybridizing to the 561 probe were elevated significantly by 37°C
heat shock. The scubi5121 probe hybridized to a single size class that was moderately abundant, but significantly lower than the scubi241/S11 pool. The 5121 levels were very similar in all tissues except callus. Callus at control temperature (26°C) contained less scubi5121 mRNA than did other tissues; 37°C heat shock elevated transcript levels in callus to levels approximately equal to those in other tissues at 26°C. This pattern of reduced transcript levels in control temperature callus tissue could be seen to some extent for all probes except scubiS6l.
To confirm the northern analysis results, a "reverse northern" blot containing the 3' UTR DNA of all five polyubiquitin cDNA clones was hybridized to a 3zP-labeled, first strand cDNA probe made from approximately O.S ug mRNA extracted from immature leaves.
Fifty ng of each 3' UTR target DNA was loaded for the blot, ensuring that the target DNA would be present in excess and that hybridization signals should reflect relative abundance of the different mRNAs in the template mRNA population. Results of this experiment are shown in Figure 2.
WO 99!46976 PCT/US99/05985 In Figure 2, SO ng DNA from the gene-specific 3' UTR of each cDNA clone was hybridized to a total first strand cDNA probe from immature leaves. Results of this experiment confirmed the northern results, with scubi241/511 transcripts most abundant, scubi221 least abundant, and scubi561 and 5121 at intermediate levels (Figure 2). Because these two independent methods of estimating the relative mRNA abundance for the different polyubiquitin gene sub-families both indicated the scubi241/511 sub-family as highest, and because the difference between scubi241 /511 and the other sub-families was so large by both measures, this ranking of expression levels is accurate.
While expression of these sugarcane polyubiquitin genes was little affected by cell-type, several of them responded dramatically to an environmental stimulus:
heat stress.
mRNA pools homologous to scubi221, 561 and 5121 in sugarcane callus tissue subjected to 37°C heat shock all rose substantially, whereas the pool homologous to scubi241/511 did not show a substantial increase. Overall, scubi241 and scubi5l 1 most nearly fit the stereotype of "constitutive'' genes, with uniformly high levels of mRNA accumulation in all tested tissues and little induction of expression by heat stress.
Approximately 1x106 pfu from a sugarcane genomic library in the vector ~,EMBL4 were screened with a polyubiquitin coding sequence probe. Fifty polyubiquitin positive plaques were screened again and purified with the scubi241/S11 specific probe.
Five scubi241/511 positive plaques were isolated, and two of these, ~,ubi4 and ~,ubi9. were subcloned into plasmid vectors for further analysis and sequencing.
Subclone pubi4 sequence and organization were initially determined to be as shown in Figures 3A, 3B and 4. Tables 1-2 provide the meaning of sequence symbols other than the bases G, A, T, and C which indicates ambiguities where two or more bases are equally possible in the sequences shown in Figures 3A and 3B.
Table 1 Ambiguity codes for sugarcane polyubiquitin ubi-l gene sequences upstream of the translation start codon S Position Ambiguity Code Shown Actual Base in Figure 3A
243 n G, A, T, or C
245 n G, A, T, or C
790 n G, A, T, or C
1023 n G, A, T, or C
1089 n G, A, T, or C
1174 n G, A, T, or C
1656 n G, A, T, or C
Table 2 Ambiguity codes for sugarcane poiyubiquitin ubi=t gene downstream of the translation stop codon Position Ambiguity Code Shown Actual Base in Figure 3B
17 n G, A, T,orC
885 n G, A, T, or C
1055 s C or G
1059 y C or T
1463 n G, A, T, or C
1712 n G, A, T, or C
1769 r A or G
1810 w A or T
1964 y C or T
2495 n G, A, T, or C
On subsequent sequencing of subclone pubi4, the nucleotide sequence (SEQ ID
NO:S) and translated amino acid sequence (SEQ ID NO:b) were determined to be as shown in Figure ~. The letter "X" in Figure SB refers to any amino acid. The organization of the ubi,~
gene was determined as shown in Figure 6.
SEQ ID NO:S was cloned as two fragments into the plasmid vector pBluescript II
KS+ {Stratagene} to generate plasmids pubi4a and pubi4b. Plasmids pubi4a and pubi4b were introduced into the host Escherichia coli DHSalpha and the transformed Escherichia coli cells were deposited at the Agricultural Research Service Culture Collection (NRRL) under the terms of the Budapest Treaty on March 8, 1999 as NRRLB-30112 (containing pubi4a} and NRRLB-30114 (containing pubi4b). NRRLB-30112 contains the 2227 by EcoRIlSaII
fragment of SEQ ID NO:S, i.e., including the 1802 by 1802 by SEQ ID N0:7, plus the first coding repeat and part of the second repeat. NRRLB-30114 contains the 3329 by SaIIlEcoRI
fragment which includes the remainder of the coding region, the 3' UTR, and further downstream sequences.
An XbaI restriction site was added at the 3' end of the ubi4 promoter shown in SEQ
ID N0:7 by way of PCR amplification with an XbaI adapter primer. The polyubiquitin ubi4 promoter so modified was ligated upstream of a GUS coding sequence and a NOS
3' terminator in the vector plasmid pUCl9 to form pubi4-GUS. This plant expression plasmid was transformed into E. coli DHSa host cells and deposited with the NRRL under the terms of the Budapest Treaty on March 15, 1999 as NRRLB-30115.
Table 3 shows the ambiguity codes and the bases they represent in the ubi4 sequences shown in Figures 5 and 10.
Table 3 Ambiguity codes for sugarcane polyubiquitin ubi=t and ubi9 genes Ambiguity Code Actual Base a A
C
g G
t/u T
m AorC
r A or G
w A or T
s Core y C or T
k GorT
IS v A or C or G
h A or C or T
d AorGorT
b CorGorT
x/n GorAorTorC
not G or A or T or C
Subclone pubi4 (Figures 3, 4, 5 and 6) contained four copies of the polyubiquitin coding repeat, 238 by of 3' UTR, which is approximately 95% identical to the corresponding region of the scubi241/51 I cDNAs, a possible poly-A addition signal 21 S by 3' of the TAA
stop colon, and approximately 2.6 kb of further downstream sequence.
Immediately S' of the initiation colon was an introzZ of 1360 by within SEQ ID NO:S (intron of 1382 by within SEQ ID NO: I ) which was preceded by 65 by in SEQ ID NO:S (67 by in SEQ ID
NO:1 ) that were 98% identical to the 5' UTR sequence of scubi241/51 I. The transcription start site has not yet been determined, and it is not known if the scubi24l and scubi5l I
cDNA clones contain the entire 5' UTR; however, since both scubi241 and scubi5l I cDNA
clones started at the same nucleotide, this site can serve as a putative transcription start site for discussion purposes. The pubi4 subclone contained an additional 377 by upstream of the transcription start codon, with a TATA consensus at -30 by relative to the beginning of the cDNAs.
Approximately 320 by upstream of the transcription start codon were two 10 by sequences that showed homology to the heat stress promoter element (HSE) consensus sequence (aGAAnnTTCt) [Scharf et al.: Heat stress promoters and transcription factors.
In: Nover L
(ed) Plant Promoters and Transcription Factors, pp. 125-162. Springer-Verlag, Berlin (1994)].
The second of these 10 by sequences, however, lacked the G residue at position two that has been found to be invariant in HSEs [Scharf et al. (1994) supra].
Subclone pubi9 sequence and organization were initially determined to be as shown in Figures 7A, 7B and 4. Tables 4-5 provide the meaning of sequence symbols other than the bases G, A, T, and C which indicates ambiguities where two or more bases are equally possible in the sequences shown in Figures 7A and 7B.
Table 4 Ambiguity codes for sugarcane polyubiquitin ubi9 gene upstream of the translation codon Position Ambiguity Code Shown Actual Base in Figure 7A
9 n G, A, T, or C
3613 n G, A, T, or C
Table 5 Ambiguity codes for sugarcane polyubiquitin ubi9 gene downstream of the translation stop codon Position Ambiguity Code Shown Actual Base in Figure 7B
59 n G, A, T, or C
286 n G, A, T, or C
294 n G, A, T, or C
319 n G, A, T, or C
On subsequent sequencing of subclone pubi9, the nucleotide sequence (SEQ ID
N0:8) and translated amino acid sequence (SEQ ID N0:9) were determined to be as shown in Figure 8. The letter "X" in Figure 8B refers to any amino acid. The organization of the ubi9 gene was determined as shown in Figure 6. Table 3, .supra, shows the ambiguity codes and the bases they represent in the ubi9 gene sequences shown in Figures 8 and 11.
An approximately 7.2 kb HindIlI-EcoRI fragment, which contains SEQ ID N0:8 plus approximately 2kb additional downstream sequence, was cloned in the plasmid vector pBluescript II KS+ (Stratagene) to form plasmid pubi9. This plasmid was transformed into E.
coli DHSa host cells and deposited with the Agriculture Research Service culture collection (NRRL). under the terms of the Budapest Treaty on March 8, 1999 as accession number NRRLB-30113.
An Xbal restriction site was added at the 3' end of the ubi9 promoter shown in SEQ
ID NO:10 by w-ay of PCR amplification with an XbaI adapter primer. The ubi9 promoter so modified was ligated upstream of a GUS coding sequence and a NOS 3' terminator in the vector plasmid pUC 19 to form pubi9-GUS. This plant expression plasmid was transformed into E. coli DHSa host cells and deposited with the NRRL under the terms of the Budapest Treaty on March 15, 1999 as accession number NRRLB-30116.
Subclone pubi9 (Figures 4, 6, 7 and 8) contained five copies of the polyubiquitin coding repeat. 244 by of 3' UTR, which were 98% identical to the corresponding region of scubi241/51 1 and 95% identical to the corresponding region of pubi4. A
possible poly-A
addition signal was present 221 by down stream of the TAA stop codon, and there was approximately 2 kb additional downstream sequence. As with the uhi4 gene, an intron was located immediately 5' of the initiation codon; this intron was 1374 bp. 5' of this intron were 65 by (in SEQ ID N0:8) and 67 by (in SEQ ID N0:3) that was 97% identical to both the 5' UTR of the scubi241/511 cDNA clones and the corresponding region of the ubi4 gene.
The subclone contained an additional 2247 by of upstream sequence, including a TATA
consensus sequence at -30 by relative to the beginning of the cDNA clones.
Upstream of the 5'UTR, a 577 by region of ubi9 from position 1671 to 2248 of SEQ ID NO:10 was highly homologous (>90% identity) to the corresponding region of ubi4 (positions 1 to 377 of SEQ
ID N0:5). Partial sequence from an additional subclone of ~,ubi4 indicated that this high degree of homology continues at least as far as 2kb upstream of the transcribed region of the genes (unpublished data). Within this highly homologous region, approximately 344 by upstream of transcription start codon, is an apparent insertion of approximately 200 by not present in the sugarcane ubi4 promoter. This 200 by region was delimited by 17 by imperfect inverted repeats. A 202 by region 82% identical to this insertion is also found in the 3' UTR of a sugarcane glucose transporter cDNA clone SGT1 [Bugos et al., Plant Physiol. 1 U3, 1469-1470 ( 1993)]. The nature of this possible insertion event has not been investigated; however, it has features of miniature inverted-repeat transposable elements (MITEs) [Wessler et ul., Curr Opin Genet Dev 5, 814-21 ( 1995)]. Without limiting the invention to any particular mechanism. this insertion is not believed to have a functional role in the promoter activity of the polyubiquitin ubi9 promoter since this insertion is inserted in the 3'UTR (not the promoter) of the glucose transporter gene, and since it is not present in the polyubiquitin ubi=t gene. Like the ubi4 gene, the ubi9 gene also contained two HSE-like sequences about 320 by upstream of the transcription start codon; however, both of these HSE-like sequences lacked the invariant G residue.
A comparison of the sequences upstream of the translation start codon of the sugarcane ubi.~ gene sequence with the maize polyubiquitin promoter (GenBank accession number 594464; Quail et al., U.S. Patent Numbers 5,614,399 and 5,510,474) showed only 51 % homology when comparing a fragment containing the sequence upstream of the 5' UTR, the 5' UTR sequence and the intron sequence, only 64% homology when comparing a fragment containing the sequence upstream of the transcription start codon, only 65%
homology when comparing a fragment containing the 5'UTR sequence, and only 58%
homology when comparing a fragment containing the intron sequence.
A comparison of the sequences upstream of the translation start codon of the sugarcane ubi9 gene sequence with the maize polyubiquitin promoter (GenBank accession number 594464; Quail et al., U.S. Patent Numbers 5,614,399 and 5,510,474) showed only 61 % homology when comparing a fragment containing the sequence upstream of the 5' UTR, the 5' UTR sequence and the intron sequence, only 63% homology when comparing a fragment containing the sequence upstream of the transcription start codon, only 66%
homology when comparing a fragment containing the S'UTR sequence, and only 59%
homology when comparing a fragment containing the intron sequence.
Some heat-shock-inducible polyubiquitin promoters have been found to contain HSEs (Binet, et al., 1991, supra; Christensen et al. ( 1992) supra]. Both the ubi4 and ubi9 genes contained two short sequence elements that have some homology to HSEs;
however, three out of four of these elements lacked the G residue that has been found to be invariant at position 2 of the HSE (aGAAnnTTCt) (Scharf et al. (1994) supra]. Given the very marginal induction (approximately 2-fold or less) of the ubi~ and ubi9 genes when compared to other sugarcane polyubiquitin genes, it is doubtful that these HSE-like elements play a role in regulating gene expression. Similar observations have been made in connection with the tobacco polyubiqutin promoter, Ubi.U4; while this promoter also contains "...two degenerated heat shock-like elements..., " removal of these elements had no significant effect on gene expression (Plesse, et al. (1997) supra].
Both the ubi=l and ubi9 genes contained a large intron immediately upstream of the protein coding region preceded by an approximately 65 by fragment which was highly homologous to the 5' UTR of scubi241/511. An intron at this position (i.e., immediately upstream of the initiation codon) has been found in many other plant polyubiquitin genes (Binet, et al., 1991, supra; Christensen et al. ( 1992) supra; Garbarino et al. ( 1995) supra;
Morris et al., Plant Mol Biol 21, 895-906 (1993)], and in some cases been shown important for high levels of expression (Morris et al. (1993) supra; Garbarino et al.
(1995) supra]. By analogy, it is the inventors' belief that the intron of the ubi4 and ubi9 genes may also play a role in regulating gene expression.
The eDNA clones scubi241 and scubi5l 1 contain three and five copies of the polyubiquitin coding sequence, respectively. Genomic clones pubi4 and pubi9 contain four and five copies of the polyubiquitin coding sequence, respectively. Without limiting the invention to any particular mechanism, this may mean that the scubi241/S11 sub-family :ZO contains at least three different genes, containing three (i.e., scubi 241/511), four (i.e., pubi4) and five (i.e., pubi9) ubiquitin repeats. Alternatively, it is possible that the difference in the number of copies of the polyubiquitin coding sequence reflects a difference between the cultivars, with the scubi241/511 sub-family in H65-7052 containing genes with three and five ubiquitin repeats, while the same sub-family in H32-8560 contains genes with four and five ubiquitin repeats.
Construction of Plasmids pubi4-GUS, pubi9-GUS, 4PI-GUS and 9PI-GUS Comprising Sugarcane Polyubiquitin Promoters And an Exemplary Structural Gene Reporter plasmids were made placing the uid A gene encoding (3-glucuronidase (GUS) [Jefferson et al., Proc. Natl. Acad. Sci. USA 83, 8447-8451 ( 1986)] and the nopaline synthase (NOS) terminator under the control of the sugarcane polyubiquitin promoter disclosed herein.
The promoter sequence in plasmid pubi4-GUS contained nucleotides 1-1810 of SEQ
ID NO:1 (i.c~.. nucleotides 1-1802 of SEQ ID NO:S) and and XbaI site (TCTAGA) added immediately after by 1802, by way of an adapter on a PCR primer. The promoter sequence in plasmid pubi9-GUS contained nucleotides 1-3688 of SEQ ID N0:3 (i.e.. nucleotides 1-3688 of SEQ
ID N0:8) and and XbaI site (TCTAGA) added immediately after by 3688, by way of an adapter on a PCR primer. The Expand~~"' PCR system (Boehringer Mannheim) was used to amplify part of the 5' UTR and the intron including the 3' splice site; this PCR product was used to join the sequence upstream of the 5' UTR, the 5' UTR, and intron to the GUS gene using a unique NruI site in the S' UTR and an XbaI site added as an adapter to the 3' PCR
primer.
pHA9 contained the maize ubi 1 promoter driving a neomycin phosphotransferase II
(NPTII) gene and NOS terminator. It was made by removing the luc gene from pAHCl8 described in [Christensen et al., Transgenic Research S, 213-218 ( 1996)] as BamHI fragment and replacing it with an 844 by BamHI fragment containing the NPTII gene.
35S-GUS contained the uid A gene encoding GUS under the control of the cauliflower mosaic virus (CaMV) 35S RNA promoter sequence (Clonfech).
Binary plasmids 9PI-GUS, 4PI-GUS and MPI-GUS for Agrobacterium transformation were made by ligating the promoter-intron-GUS-NOS cassettes from pubi9-GUS, pubi4-GUS, and pAHC27 [Christensen et al. ( 1996) supra] respectively, as HindIII - EcoRI
fragments, into the HindIII - EcoRI sites of pCAMBIA1300 [Roberts et al., Rockefeller Foundation Meeting of the International Program on Rice Biotechnology, Malacca, Malaysia (1997)].
Binary plasmid pHW537 contained a putative 5' nuclear matrix attachment region (MAR) from ~,ubi4, ubi9 promoter and intron, GUS, and 3' terminator and putative 3' MAR from ~,ubi4 as HindiII - EcoRI fragment in the HindIII - EcoRI sites of pCAMBIA1300.
Transient Expression of pubi4-GUS and pubi9-GUS in Sugarcane Suspension cultured Cells Sugarcane suspension cell cultures (variety H50-7209) were maintained as described by [Nickell et al., Physiol. Plant. 22, 117-125 (1969)]. DNA reporter plasmids (pubi4-GUS, pubi9-GUS.or pAHC27) were introduced into sugarcane suspension culture cells by particle bombardment as previously described [Klein et al., Nature 327, 70-73 ( 1987)]
using a PDS 1000 Biolistic particle accelerator (BioRad) at 1100 psi. Controls were not bombarded -SO-with DNA.. Transient assays were carried out in a randomized complete block design with four treatments (promoters) and six replications. Each replication consisted of bombardment of five samples. Two days after bombardment, plant material was assayed for GUS
expression. Each sample was divided into two equal parts, one for histochemical analysis and one for chemiluminescent measure of GUS enzyme activity using the GUS-Light kit (Tropix) and an MLX plate reader luminometer (DYNEX). GUS enzyme activity assays were performed according to the manufacturer's protocol, with 60 minutes incubation in GUS
reaction buffer before chemiIuminescence was measured. GUS activity was expressed as relative light units (RLU) per nanogram total protein [Bradford, Anal.
Biochem. 72:248-254 (1976)]. From each experiment, the highest and lowest values were discarded for each plasmid. Analysis of variance was performed on the data from each set of experiments; those experiments which showed a significant effect (P50.05) from promoter treatments were further analyzed by a least significant difference test {P<_0.05) to identify which promoters produced significantly different results.
The results of histochemical staining and chemiluminescent GUS activity assays of sugarcane suspension culture cells which had been bombarded with the reporter plasmids are shown in Figure 9. Figure 9A shows that the average number of blue foci detected after bombardment with GUS expression plasmids containing the ubi9 promoter was higher than observed for either the ubi4 or maize polyubiquitin ubi 1 [Christensen et al.
( 1996), supra]
promoters. Because of high levels of variability, it could not be determined from histochemical staining of these transient expression experiments whether the results seen in sugarcane callus are in fact significantly different. However, using a chemiluminescent assay to measure GUS activity again indicated the average level of expression was higher for the ubi9 promoter than for the sugarcane ubi4 or maize ubi 1 promoters (Figure 9B). Statistical analysis indicated that the difference, as measured by this assay, was significant at P<_0.05.
These data demonstrated that both the ubi4 and ubi9 promoters in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct transient expression in monocotyledonous sugarcane suspension cells.
Transient Expression of pubi4-GUS and pubi9-GUS in Tobacco leaves Tobacco cultivar Wisconsin 38 was grown in Magenta boxes on MSNT medium [ 1 X
S Murashige and Skoog salts (GIBCO BRL # 11117-074), 1 X minimal organics (GIBCO BRL
# I I I 18-023 ). 30g/1 sucrose, 0.8% agar] at 26°C under a I b h light regime. DNA reporter plasmids (pubi4-GUS, pubi9-GUS, or pAHC27) were introduced into tobacco leaves by particle bombardment as previously described [Klein et al., (1987) supra]
using a PDSI000 Biolistic particle accelerator (BioRad) at 650 psi. Controls were not bombarded with DNA.
Transient assays were carried out in a randomized complete block design with four treatments (promoters) and six replications. Each replication consisted of bombardment of five samples.
Two days after bombardment, plant material was assayed for GUS expression.
Each sample was divided into two equal parts, one for histochemical analysis and one for chemiluminescent measure of GUS enzyme activity as described supra (Example 3).
The results of histochemical staining and chemiluminescent GUS activity assays of tobacco leaves which had been bombarded with the reporter plasmids are shown in Figure 9.
The results show that average GUS expression was higher for the sugarcane ubi4 promoter than for either the maize polyubiquitin promoter or the sugarcane ubi9 promoter when using either the histochemical (Figure 9C) or chemiluminescent (Figure 9D) assays.
Analysis of the :ZO chemiluminescent data indicates that the difference between the sugarcane ubi4 and maize polyubiquitin promoters was statistically significant at P<_0.05.
These data demonstrated that both the ubi4 and ubi9 promoters in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct transient expression in dicotyledonous tobacco leaves.
:ZS
Transient Expression of pubi4-GUS and pubi9-GUS in Sorghum Callus :30 Sorghum immature embryo derived callus was cultured and bombarded as previously described with the reporter plasmid ubi4-GUS or ubi9-GUS (prepared as described in Example 2) and with several reporter plasmids in which the uid A gene encoding GUS was placed under the control of each of several promoters including maize adh, 355, 35S:35S, rice actin, and maize ubi 1 as described below.
A. Culture media N6 maintenance medium [Macro elements (mg/1 final concentration), 2830 KN03, 1650 (NH4),SO~, 166 CaCI,-2H,0, 185 MgS04-7H,0, 400 KH,POa; Micro elements (mg/1 final concentration), 37.3 Na, EDTA, 27.8 FeSO4-7H~0, 1.6 H;BO; 0.76 Kl, 3.3 MnS04, 1.5 ZnS04-7H,0; Carbohydrates (g/1 final concentration), 20 Sucrose; Hormones (mg/1 final concentration), 1.0 2,4-Dichlorophenoxyacetic acid; Vitamins (mg/1 final concentration), 0.5 Thiamine-HCI, 0.25 Pyridoxine-HCI, 0.25 Nicotinic Acid; Amino Acids (mg/1 final concentration), 2875 L-Proline, 2.0 Glycine, 100 Casamino Acids; and Agar (g/1 final concentration), 2.5 Phytagel] was used.
B. Plant material Highly embryogenic callus tissue derived from sorghum plants (Sorghum bicolor L.
Moench, cv BWheatland 399) was used. To establish callus cultures, caryopses 10 to 18 d post-anthesis were surface-sterilized with 70% ethanol for 5 min and 20%
Clorox bleach for 15 min, followed by two changes of sterile distilled water. Immature embryos, 1.0 to 1.5 mm long, were aseptically removed using a sterilized 11 cm forceps in a laminar flow hood under a stereo dissecting microscope. The embryos were placed with the scutella exposed on N6 medium modified for sorghum cell culture and solidified with 2.5 gll Phytagel.
C. Microprojectile bombardment preparation Prior to bombardment, 1 mm gold particles were coated with transforming DNA by the procedure of Daines ( 1990). A stock suspension of gold particles was suspended at 60 mg/ml in absolute ethanol. Thirty-five microliters of the suspension was transferred into a 1.5 ml microcentrifuge tube, centrifuged at 14,000 g for 3 min, and the pellet was suspended in 200 pl of sterile distilled water. Following a second centrifugation, the pellet was suspended in 25 ml of Tris-EDTA containing 25 mg of the transforming plasmid DNA. The following chilled sterile solutions were added in order: 200 ml of water, 250 ml of 2.5 M
CaCI,, and 50 ml of 0.1 M Spermidine (0.2 pm filter-sterilized). The microcentrifuge tubes were shaken with a Tomy microtube shaker at 4°C for 1 S min and centrifuged at 16,000 g for min. The supernatant was removed, the pellet washed with 200 ml of ethanol and the DNA-coated gold particles suspended in 36 ml of ethanol.
D. Target tissue establishment and bombardment 5 Immature embryos were removed from sorghum caryopses and cultured on N6 maintenance medium for 7 d. If the immature embryos are less than 0.5 mm they may die in culture and if they are larger than 1.5 mm they may precociously germinate instead of initiating into callus tissue. Four hours prior to bombardment, approximately 50 embryo-derived calli were placed in a circle (4 cm diameter) in the center of a Petri dish ( 1 ~ x 100 mm) containing 0.2 M mannitol and 0.2 M sorbitol in N6 maintenance medium solidified with 2.~ g Phytagel. The Petri dish containing the target callus tissue was placed in the biolistic device and 10 ml of the DNA-gold suspension pipetted onto the center of a macroproiectile. The distance between the stopping plate and the target callus tissue was adjusted to 13 cm. The tissue was bombarded under vacuum with the rupture disk strength at 1100 p.s.i.
Callus tissue was sampled one day post bombardment using the GUS histochemical assay. Approximately twenty (20) replicates were performed for each promoter.
The average number of blue foci per bombarded plate was determined for each reporter plasmid using a stereo microscope.
The ubi4 nucleotide sequence was sufficient to direct transient expression in monocotyledonous sorghum. Indeed, the ubi4 nucleotide sequences resulted in significantly higher levels of expression of GUS as compared to the levels of expression driven by each of the other tested promoters (data not shown). These results demonstrate that the ubi4 promoter in pubi4-GUS was sufficient to direct transient expression in monocotyledonous sorghum callus.
Transient Expression of pubi9-GUS in Pineapple Leaves, Protocorm-Like Bodies, Roots and Fruit Pineapple cultivar F153 leaves, protocorm-like bodies (plbs), roots and fruit were bombarded with a reporter plasmid [pAHC27, pubi9-GUS or 35S-GUS] described supra (Example 2). Target tissue (leaves, plbs, roots and fruit) was plated in the center (2.5 cm diameter) of petri plates in modified MS medium supplemented with 0.8% Difco Bacto agar and 3% sucrose. Bombardments were performed with a Bio-Rad helium gas-driven microprojectile accelerator (PDS-1000/He, Bio-Rad, Hercules, CA) with I 100 psi rupture discs. Gold microcarriers ( 1.6-um-diameter, Bio-Rad) were coated with DNA
using the CaCl2 S precipitation method following the manufacturer's directions. Two ug of each DNA construct were used for each shot.
Histochemical GUS staining was performed 48 hours following bombardment to determine transient transformion. Five (5) or three (3) replicates were performed for each promoter in each type of bombarded tissue. The number of blue foci/plate of each bombarded tissue is shown in Table 4.
Table 4 Number of Blue Foci Per Plate In F153 Pineapple Tissues Plasmid pAHC27 ubi9-GUS 35S-GUS
Leaves ll.Ot7.0~ 244.022.0 -Plbs 142.045.2 150.352.3 -Roots 3.210.9 7.42.2 1.60.8 Fruit 8.76.4 I 1.03.7 14.28.1 '' Standard error.
The above data demonstrate successful transient expression of GUS under the control of the sugarcane polyubiquitin ubi9 promoter in pubi9-GUS in each of the four tissues of monocotyledonous pineapple.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Sugarcane Callus Sugarcane callus cultures were initiated from stem apices (variety H62-4671 ) by surface sterilizing the plant material with 70% ethanol, cutting two mm transverse slices and growing on MS2 plates [ 1 X Murashige and Skoog salts (GIBCO BRL # 1111 ?-074)], 1 x WO 99!46976 PCT/US99/05985 minimal organics (GIBCO BRL#11118-023), 2 mg.L 2,4-D, 0.7% agar] under a 16 h light regime. After one to two months, callus was transferred to MS 1 plates [ 1 X
Murashige and Skoog salts (GIBCO BRL #11117-074), 1X minimal organics (GIBCO BRL #11118-023). 1 mg/L 2,4-D. 0.7% agar] and subcultured monthly.
Sugarcane callus was co-bombarded with a reporter plasmid (pubi4-GUS, pubi9-GUS.
or pAHC27) and the selection plasmid pHA9 which contained the maize ubi 1 promoter driving a neomycin phosphotransferase II (NPTiI) gene (prepared as described supra in Example 2). Bombardment was with one micron gold particles and 1 SSO psi rupture discs.
After bombardment. callus was kept on MS1 plates without selection for two weeks. After his recovery period, calli were transferred to MS 1 plates with SO mg/L 6418 (Agri-bio) for one month, then transferred to MS1 plates with 100 mg/L 6418 for 2-3 months.
Calli which survived this selection were transferred to MS1 plates with 60 mg/L 6418 for multiplication.
Small calli totaling about 50 were randomly chosen from selected lines. These calli were assayed for GUS activity using the GUS-Light kit (Tropix) according tot the manufacturer's I S protocol, with 30 minutes incubation in GUS reaction buffer before chemiluminescence was measured. Two to ten 50 mg samples were assayed from each line, with three chemiluminescence assays for each sample.
The results of the GUS chemiluminescent activity assays are shown in Figure 12. In ten selected independent transgenic sugarcane callus lines, GUS expression from the sugarcane ubi9 promoter averaged 535.4 RLU/ng protein/30 min (Figures 12A, 12D). Seven selected stable transgenic lines expressing GUS under the control of the sugarcane ubi4 promoter averaged 209.3 RLU/ng protein/30 min (Figures 7B, & 7D), and seven lines expression GUS under the control of the maize ubi I promoter averaged 34$.2 RLU/ng protein/30 mon (Figures 7C, &D).
These results demonstrate that both the ubi4 and ubi9 nucleotide sequences in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct stable expression in monocotyledonous sugarcane callus.
Stable Expression of 9PI-GUS in Transgenic Rice Callus Agrobacteriunr strain EHA105 [Hood et al. J. Bacteriol. 168:1291-1301 (1986)]
was used to transform rice callus. Reporter plasmids (9PI-GUS, 4PI-GUS, MPI-GUS, or WO 99/46976 PCT/US99t05985 pHW537. which were prepared as described in Example 2, supra) were introduced into Agrobacteriurn strain EHA105 by a standard procedure.
Rice callus was induced from scutellum tissue of rice (cv. Taipei 309) and transformed by A~,>robcrcterium co-cultivation as previously described [Hiei et al., ( 1994) supra]. After 2-3 months on selection medium containing 100 mg/1 hygromycin B (CalBiochem), small calli from selected lines were assayed for GUS activity by the chemiluminescence method described supra (Example 3).
PCR using one primer within the T-DNA and one outside of the T-DNA right border was used to confirm the absence of Agrobacterium contamination in tested callus lines using methods known in the art. Because rice transformation was by Agrobacterium.
there is the possibility that the Agrobacterium were not killed after transformation, and thus that the GUS expression seen is from the Agrobacterium, not from transgenic rice cells.
To test for this possibility, two PCRs were performed: one to test for the presence of the GUS gene, the second to test for the presence of vector plasmid sequences outside of the T-DNA. The Agrobacterium harbor a binary plasmid, one section of which contains the GUS
encoding gene under the control of sugarcane promoter sequences. This section of the vector plasmid is called the T-DNA, and only this part is ordinarily transferred to the plant genome. A
positive PCR for GUS indicates that the isolate plant DNA may be successfully amplified by PCR and the GUS gene is pressent (i.e., confirming the observation of GUS
activity). A
negative PCR for the vector plasmid outside of the T-DNA confirms that the entire plasmid, which is what is present in the Agrobacterium, is no longer present, i.e., that the T-DNA
(which contains the GUS sequences under the control of the sugarcane promoter sequences) is successfully integrated into the plant genome.
One, seven, four, and two stably transformed transgenic rice callus lines were selected following co-cultivation of rice callus with Agrobacterium which had been transformed with the 4PI-GUS, 9PI-GUS, pHW537, and MPI-GUS reporter plasmids, respectively. GUS
expression in six transgenic lines transformed with the ubi9 promoter sequence is shown in Figure 13. GUS expression in six transgenic lines transformed with the ubi9 promoter sequence averaged 681.0 RLU/ng protein/30 min.
Four stably transformed transgenic rice callus lines were selected following co-cultivation of rice callus with Agrobacterium which had been transformed with the pHW537 reporter plasmid. GUS expression in these transgenic lines averaged 567.7 RLU/ng protein/30 min. Since the pHW537 reporter plasmid differed from the 9PI-GUS
plasmid in additionally containing the putative 5' and 3' flanking nuclear matrix attachment regions (MARs). and since both the 9PI-GUS and pHW537 reporter plasmids successfully resulted in expression of comparable levels of GUS in rice callus, these results demonstrate that the putative 5' and 3' flanking nuclear MARS are not necessary for promoter activity of the pubi9 sequences.
The results also suggest that either the polyubiquitin ubi4 or ubi9 promoter (with or without putative MARs) drives GUS expression at levels comparable to the maize ubi 1 promoter in stable transgenic rice callus.
The above results demonstrate that the ubi9 promoter in 9PI-GUS, and pHW537 was sufficient to direct stable expression in monocotyledonous rice callus.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Tobacco leaves Agrobacterium mmefacien.s was used to transform tobacco leaves. Reporter plasmids 1 S (9PI-GUS, 4PI-GUS, MPI-GUS, or 355-GUS, which were prepared as described in Example 2, supra), were introduced into leaf discs by an Agrobacterium mediated transformation procedure adapted from Horsch et al Science 227:1229-1231 (1985). Controls were untransformed.
Briefly, two ml overnight cultures of Agrobacterium tumefaciens were pelleted by brief centrifugation, decanted and resuspended in two ml of MSNTS liquid media (per liter of medium: one package MS salt mix [GIBCO BRL #11117-066], 30 g sucrose, 1.0 ml BS vitamins, 50 pl 2 mg/ml a-naphthaleneacetic acetic acid [GIBCO BRL #21570-015], and 50 pl 20 mg/1 benzyladenine [GIBCO BRL #16105-017]).
Aseptically grown tobacco leaves were harvested and placed in petri plates containing 20 ml MSNTS with 0.6 ml of the resuspended Agrobacterium and cut into approx. 1 cm squares with a sterile scalpel. After one to five minutes the leaf pieces were removed from the liquid, gently blotted on dry sterile paper, and placed on 0.8% agar MSNT
plates (per liter of medium: one package MS salt mix [GIBCO BRL #11117-066], 30 g sucrose, 1.0 ml 1000X BS vitamins, 8 g Bacto-Agar) with 500 pg/ml Cefotaxime 3U (PhytoTechnology Laboratories). After two days, the leaf pieces were transferred to 0.8%
agar MSNTS plates with 500 pg/ml Cefotaxime and 100 pg/ml Kanamycin (PhytoTechnology Laboratories). Shoots appeared after two to three weeks, at which time the shoots were transferred to Magenta boxes containing MSNT media with no antibiotics. To confirm Kanamycin resistance and to reduce the likelihood of obtaining chimeric plants, a "shooting assay" was used: leaves from putative transgenic plants were placed on 0.8% agar MSNTS plates containing 100 pg/ml Kanamycin. Shoots arising from these "shooting assays" were transferred to non-selective rooting media (MSNT) and grown in Magenta boxes.
Expression of GUS was determined as described in Example 3, supra. The results demonstrate that the both the ubi4 and ubi9 nucleotide sequences in 4PI-GUS
and 9PI-GUS, respectively. were sufficient to direct stable expression in dicotyledonous tobacco leaves.
Furthermore. The levels of GUS expression under the control of either the ubi4 and ubi9 nucleotide sequences in 4PI-GUS and 9PI-GUS were equal to or greater than the levels of GUS expression under the control of the maize ubil promoter. On the other hand. expression of GUS under the control of the CaMV 35 S promoter was greater than that under the control of either the sugarcane ubi4 or ubi9 promoters (data not shown).
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Maize Embryos and Regeneration of Transgenic Maize Plants Embryogenic corn cultures are initiated from immature maize embryos, bombarded simultaneously with a reporter plasmid (pubi4-GUS, pubi9-GUS, or mzubil-GUS) and a selection plasmid (pHA9), and stably transformed corn plants regenerated as previously described by Brown et al., U.S. Patent Number 5,593,874, incorporated by reference.
Briefly, embryogenic corn cultures are initiated from immature maize embryos of the "Hi-Ir" genotype which had been cultured 18-33 days on N6 2-100-25-Ag medium modified to contain 2 mg/L 2,4-dichlorophenoxyacetic acid, 180 mg/L casein hydrolysate, 25 mm L-proline, 10 pM silver nitrate, pH5.8, solidified with 0.2% PhytagelTM
(Sigma). These embryogenic cultures are used as target tissue for transformation by particle gun bombardment.
A I:1 mixture of the reporter vector (pubi-4-GUS, pubi-GUS or mzubil-GUS) and selection plasmid (pHA9) is precipitated onto tungsten M10 particles by adding 12.5 pl of particles (25 mg/ml in SO% glycerol), 2.5 pl plasmid DNA (1 pgl~l), 12.5 pl 1M
calcium chloride, and 5 pl 0.1 M spermidine, and vortexing briefly. The particles are allowed to settle for 20 minutes, after which 12.5 pl of supernatant is removed and discarded.
Each sample of DNA-tungsten is sonicated briefly and 2.5 p.l is bombarded into the embryogenic cultures using a PDS-1000 Biolisitics particle gun (DuPont).
The bombarded tissue is transferred to fresh, nonselective medium the day after bombardment. Six days post-bombardment, the material is transferred to selective media containing 50 mg/L 6418 (Agri-bio). After 2-3 weeks, the cultures are transferred to fresh media which contains 200 mg/L 6418. The cultures are maintained on the 200 mg/L 6418 media, transferred at 2-3 week intervals,,until 6418-resistant calli could be distinguished.
6418-resistant calli are recovered from the embryogenic material. 6418-resistant lines are bulked up and assayed for GUS expression using a histochemical or chemiluminescent assay as described supra (Example 3).
Plants are regenerated from the 6418-resistant calli which express GUS
activity in a three step regeneration protocol. All regeneration is performed on 200 mg/L
6418. The first two steps are carded out in the dark at 28°C., and the final step under a 16:8 hour photoperiod, at about 25°C. Small green shoots that formed on Regeneration Medium 3 in 100X 25 mm Petri plates are transferred to Regeneration Medium 3 in 200X 25 mm PyrexTM or PhytatraysTM to permit further plantlet development and root formation. Upon formation of a sufficient root system, the plants are carefully removed from the medium, the root system washed under running water, and the plants placed into 2.5" pots containing Metromix 350 growing medium. The plants are maintained for several days in a high humidity environment, and then the humidity is gradually reduced to harden off the plants.
The plants are transplanted from the 2.5" pots to 6" pots and finally to 10"
pots during growth.
Corn plants regenerated from 6418-resistant embryogenic calli which express GUS
activity are tested for GUS expression using histochemical of chemiluminescent assays as described supra (Example 3). GUS expression (e.g., as determined by blue staining in the histochemical assay or by luminescence in the chemiluminescent assay) by one or more tissues of corn plants which are generated from calli that had been bombarded with pubi4 GUS or pubi9-GUS demonstrates that the ubi4 and ubi9 nucleotide sequences in pubi4-GUS
and pubi9-GUS, respectively, are sufficient to direct stable expression in regenerated monocotyledonous maize plants.
Stable Expression of 4PI-GUS and 9PI-GUS in Transgenic Tomato Plants A reporter plasmid (4PI-GUS or 9PI-GUS, prepared as described in Example 2) or a control plasmid (i.e~.. a plasmid which contains the uid A gene encoding GUS
and which lacks a promoter sequence) are transformed into Agrobacterium, and the transformed Agrobacter-ium is used to infect tomato plants as previously described in Theologis e~t al..
U.S. Patent Number 5,723,766, incorporated by reference.
Briefly, a reporter plasmid or control plasmid is introduced into Agrobacteriarm strain LBA4404 as follows: Agrobacterium tumefaciens LBA-4404 (2 ml) is grown overnight at 28°C. in LB broth, and this used to inoculate SO ml of LB broth to obtain the desired culture.
The inoculated medium is grown at 28°C. until the OD.boo is 0.5-1Ø
The cells are collected by centrifugation and the pellet is resuspended in 1 ml, 20 mM ice cold CaCI,.
To l 00 p l of the cell suspension. 1 ug of the plasmid is added, and the mixture is incubated on ice for 30 min before snap-freezing in liquid nitrogen. The cells are then thawed at 37°C. for 5 min and used to inoculate 1 ml LB. After 2 h growth at 28°C. with agitation, 100 ~1 of the culture are plated on LB+Kanamycinso medium; colonies are expected to appear in 2-3 days at 28°C. The cells are recultured by picking several colonies and streaking on LB+Kanamycin5o medium;
again, 3-4 colonies are picked from independent streaks and 5 ml cultures in LB+Kanamycinso medium are grown. Stationary phase cultures are used for transformion of tomato plants which are grown as described infra. Transformed Agrobacterium cells are frozen using 15% glycerol at -80°C. for later use.
To prepare host tomato plants, tomato seeds are sterilized using a protocol which consists of treatment with 70% ethanol for 2 min with mixing; followed by treatment with 10% sodium hypochlorite and 0.1 % SDS for 10 min with mixing, followed by treatment with 1 % sodium hypochlorite, 0.1 % SDS for 30 min with mixing, and washing with sterile water 3 times for 2 min per wash. For germination of the sterilized seeds, 0.8 g of the sterilized seeds are placed in a Seed Germination Medium and grown for 2 weeks at low light in a growth room. After two weeks, when the seeds had germinated, cotyledons are dissected from the seedlings by cutting off the cotyledon tips and then cutting off the stem. This process is conducted in a large petri dish containing S-10 ml of MSO medium.
Feeder plates are prepared from a tobacco cell suspension in liquid medium at 25°C.
prepared with shaking at 130-150 rpm. The suspension is transferred to fresh medium at 1:10 dilution every 3-5 days. 1 ml of rapidly dividing culture is placed on the feeder plate, overlaid with filter paper and placed in low light in a growth room. The feeder plates are supplemented with 10 ml Feeder Medium Transformed Agrobacteriunr cultures which contain the reporter or control plasmids are inoculated into 50 ml LB
containing kanamycin with a single colony of the strain. The culture is grown by shaking vigorously at 30°C. to saturation (OD>2.0 at 600 nm). The strain is chosen to come to full growth in less than 24 h.
The culture is then diluted 5 times and split into 50 mi portions in plastic tubes.
Cotyledons from two of the feeder plates are scraped into each tube and rocked gently for 10-30 min. The cotyledons are then removed from the bacterial culture onto sterile filter paper (abaxial side up) on a tobacco feeder plate and incubated for 48 h in low Iight in a growth room. The cotyledons are then transferred axial side up to callus inducing medium.
In the Callus inducing Medium, approximately four plates are used per magenta box, and the explants are crowded. The box is placed in a growth room for three weeks, and small masses of callus are expected to form at the surface of the cotyledons. The explants are transferred to fresh plates containing the callus inducing medium every three weeks. When the calli exceed 2 ml, they are transferred to plates containing shoot inducing medium. When the stem structure is evident, the shoots are dissected from the calli and the shoots are transferred to plates containing root inducing medium. After a vigorous root system is formed on the plants, the plantlets are transferred to soil by taking the plantlets from the plates, removing as much agar as possible and placing the plantlets in a high peat content soil in a small peat pot which fits into a magenta box with cover. When the seedling leaves reach the top of the box, the lid is loosened to uncover the box slowly over a period of 4-5 days. The plants are then transferred to a light cart and larger pots, and kept moist. Flowers of these regenerated plants are pollinated and tomatoes are developed.
Expression of GUS is measured in regenerated tomato plant tissues transformed with the reporter or control plasmids using a histochemical or chemiluminescent assays as described supra (Example 3). GUS expression by one or more tissues of tomato plants which are generated from tissue that had been transformed with 4PI-GUS or 9PI-GUS
demonstrates that the ubi4 and ubi9 nucleotide sequences in 4PI-GUS and 9PI-GUS, respectively, are sufficient to direct stable expression in regenerated dicotyledonous tomato plants.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Soybean Excised Embryonic Meristems And Regeneration Of Transgenic Soybean Plants Soybean explants are derived from excised meristems, bombarded simultaneously with a reporter plasmid (pubi4-GUS, pubi9-GUS, or mzubil-GUS) and a selection plasmid (pHA9), and stably transformed soybean plants are regenerated as previously described by Christou et al., U.S. Patent Number 5,015,580, incorporated by reference.
Briefly, soybean explants of cultivar Williams 82 are derived from meristems excised from the embryonic axes of immature seeds. Primary leaves are removed and the explant plated on a target plate containing 1 % water agar. The explants are transformed with a reporter plasmid or a selection plasmid loaded at 1.0-0.001 p.g/ml of beads.
The particle accelerator is charged at 13-16 kV. The carrier is loaded with 0.05-0.40 mg of loaded beads per square centimeter, with a preferred level of loading of 0.2 mg/em'-.
The bombarded explants are then plated in the dark on modified MS basal medium which has a high level (i.e., 13.3 ~M) of the cytokinin benzylaminopurine.
Following incubation of 1 to 2 weeks in the dark, the tissues are transferred onto the same basal medium at a lower (1.7 pM) level of cytokinin to promote shoot elongation. Shoots are harvested at 0.5 to 1 cm in height.
The success of the transformation protocol is verified by fixing transformed explants at each stage to assay for GUS activity as described supra (Example 3). Two days after DNA
particle injection, dozens of GUS active cells are expected to be detected in each explant. At 6 to 8 weeks, the plants are assayed for GUS activity in the shoot. Most plants are expected to be chimeric, having streaks of blue (i.e., GUS-expression cells) when assayed using the histochemical assay.
The transformed excised meristem tissue is used to regenerate fully mature, and sexually mature chimeric plants using methods known in the art such as those described in U.S. Patent No. 5,015,580 to Christou et al. (incorporated by reference). GUS
expression by one or more tissues of soybean plants which are regenerated from excised embryonic meristems that had been transformed with pubi4-GUS or pubi9-GUS demonstrates that the ubi4 and ubi9 nucleotide sequences in pubi4-GUS and pubi9-GUS, respectively, are sufficient to direct stable expression in regenerated dicotyledonous soybean plants.
From the above, it is clear that the invention provides promoter sequences which are capable of driving transgene expression in both monocotyledonous and dicotyledonous plant cells, and which are useful for generating transgenic plants with desirable agronomic characteristics.
WO 99/46976 PC'f/US99/05985 SEQUENCE LISTING
<110> Albert, Henrik H.
Wei, Hairong <120> PLANT PROMOTER SEQUENCES AND METHODS OF USE THEREOF
<130> UH-03648 <140> XX/XXX,XXX
<141> 1999-03-17 <160> 12 <170> PatentIn Ver. 2.0 <210> 1 <211> 1813 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 1 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acnanaccgc gacgcaaccg cctctcgccg ctgggcccca caccgctcgg tgccgtagcc 300 tcacgggact ctttctccct cctcccccgt tataaattgg cttcatcccc tccttgcctc 360 atccatccaa atcccagtcc ccaatcccat cccttcgtcg gagaaattca tcgaagcgaa 420 gcgaatcctc gcgatcctct caaggtactg cgagttttcg atccccctct cgacccctcg 480 tatgtttgtg tttgtcgtac gtttgattag gtatgctttc cctgtttgtg ttcgtcgtag 540 cgtttgatta ggtatgcttt ccctgttcgt gttcatcgta gtgtttgatt aggtcgtgtg 600 aggcgatggc ctgctcgcgt ccttcgatct gtagtcgatt tgcgggtcgt ggtgtagatc 660 tgcgggctgt gatgaagtta tttggtgtga tctgctcgcc tgattctgcg ggttggctcg 720 agtagatatg gatggttgga ccggttggtt cgtttaccgc gctagggttg ggctgggatg 780 atgttgcatn gcgccgttgc gcgtgatccc gcagcaggac ttgcgtttga ttgccagatc 840 tcgttacgat tatgtgattt ggtttggact tattagatct gtagcttctg cttatgttgc 900 cagatgcgcc tactgctcca tatgcctgat gataatccat aaatggcagt ggaaatcaac 960 tagttgattg cggagtcatg tatcagctac aggtgtaggg actagctaca ggtgtaggga 1020 ctngcgtcta attgtttggt ccttaactca tgtgcaatta tgcaatttag tttagatgtt 1080 tgttccaant catctaggct gtaaaaggga cactggttag attgctgttt aatcttttta 1140 gtagattata ttatattggt aacttattaa cccntattaa catgccataa cgtggattct 1200 gctcatgcct gatgataatc atagatcact gtggaattaa ttagttgatt gttgaatcat 1260 gtttcatgta cataccacgg cacaattgct tagttcctta acaaatgcaa attttactga 1320 tccatgtatg atttgcgtgg ttctctaatg tgaaatacta tagctacttg ttagtaagaa 1380 tcaggttcgt atgcttaatg ctgtatgtgc cttctgctca tgcctgatga taatcatata 1440 tcactggaat taattagttg atcgtttaat catatatcaa gtacatacca tggcacaatt 1500 tttagtcact taacccatgc agattgaact ggtccctgca tgttttgcta aattgttcta 1560 ttctgattag accatatatc aggtattttt ttttggtaat ggttctctta ttttaaatgc 1620 tatatagttc tggtacttgt tagaaagatc tggttncata gtttagttgc ctatccttcg 1680 aattaggatg ctgagcagct gatcctatag ctttgtttca tgtatcaatt cttttgtgtt 1740 caacagtcag tttttgttag attcattgta acttatgttc gcttactctt ctggtcctca 1800 atgcttgcag atg 1813 <210> 2 <211> 2804 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 2 taatcctggg ccatgancag ctgtccttcc aggttcacaa gtctggtgcc ttcttctgtc 60 cctccgatgg agattatctg catgtcgtgg tcgtgtcctg atcgaatcct cgttgaatcc 120 ctatgttttt cttcaagaaa tgtgagtcct atgtcagtct ggttgcgttt gtgaacattt 180 ctgctgctga gcagcacttt ggctggaact gtgcaatgaa ataaatggaa ccctggtttc 240 tggttatgtg tgtgttagct aatgtttttg aagtggaagc tctaatcttc tatcgcgttg 300 ctactacaat tctgcttgtg ttttgatggt tcttggtttc tgttagttgg ttcagaggaa 360 gttttgcttc cacagactaa gatgcagttg aactttggtt gccctggttt ctagatttca 420 tttgtgctgg ttgagtgata gtaagaaaca accggtgttc acatataatc aggttttgtg 480 ctgctcgagt gatcgtcaaa aaccaccggt gttcacatct aaaaaggttt cgatccccag 540 gtttagatct cccgtttaat tccaaaaaaa aagttctgtg tacttgcatt tagttgggtg 600 gttgatgctg gaaagagtaa ctttcaagag taataatctt tggtgactac tctgtttcaa 660 ctgatcaatc cctaggaaag gtacaccttt acttagggaa gaaattctta gaaccttgca 720 ctttgtttca actgataata gtatacttta ttagataaaa aatattcaga tatattagac 780 accggatgtc atccactcat ccttacaaac ctctgtcatg gtcctgcaga aatgtttgcc 840 agctccagtg gcttcctgat aaatctgtgg agtgcctgtt aatcngctgc caatttttgc 900 tgagcactgt atatatgtta gtaagtacta ttgggccacc aattccattt tgacacagca 960 ctattggtcc accaattcga tcctgacaca gcactgcata atttgaaacg tttttgctcc 1020 cattttgcaa ggctacaaat ttagatcatg tttascatyc tgtgggatac aatatatgga 1080 tatcgaacaa acttggtatg tcagagaaaa aatagtttat tttcaaaact aacattttta 1140 aagccttcta tgaactttaa accttcagca tttgggatca agatgagtgc tcgaacaaga 1200 gtgcactttt tctccaaaat aatctactac agagttcttt tttatatata aaaaaactta 1260 tacttaacag ataaatcaga cctcttctgc tccatatcac cttgacaaat caaagaagca 1320 gcaccagcga agggtattat tattgaggta aatataagat ctcgtttact gaaaaagacc 1380 gcgtgtttac ctaaactacc attttgcttt gatagcagca tacatgtgat agaattgcgg 1440 atcctaccgt gctgactgtg aangtggtaa gggtgagaga ttggtgggcg aggtctgaac 1500 gagcgaaaac agtactgcat ttactgttca caaggaggcg gcttaggttt tggtctccca 1560 gctctctaag ggaagctgag aattatgatt ctcttgctta attatttctt aaccaaagtt 1620 ataaatatat agcctatgag atcctaattt atggaaataa ctaaactatt ttaaggaaat 1680 atataaatag ataatcagcc cactaacggg cntagcgccc actaacaggc ctggtgctga 1740 gcccgacata acatctctcc ccgcctggrg aaacagctcg tcctcgagct gaaatctggt 1800 agaagcatcw tcaaccaaca ccggggtcat gctggaacac tgcatcaggc gctaccgcag 1860 ctggtacgtc gtcgtcgagg aagtcagccg actccaagta gaacagtcgc ttacaactga 1920 tgtccgcgga cgtagggctc atcacaattg taaacaaagc cctygacgac ggcactccaa 1980 acagctcttc cggtgtgaga cgacgaaacg agggagctag agcgggtagt ggcgcgggga 2040 acagccagtg tagcgcctgt agtcaccgag ggatggggcg gtcgggcgcc gcgaggctgc 2100 ggtgccaggt ggaggttcaa cattcttcaa acgcccgtgc caagtacatg gcggactgga 2160 ggtcgggcgg tgcgcggagc tgaacctgct tacggaggtg gtccggcagc ccacccacgt 2220 acaactccgc cttttggcga gcggagaggt tgtgggcatg gcacaggacg gcgttgtaac 2280 gctccgagta atcctgaacg gaagaaccaa aaggaaggcg ggcaagctcc gccaaccgag 2340 tgcccaaaac aggaggcccg aagcgaagcg agcataattc gcggaagcgc tcccaaggag 2400 gcataccctc gtcttgctcc agggcgtagt accatgtctg ggcaacaccc cgaagatggt 2460 aggacgcgag ccatgtgcga gcggaggcga gcgtntgctg gccgcggaag aactgctcgc 2520 actggttcaa ccaattcagg ggatcggtcg aaccgtcgta cgtagggaac tccagtttgt 2580 agaatttggg ccccgcctgg gcgcctgcga gggcagcagc aagggctggg tcgagccccc 2640 cctgcggctg gccgcccgag ggagagggcg cccgaagaac agcggcccgt ccaccccccc 2700 gaaaagagtg ctggcggggg gtagggagga catcgttgtc gccgccgccg tcgtgtaggc 2760 tgtcggggag ggcgacgtgg ccatcgagta gatccgggga attc 2804 <210> 3 <211> 3691 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 3 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacecaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 ggtatcaatt ctngtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcagat g 3691 <210> 4 <211> 343 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 4 taagtcctgg gccatgagca gctgtccttc cagggttcac aagtagtggt gccttcttnc 60 tgtccctccg atggagatta tctgcatgtc gtggtcgtgt cctgatcgag tcgtcgttga 120 gtccctatgt tttttcttca agaaatgtga gtcctatgtc agtctggttg cgtttgtgaa 180 cattttctgc tgctgcgcag cagtttggtt ggaactgtgc aatgaaataa attgaaccct 240 ggtttctggt tatgtgtgtt agctaatgtt tttgaagtgg aagctntaat cttntatcgc 300 gttgctacta caattctgnt tgtgttttga tgttcttgtt tct 343 <210> 5 <211> 5512 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 5 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acaaccgcga cgcaaccgcc tctcgccgct gggccccaca ccgctcggtg ccgtagcctc 300 acgggactct ttctccctcc tcccccgtta taaattggct tcatcccctc cttgcctcat 360 ccatccaaat cccagtcccc aatcccatcc cttcgtcgga gaaattcatc gaagcgaagc 420 gaatcctcgc gatcctctca aggtactgcg agttttcgat ccccctctcg acccctcgta 480 tgtttgtgtt tgtcgtacgt ttgattaggt atgctttccc tgtttgtgtt cgtcgtagcg 540 tttgattagg tatgctttcc ctgttcgtgt tcatcgtagt gtttgattag gtcgtgtgag 600 gcgatggcct gctcgcgtcc ttcgatctgt agtcgatttg cgggtcgtgg tgtagatctg 660 cgggctgtga tgaagttatt tggtgtgatc tgctcgcctg attctgcggg ttggctcgag 720 tagatatgga tggttggacc ggttggttcg tttaccgcgc tagggttggg ctgggatgat 780 gttgcatgcg ccgttgcgcg tgatcccgca gcaggacttg cgtttgattg ccagatctcg 840 ttacgattat gtgatttggt ttggacttat tagatctgta gcttctgctt atgttgccag 900 atgcgcctac tgctccatat gcctgatgat aatccataaa tggcagtgga aatcaactag 960 ttgattgcgg agtcatgtat cagctacagg tgtagggact agctacaggt gtagggactg 1020 cgtctaattg tttggtcctt aactcatgtg caattatgca atttagttta gatgtttgtt 1080 ccaatcatct aggctgtaaa agggacactg gttagattgc tgtttaatct ttttagtaga 1140 ttatattata ttggtaactt attaacccta ttacatgcca taacgtggat tctgctcatg 1200 cctgatgata atcatagatc actgtggaat taattagttg attgttgaat catgtttcat 1260 gtacatacca cggcacaatt gcttagttcc ttaacaaatg caaattttac tgatccatgt 1320 atgatttgcg tggttctcta atgtgaaata ctatagctac ttgttagtaa gaatcaggtt 1380 cgtatgctta atgctgtatg tgccttctgc tcatgcctga tgataatcat atatcactgg 1440 aattaattag ttgatcgttt aatcatatat caagtacata ccatggcaca atttttagtc 1500 acttaaccca tgcagattga actggtccct gcatgttttg ctaaattgtt ctattctgat 1560 tagaccatat atcaggtatt tttttttggt aatggttctc ttattttaaa tgctatatag 1620 ttctggtact tgttagaaag atctggttca tagtttagtt gcctatcctt cgaattagga 1680 tgctgagcag ctgatcctat agctttgttt catgtatcaa ttcttttgtg ttcaacagtc 1740 agtttttgtt agattcattg taacttatgt tcgcttactc ttctggtcct caatgcttgc 1800 agatgcagat cttcgttaag accctcactg gcaagaccat cacccttgag gttgagtctt 1860 cagacamtat tgacmatgtc maggctaaga tacaggacaa ggaaggcatt cctccggatc 1920 agcagaggct gatctttgct ggcaagcagc tcgaggatgg ccgtacccta gytgactaca 1980 acatccagaa ggagtccacc stccacctgg tgctcaggct caggggaggc atgcaaatct 2040 tcgtcaagac cctcactggc aagactatca cgcttgaggt cgagtcttct gacacgatcg 2100 acaacgtgaa ggccaagatc caggacaagg agggaatccc cccggaccag cagcgtctca 2160 tcttcgctgg caagcagctc gaggatggcc gcaccctcgc tgactacaac atccagangg 2220 agtcgantnt ccaccttgtg ctcaggttna ggggtggcat gcagattttt gtcaagacct 2280 tnactggcaa gaccatcacc ttggaggtgg agtcttcgga caccatngac aatgtgaagg 2340 ngaagatcca ggacaaggaa ggaatccccc cagaccagca gcgtcttatt tttgctggca 2400 agcagcttga ggatggccgc accctagcag actacaacat ccagaaggag tccacccttc 2460 acctggtgct ccgcttncgc ggtggtatgc agatcttcgt caagaccctc accggcaaga 2520 ccatcaccct ggaggtggag tcctctgaca ccatcgacaa tgtgaaggcg aagatccagg 2580 acaaggaggg catccccccg gaccagcagc gtctcatctt cgccggcaag cagctggagg 2640 atggccgcac cctggcagac tacaacatcc agaaggagtc cactctccac ctggtgctcc 2700 gtctccgtgg tggccagtaa tcctgggcca tgaagctgtc cttccaggtt cacaagtctg 2760 gtgccttctt ctgtccctcc gatggagatt atctgcatgt cgtggtcgtg tcctgatcga 2820 atcctcgttg aatccctatg tttttcttca agaaatgtga gtcctatgtc agtctggttg 2880 cgtttgtgaa catttctgct gctgagcagc actttggctg gaactgtgca atgaaataaa 2940 tggaaccctg gtttctggtt atgtgtgtgt tagctaatgt ttttgaagtg gaagctctaa 3000 tcttctatcg cgttgctact acaattctgc ttgtgttttg atgttcttgg tttctgttag 3060 ttggttcaga ggaagttttg cttccacaga ctaagatgca gttgaacttt ggttgccctg 3120 gtttctagat ttcatttgtg ctggttgagt gatagtaaga aacaaccggt gttcacatat 3180 aatcaggttt tgtgctgctc gagtgatcgt caaaaaccac cggtgttcac atctaaaaag 3240 gtttcgatcc ccaggtttag atctcccgtt taattccaaa aaaaaagttc tgtgtacttg 3300 catttagttg ggtggttgat gctggaaaga gtaactttca agagtaataa tctttggtga 3360 ctactctgtt tcaactgatc aatccctagg aaaggtacac ctttacttag ggaagaaatt 3420 cttagaacct tgcactttgt ttcaactgat aatagtatac tttattagat aaaaaatatt 3480 cagatatatt agacaccgga tgtcatccac tcatccttac aaacctctgt catggtcctg 3540 cagaaatgtt tgccagctcc agtggcttcc tgataaatct gtggagtgcc tgttaatcgg 3600 ctgccaattt ttgctgagca ctgtatatat gttagtaagt actattgggc caccaattcg 3660 attttgacac agcactattg gtccaccaat tcgattctga cacagcactg cataatttga 3720 aacgtgttgc tccattttgc aaggctacaa atttagatca tgtttagcat tctgtgggat 3780 acaatatatg gatatcgaac aaacttggta tgtcagagaa aaaatagttt attttcaaaa 3840 ctaacatttt taaagccttc tatgaacttt aaaccttcag catttgggat caagatgagt 3900 gctcgaacaa gagtgcactt tttctccaaa ataatctact acagagttct tttttatata 3960 taaaaaaact tatacttaac agataaatca gactttttct gctccatatc accttgacaa 4020 atcaaagaag cagcaccagc gaagggtatt attattgagg taaatataag atctcgttta 4080 ctgaaaaaga ccgcgtgttt acctaaacta ccattttgct ttgatagcag catacatgtg 4140 atagaattgc ggatcctacc gtgctgactg tgaaggtggt aggggtgaga gattggtggg 4200 cgaggtctga acgagcgaga acagtactgc atttactgtt cacaaggagg cggcttaggt 4260 tttgggtctc ccagctctct aagggaagct gagaattatg attctcttgc ttaattattt 4320 cttaaccaaa gttataaata tatagcctat gagatcctaa tttatggaaa taactaaact 4380 attttaagga aatatataaa tagataatca gcccactaac gggcctagcg cccactaaca 4440 ggcctggtgc tgagcccgac ataacatctc tccccgcctg gagaaacagc tcgtcctcga 4500 gctgaaatct ggtagaagca tcatcaacca acaccggggt catgctggaa cactgcatca 4560 ggcgctaccg cagctggtac gtcgtcgtcg aggaagtcag ccgactccaa gtagaacagt 4620 cgcttacact gatgtccgcg gacgtagggc tcatcacaat tgtaacaaag cccttgacga 4680 cggcactcca acagctcctc cggtgtgaga cgacgaaacg agggagctag agcgggtagt 4740 ggcgcgggaa cagccagtgt agcgcctgta gtcaccgagg gatggggcgg tcgggcgccg 4800 cgaggctgcg gtgcaggtgg aggtttcaca ttcctcaaac gcccgtgcca agtacatggc 4860 ggactggagg tcgggcggtg cgcggagctg aacctgctta cggaggtggt ccggcagccc 4920 acccacgtac aactccgcct tttggcgagc ggagaggttg tgggcatggc acaggacggc 4980 gttgtaacgc tccgagtaat cctgaacgga agaaccaaaa ggaaggcggg caagctccgc 5040 caaccgagtg cccaaaacag gaggcccgaa gcgaagcgag cataattcgc ggaagcgctc 5100 ccaaggaggc ataccctcgt cttgctccag ggcgtagtac catgtctggg caacaccccg 5160 aagatggtag gacgcgagcc atgtgcgagc ggaggcgagc gtctgctggc cgcggaagaa 5220 ctgctcgcac tggttcaacc aattcagggg atcggtcgaa ccgtcgtacg tagggaactc 5280 cagtttgtag aatttgggcc ccgcctgggc gcctgcgagg gcagcagcaa gggctgggtc 5340 gagccccccc tgcggctggc cgcccgaggg agagggcgcc cgaagaacag cggcccgtcc 5400 acccccccga aaagagtgct ggcggggggt agggaagaca tcgttgtcgc cgccgccgtc 5460 gtgtaggctg tcggggaagg cgacgtggcc atcgagtaga tccggggaat tc 5512 <210> 6 <211> 305 <212> PRT
<213> Saccharum Hybrid Cultivar H32-8560 <400> 6 Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Xaa Ile Asp Xaa Val Xaa Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Xaa Asp Tyr Asn Ile Gln Lys Glu Ser Thr Xaa His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Xaa Glu Ser Xaa Xaa His Leu Val Leu Arg Xaa Arg Gly Gly Met Gln Ile Phe Val Lys Thr Xaa Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Xaa Asp Asn Val Lys Xaa Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Xaa Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gln <210> 7 <211> 1802 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 7 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acaaccgcga cgcaaccgcc tctcgccgct gggccccaca ccgctcggtg ccgtagcctc 300 acgggactct ttctccctcc tcccccgtta taaattggct tcatcccctc cttgcctcat 360 ccatccaaat cccagtcccc aatcccatcc cttcgtcgga gaaattcatc gaagcgaagc 420 gaatcctcgc gatcctctca aggtactgcg agttttcgat ccccctctcg acccctcgta 480 tgtttgtgtt tgtcgtacgt ttgattaggt atgctttccc tgtttgtgtt cgtcgtagcg 540 tttgattagg tatgctttcc ctgttcgtgt tcatcgtagt gtttgattag gtcgtgtgag 600 gcgatggcct gctcgcgtcc ttcgatctgt agtcgatttg cgggtcgtgg tgtagatctg 660 cgggctgtga tgaagttatt tggtgtgatc tgctcgcctg attctgcggg ttggctcgag 720 tagatatgga tggttggacc ggttggttcg tttaccgcgc tagggttggg ctgggatgat 780 gttgcatgcg ccgttgcgcg tgatcccgca gcaggacttg cgtttgattg ccagatctcg 840 ttacgattat gtgatttggt ttggacttat tagatctgta gcttctgctt atgttgccag 900 atgcgcctac tgctccatat gcctgatgat aatccataaa tggcagtgga aatcaactag 960 ttgattgcgg agtcatgtat cagctacagg tgtagggact agctacaggt gtagggactg 1020 cgtctaattg tttggtcctt aactcatgtg caattatgca atttagttta gatgtttgtt 1080 ccaatcatct aggctgtaaa agggacactg gttagattgc tgtttaatct ttttagtaga 1140 ttatattata ttggtaactt attaacccta ttacatgcca taacgtggat tctgctcatg 1200 cctgatgata atcatagatc actgtggaat taattagttg attgttgaat catgtttcat 1260 gtacatacca cggcacaatt gcttagttcc ttaacaaatg caaattttac tgatccatgt 1320 atgatttgcg tggttctcta atgtgaaata ctatagctac ttgttagtaa gaatcaggtt 1380 cgtatgctta atgctgtatg tgccttctgc tcatgcctga tgataatcat atatcactgg 1440 aattaattag ttgatcgttt aatcatatat caagtacata ccatggcaca atttttagtc 1500 acttaaccca tgcagattga actggtccct gcatgttttg ctaaattgtt ctattctgat 1560 tagaccatat atcaggtatt tttttttggt aatggttctc ttattttaaa tgctatatag 1620 ttctggtact tgttagaaag atctggttca tagtttagtt gcctatcctt cgaattagga 1680 tgctgagcag ctgatcctat agctttgttt catgtatcaa ttcttttgtg ttcaacagtc 1740 agtttttgtt agattcattg taacttatgt tcgcttactc ttctggtcct caatgcttgc 1800 ag 1802 <210> 8 <211> 5174 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 8 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacccaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 kgtatcaatt cttgtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcagat gcagattttc gttaagaccc tcactggcaa 3720 gaccatcacc cttgaggttg agtcctcaga cactattgac aatgtcaagg ctaagatcca 3780 ggacaaggaa ggcattcctc cagatcagca gaggctgaty tttgctggca agcagctcga 3840 ggatggccgt accctagctg actacaacat ccagaaggag tccaccctcc acctggtgct 3900 caggcttagg ggaggcatgc agattttcgt caagaccctc actggcaaga ctatcacgct 3960 tgaggtcgag tcttctgaca cgatcgacaa cgtgaaggcc aagatccagg acaaggaggg 4020 aatccccccg gaccagcagc gtytcatttt cgctggcaag cagctcgagg atggccgcac 4080 cctcgctgac tacaacatcc agaaggagtc gactctccac cttgtgctca ggctcagggg 4140, tggcatgcag atcttcgtca agaccctcac tggcaagacc atcaccttgg aggtggagtc 4200 ctcggacacc attgacaatg tgaaggcgaa gatccaggac aaggagggca tccccccgga 4260 ccagcagcgt ctcatyttcg ccggcaagca rcttgaggat ggccgcaccc ttgcgganta 4320 caacatccag aargagtcca cccttcacct ggtgctccgc cttcgtggtg gtatgcagat 4380 tttcgtcaag accctcaccg gcaagaccat caccctggag gtggagtcct ctgacaccat 4440 tgacaatgtg aaggcgaaga tccaggataa ggagggcatc cccccggacc agcagcgtyt 4500 tatctttgct ggcaagcagc ttgaggatgg ccgcaccctg gcagantaca acatccagaa 4560 ggagtccacc cttcacctgg tgctccgcct tcgcggtggt atgcagatyt tcgtcaagac 4620 cctcaccggc aagaccatca ccctggaggt ggagtcctct gacaccatcg acaatgtgaa 4680 ggcgaagatc caggacaagg agggcatccc cccggaccag cagcgtctca tcttcgccgg 4740 caagcagctg gaggatggcc gcaccctggc agactacaac atccagaagg agtccactct 4800 ccacctggtg ctccgtctcc gtggtggcca gtaagtcctg ggccatgagc agctgtcctt 4860 ccagggttca caagtagtgg tgccttcttn ctgtccctcc gatggagatt atctgcatgt 4920 cgtggtcgtg tcctgatcga gtcgtcgttg agtccctatg ttttttcttc aagaaatgtg 4980 agtcctatgt cagtctggtt gcgtttgtga acattttctg ctgctgcgca gcagtttggt 5040 tggaactgtg caatgaaata aattgaaccc tggtttctgg ttatgtgtgt tagctaatgt 5100 ttttgaagtg gaagctntaa tcttntatcg cgttgctact acaattctgn ttgtgttttg 5160 atgttcttgt ttct 5174 <210> 9 <211> 381 <212> PRT
<213> Saccharum Hybrid Cultivar H32-8560 <400> 9 Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Giy Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe 65 70 75 g0 Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Xaa Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Xaa Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Xaa Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Xaa Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gln <210> 10 <211> 3688 <212> DNA
<213> Saccharum Hybrid Cultivar 32-8560 <400> 10 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacccaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 kgtatcaatt cttgtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcag 3688 <210> 11 <211> 2146 <212> DNA
<213> Lycopersicon esculentum <400> 11 agatctacaa ttatcngcaa cgtgttacac attttgtgct acaatatacc ttcaccattt 60 tgtgtatata taaaggttgc atctcttcaa acaaaaatca ctccatcaca acacaatgtc 120 ttcttcttct tctattacta ctactcttcc tttatgcacc aacaaatccc tctcttcttc 180 is i m cttcaccacc accaactcat ccttgttatc aaaaccctct caacttttcc tccacggaag 240 gcgtaatcaa agtttcaagg tttcatgcaa cgcaaacaac gttgacaaaa accctgacgc 300 tgttgataga cgaaacgttc ttttagggtt aggaggtctt tatggtgcag ctaatcttgc 360 accattagcg actgctgcac ctataccacc tcctgatctc aagtcttgtg gtactgccca 420 tgtaaaagaa ggtgttgatg taatatacag ttgttgccct cctgtacccg atgatatcga 480 tagtgttccg tactacaagt tcccttctat gactaaactc cgcatccgcc cccctgctca 540 tgcggcggat gaggagtacg tagccaagta tcaattggct acgagtcgaa tgagggaact 600 tgataaagac ccctttgacc ctcttggctt taaacaacaa gctaatattc attgtgctta 660 ttgcaacggt gcttacaaag ttggtggcaa agaattgcaa gttcatttct cgtggctttt 720 ctttcccttt catagatggt acttgtactt ttacgaaaga attttgggat cacttattaa 780 tgatccaact tttgctttac cttactggaa ttgggatcat ccaaaaggca tgcgtatacc 840 tcccatgttt gatcgtgagg gatcatctct ttacgatgag aaacgtaacc aaaatcatcg 900 caatggaact attattgatc ttggtcattt tggtaaggaa gttgacacac ctcagctaca 960 gataatgact aataatttaa ccctaatgta ccgtcaaatg gttactaatg ctccttgccc 1020 ttcccaattc ttcggtgctg cttacctctg ggttctgaac ccaagtccgg gtcagggtac 1080 tattgaaaac atccctcata ctccggttca catctggacc ggtgacaaac ctcgtcaaaa 1140 aaacggtgaa gacatgggta atttctactc agccggttta gatccgattt tttactgcca 1200 ccatgccaat gtggacagga tgtggaatga atggaaatta attggcggga aaagaaggga 1260 tttaacagat aaagattggt tgaactctga attctttttc tacgatgaaa atcgtaaccc 1320 ttaccgtgtg aaagtccgtg atgttttgga cagtaaaaaa atgggattcg attacgcgcc 1380 aatgcccact ccatggcgta attttaaacc aatcagaaag tcatcatcag gaaaagtgaa 1440 tacagcgtca attgcaccag ttagcaaggt gttcccattg gcgaagctgg accgtgcgat 1500 ttcgttctct atcacgcggc cagcctcgtc aaggacaaca caagagaaaa atgagcagga 1560 ggagattctg acattcaata aaatatcgta tgatgatagg aactatgtaa ggttcgatgt 1620 gtttctgaac gtggacaaga ctgtgaatgc agatgagctt gataaggcgg agtttgcagg 1680 gagttatact agcttgccgc atgttcatgg aagtaatact aatcatgtta ccagtgttac 1740 tttcaagctg gcgataactg aactgttgga ggatattgga ttggaagatg aagatactat 1800 cgcggtgact ttaattccaa aagctggcgg tgaaggtgta tccattgaaa gtgtggagat 1860 caagcttgag gattgttaaa gtctgcatga gttggtggct atggagccaa atttatgttt 1920 aattagtata attatgtgtg gtttgagtta tgttttatgt taaaatgtat cagctcgatc 1980 gatagctgat tgctagttgt gttaatgcta tgtatgaaat aaataaatgg ttgtcttcca 2040 ttcagtttat cattttttgt cattctaatt aacggttaac ttttttttct actatttata 2100 cgaagctact atactatgta tatcatttgg aaaattatat attatt 2146 <210> 12 <211> 3509 <?,12 > DNA
<213> Zea mays <400> 12 gaattccggc gtgggcgctg ggctagtgct cccgcagcga gcgatctgag agaacggtag 60 agttccggcc gggcgcgcgg gagaggagga gggtcgggcg gggaggatcc gatggccggg 120 aacgagtgga tcaatgggta cctggaggcg atcctcgaca gccacacctc gtcgcggggt 180 gccggcggcg gcggcggcgg gggggacccc aggtcgccga cgaaggcggc gagcccccgc 240 ggcgcgcaca tgaacttcaa cccctcgcac tacttcgtcg aggaggtggt caagggcgtc 300 gacgagagcg acctccaccg gacgtggatc aaggtcgtcg ccacccgcaa cgcccgcgag 360 cgcagcacca ggctcgagaa catgtgctgg cggatctggc acctcgcgcg caagaagaag 420 cagctggagc tggagggcat ccagagaatc tcggcaagaa ggaaggaaca ggagcaggtg 480 cgtcgtgagg cgacggagga cctggccgag gatctgtcag aaggcgagaa gggagacacc 540 atcggcgagc ttgcgccggt tgagacgacc aagaagaagt tccagaggaa cttctctgac 600 cttaccgtct ggtctgacga caataaggag aagaagcttt acattgtgct catcagcgtg 660 catggtcttg ttcgtggaga aaacatggaa ctaggtcgtg attctgatac aggtggccag 720 gtgaaatatg tggtcgaact tgcaagagcg atgtcaatga tgcctggagt gtacagggtg 780 gacctcttca ctcgtcaagt gtcatctcct gacgtggact ggagctacgg tgagccaacc 840 gagatgttat gcgccggttc caatgatgga gaggggatgg gtgagagtgg cggagcctac 900 attgtgcgca taccgtgtgg gccgcgggat aaatacctca agaaggaagc gttgtggcct 960 tacctccaag agtttgtcga tggagccctt gcgcatatcc tgaacatgtc caaggctctg 1020 ggagagcagg ttggaaatgg gaggccagta ctgccttacg tgatacatgg gcactatgcc 1080 gatgctggag atgttgctgc tctcctttct ggtgcgctga atgtgccaat ggtgctcact 1140 ggccactcac ttgggaggaa caagctggaa caactgctga agcaagggcg catgtccaag 1200 gaggagatcg attcgacata caagatcatg aggcgtatcg agggtgagga gctggccctg 1260 gatgcgtcag agcttgtaat cacgagcaca aggcaggaga ttgatgagca gtggggattg 1320 tacgatggat ttgatgtcaa gcttgagaaa gtgctgaggg cacgggcgag gcgcggggtt 1380 agctgccatg gtcgttacat gcctaggatg gtggtgattc ctccgggaat ggatttcagc 1440 aatgttgtag ttcatgaaga cattgatggg gatggtgacg tcaaagatga tatcgttggt 1500 ttggagggtg cctcacccaa gtcaatgccc ccaatttggg ccgaagtgat gcggttcctg 1560 accaaccctc acaagccgat gatcctggcg ttatcaagac cagacccgaa gaagaacatc 1620 actaccctcg tcaaagcgtt tggagagtgt cgtccactca gggaacttgc aaaccttact 1680 ctgatcatgg gtaacagaga tgacatcgac gacatgtctg ctggcaatgc cagtgtcctc 1740 accacagttc tgaagctgat tgacaagtat gatctgtacg gaagcgtggc gttccctaag 1800 catcacaatc aggctgacgt cccggagatc tatcgcctcg cggccaaaat gaagggcgtc 1860 ttcatcaacc ctgctctcgt tgagccgttt ggtctcaccc tgatcgaggc tgcggcacac 1920 ggactcccga tagtcgctac caagaatggt ggtccggtcg acattacaaa tgcattaaac 1980 aacggactgc tcgttgaccc acacgaccag aacgccatcg ctgatgcact gctgaagctt 2040 gtggcagaca agaacctgtg gcaggaatgc cggagaaacg ggctgcgcaa catccacctc 2100 tactcatggc cggagcactg ccgcacttac ctcaccaggg tggccgggtg ccggttaagg 2160 aacccgaggt ggctgaagga cacaccagca gatgccggag ccgatgagga ggagttcctg 2220 gaggattcca tggacgctca ggacctgtca ctccgtctgt ccatcgacgg tgagaagagc 2280 tcgctgaaca ctaacgatcc actgtggttc gacccccagg atcaagtgca gaagatcatg 2340 aacaacatca agcagtcgtc agcgcttcct ccgtccatgt cctcagtcgc agccgagggc 2400 acaggcagca ccatgaacaa atacccactc ctgcgccggc gccggcgctt gttcgtcata 2460 gctgtggact gctaccagga cgatggccgt gctagcaaga agatgctgca ggtgatccag 2520 gaagttttca gagcagtccg atcggactcc cagatgttca agatctcagg gttcacgctg 2580 tcgactgcca tgccgttgtc cgagacactc cagcttctgc agctcggcaa gatcccagcg 2640 accgacttcg acgccctcat ctgtggcagc ggcagcgagg tgtactatcc tggcacggcg 2700 aactgcatgg acgctgaagg aaagctgcgc ccagatcagg actatctgat gcacatcagc 2760 caccgctggt cccatgacgg cgcgaggcag accatagcga agctcatggg cgctcaggac 2820 ggttcaggcg acgctgtcga gcaggacgtg gcgtccagta atgcacactg tgtcgcgttc 2880 ctcatcaaag acccccaaaa ggtgaaaacg gtcgatgaga tgagggagcg gctgaggatg 2940 cgtggtctcc gctgccacat catgtactgc aggaactcga caaggcttca ggttgtccct 3000 ctgctagcat caaggtcaca ggcactcagg tatctttccg tgcgctgggg cgtatctgtg 3060 gggaacatgt atctgatcac cggggaacat ggcgacaccg atctagagga gatgctatcc 3120 gggctacaca agaccgtgat cgtccgtggc gtcaccgaga agggttcgga agcactggtg 3180 aggagcccag gaagctacaa gagggacgat gtcgtcccgt ctgagacccc cttggctgcg 3240 tacacgactg gtgagctgaa ggccgacgag atcatgcggg ctctgaagca agtctccaag 3300 acttccagcg gcatgtgaat ttgatgcttc ttttacattt tgtccttttc ttcactgcta 3360 tataaaataa gttgtgaaca gtaccgcggg tgtgtatata tatattgcag tgacaaataa 3420 aacaggacac tgctaactat actggtgaat atacgactgt caagattgta tgctaagtac 3480 tccatttctc aatgtatcaa tcggaattc 3509
SDS at room temperature when a DNA probe of about 100 to about 1000 nucleotides in length is employed.
High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68°C in a solution consisting of SX SSPE, 1 % SDS, SX Denhardt's reagent and 100 ~tg/ml denatured salmon sperm DNA
followed by washing in a solution comprising 0.1 X SSPE, and 0.1 % SDS at 68°C when a probe of about 100 to about 1000 nucleotides in length is employed.
The term "equivalent" when made in reference to a hybridization condition as it relates to a hybridization condition of interest means that the hybridization condition and the hybridization condition of interest result in hybridization of nucleic acid sequences which have the same range of percent (%) homology. For example, if a hybridization condition of interest results in hybridization of a first nucleic acid sequence with other nucleic acid sequences that have from 50% to 70% homology to the first nucleic acid sequence, then another hybridization condition is said to be equivalent to the hybridization condition of interest if this other hybridization condition also results in hybridization of the first nucleic acid sequence with the other nucleic acid sequences that have from 50% to 70%
homology to the first nucleic acid sequence.
When used in reference to nucleic acid hybridization the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions:
factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above-listed conditions.
Those skilled in the art know that whereas higher stringencies may be preferred to I S reduce or eliminate non-specific binding between the nucleotide sequence of SEQ ID NOs: l and 10 and other nucleic acid sequences, lower stringencies may be preferred to detect a larger number of nucleic acid sequences having different homologies to the nucleotide sequence of SEQ ID NOs:I and 10.
The term "promoter," "promoter element," or "promoter sequence" as used herein, refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into mRNA. A
promoter is typically, though not necessarily, located 5' (i. c., upstream) of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription.
Promoters may be tissue specific or cell specific. The term "tissue specific"
as it applies to a promoter refers to a promoter that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., petals) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., roots). Tissue specificity of a promoter may be evaluated by, for example, operably linking a reporter gene to the promoter sequence to generate a reporter construct, introducing the reporter construct into the genome of a plant such that the reporter construct is integrated into every tissue of the resulting transgenic plant, and detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues of the transgenic plant. The detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression of the reporter gene in other tissues shows that the promoter is specific for the tissues in which greater levels of S expression are detected. The term "cell type specific" as applied to a promoter refers to a promoter which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression of the same nucleotide sequence of interest in a different type of cell within the same tissue. The term "cell type specific" when applied to a promoter also means a promoter capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue. Cell type specificity of a promoter may be assessed using methods well known in the art, e.g., immunohistochemical staining. Briefly, tissue sections are embedded in paraffin. and paraffin sections are reacted with a primary antibody which is specific for the polypeptide product encoded by the nucleotide sequence of interest whose expression is controlled by the promoter. A labeled (e.g., peroxidase conjugated) secondary antibody which is specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy.
Promoters may be constitutive or regulatable. The term "constitutive" when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, light, etc. ). Typically, constitutive promoters are capable of directing expression of a transgene in substantially any cell and any tissue. In contrast, a "regulatable" promoter is one which is capable of directing a level of transcription of an operably linked nuclei acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, light, etc.) which is different from the level of transcription of the operably linked nucleic acid sequence in the absence of the stimulus.
The terms "infecting" and "infection" with a bacterium refer to co-incubation of a target biological sample, (e.g., cell, tissue, etc.) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells of the target biological sample.
The term "Agrobacterium" refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall. The term "Agrobacterium"
includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell.
Thus, Agrobacterium strains which cause production of nopaline (e.g.' strain LBA4301, C58, A208) are referred to as "nopaline-type" Agrobacteria; Agrobacterium strains which cause production of octopine (c~.~~. ~ strain LBA4404, AchS, B6) are referred to as "octopine-type"
Agrobacteria; and Agrobacterium strains which cause production of agropine (e.g., strain EHA10~, EHA101, A281 ) are referred to as "agropine-type" Agrobacteria.
The terms "bombarding, "bombardment," and "biolistic bombardment" refer to the process of accelerating particles towards a target biological sample (e.g., cell, tissue, etc.) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample. Methods for biolistic bombardment are known in the art (e.g., U.S. Patent No. 5,584,807, the contents of which are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile I S accelerator (PDS-1000/He) (BioRad).
The term "microwounding" when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle bombardment as described herein.
The term "plant" as used herein refers to a plurality of plant cells which are largely differentiated into a structure that is present at any stage of a plant's development. Such structures include, but are not limited to, a fruit, shoot, stem, leaf, flower petal, etc. The term "plant tissue" includes differentiated and undifferentiated tissues of plants including, but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various types of cells in culture (e. g., single cells, protoplasts, embryos, callus, protocorm-like bodies, etc.). Plant tissue may be in plants, in organ culture, tissue culture, or cell culture.
DESCRIPTION OF THE INVENTION
The present invention provides nucleic acid sequences having promoter activity. The nucleic acid sequences provided herein direct constitutive expression of operably linked nucleotide sequences in cells, tissues and organs of monocotyledonous and dicotyledonous plants. The promoter sequences provided herein were discovered by screening highly expressed sugarcane polyubiquitin genes. The sequences of the invention are capable of driving expression of operably linked nucleotide sequences in plant cells at a level which is comparable to or greater than those levels expressed under the control of the prior art's maize polyubiquitin promoter sequences. Also provided by the invention are methods for constitutive expression of a nucleic acid sequence of interest in plants. The nucleic acid sequences and methods of the invention allow generation of transgenic plants which exhibit S agronomically desirable characteristics.
The invention is further described under (A) Sugarcane Polyubiquitin Promoter Sequences, (B) Using Probes To Identify And Isolate Homologs of The Sugarcane Promoter Sequences, (C) Using Primers to Amplify Nucleotide Sequences, and (D) Generating Transgenic Plants.
A. Sugarcane Polyubiquitin Promoter Sequences Ubiquitin involvement has been shown in protein turnover, heat shock response, and many other important cellular processes [Hershko et al., Annual Review of Biochemistry 61, 761-808 (1992)]. Consistent with its essential biological roles, the structure of the protein is I S very highly conserved in all eukaryotes [Callis et al., Genetics 139, 921-39 (1995); Callis c~t al., Oxford Surv Plant Mol Cell Biol 6, 1-30 (1989); Sun et al., Plant J 11, 1017-27 (1997)].
Many genes encoding ubiquitin contain various numbers of tandem repeats of the entire protein coding region and hence are called polyubiquitin genes. The primary translation product is a polyprotein, which is processed to form ubiquitin monomers post-translationally.
Ubiquitin protein is abundant throughout the plant body, and several polyubiquitin genes have been shown to be expressed in most or all cell types under most or all environmental conditions [Kawalleck et al., Plant Mol Biol 21, 673-84 (1993)].
Numerous other polyubiquitin genes, however, are expressed in a tissue-specific manner [Callis et al., Proc Natl Acad Sci U S A 91, 6074-7 (1994); Plesse et al., Mol Gen Genet 254, ( 1997)), or in response to environmental signals such as heat stress [Christensen et al., Plant Molecular Biology 12, 619-632 ( 1989); Liu et al., Biochem Cell Biol 73, 19-30 ( 1995)], or both [Almoguera et al., Plant Physiology 107, 765-773 (1995); Binet et al., Plant Mol Biol 17, 395-407 ( 1991 ); Burke et al., Mol Gen Genet 213, 43 S-43 ( 1988);
Garbarino et al., Plant Mol Biol 2U, 235-44 ( 1992); Genschik et al., Gene 148, 195-202 (1994); Sun et al. ( 1997) supra; Takimoto et ul., Plant Mol Biol 26, 1007-12 (1994)). Several polyubiquitin promoters have been isolated and used to drive transgene expression [Christensen et al., Plant Mol Biol 18, 675-89 (1992); Garbarino et al., Plant Physiology 109, 1371-1378 (1995)), including some which have been widely used for constitutive expression of genes in plant transformation [Christensen et al., Transgenic Research 5, 213-218 ( 1996):
Gallo-Meagher et ul., Plant Cell Reporter 12, 666-670 (1993); Garbarino et al. (1995) supra:
Taylor et al., Plant Cell Reports 12, 491-495 (1993); Quail et al., U.S. Patent Numbers 5.614,399 and 5,510,474].
S All Saccharum species are polyploid and most are at least octoploid (2N=40-128 or more). Commercial sugarcane cultivars are all Saccharum species hybrids derived from 2N+N chromosome transmission [Sreenivasan et al.: Cytogenics. In: Heinz DJ
(ed) Sugarcane Improvement Through Breeding, pp. 211-253. Elsevier, Amsterdam (1987)]. This suggests that sugarcane hybrids may have at least twelve alleles for each of the multiple genes that make up the polyubiquitin gene family.
The nucleic acid sequences of the invention were discovered during a search by the inventors for promoters which are suitable for high-level constitutive transgene expression in monocotyledonous and dicotyledonous plants. The inventors isolated five polyubiquitin cDNA clones (scubi221, 241, 511, 561, and 5121) from sugarcane stem tissue [Albert et al., Plant Physiology 109, 337 (1995)]. Based on comparison of their 3' untranslated sequences, the inventors then grouped these five genes into four "sub-families." The inventors' investigation of the expression of two members of these four sub-families and isolated genomic clones, led to isolation of clones which contained promoters for two members of the most highly expressed sub-family.
The invention provides the nucleic acid sequence of two members of the sugarcane polyubiquitin family, the ubi4 gene and the ubi9 gene. Referring to the ubi4 gene, the initially determined nucleic acid sequence (SEQ ID NO:1 ) of the ubi4 gene including the translation start codon (ATG) and the sequence upstream of the translation start codon is shown in Figure 3A, and the nucleic acid sequence (SEQ ID N0:2) of the ubi4 gene including the translation stop codon and sequences downstram of the translation stop codon is shown in Figrue 3B. The subsequently determined nucleic acid sequence (SEQ ID
NO:S) of the entire ubi~ gene is shown in Figure SA with the subsequently determined nucleic acid sequence (SEQ ID N0:7) located upstream of the translation start codon of the ubi4 gene being shown in Figure 10. The nucleotide sequence of Figure SA represents nucleotides 1 to 5512 of the 5551 nucleotide sequence deposited as GenBank accession number AF093504.
Fragments of the ubi4 gene sequence which were identical in the initially determined and the subsequently determined nucleic acid sequences were as follows (the nucleotide numbers refer to the nucleotide number in SEQ ID NO:S): Fragment A: 1-242;
fragment B:
245-787; fragment C: 788-1020; fragment D: 1021-1084; fragment E: 1085-1168;
fragment F:
1169-1173; fragment G: 1174-1648; and fragment H: 1649-1805. The nucleotide sequence upstream of the transcription start codon of the ubi4 gene (i.e., nucleotides I-1810 of SEQ ID
NO:1, and nucleotides 1-1802 of SEQ ID NOs:S and 7) contained three regions:
(a) an upstream of the 5' UTR sequence (i.e., nucleotides 1-378 of SEQ ID NO:l, and nucleotides 1-377 of SEQ ID NOs:~ and 7), (b) a 5' UTR sequence (i.e.. nucleotides 379-444 of SEQ ID
NO:1. and nucleotides 378-442 of SEQ ID NOs:S and 7); and (c) an intron sequence (i.e., nucleotides 445-1810 of SEQ ID NO:I, and nucleotides 443-1802 of SEQ ID NOs:~
and 7).
With respect of the ubi9 gene, the initially determined nucleic acid sequence (SEQ ID
N0:3) of the ubi9 gene including the translation start codon and sequences upstream thereof is shown in Figure 7A and the initially determined nucleic acid sequence (SEQ
ID N0:4) of the ubi9 gene including the translation stop codon and sequences downstream thereof is shown in Figure 7B. The subsequently determined nucleic acid sequence (SEQ ID
N0:8) of the entire a~bi9 gene is shown in Figure 8A, with the nucleic acid sequence (SEQ ID NO:10) I S of the ubi9 gene upstream of the translation start codon being shown in Figure 11.
It is noted that while the 5' end of the ubi9 gene was obtained by cleavage with HindIII which recognizes the sequence 5'-AAGCTT-3', repeated sequencing of the 5'-end of the ubi9 gene showed instead the sequence 5'-AAGTTT-3' (Figures 8 and 11 ).
Thus, it is the inventors' view that the sequence of the full-length ubi9 gene (a) is as shown in Figure 8A (SEQ ID N0:8) (i.e., total number of nucleotides being 5174, with the ten nucleotides at the 5'-end being S'-AAGTTTTGnT-3'), (b) is as shown in Figure 8A (SEQ ID N0:8) with the exception that it has a total number of nucleotides of 5174, with the ten nucleotides at the 5'-end being 5'-AAGCTTTGnT-3', assuming substitution of the "T" at position 4 with a "C"), or (c) is as shown in Figure 8A (SEQ ID N0:8) with the exception that it has a total number of nucleotides of 5175, with the ten nucleotides at the 5'-end being 5'-AAGCTTTTGn-3', assuming insertion of a "C" at position 4. Accordingly, any reference herein to SEQ ID N0:8 is intended to mean each and every one of the three nucleotide sequences described in the preceding sentence.
Similarly, it is the inventors' view that the sequence upstream of the translation start codon of the ubi9 gene (a) is as shown in Figure 11 (SEQ ID NO: I 0) (i. e. , total number of nucleotides being 3688, with the ten nucleotides at the 5'-end being 5'-AAGTTTTGnT-3'), (b) is as shown in Figure 1 I (SEQ ID NO:10) with the exception that it has a total number of nucleotides of 3688, with the ten nucleotides at the S'-end being S'-AAGCTTTGnT-3', assuming substitution of the "T" at position 4 with a "C"). or (c) is as shown in Figure 11 (SEQ ID NO:10) with the exception that it has a total number of nucleotides of 3689, with the ten nucleotides at the 5'-end being 5'-AAGCTTTTGn-3', assuming insertion of a "C" at position 4. Accordingly, any reference herein to SEQ ID NO:10 is intended to mean each and every one of the three nucleotide sequences described in the preceding sentence.
Fragments of the ubi9 gene sequence which were identical in the initially determined and the subsequently determined nucleic acid sequences were as follows (the nucleotide number refers to the nucleotide number in SEQ ID N0:8): Fragment A: 1-3600:
fragment B:
3602-361?: and fragment C: 3614-3691. The nucleotide sequence upstream of the translation start codon of the ubi9 gene (i.e., nucleotides I-3691 of SEQ ID N0:3, and nucleotides 1-3691 of SEQ ID N0:8) contained three regions: (a) an upstream of the 5' UTR
sequence (i.e., nucleotides I-2248 of SEQ ID N0:3, and nucleotides 1-2248 of SEQ ID
NOs:8 and 10), (b) a S' UTR sequence (i.e., nucleotides 2249-2313 of SEQ ID N0:3, and nucleotides 2249-2313 of SEQ ID NOs:8 and 10), and (c) an intron sequence (i.e., nucleotides 2314-3688 of SEQ ID N0:3. and nucleotides 2314-3688 of SEQ ID NOs:B and 10). A BLAST search of the GenBank database showed 100% homology to only a 20-22 by region of nucleotides 1-3688 of SEQ ID NOs:8 and 10 (i. e, the region of the ubi9 gene which contained the sequence upstream of the 5' UTR, the 5' UTR sequence, and the intron sequence), 100%
homology to only a 20 by region of nucleotides 1-2248 of SEQ ID NOs:8 and 10 (i.e., the sequence upstream of the 5' UTR), and 87% homology between a 135 by fragment of the sequence upstream of the 5' UTR of SEQ ID NOs:8 and 10 and a 118 by fragment of Saccharum sp.
glucose transporter mRNA, 3' end (GenBank accession number L21752).
Data presented herein demonstrates that plasmids which contain the uid A gene encoding ~3-glucuronidase (GUS) under the control of SEQ ID N0:7 (which is equivalent to the initially determined SEQ ID NO:I) of the ubi4 gene or under the control of SEQ ID
NO:10 (which is equivalent to the initially determined SEQ ID N0:3) of the ubi9 gene successfully drive transient expression of GUS in monocotyledonous sugarcane suspension cultured cells (Example 3), monocotyledonous sorghum callus (Example 5), monocotyledonous pineapple leaves, protocorm-like bodies, roots and fruit (Example 6), and in dicotyledonous tobacco leaves (Example 4), as well as stable expression in monocotyledonous sugar cane callus (Example 7), monocotyledonous rice callus (Example 8), and dicotyledonous tobacco leaves (Example 9).
The present invention is not limited to SEQ ID NO:1 but specifically contemplates portions thereof. As used herein the term "portion" when made in reference to a nucleic acid sequence refers to a fragment of that sequence. The fragment may range in size from ten ( 10) contiguous nucleotide residues to the entire nucleic acid sequence minus one nucleic acid S residue. Thus. a nucleic acid sequence comprising "at least a portion of a nucleotide sequence comprises from ten ( 10) contiguous nucleotide residues of the nucleotide sequence to the entire nucleotide sequence.
In a preferred embodiment, portions contemplated to be within the scope of the invention include, but are not limited to, portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases; the sequence upstream of the ~' UTR sequence (i.c~.. nucleotide sequence from position 1 to 377 of SEQ ID N0:7), the S' UTR
sequence (i.e.. nucleotides sequence from position 378 to 442 of SEQ ID N0:7), and the intron sequence (i.e., nucleotide sequence from position 443 to 1802 of SEQ ID N0:7).
In an alternative preferred embodiment, portions within the scope of the invention include portions 1 S larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, those sequences which are upstream of the translation start codon and which are identical in the initially determined and subsequently determined sequence of the ubi=J gene, and are exemplified by the nucleotide sequence from position 1 to 242, from position 24S to 787, from position 788 to 1020, from position 1021 to 1084, from position 1085 to 1168, from position 1169 to 1173, from position 1174 to 1648, and from position 1649 to 1802 of SEQ
ID N0:7. In yet another alternative preferred embodiment, the portion contains the 377 by sequence which is upstream of the S'UTR and which is highly homologous (>90%
identity) in both the ubi4 and ubi9 gene sequences, i.e., the nucleotide sequence from position 1 to 377 of SEQ ID N0:7.
2S It is contemplated that the present invention is not limited to SEQ ID N0:3 but specifically includes portions thereof. In a preferred embodiment, portions contemplated to be within the scope of the invention include, but are not limited to, portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, the sequence upstream of the S' UTR sequence (i. e., nucleotide sequence from position 1 to 2248 of SEQ
ID NO:10), the S' UTR sequence (i. e. , nucleotides sequence from position 2249 to 2313 of SEQ ID
NO:10), and the intron sequence (i.c~., nucleotide sequence from position 2314 to 3688 of SEQ ID NO:10). In an alternative preferred embodiment, portions within the scope of the invention include portions larger than 20 nucleotide bases, more preferably larger than 100 nucleotide bases, those sequences which are upstream of the translation start codon and which are identical in the initially determined and subsequently determined sequence of the ubi9 gene, and are exemplified by the nucleotide sequence from position 1 to 3600, from position 3602 to 3612, and from position 3614 to 3688 of SEQ ID NO:10. In yet another alternative preferred embodiment, the portion contains the sequence which is upstream of the transcription start codon and which is highly homologous (>90% identity) in both the ubi.~
and ubi9 gene sequences. i.c~., the nucleotide sequence from position 1671 to 2248 of SEQ ID
NO:10. This sequence includes the MITE (from position 1706 to 1906) which is not homologous to sequences in the polyubiquitin ubi4 promoter.
The sequences of the present invention are not limited to SEQ ID NOs:I and 10 and portions thereof, but also include homologs of SEQ ID NOs:I and 10, as well as portions of these homologs. A nucleotide sequence which is a "homolog" of SEQ ID NOs:I and 10 is defined herein as a nucleotide sequence which exhibits greater than 61 %
identity (but not 100% identity) to the sequence of SEQ ID NOs:I and 10, respectively.
The present invention also contemplates functioning or functional homologs of SEQ
ID NOs:I and 10. A "functional homolog" of SEQ ID NOs:I and 10 is defined as a nucleotide sequence having less than 100% homology with SEQ ID NOs:I and 10, respectively, and which has promoter activity having some or all the characteristics (e.g., constitutive promoter activity) of the promoter activity of SEQ ID NOs:I and 10, respectively. Homologs of SEQ ID NOs:I and 10, and of portions thereof, include, but are not limited to, nucleotide sequences having deletions, insertions or substitutions of different nucleotides or nucleotide analogs as compared to SEQ ID NOs:I and 10, respectively. Such homologs may be produced using methods well known in the art.
The invention also contemplates at least a portion of SEQ ID NOs:I and 10, and homologs thereof having promoter activity. The term "promoter activity" when made in reference to a nucleic acid sequence refers to the ability of the nucleic acid sequence to initiate transcription of an operably linked nucleotide sequence into mRNA.
The terms "operably linked," "in operable combination," and "in operable order" as used herein refer to the linkage of nucleic acid sequences in a manner such that a nucleic acid molecule is capable of directing the transcription of nucleic acid sequence of interest and/or the synthesis of a polypeptide sequence of interest.
Promoter activity may be determined using methods known in the art. For example, a candidate nucleotide sequence whose promoter activity is to be determined is ligated in-frame to a nucleic acid sequence of interest (c~.g., a reporter gene sequence, a selectable marker gene sequence) to generate a reporter vector, introducing the reporter vector into plant tissue using methods described herein, and detecting the expression of the reporter gene (e.g., detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by S the reporter gene). The reporter gene may express a visible markers.
Reporter gene systems which express visible markers include (3-glucuronidase and its substrate (X-Gluc), luciferase and its substrate (luciferin), and (3-galactosidase and its substrate (X-Gal) which are widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system [Rhodes CA et ul.
( 1995) Methods Mol Biol 55:121-131]. In a preferred embodiment, the reporter gene is a GUS
gene. The selectable marker gene may confer antibiotic or herbicide resistance. Examples of reporter genes include, but are not limited to, dhfr which confers resistance to methotrexate [Wigler M
et al., (1980) Proc Natl Acad Sci 77:3567-70J; npt, which confers resistance to the aminoglycosides neomycin and G-418 [Colbere-Garapin F et al., (1981) J. Mol.
Biol.
150:1-14] and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively. Detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by the reporter gene or the selectable marker gene indicates that the candidate nucleotide sequence has promoter activity.
Sequences within a promoter which affect promoter activity may be determined by using deletion constructs such as those described by Sherri et al. for the determination of HSP70 intron alterations which impact transcription of genes operably linked thereto [U.S.
patent number 5,593,874, hereby incorporated by reference]. Briefly, several expression plasmids are constructed to contain a reporter gene under the regulatory control of different candidate nucleotide sequences which are obtained either by restriction enzyme deletion of internal sequences in SEQ ID NOs:I and 10, restriction enzyme truncation of sequences at the S' andlor 3' end of SEQ ID NOs:l and 10, or by the introduction of single nucleic acid base changes by PCR into SEQ ID NOs:I and 10. Expression of the reporter gene by the deletion constructs is detected. Detection of expression of the reporter gene in a given deletion construct indicates that the candidate nucleotide sequence in that deletion construct has promoter activity.
At the 3' end of the nucleic acid sequence of interest, other DNA sequences may also be included, e.g., a 3' untranslated region containing a polyadenylation site and transcription termination sites.
The present invention is not limited to sense molecules of SEQ ID NOs:I and 10 but contemplates within its scope antisense molecules comprising a nucleic acid sequence complementary to at least a portion (e.g., a portion greater than 100 nucletide bases in length and more preferably greater than 200 nucleotide bases in length) of the nucleotide sequence of SEQ ID NOs:I and 10. These antisense molecules find use in, for example, reducing or preventing expression of a gene whose expression is controlled by SEQ ID NOs:I
and 10.
The nucleotide sequence of SEQ ID NOs:I and 10, portions, homologs and antisense sequences thereof may be synthesized by synthetic chemistry techniques which are commercially available and well known in the art [see Caruthers MH et al., ( 1980) Nuc.
Acids Res. Symp. Ser. 215-223; Horn T. et al., ( 1980) Nuc. Acids Res. Symp.
Ser. 225-232].
Additionally. fragments of SEQ ID NOs:I and 10 can be made by treatment of SEQ
ID
NOs:I and 10 with restriction enzymes followed by purification of the fragments by gel electrophoresis. Alternatively, sequences may also produced using the polymerase chain reaction (PCR) as described by Mullis [U.S. Patent Nos. 4,683,195, 4,683,202 and 4,965,188, all of which are hereby incorporated by reference]. SEQ ID NOs:I and 10, portions, homologs and antisense sequences thereof may be ligated to each other or to heterologous nucleic acid sequences using methods well known in the art.
The nucleotide sequence of synthesized sequences may be confirmed using commercially available kits as well as using methods well known in the art which utilize enzymes such as the Klenow fragment of DNA polymerase I, Sequenase~', Tag DNA
polymerase, or thermostable T7 polymerase. Capillary electrophoresis may also be used to analyze the size and confirm the nucleotide sequence of the products of nucleic acid synthesis, restriction enzyme digestion or PCR amplification.
It is readily appreciated by those in the art that the sequences of the present invention may be used in a variety of ways. For example, these sequences are useful in directing the expression of polypeptide sequences in vitro and in vivo. In plants, this is useful in determining the role of the polypeptide in disease development as well an in producing transgenic plants with desirable agronomic characteristics as described below.
In addition, portions of the sequences of the invention can be used as probes for the detection and isolation of complementary DNA sequences, and for the amplification of nucleotide sequences as described below.
B. Using Probes To Identify And Isolate Homologs of The Sugarcane Promoter Sequences The invention provided herein is not limited to SEQ ID NO:I and 10, homologs thereof. and portions thereof. having promoter activity, but includes sequences having no promoter activity (i.e., non-functional homologs and non-functional portions of homologs).
This may be desirable, for example, where a portion of SEQ ID NOs:I and 10 is used as a probe to detect the presence of SEQ ID NOs:I and 10, respectively, or of portions thereof in a sample.
As used herein, the term "probe" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, which is capable of hybridizing to a nucleotide sequence of interest. A
probe may be single-stranded or double-stranded. It is contemplated that any probe used in the present invention will be labelled with any "reporter molecule," so that it is detectable in any detection system including, but not limited to enzyme (e.g.. ELISA, as well as enzyme-I ~ based histochemical assays), fluorescent, radioactive, calorimetric, gravimetric, magnetic, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.
The probes provided herein are useful in the detection, identification and isolation of, for example, sequences such as those listed as SEQ ID NOs:I and 10 as well as of homologs thereof. Preferred probes are of sufficient length (e.g., from about 9 nucleotides to about 20 nucleotides or more in length) such that high stringency hybridization may be employed. In one embodiment, probes from 20 to 50 nucleotide bases in length are employed.
C. Using Primers to Amplify Nucleotide Sequences The invention provided herein is not limited to SEQ ID NO:1 and 10, homologs thereof, and portions thereof, having promoter activity, but includes sequences having no promoter activity. This may be desirable, for example, where a portion of the nucleic acid sequences set forth as SEQ ID NOs:I and 10 is used as a primer for the amplification of nucleic acid sequences by, for example, polymerise chain reactions (PCR) or reverse transcription-polymerise chain reactions (RT-PCR). The term "amplification" is defined as the production of additional copies of a nucleic acid sequence and is generally carried out using polymerise chain reaction technologies well known in the art [Dieffenbach CW and GS
Dveksler ( 1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview NY]. As used herein, the term "polymerise chain reaction" ("PCR") refers to the method of K.B. Mullis disclosed in U.S. Patent Nos. 4,683,19, 4,683,202 and 4,96.188.
all of which are hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA
without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerise. The two primers are complementary to their respective strands of the double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule.
Following annealing, the primers are extended with a polymerise so as to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerise extension can be repeated many times (i.e., denaturation, annealing and extension constitute one "cycle": there can be numerous "cycles") to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the "polymerise chain reaction"
(hereinafter "PCR"). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are the to be "PCR
amplified."
With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; and/or incorporation of'ZP-labeled deoxyribonucleotide triphosphates, such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any nucleotide sequence can be amplified with the appropriate set of primer molecules. In particular, the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications. Amplified target sequences may be used to obtain segments of DNA (e.g., genes) for the construction of targeting vectors, transgenes, etc.
As used herein, the term "primer" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i. e. , in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products.
Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long (e.g., from about 9 nucleotides to about 20 nucleotides or more in length) to prime the synthesis of extension products in the presence of the inducing agent.
Suitable lengths of the primers may be empirically determined and depend on factors such as temperature, source of primer and the use of the method. In one embodiment, the present invention employs probes from 20 to SO nucleotide bases in length.
The primers contemplated by the invention are useful in, for example, identifying sequences which are homologous to the sugarcane ubi4 and ubi9 gene sequences in plants and in other organisms.
D. Generating Transgenic Plants The present invention provides methods for constitutively expressing a nucleotide sequence of interest in a cell, tissue, organ, and/or organism. In one embodiment, the methods provided herein direct constitutive expression of a nucleotide sequence of interest in monocotyledonous and dicotyledonous plant cells. In one embodiment, this is accomplished by introducing into a plant cell a vector that contains a nucleotide sequence of interest operably linked to sequences provided herein which have promoter activity. The transformed plant cell is allowed to develop into a transgenic plant in which the nucleotide sequence of interest is preferably, though not necessarily, expressed in substantially every tissue. These steps are further described below for specific embodiments.
1. Expression Vectors For Plants In one embodiment, the methods of the invention involve transformation of monocotyledonous tissue (sugarcane suspension cultured cells, sugarcane callus, rice callus, maize embryos, pineapple leaves, protocorm-like bodies, roots and fruit, and sorghum callus) and dicotyledonous tissue (tobacco leaves, tomato plants, and soybean excised embryonic meristems) with expression vectors in which the ~i-glucuronidase (GUS) gene is under the transcriptional control of the exemplary sugarcane ubi-! gene promoter sequence (SEQ ID
NO:1) or of the exemplary sugarcane ubi9 gene promoter sequence (SEQ ID N0:3).
As used herein, the terms "vector" and "vehicle" are used interchangeably in reference to nucleic acid molecules that transfer DNA segments) from one cell to another. The term "expression vector" as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism.
The methods of the invention are not limited to the expression vectors disclosed herein. Any expression vector which is capable of introducing a nucleic acid sequence of interest into a plant cell is contemplated to be within the scope of this invention. Typically, expression vectors comprise the nucleic acid sequence of interest as well as companion sequences which allow the transcription of this sequence, and which allow cloning of the vector into a bacterial or phage host. The vector preferably, though not necessarily. contains an origin of replication which is functional in a broad range of prokaryotic hosts. A
1 S selectable marker is generally, but not necessarily, included to allow selection of cells bearing the desired vector.
In a preferred embodiment, the promoter sequence is SEQ ID NO:1 which is derived from the sugarcane ubi4 gene. In an alternative preferred embodiment, the promoter sequence is SEQ ID N0:3 which is derived from the sugarcane ubi9 gene.
However, the invention is not limited to the promoter sequences used herein. Any sequence which is a portion, homolog, or a homolog of a portion of SEQ ID NOs:I and 10 and which has promoter activity is contemplated to be within the scope of the invention.
In addition to a promoter sequence, the expression vector preferably contains a transcription termination sequence downstream of the nucleic acid sequence of interest to provide for efficient termination. Exemplary termination sequences include the nopaline synthase (NOS) termination sequence, and different fragments of the sugarcane ribulose-1,5-biphosphate carboxylase/oxygenase (rubisco) small subunit (scrbcs) gene. The termination sequences of the expression vectors are not critical to the invention. The termination sequence may be obtained from the same gene as the promoter sequence or may be obtained from different genes.
If the mRNA encoded by the nucleic acid sequence of interest is to be efficiently translated, polyadenylation sequences are also commonly added to the expression vector.
Examples of the polyadenylation sequences include, but are not limited to, the Agrobacterium WO 99/469?6 PCT/US99/05985 octopine synthase signal, or the nopaline synthase signal. Where it is preferred that the nucleic acid sequence of interest is not translated into a polypeptide (c~.g..
where the nucleic acid sequence of interest encodes an antisense RNA), polyadenylation signals are not necessary.
Vectors for the transformation of plant cells are not limited to the type or nature of the expressed genes disclosed herein. Any nucleic acid sequence of interest may be used to create transgenic plant cells, tissues, organs, and plants. Nucleic acid sequences of interest include sequences which encode a protein of interest. The terms "protein of interest" and "polypeptide of interest" refer to any protein or polypeptide, respectively.
the manipulation of which may be deemed desirable for any reason, by one of ordinary skill in the art.
For example, it may be desirable to express a nucleic acid sequence which encodes a polypeptide sequence having, for example, enzyme activity. One example of such an enzyme is the 1-aminocyclopropane-1-carboxylic acid (ACC) deaminase enzyme which metabolizes ACC in plant tissue thereby lowering the level of ethylene which is responsible for fruit ripening (U.S. Patent No. 5,512,466, the contents of which are hereby incorporated by reference).
Another enzyme which may be desirably expressed in a plant is the sucrose phosphate synthase enzyme which increases the level of sucrose in the fruit. The nucleic acid sequence of the gene encoding this enzyme is known [e. g., in maize; Worrell et al.
(1991) Plant Cell 3:1121-1130] (Figure 15) (SEQ ID N0:12) and has been assigned GenBank accession number m97550.
Yet another example of a suitable enzyme for use in this invention is EPSP
synthase (S-enolpyruvyl-3- phosphoshikimate synthase; EC:25.1.19) which is an enzyme involved in the shikimic acid pathway of plants. The shikimic acid pathway provides a precursor for the synthesis of aromatic amino acids essential to the plant. Specifically, EPSP
synthase catalyzes the conversion of phosphoenol pyruvate and 3-phosphoshikimic acid to 5-enolpyruvyl-3-phosphoshikimate acid. A herbicide containing N-phosphonomethylglycine inhibits the EPSP
synthase enzyme and thereby inhibits the shikimic acid pathway of the plant.
The term "glyphosate" is usually used to refer to the N-phosphonomethylglycine herbicide in its acidic or anionic forms. Novel EPSP synthase enzymes have been discovered that exhibit an increased tolerance to glyphosate containing herbicides. In particular, an EPSP synthase enzyme having a single glycine to alanine substitution in the highly conserved region having the sequence: -L-G-N-A-G-T-A- located between positions 80 and 120 in the mature wild-type EPSP synthase amino add sequence has been shown to exhibit an increased tolerance to glyphosate and is described in U.S. Patent No. 4.971,908, the teachings of which are hereby incorporated b}~ reference. Methods for transforming plants to exhibit glyphosate tolerance are discussed in U.S. Patent No. 4,940,835, incorporated herein by reference. A
glyphosate-tolerant EPSP synthase plant gene encodes a polypeptide which contains a chloroplast transit peptide (CTP) which enables the EPSP synthase polypeptide (or an active portion thereto) to be transported into a chloroplast inside the plant cell.
The EPSP synthase gene is transcribed into mRNA in the nucleus and the mRNA is translated into a precursor polypeptide (CTP/mature EPSP synthase) in the cytoplasm. The precursor polypeptide is transported into the chloroplast.
Additional examples of enzymes suitable for use in this invention are acetolactate synthase, RNase to impart male sterility [Mariani et al. ( 1990) Nature 347:
737-741 ], and wheat germ agglutinin.
Yet other examples of desirable nucleic acid sequence are those which encode the sweetness protein. The nucleic acid sequence for the gene encoding the sweetness protein is known in the art (see, e.g., U.S. Patent Application No. 08/670,186, the contents of which are herein incorporated by reference). Transformation of plants with the sweetness protein is useful in, for example, providing a base level of. sweetness in the fruit, thus reducing the effects of differences in fruit maturity by providing more uniform sweetness in different parts of the fruit.
Further examples of suitable proteins for use in this invention are Bacillus thuringiena~is (B.t.) crystal toxin proteins which when expressed in plants protect the plants from insect infestation because the insect, upon eating the plant containing the B.t. toxin protein either dies or stops feeding. B.t. toxin proteins which are toxic to either Lepidopteran or Coleopteran insects may be used. Examples of particularly suitable DNA
sequences encoding B.t. toxin protein are described in the EP patent application 385,962 entitled "Synthetic Plant Genes and Method for Preparation," published Sep. 5, 1990.
Alternatively, it may be desirable to express a nucleic acid sequence which encodes an antisense RNA that hybridizes with a genomic plant DNA sequence. For example, it may be of advantage to express antisense RNA which is specific for genomic plant DNA
sequences that encode an enzyme whose activity is sought to be decreased. Examples of DNA
sequences whose reduced expression may be desirable are known in the art including, but not limited to, the ethylene inducible sequences in fruit (U.S. Patent No.
5,545,815, the entire WO 99/46976 PCT/lJS99/05985 contents of which are herein incorporated by reference). Expression of antisense RNA which is homologous with these ethylene inducible sequences is useful in delaying fruit ripening and in increasing fruit firmness. Other DNA sequences whose expression may be desirably reduced include the ACC synthase gene which encodes the ACC synthase enzyme that is the first and rate limiting step in ethylene biosynthesis. Nucleic acid sequences for this gene have been described from a number of plant sources (e.g., Picton et al. (1993) The Plant J. 3:469-481: U.S. Patent Nos. 5,365,015 and 5,723,766, the contents of both of which are herein incorporated by reference). Expression of antisense RNA which hybridizes with ACC
synthase genomic sequences in plants may be desirable to delay fruit ripening.
Yet another sequence whose expression may be advantageously reduced is the genomic sequence encoding the enzyme polyphenol oxidase. This enzyme is involved in the browning reaction that occurs during chilling injury. Nucleic acid sequences encoding this enzyme have been previously described in the art (e.g., Shahar et al. (1992) Plant Cell 4:135-147), as shown in Figure 14 (SEQ ID NO:11 ) {GenBank accession number s40548).
The use of antisense polyphenol antisense sequences has been reported to inhibit polyphenol oxidase (PPO) gene expression and to inhibit browning [Bachem et al. ( 1994) Bio/Technology 12:1 I01-1105].
One of skill in the art knows that the antisense DNA segment to be introduced into the plant may include the full length coding region of the targeted gene or a portion thereof.
Complete homology between the nucleotide sequences of the antisense RNA and the targeted genomic DNA is not required. Rather, antisense DNA sequences which encode antisense RNA sequences that are partially homologous to a targeted genomic DNA sequence are contemplated to be within the scope of the invention so long as the antisense RNA sequences are capable of repressing expression of the target genomic DNA sequence.
The invention is not limited to vectors which express a single nucleic acid sequence of interest. Vectors which contain a plurality of (i.e., two or more) nucleic acid sequences under the transcriptional control of the same promoter sequence are expressly contemplated to be within the scope of the invention. Such vectors may be desirable, for example.
where the expression products of the plurality of nucleic acid sequences contained within the vector provide protection against different pathogens, and where simultaneous protection against these different pathogens is deemed advantageous.
Also included within the scope of this invention are vectors which contain the same or different nucleic acid sequences under the transcriptional control of different promoter sequences derived from SEQ ID NO:I, SEQ ID NO:10, and other sequences. Such vectors may be desirable to, for example, to control different levels of expression of different nucleic acid sequences of interest in plant tissues.
2. Transformation of Plant Cells Once an expression vector is prepared, transgenic plants and plant cells are obtained by introducing the expression vectors into plants and plant cells using methods known in the art. The present invention is suitable for any member of the monocotyledonous (monocot) plant family including, but not limited to, maize, rice, barley, oats, wheat, sorghum, rye, sugarcane, pineapple, yams, onion, banana, coconut, dates and hops. The present invention is also suitable for any member of the dicotyledonous (dicot) plant family including, but not limited to, tobacco, tomato, soybean, and papaya.
In one embodiment, the expression vectors are introduced into plant cells by particle mediated gene transfer. Particle mediated gene transfer methods are known in the art, are commercially available, and include, but are not limited to, the gas driven gene delivery instrument descried in McCabe, U.S. Patent No. 5,584,807, the entire contents of which are herein incorporated by reference. This method involves coating the nucleic acid sequence of interest onto heavy metal particles, and accelerating the coated particles under the pressure of compressed gas for delivery to the target tissue.
Other particle bombardment methods are also available for the introduction of heterologous nucleic acid sequences into plant cells. Generally, these methods involve depositing the nucleic acid sequence of interest upon the surface of small, dense particles of a material such as gold, platinum, or tungsten. The coated particles are themselves then coated onto either a rigid surface, such as a metal plate, or onto a carrier sheet made of a fragile material such as mylar. The coated sheet is then accelerated toward the target biological tissue. The use of the flat sheet generates a uniform spread of accelerated particles which maximizes the number of cells receiving particles under uniform conditions, resulting in the introduction of the nucleic acid sample into the target tissue.
Alternatively, an expression vector may be inserted into the genome of plant cells by infecting the cells with a bacterium, including but not limited to an Agrobacterium strain previously transformed with the nucleic acid sequence of interest. Since most dicotyledonous plant are natural hosts for Agrobacterium, almost every dicotyledonous plant may be transformed by Agrobacterium in vitro. Although monocotyledonous plants, and in particular, cereals and grasses, are not natural hosts to Agrobacterium, work to transform them using Agrobacterium has also been carried out (Hooykas-Van Slogteren et al., (1984) Nature 311:763-764). Plant genera that may be transformed by Agrobacterium include Chrysanthemum, Dianthus, Gerbera, Euphorbia. Pelaronium, Ipomoea, Passiflora.
Cyclamen, Malus, Prunus. Rosa, Rubus, Populus, Santalum, Allium, Lilium, Narcissus, Ananas. Arachis, Phaseolus and Pisum.
For transformation with Agrobacterium, disarmed Agrobacterium cells are transformed with recombinant Ti plasmids of Agrobacterium tumefaciens or Ri plasmids of Agrobucterium rl7izogenes (such as those described in U.S. Patent No. 4,940,838, the entire contents of which are herein incorporated by reference) which are constructed to contain the nucleic acid sequence of interest using methods well known in the art [J. Sambrook et al. ( 1989) Molecular Clonin~~: A Laboratory Manual, Cold Spring Harbor Press, NY]. The nucleic acid sequence of interest is then stably integrated into the plant genome by infection with the transformed Agrobucterium strain. For example, heterologous nucleic acid sequences have been introduced into plant tissues using the natural DNA transfer system of Agrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et al.
( 1987) Ann.
Rev. Plant Phys. 38:467-48b).
Construction of recombinant Ti and Ri plasmids in general follows methods typically used with the more common bacterial vectors, such as pBR322. Additional use can be made of accessory genetic elements sometimes found with the native plasmids and sometimes constructed from foreign sequences. These may include but are not limited to structural genes for antibiotic resistance as selection genes.
There are two systems of recombinant Ti and Ri plasmid vector systems now in use.
The first system is called the "cointegrate" system. In this system, the shuttle vector containing the gene of interest is inserted by genetic recombination into a non-oncogenic Ti plasmid that contains both the cis-acting and traps-acting elements required for plant transformation as, for example, in the pMLJI shuttle vector and the non-oncogenic Ti plasmid pGV3850. The second system is called the "binary" system in which two plasmids are used;
the gene of interest is inserted into a shuttle vector containing the cis-acting elements required for plant transformation. The other necessary functions are provided in traps by the non-oncogenic Ti plasmid as exemplified by the pBINl9 shuttle vector and the non-oncogenic Ti plasmid PAL4404. Some of these vectors are commercially available.
There are three common methods to transform plant cells with Agrobacterium:
The first method is by co-cultivation of Agrobacterium with cultured isolated protoplasts. This method requires an established culture system that allows culturing protoplasts and plant - regeneration from cultured protoplasts. The second method is by transformation of cells or tissues with Agrobacterium. This method requires (a) that the plant cells or tissues can be transformed by Agrobacterium and (b) that the transformed cells or tissues can be induced to regenerate into whole plants. The third method is by transformation of seeds, apices or meristems with Agrobacterium. This method requires micropropagation.
One of skill in the art knows that the efficiency of transformation by Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency with Agrobacterium tumefaciens [Shahla et al. (1987) Plant Molec. Biol. 8:291-298].
Alternatively. transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. [see, e.g., Bidney et al.
(1992) Plant Molec. Biol. 18:301-313J.
It may be desirable to target the nucleic acid sequence of interest to a particular locus on the plant genome. Site-directed integration of the nucleic acid sequence of interest into ?0 the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium-derived sequences. Generally, plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homologous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer-DNA
(T-DNA) sequences, as previously described (Offxinga et al., (1996), U.S. Patent No.
5,501,967, the ?5 entire contents of which are herein incorporated by reference). One of skill in the art knows that homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part of the targeted plant gene, whether belonging to the regulatory elements of the gene. or the coding regions of the gene.
Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid ,0 sequence of regions flanking the site to be targeted is known.
Where homologous recombination is desired, the targeting vector used may be of the replacement-or insertion-type (Offringa et al. ( 1996), supra). Replacement-type vectors generally contain two regions which are homologous with the targeted genomic sequence and which flank a heterologous nucleic acid sequence, e.g., a selectable marker gene sequence.
Replacement-type vectors result in the insertion of the selectable marker gene which thereby disrupts the targeted gene. Insertion-type vectors contain a single region of homology with the targeted gene and result in the insertion of the entire targeting vector into the targeted gene.
Other methods are also available for the introduction of expression vectors into plant tissue, e.g., electroinjection (Nan et al. (1995) In "Biotechnology in Agriculture and Forestry."
Ed. Y.P.S. Bajaj. Springer-Verlag Berlin Heidelberg, Vol 34:145-155; Griesbach (1992) HortScience 27:620): fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies (Fraley er al. (1982) Proc. Natl. Acad. Sci. USA 79:1859-1863); polyethylene glycol (Krens c~/ crl. ( I 982) nature 296:72-74); chemicals that increase free DNA uptake;
transformation using virus, and the like.
3. Selection of Transgenic Plant Cells Plants, plant cells and tissues transformed with a heterologous nucleic acid sequence of interest are readily detected using methods known in the art including, but not limited to, restriction mapping of the genomic DNA, PCR-analysis, DNA-DNA hybridization, DNA-RNA hybridization, DNA sequence analysis and the like.
Additionally. selection of transformed plant cells may be accomplished using a selection marker gene. It is preferred, though not necessary, that a selection marker gene be used to select transformed plant cells. A selection marker gene may confer positive or negative selection.
A positive selection marker gene may be used in constructs for random integration and site-directed integration. Positive selection marker genes include antibiotic resistance genes, and herbicide resistance genes and the like. In one embodiment, the positive selection marker gene is the NPTII gene which confers resistance to geneticin (G418) or kanamycin. In another embodiment the positive selection marker gene is the HPT gene which confers resistance to hygromycin. The choice of the positive selection marker gene is not critical to the invention as long as it encodes a functional polypeptide product. Positive selection genes known in the art include, but are not limited to, the ALS gene (chlorsulphuron resistance), and the DHFR-gene (methothrexate resistance).
A negative selection marker gene may also be included in the constructs. The use of one or more negative selection marker genes in combination with a positive selection marker gene is preferred in constructs used for homologous recombination. Negative selection marker genes are generally placed outside the regions involved in the homologous recombination event. The negative selection marker gene serves to provide a disadvantage (preferably lethality) to cells that have integrated these genes into their genome in an expressible manner. Cells in which the targeting vectors for homologous recombination are S randomly integrated in the genome will be harmed or killed due to the presence of the negative selection marker gene. Where a positive selection marker gene is included in the construct. only those cells having the positive selection marker gene integrated in their genome will survive.
The choice of the negative selection marker gene is not critical to the invention as long as it encodes a functional polypeptide in the transformed plant cell. The negative selection gene may for instance be chosen from the aux-2 gene from the Ti-plasmid of Agrobacterium, the tk-gene from SV40, cytochrome P450 from Streptomyces griseolus, the Adh-gene from Maize or Arabidopsis, etc. Any gene encoding an enzyme capable of converting a substance which is otherwise harmless to plant cells into a substance which is harmful to plant cells may be used.
4. Regeneration of Transgenic Plants The present invention provides transgenic plants. The transgenic plants of the invention are not limited to plants in which each and every cell expresses the nucleic acid sequence of interest under the control of the sequences provided herein.
Included within the scope of this invention is any plant which contains at least one cell which expresses the nucleic acid sequence of interest (e.g., chimeric plants). It is preferred, though not necessary, that the transgenic plant express the nucleic acid sequence of interest in more than one cell, and more preferably in one or more tissue.
Once transgenic plant tissue which contains an expression vector has been obtained, transgenic plants may be regenerated from this transgenic plant tissue using methods known in the art. The term "regeneration" as used herein, means growing a whole plant from a plant cell. a group of plant cells, a plant part or a plant piece (e.g., from a protoplast, callus, protocorm-like body, or tissue part).
Species from the following examples of genera of plants may be regenerated from transformed protoplasts: Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, WO 99/46976 PCT/US99/059$5 Majorana, Ciohorium, Helianthus, Lactuca, Bromus, Asparagus. Antirrhinum, Hererocallis, Nemesia, Pelargonium. Panicum. Pennisetum, Ranunculus. Senecio. Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea. Triticum, Sorghum, and Datura.
For regeneration of transgenic plants from transgenic protoplasts, a suspension of transformed protoplasts or a petri plate containing transformed explants is first provided.
Callus tissue is formed and shoots may be induced from callus and subsequently rooted.
Alternatively, somatic embryo formation can be induced in the callus tissue.
These somatic embryos germinate as natural embryos to form plants. The culture media will generally contain various amino acids and plant hormones, such as auxin and cytokinins.
It is also advantageous to add glutamic acid and proline to the medium, especially for such species as corn and alfalfa. Efficient regeneration will depend on the medium, on the genotype, and on the history of the culture. These three variables may be empirically controlled to result in reproducible regeneration.
Plants may also be regenerated from cultured cells or tissues. Dicotyledonous plants which have been shown capable of regeneration from transformed individual cells to obtain transgenic whole plants include, for example, apple (Malus pumila), blackberry (Rubus), Blackberry/raspberry hybrid (Rubus), red raspberry (Rubus), carrot (Daucus carota), cauliflower (Brassica oleracea), celery (Apium graveolens), cucumber (Cucumis sativus), eggplant (Solanum melongena), lettuce (Lactuca sativa), potato (Solanum tuberosum), rape (Brassica napus), wild soybean (Glycine canescens), strawberry (Fragaria x ananassa), tomato (Lycopersicon esculentum), walnut (Juglans regia), melon (Cucumis melo), grape (Vitis vinifera), and mango (Mangifera indica). Monocotyledonous plants which have been shown capable of regeneration from transformed individual cells to obtain transgenic whole plants include, for example, rice (Oryza sativa), rye (Secale cereale), and maize.
In addition, regeneration of whole plants from cells (not necessarily transformed) has also been observed in: apricot (Prunus armeniaca), asparagus (Asparagus officinalis), banana (hybrid Musa), bean (Phaseolus vulgaris), cherry (hybrid Prunus), grape (Vitis vinifera), mango (Mangifera indica), melon (Cucumis melo), ochra (Abelmoschus esculentus), onion (hybrid Allium), orange (Citrus sinensis), papaya (Carrica papaya), peach (Prunus persica), plum (Prunus domestica), pear (Pyrus communis), pineapple (Ananas comosus), watermelon (Citrullus vulgaris), and wheat (Triticum aestivum).
The regenerated plants are transferred to standard soil conditions and cultivated in a conventional manner. After the expression vector is stably incorporated into regenerated transgenic plants, it can be transferred to other plants by vegetative propagation or by sexual crossing. For example, in vegetatively propagated crops, the mature transgenic plants are propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants. In seed propagated crops, the mature transgenic plants are self crossed to produce a homozygous inbred plant which is capable of passing the transgene to its progeny by Mendelian inheritance. The inbred plant produces seed containing the nucleic acid sequence of interest. These seeds can be grown to produce plants that would produce the selected phenotype. The inbred plants can also be used to develop new hybrids by crossing the inbred plant with another inbred plant to produce a hybrid.
Confirmation of the transgenic nature of the cells, tissues, and plants may be performed by PCR analysis, antibiotic or herbicide resistance, enzymatic analysis and/or Southern blots to verify transformation. Progeny of the regenerated plants may be obtained and analyzed to verify whether the transgenes are heritable. Heritability of the transgene is further confirmation of the stable transformation of the transgene in the plant.
EXPERIMENTAL
The following examples serve to illustrate certain preferred embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.
Isolation and Characterization of Sugarcane Polyubiquitin ubi4 And ubi9 Genes A. Plant materials The plant materials used for the cDNA library and for expression studies were sugarcane hybrid H65-7052 plants grown in the greenhouse of the Hawaii Agriculture Research Center, Aiea, Hawaii. The genomic library used to isolate the genomic clones was made from the related sugarcane hybrid H32-8560 [Albert et al., Plant Mol Biol 20, 663-71 (1992)]. Internodes were numbered consecutively down the culm, with number one defined as that internode subtending the youngest fully expanded leaf, as previously described [Moore P. H.: Anatomy and Morphology. In: Heinz DJ (ed) Sugarcane Improvement Through Breeding, pp. 85-142. Elsevier, Amsterdam (1987)].
Sequence comparisons of the 3' untranslated regions (UTR) of five polyubiquitin cDNA clones revealed significant differences, except for clones scubi241 and 511, which were identical in both S' and 3' UTR sequences. Despite these identical regions, these clones do not represent transcripts from the same gene, as scubi241 contained four copies of the polyubiquitin coding repeat, whereas scubi5l 1 contained five of these repeats (data not shown).
B. Plasmid DNA gel blots Polvubiquitin cDNA plasmid clones were digested with appropriate restriction enzymes and size fractionated by gel electrophoresis. DNA was transferred to Hvbond-N+
(Amersham) membranes by capillary transfer. Identical DNA gel blots of the five clones were hybridized to 3'UTR probes at high stringency. High stringency hybridization and stringency washes at 65°C were by the method of [Church et al., Proc.
Natl. Acad. Sci. USA
81, 1991-1995 (1984)). Probe templates were prepared by PCR amplification of the 3' UTR
of each cDNA clone. Probes were 3zP labeled by the method of [Feinberg et al., Anal.
Biochem. l3?. 6-13 (1983)). Blots were exposed to Kodak X-Omat RP XRP-S film for five to 10 min at room temperature.
C. RNA extraction Total RNA was extracted from 10 g of each tissue sample by the method of [Bugos et al., Biotechniques 19, 734-737 (1995)]. RNA concentration was determined by spectrophotometer. Poly-A+ RNA was isolated from total RNA using the PolyATract System (Promega).
D. RNA gel blot analysis Fifteen ~.g of each RNA sample was separated by denaturing gel electrophoresis as described by [Fourney et al., Focus 10, 5-7 (1988)). Hybridization and stringency washes at 65°C were by the simplified northern blot method of [Virca et al., BioTechniques 8, 370-371 (1990)). 3'-P incorporation into probes was monitored by two methods:
scintillation counting following Sephadex G-50 chromatography and TCA precipitation. Equal activity ( 1.7x 10' cpm/ml) of ''-P-labeled gene-specific probe was used for each hybridization.
Autoradiography was with Kodak X-Omat RP XRP-5 film that was preflashed to an OD54~ of 0.15 to maximize linearity of response, exposed at -80°C.
Four identical RNA gel blots, each containing 1 S p.g total RNA from mature leaves, immature leaves, stem apices, internode 1, internode 2, internode 3, roots, and callus cultures at control (26°C) and heat shock (37°C) temperatures, were hybridized to each of the four gene specific 3' UTR probes, respectively, and exposed to film for 16 h as shown in Figure 1.
In Figure l, 15 pg total RNA from each indicated tissue was hybridized to equal activities of each gene-specific probe. Ethidium bromide stained gel indicates equal loading of RNA from each tissue. M.L., mature leaves; LL., immature leaves; S.A., shoot apex; I1, internode 1; I2, internode 2; I3, internode 3; R, roots; C, callus 26°C; C-HS, callus 37°C.
Autoradiogram for scubi221 probe was exposed for 72 h, all others for 16 h.
With this exposure time of 16 h, transcripts homologous to scubi221 were not detected except in callus tissue under 37°C heat-shock, where a single band was barely detectable. After 72 h exposure, the single band in heat-shock callus was clear, but no clear signal was evident for any other tissue. The scubi241/511 probe detected transcripts of two size classes (on the 16 h autoradiogram exposure shown in Figure 1 the two bands overlap, however on shorter exposures two bands can be resolved), with both transcripts present in approximately equal amounts in all tested tissues. Transcripts for this sub-family were present at the highest levels of all tested polyubiquitin genes; heat shock at 37°C did not induce a significant change in transcript accumulation. The scubi561 probe also detected two mRNA size classes; however, the levels of these transcripts were considerably lower than those detected with the 241/511 probe. Transcript pools hybridizing to the 561 probe were elevated significantly by 37°C
heat shock. The scubi5121 probe hybridized to a single size class that was moderately abundant, but significantly lower than the scubi241/S11 pool. The 5121 levels were very similar in all tissues except callus. Callus at control temperature (26°C) contained less scubi5121 mRNA than did other tissues; 37°C heat shock elevated transcript levels in callus to levels approximately equal to those in other tissues at 26°C. This pattern of reduced transcript levels in control temperature callus tissue could be seen to some extent for all probes except scubiS6l.
To confirm the northern analysis results, a "reverse northern" blot containing the 3' UTR DNA of all five polyubiquitin cDNA clones was hybridized to a 3zP-labeled, first strand cDNA probe made from approximately O.S ug mRNA extracted from immature leaves.
Fifty ng of each 3' UTR target DNA was loaded for the blot, ensuring that the target DNA would be present in excess and that hybridization signals should reflect relative abundance of the different mRNAs in the template mRNA population. Results of this experiment are shown in Figure 2.
WO 99!46976 PCT/US99/05985 In Figure 2, SO ng DNA from the gene-specific 3' UTR of each cDNA clone was hybridized to a total first strand cDNA probe from immature leaves. Results of this experiment confirmed the northern results, with scubi241/511 transcripts most abundant, scubi221 least abundant, and scubi561 and 5121 at intermediate levels (Figure 2). Because these two independent methods of estimating the relative mRNA abundance for the different polyubiquitin gene sub-families both indicated the scubi241/511 sub-family as highest, and because the difference between scubi241 /511 and the other sub-families was so large by both measures, this ranking of expression levels is accurate.
While expression of these sugarcane polyubiquitin genes was little affected by cell-type, several of them responded dramatically to an environmental stimulus:
heat stress.
mRNA pools homologous to scubi221, 561 and 5121 in sugarcane callus tissue subjected to 37°C heat shock all rose substantially, whereas the pool homologous to scubi241/511 did not show a substantial increase. Overall, scubi241 and scubi5l 1 most nearly fit the stereotype of "constitutive'' genes, with uniformly high levels of mRNA accumulation in all tested tissues and little induction of expression by heat stress.
Approximately 1x106 pfu from a sugarcane genomic library in the vector ~,EMBL4 were screened with a polyubiquitin coding sequence probe. Fifty polyubiquitin positive plaques were screened again and purified with the scubi241/S11 specific probe.
Five scubi241/511 positive plaques were isolated, and two of these, ~,ubi4 and ~,ubi9. were subcloned into plasmid vectors for further analysis and sequencing.
Subclone pubi4 sequence and organization were initially determined to be as shown in Figures 3A, 3B and 4. Tables 1-2 provide the meaning of sequence symbols other than the bases G, A, T, and C which indicates ambiguities where two or more bases are equally possible in the sequences shown in Figures 3A and 3B.
Table 1 Ambiguity codes for sugarcane polyubiquitin ubi-l gene sequences upstream of the translation start codon S Position Ambiguity Code Shown Actual Base in Figure 3A
243 n G, A, T, or C
245 n G, A, T, or C
790 n G, A, T, or C
1023 n G, A, T, or C
1089 n G, A, T, or C
1174 n G, A, T, or C
1656 n G, A, T, or C
Table 2 Ambiguity codes for sugarcane poiyubiquitin ubi=t gene downstream of the translation stop codon Position Ambiguity Code Shown Actual Base in Figure 3B
17 n G, A, T,orC
885 n G, A, T, or C
1055 s C or G
1059 y C or T
1463 n G, A, T, or C
1712 n G, A, T, or C
1769 r A or G
1810 w A or T
1964 y C or T
2495 n G, A, T, or C
On subsequent sequencing of subclone pubi4, the nucleotide sequence (SEQ ID
NO:S) and translated amino acid sequence (SEQ ID NO:b) were determined to be as shown in Figure ~. The letter "X" in Figure SB refers to any amino acid. The organization of the ubi,~
gene was determined as shown in Figure 6.
SEQ ID NO:S was cloned as two fragments into the plasmid vector pBluescript II
KS+ {Stratagene} to generate plasmids pubi4a and pubi4b. Plasmids pubi4a and pubi4b were introduced into the host Escherichia coli DHSalpha and the transformed Escherichia coli cells were deposited at the Agricultural Research Service Culture Collection (NRRL) under the terms of the Budapest Treaty on March 8, 1999 as NRRLB-30112 (containing pubi4a} and NRRLB-30114 (containing pubi4b). NRRLB-30112 contains the 2227 by EcoRIlSaII
fragment of SEQ ID NO:S, i.e., including the 1802 by 1802 by SEQ ID N0:7, plus the first coding repeat and part of the second repeat. NRRLB-30114 contains the 3329 by SaIIlEcoRI
fragment which includes the remainder of the coding region, the 3' UTR, and further downstream sequences.
An XbaI restriction site was added at the 3' end of the ubi4 promoter shown in SEQ
ID N0:7 by way of PCR amplification with an XbaI adapter primer. The polyubiquitin ubi4 promoter so modified was ligated upstream of a GUS coding sequence and a NOS
3' terminator in the vector plasmid pUCl9 to form pubi4-GUS. This plant expression plasmid was transformed into E. coli DHSa host cells and deposited with the NRRL under the terms of the Budapest Treaty on March 15, 1999 as NRRLB-30115.
Table 3 shows the ambiguity codes and the bases they represent in the ubi4 sequences shown in Figures 5 and 10.
Table 3 Ambiguity codes for sugarcane polyubiquitin ubi=t and ubi9 genes Ambiguity Code Actual Base a A
C
g G
t/u T
m AorC
r A or G
w A or T
s Core y C or T
k GorT
IS v A or C or G
h A or C or T
d AorGorT
b CorGorT
x/n GorAorTorC
not G or A or T or C
Subclone pubi4 (Figures 3, 4, 5 and 6) contained four copies of the polyubiquitin coding repeat, 238 by of 3' UTR, which is approximately 95% identical to the corresponding region of the scubi241/51 I cDNAs, a possible poly-A addition signal 21 S by 3' of the TAA
stop colon, and approximately 2.6 kb of further downstream sequence.
Immediately S' of the initiation colon was an introzZ of 1360 by within SEQ ID NO:S (intron of 1382 by within SEQ ID NO: I ) which was preceded by 65 by in SEQ ID NO:S (67 by in SEQ ID
NO:1 ) that were 98% identical to the 5' UTR sequence of scubi241/51 I. The transcription start site has not yet been determined, and it is not known if the scubi24l and scubi5l I
cDNA clones contain the entire 5' UTR; however, since both scubi241 and scubi5l I cDNA
clones started at the same nucleotide, this site can serve as a putative transcription start site for discussion purposes. The pubi4 subclone contained an additional 377 by upstream of the transcription start codon, with a TATA consensus at -30 by relative to the beginning of the cDNAs.
Approximately 320 by upstream of the transcription start codon were two 10 by sequences that showed homology to the heat stress promoter element (HSE) consensus sequence (aGAAnnTTCt) [Scharf et al.: Heat stress promoters and transcription factors.
In: Nover L
(ed) Plant Promoters and Transcription Factors, pp. 125-162. Springer-Verlag, Berlin (1994)].
The second of these 10 by sequences, however, lacked the G residue at position two that has been found to be invariant in HSEs [Scharf et al. (1994) supra].
Subclone pubi9 sequence and organization were initially determined to be as shown in Figures 7A, 7B and 4. Tables 4-5 provide the meaning of sequence symbols other than the bases G, A, T, and C which indicates ambiguities where two or more bases are equally possible in the sequences shown in Figures 7A and 7B.
Table 4 Ambiguity codes for sugarcane polyubiquitin ubi9 gene upstream of the translation codon Position Ambiguity Code Shown Actual Base in Figure 7A
9 n G, A, T, or C
3613 n G, A, T, or C
Table 5 Ambiguity codes for sugarcane polyubiquitin ubi9 gene downstream of the translation stop codon Position Ambiguity Code Shown Actual Base in Figure 7B
59 n G, A, T, or C
286 n G, A, T, or C
294 n G, A, T, or C
319 n G, A, T, or C
On subsequent sequencing of subclone pubi9, the nucleotide sequence (SEQ ID
N0:8) and translated amino acid sequence (SEQ ID N0:9) were determined to be as shown in Figure 8. The letter "X" in Figure 8B refers to any amino acid. The organization of the ubi9 gene was determined as shown in Figure 6. Table 3, .supra, shows the ambiguity codes and the bases they represent in the ubi9 gene sequences shown in Figures 8 and 11.
An approximately 7.2 kb HindIlI-EcoRI fragment, which contains SEQ ID N0:8 plus approximately 2kb additional downstream sequence, was cloned in the plasmid vector pBluescript II KS+ (Stratagene) to form plasmid pubi9. This plasmid was transformed into E.
coli DHSa host cells and deposited with the Agriculture Research Service culture collection (NRRL). under the terms of the Budapest Treaty on March 8, 1999 as accession number NRRLB-30113.
An Xbal restriction site was added at the 3' end of the ubi9 promoter shown in SEQ
ID NO:10 by w-ay of PCR amplification with an XbaI adapter primer. The ubi9 promoter so modified was ligated upstream of a GUS coding sequence and a NOS 3' terminator in the vector plasmid pUC 19 to form pubi9-GUS. This plant expression plasmid was transformed into E. coli DHSa host cells and deposited with the NRRL under the terms of the Budapest Treaty on March 15, 1999 as accession number NRRLB-30116.
Subclone pubi9 (Figures 4, 6, 7 and 8) contained five copies of the polyubiquitin coding repeat. 244 by of 3' UTR, which were 98% identical to the corresponding region of scubi241/51 1 and 95% identical to the corresponding region of pubi4. A
possible poly-A
addition signal was present 221 by down stream of the TAA stop codon, and there was approximately 2 kb additional downstream sequence. As with the uhi4 gene, an intron was located immediately 5' of the initiation codon; this intron was 1374 bp. 5' of this intron were 65 by (in SEQ ID N0:8) and 67 by (in SEQ ID N0:3) that was 97% identical to both the 5' UTR of the scubi241/511 cDNA clones and the corresponding region of the ubi4 gene.
The subclone contained an additional 2247 by of upstream sequence, including a TATA
consensus sequence at -30 by relative to the beginning of the cDNA clones.
Upstream of the 5'UTR, a 577 by region of ubi9 from position 1671 to 2248 of SEQ ID NO:10 was highly homologous (>90% identity) to the corresponding region of ubi4 (positions 1 to 377 of SEQ
ID N0:5). Partial sequence from an additional subclone of ~,ubi4 indicated that this high degree of homology continues at least as far as 2kb upstream of the transcribed region of the genes (unpublished data). Within this highly homologous region, approximately 344 by upstream of transcription start codon, is an apparent insertion of approximately 200 by not present in the sugarcane ubi4 promoter. This 200 by region was delimited by 17 by imperfect inverted repeats. A 202 by region 82% identical to this insertion is also found in the 3' UTR of a sugarcane glucose transporter cDNA clone SGT1 [Bugos et al., Plant Physiol. 1 U3, 1469-1470 ( 1993)]. The nature of this possible insertion event has not been investigated; however, it has features of miniature inverted-repeat transposable elements (MITEs) [Wessler et ul., Curr Opin Genet Dev 5, 814-21 ( 1995)]. Without limiting the invention to any particular mechanism. this insertion is not believed to have a functional role in the promoter activity of the polyubiquitin ubi9 promoter since this insertion is inserted in the 3'UTR (not the promoter) of the glucose transporter gene, and since it is not present in the polyubiquitin ubi=t gene. Like the ubi4 gene, the ubi9 gene also contained two HSE-like sequences about 320 by upstream of the transcription start codon; however, both of these HSE-like sequences lacked the invariant G residue.
A comparison of the sequences upstream of the translation start codon of the sugarcane ubi.~ gene sequence with the maize polyubiquitin promoter (GenBank accession number 594464; Quail et al., U.S. Patent Numbers 5,614,399 and 5,510,474) showed only 51 % homology when comparing a fragment containing the sequence upstream of the 5' UTR, the 5' UTR sequence and the intron sequence, only 64% homology when comparing a fragment containing the sequence upstream of the transcription start codon, only 65%
homology when comparing a fragment containing the 5'UTR sequence, and only 58%
homology when comparing a fragment containing the intron sequence.
A comparison of the sequences upstream of the translation start codon of the sugarcane ubi9 gene sequence with the maize polyubiquitin promoter (GenBank accession number 594464; Quail et al., U.S. Patent Numbers 5,614,399 and 5,510,474) showed only 61 % homology when comparing a fragment containing the sequence upstream of the 5' UTR, the 5' UTR sequence and the intron sequence, only 63% homology when comparing a fragment containing the sequence upstream of the transcription start codon, only 66%
homology when comparing a fragment containing the S'UTR sequence, and only 59%
homology when comparing a fragment containing the intron sequence.
Some heat-shock-inducible polyubiquitin promoters have been found to contain HSEs (Binet, et al., 1991, supra; Christensen et al. ( 1992) supra]. Both the ubi4 and ubi9 genes contained two short sequence elements that have some homology to HSEs;
however, three out of four of these elements lacked the G residue that has been found to be invariant at position 2 of the HSE (aGAAnnTTCt) (Scharf et al. (1994) supra]. Given the very marginal induction (approximately 2-fold or less) of the ubi~ and ubi9 genes when compared to other sugarcane polyubiquitin genes, it is doubtful that these HSE-like elements play a role in regulating gene expression. Similar observations have been made in connection with the tobacco polyubiqutin promoter, Ubi.U4; while this promoter also contains "...two degenerated heat shock-like elements..., " removal of these elements had no significant effect on gene expression (Plesse, et al. (1997) supra].
Both the ubi=l and ubi9 genes contained a large intron immediately upstream of the protein coding region preceded by an approximately 65 by fragment which was highly homologous to the 5' UTR of scubi241/511. An intron at this position (i.e., immediately upstream of the initiation codon) has been found in many other plant polyubiquitin genes (Binet, et al., 1991, supra; Christensen et al. ( 1992) supra; Garbarino et al. ( 1995) supra;
Morris et al., Plant Mol Biol 21, 895-906 (1993)], and in some cases been shown important for high levels of expression (Morris et al. (1993) supra; Garbarino et al.
(1995) supra]. By analogy, it is the inventors' belief that the intron of the ubi4 and ubi9 genes may also play a role in regulating gene expression.
The eDNA clones scubi241 and scubi5l 1 contain three and five copies of the polyubiquitin coding sequence, respectively. Genomic clones pubi4 and pubi9 contain four and five copies of the polyubiquitin coding sequence, respectively. Without limiting the invention to any particular mechanism, this may mean that the scubi241/S11 sub-family :ZO contains at least three different genes, containing three (i.e., scubi 241/511), four (i.e., pubi4) and five (i.e., pubi9) ubiquitin repeats. Alternatively, it is possible that the difference in the number of copies of the polyubiquitin coding sequence reflects a difference between the cultivars, with the scubi241/511 sub-family in H65-7052 containing genes with three and five ubiquitin repeats, while the same sub-family in H32-8560 contains genes with four and five ubiquitin repeats.
Construction of Plasmids pubi4-GUS, pubi9-GUS, 4PI-GUS and 9PI-GUS Comprising Sugarcane Polyubiquitin Promoters And an Exemplary Structural Gene Reporter plasmids were made placing the uid A gene encoding (3-glucuronidase (GUS) [Jefferson et al., Proc. Natl. Acad. Sci. USA 83, 8447-8451 ( 1986)] and the nopaline synthase (NOS) terminator under the control of the sugarcane polyubiquitin promoter disclosed herein.
The promoter sequence in plasmid pubi4-GUS contained nucleotides 1-1810 of SEQ
ID NO:1 (i.c~.. nucleotides 1-1802 of SEQ ID NO:S) and and XbaI site (TCTAGA) added immediately after by 1802, by way of an adapter on a PCR primer. The promoter sequence in plasmid pubi9-GUS contained nucleotides 1-3688 of SEQ ID N0:3 (i.e.. nucleotides 1-3688 of SEQ
ID N0:8) and and XbaI site (TCTAGA) added immediately after by 3688, by way of an adapter on a PCR primer. The Expand~~"' PCR system (Boehringer Mannheim) was used to amplify part of the 5' UTR and the intron including the 3' splice site; this PCR product was used to join the sequence upstream of the 5' UTR, the 5' UTR, and intron to the GUS gene using a unique NruI site in the S' UTR and an XbaI site added as an adapter to the 3' PCR
primer.
pHA9 contained the maize ubi 1 promoter driving a neomycin phosphotransferase II
(NPTII) gene and NOS terminator. It was made by removing the luc gene from pAHCl8 described in [Christensen et al., Transgenic Research S, 213-218 ( 1996)] as BamHI fragment and replacing it with an 844 by BamHI fragment containing the NPTII gene.
35S-GUS contained the uid A gene encoding GUS under the control of the cauliflower mosaic virus (CaMV) 35S RNA promoter sequence (Clonfech).
Binary plasmids 9PI-GUS, 4PI-GUS and MPI-GUS for Agrobacterium transformation were made by ligating the promoter-intron-GUS-NOS cassettes from pubi9-GUS, pubi4-GUS, and pAHC27 [Christensen et al. ( 1996) supra] respectively, as HindIII - EcoRI
fragments, into the HindIII - EcoRI sites of pCAMBIA1300 [Roberts et al., Rockefeller Foundation Meeting of the International Program on Rice Biotechnology, Malacca, Malaysia (1997)].
Binary plasmid pHW537 contained a putative 5' nuclear matrix attachment region (MAR) from ~,ubi4, ubi9 promoter and intron, GUS, and 3' terminator and putative 3' MAR from ~,ubi4 as HindiII - EcoRI fragment in the HindIII - EcoRI sites of pCAMBIA1300.
Transient Expression of pubi4-GUS and pubi9-GUS in Sugarcane Suspension cultured Cells Sugarcane suspension cell cultures (variety H50-7209) were maintained as described by [Nickell et al., Physiol. Plant. 22, 117-125 (1969)]. DNA reporter plasmids (pubi4-GUS, pubi9-GUS.or pAHC27) were introduced into sugarcane suspension culture cells by particle bombardment as previously described [Klein et al., Nature 327, 70-73 ( 1987)]
using a PDS 1000 Biolistic particle accelerator (BioRad) at 1100 psi. Controls were not bombarded -SO-with DNA.. Transient assays were carried out in a randomized complete block design with four treatments (promoters) and six replications. Each replication consisted of bombardment of five samples. Two days after bombardment, plant material was assayed for GUS
expression. Each sample was divided into two equal parts, one for histochemical analysis and one for chemiluminescent measure of GUS enzyme activity using the GUS-Light kit (Tropix) and an MLX plate reader luminometer (DYNEX). GUS enzyme activity assays were performed according to the manufacturer's protocol, with 60 minutes incubation in GUS
reaction buffer before chemiIuminescence was measured. GUS activity was expressed as relative light units (RLU) per nanogram total protein [Bradford, Anal.
Biochem. 72:248-254 (1976)]. From each experiment, the highest and lowest values were discarded for each plasmid. Analysis of variance was performed on the data from each set of experiments; those experiments which showed a significant effect (P50.05) from promoter treatments were further analyzed by a least significant difference test {P<_0.05) to identify which promoters produced significantly different results.
The results of histochemical staining and chemiluminescent GUS activity assays of sugarcane suspension culture cells which had been bombarded with the reporter plasmids are shown in Figure 9. Figure 9A shows that the average number of blue foci detected after bombardment with GUS expression plasmids containing the ubi9 promoter was higher than observed for either the ubi4 or maize polyubiquitin ubi 1 [Christensen et al.
( 1996), supra]
promoters. Because of high levels of variability, it could not be determined from histochemical staining of these transient expression experiments whether the results seen in sugarcane callus are in fact significantly different. However, using a chemiluminescent assay to measure GUS activity again indicated the average level of expression was higher for the ubi9 promoter than for the sugarcane ubi4 or maize ubi 1 promoters (Figure 9B). Statistical analysis indicated that the difference, as measured by this assay, was significant at P<_0.05.
These data demonstrated that both the ubi4 and ubi9 promoters in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct transient expression in monocotyledonous sugarcane suspension cells.
Transient Expression of pubi4-GUS and pubi9-GUS in Tobacco leaves Tobacco cultivar Wisconsin 38 was grown in Magenta boxes on MSNT medium [ 1 X
S Murashige and Skoog salts (GIBCO BRL # 11117-074), 1 X minimal organics (GIBCO BRL
# I I I 18-023 ). 30g/1 sucrose, 0.8% agar] at 26°C under a I b h light regime. DNA reporter plasmids (pubi4-GUS, pubi9-GUS, or pAHC27) were introduced into tobacco leaves by particle bombardment as previously described [Klein et al., (1987) supra]
using a PDSI000 Biolistic particle accelerator (BioRad) at 650 psi. Controls were not bombarded with DNA.
Transient assays were carried out in a randomized complete block design with four treatments (promoters) and six replications. Each replication consisted of bombardment of five samples.
Two days after bombardment, plant material was assayed for GUS expression.
Each sample was divided into two equal parts, one for histochemical analysis and one for chemiluminescent measure of GUS enzyme activity as described supra (Example 3).
The results of histochemical staining and chemiluminescent GUS activity assays of tobacco leaves which had been bombarded with the reporter plasmids are shown in Figure 9.
The results show that average GUS expression was higher for the sugarcane ubi4 promoter than for either the maize polyubiquitin promoter or the sugarcane ubi9 promoter when using either the histochemical (Figure 9C) or chemiluminescent (Figure 9D) assays.
Analysis of the :ZO chemiluminescent data indicates that the difference between the sugarcane ubi4 and maize polyubiquitin promoters was statistically significant at P<_0.05.
These data demonstrated that both the ubi4 and ubi9 promoters in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct transient expression in dicotyledonous tobacco leaves.
:ZS
Transient Expression of pubi4-GUS and pubi9-GUS in Sorghum Callus :30 Sorghum immature embryo derived callus was cultured and bombarded as previously described with the reporter plasmid ubi4-GUS or ubi9-GUS (prepared as described in Example 2) and with several reporter plasmids in which the uid A gene encoding GUS was placed under the control of each of several promoters including maize adh, 355, 35S:35S, rice actin, and maize ubi 1 as described below.
A. Culture media N6 maintenance medium [Macro elements (mg/1 final concentration), 2830 KN03, 1650 (NH4),SO~, 166 CaCI,-2H,0, 185 MgS04-7H,0, 400 KH,POa; Micro elements (mg/1 final concentration), 37.3 Na, EDTA, 27.8 FeSO4-7H~0, 1.6 H;BO; 0.76 Kl, 3.3 MnS04, 1.5 ZnS04-7H,0; Carbohydrates (g/1 final concentration), 20 Sucrose; Hormones (mg/1 final concentration), 1.0 2,4-Dichlorophenoxyacetic acid; Vitamins (mg/1 final concentration), 0.5 Thiamine-HCI, 0.25 Pyridoxine-HCI, 0.25 Nicotinic Acid; Amino Acids (mg/1 final concentration), 2875 L-Proline, 2.0 Glycine, 100 Casamino Acids; and Agar (g/1 final concentration), 2.5 Phytagel] was used.
B. Plant material Highly embryogenic callus tissue derived from sorghum plants (Sorghum bicolor L.
Moench, cv BWheatland 399) was used. To establish callus cultures, caryopses 10 to 18 d post-anthesis were surface-sterilized with 70% ethanol for 5 min and 20%
Clorox bleach for 15 min, followed by two changes of sterile distilled water. Immature embryos, 1.0 to 1.5 mm long, were aseptically removed using a sterilized 11 cm forceps in a laminar flow hood under a stereo dissecting microscope. The embryos were placed with the scutella exposed on N6 medium modified for sorghum cell culture and solidified with 2.5 gll Phytagel.
C. Microprojectile bombardment preparation Prior to bombardment, 1 mm gold particles were coated with transforming DNA by the procedure of Daines ( 1990). A stock suspension of gold particles was suspended at 60 mg/ml in absolute ethanol. Thirty-five microliters of the suspension was transferred into a 1.5 ml microcentrifuge tube, centrifuged at 14,000 g for 3 min, and the pellet was suspended in 200 pl of sterile distilled water. Following a second centrifugation, the pellet was suspended in 25 ml of Tris-EDTA containing 25 mg of the transforming plasmid DNA. The following chilled sterile solutions were added in order: 200 ml of water, 250 ml of 2.5 M
CaCI,, and 50 ml of 0.1 M Spermidine (0.2 pm filter-sterilized). The microcentrifuge tubes were shaken with a Tomy microtube shaker at 4°C for 1 S min and centrifuged at 16,000 g for min. The supernatant was removed, the pellet washed with 200 ml of ethanol and the DNA-coated gold particles suspended in 36 ml of ethanol.
D. Target tissue establishment and bombardment 5 Immature embryos were removed from sorghum caryopses and cultured on N6 maintenance medium for 7 d. If the immature embryos are less than 0.5 mm they may die in culture and if they are larger than 1.5 mm they may precociously germinate instead of initiating into callus tissue. Four hours prior to bombardment, approximately 50 embryo-derived calli were placed in a circle (4 cm diameter) in the center of a Petri dish ( 1 ~ x 100 mm) containing 0.2 M mannitol and 0.2 M sorbitol in N6 maintenance medium solidified with 2.~ g Phytagel. The Petri dish containing the target callus tissue was placed in the biolistic device and 10 ml of the DNA-gold suspension pipetted onto the center of a macroproiectile. The distance between the stopping plate and the target callus tissue was adjusted to 13 cm. The tissue was bombarded under vacuum with the rupture disk strength at 1100 p.s.i.
Callus tissue was sampled one day post bombardment using the GUS histochemical assay. Approximately twenty (20) replicates were performed for each promoter.
The average number of blue foci per bombarded plate was determined for each reporter plasmid using a stereo microscope.
The ubi4 nucleotide sequence was sufficient to direct transient expression in monocotyledonous sorghum. Indeed, the ubi4 nucleotide sequences resulted in significantly higher levels of expression of GUS as compared to the levels of expression driven by each of the other tested promoters (data not shown). These results demonstrate that the ubi4 promoter in pubi4-GUS was sufficient to direct transient expression in monocotyledonous sorghum callus.
Transient Expression of pubi9-GUS in Pineapple Leaves, Protocorm-Like Bodies, Roots and Fruit Pineapple cultivar F153 leaves, protocorm-like bodies (plbs), roots and fruit were bombarded with a reporter plasmid [pAHC27, pubi9-GUS or 35S-GUS] described supra (Example 2). Target tissue (leaves, plbs, roots and fruit) was plated in the center (2.5 cm diameter) of petri plates in modified MS medium supplemented with 0.8% Difco Bacto agar and 3% sucrose. Bombardments were performed with a Bio-Rad helium gas-driven microprojectile accelerator (PDS-1000/He, Bio-Rad, Hercules, CA) with I 100 psi rupture discs. Gold microcarriers ( 1.6-um-diameter, Bio-Rad) were coated with DNA
using the CaCl2 S precipitation method following the manufacturer's directions. Two ug of each DNA construct were used for each shot.
Histochemical GUS staining was performed 48 hours following bombardment to determine transient transformion. Five (5) or three (3) replicates were performed for each promoter in each type of bombarded tissue. The number of blue foci/plate of each bombarded tissue is shown in Table 4.
Table 4 Number of Blue Foci Per Plate In F153 Pineapple Tissues Plasmid pAHC27 ubi9-GUS 35S-GUS
Leaves ll.Ot7.0~ 244.022.0 -Plbs 142.045.2 150.352.3 -Roots 3.210.9 7.42.2 1.60.8 Fruit 8.76.4 I 1.03.7 14.28.1 '' Standard error.
The above data demonstrate successful transient expression of GUS under the control of the sugarcane polyubiquitin ubi9 promoter in pubi9-GUS in each of the four tissues of monocotyledonous pineapple.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Sugarcane Callus Sugarcane callus cultures were initiated from stem apices (variety H62-4671 ) by surface sterilizing the plant material with 70% ethanol, cutting two mm transverse slices and growing on MS2 plates [ 1 X Murashige and Skoog salts (GIBCO BRL # 1111 ?-074)], 1 x WO 99!46976 PCT/US99/05985 minimal organics (GIBCO BRL#11118-023), 2 mg.L 2,4-D, 0.7% agar] under a 16 h light regime. After one to two months, callus was transferred to MS 1 plates [ 1 X
Murashige and Skoog salts (GIBCO BRL #11117-074), 1X minimal organics (GIBCO BRL #11118-023). 1 mg/L 2,4-D. 0.7% agar] and subcultured monthly.
Sugarcane callus was co-bombarded with a reporter plasmid (pubi4-GUS, pubi9-GUS.
or pAHC27) and the selection plasmid pHA9 which contained the maize ubi 1 promoter driving a neomycin phosphotransferase II (NPTiI) gene (prepared as described supra in Example 2). Bombardment was with one micron gold particles and 1 SSO psi rupture discs.
After bombardment. callus was kept on MS1 plates without selection for two weeks. After his recovery period, calli were transferred to MS 1 plates with SO mg/L 6418 (Agri-bio) for one month, then transferred to MS1 plates with 100 mg/L 6418 for 2-3 months.
Calli which survived this selection were transferred to MS1 plates with 60 mg/L 6418 for multiplication.
Small calli totaling about 50 were randomly chosen from selected lines. These calli were assayed for GUS activity using the GUS-Light kit (Tropix) according tot the manufacturer's I S protocol, with 30 minutes incubation in GUS reaction buffer before chemiluminescence was measured. Two to ten 50 mg samples were assayed from each line, with three chemiluminescence assays for each sample.
The results of the GUS chemiluminescent activity assays are shown in Figure 12. In ten selected independent transgenic sugarcane callus lines, GUS expression from the sugarcane ubi9 promoter averaged 535.4 RLU/ng protein/30 min (Figures 12A, 12D). Seven selected stable transgenic lines expressing GUS under the control of the sugarcane ubi4 promoter averaged 209.3 RLU/ng protein/30 min (Figures 7B, & 7D), and seven lines expression GUS under the control of the maize ubi I promoter averaged 34$.2 RLU/ng protein/30 mon (Figures 7C, &D).
These results demonstrate that both the ubi4 and ubi9 nucleotide sequences in pubi4-GUS and pubi9-GUS, respectively, were sufficient to direct stable expression in monocotyledonous sugarcane callus.
Stable Expression of 9PI-GUS in Transgenic Rice Callus Agrobacteriunr strain EHA105 [Hood et al. J. Bacteriol. 168:1291-1301 (1986)]
was used to transform rice callus. Reporter plasmids (9PI-GUS, 4PI-GUS, MPI-GUS, or WO 99/46976 PCT/US99t05985 pHW537. which were prepared as described in Example 2, supra) were introduced into Agrobacteriurn strain EHA105 by a standard procedure.
Rice callus was induced from scutellum tissue of rice (cv. Taipei 309) and transformed by A~,>robcrcterium co-cultivation as previously described [Hiei et al., ( 1994) supra]. After 2-3 months on selection medium containing 100 mg/1 hygromycin B (CalBiochem), small calli from selected lines were assayed for GUS activity by the chemiluminescence method described supra (Example 3).
PCR using one primer within the T-DNA and one outside of the T-DNA right border was used to confirm the absence of Agrobacterium contamination in tested callus lines using methods known in the art. Because rice transformation was by Agrobacterium.
there is the possibility that the Agrobacterium were not killed after transformation, and thus that the GUS expression seen is from the Agrobacterium, not from transgenic rice cells.
To test for this possibility, two PCRs were performed: one to test for the presence of the GUS gene, the second to test for the presence of vector plasmid sequences outside of the T-DNA. The Agrobacterium harbor a binary plasmid, one section of which contains the GUS
encoding gene under the control of sugarcane promoter sequences. This section of the vector plasmid is called the T-DNA, and only this part is ordinarily transferred to the plant genome. A
positive PCR for GUS indicates that the isolate plant DNA may be successfully amplified by PCR and the GUS gene is pressent (i.e., confirming the observation of GUS
activity). A
negative PCR for the vector plasmid outside of the T-DNA confirms that the entire plasmid, which is what is present in the Agrobacterium, is no longer present, i.e., that the T-DNA
(which contains the GUS sequences under the control of the sugarcane promoter sequences) is successfully integrated into the plant genome.
One, seven, four, and two stably transformed transgenic rice callus lines were selected following co-cultivation of rice callus with Agrobacterium which had been transformed with the 4PI-GUS, 9PI-GUS, pHW537, and MPI-GUS reporter plasmids, respectively. GUS
expression in six transgenic lines transformed with the ubi9 promoter sequence is shown in Figure 13. GUS expression in six transgenic lines transformed with the ubi9 promoter sequence averaged 681.0 RLU/ng protein/30 min.
Four stably transformed transgenic rice callus lines were selected following co-cultivation of rice callus with Agrobacterium which had been transformed with the pHW537 reporter plasmid. GUS expression in these transgenic lines averaged 567.7 RLU/ng protein/30 min. Since the pHW537 reporter plasmid differed from the 9PI-GUS
plasmid in additionally containing the putative 5' and 3' flanking nuclear matrix attachment regions (MARs). and since both the 9PI-GUS and pHW537 reporter plasmids successfully resulted in expression of comparable levels of GUS in rice callus, these results demonstrate that the putative 5' and 3' flanking nuclear MARS are not necessary for promoter activity of the pubi9 sequences.
The results also suggest that either the polyubiquitin ubi4 or ubi9 promoter (with or without putative MARs) drives GUS expression at levels comparable to the maize ubi 1 promoter in stable transgenic rice callus.
The above results demonstrate that the ubi9 promoter in 9PI-GUS, and pHW537 was sufficient to direct stable expression in monocotyledonous rice callus.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Tobacco leaves Agrobacterium mmefacien.s was used to transform tobacco leaves. Reporter plasmids 1 S (9PI-GUS, 4PI-GUS, MPI-GUS, or 355-GUS, which were prepared as described in Example 2, supra), were introduced into leaf discs by an Agrobacterium mediated transformation procedure adapted from Horsch et al Science 227:1229-1231 (1985). Controls were untransformed.
Briefly, two ml overnight cultures of Agrobacterium tumefaciens were pelleted by brief centrifugation, decanted and resuspended in two ml of MSNTS liquid media (per liter of medium: one package MS salt mix [GIBCO BRL #11117-066], 30 g sucrose, 1.0 ml BS vitamins, 50 pl 2 mg/ml a-naphthaleneacetic acetic acid [GIBCO BRL #21570-015], and 50 pl 20 mg/1 benzyladenine [GIBCO BRL #16105-017]).
Aseptically grown tobacco leaves were harvested and placed in petri plates containing 20 ml MSNTS with 0.6 ml of the resuspended Agrobacterium and cut into approx. 1 cm squares with a sterile scalpel. After one to five minutes the leaf pieces were removed from the liquid, gently blotted on dry sterile paper, and placed on 0.8% agar MSNT
plates (per liter of medium: one package MS salt mix [GIBCO BRL #11117-066], 30 g sucrose, 1.0 ml 1000X BS vitamins, 8 g Bacto-Agar) with 500 pg/ml Cefotaxime 3U (PhytoTechnology Laboratories). After two days, the leaf pieces were transferred to 0.8%
agar MSNTS plates with 500 pg/ml Cefotaxime and 100 pg/ml Kanamycin (PhytoTechnology Laboratories). Shoots appeared after two to three weeks, at which time the shoots were transferred to Magenta boxes containing MSNT media with no antibiotics. To confirm Kanamycin resistance and to reduce the likelihood of obtaining chimeric plants, a "shooting assay" was used: leaves from putative transgenic plants were placed on 0.8% agar MSNTS plates containing 100 pg/ml Kanamycin. Shoots arising from these "shooting assays" were transferred to non-selective rooting media (MSNT) and grown in Magenta boxes.
Expression of GUS was determined as described in Example 3, supra. The results demonstrate that the both the ubi4 and ubi9 nucleotide sequences in 4PI-GUS
and 9PI-GUS, respectively. were sufficient to direct stable expression in dicotyledonous tobacco leaves.
Furthermore. The levels of GUS expression under the control of either the ubi4 and ubi9 nucleotide sequences in 4PI-GUS and 9PI-GUS were equal to or greater than the levels of GUS expression under the control of the maize ubil promoter. On the other hand. expression of GUS under the control of the CaMV 35 S promoter was greater than that under the control of either the sugarcane ubi4 or ubi9 promoters (data not shown).
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Maize Embryos and Regeneration of Transgenic Maize Plants Embryogenic corn cultures are initiated from immature maize embryos, bombarded simultaneously with a reporter plasmid (pubi4-GUS, pubi9-GUS, or mzubil-GUS) and a selection plasmid (pHA9), and stably transformed corn plants regenerated as previously described by Brown et al., U.S. Patent Number 5,593,874, incorporated by reference.
Briefly, embryogenic corn cultures are initiated from immature maize embryos of the "Hi-Ir" genotype which had been cultured 18-33 days on N6 2-100-25-Ag medium modified to contain 2 mg/L 2,4-dichlorophenoxyacetic acid, 180 mg/L casein hydrolysate, 25 mm L-proline, 10 pM silver nitrate, pH5.8, solidified with 0.2% PhytagelTM
(Sigma). These embryogenic cultures are used as target tissue for transformation by particle gun bombardment.
A I:1 mixture of the reporter vector (pubi-4-GUS, pubi-GUS or mzubil-GUS) and selection plasmid (pHA9) is precipitated onto tungsten M10 particles by adding 12.5 pl of particles (25 mg/ml in SO% glycerol), 2.5 pl plasmid DNA (1 pgl~l), 12.5 pl 1M
calcium chloride, and 5 pl 0.1 M spermidine, and vortexing briefly. The particles are allowed to settle for 20 minutes, after which 12.5 pl of supernatant is removed and discarded.
Each sample of DNA-tungsten is sonicated briefly and 2.5 p.l is bombarded into the embryogenic cultures using a PDS-1000 Biolisitics particle gun (DuPont).
The bombarded tissue is transferred to fresh, nonselective medium the day after bombardment. Six days post-bombardment, the material is transferred to selective media containing 50 mg/L 6418 (Agri-bio). After 2-3 weeks, the cultures are transferred to fresh media which contains 200 mg/L 6418. The cultures are maintained on the 200 mg/L 6418 media, transferred at 2-3 week intervals,,until 6418-resistant calli could be distinguished.
6418-resistant calli are recovered from the embryogenic material. 6418-resistant lines are bulked up and assayed for GUS expression using a histochemical or chemiluminescent assay as described supra (Example 3).
Plants are regenerated from the 6418-resistant calli which express GUS
activity in a three step regeneration protocol. All regeneration is performed on 200 mg/L
6418. The first two steps are carded out in the dark at 28°C., and the final step under a 16:8 hour photoperiod, at about 25°C. Small green shoots that formed on Regeneration Medium 3 in 100X 25 mm Petri plates are transferred to Regeneration Medium 3 in 200X 25 mm PyrexTM or PhytatraysTM to permit further plantlet development and root formation. Upon formation of a sufficient root system, the plants are carefully removed from the medium, the root system washed under running water, and the plants placed into 2.5" pots containing Metromix 350 growing medium. The plants are maintained for several days in a high humidity environment, and then the humidity is gradually reduced to harden off the plants.
The plants are transplanted from the 2.5" pots to 6" pots and finally to 10"
pots during growth.
Corn plants regenerated from 6418-resistant embryogenic calli which express GUS
activity are tested for GUS expression using histochemical of chemiluminescent assays as described supra (Example 3). GUS expression (e.g., as determined by blue staining in the histochemical assay or by luminescence in the chemiluminescent assay) by one or more tissues of corn plants which are generated from calli that had been bombarded with pubi4 GUS or pubi9-GUS demonstrates that the ubi4 and ubi9 nucleotide sequences in pubi4-GUS
and pubi9-GUS, respectively, are sufficient to direct stable expression in regenerated monocotyledonous maize plants.
Stable Expression of 4PI-GUS and 9PI-GUS in Transgenic Tomato Plants A reporter plasmid (4PI-GUS or 9PI-GUS, prepared as described in Example 2) or a control plasmid (i.e~.. a plasmid which contains the uid A gene encoding GUS
and which lacks a promoter sequence) are transformed into Agrobacterium, and the transformed Agrobacter-ium is used to infect tomato plants as previously described in Theologis e~t al..
U.S. Patent Number 5,723,766, incorporated by reference.
Briefly, a reporter plasmid or control plasmid is introduced into Agrobacteriarm strain LBA4404 as follows: Agrobacterium tumefaciens LBA-4404 (2 ml) is grown overnight at 28°C. in LB broth, and this used to inoculate SO ml of LB broth to obtain the desired culture.
The inoculated medium is grown at 28°C. until the OD.boo is 0.5-1Ø
The cells are collected by centrifugation and the pellet is resuspended in 1 ml, 20 mM ice cold CaCI,.
To l 00 p l of the cell suspension. 1 ug of the plasmid is added, and the mixture is incubated on ice for 30 min before snap-freezing in liquid nitrogen. The cells are then thawed at 37°C. for 5 min and used to inoculate 1 ml LB. After 2 h growth at 28°C. with agitation, 100 ~1 of the culture are plated on LB+Kanamycinso medium; colonies are expected to appear in 2-3 days at 28°C. The cells are recultured by picking several colonies and streaking on LB+Kanamycin5o medium;
again, 3-4 colonies are picked from independent streaks and 5 ml cultures in LB+Kanamycinso medium are grown. Stationary phase cultures are used for transformion of tomato plants which are grown as described infra. Transformed Agrobacterium cells are frozen using 15% glycerol at -80°C. for later use.
To prepare host tomato plants, tomato seeds are sterilized using a protocol which consists of treatment with 70% ethanol for 2 min with mixing; followed by treatment with 10% sodium hypochlorite and 0.1 % SDS for 10 min with mixing, followed by treatment with 1 % sodium hypochlorite, 0.1 % SDS for 30 min with mixing, and washing with sterile water 3 times for 2 min per wash. For germination of the sterilized seeds, 0.8 g of the sterilized seeds are placed in a Seed Germination Medium and grown for 2 weeks at low light in a growth room. After two weeks, when the seeds had germinated, cotyledons are dissected from the seedlings by cutting off the cotyledon tips and then cutting off the stem. This process is conducted in a large petri dish containing S-10 ml of MSO medium.
Feeder plates are prepared from a tobacco cell suspension in liquid medium at 25°C.
prepared with shaking at 130-150 rpm. The suspension is transferred to fresh medium at 1:10 dilution every 3-5 days. 1 ml of rapidly dividing culture is placed on the feeder plate, overlaid with filter paper and placed in low light in a growth room. The feeder plates are supplemented with 10 ml Feeder Medium Transformed Agrobacteriunr cultures which contain the reporter or control plasmids are inoculated into 50 ml LB
containing kanamycin with a single colony of the strain. The culture is grown by shaking vigorously at 30°C. to saturation (OD>2.0 at 600 nm). The strain is chosen to come to full growth in less than 24 h.
The culture is then diluted 5 times and split into 50 mi portions in plastic tubes.
Cotyledons from two of the feeder plates are scraped into each tube and rocked gently for 10-30 min. The cotyledons are then removed from the bacterial culture onto sterile filter paper (abaxial side up) on a tobacco feeder plate and incubated for 48 h in low Iight in a growth room. The cotyledons are then transferred axial side up to callus inducing medium.
In the Callus inducing Medium, approximately four plates are used per magenta box, and the explants are crowded. The box is placed in a growth room for three weeks, and small masses of callus are expected to form at the surface of the cotyledons. The explants are transferred to fresh plates containing the callus inducing medium every three weeks. When the calli exceed 2 ml, they are transferred to plates containing shoot inducing medium. When the stem structure is evident, the shoots are dissected from the calli and the shoots are transferred to plates containing root inducing medium. After a vigorous root system is formed on the plants, the plantlets are transferred to soil by taking the plantlets from the plates, removing as much agar as possible and placing the plantlets in a high peat content soil in a small peat pot which fits into a magenta box with cover. When the seedling leaves reach the top of the box, the lid is loosened to uncover the box slowly over a period of 4-5 days. The plants are then transferred to a light cart and larger pots, and kept moist. Flowers of these regenerated plants are pollinated and tomatoes are developed.
Expression of GUS is measured in regenerated tomato plant tissues transformed with the reporter or control plasmids using a histochemical or chemiluminescent assays as described supra (Example 3). GUS expression by one or more tissues of tomato plants which are generated from tissue that had been transformed with 4PI-GUS or 9PI-GUS
demonstrates that the ubi4 and ubi9 nucleotide sequences in 4PI-GUS and 9PI-GUS, respectively, are sufficient to direct stable expression in regenerated dicotyledonous tomato plants.
Stable Expression of pubi4-GUS and pubi9-GUS in Transgenic Soybean Excised Embryonic Meristems And Regeneration Of Transgenic Soybean Plants Soybean explants are derived from excised meristems, bombarded simultaneously with a reporter plasmid (pubi4-GUS, pubi9-GUS, or mzubil-GUS) and a selection plasmid (pHA9), and stably transformed soybean plants are regenerated as previously described by Christou et al., U.S. Patent Number 5,015,580, incorporated by reference.
Briefly, soybean explants of cultivar Williams 82 are derived from meristems excised from the embryonic axes of immature seeds. Primary leaves are removed and the explant plated on a target plate containing 1 % water agar. The explants are transformed with a reporter plasmid or a selection plasmid loaded at 1.0-0.001 p.g/ml of beads.
The particle accelerator is charged at 13-16 kV. The carrier is loaded with 0.05-0.40 mg of loaded beads per square centimeter, with a preferred level of loading of 0.2 mg/em'-.
The bombarded explants are then plated in the dark on modified MS basal medium which has a high level (i.e., 13.3 ~M) of the cytokinin benzylaminopurine.
Following incubation of 1 to 2 weeks in the dark, the tissues are transferred onto the same basal medium at a lower (1.7 pM) level of cytokinin to promote shoot elongation. Shoots are harvested at 0.5 to 1 cm in height.
The success of the transformation protocol is verified by fixing transformed explants at each stage to assay for GUS activity as described supra (Example 3). Two days after DNA
particle injection, dozens of GUS active cells are expected to be detected in each explant. At 6 to 8 weeks, the plants are assayed for GUS activity in the shoot. Most plants are expected to be chimeric, having streaks of blue (i.e., GUS-expression cells) when assayed using the histochemical assay.
The transformed excised meristem tissue is used to regenerate fully mature, and sexually mature chimeric plants using methods known in the art such as those described in U.S. Patent No. 5,015,580 to Christou et al. (incorporated by reference). GUS
expression by one or more tissues of soybean plants which are regenerated from excised embryonic meristems that had been transformed with pubi4-GUS or pubi9-GUS demonstrates that the ubi4 and ubi9 nucleotide sequences in pubi4-GUS and pubi9-GUS, respectively, are sufficient to direct stable expression in regenerated dicotyledonous soybean plants.
From the above, it is clear that the invention provides promoter sequences which are capable of driving transgene expression in both monocotyledonous and dicotyledonous plant cells, and which are useful for generating transgenic plants with desirable agronomic characteristics.
WO 99/46976 PC'f/US99/05985 SEQUENCE LISTING
<110> Albert, Henrik H.
Wei, Hairong <120> PLANT PROMOTER SEQUENCES AND METHODS OF USE THEREOF
<130> UH-03648 <140> XX/XXX,XXX
<141> 1999-03-17 <160> 12 <170> PatentIn Ver. 2.0 <210> 1 <211> 1813 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 1 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acnanaccgc gacgcaaccg cctctcgccg ctgggcccca caccgctcgg tgccgtagcc 300 tcacgggact ctttctccct cctcccccgt tataaattgg cttcatcccc tccttgcctc 360 atccatccaa atcccagtcc ccaatcccat cccttcgtcg gagaaattca tcgaagcgaa 420 gcgaatcctc gcgatcctct caaggtactg cgagttttcg atccccctct cgacccctcg 480 tatgtttgtg tttgtcgtac gtttgattag gtatgctttc cctgtttgtg ttcgtcgtag 540 cgtttgatta ggtatgcttt ccctgttcgt gttcatcgta gtgtttgatt aggtcgtgtg 600 aggcgatggc ctgctcgcgt ccttcgatct gtagtcgatt tgcgggtcgt ggtgtagatc 660 tgcgggctgt gatgaagtta tttggtgtga tctgctcgcc tgattctgcg ggttggctcg 720 agtagatatg gatggttgga ccggttggtt cgtttaccgc gctagggttg ggctgggatg 780 atgttgcatn gcgccgttgc gcgtgatccc gcagcaggac ttgcgtttga ttgccagatc 840 tcgttacgat tatgtgattt ggtttggact tattagatct gtagcttctg cttatgttgc 900 cagatgcgcc tactgctcca tatgcctgat gataatccat aaatggcagt ggaaatcaac 960 tagttgattg cggagtcatg tatcagctac aggtgtaggg actagctaca ggtgtaggga 1020 ctngcgtcta attgtttggt ccttaactca tgtgcaatta tgcaatttag tttagatgtt 1080 tgttccaant catctaggct gtaaaaggga cactggttag attgctgttt aatcttttta 1140 gtagattata ttatattggt aacttattaa cccntattaa catgccataa cgtggattct 1200 gctcatgcct gatgataatc atagatcact gtggaattaa ttagttgatt gttgaatcat 1260 gtttcatgta cataccacgg cacaattgct tagttcctta acaaatgcaa attttactga 1320 tccatgtatg atttgcgtgg ttctctaatg tgaaatacta tagctacttg ttagtaagaa 1380 tcaggttcgt atgcttaatg ctgtatgtgc cttctgctca tgcctgatga taatcatata 1440 tcactggaat taattagttg atcgtttaat catatatcaa gtacatacca tggcacaatt 1500 tttagtcact taacccatgc agattgaact ggtccctgca tgttttgcta aattgttcta 1560 ttctgattag accatatatc aggtattttt ttttggtaat ggttctctta ttttaaatgc 1620 tatatagttc tggtacttgt tagaaagatc tggttncata gtttagttgc ctatccttcg 1680 aattaggatg ctgagcagct gatcctatag ctttgtttca tgtatcaatt cttttgtgtt 1740 caacagtcag tttttgttag attcattgta acttatgttc gcttactctt ctggtcctca 1800 atgcttgcag atg 1813 <210> 2 <211> 2804 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 2 taatcctggg ccatgancag ctgtccttcc aggttcacaa gtctggtgcc ttcttctgtc 60 cctccgatgg agattatctg catgtcgtgg tcgtgtcctg atcgaatcct cgttgaatcc 120 ctatgttttt cttcaagaaa tgtgagtcct atgtcagtct ggttgcgttt gtgaacattt 180 ctgctgctga gcagcacttt ggctggaact gtgcaatgaa ataaatggaa ccctggtttc 240 tggttatgtg tgtgttagct aatgtttttg aagtggaagc tctaatcttc tatcgcgttg 300 ctactacaat tctgcttgtg ttttgatggt tcttggtttc tgttagttgg ttcagaggaa 360 gttttgcttc cacagactaa gatgcagttg aactttggtt gccctggttt ctagatttca 420 tttgtgctgg ttgagtgata gtaagaaaca accggtgttc acatataatc aggttttgtg 480 ctgctcgagt gatcgtcaaa aaccaccggt gttcacatct aaaaaggttt cgatccccag 540 gtttagatct cccgtttaat tccaaaaaaa aagttctgtg tacttgcatt tagttgggtg 600 gttgatgctg gaaagagtaa ctttcaagag taataatctt tggtgactac tctgtttcaa 660 ctgatcaatc cctaggaaag gtacaccttt acttagggaa gaaattctta gaaccttgca 720 ctttgtttca actgataata gtatacttta ttagataaaa aatattcaga tatattagac 780 accggatgtc atccactcat ccttacaaac ctctgtcatg gtcctgcaga aatgtttgcc 840 agctccagtg gcttcctgat aaatctgtgg agtgcctgtt aatcngctgc caatttttgc 900 tgagcactgt atatatgtta gtaagtacta ttgggccacc aattccattt tgacacagca 960 ctattggtcc accaattcga tcctgacaca gcactgcata atttgaaacg tttttgctcc 1020 cattttgcaa ggctacaaat ttagatcatg tttascatyc tgtgggatac aatatatgga 1080 tatcgaacaa acttggtatg tcagagaaaa aatagtttat tttcaaaact aacattttta 1140 aagccttcta tgaactttaa accttcagca tttgggatca agatgagtgc tcgaacaaga 1200 gtgcactttt tctccaaaat aatctactac agagttcttt tttatatata aaaaaactta 1260 tacttaacag ataaatcaga cctcttctgc tccatatcac cttgacaaat caaagaagca 1320 gcaccagcga agggtattat tattgaggta aatataagat ctcgtttact gaaaaagacc 1380 gcgtgtttac ctaaactacc attttgcttt gatagcagca tacatgtgat agaattgcgg 1440 atcctaccgt gctgactgtg aangtggtaa gggtgagaga ttggtgggcg aggtctgaac 1500 gagcgaaaac agtactgcat ttactgttca caaggaggcg gcttaggttt tggtctccca 1560 gctctctaag ggaagctgag aattatgatt ctcttgctta attatttctt aaccaaagtt 1620 ataaatatat agcctatgag atcctaattt atggaaataa ctaaactatt ttaaggaaat 1680 atataaatag ataatcagcc cactaacggg cntagcgccc actaacaggc ctggtgctga 1740 gcccgacata acatctctcc ccgcctggrg aaacagctcg tcctcgagct gaaatctggt 1800 agaagcatcw tcaaccaaca ccggggtcat gctggaacac tgcatcaggc gctaccgcag 1860 ctggtacgtc gtcgtcgagg aagtcagccg actccaagta gaacagtcgc ttacaactga 1920 tgtccgcgga cgtagggctc atcacaattg taaacaaagc cctygacgac ggcactccaa 1980 acagctcttc cggtgtgaga cgacgaaacg agggagctag agcgggtagt ggcgcgggga 2040 acagccagtg tagcgcctgt agtcaccgag ggatggggcg gtcgggcgcc gcgaggctgc 2100 ggtgccaggt ggaggttcaa cattcttcaa acgcccgtgc caagtacatg gcggactgga 2160 ggtcgggcgg tgcgcggagc tgaacctgct tacggaggtg gtccggcagc ccacccacgt 2220 acaactccgc cttttggcga gcggagaggt tgtgggcatg gcacaggacg gcgttgtaac 2280 gctccgagta atcctgaacg gaagaaccaa aaggaaggcg ggcaagctcc gccaaccgag 2340 tgcccaaaac aggaggcccg aagcgaagcg agcataattc gcggaagcgc tcccaaggag 2400 gcataccctc gtcttgctcc agggcgtagt accatgtctg ggcaacaccc cgaagatggt 2460 aggacgcgag ccatgtgcga gcggaggcga gcgtntgctg gccgcggaag aactgctcgc 2520 actggttcaa ccaattcagg ggatcggtcg aaccgtcgta cgtagggaac tccagtttgt 2580 agaatttggg ccccgcctgg gcgcctgcga gggcagcagc aagggctggg tcgagccccc 2640 cctgcggctg gccgcccgag ggagagggcg cccgaagaac agcggcccgt ccaccccccc 2700 gaaaagagtg ctggcggggg gtagggagga catcgttgtc gccgccgccg tcgtgtaggc 2760 tgtcggggag ggcgacgtgg ccatcgagta gatccgggga attc 2804 <210> 3 <211> 3691 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 3 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacecaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 ggtatcaatt ctngtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcagat g 3691 <210> 4 <211> 343 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 4 taagtcctgg gccatgagca gctgtccttc cagggttcac aagtagtggt gccttcttnc 60 tgtccctccg atggagatta tctgcatgtc gtggtcgtgt cctgatcgag tcgtcgttga 120 gtccctatgt tttttcttca agaaatgtga gtcctatgtc agtctggttg cgtttgtgaa 180 cattttctgc tgctgcgcag cagtttggtt ggaactgtgc aatgaaataa attgaaccct 240 ggtttctggt tatgtgtgtt agctaatgtt tttgaagtgg aagctntaat cttntatcgc 300 gttgctacta caattctgnt tgtgttttga tgttcttgtt tct 343 <210> 5 <211> 5512 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 5 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acaaccgcga cgcaaccgcc tctcgccgct gggccccaca ccgctcggtg ccgtagcctc 300 acgggactct ttctccctcc tcccccgtta taaattggct tcatcccctc cttgcctcat 360 ccatccaaat cccagtcccc aatcccatcc cttcgtcgga gaaattcatc gaagcgaagc 420 gaatcctcgc gatcctctca aggtactgcg agttttcgat ccccctctcg acccctcgta 480 tgtttgtgtt tgtcgtacgt ttgattaggt atgctttccc tgtttgtgtt cgtcgtagcg 540 tttgattagg tatgctttcc ctgttcgtgt tcatcgtagt gtttgattag gtcgtgtgag 600 gcgatggcct gctcgcgtcc ttcgatctgt agtcgatttg cgggtcgtgg tgtagatctg 660 cgggctgtga tgaagttatt tggtgtgatc tgctcgcctg attctgcggg ttggctcgag 720 tagatatgga tggttggacc ggttggttcg tttaccgcgc tagggttggg ctgggatgat 780 gttgcatgcg ccgttgcgcg tgatcccgca gcaggacttg cgtttgattg ccagatctcg 840 ttacgattat gtgatttggt ttggacttat tagatctgta gcttctgctt atgttgccag 900 atgcgcctac tgctccatat gcctgatgat aatccataaa tggcagtgga aatcaactag 960 ttgattgcgg agtcatgtat cagctacagg tgtagggact agctacaggt gtagggactg 1020 cgtctaattg tttggtcctt aactcatgtg caattatgca atttagttta gatgtttgtt 1080 ccaatcatct aggctgtaaa agggacactg gttagattgc tgtttaatct ttttagtaga 1140 ttatattata ttggtaactt attaacccta ttacatgcca taacgtggat tctgctcatg 1200 cctgatgata atcatagatc actgtggaat taattagttg attgttgaat catgtttcat 1260 gtacatacca cggcacaatt gcttagttcc ttaacaaatg caaattttac tgatccatgt 1320 atgatttgcg tggttctcta atgtgaaata ctatagctac ttgttagtaa gaatcaggtt 1380 cgtatgctta atgctgtatg tgccttctgc tcatgcctga tgataatcat atatcactgg 1440 aattaattag ttgatcgttt aatcatatat caagtacata ccatggcaca atttttagtc 1500 acttaaccca tgcagattga actggtccct gcatgttttg ctaaattgtt ctattctgat 1560 tagaccatat atcaggtatt tttttttggt aatggttctc ttattttaaa tgctatatag 1620 ttctggtact tgttagaaag atctggttca tagtttagtt gcctatcctt cgaattagga 1680 tgctgagcag ctgatcctat agctttgttt catgtatcaa ttcttttgtg ttcaacagtc 1740 agtttttgtt agattcattg taacttatgt tcgcttactc ttctggtcct caatgcttgc 1800 agatgcagat cttcgttaag accctcactg gcaagaccat cacccttgag gttgagtctt 1860 cagacamtat tgacmatgtc maggctaaga tacaggacaa ggaaggcatt cctccggatc 1920 agcagaggct gatctttgct ggcaagcagc tcgaggatgg ccgtacccta gytgactaca 1980 acatccagaa ggagtccacc stccacctgg tgctcaggct caggggaggc atgcaaatct 2040 tcgtcaagac cctcactggc aagactatca cgcttgaggt cgagtcttct gacacgatcg 2100 acaacgtgaa ggccaagatc caggacaagg agggaatccc cccggaccag cagcgtctca 2160 tcttcgctgg caagcagctc gaggatggcc gcaccctcgc tgactacaac atccagangg 2220 agtcgantnt ccaccttgtg ctcaggttna ggggtggcat gcagattttt gtcaagacct 2280 tnactggcaa gaccatcacc ttggaggtgg agtcttcgga caccatngac aatgtgaagg 2340 ngaagatcca ggacaaggaa ggaatccccc cagaccagca gcgtcttatt tttgctggca 2400 agcagcttga ggatggccgc accctagcag actacaacat ccagaaggag tccacccttc 2460 acctggtgct ccgcttncgc ggtggtatgc agatcttcgt caagaccctc accggcaaga 2520 ccatcaccct ggaggtggag tcctctgaca ccatcgacaa tgtgaaggcg aagatccagg 2580 acaaggaggg catccccccg gaccagcagc gtctcatctt cgccggcaag cagctggagg 2640 atggccgcac cctggcagac tacaacatcc agaaggagtc cactctccac ctggtgctcc 2700 gtctccgtgg tggccagtaa tcctgggcca tgaagctgtc cttccaggtt cacaagtctg 2760 gtgccttctt ctgtccctcc gatggagatt atctgcatgt cgtggtcgtg tcctgatcga 2820 atcctcgttg aatccctatg tttttcttca agaaatgtga gtcctatgtc agtctggttg 2880 cgtttgtgaa catttctgct gctgagcagc actttggctg gaactgtgca atgaaataaa 2940 tggaaccctg gtttctggtt atgtgtgtgt tagctaatgt ttttgaagtg gaagctctaa 3000 tcttctatcg cgttgctact acaattctgc ttgtgttttg atgttcttgg tttctgttag 3060 ttggttcaga ggaagttttg cttccacaga ctaagatgca gttgaacttt ggttgccctg 3120 gtttctagat ttcatttgtg ctggttgagt gatagtaaga aacaaccggt gttcacatat 3180 aatcaggttt tgtgctgctc gagtgatcgt caaaaaccac cggtgttcac atctaaaaag 3240 gtttcgatcc ccaggtttag atctcccgtt taattccaaa aaaaaagttc tgtgtacttg 3300 catttagttg ggtggttgat gctggaaaga gtaactttca agagtaataa tctttggtga 3360 ctactctgtt tcaactgatc aatccctagg aaaggtacac ctttacttag ggaagaaatt 3420 cttagaacct tgcactttgt ttcaactgat aatagtatac tttattagat aaaaaatatt 3480 cagatatatt agacaccgga tgtcatccac tcatccttac aaacctctgt catggtcctg 3540 cagaaatgtt tgccagctcc agtggcttcc tgataaatct gtggagtgcc tgttaatcgg 3600 ctgccaattt ttgctgagca ctgtatatat gttagtaagt actattgggc caccaattcg 3660 attttgacac agcactattg gtccaccaat tcgattctga cacagcactg cataatttga 3720 aacgtgttgc tccattttgc aaggctacaa atttagatca tgtttagcat tctgtgggat 3780 acaatatatg gatatcgaac aaacttggta tgtcagagaa aaaatagttt attttcaaaa 3840 ctaacatttt taaagccttc tatgaacttt aaaccttcag catttgggat caagatgagt 3900 gctcgaacaa gagtgcactt tttctccaaa ataatctact acagagttct tttttatata 3960 taaaaaaact tatacttaac agataaatca gactttttct gctccatatc accttgacaa 4020 atcaaagaag cagcaccagc gaagggtatt attattgagg taaatataag atctcgttta 4080 ctgaaaaaga ccgcgtgttt acctaaacta ccattttgct ttgatagcag catacatgtg 4140 atagaattgc ggatcctacc gtgctgactg tgaaggtggt aggggtgaga gattggtggg 4200 cgaggtctga acgagcgaga acagtactgc atttactgtt cacaaggagg cggcttaggt 4260 tttgggtctc ccagctctct aagggaagct gagaattatg attctcttgc ttaattattt 4320 cttaaccaaa gttataaata tatagcctat gagatcctaa tttatggaaa taactaaact 4380 attttaagga aatatataaa tagataatca gcccactaac gggcctagcg cccactaaca 4440 ggcctggtgc tgagcccgac ataacatctc tccccgcctg gagaaacagc tcgtcctcga 4500 gctgaaatct ggtagaagca tcatcaacca acaccggggt catgctggaa cactgcatca 4560 ggcgctaccg cagctggtac gtcgtcgtcg aggaagtcag ccgactccaa gtagaacagt 4620 cgcttacact gatgtccgcg gacgtagggc tcatcacaat tgtaacaaag cccttgacga 4680 cggcactcca acagctcctc cggtgtgaga cgacgaaacg agggagctag agcgggtagt 4740 ggcgcgggaa cagccagtgt agcgcctgta gtcaccgagg gatggggcgg tcgggcgccg 4800 cgaggctgcg gtgcaggtgg aggtttcaca ttcctcaaac gcccgtgcca agtacatggc 4860 ggactggagg tcgggcggtg cgcggagctg aacctgctta cggaggtggt ccggcagccc 4920 acccacgtac aactccgcct tttggcgagc ggagaggttg tgggcatggc acaggacggc 4980 gttgtaacgc tccgagtaat cctgaacgga agaaccaaaa ggaaggcggg caagctccgc 5040 caaccgagtg cccaaaacag gaggcccgaa gcgaagcgag cataattcgc ggaagcgctc 5100 ccaaggaggc ataccctcgt cttgctccag ggcgtagtac catgtctggg caacaccccg 5160 aagatggtag gacgcgagcc atgtgcgagc ggaggcgagc gtctgctggc cgcggaagaa 5220 ctgctcgcac tggttcaacc aattcagggg atcggtcgaa ccgtcgtacg tagggaactc 5280 cagtttgtag aatttgggcc ccgcctgggc gcctgcgagg gcagcagcaa gggctgggtc 5340 gagccccccc tgcggctggc cgcccgaggg agagggcgcc cgaagaacag cggcccgtcc 5400 acccccccga aaagagtgct ggcggggggt agggaagaca tcgttgtcgc cgccgccgtc 5460 gtgtaggctg tcggggaagg cgacgtggcc atcgagtaga tccggggaat tc 5512 <210> 6 <211> 305 <212> PRT
<213> Saccharum Hybrid Cultivar H32-8560 <400> 6 Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Xaa Ile Asp Xaa Val Xaa Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Xaa Asp Tyr Asn Ile Gln Lys Glu Ser Thr Xaa His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Xaa Glu Ser Xaa Xaa His Leu Val Leu Arg Xaa Arg Gly Gly Met Gln Ile Phe Val Lys Thr Xaa Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Xaa Asp Asn Val Lys Xaa Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Xaa Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gln <210> 7 <211> 1802 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 7 gaattcatta tgtggtctag gtaggttcta tatataagaa aacttgaaat gttctaaaaa 60 aaaattcaag cccatgcatg attgaagcaa acggtatagc aacggtgtta acctgatcta 120 gtgatctctt gcaatcctta acggccacct accgcaggta gcaaacggcg tccccctcct 180 cgatatctcc gcggcgacct ctggcttttt ccgcggaatt gcgcggtggg gacggattcc 240 acaaccgcga cgcaaccgcc tctcgccgct gggccccaca ccgctcggtg ccgtagcctc 300 acgggactct ttctccctcc tcccccgtta taaattggct tcatcccctc cttgcctcat 360 ccatccaaat cccagtcccc aatcccatcc cttcgtcgga gaaattcatc gaagcgaagc 420 gaatcctcgc gatcctctca aggtactgcg agttttcgat ccccctctcg acccctcgta 480 tgtttgtgtt tgtcgtacgt ttgattaggt atgctttccc tgtttgtgtt cgtcgtagcg 540 tttgattagg tatgctttcc ctgttcgtgt tcatcgtagt gtttgattag gtcgtgtgag 600 gcgatggcct gctcgcgtcc ttcgatctgt agtcgatttg cgggtcgtgg tgtagatctg 660 cgggctgtga tgaagttatt tggtgtgatc tgctcgcctg attctgcggg ttggctcgag 720 tagatatgga tggttggacc ggttggttcg tttaccgcgc tagggttggg ctgggatgat 780 gttgcatgcg ccgttgcgcg tgatcccgca gcaggacttg cgtttgattg ccagatctcg 840 ttacgattat gtgatttggt ttggacttat tagatctgta gcttctgctt atgttgccag 900 atgcgcctac tgctccatat gcctgatgat aatccataaa tggcagtgga aatcaactag 960 ttgattgcgg agtcatgtat cagctacagg tgtagggact agctacaggt gtagggactg 1020 cgtctaattg tttggtcctt aactcatgtg caattatgca atttagttta gatgtttgtt 1080 ccaatcatct aggctgtaaa agggacactg gttagattgc tgtttaatct ttttagtaga 1140 ttatattata ttggtaactt attaacccta ttacatgcca taacgtggat tctgctcatg 1200 cctgatgata atcatagatc actgtggaat taattagttg attgttgaat catgtttcat 1260 gtacatacca cggcacaatt gcttagttcc ttaacaaatg caaattttac tgatccatgt 1320 atgatttgcg tggttctcta atgtgaaata ctatagctac ttgttagtaa gaatcaggtt 1380 cgtatgctta atgctgtatg tgccttctgc tcatgcctga tgataatcat atatcactgg 1440 aattaattag ttgatcgttt aatcatatat caagtacata ccatggcaca atttttagtc 1500 acttaaccca tgcagattga actggtccct gcatgttttg ctaaattgtt ctattctgat 1560 tagaccatat atcaggtatt tttttttggt aatggttctc ttattttaaa tgctatatag 1620 ttctggtact tgttagaaag atctggttca tagtttagtt gcctatcctt cgaattagga 1680 tgctgagcag ctgatcctat agctttgttt catgtatcaa ttcttttgtg ttcaacagtc 1740 agtttttgtt agattcattg taacttatgt tcgcttactc ttctggtcct caatgcttgc 1800 ag 1802 <210> 8 <211> 5174 <212> DNA
<213> Saccharum Hybrid Cultivar H32-8560 <400> 8 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacccaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 kgtatcaatt cttgtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcagat gcagattttc gttaagaccc tcactggcaa 3720 gaccatcacc cttgaggttg agtcctcaga cactattgac aatgtcaagg ctaagatcca 3780 ggacaaggaa ggcattcctc cagatcagca gaggctgaty tttgctggca agcagctcga 3840 ggatggccgt accctagctg actacaacat ccagaaggag tccaccctcc acctggtgct 3900 caggcttagg ggaggcatgc agattttcgt caagaccctc actggcaaga ctatcacgct 3960 tgaggtcgag tcttctgaca cgatcgacaa cgtgaaggcc aagatccagg acaaggaggg 4020 aatccccccg gaccagcagc gtytcatttt cgctggcaag cagctcgagg atggccgcac 4080 cctcgctgac tacaacatcc agaaggagtc gactctccac cttgtgctca ggctcagggg 4140, tggcatgcag atcttcgtca agaccctcac tggcaagacc atcaccttgg aggtggagtc 4200 ctcggacacc attgacaatg tgaaggcgaa gatccaggac aaggagggca tccccccgga 4260 ccagcagcgt ctcatyttcg ccggcaagca rcttgaggat ggccgcaccc ttgcgganta 4320 caacatccag aargagtcca cccttcacct ggtgctccgc cttcgtggtg gtatgcagat 4380 tttcgtcaag accctcaccg gcaagaccat caccctggag gtggagtcct ctgacaccat 4440 tgacaatgtg aaggcgaaga tccaggataa ggagggcatc cccccggacc agcagcgtyt 4500 tatctttgct ggcaagcagc ttgaggatgg ccgcaccctg gcagantaca acatccagaa 4560 ggagtccacc cttcacctgg tgctccgcct tcgcggtggt atgcagatyt tcgtcaagac 4620 cctcaccggc aagaccatca ccctggaggt ggagtcctct gacaccatcg acaatgtgaa 4680 ggcgaagatc caggacaagg agggcatccc cccggaccag cagcgtctca tcttcgccgg 4740 caagcagctg gaggatggcc gcaccctggc agactacaac atccagaagg agtccactct 4800 ccacctggtg ctccgtctcc gtggtggcca gtaagtcctg ggccatgagc agctgtcctt 4860 ccagggttca caagtagtgg tgccttcttn ctgtccctcc gatggagatt atctgcatgt 4920 cgtggtcgtg tcctgatcga gtcgtcgttg agtccctatg ttttttcttc aagaaatgtg 4980 agtcctatgt cagtctggtt gcgtttgtga acattttctg ctgctgcgca gcagtttggt 5040 tggaactgtg caatgaaata aattgaaccc tggtttctgg ttatgtgtgt tagctaatgt 5100 ttttgaagtg gaagctntaa tcttntatcg cgttgctact acaattctgn ttgtgttttg 5160 atgttcttgt ttct 5174 <210> 9 <211> 381 <212> PRT
<213> Saccharum Hybrid Cultivar H32-8560 <400> 9 Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Giy Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe 65 70 75 g0 Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Xaa Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Xaa Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Xaa Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Xaa Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gln <210> 10 <211> 3688 <212> DNA
<213> Saccharum Hybrid Cultivar 32-8560 <400> 10 aagttttgnt aaaatgaaca aagaattggg gaaactatag ccaaagtggg tggggaatgg 60 tgccaaacaa aacttcgtaa accaacccaa aaagatccgg aaaacaaatg gatacgtgca 120 gggcatgcat gcaatagccc agccataaaa agcggcgagc caatgcccgg gtgtcaaaca 180 aaatggcgcc tgtgccggct ctggctgctt ccggctcagc tttcggaacg atccgccgca 240 gtttggcctc gcatatgatg acgatgatgg tctcctcttc tcgatttgta gctccggcat 300 gggagccacc tcctgtcggc tcacacatag cacgcgcctt agcccgtgct cgctctcccc 360 tagatgcttc acctgcgcca atcagtgtga gcccatcgtg tcagatggta ctcgtacgta 420 tggagtaacg tgataccaca acacgtacac tggtcagaat tgatagtata tgatcctgtc 480 gacccgatgt gttttagtac cttgcagtgg ccggagagga gtggccgcgc gcatgcggcg 540 caggggttct ccgcgctcgc tgatcgcttc ctcactgtgc gctcgtttag gaacaccacc 600 tcgtggtcgc tcaccatgtg tgactgcatg caacgctacg aatcaggacc cagatggaaa 660 cgaagcgcct ctcgaccacc tctgcctcgg tgatggttgg tgtgcagtgc gtacgcatgc 720 acgctaccaa tatcatacct ggatgccggt gcaatcgaac agcttcaggt tgtcgacgcg 780 gacggcgaag caggacgcgt acttccatat ctttgggttc cattacgtac cgtcaatcga 840 ataaataaag agaagagttt gagatcagct tgttgggagc aggtgaccgc ccgacatgca 900 tgccgattgt cgacggcacg gaaataaaca acacatttgt gagggagcca gggaggcagt 960 ggcggcacag cgtcgcggca cagtcgatgc agaagtggtt cttgtcgttc ttgcgctccc 1020 cccgggtgtg cagcgcacgc ctttgaaaaa ctccgatagc aggccacaca gccattgcgg 1080 ggcgccgcgc acggccgcca gctgcatccc cgtttgttcg cacatgcgct aggtggtcct 1140 gcggccgttc cttgcaccgc ggagacgcgg ggtggaccag tgggggaatg gatgaactgc 1200 tggtaggttt ggttggattg gcgagtgcgt agagggggca tgggcaacga tagactcgat 1260 tcaattcaaa gactgaaaat agtggagttc taacaccatt ctgtgcggcg ctaattctcg 1320 acatggcagg cgtaagcata ataccgacat ggcatgcaac gatgttcgtg aacagtggtg 1380 acacatggat atggtggccg tccaggggat tcgttccatt caattcaaag accgaaaatc 1440 gcggggttcc gtagcatttt gtgcggtgct aattctcgaa catgcgagac gtaagcctaa 1500 taccgagatg gcatgcaaca atgttcgtga acaacagtga cacgtggatg cggtggccgt 1560 ctagggattc gcgttctaag ctggtatatg tgcggtgtta attcttgaca tgcggggcgt 1620 aagtgtaata ccaagatgaa cggtgacacg tggacgcggg ggtcgtcaaa caattcattc 1680 cgtggtctag ggtaggttat atataaaggc cagtcttagt gggggatttt atggccatgt 1740 tattaatgca acccatattt ggaaaacagt gcaggaagag tttcatcttc gtaaaactct 1800 ctctaattcc atgaaactct tatcatctct ctcttcatca atacggtgcc acatcagcct 1860 atttaatgtc catgaaactc tgatgaaatc cactgagacg ggcctcagaa aacttgaaat 1920 cttctaaaaa aaattcaagt ccatgcatga ttgaagcaaa cggtatagca acggtgttaa 1980 cctgatctag tgatctcttg taatccttaa cggccaccta ccacaggtag caaacggcgt 2040 ccccctcctc gatatctccg cggcggcctc tggctttttc cgcggaattg cgcggtgggg 2100 acggattcct cgagaccgcg acacaaccgc ctttcgccgc tgggccccac accgctcggt 2160 gccgtagcct cacgggactc tttctccctc ctcccccgct ataaattggc ttcatcccct 2220 ccttgcctca tccatccaaa tcccagtccc caatcccagc ccatcgtcgg agaaattcat 2280 agaagcgaag cgaatcctcg cgatcctctc aaggtagtgc gagttttcga ttcccctctc 2340 gacccctcgt atgctttccc tgtttgtgtt tcgtcgtagc gtttgattag gtatgctttc 2400 cctgtttgtg ttcgtcgtag cgtttgattt ggtatgcttt ccccgttcgt gttcctcgta 2460 gtgtttgatt aggtcgtgtg aggcgatggc ctgctcgcat ccttcgatct gtagtcgatt 2520 tgcgggtcgt ggtgtagatc tgcgggctgt gatgaagtta tttggtgtga tcgtgctcgc 2580 ctgattctgc gggttggctc gagtagatat gatggttgga ccggttggtt tgtttaccgc 2640 gctagggttg ggctgggatg atgttgcatg cgccgttgcg cgtgatcccg cagcaggact 2700 tgcgtttgat tgccagatct cgttacgatt atgtgatttg gtttggactt tttagatctg 2760 tagcttctgc ttatgtgcca gatgcgccta ctgctcatat gcctgatgat aatcataaat 2820 ggctgtggaa ctaactagtt gattgcggag tcatgtatca gctacaggtg tagggactag 2880 ctacaggtgt agggacttgc gtctaaattg tttggtcctg tactcatgtt gcaattatgc 2940 aatttagttt agattgtttg ttccactcat ctaggctgta aaagggacac tgcttagatt 3000 gctgtttaat ctttttagta gattatatat tatattggta acttattacc cttattacat 3060 gccatacgtg acttctgctc atgcctgatg ataatcatag atcactgtgg aattaattag 3120 ttgattgttg aatcatgttt catgtacata ccacggcaca attgcttagt tccttaacaa 3180 atgcaaattt tactgatcca tgtatgattt gcgtggttct ctaatgtgaa atactatagc 3240 tacttgttag taagaatcag gttcgtatgc ttaatgctgt atgtgccttc tgctcatgcc 3300 tgatgataat catatatcac tggaattaat tagttgatcg tttaatcata tatcaagtac 3360 ataccatggc acaattttta gtcacttaac ccatgcagat tgaactggtc cctgcatgtt 3420 ttgctaaatt gttctatttc tgattagacc atatatcatg taattttttt tttgggtaat 3480 ggttctccta ttttaaatgc tatatagttc tggtacttgt tagaaaaatc tgcttccata 3540 gtttagttgc ttatccctcg aattatgatg ctgagcagct gatcctatag ctttgtttca 3600 kgtatcaatt cttgtgttca acagtcagtt tttgttagat tcattgtaac ttatggtcgc 3660 ttactcttct ggtcctcaat gcttgcag 3688 <210> 11 <211> 2146 <212> DNA
<213> Lycopersicon esculentum <400> 11 agatctacaa ttatcngcaa cgtgttacac attttgtgct acaatatacc ttcaccattt 60 tgtgtatata taaaggttgc atctcttcaa acaaaaatca ctccatcaca acacaatgtc 120 ttcttcttct tctattacta ctactcttcc tttatgcacc aacaaatccc tctcttcttc 180 is i m cttcaccacc accaactcat ccttgttatc aaaaccctct caacttttcc tccacggaag 240 gcgtaatcaa agtttcaagg tttcatgcaa cgcaaacaac gttgacaaaa accctgacgc 300 tgttgataga cgaaacgttc ttttagggtt aggaggtctt tatggtgcag ctaatcttgc 360 accattagcg actgctgcac ctataccacc tcctgatctc aagtcttgtg gtactgccca 420 tgtaaaagaa ggtgttgatg taatatacag ttgttgccct cctgtacccg atgatatcga 480 tagtgttccg tactacaagt tcccttctat gactaaactc cgcatccgcc cccctgctca 540 tgcggcggat gaggagtacg tagccaagta tcaattggct acgagtcgaa tgagggaact 600 tgataaagac ccctttgacc ctcttggctt taaacaacaa gctaatattc attgtgctta 660 ttgcaacggt gcttacaaag ttggtggcaa agaattgcaa gttcatttct cgtggctttt 720 ctttcccttt catagatggt acttgtactt ttacgaaaga attttgggat cacttattaa 780 tgatccaact tttgctttac cttactggaa ttgggatcat ccaaaaggca tgcgtatacc 840 tcccatgttt gatcgtgagg gatcatctct ttacgatgag aaacgtaacc aaaatcatcg 900 caatggaact attattgatc ttggtcattt tggtaaggaa gttgacacac ctcagctaca 960 gataatgact aataatttaa ccctaatgta ccgtcaaatg gttactaatg ctccttgccc 1020 ttcccaattc ttcggtgctg cttacctctg ggttctgaac ccaagtccgg gtcagggtac 1080 tattgaaaac atccctcata ctccggttca catctggacc ggtgacaaac ctcgtcaaaa 1140 aaacggtgaa gacatgggta atttctactc agccggttta gatccgattt tttactgcca 1200 ccatgccaat gtggacagga tgtggaatga atggaaatta attggcggga aaagaaggga 1260 tttaacagat aaagattggt tgaactctga attctttttc tacgatgaaa atcgtaaccc 1320 ttaccgtgtg aaagtccgtg atgttttgga cagtaaaaaa atgggattcg attacgcgcc 1380 aatgcccact ccatggcgta attttaaacc aatcagaaag tcatcatcag gaaaagtgaa 1440 tacagcgtca attgcaccag ttagcaaggt gttcccattg gcgaagctgg accgtgcgat 1500 ttcgttctct atcacgcggc cagcctcgtc aaggacaaca caagagaaaa atgagcagga 1560 ggagattctg acattcaata aaatatcgta tgatgatagg aactatgtaa ggttcgatgt 1620 gtttctgaac gtggacaaga ctgtgaatgc agatgagctt gataaggcgg agtttgcagg 1680 gagttatact agcttgccgc atgttcatgg aagtaatact aatcatgtta ccagtgttac 1740 tttcaagctg gcgataactg aactgttgga ggatattgga ttggaagatg aagatactat 1800 cgcggtgact ttaattccaa aagctggcgg tgaaggtgta tccattgaaa gtgtggagat 1860 caagcttgag gattgttaaa gtctgcatga gttggtggct atggagccaa atttatgttt 1920 aattagtata attatgtgtg gtttgagtta tgttttatgt taaaatgtat cagctcgatc 1980 gatagctgat tgctagttgt gttaatgcta tgtatgaaat aaataaatgg ttgtcttcca 2040 ttcagtttat cattttttgt cattctaatt aacggttaac ttttttttct actatttata 2100 cgaagctact atactatgta tatcatttgg aaaattatat attatt 2146 <210> 12 <211> 3509 <?,12 > DNA
<213> Zea mays <400> 12 gaattccggc gtgggcgctg ggctagtgct cccgcagcga gcgatctgag agaacggtag 60 agttccggcc gggcgcgcgg gagaggagga gggtcgggcg gggaggatcc gatggccggg 120 aacgagtgga tcaatgggta cctggaggcg atcctcgaca gccacacctc gtcgcggggt 180 gccggcggcg gcggcggcgg gggggacccc aggtcgccga cgaaggcggc gagcccccgc 240 ggcgcgcaca tgaacttcaa cccctcgcac tacttcgtcg aggaggtggt caagggcgtc 300 gacgagagcg acctccaccg gacgtggatc aaggtcgtcg ccacccgcaa cgcccgcgag 360 cgcagcacca ggctcgagaa catgtgctgg cggatctggc acctcgcgcg caagaagaag 420 cagctggagc tggagggcat ccagagaatc tcggcaagaa ggaaggaaca ggagcaggtg 480 cgtcgtgagg cgacggagga cctggccgag gatctgtcag aaggcgagaa gggagacacc 540 atcggcgagc ttgcgccggt tgagacgacc aagaagaagt tccagaggaa cttctctgac 600 cttaccgtct ggtctgacga caataaggag aagaagcttt acattgtgct catcagcgtg 660 catggtcttg ttcgtggaga aaacatggaa ctaggtcgtg attctgatac aggtggccag 720 gtgaaatatg tggtcgaact tgcaagagcg atgtcaatga tgcctggagt gtacagggtg 780 gacctcttca ctcgtcaagt gtcatctcct gacgtggact ggagctacgg tgagccaacc 840 gagatgttat gcgccggttc caatgatgga gaggggatgg gtgagagtgg cggagcctac 900 attgtgcgca taccgtgtgg gccgcgggat aaatacctca agaaggaagc gttgtggcct 960 tacctccaag agtttgtcga tggagccctt gcgcatatcc tgaacatgtc caaggctctg 1020 ggagagcagg ttggaaatgg gaggccagta ctgccttacg tgatacatgg gcactatgcc 1080 gatgctggag atgttgctgc tctcctttct ggtgcgctga atgtgccaat ggtgctcact 1140 ggccactcac ttgggaggaa caagctggaa caactgctga agcaagggcg catgtccaag 1200 gaggagatcg attcgacata caagatcatg aggcgtatcg agggtgagga gctggccctg 1260 gatgcgtcag agcttgtaat cacgagcaca aggcaggaga ttgatgagca gtggggattg 1320 tacgatggat ttgatgtcaa gcttgagaaa gtgctgaggg cacgggcgag gcgcggggtt 1380 agctgccatg gtcgttacat gcctaggatg gtggtgattc ctccgggaat ggatttcagc 1440 aatgttgtag ttcatgaaga cattgatggg gatggtgacg tcaaagatga tatcgttggt 1500 ttggagggtg cctcacccaa gtcaatgccc ccaatttggg ccgaagtgat gcggttcctg 1560 accaaccctc acaagccgat gatcctggcg ttatcaagac cagacccgaa gaagaacatc 1620 actaccctcg tcaaagcgtt tggagagtgt cgtccactca gggaacttgc aaaccttact 1680 ctgatcatgg gtaacagaga tgacatcgac gacatgtctg ctggcaatgc cagtgtcctc 1740 accacagttc tgaagctgat tgacaagtat gatctgtacg gaagcgtggc gttccctaag 1800 catcacaatc aggctgacgt cccggagatc tatcgcctcg cggccaaaat gaagggcgtc 1860 ttcatcaacc ctgctctcgt tgagccgttt ggtctcaccc tgatcgaggc tgcggcacac 1920 ggactcccga tagtcgctac caagaatggt ggtccggtcg acattacaaa tgcattaaac 1980 aacggactgc tcgttgaccc acacgaccag aacgccatcg ctgatgcact gctgaagctt 2040 gtggcagaca agaacctgtg gcaggaatgc cggagaaacg ggctgcgcaa catccacctc 2100 tactcatggc cggagcactg ccgcacttac ctcaccaggg tggccgggtg ccggttaagg 2160 aacccgaggt ggctgaagga cacaccagca gatgccggag ccgatgagga ggagttcctg 2220 gaggattcca tggacgctca ggacctgtca ctccgtctgt ccatcgacgg tgagaagagc 2280 tcgctgaaca ctaacgatcc actgtggttc gacccccagg atcaagtgca gaagatcatg 2340 aacaacatca agcagtcgtc agcgcttcct ccgtccatgt cctcagtcgc agccgagggc 2400 acaggcagca ccatgaacaa atacccactc ctgcgccggc gccggcgctt gttcgtcata 2460 gctgtggact gctaccagga cgatggccgt gctagcaaga agatgctgca ggtgatccag 2520 gaagttttca gagcagtccg atcggactcc cagatgttca agatctcagg gttcacgctg 2580 tcgactgcca tgccgttgtc cgagacactc cagcttctgc agctcggcaa gatcccagcg 2640 accgacttcg acgccctcat ctgtggcagc ggcagcgagg tgtactatcc tggcacggcg 2700 aactgcatgg acgctgaagg aaagctgcgc ccagatcagg actatctgat gcacatcagc 2760 caccgctggt cccatgacgg cgcgaggcag accatagcga agctcatggg cgctcaggac 2820 ggttcaggcg acgctgtcga gcaggacgtg gcgtccagta atgcacactg tgtcgcgttc 2880 ctcatcaaag acccccaaaa ggtgaaaacg gtcgatgaga tgagggagcg gctgaggatg 2940 cgtggtctcc gctgccacat catgtactgc aggaactcga caaggcttca ggttgtccct 3000 ctgctagcat caaggtcaca ggcactcagg tatctttccg tgcgctgggg cgtatctgtg 3060 gggaacatgt atctgatcac cggggaacat ggcgacaccg atctagagga gatgctatcc 3120 gggctacaca agaccgtgat cgtccgtggc gtcaccgaga agggttcgga agcactggtg 3180 aggagcccag gaagctacaa gagggacgat gtcgtcccgt ctgagacccc cttggctgcg 3240 tacacgactg gtgagctgaa ggccgacgag atcatgcggg ctctgaagca agtctccaag 3300 acttccagcg gcatgtgaat ttgatgcttc ttttacattt tgtccttttc ttcactgcta 3360 tataaaataa gttgtgaaca gtaccgcggg tgtgtatata tatattgcag tgacaaataa 3420 aacaggacac tgctaactat actggtgaat atacgactgt caagattgta tgctaagtac 3480 tccatttctc aatgtatcaa tcggaattc 3509
Claims (21)
1. A substantially purified nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID
NO:1, homologs of SEQ ID NO: 1. homologs of the complement of SEQ ID NO:1; SEQ ID
NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3.
NO:1, homologs of SEQ ID NO: 1. homologs of the complement of SEQ ID NO:1; SEQ ID
NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3.
2. The substantially purified nucleic acid sequence of Claim 1, wherein said nucleotide sequence is characterized by having promoter activity.
3. The substantially purified nucleic acid sequence of Claim 2, wherein said promoter activity is constitutive.
4. A substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:1 and the complement thereof.
5. The substantially purified nucleic acid sequence of Claim 4, wherein said portion is characterized by having promoter activity.
6. The substantially purified nucleic acid sequence of Claim 5, wherein said promoter activity is constitutive.
7. The substantially purified nucleic acid sequence of Claim 4, wherein said portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID NO:7, from 245 to 787 of SEQ ID NO:7, from 788 to 1020 of SEQ ID NO:7, from 1021 to 1084 of SEQ ID NO:7, from 1085 to 1168 of SEQ ID
NO:7, from 1169 to 1173 of SEQ ID NO:7, from 1174 to 1648 of SEQ ID NO:7, from to 1805 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ
ID
NO:1, and from 445 to 1810 of SEQ ID NO:1.
NO:7, from 1169 to 1173 of SEQ ID NO:7, from 1174 to 1648 of SEQ ID NO:7, from to 1805 of SEQ ID NO:1, from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ
ID
NO:1, and from 445 to 1810 of SEQ ID NO:1.
8. A substantially purified nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:3 and the complement thereof.
9. The substantially purified nucleic acid sequence of Claim 8, wherein said portion is characterized by having promoter activity.
10. The substantially purified nucleic acid sequence of Claim 9, wherein said promoter activity is constitutive.
11. The substantially purified nucleic acid sequence of Claim 8, wherein said portion comprises the nucleotide sequence selected from the group consisting of the nucleotides I to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID NO:10, from to 3691 of SEQ ID NO:3, from 1 to 2248 of SEQ ID NO:3, from 2249 to 2313 of SEQ ID
NO:3, from 2314 to 3688 of SEQ ID NO:3, and from 1671 to 2248 of SEQ ID NO:3.
NO:3, from 2314 to 3688 of SEQ ID NO:3, and from 1671 to 2248 of SEQ ID NO:3.
12. A transgenic plant cell comprising a nucleic acid sequence comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:1, the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID
NO:1;
SEQ ID NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3, wherein said nucleotide sequence is operably linked to a nucleic acid sequence of interest.
NO:1;
SEQ ID NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3, wherein said nucleotide sequence is operably linked to a nucleic acid sequence of interest.
13. A transgenic plant cell comprising a nucleic acid sequence comprising a portion of a nucleotide sequence selected from the group consisting of SEQ ID
NO:1, the complement of SEQ ID NO:1, SEQ ID NO:3, and the complement of SEQ ID NO:3.
NO:1, the complement of SEQ ID NO:1, SEQ ID NO:3, and the complement of SEQ ID NO:3.
14. A method for expressing a nucleic acid sequence of interest in a plant cell, comprising:
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a nucleotide sequence selected from the group consisting of SEQ ID
NO:1 , the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID NO:1; SEQ ID NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3;
b) operably linking said nucleic acid sequence of interest to said nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a nucleotide sequence selected from the group consisting of SEQ ID
NO:1 , the complement of SEQ ID NO:1, homologs of SEQ ID NO:1, homologs of the complement of SEQ ID NO:1; SEQ ID NO:3, the complement of SEQ ID NO:3, homologs of SEQ ID NO:3, and homologs of the complement of SEQ ID NO:3;
b) operably linking said nucleic acid sequence of interest to said nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
15. The method of Claim 14, further comprising d) identifying said transgenic plant cell.
16. The method of Claim 14, further comprising d) regenerating transgenic plant tissue from said transgenic plant cell.
17. The method of Claim 14, further comprising d) regenerating a transgenic plant from said transgenic plant cell.
18. A method for expressing a nucleic acid sequence of interest in a plant cell, comprising:
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:1 and the complement thereof;
b) operably linking said nucleic acid sequence of interest to said portion of a nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:1 and the complement thereof;
b) operably linking said nucleic acid sequence of interest to said portion of a nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
19. The method of Claim 18, wherein said portion comprises the nucleotide sequence selected from the group consisting of the nucleotides from 1 to 242 of SEQ ID
NO:7, from 245 to 787 of SEQ ID NO:7, from 788 to 1020 of SEQ ID NO:7, from 1021 to 1084 of SEQ ID NO:7, from 1085 to 1168 of SEQ ID NO:7, from 1169 to 1173 of SEQ ID
NO:7, from 1174 to 1648 of SEQ ID NO:7, from 1649 to 1805 of SEQ ID NO: 1 from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ ID NO:1, and from 445 to 1810 of SEQ ID
NO:1.
NO:7, from 245 to 787 of SEQ ID NO:7, from 788 to 1020 of SEQ ID NO:7, from 1021 to 1084 of SEQ ID NO:7, from 1085 to 1168 of SEQ ID NO:7, from 1169 to 1173 of SEQ ID
NO:7, from 1174 to 1648 of SEQ ID NO:7, from 1649 to 1805 of SEQ ID NO: 1 from 1 to 378 of SEQ ID NO:1, from 379 to 444 of SEQ ID NO:1, and from 445 to 1810 of SEQ ID
NO:1.
20. A method for expressing a nucleic acid sequence of interest in a plant cell.
comprising:
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:3 and the complement thereof;
b) operably linking said nucleic acid sequence of interest to said portion of a nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
comprising:
a) providing:
i) a plant cell;
ii) a nucleic acid sequence of interest; and iii) a portion of a nucleotide sequence selected from the group consisting of SEQ ID NO:3 and the complement thereof;
b) operably linking said nucleic acid sequence of interest to said portion of a nucleotide sequence to produce a transgene; and c) introducing said transgene into said plant cell to produce a transgenic plant cell under conditions such that said nucleic acid sequence of interest is expressed in said transgenic plant cell.
21. The method of Claim 20, wherein said portion comprises the nucleotide sequence selected from the group consisting of the nucleotides 1 to 3600 of SEQ ID NO:10, from 3602 to 3612 of SEQ ID NO:10, from 3614 to 3691 of SEQ ID NO:3, from 1 to of SEQ ID NO:3, from 2249 to 2313 of SEQ ID NO:3, from 2314 to 3688 of SEQ ID
NO:3, and from 1671 to 2248 of SEQ ID NO:3.
NO:3, and from 1671 to 2248 of SEQ ID NO:3.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US7876898P | 1998-03-19 | 1998-03-19 | |
US60/078,768 | 1998-03-19 | ||
PCT/US1999/005985 WO1999046976A1 (en) | 1998-03-19 | 1999-03-18 | Plant promoter sequences and methods of use thereof |
USUNKNOWN | 2005-05-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2324520A1 true CA2324520A1 (en) | 1999-09-23 |
Family
ID=22146099
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002324520A Abandoned CA2324520A1 (en) | 1998-03-19 | 1999-03-18 | Plant promoter sequences and methods of use thereof |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1063880A1 (en) |
AU (1) | AU3192799A (en) |
CA (1) | CA2324520A1 (en) |
WO (1) | WO1999046976A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001032897A2 (en) * | 1999-11-05 | 2001-05-10 | South African Sugar Association | A high level, stable, constitutive promoter element for plants |
EP1589813B1 (en) * | 2003-01-03 | 2015-02-25 | The Texas A & M University System | Stem-regulated, plant defense promoter and uses thereof in tissue-specific expression in monocots |
CA2681661C (en) | 2007-03-23 | 2015-11-24 | New York University | Methods of increasing nitrogen-assimilation capacity in transgenic plants expressing cca1 and glk1 |
RU2644205C2 (en) | 2011-03-25 | 2018-02-08 | Монсанто Текнолоджи Ллс | Plant regulatory elements and their application |
-
1999
- 1999-03-18 CA CA002324520A patent/CA2324520A1/en not_active Abandoned
- 1999-03-18 WO PCT/US1999/005985 patent/WO1999046976A1/en not_active Application Discontinuation
- 1999-03-18 AU AU31927/99A patent/AU3192799A/en not_active Abandoned
- 1999-03-18 EP EP99913971A patent/EP1063880A1/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
EP1063880A1 (en) | 2001-01-03 |
AU3192799A (en) | 1999-10-11 |
WO1999046976A1 (en) | 1999-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3565562B2 (en) | Gene promoter specific to root bark layer | |
CN110139872A (en) | Plant seed character-related protein, gene, promoter and SNP and haplotype | |
US6706948B1 (en) | Sugarcane UBI9 gene promoter and methods of use thereof | |
JPH10505233A (en) | Circovirus-derived plant transcriptional regulator | |
US7943753B2 (en) | Auxin transport proteins | |
CA2617876A1 (en) | Nitrate transport components | |
US6452069B1 (en) | SF3 promoter and methods of use | |
US20130125256A1 (en) | Nitrate transport components | |
US7560612B2 (en) | Early-inflorescence-preferred regulatory elements and uses thereof | |
US6686513B1 (en) | Sugarcane ubi9 gene promoter sequence and methods of use thereof | |
US7276596B2 (en) | Promoter from maize invertase inhibitor gene | |
CA2324520A1 (en) | Plant promoter sequences and methods of use thereof | |
US20040191912A1 (en) | New constitutive plant promoter | |
CN114539373A (en) | IbPIF1 related to sweet potato stem nematode resistance as well as encoding gene and application thereof | |
CA2341780A1 (en) | Regulatory sequences for root specific or root abundant gene expression in plants | |
WO2000008189A2 (en) | Plant resistance gene | |
EP1761634B1 (en) | Cell number polynucleotides and polypeptides and methods of use thereof | |
EP1078086A2 (en) | Plant-derived resistance gene | |
MXPA00009202A (en) | Plant promoter sequences and methods of use thereof | |
WO2001012799A2 (en) | Regulatory sequences for pollen specific or pollen abundant gene expression in plants | |
EP1228216A2 (en) | Plants with a modified flower and seed development | |
US20020059657A1 (en) | Homeobox binding sites and their uses | |
US7199235B2 (en) | Plant promoters | |
US20020124282A1 (en) | Plant reproduction polynucleotides and methods of use | |
EP1268829B1 (en) | Atrsp gene promoters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |