CA2584934A1 - Nitrogen-regulated sugar sensing gene and protein and modulation thereof - Google Patents
Nitrogen-regulated sugar sensing gene and protein and modulation thereof Download PDFInfo
- Publication number
- CA2584934A1 CA2584934A1 CA 2584934 CA2584934A CA2584934A1 CA 2584934 A1 CA2584934 A1 CA 2584934A1 CA 2584934 CA2584934 CA 2584934 CA 2584934 A CA2584934 A CA 2584934A CA 2584934 A1 CA2584934 A1 CA 2584934A1
- Authority
- CA
- Canada
- Prior art keywords
- sequence
- plant
- nucleotide sequence
- gene
- expression
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 302
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 title claims abstract description 146
- 229910052757 nitrogen Inorganic materials 0.000 title claims abstract description 73
- 230000001105 regulatory effect Effects 0.000 title abstract description 31
- 235000000346 sugar Nutrition 0.000 title abstract description 22
- 102000004169 proteins and genes Human genes 0.000 title description 98
- 230000014509 gene expression Effects 0.000 claims abstract description 221
- 108010088742 GATA Transcription Factors Proteins 0.000 claims abstract description 40
- 230000014075 nitrogen utilization Effects 0.000 claims abstract description 20
- 102000009041 GATA Transcription Factors Human genes 0.000 claims abstract description 16
- 241000196324 Embryophyta Species 0.000 claims description 414
- 239000002773 nucleotide Substances 0.000 claims description 266
- 125000003729 nucleotide group Chemical group 0.000 claims description 266
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 201
- 150000007523 nucleic acids Chemical class 0.000 claims description 198
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 192
- 229920001184 polypeptide Polymers 0.000 claims description 191
- 210000004027 cell Anatomy 0.000 claims description 161
- 102000039446 nucleic acids Human genes 0.000 claims description 155
- 108020004707 nucleic acids Proteins 0.000 claims description 155
- 238000000034 method Methods 0.000 claims description 150
- 239000012634 fragment Substances 0.000 claims description 110
- 230000000295 complement effect Effects 0.000 claims description 73
- 210000001519 tissue Anatomy 0.000 claims description 61
- 230000009466 transformation Effects 0.000 claims description 61
- 230000009261 transgenic effect Effects 0.000 claims description 59
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 57
- 150000001413 amino acids Chemical class 0.000 claims description 46
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 41
- 240000008042 Zea mays Species 0.000 claims description 38
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 37
- 235000009973 maize Nutrition 0.000 claims description 34
- 241000209510 Liliopsida Species 0.000 claims description 33
- 230000004048 modification Effects 0.000 claims description 31
- 238000012986 modification Methods 0.000 claims description 31
- 239000000203 mixture Substances 0.000 claims description 30
- 241000589158 Agrobacterium Species 0.000 claims description 29
- 239000003550 marker Substances 0.000 claims description 29
- 230000033228 biological regulation Effects 0.000 claims description 21
- 241000209056 Secale Species 0.000 claims description 20
- 238000009395 breeding Methods 0.000 claims description 20
- 241001233957 eudicotyledons Species 0.000 claims description 20
- 230000029553 photosynthesis Effects 0.000 claims description 20
- 238000010672 photosynthesis Methods 0.000 claims description 20
- 230000002441 reversible effect Effects 0.000 claims description 20
- 241000209140 Triticum Species 0.000 claims description 19
- 235000021307 Triticum Nutrition 0.000 claims description 19
- 230000001488 breeding effect Effects 0.000 claims description 19
- 235000016709 nutrition Nutrition 0.000 claims description 19
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 claims description 18
- 240000006394 Sorghum bicolor Species 0.000 claims description 17
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 17
- 239000003795 chemical substances by application Substances 0.000 claims description 17
- 229930002875 chlorophyll Natural products 0.000 claims description 17
- 235000019804 chlorophyll Nutrition 0.000 claims description 17
- 230000004060 metabolic process Effects 0.000 claims description 17
- 102000040430 polynucleotide Human genes 0.000 claims description 17
- 108091033319 polynucleotide Proteins 0.000 claims description 17
- 239000002157 polynucleotide Substances 0.000 claims description 17
- 230000019491 signal transduction Effects 0.000 claims description 17
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 16
- 108020004705 Codon Proteins 0.000 claims description 16
- 208000035240 Disease Resistance Diseases 0.000 claims description 16
- 229910052799 carbon Inorganic materials 0.000 claims description 16
- 230000036579 abiotic stress Effects 0.000 claims description 15
- 230000035882 stress Effects 0.000 claims description 13
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 12
- 230000010261 cell growth Effects 0.000 claims description 12
- 230000004069 differentiation Effects 0.000 claims description 12
- 230000033458 reproduction Effects 0.000 claims description 12
- 244000075850 Avena orientalis Species 0.000 claims description 11
- 230000015572 biosynthetic process Effects 0.000 claims description 11
- 230000002759 chromosomal effect Effects 0.000 claims description 11
- 210000002615 epidermis Anatomy 0.000 claims description 11
- 230000002792 vascular Effects 0.000 claims description 11
- 235000007319 Avena orientalis Nutrition 0.000 claims description 10
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 10
- 235000007238 Secale cereale Nutrition 0.000 claims description 10
- 230000006872 improvement Effects 0.000 claims description 10
- 244000025254 Cannabis sativa Species 0.000 claims description 9
- 244000140063 Eragrostis abyssinica Species 0.000 claims description 9
- 235000014966 Eragrostis abyssinica Nutrition 0.000 claims description 9
- 241000448472 Gramma Species 0.000 claims description 9
- 244000062793 Sorghum vulgare Species 0.000 claims description 9
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 claims description 9
- 241000209138 Tripsacum Species 0.000 claims description 9
- 235000019714 Triticale Nutrition 0.000 claims description 9
- 240000000359 Triticum dicoccon Species 0.000 claims description 9
- 235000001468 Triticum dicoccon Nutrition 0.000 claims description 9
- 240000000581 Triticum monococcum Species 0.000 claims description 9
- 240000003834 Triticum spelta Species 0.000 claims description 9
- 235000004240 Triticum spelta Nutrition 0.000 claims description 9
- 235000019713 millet Nutrition 0.000 claims description 9
- 229910052717 sulfur Inorganic materials 0.000 claims description 9
- 239000011593 sulfur Substances 0.000 claims description 9
- 241000228158 x Triticosecale Species 0.000 claims description 9
- 241000218631 Coniferophyta Species 0.000 claims description 8
- 235000004431 Linum usitatissimum Nutrition 0.000 claims description 8
- 235000009430 Thespesia populnea Nutrition 0.000 claims description 8
- 238000003205 genotyping method Methods 0.000 claims description 8
- 238000003018 immunoassay Methods 0.000 claims description 8
- 230000001404 mediated effect Effects 0.000 claims description 8
- 230000009418 agronomic effect Effects 0.000 claims description 7
- 238000003786 synthesis reaction Methods 0.000 claims description 7
- 150000002632 lipids Chemical class 0.000 claims description 6
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 claims description 6
- 229920000742 Cotton Polymers 0.000 claims description 4
- 244000068988 Glycine max Species 0.000 claims description 4
- 235000010469 Glycine max Nutrition 0.000 claims description 4
- 244000299507 Gossypium hirsutum Species 0.000 claims description 4
- 108700024394 Exon Proteins 0.000 claims description 3
- 238000012252 genetic analysis Methods 0.000 claims description 3
- 239000012528 membrane Substances 0.000 claims description 3
- 230000032258 transport Effects 0.000 claims description 3
- 238000010396 two-hybrid screening Methods 0.000 claims description 3
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 230000006978 adaptation Effects 0.000 claims description 2
- 210000004507 artificial chromosome Anatomy 0.000 claims description 2
- 230000018514 detection of nutrient Effects 0.000 claims description 2
- 238000004134 energy conservation Methods 0.000 claims description 2
- 240000005979 Hordeum vulgare Species 0.000 claims 1
- 240000006240 Linum usitatissimum Species 0.000 claims 1
- 239000011859 microparticle Substances 0.000 claims 1
- 230000001965 increasing effect Effects 0.000 abstract description 26
- 230000001976 improved effect Effects 0.000 abstract description 12
- 235000018102 proteins Nutrition 0.000 description 95
- 108020004414 DNA Proteins 0.000 description 69
- 239000013598 vector Substances 0.000 description 62
- 239000000047 product Substances 0.000 description 48
- 240000007594 Oryza sativa Species 0.000 description 38
- 230000000694 effects Effects 0.000 description 38
- 102000053602 DNA Human genes 0.000 description 34
- 235000007164 Oryza sativa Nutrition 0.000 description 34
- 229940024606 amino acid Drugs 0.000 description 34
- 230000006870 function Effects 0.000 description 29
- 239000002609 medium Substances 0.000 description 29
- 235000009566 rice Nutrition 0.000 description 29
- 102000004190 Enzymes Human genes 0.000 description 27
- 108090000790 Enzymes Proteins 0.000 description 27
- 238000012217 deletion Methods 0.000 description 26
- 230000037430 deletion Effects 0.000 description 26
- 239000013604 expression vector Substances 0.000 description 25
- 230000012010 growth Effects 0.000 description 23
- 238000003780 insertion Methods 0.000 description 23
- 230000037431 insertion Effects 0.000 description 23
- 238000009739 binding Methods 0.000 description 21
- 235000013339 cereals Nutrition 0.000 description 21
- 230000001939 inductive effect Effects 0.000 description 21
- 229910002651 NO3 Inorganic materials 0.000 description 20
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 20
- 230000027455 binding Effects 0.000 description 20
- 238000009396 hybridization Methods 0.000 description 20
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 19
- 238000010276 construction Methods 0.000 description 19
- 241000219194 Arabidopsis Species 0.000 description 18
- 239000002245 particle Substances 0.000 description 18
- 239000002253 acid Substances 0.000 description 17
- 238000002474 experimental method Methods 0.000 description 17
- 238000013518 transcription Methods 0.000 description 17
- 230000035897 transcription Effects 0.000 description 17
- 230000002068 genetic effect Effects 0.000 description 16
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 15
- 238000013459 approach Methods 0.000 description 15
- 230000002829 reductive effect Effects 0.000 description 15
- 230000004044 response Effects 0.000 description 15
- 238000006467 substitution reaction Methods 0.000 description 15
- 230000008685 targeting Effects 0.000 description 15
- 230000004075 alteration Effects 0.000 description 13
- 238000010367 cloning Methods 0.000 description 13
- 210000002257 embryonic structure Anatomy 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 241000894007 species Species 0.000 description 13
- 241000209219 Hordeum Species 0.000 description 12
- 210000003763 chloroplast Anatomy 0.000 description 12
- 230000018109 developmental process Effects 0.000 description 12
- 239000004009 herbicide Substances 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 230000008569 process Effects 0.000 description 12
- 238000013519 translation Methods 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 11
- 108700019146 Transgenes Proteins 0.000 description 11
- 238000011161 development Methods 0.000 description 11
- 230000004927 fusion Effects 0.000 description 11
- 210000001938 protoplast Anatomy 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 108090000848 Ubiquitin Proteins 0.000 description 10
- 102000044159 Ubiquitin Human genes 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 230000008901 benefit Effects 0.000 description 10
- 239000003623 enhancer Substances 0.000 description 10
- 230000002363 herbicidal effect Effects 0.000 description 10
- 230000000977 initiatory effect Effects 0.000 description 10
- 230000007246 mechanism Effects 0.000 description 10
- 230000037361 pathway Effects 0.000 description 10
- 239000011347 resin Substances 0.000 description 10
- 229920005989 resin Polymers 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 239000000758 substrate Substances 0.000 description 10
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 9
- 241000208125 Nicotiana Species 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 9
- 230000001276 controlling effect Effects 0.000 description 9
- 230000000670 limiting effect Effects 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 230000002018 overexpression Effects 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- 230000002103 transcriptional effect Effects 0.000 description 9
- 238000012546 transfer Methods 0.000 description 9
- 108091005461 Nucleic proteins Chemical group 0.000 description 8
- 230000003115 biocidal effect Effects 0.000 description 8
- 230000000875 corresponding effect Effects 0.000 description 8
- 230000009368 gene silencing by RNA Effects 0.000 description 8
- 230000006698 induction Effects 0.000 description 8
- 230000000813 microbial effect Effects 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 235000015097 nutrients Nutrition 0.000 description 8
- 238000011282 treatment Methods 0.000 description 8
- 108090000994 Catalytic RNA Proteins 0.000 description 7
- 102000053642 Catalytic RNA Human genes 0.000 description 7
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 7
- 241000238631 Hexapoda Species 0.000 description 7
- 241000208202 Linaceae Species 0.000 description 7
- 108091022912 Mannose-6-Phosphate Isomerase Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- 244000083398 Zea diploperennis Species 0.000 description 7
- 235000007241 Zea diploperennis Nutrition 0.000 description 7
- 235000017556 Zea mays subsp parviglumis Nutrition 0.000 description 7
- 230000000692 anti-sense effect Effects 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000001413 cellular effect Effects 0.000 description 7
- 238000003776 cleavage reaction Methods 0.000 description 7
- 201000010099 disease Diseases 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 238000004520 electroporation Methods 0.000 description 7
- 210000000056 organ Anatomy 0.000 description 7
- 210000002706 plastid Anatomy 0.000 description 7
- 230000008929 regeneration Effects 0.000 description 7
- 238000011069 regeneration method Methods 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 108091092562 ribozyme Proteins 0.000 description 7
- 230000007017 scission Effects 0.000 description 7
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 6
- 206010020649 Hyperkeratosis Diseases 0.000 description 6
- 241000710118 Maize chlorotic mottle virus Species 0.000 description 6
- 102100025022 Mannose-6-phosphate isomerase Human genes 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- 229930006000 Sucrose Natural products 0.000 description 6
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 6
- 108091023040 Transcription factor Proteins 0.000 description 6
- 101710185494 Zinc finger protein Proteins 0.000 description 6
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 244000038559 crop plants Species 0.000 description 6
- 230000003247 decreasing effect Effects 0.000 description 6
- 239000003112 inhibitor Substances 0.000 description 6
- 239000000618 nitrogen fertilizer Substances 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 239000005720 sucrose Substances 0.000 description 6
- 230000001629 suppression Effects 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 101150084750 1 gene Proteins 0.000 description 5
- 108020005544 Antisense RNA Proteins 0.000 description 5
- 101100121136 Arabidopsis thaliana GATA21 gene Proteins 0.000 description 5
- 101100121137 Arabidopsis thaliana GATA22 gene Proteins 0.000 description 5
- 229920002101 Chitin Polymers 0.000 description 5
- 108091033380 Coding strand Proteins 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- -1 FAD and FMN) Chemical compound 0.000 description 5
- 108010070675 Glutathione transferase Proteins 0.000 description 5
- 102000005720 Glutathione transferase Human genes 0.000 description 5
- 150000007513 acids Chemical class 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000000576 coating method Methods 0.000 description 5
- 239000003184 complementary RNA Substances 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- 230000007613 environmental effect Effects 0.000 description 5
- 238000003306 harvesting Methods 0.000 description 5
- 230000006801 homologous recombination Effects 0.000 description 5
- 238000002744 homologous recombination Methods 0.000 description 5
- 239000005556 hormone Substances 0.000 description 5
- 229940088597 hormone Drugs 0.000 description 5
- 238000002955 isolation Methods 0.000 description 5
- 210000003463 organelle Anatomy 0.000 description 5
- 244000052769 pathogen Species 0.000 description 5
- 230000001717 pathogenic effect Effects 0.000 description 5
- 229920002704 polyhistidine Polymers 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 239000011541 reaction mixture Substances 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- 241000724328 Alfalfa mosaic virus Species 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 102100036646 Glutamyl-tRNA(Gln) amidotransferase subunit A, mitochondrial Human genes 0.000 description 4
- 101001072655 Homo sapiens Glutamyl-tRNA(Gln) amidotransferase subunit A, mitochondrial Proteins 0.000 description 4
- 108090000913 Nitrate Reductases Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 108700001094 Plant Genes Proteins 0.000 description 4
- 108091030071 RNAI Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- 241000723873 Tobacco mosaic virus Species 0.000 description 4
- 102000040945 Transcription factor Human genes 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 230000002411 adverse Effects 0.000 description 4
- 101150069317 alcA gene Proteins 0.000 description 4
- 239000003242 anti bacterial agent Substances 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 4
- 238000010195 expression analysis Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 238000000520 microinjection Methods 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 230000000243 photosynthetic effect Effects 0.000 description 4
- 238000001742 protein purification Methods 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 239000002689 soil Substances 0.000 description 4
- 125000006850 spacer group Chemical group 0.000 description 4
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 4
- 229960000268 spectinomycin Drugs 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 230000009452 underexpressoin Effects 0.000 description 4
- 239000011782 vitamin Substances 0.000 description 4
- 235000013343 vitamin Nutrition 0.000 description 4
- 229940088594 vitamin Drugs 0.000 description 4
- 229930003231 vitamin Natural products 0.000 description 4
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 3
- 102000007469 Actins Human genes 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 3
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- 108091023037 Aptamer Proteins 0.000 description 3
- 241000219195 Arabidopsis thaliana Species 0.000 description 3
- 241000351920 Aspergillus nidulans Species 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 101710154606 Hemagglutinin Proteins 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 108010025915 Nitrite Reductases Proteins 0.000 description 3
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 3
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 101710163504 Phaseolin Proteins 0.000 description 3
- 101710176177 Protein A56 Proteins 0.000 description 3
- 108020005067 RNA Splice Sites Proteins 0.000 description 3
- 101710172711 Structural protein Proteins 0.000 description 3
- 241000607479 Yersinia pestis Species 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 229930013930 alkaloid Natural products 0.000 description 3
- 150000003797 alkaloid derivatives Chemical class 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 101150103518 bar gene Proteins 0.000 description 3
- 235000021466 carotenoid Nutrition 0.000 description 3
- 150000001747 carotenoids Chemical class 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 239000011248 coating agent Substances 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 235000014113 dietary fatty acids Nutrition 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 229930195729 fatty acid Natural products 0.000 description 3
- 239000000194 fatty acid Substances 0.000 description 3
- 150000004665 fatty acids Chemical class 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 230000030279 gene silencing Effects 0.000 description 3
- 239000003862 glucocorticoid Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 150000004676 glycans Chemical class 0.000 description 3
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 239000000185 hemagglutinin Substances 0.000 description 3
- 230000008676 import Effects 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 238000002743 insertional mutagenesis Methods 0.000 description 3
- 230000017730 intein-mediated protein splicing Effects 0.000 description 3
- 239000003446 ligand Substances 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 210000003470 mitochondria Anatomy 0.000 description 3
- 108010058731 nopaline synthase Proteins 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 3
- 239000000049 pigment Substances 0.000 description 3
- 238000003976 plant breeding Methods 0.000 description 3
- 239000003375 plant hormone Substances 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- 150000003431 steroids Chemical class 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 150000003722 vitamin derivatives Chemical class 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 2
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 2
- 108020005029 5' Flanking Region Proteins 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 2
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- 101150074155 DHFR gene Proteins 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 102100039556 Galectin-4 Human genes 0.000 description 2
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 102000005548 Hexokinase Human genes 0.000 description 2
- 108700040460 Hexokinases Proteins 0.000 description 2
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 241000723994 Maize dwarf mosaic virus Species 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- DRBBFCLWYRJSJZ-UHFFFAOYSA-N N-phosphocreatine Chemical compound OC(=O)CN(C)C(=N)NP(O)(O)=O DRBBFCLWYRJSJZ-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 2
- 241000219843 Pisum Species 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108020001991 Protoporphyrinogen Oxidase Proteins 0.000 description 2
- 102000005135 Protoporphyrinogen oxidase Human genes 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 2
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- 241000723792 Tobacco etch virus Species 0.000 description 2
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 108010025267 calcium-dependent protein kinase Proteins 0.000 description 2
- 150000001721 carbon Chemical class 0.000 description 2
- 230000006860 carbon metabolism Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000000408 embryogenic effect Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 244000037666 field crops Species 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 239000013505 freshwater Substances 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 239000000417 fungicide Substances 0.000 description 2
- 230000035784 germination Effects 0.000 description 2
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 2
- LXJXRIRHZLFYRP-UHFFFAOYSA-N glyceraldehyde 3-phosphate Chemical compound O=CC(O)COP(O)(O)=O LXJXRIRHZLFYRP-UHFFFAOYSA-N 0.000 description 2
- 101150054900 gus gene Proteins 0.000 description 2
- 239000003999 initiator Substances 0.000 description 2
- 239000002917 insecticide Substances 0.000 description 2
- TWBYWOBDOCUKOW-UHFFFAOYSA-N isonicotinic acid Chemical compound OC(=O)C1=CC=NC=C1 TWBYWOBDOCUKOW-UHFFFAOYSA-N 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000000691 measurement method Methods 0.000 description 2
- 229960000485 methotrexate Drugs 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000003147 molecular marker Substances 0.000 description 2
- 101150057066 nahG gene Proteins 0.000 description 2
- 239000006225 natural substrate Substances 0.000 description 2
- 239000005645 nematicide Substances 0.000 description 2
- 150000002829 nitrogen Chemical class 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 210000002824 peroxisome Anatomy 0.000 description 2
- QHOQHJPRIBSPCY-UHFFFAOYSA-N pirimiphos-methyl Chemical group CCN(CC)C1=NC(C)=CC(OP(=S)(OC)OC)=N1 QHOQHJPRIBSPCY-UHFFFAOYSA-N 0.000 description 2
- 230000008635 plant growth Effects 0.000 description 2
- 230000010152 pollination Effects 0.000 description 2
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 108020001580 protein domains Proteins 0.000 description 2
- 230000004850 protein–protein interaction Effects 0.000 description 2
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000003938 response to stress Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000005204 segregation Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 238000009331 sowing Methods 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- JZRWCGZRTZMZEH-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 2
- 229960004659 ticarcillin Drugs 0.000 description 2
- OHKOGUYZJXTSFX-KZFFXBSXSA-N ticarcillin Chemical compound C=1([C@@H](C(O)=O)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)C=CSC=1 OHKOGUYZJXTSFX-KZFFXBSXSA-N 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 230000035899 viability Effects 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- LDVVMCZRFWMZSG-OLQVQODUSA-N (3ar,7as)-2-(trichloromethylsulfanyl)-3a,4,7,7a-tetrahydroisoindole-1,3-dione Chemical compound C1C=CC[C@H]2C(=O)N(SC(Cl)(Cl)Cl)C(=O)[C@H]21 LDVVMCZRFWMZSG-OLQVQODUSA-N 0.000 description 1
- FNQJDLTXOVEEFB-UHFFFAOYSA-N 1,2,3-benzothiadiazole Chemical compound C1=CC=C2SN=NC2=C1 FNQJDLTXOVEEFB-UHFFFAOYSA-N 0.000 description 1
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical group C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 1
- LMSDCGXQALIMLM-UHFFFAOYSA-N 2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid;iron Chemical compound [Fe].OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O LMSDCGXQALIMLM-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- HYPYXGZDOYTYDR-HAJWAVTHSA-N 2-methyl-3-[(2e,6e,10e,14e)-3,7,11,15,19-pentamethylicosa-2,6,10,14,18-pentaenyl]naphthalene-1,4-dione Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 HYPYXGZDOYTYDR-HAJWAVTHSA-N 0.000 description 1
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 description 1
- 108091006112 ATPases Proteins 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 239000005964 Acibenzolar-S-methyl Substances 0.000 description 1
- 102000057290 Adenosine Triphosphatases Human genes 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 1
- 101150021974 Adh1 gene Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 241000234282 Allium Species 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 241000207875 Antirrhinum Species 0.000 description 1
- 108700018853 Arabidopsis PR-1 Proteins 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- 244000003416 Asparagus officinalis Species 0.000 description 1
- 235000005340 Asparagus officinalis Nutrition 0.000 description 1
- 102000004625 Aspartate Aminotransferases Human genes 0.000 description 1
- 108010003415 Aspartate Aminotransferases Proteins 0.000 description 1
- 108700023262 Aspergillus nidulans AreA Proteins 0.000 description 1
- 241001106067 Atropa Species 0.000 description 1
- 235000005781 Avena Nutrition 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 241000209200 Bromus Species 0.000 description 1
- 101000583080 Bunodosoma granuliferum Delta-actitoxin-Bgr2a Proteins 0.000 description 1
- 101100520142 Caenorhabditis elegans pin-2 gene Proteins 0.000 description 1
- 101100284219 Candida albicans (strain SC5314 / ATCC MYA-2876) GZF3 gene Proteins 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 239000005745 Captan Substances 0.000 description 1
- 239000005746 Carboxin Substances 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- GHOKWGTUZJEAQD-UHFFFAOYSA-N Chick antidermatitis factor Natural products OCC(C)(C)C(O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-UHFFFAOYSA-N 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- ACTIUHUUMQJHFO-UHFFFAOYSA-N Coenzym Q10 Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UHFFFAOYSA-N 0.000 description 1
- 244000024469 Cucumis prophetarum Species 0.000 description 1
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 1
- 241000219122 Cucurbita Species 0.000 description 1
- 241000256113 Culicidae Species 0.000 description 1
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000209210 Dactylis Species 0.000 description 1
- 241000208296 Datura Species 0.000 description 1
- 241000208175 Daucus Species 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 240000001879 Digitalis lutea Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 101100127285 Drosophila melanogaster unc-104 gene Proteins 0.000 description 1
- 101150111720 EPSPS gene Proteins 0.000 description 1
- 241000710188 Encephalomyocarditis virus Species 0.000 description 1
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101150002687 GS-2 gene Proteins 0.000 description 1
- 229920002148 Gellan gum Polymers 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 241000208152 Geranium Species 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 101001015612 Halomonas elongata (strain ATCC 33173 / DSM 2581 / NBRC 15536 / NCIMB 2198 / 1H9) Glutamate synthase [NADPH] large chain Proteins 0.000 description 1
- 101001040070 Halomonas elongata (strain ATCC 33173 / DSM 2581 / NBRC 15536 / NCIMB 2198 / 1H9) Glutamate synthase [NADPH] small chain Proteins 0.000 description 1
- 241000208818 Helianthus Species 0.000 description 1
- 244000061944 Helianthus giganteus Species 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 241000175212 Herpesvirales Species 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- QAMFAYSMNZBNCA-UWVGGRQHSA-N His-Gly-Met Chemical compound CSCC[C@H](NC(=O)CNC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O QAMFAYSMNZBNCA-UWVGGRQHSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- 101000899240 Homo sapiens Endoplasmic reticulum chaperone BiP Proteins 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- HODVZHLJUUWPKY-STECZYCISA-N Ile-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=C(O)C=C1 HODVZHLJUUWPKY-STECZYCISA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 108010061833 Integrases Proteins 0.000 description 1
- 235000013757 Juglans Nutrition 0.000 description 1
- 241000758789 Juglans Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241000208822 Lactuca Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 241000208204 Linum Species 0.000 description 1
- 241000209082 Lolium Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 241000227653 Lycopersicon Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- KQAREVUPVXMNNP-WDSOQIARSA-N Lys-Trp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O KQAREVUPVXMNNP-WDSOQIARSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- 241001082241 Lythrum hyssopifolia Species 0.000 description 1
- 241001344131 Magnaporthe grisea Species 0.000 description 1
- 241001330975 Magnaporthe oryzae Species 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241000121629 Majorana Species 0.000 description 1
- 208000002720 Malnutrition Diseases 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 241000219823 Medicago Species 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- 229930195061 Micheline Natural products 0.000 description 1
- 108700005443 Microbial Genes Proteins 0.000 description 1
- 101150109579 Mrps7 gene Proteins 0.000 description 1
- 241001477931 Mythimna unipuncta Species 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 240000002853 Nelumbo nucifera Species 0.000 description 1
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 1
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 241001282315 Nemesis Species 0.000 description 1
- 101100363725 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) crp-15 gene Proteins 0.000 description 1
- 108700019658 Neurospora crassa NIT2 Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 241000219830 Onobrychis Species 0.000 description 1
- 241000209094 Oryza Species 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 101150034459 Parpbp gene Proteins 0.000 description 1
- 241000208181 Pelargonium Species 0.000 description 1
- 241000209046 Pennisetum Species 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 241000219833 Phaseolus Species 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 1
- 108091000041 Phosphoenolpyruvate Carboxylase Proteins 0.000 description 1
- 102000011755 Phosphoglycerate Kinase Human genes 0.000 description 1
- 241000709664 Picornaviridae Species 0.000 description 1
- 239000005924 Pirimiphos-methyl Substances 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 241001330029 Pooideae Species 0.000 description 1
- 241000710078 Potyvirus Species 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 238000012356 Product development Methods 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 101150020647 RPS7 gene Proteins 0.000 description 1
- 241000218206 Ranunculus Species 0.000 description 1
- 241000220259 Raphanus Species 0.000 description 1
- 101001023863 Rattus norvegicus Glucocorticoid receptor Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 235000011449 Rosa Nutrition 0.000 description 1
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 1
- 101100442138 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DAL80 gene Proteins 0.000 description 1
- 101100175800 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GLN3 gene Proteins 0.000 description 1
- 241001106018 Salpiglossis Species 0.000 description 1
- 101000888131 Schizosaccharomyces pombe (strain 972 / ATCC 24843) Glutamate synthase [NADH] Proteins 0.000 description 1
- 241000780602 Senecio Species 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 241000207763 Solanum Species 0.000 description 1
- 235000002634 Solanum Nutrition 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 1
- 241000187191 Streptomyces viridochromogenes Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 101001099217 Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) Triosephosphate isomerase Proteins 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- 241001312519 Trigonella Species 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- 241000219977 Vigna Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 235000009392 Vitis Nutrition 0.000 description 1
- 241000219095 Vitis Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- KRWTWSSMURUMDE-UHFFFAOYSA-N [1-(2-methoxynaphthalen-1-yl)naphthalen-2-yl]-diphenylphosphane Chemical compound COC1=CC=C2C=CC=CC2=C1C(C1=CC=CC=C1C=C1)=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 KRWTWSSMURUMDE-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- LIPOUNRJVLNBCD-UHFFFAOYSA-N acetyl dihydrogen phosphate Chemical compound CC(=O)OP(O)(O)=O LIPOUNRJVLNBCD-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 229960001570 ademetionine Drugs 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 244000193174 agave Species 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 239000003905 agrochemical Substances 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 108010050181 aleurone Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000003322 aneuploid effect Effects 0.000 description 1
- 208000036878 aneuploidy Diseases 0.000 description 1
- 235000019728 animal nutrition Nutrition 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 239000003899 bactericide agent Substances 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 238000007622 bioinformatic analysis Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000001390 capsicum minimum Substances 0.000 description 1
- 229940117949 captan Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- GYSSRZJIHXQEHQ-UHFFFAOYSA-N carboxin Chemical compound S1CCOC(C)=C1C(=O)NC1=CC=CC=C1 GYSSRZJIHXQEHQ-UHFFFAOYSA-N 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 108010079058 casein hydrolysate Proteins 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 239000004464 cereal grain Substances 0.000 description 1
- 238000009614 chemical analysis method Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229930002868 chlorophyll a Natural products 0.000 description 1
- 229930002869 chlorophyll b Natural products 0.000 description 1
- NSMUHPMZFPKNMZ-VBYMZDBQSA-M chlorophyll b Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C=O)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 NSMUHPMZFPKNMZ-VBYMZDBQSA-M 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- ACTIUHUUMQJHFO-UPTCCGCDSA-N coenzyme Q10 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UPTCCGCDSA-N 0.000 description 1
- 235000017471 coenzyme Q10 Nutrition 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 238000002967 competitive immunoassay Methods 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 229960003624 creatine Drugs 0.000 description 1
- 239000006046 creatine Substances 0.000 description 1
- 238000012272 crop production Methods 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 230000010154 cross-pollination Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 108010056535 dihydrofolate reductase type II Proteins 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 210000001339 epidermal cell Anatomy 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- FVTCRASFADXXNN-SCRDCRAPSA-N flavin mononucleotide Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-SCRDCRAPSA-N 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 101150068680 gatA gene Proteins 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000003208 gene overexpression Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 230000004153 glucose metabolism Effects 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108020002326 glutamine synthetase Proteins 0.000 description 1
- 102000005396 glutamine synthetase Human genes 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 150000002402 hexoses Chemical class 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000003898 horticulture Methods 0.000 description 1
- 101150029559 hph gene Proteins 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000003501 hydroponics Substances 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 238000009399 inbreeding Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000012994 industrial processing Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 230000000749 insecticidal effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000009403 interspecific hybridization Methods 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- AGBQKNBQESQNJD-UHFFFAOYSA-M lipoate Chemical compound [O-]C(=O)CCCCC1CCSS1 AGBQKNBQESQNJD-UHFFFAOYSA-M 0.000 description 1
- 235000019136 lipoic acid Nutrition 0.000 description 1
- 239000012669 liquid formulation Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 230000001071 malnutrition Effects 0.000 description 1
- 235000000824 malnutrition Nutrition 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 210000000473 mesophyll cell Anatomy 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 239000011490 mineral wool Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000003750 molluscacide Substances 0.000 description 1
- 230000002013 molluscicidal effect Effects 0.000 description 1
- 108010046778 molybdenum cofactor Proteins 0.000 description 1
- HPEUEJRPDGMIMY-IFQPEPLCSA-N molybdopterin Chemical compound O([C@H]1N2)[C@H](COP(O)(O)=O)C(S)=C(S)[C@@H]1NC1=C2N=C(N)NC1=O HPEUEJRPDGMIMY-IFQPEPLCSA-N 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 239000006870 ms-medium Substances 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 208000015380 nutritional deficiency disease Diseases 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229940055726 pantothenic acid Drugs 0.000 description 1
- 235000019161 pantothenic acid Nutrition 0.000 description 1
- 239000011713 pantothenic acid Substances 0.000 description 1
- 239000003415 peat Substances 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- NONJJLVGHLVQQM-JHXYUMNGSA-N phenethicillin Chemical compound N([C@@H]1C(N2[C@H](C(C)(C)S[C@@H]21)C(O)=O)=O)C(=O)C(C)OC1=CC=CC=C1 NONJJLVGHLVQQM-JHXYUMNGSA-N 0.000 description 1
- 229950007002 phosphocreatine Drugs 0.000 description 1
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 1
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000013081 phylogenetic analysis Methods 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 238000000575 proteomic method Methods 0.000 description 1
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 description 1
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 description 1
- 229960001327 pyridoxal phosphate Drugs 0.000 description 1
- NHDHVHZZCFYRSB-UHFFFAOYSA-N pyriproxyfen Chemical compound C=1C=CC=NC=1OC(C)COC(C=C1)=CC=C1OC1=CC=CC=C1 NHDHVHZZCFYRSB-UHFFFAOYSA-N 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- NPCOQXAVBJJZBQ-UHFFFAOYSA-N reduced coenzyme Q9 Natural products COC1=C(O)C(C)=C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)C(O)=C1OC NPCOQXAVBJJZBQ-UHFFFAOYSA-N 0.000 description 1
- 238000011946 reduction process Methods 0.000 description 1
- 230000023276 regulation of development, heterochronic Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 230000029054 response to nutrient Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012021 retail method of payment Methods 0.000 description 1
- 229960002477 riboflavin Drugs 0.000 description 1
- 235000019192 riboflavin Nutrition 0.000 description 1
- 239000002151 riboflavin Substances 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 230000005070 ripening Effects 0.000 description 1
- 101150036132 rpsG gene Proteins 0.000 description 1
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical class OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 1
- 230000024053 secondary metabolic process Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000014639 sexual reproduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 108010089087 soymetide-4 Proteins 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000003270 steroid hormone Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- 229960002663 thioctic acid Drugs 0.000 description 1
- KUAZQDVKQLNFPE-UHFFFAOYSA-N thiram Chemical compound CN(C)C(=S)SSC(=S)N(C)C KUAZQDVKQLNFPE-UHFFFAOYSA-N 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000003971 tillage Methods 0.000 description 1
- 229940027257 timentin Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- UZKQTCBAMSWPJD-UQCOIBPSSA-N trans-Zeatin Natural products OCC(/C)=C\CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-UQCOIBPSSA-N 0.000 description 1
- UZKQTCBAMSWPJD-FARCUNLSSA-N trans-zeatin Chemical compound OCC(/C)=C/CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-FARCUNLSSA-N 0.000 description 1
- 108091008023 transcriptional regulators Proteins 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000012250 transgenic expression Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- 101150003560 trfA gene Proteins 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010036387 trimethionine Proteins 0.000 description 1
- 101150019416 trpA gene Proteins 0.000 description 1
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 239000010937 tungsten Substances 0.000 description 1
- 229940035936 ubiquinone Drugs 0.000 description 1
- 101150101900 uidA gene Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 235000019143 vitamin K2 Nutrition 0.000 description 1
- 239000011728 vitamin K2 Substances 0.000 description 1
- 229940041603 vitamin k 3 Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 238000003158 yeast two-hybrid assay Methods 0.000 description 1
- 229940023877 zeatin Drugs 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8237—Externally regulated expression systems
- C12N15/8238—Externally regulated expression systems chemically inducible, e.g. tetracycline
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/5097—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving plant cells
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6872—Intracellular protein regulatory factors and their receptors, e.g. including ion channels
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Hematology (AREA)
- Urology & Nephrology (AREA)
- Cell Biology (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Food Science & Technology (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Medicinal Chemistry (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Botany (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to a nitrogen-regulated GATA
transcription factor gene required for sugar sensing and the modulation of the expression of this gene to modulate a characteristic in a plant. The GATA
transcription factor of the present invention is involved in regulating sugar sensing in plants and its expression is influenced by nitrogen status.
Increased expression of this or substantially similar genes can produce plants with improved nitrogen utilization and increased yield.
transcription factor gene required for sugar sensing and the modulation of the expression of this gene to modulate a characteristic in a plant. The GATA
transcription factor of the present invention is involved in regulating sugar sensing in plants and its expression is influenced by nitrogen status.
Increased expression of this or substantially similar genes can produce plants with improved nitrogen utilization and increased yield.
Description
B&P File No: 6580-346 Title: NITROGEN-REGULATED SUGAR SENSING GENE AND PROTEIN
AND MODULATION THEREOF
FIELD OF THE INVENTION
The present invention relates to methods of modulating agronomic traits in plants by modulating the expression of a GATA transcription factor in the plant cells. In particular the present invention relates to methods of improving nitrogen utilization in plants. The present invention also pertains to nucleic acid molecules isolated from Oryza sativa comprising nucleotide sequences that encode proteins that are important in chlorophyll synthesis and sugar sensing and, ultimately, can modulate nitrogen uptake and overall carbon metabolism.
BACKGROUND OF THE INVENTION
Improvement of the agronomic characteristics of crop plants has been ongoing since the beginning of agriculture. Most of the land suitable for crop production is currently being used. As human populations continue to increase, improved crop varieties will be required to adequately provide our food and feed (Trewavas (2001) Plant Physiol. 125: 174-179). To avoid catastrophic famines and malnutrition, future crop cultivars will need to have improved yields with equivalent farm inputs. These cultivars will need to more effectively withstand adverse conditions such as drought, soil salinity or disease, which will be especially important as marginal lands are brought into cultivation. Finally, we will need cultivars with altered nutrient composition to enhance human and animal nutrition, and to enable more efficient food and feed processing. For all these traits, identification of the genes controlling phenotypic expression of traits of interest will be crucial in accelerating development of superior crop germplasm by conventional or transgenic means.
A number of highly-efficient approaches are available to assist identification of genes playing key roles in expression of agronomically-important traits. These include genetics, genomics, bioinformatics, and functional genomics. Genetics is the scientific study of the mechanisms of inheritance. By identifying mutations that alter the pathway or response of interest, classical (or forward) genetics can help to identify the genes involved in these pathways or responses. For example, a mutant with enhanced susceptibility to disease may identify an important component of the plant signal transduction pathway leading from pathogen recognition to disease resistance. Genetics is also the central component in improvement of germplasm by breeding. Through molecular and phenotypic analysis of genetic crosses, loci controlling traits of interest can be mapped and followed in subsequent generations. Knowledge of the genes underlying phenotypic variation between crop accessions can enable development of markers that greatly increase efficiency of the germplasm improvement process, as well as open avenues for discovery of additional superior alieles.
Genomics is the system-level study of an organism's genome, including genes and corresponding gene products - RNA and proteins. At a first level, genomic approaches have provided large datasets of sequence information from diverse plant species, including full-length and partial cDNA
sequences, and the complete genomic sequence of a model plant species, Arabidopsis thaliana. Recently, the first draft sequence of a crop plant's genome, that of rice (Oryza sativa), has also become available. Availability of a whole genome sequence makes possible the development of tools for system-level study of other molecular complements, such as arrays and chips for use in determining the complement of expressed genes in an organism under specific conditions. Such data can be used as a first indication of the potential for certain genes to play key roles in expression of different plant phenotypes.
Bioinformatics approaches interface directly with first-level genomic datasets in allowing for processing to uncover sequences of interest by annotative or other means. Using, for example, similarity searches, alignments and phylogenetic analyses, bioinformatics can often identify homologs of a gene product of interest. Very similar homologs (eg. >--90%
AND MODULATION THEREOF
FIELD OF THE INVENTION
The present invention relates to methods of modulating agronomic traits in plants by modulating the expression of a GATA transcription factor in the plant cells. In particular the present invention relates to methods of improving nitrogen utilization in plants. The present invention also pertains to nucleic acid molecules isolated from Oryza sativa comprising nucleotide sequences that encode proteins that are important in chlorophyll synthesis and sugar sensing and, ultimately, can modulate nitrogen uptake and overall carbon metabolism.
BACKGROUND OF THE INVENTION
Improvement of the agronomic characteristics of crop plants has been ongoing since the beginning of agriculture. Most of the land suitable for crop production is currently being used. As human populations continue to increase, improved crop varieties will be required to adequately provide our food and feed (Trewavas (2001) Plant Physiol. 125: 174-179). To avoid catastrophic famines and malnutrition, future crop cultivars will need to have improved yields with equivalent farm inputs. These cultivars will need to more effectively withstand adverse conditions such as drought, soil salinity or disease, which will be especially important as marginal lands are brought into cultivation. Finally, we will need cultivars with altered nutrient composition to enhance human and animal nutrition, and to enable more efficient food and feed processing. For all these traits, identification of the genes controlling phenotypic expression of traits of interest will be crucial in accelerating development of superior crop germplasm by conventional or transgenic means.
A number of highly-efficient approaches are available to assist identification of genes playing key roles in expression of agronomically-important traits. These include genetics, genomics, bioinformatics, and functional genomics. Genetics is the scientific study of the mechanisms of inheritance. By identifying mutations that alter the pathway or response of interest, classical (or forward) genetics can help to identify the genes involved in these pathways or responses. For example, a mutant with enhanced susceptibility to disease may identify an important component of the plant signal transduction pathway leading from pathogen recognition to disease resistance. Genetics is also the central component in improvement of germplasm by breeding. Through molecular and phenotypic analysis of genetic crosses, loci controlling traits of interest can be mapped and followed in subsequent generations. Knowledge of the genes underlying phenotypic variation between crop accessions can enable development of markers that greatly increase efficiency of the germplasm improvement process, as well as open avenues for discovery of additional superior alieles.
Genomics is the system-level study of an organism's genome, including genes and corresponding gene products - RNA and proteins. At a first level, genomic approaches have provided large datasets of sequence information from diverse plant species, including full-length and partial cDNA
sequences, and the complete genomic sequence of a model plant species, Arabidopsis thaliana. Recently, the first draft sequence of a crop plant's genome, that of rice (Oryza sativa), has also become available. Availability of a whole genome sequence makes possible the development of tools for system-level study of other molecular complements, such as arrays and chips for use in determining the complement of expressed genes in an organism under specific conditions. Such data can be used as a first indication of the potential for certain genes to play key roles in expression of different plant phenotypes.
Bioinformatics approaches interface directly with first-level genomic datasets in allowing for processing to uncover sequences of interest by annotative or other means. Using, for example, similarity searches, alignments and phylogenetic analyses, bioinformatics can often identify homologs of a gene product of interest. Very similar homologs (eg. >--90%
amino acid identity over the entire length of the protein) are very likely orthologs, i.e. share the same function in different organisms.
Functional genomics can be defined as the assignment of function to genes and their products. Functional genomics draws from genetics, genomics and bioinformatics to derive a path toward identifying genes important in a particular pathway or response of interest. Expression analysis, for example, uses high density DNA microarrays (often derived from genomic-scale organismal sequencing) to monitor the mRNA expression of thousands of genes in a single experiment. Experimental treatments can include those eliciting a response of interest, such as the disease resistance response in plants infected with a pathogen. To give additional examples of the use of microarrays, mRNA expression levels can be monitored in distinct tissues over a developmental time course, or in mutants affected in a response of interest. Proteomics can also help to assign function, by assaying the expression and post-translational modifications of hundreds of proteins in a single experiment.
Proteomics approaches are in many cases analogous to the approaches taken for monitoring mRNA expression in microarray experiments. Protein-protein interactions can also help to assign proteins to a given pathway or response, by identifying proteins that interact with known components of the pathway or response. For functional genomics, protein-protein interactions are often studied using large-scale yeast two-hybrid assays. Another approach to assigning gene function is to express the corresponding protein in a heterologous host, for example the bacterium Escherichia coli, followed by purification and enzymatic assays.
Demonstration of the ability of a gene-of-interest to control a given trait may be derived, for example, from experimental testing in plant species of interest. The generation and analysis of plants transgenic for a gene of interest can be used for plant functional genomics, with several advantages.
The gene can often be both overexpressed and underexpressed ("knocked out"), thereby increasing the chances of observing a phenotype linking the gene to a pathway or response of interest. Two aspects of transgenic functional genomics help lend a high level of confidence to functional assignment by this approach. First, phenotypic observations are carried out in the context of the living plant. Second, the range of phenotypes observed can be checked and correlated with observed expression levels of the introduced transgene. Transgenic functional genomics is especially valuable in improved cultivar development. Only genes that function in a pathway or response of interest, and that in addition are able to confer a desired trait-based phenotype, are promoted as candidate genes for crop improvement efforts. In some cases, transgenic lines developed for functional genomics studies can be directly utilized in initial stages of product development.
Another approach towards plant functional genomics involves first identifying plant lines with mutations in specific genes of interest, followed by phenotypic evaluation of the consequences of such gene knockouts on the trait under study. Such an approach reveals genes essential for expression of specific traits.
Genes identified through functional genomics can be directly employed in efforts towards germplasm improvement by transgenic means, as described above, or used to develop markers for identification of tracking of alieles-of-interest in mapping and breeding populations. Knowledge of such genes may also enable construction of superior alleles non-existent in nature, by any of a number of molecular methods.
Rapid increases in yield over the last 80 years in row crops have been due in roughly equal measure to improved genetics and improved agronomic practices. In particular, in a crop like maize, the combination of high yielding hybrids and the use of large amounts of nitrogen fertilizer have under ideal conditions allowed for yields of greater than 440bu/acre. However, the use of large amounts of nitrogen fertilizer has negative side-effects primarily around increasing cost of this input to the farmer and cost to the environment since nitrate pollution is a major problem in many agricultural areas contributing significantly to the degradation of both fresh water and marine environments.
Developing crop genetics that use nitrogen more efficiently through an understanding of the role of genotype on nitrogen use would be highly advantageous in reducing producer input costs as well as environmental load.
This is particularly important for a crop like corn which is grown using a high level of nitrogen fertilizer.
Nitrogen use efficiency can be defined in several ways, although the simplest is yield/N supplied. There are two stages in this process: first, the amount of available nitrogen that is taken up, stored and assimilated into amino acids and other important nitrogenous compounds; second, the proportion of nitrogen that is partitioned to the seed, resulting in final yield. A
variety of field studies have been performed on various agriculturally important crops to study this problem (Lawlor DW et al 2001 in Lea PJ, Morot-Gaudry JF, eds. Plant Nitrogen. Berlin: Springer-Verlag 343-367; Lafitte HR
and Edmeades GO 1994 Field Crops Res 39, 15-25; Lawlor DW 2002 J Exp Bot. 53, 773-87; Moll RH et al 1982 Agron J 74, 562-564). These experiments have demonstrated that there is a genetic component to nitrogen use efficiency, but have not proved satisfactory in determining which genes are important for this process. In addition, corn breeders have generally not targeted the maintenance of yield under limiting nitrogen fertilizer. These types of field experiments on nitrogen use are difficult for a variety of reasons including a lack of uniformity of accessible nitrogen in a test field or between field sites under any treatment regime and the interplay of other environmental factors that make experiments difficult to interpret.
Therefore, although there is experimental evidence for genetic variation for this trait, it is difficult to make any conclusions from these experiments on what causes this variation. It should be feasible and is certainly important to develop methods to study this trait under field conditions in crop plants.
However, significant progress toward identifying, understanding and manipulating important traits can be made through the use of a model system like Arabidopsis. At the very least, these experiments will give important clues about potential target genes to evaluate in important field crops. In addition, there are also considerable genetic and genomic resources available to study rice and this species will also be used for some of the proposed experiments as a species more similar to corn than is Arabidopsis.
Functional genomics can be defined as the assignment of function to genes and their products. Functional genomics draws from genetics, genomics and bioinformatics to derive a path toward identifying genes important in a particular pathway or response of interest. Expression analysis, for example, uses high density DNA microarrays (often derived from genomic-scale organismal sequencing) to monitor the mRNA expression of thousands of genes in a single experiment. Experimental treatments can include those eliciting a response of interest, such as the disease resistance response in plants infected with a pathogen. To give additional examples of the use of microarrays, mRNA expression levels can be monitored in distinct tissues over a developmental time course, or in mutants affected in a response of interest. Proteomics can also help to assign function, by assaying the expression and post-translational modifications of hundreds of proteins in a single experiment.
Proteomics approaches are in many cases analogous to the approaches taken for monitoring mRNA expression in microarray experiments. Protein-protein interactions can also help to assign proteins to a given pathway or response, by identifying proteins that interact with known components of the pathway or response. For functional genomics, protein-protein interactions are often studied using large-scale yeast two-hybrid assays. Another approach to assigning gene function is to express the corresponding protein in a heterologous host, for example the bacterium Escherichia coli, followed by purification and enzymatic assays.
Demonstration of the ability of a gene-of-interest to control a given trait may be derived, for example, from experimental testing in plant species of interest. The generation and analysis of plants transgenic for a gene of interest can be used for plant functional genomics, with several advantages.
The gene can often be both overexpressed and underexpressed ("knocked out"), thereby increasing the chances of observing a phenotype linking the gene to a pathway or response of interest. Two aspects of transgenic functional genomics help lend a high level of confidence to functional assignment by this approach. First, phenotypic observations are carried out in the context of the living plant. Second, the range of phenotypes observed can be checked and correlated with observed expression levels of the introduced transgene. Transgenic functional genomics is especially valuable in improved cultivar development. Only genes that function in a pathway or response of interest, and that in addition are able to confer a desired trait-based phenotype, are promoted as candidate genes for crop improvement efforts. In some cases, transgenic lines developed for functional genomics studies can be directly utilized in initial stages of product development.
Another approach towards plant functional genomics involves first identifying plant lines with mutations in specific genes of interest, followed by phenotypic evaluation of the consequences of such gene knockouts on the trait under study. Such an approach reveals genes essential for expression of specific traits.
Genes identified through functional genomics can be directly employed in efforts towards germplasm improvement by transgenic means, as described above, or used to develop markers for identification of tracking of alieles-of-interest in mapping and breeding populations. Knowledge of such genes may also enable construction of superior alleles non-existent in nature, by any of a number of molecular methods.
Rapid increases in yield over the last 80 years in row crops have been due in roughly equal measure to improved genetics and improved agronomic practices. In particular, in a crop like maize, the combination of high yielding hybrids and the use of large amounts of nitrogen fertilizer have under ideal conditions allowed for yields of greater than 440bu/acre. However, the use of large amounts of nitrogen fertilizer has negative side-effects primarily around increasing cost of this input to the farmer and cost to the environment since nitrate pollution is a major problem in many agricultural areas contributing significantly to the degradation of both fresh water and marine environments.
Developing crop genetics that use nitrogen more efficiently through an understanding of the role of genotype on nitrogen use would be highly advantageous in reducing producer input costs as well as environmental load.
This is particularly important for a crop like corn which is grown using a high level of nitrogen fertilizer.
Nitrogen use efficiency can be defined in several ways, although the simplest is yield/N supplied. There are two stages in this process: first, the amount of available nitrogen that is taken up, stored and assimilated into amino acids and other important nitrogenous compounds; second, the proportion of nitrogen that is partitioned to the seed, resulting in final yield. A
variety of field studies have been performed on various agriculturally important crops to study this problem (Lawlor DW et al 2001 in Lea PJ, Morot-Gaudry JF, eds. Plant Nitrogen. Berlin: Springer-Verlag 343-367; Lafitte HR
and Edmeades GO 1994 Field Crops Res 39, 15-25; Lawlor DW 2002 J Exp Bot. 53, 773-87; Moll RH et al 1982 Agron J 74, 562-564). These experiments have demonstrated that there is a genetic component to nitrogen use efficiency, but have not proved satisfactory in determining which genes are important for this process. In addition, corn breeders have generally not targeted the maintenance of yield under limiting nitrogen fertilizer. These types of field experiments on nitrogen use are difficult for a variety of reasons including a lack of uniformity of accessible nitrogen in a test field or between field sites under any treatment regime and the interplay of other environmental factors that make experiments difficult to interpret.
Therefore, although there is experimental evidence for genetic variation for this trait, it is difficult to make any conclusions from these experiments on what causes this variation. It should be feasible and is certainly important to develop methods to study this trait under field conditions in crop plants.
However, significant progress toward identifying, understanding and manipulating important traits can be made through the use of a model system like Arabidopsis. At the very least, these experiments will give important clues about potential target genes to evaluate in important field crops. In addition, there are also considerable genetic and genomic resources available to study rice and this species will also be used for some of the proposed experiments as a species more similar to corn than is Arabidopsis.
Nitrate is the major form of available nitrogen in the field and there is an extensive body of literature on genes involved in nitrate uptake and reduction (Forde BG 2000 Biochimica et Biophysica Acta 1465, 219-235;
Howitt SM and Udvardi MK 2000 Biochimica et Biophysica Acta 1465, 152-170; Stitt M et al 2002 J Exp Bot. 53, 959-70) as well as on genes involved in other aspects of nitrogen metabolism (Lea PJ, Morot-Gaudry JF, eds. 2001 Plant Nitrogen. Berlin: Springer-Verlag; Morot-Gaudry JF 2001 Nitrogen assimilation by plants Science Publishers Inc. NH, US). Also, it is clear that the availability of carbon metabolites is crucial for the efficient use of field nitrate and there is good experimental evidence for a linkage between carbon and nitrogen metabolism (Coruzzi GM and Zhou L 2001 Curr Opin Plant Biol.
4, 247-53). In addition, some experiments suggest that GS and GOGAT are involved in remobilizing N from senescing organs to the sink organ (Brouquisse R et al 2001 in Lea PJ, Morot-Gaudry JF, eds. Plant Nitrogen.
Berlin: Springer-Verlag 275-293; Yamaya T et al 2002 J Exp Bot. 53, 917-925). However, most aspects of the regulation of these genes are still unclear and there is still no notion of how this regulation affects nitrogen use efficiency.
Plants can sense levels of carbon and nitrogen metabolites and accordingly adjust growth and development. The perception mechanisms are complex regulatory networks that control gene expression to accommodate constant changes of nutrient-dependent cellular activities. Possession of a sugar-sensing mechanism enables plants to turn off photosynthesis when C-skeletons are abundant. The N-sensing mechanism enables plants to turn off nitrate uptake and reduction when levels of reduced or organic N are high (Coruzzi, G.M. & Zhou, L. (2001) Curr Opin Plant Biol. 4, 247-53).
Multiple sugar signal transduction pathways exist in plants. Glucose has emerged as a key regulator of many vital processes in photosynthetic plants such as in photosynthesis and in carbon and nitrogen metabolism (Rolland, F., Moore, B. & Sheen, J. (2002) Plant Cell S185-S205).
Hexokinases (HXK) are an important control point for glucose metabolism.
They not only catalyze the phosphorylation of glucose but also function as a glucose sensor to interrelate nutrient, light and hormone signaling networks for controlling growth and development in response to the changing environment (Jang, J., Leon, P, Zhou, L. & Sheen, J. (1997) Plant Cell 9, 5-19; Dai, N., Schaffer, A., Petreikov, M., Shahak, Y., Giller, Y., Ratner, K., Levine, A. & Granot, D. (1999) Plant Cell 11, 1253-1266; Moore, B., Zhou, L., Rolland, F., Hall, Q., Cheng, W., Liu, Y., Hwang, I., Jones, T. & Sheen, J.
(2003) Science 300, 332-336). In other organisms it has been shown that hexose transport molecules also serve as sugar sensors.
Multiple N signals and sensing pathways exist as well in plants. Plants have mechanisms to sense nitrate, the major form of nitrogen fertilizer, as a signal for inorganic N status as well as to sense metabolites derived from nitrate as signals for reduced or organic N status. Nitrate reductase (NR) and nitrite reductase (NiR) are the first two enzymes in the nitrate reduction process and their expression can be stimulated by the presence of nitrate and modulated by other physiological factors including some nitrogenous compounds, sucrose, light and hormone (Forde, B.G. (2000) Biochimica et Biophysica Acta 1465, 219-235; Howitt, S.M. & Udvardi, M.K. (2000) Biochimica et Biophysica Acta 1465, 152-170; Stitt, M., Muller, M., Matt, M., Gibon, Y., Carillo, P., Morcuende, R., Scheible, W. & Krapp, A. (2002) J Exp Bot. 53, 959-970; Lea, P.J. & Morot-Gaudry, J.F. eds. 2001 Plant Nitrogen.
Berlin: Springer-Veriag; Morot-Gaudry JF 2001 Nitrogen assimilation by plants Science Publishers Inc. NH, US).
It is clear that carbon and nitrogen metabolism is closely linked and tightly regulated (Coruzzi, G. & Bush, D.R. (2001) Plant Physiol 125, 61-64).
The availability of carbon metabolites is crucial for efficient nitrate utilization and the nitrogen status is very sensitive to photosynthesis. Despite increased knowledge of structural genes involved in carbon and nitrogen metabolism, trans-acting factors involved in transcriptional regulation of C/N gene expression have not been characterized.
GATA transcription factors are a group of transcriptional regulators broadly distributed in eukaryotes. The GATA DNA binding domain normally recognizes the consensus sequence WGATAR (W = T or A; R = G or A) (Lowry, J. & Atchley, W. (2000) J Mol Evol 50, 103-115). GATA motifs have been identified in the regulatory regions of many light responsive genes (Arguello-Astorga, G. & Herrera-Estrella, L. (1998) Annu Rev Plant Physiol Plant Mol Biol 49, 525-555), including many genes involved in or relating to photosynthesis such as the RBCS, CAB (chlorophyll A/B binding protein) and GAP (glyceraldehyde-3-phosphate dehydrogenase) (Terzaghi, W.B. &
Cashmore, A.R. (1995) Annu Rev Plant Physiol Plant Mol Biol 46, 445-474;
Koch, K.E. (1996) Carbohydrate-modulated gene expression in plants. Annu Rev Plant Physiol Plant Mol Biol 47, 509-540; Jeong, M.J. & Shih, M.C.
(2003) Biochem Biophys Res Commun 300, 555-562) as well as genes involved in nitrate assimilation such as nitrate reductase, nitrite reductase, and Gln synthetase (Jarai, G., Truong, H., Daniel-Vedele, F. & Marzluf, G.
(1992) Curr Genet 21, 37-41; Rastogi, R., Bate, N., Sivasankar, S &
Rothstein, S. (1997) Plant Mol Biol. 34, 465-76; Oliveira, I.C. & Coruzzi, G.M.
(1999) Plant Physiol 121, 301-309). Some known trans-acting regulatory proteins that globally regulate genes in N metabolism are GATA transcription factor genes. In yeast, four global nitrogen regulatory factors GLN3, NIL1, NIL2 and DAL80 are DNA-binding proteins that contain a single GATA zinc finger, recognizing the consensus motif GATA (Hofman-Bang, J. (1999) Mol Biotech 12, 35-73). In fungi, Neurospora crassa NIT2 (Tao Y and Marzluf GA
1999 Curr Genet 36, 153-158) and Aspergillus nidulans AREA (Caddick MX
Arst HN Jr Taylor LH Johnson RI Brownlee AG 1986 Cloning of the regulatory gene areA mediating nitrogen metabolite repression in Aspergillus nidulans.
EMBO J 5, 1087-1090) are GATA transcription factor genes.
In plants, the in vivo function of GATA factors remains very poorly defined, with the Arabidopsis genome having 30 GATA members (Riechmann, J.L., Heard, J., Martin, G., Reuber, L., Jiang, C., Keddie, J., Adam, L., Pineda, 0., Ratcliffe, O.J., Samaha, R.R., Creelman, R., Pilgrim, M., Broun, P., Zhang, J.Z., Ghandehari, D., Sherman, B.K. & Yu, G. (2000) Science 290, 2105-2110; Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J.
(2004) Plant Physiol. 134, 1718-1732). Applicant identified the Arabidopsis GATA transcription factor gene GNC (At5g56860) important in chlorophyll synthesis and sugar sensitivity previously (WO 2006/074547). In the rice (Oryza sativa) genome, there are 28 GATA transcription factor genes, with one gene OsGATA16 sharing similarity with the Arabidopsis GATA gene At4g26150 (Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J. (2004) Plant Physiol. 134, 1718-1732 and WO 2006/074547).
SUMMARY OF THE INVENTION
The inventors have isolated a new GATA transcription factor from rice, termed OsGATAl 1, which is an ortholog of the At4g26150 gene from Arabidopsis. The At4g26150 gene is a GNC paralog in the phylogenetic tree of the 30 Arabidopsis GATA transcription factor genes (Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J. (2004) Plant Physiol. 134, 1718-1732) and was found to have overlapping function with GNC. The inventors have determined that the expression of the OsGATAl 1 gene regulates chlorophyll synthesis, seed yield and stress response to low nitrogen levels. Loss-of-function mutant plants in the OsGATAl 1 gene resulted in reduced chlorophyll levels.
In particular, transgenic rice plants silencing the OsGATAl 1 gene via RNAi, as well as transgenic plants over-expressing the rice gene, were created. The plants transformed with the OsGATAl 1 gene had increased chlorophyll levels and increased seed yield and had an improved stress response to low nitrogen levels. Plants grown under high N experienced stress after being transferred from the growth room to the greenhouse and the transgenic plants over-expressing OsGATAl 1 responded much better to the stress.
Sugars are central regulators of many vital processes in photosynthetic plants, such as photosynthesis and carbon and nitrogen metabolism. This regulation is achieved by regulating gene expression to either activate or repress genes involved. The mechanisms by which sugars control gene expression are not understood well. The GATA transcription factor disclosed here is involved in regulating sugar sensing and the expression of the factor itself is influenced by the change of the N status. Increased expression of this gene can produce plants with increased yield, particularly as the manipulation of sugar signaling pathways can lead to increased photosynthesis and increased nitrogen assimilation and alter source-sink relationships in seeds, tubes, roots and other storage organs.
Accordingly, the present invention relates to a method of modulating a characteristic in a plant or plant cell comprising modulating expression of a GATA transcription factor gene in the plant or plant cell. In an embodiment of the invention, the expression of the GATA transcription factor gene is modulated by administering, to the cell, an effective amount of an agent that can modulate the expression levels of a GATA transcription factor gene in the plant cell. In a further embodiment of the invention, the agent enhances the expression levels of a GATA transcription factor gene in the plant cell.
The characteristic to be modulated in the plant may be any agronomic trait of interest. In an embodiment of the invention, the characteristic is any that is affected by nitrogen, carbon and/or sulfur metabolism, biosynthesis of lipids, perception of nutrients, nutritional adaptation, electron transport and/or membrane associated energy conservation. In a further embodiment of the invention, the characteristic is selected from one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation, abiotic stress tolerance and nutritional composition. In a still further embodiment of the invention the modulated characteristic is an increase or improvement in one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation abiotic stress tolerance and nutritional composition.
In a particular embodiment, the present invention relates to a method of improving nitrogen utilization in a plant or plant cell comprising enhancing expression of a GATA transcription factor gene in the plant or plant cell.
Improving nitrogen utilization in a plant will allow for reduce amounts of nitrogen fertilizer to applied to the plant with a concomitant reduction in costs to the farmer and cost to the environment since nitrate pollution is a major problem in many agricultural areas contributing significantly to the degradation of both fresh water and marine environments.
Howitt SM and Udvardi MK 2000 Biochimica et Biophysica Acta 1465, 152-170; Stitt M et al 2002 J Exp Bot. 53, 959-70) as well as on genes involved in other aspects of nitrogen metabolism (Lea PJ, Morot-Gaudry JF, eds. 2001 Plant Nitrogen. Berlin: Springer-Verlag; Morot-Gaudry JF 2001 Nitrogen assimilation by plants Science Publishers Inc. NH, US). Also, it is clear that the availability of carbon metabolites is crucial for the efficient use of field nitrate and there is good experimental evidence for a linkage between carbon and nitrogen metabolism (Coruzzi GM and Zhou L 2001 Curr Opin Plant Biol.
4, 247-53). In addition, some experiments suggest that GS and GOGAT are involved in remobilizing N from senescing organs to the sink organ (Brouquisse R et al 2001 in Lea PJ, Morot-Gaudry JF, eds. Plant Nitrogen.
Berlin: Springer-Verlag 275-293; Yamaya T et al 2002 J Exp Bot. 53, 917-925). However, most aspects of the regulation of these genes are still unclear and there is still no notion of how this regulation affects nitrogen use efficiency.
Plants can sense levels of carbon and nitrogen metabolites and accordingly adjust growth and development. The perception mechanisms are complex regulatory networks that control gene expression to accommodate constant changes of nutrient-dependent cellular activities. Possession of a sugar-sensing mechanism enables plants to turn off photosynthesis when C-skeletons are abundant. The N-sensing mechanism enables plants to turn off nitrate uptake and reduction when levels of reduced or organic N are high (Coruzzi, G.M. & Zhou, L. (2001) Curr Opin Plant Biol. 4, 247-53).
Multiple sugar signal transduction pathways exist in plants. Glucose has emerged as a key regulator of many vital processes in photosynthetic plants such as in photosynthesis and in carbon and nitrogen metabolism (Rolland, F., Moore, B. & Sheen, J. (2002) Plant Cell S185-S205).
Hexokinases (HXK) are an important control point for glucose metabolism.
They not only catalyze the phosphorylation of glucose but also function as a glucose sensor to interrelate nutrient, light and hormone signaling networks for controlling growth and development in response to the changing environment (Jang, J., Leon, P, Zhou, L. & Sheen, J. (1997) Plant Cell 9, 5-19; Dai, N., Schaffer, A., Petreikov, M., Shahak, Y., Giller, Y., Ratner, K., Levine, A. & Granot, D. (1999) Plant Cell 11, 1253-1266; Moore, B., Zhou, L., Rolland, F., Hall, Q., Cheng, W., Liu, Y., Hwang, I., Jones, T. & Sheen, J.
(2003) Science 300, 332-336). In other organisms it has been shown that hexose transport molecules also serve as sugar sensors.
Multiple N signals and sensing pathways exist as well in plants. Plants have mechanisms to sense nitrate, the major form of nitrogen fertilizer, as a signal for inorganic N status as well as to sense metabolites derived from nitrate as signals for reduced or organic N status. Nitrate reductase (NR) and nitrite reductase (NiR) are the first two enzymes in the nitrate reduction process and their expression can be stimulated by the presence of nitrate and modulated by other physiological factors including some nitrogenous compounds, sucrose, light and hormone (Forde, B.G. (2000) Biochimica et Biophysica Acta 1465, 219-235; Howitt, S.M. & Udvardi, M.K. (2000) Biochimica et Biophysica Acta 1465, 152-170; Stitt, M., Muller, M., Matt, M., Gibon, Y., Carillo, P., Morcuende, R., Scheible, W. & Krapp, A. (2002) J Exp Bot. 53, 959-970; Lea, P.J. & Morot-Gaudry, J.F. eds. 2001 Plant Nitrogen.
Berlin: Springer-Veriag; Morot-Gaudry JF 2001 Nitrogen assimilation by plants Science Publishers Inc. NH, US).
It is clear that carbon and nitrogen metabolism is closely linked and tightly regulated (Coruzzi, G. & Bush, D.R. (2001) Plant Physiol 125, 61-64).
The availability of carbon metabolites is crucial for efficient nitrate utilization and the nitrogen status is very sensitive to photosynthesis. Despite increased knowledge of structural genes involved in carbon and nitrogen metabolism, trans-acting factors involved in transcriptional regulation of C/N gene expression have not been characterized.
GATA transcription factors are a group of transcriptional regulators broadly distributed in eukaryotes. The GATA DNA binding domain normally recognizes the consensus sequence WGATAR (W = T or A; R = G or A) (Lowry, J. & Atchley, W. (2000) J Mol Evol 50, 103-115). GATA motifs have been identified in the regulatory regions of many light responsive genes (Arguello-Astorga, G. & Herrera-Estrella, L. (1998) Annu Rev Plant Physiol Plant Mol Biol 49, 525-555), including many genes involved in or relating to photosynthesis such as the RBCS, CAB (chlorophyll A/B binding protein) and GAP (glyceraldehyde-3-phosphate dehydrogenase) (Terzaghi, W.B. &
Cashmore, A.R. (1995) Annu Rev Plant Physiol Plant Mol Biol 46, 445-474;
Koch, K.E. (1996) Carbohydrate-modulated gene expression in plants. Annu Rev Plant Physiol Plant Mol Biol 47, 509-540; Jeong, M.J. & Shih, M.C.
(2003) Biochem Biophys Res Commun 300, 555-562) as well as genes involved in nitrate assimilation such as nitrate reductase, nitrite reductase, and Gln synthetase (Jarai, G., Truong, H., Daniel-Vedele, F. & Marzluf, G.
(1992) Curr Genet 21, 37-41; Rastogi, R., Bate, N., Sivasankar, S &
Rothstein, S. (1997) Plant Mol Biol. 34, 465-76; Oliveira, I.C. & Coruzzi, G.M.
(1999) Plant Physiol 121, 301-309). Some known trans-acting regulatory proteins that globally regulate genes in N metabolism are GATA transcription factor genes. In yeast, four global nitrogen regulatory factors GLN3, NIL1, NIL2 and DAL80 are DNA-binding proteins that contain a single GATA zinc finger, recognizing the consensus motif GATA (Hofman-Bang, J. (1999) Mol Biotech 12, 35-73). In fungi, Neurospora crassa NIT2 (Tao Y and Marzluf GA
1999 Curr Genet 36, 153-158) and Aspergillus nidulans AREA (Caddick MX
Arst HN Jr Taylor LH Johnson RI Brownlee AG 1986 Cloning of the regulatory gene areA mediating nitrogen metabolite repression in Aspergillus nidulans.
EMBO J 5, 1087-1090) are GATA transcription factor genes.
In plants, the in vivo function of GATA factors remains very poorly defined, with the Arabidopsis genome having 30 GATA members (Riechmann, J.L., Heard, J., Martin, G., Reuber, L., Jiang, C., Keddie, J., Adam, L., Pineda, 0., Ratcliffe, O.J., Samaha, R.R., Creelman, R., Pilgrim, M., Broun, P., Zhang, J.Z., Ghandehari, D., Sherman, B.K. & Yu, G. (2000) Science 290, 2105-2110; Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J.
(2004) Plant Physiol. 134, 1718-1732). Applicant identified the Arabidopsis GATA transcription factor gene GNC (At5g56860) important in chlorophyll synthesis and sugar sensitivity previously (WO 2006/074547). In the rice (Oryza sativa) genome, there are 28 GATA transcription factor genes, with one gene OsGATA16 sharing similarity with the Arabidopsis GATA gene At4g26150 (Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J. (2004) Plant Physiol. 134, 1718-1732 and WO 2006/074547).
SUMMARY OF THE INVENTION
The inventors have isolated a new GATA transcription factor from rice, termed OsGATAl 1, which is an ortholog of the At4g26150 gene from Arabidopsis. The At4g26150 gene is a GNC paralog in the phylogenetic tree of the 30 Arabidopsis GATA transcription factor genes (Reyes, J.C., Muro-Pastor, M.I. & Florencio, F.J. (2004) Plant Physiol. 134, 1718-1732) and was found to have overlapping function with GNC. The inventors have determined that the expression of the OsGATAl 1 gene regulates chlorophyll synthesis, seed yield and stress response to low nitrogen levels. Loss-of-function mutant plants in the OsGATAl 1 gene resulted in reduced chlorophyll levels.
In particular, transgenic rice plants silencing the OsGATAl 1 gene via RNAi, as well as transgenic plants over-expressing the rice gene, were created. The plants transformed with the OsGATAl 1 gene had increased chlorophyll levels and increased seed yield and had an improved stress response to low nitrogen levels. Plants grown under high N experienced stress after being transferred from the growth room to the greenhouse and the transgenic plants over-expressing OsGATAl 1 responded much better to the stress.
Sugars are central regulators of many vital processes in photosynthetic plants, such as photosynthesis and carbon and nitrogen metabolism. This regulation is achieved by regulating gene expression to either activate or repress genes involved. The mechanisms by which sugars control gene expression are not understood well. The GATA transcription factor disclosed here is involved in regulating sugar sensing and the expression of the factor itself is influenced by the change of the N status. Increased expression of this gene can produce plants with increased yield, particularly as the manipulation of sugar signaling pathways can lead to increased photosynthesis and increased nitrogen assimilation and alter source-sink relationships in seeds, tubes, roots and other storage organs.
Accordingly, the present invention relates to a method of modulating a characteristic in a plant or plant cell comprising modulating expression of a GATA transcription factor gene in the plant or plant cell. In an embodiment of the invention, the expression of the GATA transcription factor gene is modulated by administering, to the cell, an effective amount of an agent that can modulate the expression levels of a GATA transcription factor gene in the plant cell. In a further embodiment of the invention, the agent enhances the expression levels of a GATA transcription factor gene in the plant cell.
The characteristic to be modulated in the plant may be any agronomic trait of interest. In an embodiment of the invention, the characteristic is any that is affected by nitrogen, carbon and/or sulfur metabolism, biosynthesis of lipids, perception of nutrients, nutritional adaptation, electron transport and/or membrane associated energy conservation. In a further embodiment of the invention, the characteristic is selected from one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation, abiotic stress tolerance and nutritional composition. In a still further embodiment of the invention the modulated characteristic is an increase or improvement in one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation abiotic stress tolerance and nutritional composition.
In a particular embodiment, the present invention relates to a method of improving nitrogen utilization in a plant or plant cell comprising enhancing expression of a GATA transcription factor gene in the plant or plant cell.
Improving nitrogen utilization in a plant will allow for reduce amounts of nitrogen fertilizer to applied to the plant with a concomitant reduction in costs to the farmer and cost to the environment since nitrate pollution is a major problem in many agricultural areas contributing significantly to the degradation of both fresh water and marine environments.
The plant or plant cell may be from any plant wherein one wishes to modulate a characteristic. In an embodiment of the invention, the plant cell is a dicot, a gymnosperm or a monocot. In one embodiment, the dicot is selected from the group consisting of soybean, tobacco or cotton. In a further embodiment of the invention, the monocot is selected from maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp. and teosite.
In an embodiment of the invention, the agent that enhances the expression levels of a GATA transcription factor gene in the plant cell comprises a nucleic acid molecule encoding a GATA transcription factor.
In an embodiment of the invention, the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
In a further embodiment of the invention, the nucleic acid molecule comprises the sequence of the OsGATAl1 gene of SEQ ID NO:1 or a functional fragment thereof. In a still further embodiment of the invention, the nucleic acid molecule comprises a sequence that hybridizes under medium stringency conditions to the OsGATAl 1 gene of SEQ ID NO:1 or a functional fragment thereof. In another embodiment of the present invention, the nucleic acid molecule is derived from the nucleotide sequence of the At5g56860 gene of SEQ ID NO:1 and has a nucleotide sequence comprising codons specific for expression in plants.
In a further embodiment of the invention, the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a functional fragment, domain, repeat, or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or a functional fragment or domain thereof, or a sequence complementary thereto; or (d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto.
In an embodiment of the present invention, when the agent is a nucleic acid sequence, the nucleic acid sequence is expressed in a specific location or tissue of the plant. The location or tissue is for example, but not limited to, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf and/or flower. In an alternative embodiment, the location or tissue is a seed.
Embodiments of the present invention also relate to use of a shuffled nucleic acid molecule for modulating a characteristic in a plant cell, said shuffled nucleic acid molecule containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments encodes a GATA
transcription factor and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid. In a specific embodiment, all of the fragments in a shuffled nucleic acid molecule containing a plurality of nucleotide sequence fragments are from a single gene. In a more specific embodiment, the plurality of fragments originate from at least two different genes. In a more specific embodiment, the shuffled nucleic acid is operably linked to a promoter sequence. Another more specific embodiment is a use of a chimeric polynucleotide for modulating a characteristic in a plant cell, said chimeric polynucleotide including a promoter sequence operably linked to the shuffled nucleic acid. In a more specific embodiment, the shuffled nucleic acid is contained within a host cell. In a further specific embodiment of the invention the fragment encoding a GATA
transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Embodiments of the present invention also contemplate a use of an expression cassette for modulating a characteristic in a plant cell including a promoter sequence operably linked to an isolated nucleic acid encoding a GATA transcription factor. In embodiments of the invention the isolated nucleic acid encoding a GATA transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
In an embodiment of the invention, the agent that enhances the expression levels of a GATA transcription factor gene in the plant cell comprises a nucleic acid molecule encoding a GATA transcription factor.
In an embodiment of the invention, the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
In a further embodiment of the invention, the nucleic acid molecule comprises the sequence of the OsGATAl1 gene of SEQ ID NO:1 or a functional fragment thereof. In a still further embodiment of the invention, the nucleic acid molecule comprises a sequence that hybridizes under medium stringency conditions to the OsGATAl 1 gene of SEQ ID NO:1 or a functional fragment thereof. In another embodiment of the present invention, the nucleic acid molecule is derived from the nucleotide sequence of the At5g56860 gene of SEQ ID NO:1 and has a nucleotide sequence comprising codons specific for expression in plants.
In a further embodiment of the invention, the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a functional fragment, domain, repeat, or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or a functional fragment or domain thereof, or a sequence complementary thereto; or (d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto.
In an embodiment of the present invention, when the agent is a nucleic acid sequence, the nucleic acid sequence is expressed in a specific location or tissue of the plant. The location or tissue is for example, but not limited to, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf and/or flower. In an alternative embodiment, the location or tissue is a seed.
Embodiments of the present invention also relate to use of a shuffled nucleic acid molecule for modulating a characteristic in a plant cell, said shuffled nucleic acid molecule containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments encodes a GATA
transcription factor and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid. In a specific embodiment, all of the fragments in a shuffled nucleic acid molecule containing a plurality of nucleotide sequence fragments are from a single gene. In a more specific embodiment, the plurality of fragments originate from at least two different genes. In a more specific embodiment, the shuffled nucleic acid is operably linked to a promoter sequence. Another more specific embodiment is a use of a chimeric polynucleotide for modulating a characteristic in a plant cell, said chimeric polynucleotide including a promoter sequence operably linked to the shuffled nucleic acid. In a more specific embodiment, the shuffled nucleic acid is contained within a host cell. In a further specific embodiment of the invention the fragment encoding a GATA
transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Embodiments of the present invention also contemplate a use of an expression cassette for modulating a characteristic in a plant cell including a promoter sequence operably linked to an isolated nucleic acid encoding a GATA transcription factor. In embodiments of the invention the isolated nucleic acid encoding a GATA transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Further encompassed within the invention is use of a recombinant vector for modulating a characteristic in a plant cell comprising an expression cassette including a promoter sequence operably linked to an isolated nucleic acid encoding a GATA transcription factor. In embodiments of the invention the isolated nucleic acid encoding a GATA transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ; (e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Also encompassed are uses of plant cells, which contain expression cassettes, according to the present disclosure, and uses of plants, containing these plant cells.
In one embodiment, the expression cassette is expressed throughout the plant. In another embodiment, the expression cassette is expressed in a specific location or tissue of a plant. In a specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternative specific embodiment, the location or tissue is a seed.
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Further encompassed within the invention is use of a recombinant vector for modulating a characteristic in a plant cell comprising an expression cassette including a promoter sequence operably linked to an isolated nucleic acid encoding a GATA transcription factor. In embodiments of the invention the isolated nucleic acid encoding a GATA transcription factor consists of or comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ; (e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Also encompassed are uses of plant cells, which contain expression cassettes, according to the present disclosure, and uses of plants, containing these plant cells.
In one embodiment, the expression cassette is expressed throughout the plant. In another embodiment, the expression cassette is expressed in a specific location or tissue of a plant. In a specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternative specific embodiment, the location or tissue is a seed.
Embodiments of the present invention also provide the use of seed and isolated product from plants for modulating a characteristic in a plant cell, which contain an expression cassette including a promoter sequence operably linked to an isolated nucleic acid encoding a GATA transcription factor gene according to the present invention.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-1a promoter, ubiquitin and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or glutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
In a specific embodiment, the expression vector is a eukaryotic expression vector or a prokaryotic expression vector. In a more specific embodiment, the eukaryotic expression vector includes a tissue-specific promoter. More specifically, the expression vector is operable in plants.
Embodiments of the present invention also relate to a plant modified by a method that includes introducing into a plant a nucleic acid where the nucleic acid is expressible in the plant in an amount effective to effect the modification. The modification can be an increase or decrease in the one or more traits of interest. The modification may include overexpression, underexpression, antisense modulation, sense suppression, inducible expression, inducible repression, or inducible modulation of a gene. In an embodiment of the invention the modification involved an increase or improvement in the trait of interest, for example, nitrogen utilization.
Embodiments of the present invention provide nucleotide and amino acid sequences isolated from Arabidopsis thaliana. Particularly, the present invention relates to a nitrogen-regulated GATA transcription factor gene required for sugar sensing.
Embodiments of the present invention relate to an isolated nucleic acid comprising or consisting of a nucleotide sequence comprising:
(a) a nucleotide sequence listed in SEQ ID NO:1, or a fragment or domain, thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c).
In a specific embodiment, the substantial similarity is at least about 65% identity, specifically about 80% identity, specifically 90%, and more specifically at least about 95% sequence identity to the nucleotide sequence listed as SEQ ID NO:1, a fragment or domain thereof.
In a one embodiment, the sequence having substantial similarity to the nucleotide sequence of SEQ ID NO:1, a fragment or domain thereof, is from a plant. In a specific embodiment, the plant is a dicot. In a more specific embodiment, the dicot is selected from the group consisting of soybean, tobacco or cotton. In another specific embodiment, the plant is a gymnosperm. In another specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp., or teosinte.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-1a promoter, ubiquitin and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or glutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
In a specific embodiment, the expression vector is a eukaryotic expression vector or a prokaryotic expression vector. In a more specific embodiment, the eukaryotic expression vector includes a tissue-specific promoter. More specifically, the expression vector is operable in plants.
Embodiments of the present invention also relate to a plant modified by a method that includes introducing into a plant a nucleic acid where the nucleic acid is expressible in the plant in an amount effective to effect the modification. The modification can be an increase or decrease in the one or more traits of interest. The modification may include overexpression, underexpression, antisense modulation, sense suppression, inducible expression, inducible repression, or inducible modulation of a gene. In an embodiment of the invention the modification involved an increase or improvement in the trait of interest, for example, nitrogen utilization.
Embodiments of the present invention provide nucleotide and amino acid sequences isolated from Arabidopsis thaliana. Particularly, the present invention relates to a nitrogen-regulated GATA transcription factor gene required for sugar sensing.
Embodiments of the present invention relate to an isolated nucleic acid comprising or consisting of a nucleotide sequence comprising:
(a) a nucleotide sequence listed in SEQ ID NO:1, or a fragment or domain, thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c).
In a specific embodiment, the substantial similarity is at least about 65% identity, specifically about 80% identity, specifically 90%, and more specifically at least about 95% sequence identity to the nucleotide sequence listed as SEQ ID NO:1, a fragment or domain thereof.
In a one embodiment, the sequence having substantial similarity to the nucleotide sequence of SEQ ID NO:1, a fragment or domain thereof, is from a plant. In a specific embodiment, the plant is a dicot. In a more specific embodiment, the dicot is selected from the group consisting of soybean, tobacco or cotton. In another specific embodiment, the plant is a gymnosperm. In another specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp., or teosinte.
In one embodiment the nucleic acid is expressed in a specific location or tissue of a plant. The location or tissue is for example, but not limited to, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternative embodiment, the location or tissue is a seed. In another embodiment, the nucleic acid encodes a polypeptide involved in a function such as, for example, but not limited to, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In a specific embodiment, the isolated nucleic acid comprises or consists of a nucleotide sequence capable of hybridizing to a nucleotide sequence listed in SEQ ID NO:1 or a fragment or domain thereof. In a specific embodiment, hybridization allows the sequence to form a duplex at medium or high stringency. Embodiments of the present invention also encompass a nucleotide sequence complementary to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof. Embodiments of the present invention further encompass a nucleotide sequence complementary to a nucleotide sequence that has substantial similarity or is capable of hybridizing to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof.
In a specific embodiment, the nucleotide sequence having substantial similarity is an allelic variant of the nucleotide sequence of SEQ ID NO:1 a fragment or domain thereof. In an alternate embodiment, the sequence having substantial similarity is a naturally occurring variant. In another alternate embodiment, the sequence having substantial similarity is a polymorphic variant of the nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof.
In a specific embodiment, the isolated nucleic acid contains a plurality of regions having the nucleotide sequence of SEQ ID NO:1 or exon or domain thereof.
In a specific embodiment, the isolated nucleic acid contains a polypeptide-encoding sequence. In a more specific embodiment, the polypeptide-encoding sequence contains a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of a nucleic acid sequence of SEQ ID NO:1. In a more specific embodiment, the polypeptide contains a polypeptide sequence of SEQ ID NO:2, or a fragment thereof. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, miloflax, gramma grass, Tripsacum, and teosinte.
In one embodiment, the polypeptide is expressed throughout the plant.
In a more specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In a most specific embodiment, the location or tissue is a seed.
In a specific embodiment, the sequence of the isolated nucleic acid encodes a polypeptide useful for generating an antibody having immunoreactivity against a polypeptide encoded by a nucleotide sequence of SEQ ID NO:2, or fragment or domain thereof.
In a specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides.
In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the sequence of the isolated nucleic acid having substantial similarity comprises or consists of a substitution in at least one codon. In a specific embodiment, the substitution is conservative.
Embodiments of the present invention also relate to an isolated nucleic acid molecule comprising or consisting of a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including:
In a specific embodiment, the isolated nucleic acid comprises or consists of a nucleotide sequence capable of hybridizing to a nucleotide sequence listed in SEQ ID NO:1 or a fragment or domain thereof. In a specific embodiment, hybridization allows the sequence to form a duplex at medium or high stringency. Embodiments of the present invention also encompass a nucleotide sequence complementary to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof. Embodiments of the present invention further encompass a nucleotide sequence complementary to a nucleotide sequence that has substantial similarity or is capable of hybridizing to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof.
In a specific embodiment, the nucleotide sequence having substantial similarity is an allelic variant of the nucleotide sequence of SEQ ID NO:1 a fragment or domain thereof. In an alternate embodiment, the sequence having substantial similarity is a naturally occurring variant. In another alternate embodiment, the sequence having substantial similarity is a polymorphic variant of the nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof.
In a specific embodiment, the isolated nucleic acid contains a plurality of regions having the nucleotide sequence of SEQ ID NO:1 or exon or domain thereof.
In a specific embodiment, the isolated nucleic acid contains a polypeptide-encoding sequence. In a more specific embodiment, the polypeptide-encoding sequence contains a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of a nucleic acid sequence of SEQ ID NO:1. In a more specific embodiment, the polypeptide contains a polypeptide sequence of SEQ ID NO:2, or a fragment thereof. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, miloflax, gramma grass, Tripsacum, and teosinte.
In one embodiment, the polypeptide is expressed throughout the plant.
In a more specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In a most specific embodiment, the location or tissue is a seed.
In a specific embodiment, the sequence of the isolated nucleic acid encodes a polypeptide useful for generating an antibody having immunoreactivity against a polypeptide encoded by a nucleotide sequence of SEQ ID NO:2, or fragment or domain thereof.
In a specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides.
In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the sequence of the isolated nucleic acid having substantial similarity comprises or consists of a substitution in at least one codon. In a specific embodiment, the substitution is conservative.
Embodiments of the present invention also relate to an isolated nucleic acid molecule comprising or consisting of a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including:
(a) a polypeptide sequence of SEQ ID NO:2, or a fragment, domain, repeat, or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or a fragment or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence of SEQ ID NO:1 or a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In another specific embodiment, the polypeptide having substantial similarity is an allelic variant of a polypeptide sequence of SEQ ID NO:2, or a fragment, domain, repeat or chimera thereof. In another specific embodiment, the isolated nucleic acid includes a plurality of regions from the polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or fragment or domain thereof, or a sequence complementary thereto.
In another specific embodiment, the polypeptide is a polypeptide sequence of SEQ ID NO:2. In another specific embodiment, the polypeptide is a functional fragment or domain. In yet another specific embodiment, the polypeptide is a chimera, where the chimera may include functional protein domains, including domains, repeats, post-translational modification sites, or other features. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum, and teosinte.
_20_ In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In another specific embodiment, the location or tissue is a seed.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof or a sequence complementary thereto, includes a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides. In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or a fragment or domain thereof or a sequence complementary thereto, includes a substitution of at least one codon. In a more specific embodiment, the substitution is conservative.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence of SEQ ID NO:2 or a fragment, domain, repeat, or chimeras thereof includes a deletion or insertion of at least one amino acid.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence of SEQ ID NO:2 or a fragment, domain, repeat, or chimeras thereof includes a substitution of at least one amino acid.
Embodiments of the present invention also relate to a shuffled nucleic acid containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments corresponds to a region of a nucleotide sequence of SEQ ID NO:1 and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid. In a more specific embodiment, all of the fragments in a shuffled nucleic acid containing a plurality of nucleotide sequence fragments are from a single gene. In a more specific embodiment, the plurality of fragments originates from at least two different genes. In a more specific embodiment, the shuffled nucleic acid is operably linked to a promoter sequence. Another more specific embodiment is a chimeric polynucleotide including a promoter sequence operably linked to the shuffled nucleic acid. In a more specific embodiment, the shuffled nucleic acid is contained within a host cell.
Embodiments of the present invention also contemplate an expression cassette including a promoter sequence operably linked to an isolated nucleic acid containing a nucleotide sequence including:
a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Further encompassed within the invention is a recombinant vector comprising an expression cassette according to embodiments of the present invention. Also encompassed are plant cells, which contain expression cassettes, according to the present disclosure, and plants, containing these plant cells. In a specific embodiment, the plant is a dicot. In a more specific embodiment, the dicot is selected from the group consisting of soybean, tobacco or cotton. In another specific embodiment, the plant is a gymnosperm. In another specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, _22_ millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum and teosinte.
In one embodiment, the expression cassette is expressed throughout the plant. In another embodiment, the expression cassette is expressed in a specific location or tissue of a plant. In a specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternative specific embodiment, the location or tissue is a seed.
In one embodiment, the expression cassette is involved in a function such as, for example, but not limited to, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In a more specific embodiment, the chimeric polypeptide is involved in a function such as, nitrogen utilization, abiotic stress tolerance, enhanced yield, disease resistance and/or nutritional composition.
In one embodiment, the plant contains a modification to a phenotype or measurable characteristic of the plant, the modification being attributable to the expression of at least one gene contained in the expression cassette. In a specific embodiment, the modification may be, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
Embodiments of the present invention also provide seed and isolated product from plants which contain an expression cassette including a promoter sequence operably linked to an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence of SEQ ID NO:1or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d) according to the present disclosure.
In a specific embodiment the isolated product includes an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
Embodiments of the present invention also relate to isolated products produced by expression of an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence of SEQ ID NO:1, or fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, or a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a) or (b);
(e) a nucleotide sequence complementary to (a), (b), (c) or (d);
or (f) a nucleotide sequence that is the reverse complement of (a), (b) (c) or (d) according to the present disclosure.
In a specific embodiment, the product is produced in a plant. In another specific embodiment, the product is produced in cell culture. In another specific embodiment, the product is produced in a cell-free system.
In another specific embodiment, the product includes an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
In a specific embodiment, the product is a polypeptide containing an amino acid sequence of SEQ ID NO:2. In a more specific embodiment, the protein is an transcription factor.
Embodiments of the present invention further relate to an isolated polynucleotide including a nucleotide sequence of at least 10 bases, which sequence is identical, complementary, or substantially similar to a region of any sequence of SEQ ID NO:1, and wherein the polynucleotide is adapted for any of numerous uses.
In a specific embodiment, the polynucleotide is used as a chromosomal marker. In another specific embodiment, the polynucleotide is used as a marker for RFLP analysis. In another specific embodiment, the polynucleotide is used as a marker for quantitative trait linked breeding. In another specific embodiment, the polynucleotide is used as a marker for marker-assisted breeding. In another specific embodiment, the polynucleotide is used as a bait sequence in a two-hybrid system to identify sequence- encoding polypeptides interacting with the polypeptide encoded by the bait sequence. In another specific embodiment, the polynucleotide is used as a diagnostic indicator for genotyping or identifying an individual or population of individuals. In another specific embodiment, the polynucleotide is used for genetic analysis to identify boundaries of genes or exons.
Embodiments of the present invention also relate to an expression vector comprising or consisting of a nucleic acid molecule including:
(a) a nucleic acid encoding a polypeptide as listed in SEQ ID
NO:2 (b) a fragment, one or more domains, or featured regions of SEQ ID NO:1; or (c) a complete nucleic acid sequence listed in SEQ ID NO:1, or a fragment thereof, in combination with a heterologous sequence.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-1a promoter, ubiquitin and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or gIutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
In a specific embodiment, the expression vector is a eukaryotic expression vector or a prokaryotic expression vector. In a more specific embodiment, the eukaryotic expression vector includes a tissue-specific promoter. More specifically, the expression vector is operable in plants.
Embodiments of the present invention also relate to a cell comprising or consisting of a nucleic acid construct comprising an expression vector and a nucleic acid including a nucleic acid encoding a polypeptide as listed in SEQ
ID NO:2, or a nucleic acid sequence listed in SEQ ID NO:1, or a segment thereof, in combination with a heterologous sequence.
In a specific embodiment, the cell is a bacterial cell, a fungal cell, a plant cell, or an animal cell. In a specific embodiment, the cell is a plant cell. In a more specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a most specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternate most specific embodiment, the location or tissue is a seed. In a specific embodiment, the polypeptide is involved in a function such as, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
Embodiments of the present invention also relate to polypeptides encoded by the isolated nucleic acid molecules of the present disclosure including a polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c);
(f) or a functional fragment thereof.
A polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including a polypeptide sequence including:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a domain, repeat, or chimeras thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1 or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d);
(f) or a functional fragment thereof.
Embodiments of the present invention contemplate a polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid which includes a shuffled nucleic acid containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments corresponds to a region of a nucleotide sequence listed SEQ ID NO:1, and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid, or functional fragment thereof.
Embodiments of the present invention contemplate a polypeptide containing a polypeptide sequence encoded by an isolated polynucleotide containing a nucleotide sequence of at least 10 bases, which sequence is identical, complementary, or substantially similar to a region of any of sequences of SEQ ID NO:1, or functional fragment thereof and wherein the polynucleotide is adapted for a use including:
(a) use as a chromosomal marker to identify the location of the corresponding or complementary polynucleotide on a native or artificial chromosome;
(b) use as a marker for RFLP analysis;
(c) use as a marker for quantitative trait linked breeding;
(d) use as a marker for marker-assisted breeding;
(e) use as a bait sequence in a two-hybrid system to identify sequence encoding polypeptides interacting with the polypeptide encoded by the bait sequence;
(f) use as a diagnostic indicator for genotyping or identifying an individual or population of individuals; or (g) use for genetic analysis to identify boundaries of genes or exons.
_28-Embodiments of the present invention also contemplate an isolated polypeptide containing a polypeptide sequence including:
(a) a polypeptide sequence listed SEQ ID NO:2, or exon or domain thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In a specific embodiment, the substantial similarity is at least about 65% identity. In a more specific embodiment, the substantial similarity is at least about 80% identity. In a most specific embodiment, the substantial similarity is at least about 95% identity. In a specific embodiment, the substantial similarity is at least three percent greater than the percent identity to the closest homologous sequence listed in any of the Sequence Listings.
In a specific embodiment, the sequence having substantial similarity is from a plant. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum and teosinte.
In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In another specific embodiment, the location or tissue is a seed. In a specific embodiment, the polypeptide is involved in a function such as, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In a specific embodiment, hybridization of a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto, or a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed SEQ ID NO:1, or to a sequence complementary thereto, allows the sequence to form a duplex at medium or high stringency.
In a specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is an allelic variant of the polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a naturally occurring variant of the polypeptide sequence listed in SEQ ID NO:2.
In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a polymorphic variant of the polypeptide sequence listed in SEQ ID NO:2.
In an alternate specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one amino acid. In a more specific embodiment, the deletion or insertion is of less than about ten amino acids. In a most specific embodiment, the deletion or insertion is of less than about three amino acids.
In a specific embodiment, the sequence having substantial similarity encodes a substitution in at least one amino acid.
Also contemplated is a method of producing a plant comprising a modification thereto, including the steps of: (1) providing a nucleic acid which is an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed SEQ ID NO:1, or exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c);
and (2) introducing the nucleic acid into the plant, wherein the nucleic acid is expressible in the plant in an amount effective to effect the modification. In one embodiment, the modification comprises an altered characteristic in the plant, wherein the characteristic corresponds to the nucleic acid introduced into the plant. In other specific embodiments the characteristic corresponds to carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In another embodiment, the modification includes an increased or decreased expression or accumulation of a product of the plant. Specifically, the product is a natural product of the plant. Equally specifically, the product is a new or altered product of the plant. Specifically, the product comprises a GATA transcription factor.
Also encompassed within the presently disclosed invention is a method of producing a recombinant protein, comprising the steps of:
(a) growing recombinant cells comprising a nucleic acid construct under suitable growth conditions, the construct comprising an expression vector and a nucleic acid including: a nucleic acid encoding a protein as listed in SEQ ID NO:2, or a nucleic acid sequence listed in SEQ ID NO:1, or segments thereof; and (b) isolating from the recombinant cells the recombinant protein expressed thereby.
Embodiments of the present invention provide a method of producing a recombinant protein in which the expression vector includes one or more elements including a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, and an affinity purification-tag encoding sequence. In one specific embodiment, the nucleic acid construct includes an epitope-tag encoding sequence and the isolating step includes use of an antibody specific for the epitope-tag. In another specific embodiment, the nucleic acid construct contains a polyamino acid encoding sequence and the isolating step includes use of a resin comprising a polyamino acid binding substance, specifically where the polyamino acid is polyhistidine and the polyamino binding resin is nickel-charged agarose resin. In yet another specific embodiment, the nucleic acid construct contains a polypeptide encoding sequence and the isolating step includes the use of a resin containing a polypeptide binding substance, specifically where the polypeptide is a chitin binding domain and the resin contains chitin-sepharose.
Embodiments of the present invention also relate to a plant modified by a method that includes introducing into a plant a nucleic acid where the nucleic acid is expressible in the plant in an amount effective to effect the modification. The modification can be, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation. In one embodiment, the modified plant has increased or decreased resistance to an herbicide, a stress, or a pathogen. In another embodiment, the modified plant has enhanced or diminished requirement for light, water, nitrogen, or trace elements. In yet another embodiment, the modified plant is enriched for an essential amino acid as a proportion of a protein fraction of the plant. The protein fraction may be, for example, total seed protein, soluble protein, insoluble protein, water-extractable protein, and lipid-associated protein. The modification may include overexpression, underexpression, antisense modulation, sense suppression, inducible expression, inducible repression, or inducible modulation of a gene.
The invention further relates to a seed from a modified plant or an isolated product of a modified plant, where the product may be an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
The above Summary of Invention lists several embodiments of the invention, and in many cases lists variations and permutations of these embodiments. The Summary is merely exemplary of the numerous and varied embodiments. Mention of one or more specific features of a given embodiment is likewise exemplary. Such embodiment can typically exist with or without the feature(s) mentioned; likewise, those features can be applied to other embodiments of the invention, whether listed in this Summary or not.
To avoid excessive repetition, this Summary does not list or suggest all possible combinations of such features.
For purposes of summarizing the invention and the advantages achieved over the prior art, certain objects and advantages of the invention have been described above. Of course, it is to be understood that not necessarily all such objects or advantages may be achieved in accordance with any particular embodiment of the invention. Thus, for example, those skilled in the art will recognize that the invention may be embodied or carried out in a manner that achieves or optimizes one advantage or group of advantages as taught herein without necessarily achieving other objects or advantages as may be taught or suggested herein.
Further aspects, features and advantages of this invention will become apparent from the detailed description of the specific embodiments that follow.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 and SEQ ID NO:1 shows the nucleic acid sequence of full length OsGATAl 1.
Figure 2 and SEQ ID NO:2 shows the amino acid sequence of OsGATAl 1.
Figure 3 shows the alignment of the amino acid sequence of At4g26150 (SEQ ID NO:7) and its rice ortholog OsGATAl 1 (SEQ ID NO:2).
Figure 4A and B shows the phenotypes of the OsGATAl 1 over-expressing plants.
Figure 5A and B shows the chlorophyll level affected by the expression of OsGATA11 gene.
Figure 6A and B shows the seed yield of OsGATAl 1 over-expressing plants.
Figure 7 are pictures showing more resistant to stress in the OsGATAl1 over-expressing plants.
DEFINITIONS
For clarity, certain terms used in the specification are defined and presented as follows:
"Associated with / operatively linked" refer to two nucleic acid sequences that are related physically or functionally. For example, a promoter or regulatory DNA sequence is said to be "associated with" a DNA
sequence that codes for an RNA or a protein if the two sequences are operatively linked, or situated such that the regulator DNA sequence will affect the expression level of the coding or structural DNA sequence.
A "chimeric construct" is a recombinant nucleic acid sequence in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA or which is expressed as a protein, such that the regulatory nucleic acid sequence is able to regulate transcription or expression of the associated nucleic acid sequence. The regulatory nucleic acid sequence of the chimeric construct is not normally operatively linked to the associated nucleic acid sequence as found in nature.
A "co-factor" is a natural reactant, such as an organic molecule or a metal ion, required in an enzyme-catalyzed reaction. A co-factor is e.g.
NAD(P), riboflavin (including FAD and FMN), folate, molybdopterin, thiamin, biotin, lipoic acid, pantothenic acid and coenzyme A, S-adenosylmethionine, pyridoxal phosphate, ubiquinone, menaquinone. Optionally, a co-factor can be regenerated and reused.
A"coding sequence" is a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA.
Specifically the RNA is then translated in an organism to produce a protein.
Complementary: "complementary" refers to two nucleotide sequences that comprise antiparallel nucleotide sequences capable of pairing with one another upon formation of hydrogen bonds between the complementary base residues in the antiparaliel nucleotide sequences.
Enzyme activity: means herein the ability of an enzyme to catalyze the conversion of a substrate into a product. A substrate for the enzyme comprises the natural substrate of the enzyme but also comprises analogues of the natural substrate, which can also be converted, by the enzyme into a product or into an analogue of a product. The activity of the enzyme is measured for example by determining the amount of product in the reaction after a certain period of time, or by determining the amount of substrate remaining in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of an unused co-factor of the reaction remaining in the reaction mixture after a certain period of time or by determining the amount of used co-factor in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of a donor of free energy or energy-rich molecule (e.g. ATP, phosphoenolpyruvate, acetyl phosphate or phosphocreatine) remaining in the reaction mixture after a certain period of time or by determining the amount of a used donor of free energy or energy-rich molecule (e.g. ADP, pyruvate, acetate or creatine) in the reaction mixture after a certain period of time.
Expression Cassette: "Expression cassette" as used herein means a nucleic acid molecule capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operatively linked to the nucleotide sequence of interest which is operatively linked to termination signals. It also typically comprises sequences required for proper translation of the nucleotide sequence. The coding region usually codes for a protein of interest but may also code for a functional RNA of interest, for example antisense RNA or a nontransiated RNA, in the sense or antisense direction. The expression cassette comprising the nucleotide sequence of interest may be chimeric, meaning that at least one of its components is heterologous with respect to at least one of its other components. The expression cassette may also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. Typically, however, the expression cassette is heterologous with respect to the host, i.e., the particular DNA sequence of the expression cassette does not occur naturally in the host cell and must have been introduced into the host cell or an ancestor of the host cell by a transformation event. The expression of the nucleotide sequence in the expression cassette may be under the control of a constitutive promoter or of an inducible promoter that initiates transcription only when the host cell is exposed to some particular external stimulus. In the case of a multicellular organism, such as a plant, the promoter can also be specific to a particular tissue or organ or stage of development.
The term "functional fragment" as used herein in relation to a nucleic acid or protein sequence means a fragment or portion of the sequence that retains the function of the full length sequence.
Gene: the term "gene" is used broadly to refer to any segment of DNA
associated with a biological function. Thus, genes include coding sequences and/or the regulatory sequences required for their expression. Genes also include nonexpressed DNA segments that, for example, form recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.
Heterologous/exogenous: The terms "heterologous" and "exogenous"
when used herein to refer to a nucleic acid sequence (e.g. a DNA sequence) or a gene, refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling. The terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides.
A "homologous" nucleic acid (e.g. DNA) sequence is a nucleic acid (e.g. DNA) sequence naturally associated with a host cell into which it is introduced.
Hybridization: The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially"
refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence.
Inhibitor: a chemical substance that inactivates the enzymatic activity of a protein such as a biosynthetic enzyme, receptor, signal transduction protein, structural gene product, or transport protein. The term "herbicide" (or "herbicidal compound") is used herein to define an inhibitor applied to a plant at any stage of development, whereby the herbicide inhibits the growth of the plant or kills the plant.
Interaction: quality or state of mutual action such that the effectiveness or toxicity of one protein or compound on another protein is inhibitory (antagonists) or enhancing (agonists).
A nucleic acid sequence is "isocoding with" a reference nucleic acid sequence when the nucleic acid sequence encodes a polypeptide having the same amino acid sequence as the polypeptide encoded by the reference nucleic acid sequence.
Isogenic: plants that are genetically identical, except that they may differ by the presence or absence of a heterologous DNA sequence.
Isolated: in the context of the present invention, an isolated DNA
molecule or an isolated enzyme is a DNA molecule or enzyme that, by human intervention, exists apart from its native environment and is therefore not a product of nature. An isolated DNA molecule or enzyme may exist in a purified form or may exist in a non-native environment such as, for example, in a transgenic host cell.
Mature protein: protein from which the transit peptide, signal peptide, and/or propeptide portions have been removed.
Minimal Promoter: the smallest piece of a promoter, such as a TATA
element, that can support any transcription. A minimal promoter typically has greatly reduced promoter activity in the absence of upstream activation. In the presence of a suitable transcription factor, the minimal promoter functions to permit transcription.
Modified Enzyme Activity: enzyme activity different from that which naturally occurs in a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally occurring enzyme activity.
Native: refers to a gene that is present in the genome of an untransformed plant cell.
Naturally occurring: the term "naturally occurring" is used to describe an object that can be found in nature as distinct from being artificially produced by man. For example, a protein or nucleotide sequence present in an organism (including a virus), which can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory, is naturally occurring.
Nucleic acid: the term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated.
Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19: 5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:
2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8: 91-98 (1994)). The terms "nucleic acid" or "nucleic acid sequence" may also be used interchangeably with gene, cDNA, and mRNA encoded by a gene.
"ORF" means open reading frame.
Percent identity: the phrases "percent identityl" or "percent identical," in the context of two nucleic acid or protein sequences, refers to two or more sequences or subsequences that have for example 60%, specifically 70%, more specifically 80%, still more specifically 90%, even more specifically 95%, and most specifically at least 99% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. Specifically, the percent identity exists over a region of the sequences that is at least about 50 residues in length, more specifically over a region of at least about 100 residues, and most specifically the percent identity exists over at least about 150 residues. In an especially specific embodiment, the percent identity exists over the entire length of the coding regions.
For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appi. Math.
2: 482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48: 443 (1970), by the search for similarity method of Pearson &
Lipman, Proc. Nat'l. Acad. Sci. USA 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by visual inspection (see generally, Ausubel et al., infra).
One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., J. Mol. Biol. 215: 403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., 1990). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when the cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below due to the accumulation of one or more negative-scoring residue alignments, or the end of either sequence is reached. The BLAST
algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP
program uses as defaults a wordiength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci.
USA 89: 10915 (1989)).
In addition to calculating percent sequence identity, the BLAST
algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90:
5873-5787 (1993)). One measure of similarity provided by the BLAST
algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic acid sequence is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid sequence to the reference nucleic acid sequence is less than about 0.1, more specifically less than about 0.01, and most specifically less than about 0.001.
Pre-protein: protein that is normally targeted to a cellular organelle, such as a chloroplast, and still comprises its native transit peptide.
Purified: the term "purified," when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It is specifically in a homogeneous state although it can be in either a dry or aqueous solution.
Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified. The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel.
Particularly, it means that the nucleic acid or protein is at least about 50%
pure, more specifically at least about 85% pure, and most specifically at least about 99% pure.
Two nucleic acids are "recombined" when sequences from each of the two nucleic acids are combined in a progeny nucleic acid. Two sequences are "directly" recombined when both of the nucleic acids are substrates for recombination. Two sequences are "indirectly recombined" when the sequences are recombined using an intermediate such as a cross-over oligonucleotide. For indirect recombination, no more than one of the sequences is an actual substrate for recombination, and in some cases, neither sequence is a substrate for recombination.
"Regulatory elements" refer to sequences involved in controlling the expression of a nucleotide sequence. Regulatory elements comprise a promoter operatively linked to the nucleotide sequence of interest and termination signals. They also typically encompass sequences required for proper translation of the nucleotide sequence.
Significant Increase: an increase in enzymatic activity that is larger than the margin of error inherent in the measurement technique, specifically an increase by about 2-fold or greater of the activity of the wild-type enzyme in the presence of the inhibitor, more specifically an increase by about 5-fold or greater, and most specifically an increase by about 10-fold or greater.
Significantly less: means that the amount of a product of an enzymatic reaction is reduced by more than the margin of error inherent in the measurement technique, specifically a decrease by about 2-fold or greater of the activity of the wild-type enzyme in the absence of the inhibitor, more specifically an decrease by about 5-fold or greater, and most specifically an decrease by about 10-fold or greater.
Specific Binding/Immunological Cross-Reactivity: An indication that two nucleic acid sequences or proteins are substantially identical is that the protein encoded by the first nucleic acid is immunologically cross reactive with, or specifically binds to, the protein encoded by the second nucleic acid.
Thus, a protein is typically substantially identical to a second protein, for example, where the two proteins differ only by conservative substitutions.
The phrase "specifically (or selectively) binds to an antibody," or "specifically (or selectively) immunoreactive with," when referring to a protein or peptide, refers to a binding reaction which is determinative of the presence of the protein in the presence of a heterogeneous population of proteins and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. For example, antibodies raised to the protein with the amino acid sequence encoded by any of the nucleic acid sequences of the invention can be selected to obtain antibodies specifically immunoreactive with that protein and not with other proteins except for polymorphic variants. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein. For example, solid-phase ELISA
immunoassays, Western blots, or immunohistochemistry are routinely used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York "Harlow and Lane"), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity. Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background.
"Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern hybridizations are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays" Elsevier, New York. Generally, highly stringent hybridization and wash conditions are selected to be about 5 C
lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Typically, under "stringent conditions" a probe will hybridize to its target subsequence, but to no other sequences.
The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.
Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42 C, with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.1 5M NaCI at 72 C for about minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65 C for 15 minutes (see, Sambrook, infra, for a description of SSC buffer).
Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a 15 duplex of, e.g., more than 100 nucleotides, is lx SSC at 45 C for 15 minutes.
An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40 C for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30 C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization.
Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
The following are examples of sets of hybridization/wash conditions that may be used to clone nucleotide sequences that are homologues of reference nucleotide sequences of the present invention: a reference nucleotide sequence specifically hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 2X SSC, 0.1% SDS at 50 C, more desirably in 7%
sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 1X SSC, 0.1% SDS at 50 C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 0.5X SSC, 0.1% SDS at 50 C, specifically in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 0.1X SSC, 0.1%
SDS at 50 C, more specifically in 7% sodium dodecyl sulfate (SDS), 0.5 M
NaPO4, 1 mM EDTA at 50 C with washing in 0.1X SSC, 0.1% SDS at 65 C.
A "subsequence" refers to a sequence of nucleic acids or amino acids that comprise a part of a longer sequence of nucleic acids or amino acids (e.g., protein) respectively.
Substantial similarity: The term "substantial similarity" in the context of two nucleic acid or protein sequences, refers to two or more sequences or subsequences that are substantially similar, for example that have 50%, specifically 60%, more specifically 70%, even more specifically 80%, still more specifically 90%, further more specifically 95%, and most specifically 99%
sequence identity.
Substrate: a substrate is the molecule that an enzyme naturally recognizes and converts to a product in the biochemical pathway in which the enzyme naturally carries out its function, or is a modified version of the molecule, which is also recognized by the enzyme and is converted by the enzyme to a product in an enzymatic reaction similar to the naturally-occurring reaction.
Transformation: a process for introducing heterologous DNA into a plant cell, plant tissue, or plant. Transformed plant cells, plant tissue, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof.
"Transformed," "transgenic," and "recombinant" refer to a host organism such as a bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome of the host or the nucleic acid molecule can also be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof. A "non-transformed," "non-transgenic," or "non-recombinant" host refers to a wild-type organism, e.g., a bacterium or plant, which does not contain the heterologous nucleic acid molecule.
Viability: "viability" as used herein refers to a fitness parameter of a plant. Plants are assayed for their homozygous performance of plant development, indicating which proteins are essential for plant growth.
DETAILED DESCRIPTION OF THE INVENTION
1. General Description of Trait Functional Genomics The goal of functional genomics is to identify genes controlling expression of organismal phenotypes, and employs a variety of methodologies, including but not limited to bioinformatics, gene expression studies, gene and gene product interactions, genetics, biochemistry and molecular genetics. For example, bioinformatics can assign function to a given gene by identifying genes in heterologous organisms with a high degree of similarity (homology) at the amino acid or nucleotide level. Expression of a gene at the mRNA or protein levels can assign function by linking expression of a gene to an environmental response, a developmental process or a genetic (mutational) or molecular genetic (gene overexpression or underexpression) perturbation. Expression of a gene at the mRNA level can be ascertained either alone (Northern analysis) or in concert with other genes (microarray analysis), whereas expression of a gene at the protein level can be ascertained either alone (native or denatured protein gel or immunoblot analysis) or in concert with other genes (proteomic analysis). Knowledge of protein/protein and protein/DNA interactions can assign function by identifying proteins and nucleic acid sequences acting together in the same biological process. Genetics can assign function to a gene by demonstrating that DNA
lesions (mutations) in the gene have a quantifiable effect on the organism, including but not limited to: its development; hormone biosynthesis and response; growth and growth habit (plant architecture); mRNA expression profiles; protein expression profiles; ability to resist diseases; tolerance of abiotic stresses; ability to acquire nutrients; photosynthetic efficiency;
altered primary and secondary metabolism; and the composition of various plant organs. Biochemistry can assign function by demonstrating that the protein encoded by the gene, typically when expressed in a heterologous organism, possesses a certain enzymatic activity, alone or in combination with other proteins. Molecular genetics can assign function by overexpressing or underexpressing the gene in the native plant or in heterologous organisms, and observing quantifiable effects as described in functional assignment by genetics above. In functional genomics, any or all of these approaches are utilized, often in concert, to assign genes to functions across any of a number of organismal phenotypes.
It is recognized by those skilled in the art that these different methodologies can each provide data as evidence for the function of a particular gene, and that such evidence is stronger with increasing amounts of data used for functional assignment: specifically from a single methodology, more specifically from two methodologies, and even more specifically from more than two methodologies. In addition, those skilled in the art are aware that different methodologies can differ in the strength of the evidence for the assignment of gene function. Typically, but not always, a datum of biochemical, genetic and molecular genetic evidence is considered stronger than a datum of bioinformatic or gene expression evidence. Finally, those skilled in the art recognize that, for different genes, a single datum from a single methodology can differ in terms of the strength of the evidence provided by each distinct datum for the assignment of the function of these different genes.
The objective of crop trait functional genomics is to identify crop trait genes, i.e. genes capable of conferring useful agronomic traits in crop plants.
Such agronomic traits include, but are not limited to: enhanced yield, whether in quantity or quality; enhanced nutrient acquisition and enhanced metabolic efficiency; enhanced or altered nutrient composition of plant tissues used for food, feed, fiber or processing; enhanced utility for agricultural or industrial processing; enhanced resistance to plant diseases; enhanced tolerance of adverse environmental conditions (abiotic stresses) including but not limited to drought, excessive cold, excessive heat, or excessive soil salinity or extreme acidity or alkalinity; and alterations in plant architecture or development, including changes in developmental timing. The deployment of such identified trait genes by either transgenic or non-transgenic means could materially improve crop plants for the benefit of agriculture.
Cereals are the most important crop plants on the planet, in terms of both human and animal consumption. Genomic synteny (conservation of gene order within large chromosomal segments) is observed in rice, maize, wheat, barley, rye, oats and other agriculturally important monocots, which facilitates the mapping and isolation of orthologous genes from diverse cereal species based on the sequence of a single cereal gene. Rice has the smallest (- 420 Mb) genome among the cereal grains, and has recently been a major focus of public and private genomic and EST sequencing efforts.
To identify crop trait genes in the rice [wheat] genome controlling [trait], genes from the rice draft genome sequence [wheat EST databases] were prioritized based on one or more functional genomic methodologies. For example, genome-wide expression studies of rice plants infected with rice blast fungus (Magnaporthe grisea) were used to prioritize candidate genes controlling disease resistance. Full-length and partial cDNAs of rice trait gene candidates could then be predicted based on analysis of the rice whole-genome sequence, and isolated by designing and using primers for PCR
amplification using a commercially available PCR primer-picking program.
Primers were used for PCR amplification of full-length or partial cDNAs from rice cDNA libraries or first-strand cDNA. cDNA clones resulting from either approach were used for the construction of vectors designed for altering expression of these genes in transgenic plants using plant molecular genetic methodologies, which are described in detail below. Alteration of plant phenotype through overexpression or underexpression of key trait genes in transgenic plants is a robust and established method for assigning functions to plant genes. Assays to identify transgenic plants with alterations in traits of interest are to be used to unambiguously assign the utility of these genes for the improvement of rice, and by extension, other cereals, either by transgenic or classical breeding methods.
II. Identifying, Cloning and Sequencing cDNAs The cloning and sequencing of the cDNAs of the present invention are described in Example 1.
The isolated nucleic acids and proteins of the present invention are usable over a range of plants, monocots and dicots, in particular monocots such as rice, wheat, barley and maize. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp., or teosinte. In a most specific embodiment, the cereal is rice. Other plants genera include, but are not limited to, Cucurbita, Rosa, Vitis, Juglans, Gragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, Lolium, Oryza, Avena, Hordeum, Secale, Allium, and Triticum.
The present invention also provides a method of genotyping a plant or plant part comprising a nucleic acid molecule of the present invention.
Optionally, the plant is a monocot such as, but not limited rice or wheat.
Genotyping provides a means of distinguishing homologs of a chromosome pari and can be used to differentiate segregants in a plant population.
Molecular marker methods can be used in phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomeal segments affecting mongenic traits, map based cloning, and the study of quantitative inheritance (see Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark ed., Springer-Verlag, Berlin 1997; Paterson, A.H., "The DNA Revolution", chapter 2 in Genome Mapping in Plants, Paterson, A.H. ed., Academic Press/R.G. Lands Co., Austin, Texas 1996).
The method of genotyping may employ any number of molecular marker analytical techniques such as, but not limited to, restriction length polymorphisms (RFLPs). As is well known in the art, RFLPs are produced by differences in the DNA restriction fragment lengths resulting from nucleotide differences between alleles of the same gene. Thus, the present invention provides a method of following segregation of a gene or nucleic acid of the present invention or chromosomal sequences genetically linked by using RFLP analysis. Linked chromosomal sequences are within 50 centiMorgans (50 cM), within 40 or 30 cM, specifically within 20 or 10 cM, more specifically within 5, 3, 2, or 1 cM of the nucleic acid of the invention.
Ill. Traits of Interest The present invention encompasses the identification and isolation of polynucleotides encoding proteins involved in sugar sensing and, ultimately, in nitrogen uptake and carbon metabolism. Altering the expression of genes related to these traits can be used to improve or modify plants and/or grain, as desired. Examples describe the isolated genes of interest and methods of analyzing the alteration of expression and their effects on the plant characteristics.
One aspect of the present invention provides compositions and methods for altering (i.e. increasing or decreasing) the level of nucleic acid molecules and polypeptides of the present invention in plants. In particular, the nucleic acid molecules and polypeptides of the invention are expressed constitutively, temporally or spatially, e.g. at developmental stages, in certain tissues, and/or quantities, which are uncharacteristic of non-recombinantly engineered plants.
Therefore, the present invention provides utility in such exemplary applications as altering the specified characteristics identified above.
VI. Controlling Gene Expression in Transgenic Plants The invention further relates to transformed cells comprising the nucleic acid molecules, transformed plants, seeds, and plant parts, and methods of modifying phenotypic traits of interest by altering the expression of the genes of the invention.
A. Modification of Coding Sequences and Adjacent Sequences The transgenic expression in plants of genes derived from heterologous sources may involve the modification of those genes to achieve and optimize their expression in plants. In particular, bacterial ORFs which encode separate enzymes but which are encoded by the same transcript in the native microbe are best expressed in plants on separate transcripts. To achieve this, each microbial ORF is isolated individually and cloned within a cassette which provides a plant promoter sequence at the 5' end of the ORF
and a plant transcriptional terminator at the 3' end of the ORF. The isolated ORF sequence specifically includes the initiating ATG codon and the terminating STOP codon but may include additional sequence beyond the initiating ATG and the STOP codon. In addition, the ORF may be truncated, but still retain the required activity; for particularly long ORFs, truncated versions which retain activity may be preferable for expression in transgenic organisms. By "plant promoter" and "plant transcriptional terminator" it is intended to mean promoters and transcriptional terminators that operate within plant cells. This includes promoters and transcription terminators that may be derived from non-plant sources such as viruses (an example is the Cauliflower Mosaic Virus).
In some cases, modification to the ORF coding sequences and adjacent sequence is not required. It is sufficient to isolate a fragment containing the ORF of interest and to insert it downstream of a plant promoter.
For example, Gaffney et al. (Science 261: 754-756 (1993)) have expressed the Pseudomonas nahG gene in transgenic plants under the control of the CaMV 35S promoter and the CaMV tml terminator successfully without modification of the coding sequence and with nucleotides of the Pseudomonas gene upstream of the ATG still attached, and nucleotides downstream of the STOP codon still attached to the nahG ORF. Specifically, as little adjacent microbial sequence as possible should be left attached upstream of the ATG and downstream of the STOP codon. In practice, such construction may depend on the availability of restriction sites.
In other cases, the expression of genes derived from microbial sources may provide problems in expression. These problems have been well characterized in the art and are particularly common with genes derived from certain sources such as Bacillus. These problems may apply to the nucleotide sequence of this invention and the modification of these genes can be undertaken using techniques now well known in the art. The following problems may be encountered:
1. Codon Usage.
The specific codon usage in plants differs from the specific codon usage in certain microorganisms. Comparison of the usage of codons within a cloned microbial ORF to usage in plant genes (and in particular genes from the target plant) will enable an identification of the codons within the ORF
that should specifically be changed. Typically plant evolution has tended towards a strong preference of the nucleotides C and G in the third base position of monocotyledons, whereas dicotyledons often use the nucleotides A or T at this position. By modifying a gene to incorporate specific codon usage for a particular target transgenic species, many of the problems described below for GC/AT content and illegitimate splicing will be overcome.
2. GC/AT Content.
Plant genes typically have a GC content of more than 35%. ORF
sequences which are rich in A and T nucleotides can cause several problems in plants. Firstly, motifs of ATTTA are believed to cause destabilization of messages and are found at the 3' end of many short-lived mRNAs. Secondly, the occurrence of polyadenylation signals such as AATAAA at inappropriate positions within the message is believed to cause premature truncation of transcription. In addition, monocotyledons may recognize AT-rich sequences as splice sites (see below).
3. Sequences Adjacent to the Initiating Methionine.
Plants differ from microorganisms in that their messages do not possess a defined ribosome-binding site. Rather, it is believed that ribosomes attach to the 5' end of the message and scan for the first available ATG at which to start translation. Nevertheless, it is believed that there is a preference for certain nucleotides adjacent to the ATG and that expression of microbial genes can be enhanced by the inclusion of a eukaryotic consensus translation initiator at the ATG. Clontech (1993/1994 catalog, page 210, incorporated herein by reference) have suggested one sequence as a consensus translation initiator for the expression of the E. coli uidA gene in plants. Further, Joshi (N.A.R. 15: 6643-6653 (1987), incorporated herein by reference) has compared many plant sequences adjacent to the ATG and suggests another consensus sequence. In situations where difficulties are encountered in the expression of microbial ORFs in plants, inclusion of one of these sequences at the initiating ATG may improve translation. In such cases the last three nucleotides of the consensus may not be appropriate for inclusion in the modified sequence due to their modification of the second AA
residue. Specific sequences adjacent to the initiating methionine may differ between different plant species. A survey of 14 maize genes located in the GenBank database provided the following results:
Position Before the Initiating ATG in 14 Maize Genes:
This analysis can be done for the desired plant species into which the nucleotide sequence is being incorporated, and the sequence adjacent to the ATG modified to incorporate the specific nucleotides.
4. Removal of Illegitimate Splice Sites.
Genes cloned from non-plant sources and not optimized for expression in plants may also contain motifs which may be recognized in plants as 5' or 3' splice sites, and be cleaved, thus generating truncated or deleted messages. These sites can be removed using the techniques well known in the art.
Techniques for the modification of coding sequences and adjacent sequences are well known in the art. In cases where the initial expression of a microbial ORF is low and it is deemed appropriate to make alterations to the sequence as described above, then the construction of synthetic genes can be accomplished according to methods well known in the art. These are, for example, described in the published patent disclosures EP 0 385 962 (to Monsanto), EP 0 359 472 (to Lubrizol) and WO 93/07278 (to Ciba-Geigy), all of which are incorporated herein by reference. In most cases it is preferable to assay the expression of gene constructions using transient assay protocols (which are well known in the art) prior to their transfer to transgenic plants.
B. Construction of Plant Expression Cassettes Coding sequences intended for expression in transgenic plants are first assembled in expression cassettes behind a suitable promoter expressible in plants. The expression cassettes may also comprise any further sequences required or selected for the expression of the transgene. Such sequences include, but are not restricted to, transcription terminators, extraneous sequences to enhance expression such as introns, vital sequences, and sequences intended for the targeting of the gene product to specific organelles and cell compartments. These expression cassettes can then be easily transferred to the plant transformation vectors described below. The following is a description of various components of typical expression cassettes.
1. Promoters The selection of the promoter used in expression cassettes will determine the spatial and temporal expression pattern of the transgene in the transgenic plant. Selected promoters will express transgenes in specific cell types (such as leaf epidermal cells, mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, for example) and the selection will reflect the desired location of accumulation of the gene product.
Alternatively, the selected promoter may drive expression of the gene under various inducing conditions. Promoters vary in their strength, i.e., ability to promote transcription. Depending upon the host cell system utilized, any one of a number of suitable promoters can be used, including the gene's native promoter. The following are non-limiting examples of promoters that may be used in expression cassettes.
a. Constitutive Expression, the Ubiquitin Promoter:
Ubiquitin is a gene product known to accumulate in many cell types and its promoter has been cloned from several species for use in transgenic plants (e.g. sunflower - Binet et al. Plant Science 79: 87-94 (1991); maize -Christensen et al. Plant Molec. Biol. 12: 619-632 (1989); and Arabidopsis -Callis et al., J. Biol. Chem. 265:12486-12493 (1990) and Norris et al., Plant Mol. Biol. 21:895-906 (1993)). The maize ubiquitin promoter has been developed in transgenic monocot systems and its sequence and vectors constructed for monocot transformation are disclosed in the patent publication EP 0 342 926 (to Lubrizol) which is herein incorporated by reference. Taylor et al. (Plant Cell Rep. 12: 491-495 (1993)) describe a vector (pAHC25) that comprises the maize ubiquitin promoter and first intron and its high activity in cell suspensions of numerous monocotyledons when introduced via microprojectile bombardment. The Arabidopsis ubiquitin promoter is ideal for use with the nucleotide sequences of the present invention. The ubiquitin promoter is suitable for gene expression in transgenic plants, both monocotyledons and dicotyledons. Suitable vectors are derivatives of pAHC25 or any of the transformation vectors described in this application, modified by the introduction of the appropriate ubiquitin promoter and/or intron sequences.
b. Constitutive Expression, the CaMV 35S Promoter:
Construction of the plasmid pCGN1761 is described in the published patent application EP 0 392 225 (Example 23), which is hereby incorporated by reference. pCGN1761 contains the "double" CaMV 35S promoter and the tml transcriptional terminator with a unique EcoRl site between the promoter and the terminator and has a pUC-type backbone. A derivative of pCGN1761 is constructed which has a modified polylinker which includes Notl and Xhol sites in addition to the existing EcoRl site. This derivative is designated pCGN 1761 ENX. pCGN 1761 ENX is useful for the cloning of cDNA sequences or coding sequences (including microbial ORF sequences) within its polylinker for the purpose of their expression under the control of the 35S promoter in transgenic plants. The entire 35S promoter-coding sequence-tml terminator cassette of such a construction can be excised by Hindlil, Sphl, Sall, and Xbal sites 5' to the promoter and Xbal, BamHl and BgII sites 3' to the terminator for transfer to transformation vectors such as those described below. Furthermore, the double 35S promoter fragment can be removed by 5' excision with Hindlll, Sphl, Sall, Xbal, or Pstl, and 3' excision with any of the polylinker restriction sites (EcoRl, Notl or Xhol) for replacement with another promoter. If desired, modifications around the cloning sites can be made by the introduction of sequences that may enhance translation. This is particularly useful when overexpression is desired. For example, pCGN1761ENX may be modified by optimization of the translational initiation site as described in Example 37 of U.S. Patent No. 5,639,949, incorporated herein by reference.
c. Constitutive Expression, the Actin Promoter:
Several isoforms of actin are known to be expressed in most cell types and consequently the actin promoter is a good choice for a constitutive promoter. In particular, the promoter from the rice Actl gene has been cloned and characterized (McElroy et al. Plant Cell 2: 163-171 (1990)). A 1.3kb fragment of the promoter was found to contain all the regulatory elements required for expression in rice protoplasts. Furthermore, numerous expression vectors based on the Actl promoter have been constructed specifically for use in monocotyledons (McElroy et al. Mol. Gen. Genet. 231:
150-160 (1991)). These incorporate the Actl-intron 1, Adhl 5' flanking sequence and Adhi-intron 1(from the maize alcohol dehydrogenase gene) and sequence from the CaMV 35S promoter. Vectors showing highest expression were fusions of 35S and Actl intron or the Actl 5' flanking sequence and the Actl intron. Optimization of sequences around the initiating ATG (of the GUS reporter gene) also enhanced expression. The promoter expression cassettes described by McElroy et al. (Mol. Gen. Genet. 231: 150-160 (1991)) can be easily modified for gene expression and are particularly suitable for use in monocotyledonous hosts. For example, promoter-containing fragments is removed from the McElroy constructions and used to replace the double 35S promoter in pCGN1761ENX, which is then available for the insertion of specific gene sequences. The fusion genes thus constructed can then be transferred to appropriate transformation vectors. In a separate report, the rice Actl promoter with its first intron has also been found to direct high expression in cultured barley cells (Chibbar et al. Plant Cell Rep. 12: 506-509 (1993)).
d. Inducible Expression, PR-1 Promoters:
The double 35S promoter in pCGN1761 ENX may be replaced with any other promoter of choice that will result in suitably high expression levels.
By way of example, one of the chemically regulatable promoters described in U.S. Patent No. 5,614,395, such as the tobacco PR-1a promoter, may replace the double 35S promoter. Alternately, the Arabidopsis PR-1 promoter described in Lebel et al., Plant J. 16:223-233 (1998) may be used. The promoter of choice is specifically excised from its source by restriction enzymes, but can alternatively be PCR-amplified using primers that carry appropriate terminal restriction sites. Should PCR-amplification be undertaken, the promoter should be re-sequenced to check for amplification errors after the cloning of the amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR-la promoter is cleaved from plasmid pCIB1004 (for construction, see example 21 of EP 0 332 104, which is hereby incorporated by reference) and transferred to plasmid pCGN1761ENX (Uknes et al., Plant Cell 4: 645-656 (1992)). pCIB1004 is cleaved with Ncol and the resultant 3' overhang of the linearized fragment is rendered blunt by treatment with T4 DNA polymerase. The fragment is then cleaved with Hindlll and the resultant PR-la promoter-containing fragment is gel purified and cloned into pCGN1761ENX from which the double 35S
promoter has been removed. This is accomplished by cleavage with Xhol and blunting with T4 polymerase, followed by cleavage with Hindlll, and isolation of the larger vector-terminator containing fragment into which the pCIB1004 promoter fragment is cloned. This generates a pCGN1761ENX
derivative with the PR-la promoter and the tml terminator and an intervening polylinker with unique EcoRl and Notl sites. The selected coding sequence can be inserted into this vector, and the fusion products (i.e. promoter-gene-terminator) can subsequently be transferred to any selected transformation vector, including those described infra. Various chemical regulators may be employed to induce expression of the selected coding sequence in the plants transformed according to the present invention, including the benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in U.S. Patent Nos. 5,523,311 and 5,614,395.
e. Inducible Expression, an Ethanol-Inducible Promoter:
A promoter inducible by certain alcohols or ketones, such as ethanol, may also be used to confer inducible expression of a coding sequence of the present invention. Such a promoter is for example the alcA gene promoter from Aspergillus nidulans (Caddick et al. (1998) Nat. Biotechnol 16:177-180).
In A. nidulans, the alcA gene encodes alcohol dehydrogenase I, the expression of which is regulated by the AIcR transcription factors in presence of the chemical inducer. For the purposes of the present invention, the CAT
coding sequences in plasmid palcA:CAT comprising a alcA gene promoter sequence fused to a minimal 35S promoter (Caddick et al. (1998) Nat.
Biotechnol 16:177-180) are replaced by a coding sequence of the present invention to form an expression cassette having the coding sequence under the control of the alcA gene promoter. This is carried out using methods well known in the art.
f. Inducible Expression, a Glucocorticoid-Inducible Promoter:
Induction of expression of a nucleic acid sequence of the present invention using systems based on steroid hormones is also contemplated.
For example, a glucocorticoid-mediated induction system is used (Aoyama and Chua (1997) The Plant Journal 11: 605-612) and gene expression is induced by application of a glucocorticoid, for example a synthetic glucocorticoid, specifically dexamethasone, specifically at a concentration ranging from 0.1 mM to 1mM, more specifically from 10mM to 100mM. For the purposes of the present invention, the luciferase gene sequences are replaced by a nucleic acid sequence of the invention to form an expression cassette having a nucleic acid sequence of the invention under the control of six copies of the GAL4 upstream activating sequences fused to the 35S
minimal promoter. This is carried out using methods well known in the art.
The trans-acting factor comprises the GAL4 DNA-binding domain (Keegan et al. (1986) Science 231: 699-704) fused to the transactivating domain of the herpes viral protein VP16 (Triezenberg et al. (1988) Genes Devel. 2: 718-729) fused to the hormone-binding domain of the rat glucocorticoid receptor (Picard et al. (1988) Cell 54: 1073-1080). The expression of the fusion protein is controlled either by a promoter known in the art or described here. This expression cassette is also comprised in the plant comprising a nucleic acid sequence of the invention fused to the 6xGAL4/minimal promoter. Thus, tissue- or organ-specificity of the fusion protein is achieved leading to inducible tissue- or organ-specificity of the insecticidal toxin.
g. Root Specific Expression:
Another pattern of gene expression is root expression. A suitable root promoter is the promoter of the maize metallothionein-like (MTL) gene described by de Framond (FEBS 290: 103-106 (1991)) and also in U.S.
Patent No. 5,466,785, incorporated herein by reference. This "MTL" promoter is transferred to a suitable vector such as pCGN1761 ENX for the insertion of a selected gene and subsequent transfer of the entire promoter-gene-terminator cassette to a transformation vector of interest.
h. Wound-Inducible Promoters:
Wound-inducible promoters may also be suitable for gene expression.
Numerous such promoters have been described (e.g. Xu et al. Plant Molec.
Biol. 22: 573-588 (1993), Logemann et al. Plant Cell 1: 151-158 (1989), Rohrmeier & Lehle, Plant Molec. Biol. 22: 783-792 (1993), Firek et al. Plant Molec. Biol. 22: 129-142 (1993), Warner et al. Plant J. 3: 191-201 (1993)) and all are suitable for use with the instant invention. Logemann et al. describe the 5' upstream sequences of the dicotyledonous potato wunl gene. Xu et al.
show that a wound-inducible promoter from the dicotyledon potato (pin2) is active in the monocotyledon rice. Further, Rohrmeier & Lehle describe the cloning of the maize Wipl cDNA which is wound induced and which can be used to isolate the cognate promoter using standard techniques. Similar, Firek et al. and Warner et al. have described a wound-induced gene from the monocotyledon Asparagus officinalis, which is expressed at local wound and pathogen invasion sites. Using cloning techniques well known in the art, these promoters can be transferred to suitable vectors, fused to the genes pertaining to this invention, and used to express these genes at the sites of plant wounding.
i. Pith-Specific Expression:
Patent Application WO 93/07278, which is herein incorporated by reference, describes the isolation of the maize trpA gene, which is preferentially expressed in pith cells. The gene sequence and promoter extending up to -1726 bp from the start of transcription are presented. Using standard molecular biological techniques, this promoter, or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 35S
promoter and be used to drive the expression of a foreign gene in a pith-specific manner. In fact, fragments containing the pith-specific promoter or parts thereof can be transferred to any vector and modified for utility in transgenic plants.
j. Leaf-Specific Expression:
A maize gene encoding phosphoenol carboxylase (PEPC) has been described by Hudspeth & Grula (Plant Molec Biol 12: 579-589 (1989)). Using standard molecular biological techniques the promoter for this gene can be used to drive the expression of any gene in a leaf-specific manner in transgenic plants.
k. Pollen-Specific Expression:
WO 93/07278 describes the isolation of the maize calcium-dependent protein kinase (CDPK) gene which is expressed in pollen cells. The gene sequence and promoter extend up to 1400 bp from the start of transcription.
Using standard molecular biological techniques, this promoter or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 35S promoter and be used to drive the expression of a nucleic acid sequence of the invention in a pollen-specific manner.
2. Transcriptional Terminators A variety of transcriptional terminators are available for use in expression cassettes. These are responsible for the termination of transcription beyond the transgene and correct mRNA polyadenylation.
Appropriate transcriptional terminators are those that are known to function in plants and include the CaMV 35S terminator, the tml terminator, the nopaline synthase terminator and the pea rbcS E9 terminator. These can be used in both monocotyledons and dicotyledons. In addition, a gene's native transcription terminator may be used.
3. Sequences for the Enhancement or Regulation of Expression Numerous sequences have been found to enhance gene expression from within the transcriptional unit and these sequences can be used in conjunction with the genes of this invention to increase their expression in transgenic plants.
Various intron sequences have been shown to enhance expression, particularly in monocotyledonous cells. For example, the introns of the maize Adhl gene have been found to significantly enhance the expression of the wild-type gene under its cognate promoter when introduced into maize cells.
Intron 1 was found to be particularly effective and enhanced expression in fusion constructs with the chloramphenicol acetyltransferase gene (Callis et al., Genes Develop. 1: 1183-1200 (1987)). In the same experimental system, the intron from the maize bronzel gene had a similar effect in enhancing expression. Intron sequences have been routinely incorporated into plant transformation vectors, typically within the non-translated leader.
A number of non-translated leader sequences derived from viruses are also known to enhance expression, and these are particularly effective in dicotyledonous cells. Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the "W-sequence"), Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be effective in enhancing expression (e.g. Gallie et al. Nucl. Acids Res. 15: 8693-8711 (1987); Skuzeski et al. Plant Molec. Biol. 15: 65-79 (1990)). Other leader sequences known in the art include but are not limited to: picornavirus leaders, for example, EMCV
leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein, 0., Fuerst, T.
R., and Moss, B. PNAS USA 86:6126-6130 (1989)); potyvirus leaders, for example, TEV leader (Tobacco Etch Virus) (Allison et al., 1986); MDMV
leader (Maize Dwarf Mosaic Virus); Virology 154:9-20); human immunoglobulin heavy-chain binding protein (BiP) leader, (Macejak, D. G., and Sarnow, P., Nature 353: 90-94 (1991); untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4), (Jobling, S. A., and Gehrke, L., Nature 325:622-625 (1987); tobacco mosaic virus leader (TMV), (Gallie, D. R. et al., Molecular Biology of RNA, pages 237-256 (1989); and Maize Chlorotic Mottle Virus leader (MCMV) (Lommel, S. A. et al., Virology 81:382-385 (1991). See also, Della-Cioppa et al., Plant Physiology 84:965-968 (1987).
In addition to incorporating one or more of the aforementioned elements into the 5' regulatory region of a target expression cassette of the invention, other elements peculiar to the target expression cassette may also be incorporated. Such elements include but are not limited to a minimal promoter. By minimal promoter it is intended that the basal promoter elements are inactive or nearly so without upstream activation. Such a promoter has low background activity in plants when there is no transactivator present or when enhancer or response element binding sites are absent. One minimal promoter that is particularly useful for target genes in plants is the Bzl minimal promoter, which is obtained from the bronzel gene of maize. The Bzl core promoter is obtained from the "myc" mutant Bzl-luciferase construct pBzlLucR98 via cleavage at the Nhel site located at -53 to -58. Roth et al., Plant Cell 3: 317 (1991). The derived Bzl core promoter fragment thus extends from -53 to +227 and includes the Bzl intron-1 in the 5' untranslated region. Also useful for the invention is a minimal promoter created by use of a synthetic TATA element. The TATA element allows recognition of the promoter by RNA polymerase factors and confers a basal level of gene expression in the absence of activation (see generally, Mukumoto (1993) Plant Mol Biol 23: 995-1003; Green (2000) Trends Biochem Sci 25: 59-63) 4. Targeting of the Gene Product Within the Cell Various mechanisms for targeting gene products are known to exist in plants and the sequences controlling the functioning of these mechanisms have been characterized in some detail. For example, the targeting of gene products to the chloroplast is controlled by a signal sequence found at the amino terminal end of various proteins which is cleaved during chloroplast import to yield the mature protein (e.g. Comai et al. J. Biol. Chem. 263:
15104-15109 (1988)). These signal sequences can be fused to heterologous gene products to effect the import of heterologous products into the chioroplast (van den Broeck, et al. Nature 313: 358-363 (1985)). DNA
encoding for appropriate signal sequences can be isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB protein, the EPSP
synthase enzyme, the GS2 protein and many other proteins which are known to be chloroplast localized. See also, the section entitled "Expression With Chloroplast Targeting" in Example 37 of U.S. Patent No. 5,639,949.
Other gene products are localized to other organelles such as the mitochondrion and the peroxisome (e.g. Unger et al. Plant Molec. Biol. 13:
411-418 (1989)). The cDNAs encoding these products can also be manipulated to effect the targeting of heterologous gene products to these organelles. Examples of such sequences are the nuclear-encoded ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting cellular protein bodies has been described by Rogers et al. (Proc. Natl. Acad.
Sci. USA 82: 6512-6516 (1985)).
In addition, sequences have been characterized which cause the targeting of gene products to other cell compartments. Amino terminal sequences are responsible for targeting to the ER, the apoplast, and extracellular secretion from aleurone cells (Koehler & Ho, Plant Cell 2: 769-783 (1990)). Additionally, amino terminal sequences in conjunction with carboxy terminal sequences are responsible for vacuolar targeting of gene products (Shinshi et al. Plant Molec. Biol. 14: 357-368 (1990)).
By the fusion of the appropriate targeting sequences described above to transgene sequences of interest it is possible to direct the transgene product to any organelle or cell compartment. For chloroplast targeting, for example, the chloroplast signal sequence from the RUBISCO gene, the CAB
gene, the EPSP synthase gene, or the GS2 gene is fused in frame to the amino terminal ATG of the transgene. The signal sequence selected should include the known cleavage site, and the fusion constructed should take into account any amino acids after the cleavage site which are required for cleavage. In some cases this requirement may be fulfilled by the addition of a small number of amino acids between the cleavage site and the transgene ATG or, alternatively, replacement of some amino acids within the transgene sequence. Fusions constructed for chloroplast import can be tested for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions followed by in vitro chloroplast uptake using techniques described by Bartlett et al. In: Edelmann et al. (Eds.) Methods in Chloroplast Molecular Biology, Elsevier pp 1081-1091 (1982) and Wasmann et al. Mol.
Gen. Genet. 205: 446-453 (1986). These construction techniques are well known in the art and are equally applicable to mitochondria and peroxisomes.
The above-described mechanisms for cellular targeting can be utilized not only in conjunction with their cognate promoters, but also in conjunction with heterologous promoters so as to effect a specific cell-targeting goal under the transcriptional regulation of a promoter that has an expression pattern different to that of the promoter from which the targeting signal derives.
C. Construction of Plant Transformation Vectors Numerous transformation vectors available for plant transformation are known to those of ordinary skill in the plant transformation arts, and the genes pertinent to this invention can be used in conjunction with any such vectors.
The selection of vector will depend upon the specific transformation technique and the target species for transformation. For certain target species, different antibiotic or herbicide selection markers may be specific. Selection markers used routinely in transformation include the nptll gene, which confers resistance to kanamycin and related antibiotics (Messing & Vierra. Gene 19:
259-268 (1982); Bevan et al., Nature 304:184-187 (1983)), the bar gene, which confers resistance to the herbicide phosphinothricin (White et al., Nucl.
Acids Res 18: 1062 (1990), Spencer et al. Theor. Appl. Genet 79: 625-631 (1990)), the hph gene, which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol Cell Biol 4: 2929-2931), and the dhfr gene, which confers resistance to methatrexate (Bourouis et al., EMBO J. 2(7):
1099-1104 (1983)), the EPSPS gene, which confers resistance to glyphosate (U.S. Patent Nos. 4,940,935 and 5,188,642), and the mannose-6-phosphate isomerase gene, which provides the ability to metabolize mannose (U.S.
Patent Nos. 5,767,378 and 5,994,629).
1. Vectors Suitable for Agrobacterium Transformation Many vectors are available for transformation using Agrobacterium tumefaciens. These typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, Nucl. Acids Res. (1984)). Below, the construction of two typical vectors suitable for Agrobacterium transformation is described.
a. pCIB200 and pCIB2001:
The binary vectors pCIB200 and pCIB2001 are used for the construction of recombinant vectors for use with Agrobacterium and are constructed in the following manner. pTJS75kan is created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. Bacteriol. 164: 446-455 (1985)) allowing excision of the tetracycline-resistance gene, followed by insertion of an Accl fragment from pUC4K carrying an NPTII (Messing & Vierra, Gene 19:
259-268 (1982): Bevan et al., Nature 304: 184-187 (1983): McBride et al., Plant Molecular Biology 14: 266-276 (1990)). Xhol linkers are ligated to the EcoRV fragment of PCIB7 which contains the left and right T-DNA borders, a plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et al., Gene 53: 153-161 (1987)), and the Xhol-digested fragment are cloned into Sall-digested pTJS75kan to create pCIB200 (see also EP 0 332 104, example 19). pCIB200 contains the following unique polylinker restriction sites:
EcoRl, Sstl, Kpnl, BgIlI, Xbal, and Sall. pCIB2001 is a derivative of pCIB200 created by the insertion into the polylinker of additional restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRl, Sstl, Kpnl, Bglll, Xbal, Sall, Mlul, Bcll, Avrll, Apal, Hpal, and Stul. pCIB2001, in addition to containing these unique restriction sites also has plant and bacterial kanamycin selection, left and right T-DNA borders for Agrobacterium-mediated transformation, the RK2-derived trfA function for mobilization between E. coli and other hosts, and the OriT and OriV functions also from RK2. The pCIB2001 polylinker is suitable for the cloning of plant expression cassettes containing their own regulatory signals.
b. pCIB10 and Hygromycin Selection Derivatives thereof:
The binary vector pCIB10 contains a gene encoding kanamycin resistance for selection in plants and T-DNA right and left border sequences and incorporates sequences from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its construction is described by Rothstein et al. (Gene 53: 153-161 (1987)). Various derivatives of pCIB10 are constructed which incorporate the gene for hygromycin B
phosphotransferase described by Gritz et al. (Gene 25: 179-188 (1983)).
These derivatives enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin (pCIB715, pCIB717).
2. Vectors Suitable for non-Agrobacterium Transformation Transformation without the use of Agrobacterium tumefaciens circumvents the requirement for T-DNA sequences in the chosen transformation vector and consequently vectors lacking these sequences can be utilized in addition to vectors such as the ones described above which contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium include transformation via particle bombardment, protoplast uptake (e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on the specific selection for the species being transformed. Below, the construction of typical vectors suitable for non-Agrobacterium transformation is described.
a. pCIB3064:
pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in combination with selection by the herbicide basta (or phosphinothricin). The plasmid pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene and the CaMV 35S
transcriptional terminator and is described in the PCT published application WO 93/07278. The 35S promoter of this vector contains two ATG sequences 5' of the start site. These sites are mutated using standard PCR techniques in such a way as to remove the ATGs and generate the restriction sites Sspl and Pvull. The new restriction sites are 96 and 37 bp away from the unique Sall site and 101 and 42 bp away from the actual start site. The resultant derivative of pCIB246 is designated pCIB3025. The GUS gene is then excised from pCIB3025 by digestion with Sall and Sacl, the termini rendered blunt and religated to generate plasmid pCIB3060. The plasmid pJIT82 is obtained from the John Innes Centre, Norwich and the a 400 bp Smal fragment containing the bar gene from Streptomyces viridochromogenes is excised and inserted into the Hpal site of pCIB3060 (Thompson et al. EMBO J
6: 2519-2523 (1987)). This generated pCIB3064, which comprises the bar gene under the control of the CaMV 35S promoter and terminator for herbicide selection, a gene for ampicillin resistance (for selection in E.
coli) and a polylinker with the unique sites Sphl, Pstl, Hindill, and BamHl. This vector is suitable for the cloning of plant expression cassettes containing their own regulatory signals.
b. pSOG19 and pSOG35:
pSOG35 is a transformation vector that utilizes the E. coli gene dihydrofolate reductase (DFR) as a selectable marker conferring resistance to methotrexate. PCR is used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene (-550 bp) and 18 bp of the GUS untransiated leader sequence from pSOG10. A 250-bp fragment encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and these two PCR fragments are assembled with a Sacl-Pstl fragment from pB1221 (Clontech) which comprises the pUC19 vector backbone and the nopaline synthase terminator. Assembly of these fragments generates pSOG19 which contains the 35S promoter in fusion with the intron 6 sequence, the GUS
leader, the DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generates the vector pSOG35. pSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and have Hindill, Sphl, Pstl and EcoRl sites available for the cloning of foreign substances.
3. Vector Suitable for Chloroplast Transformation For expression of a nucleotide sequence of the present invention in plant plastids, plastid transformation vector pPH143 (WO 97/32011, example 36) is used. The nucleotide sequence is inserted into pPH143 thereby replacing the PROTOX coding sequence. This vector is then used for plastid transformation and selection of transformants for spectinomycin resistance.
Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. In this case, transformants are selected for resistance to PROTOX inhibitors.
D. Transformation Once a nucleic acid sequence of the invention has been cloned into an expression system, it is transformed into a plant cell. The receptor and target expression cassettes of the present invention can be introduced into the plant cell in a number of art-recognized ways. Methods for regeneration of plants are also well known in the art. For example, Ti plasmid vectors have been utilized for the delivery of foreign DNA, as well as direct DNA uptake, liposomes, electroporation, microinjection, and microprojectiles. In addition, bacteria from the genus Agrobacterium can be utilized to transform plant cells.
Below are descriptions of representative techniques for transforming both dicotyledonous and monocotyledonous plants, as well as a representative plastid transformation technique.
1. Transformation of Dicotyledons Transformation techniques for dicotyledons are well known in the art and include Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non-Agrobacterium techniques involve the uptake of exogenous genetic material directly by protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, particle bombardment-mediated delivery, or microinjection. Examples of these techniques are described by Paszkowski et al., EMBO J 3: 2717-2722 (1984), Potrykus et al., Mol. Gen. Genet. 199: 169-177 (1985), Reich et al., Biotechnology 4: 1001-1004 (1986), and Klein et al., Nature 327: 70-73 (1987). In each case the transformed cells are regenerated to whole plants using standard techniques known in the art.
Agrobacterium-mediated transformation is a specific technique for transformation of dicotyledons because of its high efficiency of transformation and its broad utility with many different species. Agrobacterium transformation typically involves the transfer of the binary vector carrying the foreign DNA of interest (e.g. pCIB200 or pCIB2001) to an appropriate Agrobacterium strain which may depend of the complement of vir genes carried by the host Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g. strain CIB542 for pCIB200 and pCIB2001 (Uknes et al.
Plant Cell 5: 159-169 (1993)). The transfer of the recombinant binary vector to Agrobacterium is accomplished by a triparental mating procedure using E.
coli carrying the recombinant binary vector, a helper E. coli strain which carries a plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary vector can be transferred to Agrobacterium by DNA
transformation (Hofgen & Willmitzer, Nucl. Acids Res. 16: 9877 (1988)).
Transformation of the target plant species by recombinant Agrobacterium usually involves co-cultivation of the Agrobacterium with explants from the plant and follows protocols well known in the art.
Transformed tissue is regenerated on selectable medium carrying the antibiotic or herbicide resistance marker present between the binary plasmid T-DNA borders.
Another approach to transforming plant cells with a gene involves propelling inert or biologically active particles at plant tissues and cells.
This technique is disclosed in U.S. Patent Nos. 4,945,050, 5,036,006, and 5,100,792 all to Sanford et al. Generally, this procedure involves propelling inert or biologically active particles at the cells under conditions effective to penetrate the outer surface of the cell and afford incorporation within the interior thereof. When inert particles are utilized, the vector can be introduced into the cell by coating the particles with the vector containing the desired gene. Alternatively, the target cell can be surrounded by the vector so that the vector is carried into the cell by the wake of the particle. Biologically active particles (e.g., dried yeast cells, dried bacterium or a bacteriophage, each containing DNA sought to be introduced) can also be propelled into plant cell tissue.
2. Transformation of Monocotyledons Transformation of most monocotyledon species has now also become routine. Specific techniques include direct gene transfer into protoplasts using PEG or electroporation techniques, and particle bombardment into callus tissue. Transformations can be undertaken with a single DNA species or multiple DNA species (i.e. co-transformation) and both these techniques are suitable for use with this invention. Co-transformation may have the advantage of avoiding complete vector construction and of generating transgenic plants with unlinked loci for the gene of interest and the selectable marker, enabling the removal of the selectable marker in subsequent generations, should this be regarded desirable. However, a disadvantage of the use of co-transformation is the less than 100% frequency with which separate DNA species are integrated into the genome (Schocher et al.
Biotechnology 4: 1093-1096 (1986)).
Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques for the preparation of callus and protoplasts from an elite inbred line of maize, transformation of protoplasts using PEG or electroporation, and the regeneration of maize plants from transformed protoplasts. Gordon-Kamm et al. (Plant Cell 2: 603-618 (1990)) and Fromm et al. (Biotechnology 8: 833-839 (1990)) have published techniques for transformation of A188-derived maize line using particle bombardment.
Furthermore, WO 93/07278 and Koziel et al. (Biotechnology 11: 194-200 (1993)) describe techniques for the transformation of elite inbred lines of maize by particle bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length excised from a maize ear 14-15 days after pollination and a PDS-1000He Biolistics device for bombardment.
Transformation of rice can also be undertaken by direct gene transfer techniques utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been described for Japonica-types and Indica-types (Zhang et al. Plant Cell Rep 7: 379-384 (1988); Shimamoto et al. Nature 338:
274-277 (1989); Datta et al. Biotechnology 8: 736-740 (1990)). Both types are also routinely transformable using particle bombardment (Christou et al.
Biotechnology 9: 957-962 (1991)). Furthermore, WO 93/21335 describes techniques for the transformation of rice via electroporation.
Patent Application EP 0 332 581 describes techniques for the generation, transformation and regeneration of Pooideae protoplasts. These techniques allow the transformation of Dactylis and wheat. Furthermore, wheat transformation has been described by Vasil et al. (Biotechnology 10:
667-674 (1992)) using particle bombardment into cells of type C long-term regenerable callus, and also by Vasil et al. (Biotechnology 11: 1553-1558 (1993)) and Weeks et al. (Plant Physiol. 102: 1077-1084 (1993)) using particle bombardment of immature embryos and immature embryo-derived callus. A
specific technique for wheat transformation, however, involves the transformation of wheat by particle bombardment of immature embryos and includes either a high sucrose or a high maltose step prior to gene delivery.
Prior to bombardment, any number of embryos (0.75-1 mm in length) are plated onto MS medium with 3% sucrose (Murashiga & Skoog, Physiologia Plantarum 15: 473-497 (1962)) and 3 mg/I 2,4-D for induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day of bombardment, embryos are removed from the induction medium and placed onto the osmoticum (i.e. induction medium with sucrose or maltose added at the desired concentration, typically 15%). The embryos are allowed to plasmolyze for 2-3 hours and are then bombarded. Twenty embryos per target plate is typical, although not critical. An appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles using standard procedures. Each plate of embryos is shot with the DuPont Biolistics helium device using a burst pressure of -1000 psi using a standard 80 mesh screen. After bombardment, the embryos are placed back into the dark to recover for about 24 hours (still on osmoticum).
After 24 hrs, the embryos are removed from the osmoticum and placed back onto induction medium where they stay for about a month before regeneration. Approximately one month later the embryo explants with developing embryogenic callus are transferred to regeneration medium (MS +
1 mg/liter NAA, 5 mg/liter GA), further containing the appropriate selection agent (10 mg/I basta in the case of pCIB3064 and 2 mg/I methotrexate in the case of pSOG35). After approximately one month, developed shoots are transferred to larger sterile containers known as "GA7s" which contain half-strength MS, 2% sucrose, and the same concentration of selection agent.
Tranformation of monocotyledons using Agrobacterium has also been described. See, WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein by reference. See also, Negrotto et al., Plant Cell Reports 19: 798-803 (2000), incorporated herein by reference. For this example, rice (Oryza sativa) is used for generating transgenic plants. Various rice cultivars can be used (Hiei et al., 1994, Plant Journal 6:271-282; Dong et al., 1996, Molecular Breeding 2:267-276; Hiei et al., 1997, Plant Molecular Biology, 35:205-218). Also, the various media constituents described below may be either varied in quantity or substituted. Embryogenic responses are initiated and/or cultures are established from mature embryos by culturing on MS-CIM medium (MS basal salts, 4.3 g/liter; B5 vitamins (200 x), 5 mI/liter;
Sucrose, 30 g/liter; proline, 500 mg/liter; glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2,4-D (1 mg/mI), 2 mI/liter; adjust pH to 5.8 with KOH; Phytagel, 3 g/Iiter). Either mature embryos at the initial stages of culture response or established culture lines are inoculated and co-cultivated with the Agrobacterium tumefaciens strain LBA4404 (Agrobacterium) containing the desired vector construction. Agrobacterium is cultured from glycerol stocks on solid YPC medium (100 mg/L spectinomycin and any other appropriate antibiotic) for -2 days at 28 oC. Agrobacterium is re-suspended in liquid MS-CIM medium. The Agrobacterium culture is diluted to an OD600 of 0.2-0.3 and acetosyringone is added to a final concentration of 200 uM.
Acetosyringone is added before mixing the solution with the rice cultures to induce Agrobacterium for DNA transfer to the plant cells. For inoculation, the plant cultures are immersed in the bacterial suspension. The liquid bacterial suspension is removed and the inoculated cultures are placed on co-cultivation medium and incubated at 22 C for two days. The cultures are then transferred to MS-CIM medium with Ticarcillin (400 mg/liter) to inhibit the growth of Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., In Vitro Cell. Dev. Biol.-Plant 37:127-132), cultures are transferred to selection medium containing Mannose as a carbohydrate source (MS with 2%Mannose, 300 mg/liter Ticarcillin) after 7 days, and cultured for 3-4 weeks in the dark. Resistant colonies are then transferred to regeneration induction medium (MS with no 2,4-D, 0.5 mg/liter IAA, 1 mg/liter zeatin, 200 mg/liter timentin 2% Mannose and 3% Sorbitol) and grown in the dark for 14 days. Proliferating colonies are then transferred to another round of regeneration induction media and moved to the light growth room.
Regenerated shoots are transferred to GA7 containers with GA7-1 medium (MS with no hormones and 2% Sorbitol) for 2 weeks and then moved to the greenhouse when they are large enough and have adequate roots. Plants are transplanted to soil in the greenhouse (TO generation) grown to maturity, and the T1 seed is harvested.
3. Transformation of Plastids Seeds of Nicotiana tabacum c.v. 'Xanthi nc' are germinated seven per plate in a 1" circular array on T agar medium and bombarded 12-14 days after sowing with 1 pm tungsten particles (M10, Biorad, Hercules, CA) coated with DNA from plasmids pPH143 and pPH145 essentially as described (Svab, Z.
and Maliga, P. (1993) PNAS 90, 913-917). Bombarded seedlings are incubated on T medium for two days after which leaves are excised and placed abaxial side up in bright light (350-500 pmol photons/m2/s) on plates of RMOP medium (Svab, Z., Hajdukiewicz, P. and Maliga, P. (1990) PNAS
87, 8526-8530) containing 500 pg/mi spectinomycin dihydrochloride (Sigma, St. Louis, MO). Resistant shoots appearing underneath the bleached leaves three to eight weeks after bombardment are subcloned onto the same selective medium, allowed to form callus, and secondary shoots isolated and subcloned. Complete segregation of transformed plastid genome copies (homoplasmicity) in independent subclones is assessed by standard techniques of Southern blotting (Sambrook et al., (1989) Molecular Cloning: A
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor).
BamHI/EcoRI-digested total cellular DNA (Mettler, I. J. (1987) Plant Mol Biol Reporter 5, 346-349) is separated on 1% Tris-borate (TBE) agarose gels, transferred to nylon membranes (Amersham) and probed with 32P-labeled random primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA fragment from pC8 containing a portion of the rps7/12 plastid targeting sequence. Homoplasmic shoots are rooted aseptically on spectinomycin-containing MS/IBA medium (McBride, K. E. et al. (1994) PNAS 91, 7301-7305) and transferred to the greenhouse.
V. Breeding and Seed Production A. Breeding The plants obtained via tranformation with a nucleic acid sequence of the present invention can be any of a wide variety of plant species, including those of monocots and dicots; however, the plants used in the method of the invention are specifically selected from the list of agronomically important target crops set forth supra. The expression of a gene of the present invention in combination with other characteristics important for production and quality can be incorporated into plant lines through breeding. Breeding approaches and techniques are known in the art. See, for example, Welsh J. R., Fundamentals of Plant Genetics and Breeding, John Wiley & Sons, NY
(1981); Crop Breeding, Wood D. R. (Ed.) American Society of Agronomy Madison, Wisconsin (1983); Mayo 0., The Theory of Plant Breeding, Second Edition, Clarendon Press, Oxford (1987); Singh, D.P., Breeding for Resistance to Diseases and Insect Pests, Springer-Verlag, NY (1986); and Wricke and Weber, Quantitative Genetics and Selection Plant Breeding, Walter de Gruyter and Co., Berlin (1986).
The genetic properties engineered into the transgenic seeds and plants described above are passed on by sexual reproduction or vegetative growth and can thus be maintained and propagated in progeny plants. Generally said maintenance and propagation make use of known agricultural methods developed to fit specific purposes such as tilling, sowing or harvesting.
Specialized processes such as hydroponics or greenhouse technologies can also be applied. As the growing crop is vulnerable to attack and damages caused by insects or infections as well as to competition by weed plants, measures are undertaken to control weeds, plant diseases, insects, nematodes, and other adverse conditions to improve yield. These include mechanical measures such a tillage of the soil or removal of weeds and infected plants, as well as the application of agrochemicals such as herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and insecticides.
Use of the advantageous genetic properties of the transgenic plants and seeds according to the invention can further be made in plant breeding, which aims at the development of plants with improved properties such as tolerance of pests, herbicides, or stress, improved nutritional value, increased yield, or improved structure causing less loss from lodging or shattering. The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate progeny plants. Depending on the desired properties, different breeding measures are taken. The relevant techniques are well known in the art and include but are not limited to hybridization, inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization techniques also include the sterilization of plants to yield male or female sterile plants by mechanical, chemical, or biochemical means. Cross pollination of a male sterile plant with pollen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines. Thus, the transgenic seeds and plants according to the invention can be used for the breeding of improved plant lines, that for example, increase the effectiveness of conventional methods such as herbicide or pesticide treatment or allow one to dispense with said methods due to their modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained, which, due to their optimized genetic "equipment", yield harvested product of better quality than products that were not able to tolerate comparable adverse developmental conditions.
B. Seed Production In seed production, germination quality and uniformity of seeds are essential product characteristics. As it is difficult to keep a crop free from other crop and weed seeds, to control seedborne diseases, and to produce seed with good germination, fairly extensive and well-defined seed production practices have been developed by seed producers, who are experienced in the art of growing, conditioning and marketing of pure seed. Thus, it is common practice for the farmer to buy certified seed meeting specific quality standards instead of using seed harvested from his own crop. Propagation material to be used as seeds is customarily treated with a protectant coating comprising herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides, or mixtures thereof. Customarily used protectant coatings comprise compounds such as captan, carboxin, thiram (TMTD ), methalaxyl (Apron ), and pirimiphos-methyl (Actellic ). If desired, these compounds are formulated together with further carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation to provide protection against damage caused by bacterial, fungal or animal pests. The protectant coatings may be applied by impregnating propagation material with a liquid formulation or by coating with a combined wet or dry formulation. Other methods of application are also possible such as treatment directed at the buds or the fruit.
VI. Alteration of Expression of Nucleic Acid Molecules The alteration in expression of the nucleic acid molecules of the present invention is achieved in one of the following ways:
A. "Sense" Suppression Alteration of the expression of a nucleotide sequence of the present invention, specifically reduction of its expression, is obtained by "sense"
suppression (referenced in e.g. Jorgensen et al. (1996) Plant Mol. Biol. 31, 957-973). In this case, the entirety or a portion of a nucleotide sequence of the present invention is comprised in a DNA molecule. The DNA molecule is specifically operatively linked to a promoter functional in a cell comprising the target gene, specifically a plant cell, and introduced into the cell, in which the nucleotide sequence is expressible. The nucleotide sequence is inserted in the DNA molecule in the "sense orientation", meaning that the coding strand of the nucleotide sequence can be transcribed. In a specific embodiment, the nucleotide sequence is fully translatable and all the genetic information comprised in the nucleotide sequence, or portion thereof, is translated into a polypeptide. In another specific embodiment, the nucleotide sequence is partially translatable and a short peptide is translated. In a specific embodiment, this is achieved by inserting at least one premature stop codon in the nucleotide sequence, which bring translation to a halt. In another more specific embodiment, the nucleotide sequence is transcribed but no translation product is being made. This is usually achieved by removing the start codon, e.g. the "ATG", of the polypeptide encoded by the nucleotide sequence. In a further specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule.
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
B. "Anti-sense" Suppression In another specific embodiment, the alteration of the expression of a nucleotide sequence of the present invention, specifically the reduction of its expression is obtained by "anti-sense" suppression. The entirety or a portion of a nucleotide sequence of the present invention is comprised in a DNA
molecule. The DNA molecule is specifically operatively linked to a promoter functional in a plant cell, and introduced in a plant cell, in which the nucleotide sequence is expressible. The nucleotide sequence is inserted in the DNA
molecule in the "anti-sense orientation", meaning that the reverse complement (also called sometimes non-coding strand) of the nucleotide sequence can be transcribed. In a specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Green, P. J. et al., Ann. Rev.
Biochem. 55:569-597 (1986); van der Krol, A. R. et al, Antisense Nuc. Acids &
Proteins, pp. 125-141 (1991); Abel, P. P. et al., PNASroc. Natl. Acad. Sci.
USA 86:6949-6952 (1989); Ecker, J. R. et al., Proc. Natl. Acad. Sci. USANAS
83:5372-5376 (Aug. 1986)).
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
C. Homologous Recombination In another specific embodiment, at least one genomic copy corresponding to a nucleotide sequence of the present invention is modified in the genome of the plant by homologous recombination as further illustrated in Paszkowski et al., EMBO Journal 7:4021-26 (1988). This technique uses the property of homologous sequences to recognize each other and to exchange nucleotide sequences between each by a process known in the art as homologous recombination. Homologous recombination can occur between the chromosomal copy of a nucleotide sequence in a cell and an incoming copy of the nucleotide sequence introduced in the cell by transformation.
Specific modifications are thus accurately introduced in the chromosomal copy of the nucleotide sequence. In one embodiment, the regulatory elements of the nucleotide sequence of the present invention are modified. Such regulatory elements are easily obtainable by screening a genomic library using the nucleotide sequence of the present invention, or a portion thereof, as a probe. The existing regulatory elements are replaced by different regulatory elements, thus altering expression of the nucleotide sequence, or they are mutated or deleted, thus abolishing the expression of the nucleotide sequence. In another embodiment, the nucleotide sequence is modified by deletion of a part of the nucleotide sequence or the entire nucleotide sequence, or by mutation. Expression of a mutated polypeptide in a plant cell is also contemplated in the present invention. More recent refinements of this technique to disrupt endogenous plant genes have been described (Kempin et al., Nature 389:802-803 (1997) and Miao and Lam, Plant J., 7:359-365 (1995).
In another specific embodiment, a mutation in the chromosomal copy of a nucleotide sequence is introduced by transforming a cell with a chimeric oligonucleotide composed of a contiguous stretch of RNA and DNA residues in a duplex conformation with double hairpin caps on the ends. An additional feature of the oligonucleotide is for example the presence of 2'-O-methylation at the RNA residues. The RNA/DNA sequence is designed to align with the sequence of a chromosomal copy of a nucleotide sequence of the present invention and to contain the desired nucleotide change. For example, this technique is further illustrated in US patent 5,501,967 and Zhu et al. (1999) Proc. Natl. Acad. Sci. USA 96: 8768-8773.
D. Ribozymes In a further embodiment, the RNA coding for a polypeptide of the present invention is cleaved by a catalytic RNA, or ribozyme, specific for such RNA. The ribozyme is expressed in transgenic plants and results in reduced amounts of RNA coding for the polypeptide of the present invention in plant cells, thus leading to reduced amounts of polypeptide accumulated in the cells. This method is further illustrated in US patent 4,987,071.
E. Dominant-Negative Mutants In another specific embodiment, the activity of the polypeptide encoded by the nucleotide sequences of this invention is changed. This is achieved by expression of dominant negative mutants of the proteins in transgenic plants, leading to the loss of activity of the endogenous protein.
F. Aptamers In a further embodiment, the activity of polypeptide of the present invention is inhibited by expressing in transgenic plants nucleic acid ligands, so-called aptamers, which specifically bind to the protein. Aptamers are preferentially obtained by the SELEX (Systematic Evolution of Ligands by EXponential Enrichment) method. In the SELEX method, a candidate mixture of single stranded nucleic acids having regions of randomized sequence is contacted with the protein and those nucleic acids having an increased affinity to the target are partitioned from the remainder of the candidate mixture. The partitioned nucleic acids are amplified to yield a ligand enriched mixture.
After several iterations a nucleic acid with optimal affinity to the polypeptide is obtained and is used for expression in transgenic plants. This method is further illustrated in US patent 5,270,163.
G. Zinc finger proteins A zinc finger protein that binds a nucleotide sequence of the present invention or to its regulatory region is also used to alter expression of the nucleotide sequence. Specifically, transcription of the nucleotide sequence is reduced or increased. Zinc finger proteins are for example described in Beerli et al. (1998) PNAS 95:14628-14633., or in WO 95/19431, WO 98/54311, or WO 96/06166, all incorporated herein by reference in their entirety.
H. dsRNA
Alteration of the expression of a nucleotide sequence of the present invention is also obtained by dsRNA interference as described for example in WO 99/32619, WO 99/53050 or WO 99/61631, all incorporated herein by reference in their entirety. In another specific embodiment, the alteration of the expression of a nucleotide sequence of the present invention, specifically the reduction of its expression, is obtained by double-stranded RNA (dsRNA) interference. The entirety or, specifically a portion of a nucleotide sequence of the present invention is comprised in a DNA molecule. The size of the DNA molecule is specifically from 100 to 1000 nucleotides or more; the optimal size to be determined empirically. Two copies of the identical DNA
molecule are linked, separated by a spacer DNA molecule, such that the first and second copies are in opposite orientations. In the specific embodiment, the first copy of the DNA molecule is in the reverse complement (also known as the non-coding strand) and the second copy is the coding strand; in the most specific embodiment, the first copy is the coding strand, and the second copy is the reverse complement. The size of the spacer DNA molecule is specifically 200 to 10,000 nucleotides, more specifically 400 to 5000 nucleotides and most specifically 600 to 1500 nucleotides in length. The spacer is specifically a random piece of DNA, more specifically a random piece of DNA without homology to the target organism for dsRNA
interference, and most specifically a functional intron which is effectively spliced by the target organism. The two copies of the DNA molecule separated by the spacer are operatively linked to a promoter functional in a plant cell, and introduced in a plant cell, in which the nucleotide sequence is expressible. In a specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Waterhouse et al. (1998) PNAS
95:13959-13964; Chuang and Meyerowitz (2000) PNAS 97:4985-4990; Smith et al. (2000) Nature 407:319-320). Alteration of the expression of a nucleotide sequence by dsRNA interference is also described in, for example WO
99/32619, WO 99/53050 or WO 99/61631, all incorporated herein by reference in their entirety.
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
1. Insertion of a DNA molecule (Insertional mutagenesis) In another specific embodiment, a DNA molecule is inserted into a chromosomal copy of a nucleotide sequence of the present invention, or into a regulatory region thereof. Specifically, such DNA molecule comprises a transposable element capable of transposition in a plant cell, such as e.g.
Ac/Ds, Em/Spm, mutator. Alternatively, the DNA molecule comprises a T-DNA border of an Agrobacterium T-DNA. The DNA molecule may also comprise a recombinase or integrase recognition site which can be used to remove part of the DNA molecule from the chromosome of the plant cell.
Methods of insertional mutagenesis using T-DNA, transposons, oligonucleotides or other methods known to those skilled in the art are also encompassed. Methods of using T-DNA and transposon for insertional mutagenesis are described in Winkler et al. (1989) Methods Mol. Biol. 82:129-136 and Martienssen (1998) PNAS 95:2021-2026, incorporated herein by reference in their entireties.
J. Deletion mutagenesis In yet another embodiment, a mutation of a nucleic acid molecule of the present invention is created in the genomic copy of the sequence in the cell or plant by deletion of a portion of the nucleotide sequence or regulator sequence. Methods of deletion mutagenesis are known to those skilled in the art. See, for example, Miao et al, (1995) Plant J. 7:359.
In yet another embodiment, this deletion is created at random in a large population of plants by chemical mutagenesis or irradiation and a plant with a deletion in a gene of the present invention is isolated by forward or reverse genetics. Irradiation with fast neutrons or gamma rays is known to cause deletion mutations in plants (Silverstone et al, (1998) Plant Cell, 10:155-169;
Bruggemann et al., (1996) Plant J., 10:755-760; Redei and Koncz in Methods in Arabidopsis Research, World Scientific Press (1992), pp. 16-82). Deletion mutations in a gene of the present invention can be recovered in a reverse genetics strategy using PCR with pooled sets of genomic DNAs as has been shown in C. elegans (Liu et al., (1999), Genome Research, 9:859-867.). A
forward genetics strategy would involve mutagenesis of a line displaying PTGS followed by screening the M2 progeny for the absence of PTGS.
Among these mutants would be expected to be some that disrupt a gene of the present invention. This could be assessed by Southern blot or PCR for a gene of the present invention with genomic DNA from these mutants.
K. Overexpression in a plant cell In yet another specific embodiment, a nucleotide sequence of the present invention encoding a polypeptide is over-expressed. Examples of nucleic acid molecules and expression cassettes for over-expression of a nucleic acid molecule of the present invention are described above. Methods known to those skilled in the art of over-expression of nucleic acid molecules are also encompassed by the present invention.
In a specific embodiment, the expression of the nucleotide sequence of the present invention is altered in every cell of a plant. This is for example obtained though homologous recombination or by insertion in the chromosome. This is also for example obtained by expressing a sense or antisense RNA, zinc finger protein or ribozyme under the control of a promoter capable of expressing the sense or antisense RNA, zinc finger protein or ribozyme in every cell of a plant. Constitutive expression, inducible, tissue-specific or developmentally-regulated expression are also within the scope of the present invention and result in a constitutive, inducible, tissue-specific or developmentally-regulated alteration of the expression of a nucleotide sequence of the present invention in the plant cell. Constructs for expression of the sense or antisense RNA, zinc finger protein or ribozyme, or for over-expression of a nucleotide sequence of the present invention, are prepared and transformed into a plant cell according to the teachings of the present invention, e.g. as described infra.
VII. Polypeptides The present invention further relates to isolated polypeptides comprising the amino acid sequence of SEQ ID NO:2. In particular, isolated polypeptides comprising the amino acid sequence of SEQ ID NO:2, and variants having conservative amino acid modifications. One skilled in the art will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide or protein sequence which alters, adds or deletes a single amino acid or a small percent of amino acids in the encoded sequence is a "conservative modification" where the modification results in the substitution of an amino acid with a chemically similar amino acid.
Conservative modified variants provide similar biological activity as the unmodified polypeptide. Conservative substitution tables listing functionally similar amino acids are known in the art. See Crighton (1984) Proteins, W.H.
Freeman and Company.
In a specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence of SEQ ID NO:2, or exon or domain thereof, is an allelic variant of the polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a naturally occurring variant of the polypeptide sequence listed SEQ ID NO:2.
In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed SEQ ID NO:2, or exon or domain thereof, is a polymorphic variant of the polypeptide sequence listed in SEQ ID NO:2.
In an alternate specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one amino acid. In a more specific embodiment, the deletion or insertion is of less than about ten amino acids. In a most specific embodiment, the deletion or insertion is of less than about three amino acids.
In a specific embodiment, the sequence having substantial similarity encodes a substitution in at least one amino acid.
Embodiments of the present invention also contemplate an isolated polypeptide containing a polypeptide sequence including (a) a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In another specific embodiment, the polypeptide having substantial similarity is an allelic variant of a polypeptide sequence listed in SEQ ID
NO:2, or a fragment, domain, repeat or chimeras thereof. In another specific embodiment, the isolated nucleic acid includes a plurality of regions from the polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or fragment or domain thereof, or a sequence complementary thereto.
In another specific embodiment, the polypeptide is a polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, the polypeptide is a functional fragment or domain. In yet another specific embodiment, the polypeptide is a chimera, where the chimera may include functional protein domains, including domains, repeats, post-translational modification sites, or other features. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum, and teosinte. In another specific embodiment, the cereal is rice.
In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue is for example, but not limited to, epidermis, vascular tissue, meristem, cambium, cortex or pith. In a most specific embodiment, the location or tissue is leaf or sheath, root, flower, and developing ovule or seed. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In a more specific embodiment, the location or tissue is a seed.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1 or a fragment or domain thereof or a sequence complementary thereto, includes a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides. In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or fragment or domain thereof or a sequence complementary thereto, includes a substitution of at least one codon. In a more specific embodiment, the substitution is conservative.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence listed in SEQ ID NO:2, or a fragment, domain, repeat or chimeras thereof includes a deletion or insertion of at least one amino acid.
The polypeptides of the invention, fragments thereof or variants thereof can comprise any number of contiguous amino acid residues from a polypeptide of the invention, wherein the number of residues is selected from the group of integers consisting of from 10 to the number of residues in a full-length polypeptide of the invention. Specifically, the portion or fragment of the polypeptide is a functional protein. The present invention includes active polypeptides having specific activity of at least 20%, 30%, or 40%, and specifically at least 505, 60%, or 70%, and most specifically at least 805, 90%
or 95% that of the native (non-synthetic) endogenous polypeptide. Further, the substrate specificity (kcat/Km) is optionally substantially similar to the native (non-synthetic), endogenous polypeptide. Typically the Km will be at least 30%, 40%, or 50% of the native, endogenous polypeptide; and more specifically at least 605, 70%, 80%, or 90%. Methods of assaying and quantifying measures of activity and substrate specificity are well known to those of skill in the art.
The isolated polypeptides of the present invention will elicit production of an antibody specifically reactive to a polypeptide of the present invention when presented as an immunogen. Therefore, the polypeptides of the present invention can be employed as immunogens for constructing antibodies immunoreactive to a protein of the present invention for such purposes, but not limited to, immunoassays or protein purification techniques.
Immunoassays for determining binding are well known to those of skill in the art such as, but not limited to, ELISAs or competitive immunoassays.
Embodiments of the present invention also relate to chimeric polypeptides encoded by the isolated nucleic acid molecules of the present disclosure including a chimeric polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); and (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c); or (f) a functional fragment thereof.
A polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including a polypeptide sequence including:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a domain, repeat or chimeras thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; and a functional fragment of (a), (b), (c) or (d);
or (e) a functional fragment thereof.
The isolated nucleic acid molecules of the present invention are useful for expressing a polypeptide of the present invention in a recombinantly engineered cell such as a bacteria, yeast, insect, mammalian or plant cell.
The cells produce the polypeptide in a non-natural condition (e.g. in quantity, composition, location and/or time) because they have been genetically altered to do so. Those skilled in the art are knowledgeable in the numerous expression systems available for expression of nucleic acids encoding a protein of the present invention, and will not be described in detail below.
Briefly, the expression of isolated nucleic acids encoding a polypeptide of the invention will typically be achieved, for example, by operably linking the nucleic acid or cDNA to a promoter (constitutive or regulatable) followed by incorporation into an expression vector. The vectors are suitable for replication and/or integration in either prokaryotes or eukaryotes. Commonly used expression vectors comprise transcription and translation terminators, initiation sequences and promoters for regulation of the expression of the nucleic acid molecule encoding the polypeptide. To obtain high levels of expression of the cloned nucleic acid molecule, it is desirable to use expression vectors comprising a strong promoter to direct transcription, a ribosome binding site for translation initiation, and a transcription/translation terminator. One skilled in the art will recognize that modifications may be made to the polypeptide of the present invention without diminishing its biological activity. Some modifications may be made to facilitate the cloning, expression or incorporation of the polypeptide of the invention into a fusion protein. Such modification are well known in the art and include, but are not limited to, a methionine added at the amino terminus to provide an initiation site, or additional amino acids (e.g. poly Histadine) placed on either terminus to create conveniently located purification sequences. Restriction sites or termination codons can also be introduced into the vector.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-la promoter, the ubiquitin promoter, and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or glutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
Prokaryotic cells may be used a host cells, for example, but not limited to, Escherichia coli, and other microbial strains known to those in the art.
Methods for expressing proteins in prokaryotic cells are well known to those in the art and can be found in many laboratory manuals such as Molecular Cloning: A Laboratory Manual, by J. Sambrook et al. (1989, Cold Spring Harbor Laboratory Press). A variety of promoters, ribosome binding sites, and operators to control expression are available to those skilled in the art, as are selectable markers such as antibiotic resistance genes. The type of vector chosen is to allow for optimal growth and expression in the selected cell type.
A variety of eukaryotic expression systems are available such as, but not limited to, yeast, insect cell lines, plant cells and mammalian cells.
Expression and synthesis of heterologous proteins in yeast is well known (see Sherman et al., Methods in Yeast Genetics, Cold Spring Harbor Laboratory Press, 1982). Commonly used yeast strains widely used for production of eukaryotic proteins are Saccharomyces cerevisiae and Pichia pastoris, and vectors, strains and protocols for expression are available from commercial suppliers (e.g., Invitrogen).
Mammalian cell systems may be transfected with expression vectors for production of proteins. Many suitable host cell lines are available to those in the art, such as, but not limited to the HEK293, BHK21 and CHO cells lines.
Expression vectors for these cells can include expression control sequences such as an origin of replication, a promoter, (e.g., the CMV promoter, a HSV
tk promoter or phosphoglycerate kinase (pgk) promoter), an enhancer, and protein processing sites such as ribosome binding sites, RNA splice sites, polyadenylation sites, and transcription terminator sequences. Other animal cell lines useful for the production of proteins are available commercially or from depositories such as the American Type Culture Collection.
Expression vectors for expressing proteins in insect cells are usually derived from the SF9 baculovirus or other viruses known in the art. A number of suitable insect cell lines are available including but not limited to, mosquito larvae, silkworm, armyworm, moth and Drosophila cell lines.
Methods of transfecting animal and lower eukaryotic cells are known.
Numerous methods are used to make eukaryotic cells competent to introduce DNA such as but not limited to: calcium phosphate precipitation, fusion of the recipient cell with bacterial protoplasts containing the DNA, treatment of the recipient cells with liposomes containing the DNA, DEAE dextrin, electroporation, biolistics, and microinjection of the DNA directly into the cells.
Transfected cells are cultured using means well known in the art (see, Kuchler, R.J., Biochemical Methods in Cell Culture and Virology, Dowden, Hutchinson and Ross, Inc. 1997).
Once a polypeptide of the present invention is expressed it may be isolated and purified from the cells using methods known to those skilled in the art. The purification process may be monitored using Western blot techniques or radioimmunoassay or other standard immunoassay techniques.
Protein purification techniques are commonly known and used by those in the art (see R. Scopes, Protein Purification: Principles and Practice, Springer-Verlag, New York 1982: Deutscher, Guide to Protein Purification, Academic Press (1990). Embodiments of the present invention provide a method of producing a recombinant protein in which the expression vector includes one or more elements including a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, and an affinity purification-tag encoding sequence. In one specific embodiment, the nucleic acid construct includes an epitope-tag encoding sequence and the isolating step includes use of an antibody specific for the epitope-tag. In another specific embodiment, the nucleic acid construct contains a polyamino acid encoding sequence and the isolating step includes use of a resin comprising a polyamino acid binding substance, specifically where the polyamino acid is polyhistidine and the polyamino binding resin is nickel-charged agarose resin. In yet another specific embodiment, the nucleic acid construct contains a polypeptide encoding sequence and the isolating step includes the use of a resin containing a polypeptide binding substance, specifically where the polypeptide is a chitin binding domain and the resin contains chitin-sepharose.
The polypeptides of the present invention cam be synthesized using non-cellular synthetic methods known to those in the art. Techniques for solid phase synthesis are described by Barany and Mayfield, Solid-Phase Peptide Synthesis, pp. 3-284 in the Peptides: Analysis, Synthesis, Biology, Vol.2, Special Methods in Peptide Synthesis, Part A; Merrifield, et al., J. Am. Chem.
Soc. 85:2149-56 (1963) and Stewart et al., Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, IL (1984).
The present invention further provides a method for modifying (i.e.
increasing or decreasing) the concentration or composition of the polypeptides of the invention in a plant or part thereof. Modification can be effected by increasing or decreasing the concentration and/or the composition (i.e. the ratio of the polypeptides of the present invention) in a plant. The method comprised introducing into a plant cell with an expression cassette comprising a nucleic acid molecule of the present invention, or an nucleic acid encoding a OsGATAl 1 sequence as described above to obtain a transformed plant cell or tissue, culturing the transformed plant cell or tissue. The nucleic acid molecule can be under the regulation of a constitutive or inducible promoter. The method can further comprise inducing or repressing expression of a nucleic acid molecule of a sequence in the plant for a time sufficient to modify the concentration and/or composition in the plant or plant part.
A plant or plant part having modified expression of a nucleic acid molecule of the invention can be analyzed and selected using methods known to those skilled in the art such as, but not limited to, Southern blot, DNA
sequencing, or PCR analysis using primers specific to the nucleic acid molecule and detecting amplicons produced therefrom.
In general, concentration or composition in increased or decreased by at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90% relative to a native control plant, plant part or cell lacking the expression cassette.
Sugars are central regulators of many vital processes in photosynthetic plants, such as photosynthesis, carbon and nitrogen metabolism and this regulation is achieved by regulating gene expression, either activate or repress genes involved. The mechanisms by which sugars control gene expression are not understood well. This GATA transcription factor disclosed here is involved in regulating sugar sensing and the expression of the factor itself is influenced by the change of the N status. Increased expression of this gene can produce plants with increased yield, particularly as the manipulation of sugar signaling pathways can lead to increased photosynthesis and increased nitrogen utilization and alter source-sink relationships in seeds, tubes, roots and other storage organs.
The invention will be further described by reference to the following detailed examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified.
EXAMPLES
Standard recombinant DNA and molecular cloning techniques used here are well known in the art and are described by J. Sambrook, et al., Molecular Cloning: A Laboratory Manual, 3d Ed., Cold Spring Harbor, NY:
Cold Spring Harbor Laboratory Press (2001); by T.J. Silhavy, M.L. Berman, and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1984) and by Ausubel, F.M. et al., Current Protocols in Molecular Biology, New York, John Wiley and Sons Inc., (1988), Reiter, et al., Methods in Arabidopsis Research, World Scientific Press (1992), and Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publishers (1998).
EXPERIMENTAL BACKGROUND AND PROCEDURES
A. Determining rice and maize growth conditions under limiting nitrogen conditions In past experiments to study genes involved in nitrate uptake and assimilation, the present inventors and others have utilized growth conditions in which nitrate was generally either present in excess or absent in its entirety.
In the latter case, nitrate is typically added to plants grown in its absence in order to understand nitrate regulation of these and other genes. While this type of extreme treatment is useful in defining some aspects of gene regulation, it is not suitable to gain a better understanding of the effect of nitrogen limitation. The inventors have defined conditions for Arabidopsis in which nitrogen limits growth. This involved developing a system using Rockwool (Hirai et al., 1995 Plant Cell Physiol 36, 1331-1339) and defining three conditions: one where growth is maximal; one where nitrogen limits growth to 70-75% maximal growth levels; one where there is a more severe limitation to 30-35% maximal growth levels. The nitrogen limitation acts as a 'stress' with the amount of 'stress' easily varied by altering the concentration of nitrate. The inventors assay the physiological "nitrogen status" by measuring nitrate, chlorophyll (which is often used as a reflection of nitrogen status under field conditions- see, e.g., Fox RH et al 2001 Agron J. 93, 590-597; Minotti PL et al 1994 Hort Science 29, 1497-1550), amino acid levels, and nitrate reductase and glutamine synthetase activities in order to give a baseline in which to assess studies on mutant lines.
B. Expression profiling experiments on Arabidopsis plants under nitrogen limitation Transcript expression profiling can be used to test RNA levels of large numbers of genes at the same time. Large numbers of these types of experiments have been done in the past, and if the experimental system is amenable, these can be used to pinpoint the "expression status" of an organism under different conditions and to use this information to make hypotheses on what genes and pathways are involved in various processes.
The inventors found that the more profound the difference in growth conditions, the larger the differences in transcript profiles between the plants grown under these conditions and the more difficult it was to decipher which changes were most important. The only published whole genome profiling experiment in this area is one in Arabidopsis where an extreme change in nitrate levels was studied (Wang R et al 2003 Plant Physiol. 132, 556-67). In the case of nitrogen limitations, the inventors studied the effect of growing plants under chronic nitrogen stress as well as changes in the level of available nitrogen. The inventors have already determined the impact on growth of different nitrogen levels in Arabidopsis.
The effect of different nitrogen levels on the transcript profiles was studied: where nitrogen does not limit growth. For Arabidopsis the inventors collected 4-week old shoots grown under the different nitrogen regimes.
Three different samples were collected (biological triplicates) in order to get statistically significant results. The transcript profiling was done using Arabidopsis GeneChip whole genome array (Affymetrix) to study the transcript levels in Arabidopsis. The bioinformatic analysis necessary to study the considerable data produced by these experiments was performed. By studying the effect of nitrogen limitation on the expression patterns, the inventors can pinpoint which pathways are involved in their response to nutrient stress Materials and methods Plant growth conditions Peat moss and vermiculate (1:4) (SunGro Horticulture Canada Ltd. BC, Canada) was used to grow Oryza sativa Kaybonnet plants, adding nutrient solution with different amount of nitrate once a week till harvest. The nutrient solution contains 4 mM MgSO4, 5 mM KCI, 5 mM CaC12, 1 mM KH2P04, 0.1 mM Fe-EDTA, 0.5 mM MES (pH6.0), 9 p M MnSO4, 0.7 pM Zn SO4, 0.3 pM
CuSOa, 46 pM NaB4O7 and 0.2 pM (NH4) 6Mo7O2. For limiting N condition, 3mM N solution was used once a week till harvest. For sufficient N condition, 10mM N solution was used once a week for the first six week, changed to 5mM for another 6 weeks, and the changed to 3mM N solution till harvest.
Plants were grown in a growth room with 16 hr light (-400 pmolm-zs"') at 28-30 C and 8 hr dark at 22-24 C for the first four weeks and then had one week short-day treatment (10 hr light/14 hr dark). After that, plants were moved to greenhouse to grow till harvest.
Generating transgenic rice plants The constructs for over-expressing or silencing OsGATAII were made. T1 transgenic seeds over-expressing OsGATAl1, and silencing OsGATAl 1 (RNAi) were analyzed.
Genotyping transgenic plants Leaf samples were grounded in 300 NI buffer (Strategic Diagnostics Inc. Part # 7000006). One dipstick (Strategic Diagnostics Inc. Part # 7000052) was inserted into the tube and left for -15 minutes by which time the lines on the sticks were clear. The appearance of one red line (control) on the strip indicates a negative result. The appearance of two red lines (control and test) on the strip indicates a positive result.
Expression analysis by semi-quantitative RT-PCR
One pg total RNA extracted was used to make cDNA. Primers for OsGATAl1 are 5'- CGTCGAGCACCAAGGGCAAATC-3' (SEQ ID NO:3) and 5'- GGATAGGGTCATGAGCAGCATGG-3' (SEQ ID NO:4). Primers for OsTubulin are: 5'- AGGAGGATGCCGCTAACAACTTTG-3' (SEQ ID NO:5) and 5'- AAACAGCATTGGTGATTTCAGGC-3' (SEQ ID NO:6).
Chlorophyll measurement Total chlorophyll was measured either using the Minolta SPAD 502DL
chlorophyll meter (Tokyo, Japan), or extracted by ethanol and measured by spectrophotometer according to Kirk (1968).
Results Strategy to phenotype transgenic plants The strategy for initial genetic and phenotypic analysis involved growing 5 transgenic events from each construct under mainly limiting nitrogen (N) condition (- 18 plants). Also some plants were grown under sufficient N condition (- 10 plants). PMI sticks were used for genotyping to detect the selectable marker PMI. Transgene expression levels were tested by semi-quantitative RT-PCR. Chlorophyll level, culm length, tiller number, panicle number, flowering time, seed yield and shoot biomass was recorded.
Phenotypes of the OsGATAl1 over-expression plants The OsGATAl1 gene shares - 34% similarity at protein level with the AtGATA gene (At4g26150, Figure 3). Total chlorophyll levels were measured when the transgenic plants were about 4-wk-old under limiting N condition. At least two transgenic events (event 5 and 6) had significant higher chlorophyll content from the average of PMI positive plants (3-6 plants) compared to wild type control plants (6 plants) (Figure 4A). Those transgenic plants did have elevated expression of the OsGATAII gene (Figure 4B). To ensure that chlorophyll level can be affected by the expression levels of the OsGATAl1 gene, the transgenic RNAi OsGATAl 1 plants were analyzed. The expression level of the OsGATA11 gene was significantly reduced in the transgenic RNAi OsGATAl 1 plants (Figure 5A), and indeed, chlorophyll level was significantly lower in those plants (Figure 5B). One event (event 6) had - 20% higher seed yield from the average of 10 PMI positive plants compared to the average of 11 wild type control plants under limiting N condition (Figure 6A). This same event had almost doubled seed yield from the average of 4 PMI positive plants compared to the average of 6 wild type control plants under sufficient N
condition (Figure 6B). Also, plants grown under high N experienced stress after being transferred from the growth room to the greenhouse and the transgenic plants responded much better to the stress (Figure 7).
Having now described particular embodiments of the invention by way of the foregoing examples, which are not intended to be limiting, the invention will now be further set forth in the following claims. Those skilled in the art will recognize that the claims also permit for the inclusion of equivalents beyond the claims' literal scope.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: UNIVERSITY OF GUELPH
(ii) TITLE OF INVENTION: NITROGEN-REGULATED SUGAR SENSING GENE
AND PROTEIN MODULATION THEREOF
(iii) NUMBER OF SEQUENCES: 7 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: BERESKIN & PARR
(B) STREET: 40 King Street West (C) CITY: Toronto (D) STATE: Ontario (E) COUNTRY: Canada (F) ZIP: M5H 3Y2 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: iMAC - Using Virtual PC
(C) OPERATING SYSTEM: Windows 198 (D) SOFTWARE: PatentIn Release #1.0, Version #1.25 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: GRAVELLE, MICHELINE
(C) REFERENCE/DOCKET NUMBER: 6580-346 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (416) 364-7311 (B) TELEFAX: (416) 361-1398 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1343 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Rice (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
gaacttctct cccatctctt tcctcctcct cctctctgat atgtctacta tctacatgag 60 ccagctacct gctactctcc ctctaatgga gggggatcag gatcaggggc tctacccagc 120 cttccataga gcaaaggacc ctcctatctt gttccctttc atgatcgaca gcgccgtcga 180 gcaccaaggg caaatctatg gagatcaggg cttgaggagg cagcaggttt tgggtgaatc 240 caatcaacag ttcaatgatc acatgatgat gggcggatca gatgtcttcc tcacaccgtc 300 tccgttccga ccaaccatcc aaagcatcgg cagcgacatg atccagcgat catcttatga 360 tccatacgat atcgagagta acaacaagca gcatgccaat ggatcaacca gcaagtggat 420 gtcgacgccg ccaatgaaga tgaggatcat aaggaagggg gcggcaaccg atcctgaggg 480 cggggcggtg agaaagccaa ggagaagagc acaagcgcac caggatgaga gccagcaaca 540 actgcagcaa gctttgggtg tcgttagagt gtgctcggac tgcaacacca ccaagacccc 600 cttgtggaga agtggtcctt gtggccccaa gtccctttgc aacgcgtgtg gcatcaggca 660 aaggaaggcg cggcgggcga tggccgctgc tgccaacggc ggagcggcgg tggcgccggc 720 aaagagcgtg gccgcggcgc cggtgaacaa taagccggcg gcgaagaagg agaagagggc 780 ggcggacgtc gaccggtcgc tgccgttcaa gaaacggtgc aagatggtcg atcacgttgc 840 tgctgccgtc gctgccacca agcccacggc tgctggagaa gtagtggccg ccgctccgaa 900 ggaccaagat cacgtcatcg tcgtcggtgg cgagaacgcc gccgccacct ccatgccggc 960 acagaacccg atatccaagg cggcggcgac cgccgctgcc gccgccgcct ctccggcgtt 1020 cttccacggc ctccctcgcg acgagatcac cgacgccgcc atgctgctca tgaccctatc 1080 ctgtggcctc gtccacagct agctagctag ctgatcaaaa ctagctagct actagtaccg 1140 ttaatttgat gagggcaaca accagagtac tatgtaccac tactagcaat attttgtgtg 1200 tgccttgtga tcttttgttg ttttgtgttg ttgaggagat cactagatca ggatgaagga 1260 gagatagtga tcacatgtct aaggacgaaa taaacgagaa caaactcgct agctagctac 1320 tagccgggat caggattata ttt 1343 (2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 353 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Rice (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Ser Thr Ile Tyr Met Ser Gln Leu Pro Ala Thr Leu Pro Leu Met Glu Gly Asp Gln Asp Gln Gly Leu Tyr Pro Ala Phe His Arg Ala Lys Asp Pro Pro Ile Leu Phe Pro Phe Met Ile Asp Ser Ala Val Glu His Gln Gly Gln Ile Tyr Gly Asp Gln Gly Leu Arg Arg Gln Gln Val Leu Gly Glu Ser Asn Gln Gln Phe Asn Asp His Met Met Met Gly Gly Ser Asp Val Phe Leu Thr Pro Ser Pro Phe Arg Pro Thr Ile Gln Ser Ile Gly Ser Asp Met Ile Gln Arg Ser Ser Tyr Asp Pro Tyr Asp Ile Glu Ser Asn Asn Lys Gln His Ala Asn Gly Ser Thr Ser Lys Trp Met Ser Thr Pro Pro Met Lys Met Arg Ile Ile Arg Lys Gly Ala Ala Thr Asp Pro Glu Gly Gly Ala Val Arg Lys Pro Arg Arg Arg Ala Gln Ala His Gln Asp Glu Ser Gln Gln Gln Leu Gln Gln Ala Leu Gly Val Val Arg Val Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Cys Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Met Ala Ala Ala Ala Asn Gly Gly Ala Ala Val Ala Pro Ala Lys Ser Val Ala Ala Ala Pro Val Asn Asn Lys Pro Ala Ala Lys Lys Glu Lys Arg Ala Ala Asp Val Asp Arg Ser Leu Pro Phe Lys Lys Arg Cys Lys Met Val Asp His Val Ala Ala Ala Val Ala Ala Thr Lys Pro Thr Ala Ala Gly Glu Val Val Ala Ala Ala Pro Lys Asp Gln Asp His Val Ile Val Val Gly Gly Glu Asn Ala Ala Ala Thr Ser Met Pro Ala Gln Asn Pro Ile Ser Lys Ala Ala Ala Thr Ala Ala Ala Ala Ala Ala Ser Pro Ala Phe Phe His Gly Leu Pro Arg Asp Glu Ile Thr Asp Ala Ala Met Leu Leu Met Thr Leu Ser Cys Gly Leu Val His Ser (2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
cgtcgagcac caagggcaaa tc 22 (2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
ggatagggtc atgagcagca tgg 23 (2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
aggaggatgc cgctaacaac tttg 24 (2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
aaacagcatt ggtgatttca ggc 23 (2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 352 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Arabidopsis thaliana (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
Met Gly Ser Asn Phe His Tyr Thr Ile Asp Leu Asn Glu Asp Gln Asn His Gln Pro Phe Phe Ala Ser Leu Gly Ser Ser Leu His His His Leu Gln Gin Gln Gln Gln Gln Gln Gln His Phe His His Gln Ala Ser Ser Asn Pro Ser Ser Leu Met Ser Pro Ser Leu Ser Tyr Phe Pro Phe Leu Ile Asn Ser Arg Gln Asp Gln Val Tyr Val Gly Tyr Asn Asn Asn Thr Phe His Asp Val Leu Asp Thr His Ile Ser Gln Pro Leu Glu Thr Lys Asn Phe Val Ser Asp Gly Gly Ser Ser Ser Ser Asp Gln Met Val Pro Lys Lys Glu Thr Arg Leu Lys Leu Thr Ile Lys Lys Lys Asp Asn His Gln Asp Gln Thr Asp Leu Pro Gln Ser Pro Ile Lys Asp Met Thr Gly Thr Asn Ser Leu Lys Trp Ile Ser Ser Lys Val Arg Leu Met Lys Lys Lys Lys Ala Ile Ile Thr Thr Ser Asp Ser Ser Lys Gln His Thr Asn Asn Asp Gln Ser Ser Asn Leu Ser Asn Ser Glu Arg Gln Asn Gly Tyr Asn Asn Asp Cys Val Ile Arg Ile Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Arg Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Ala Met Ala Thr Ala Thr Ala Thr Ala Val Ser Gly Val Ser Pro Pro Val Met Lys Lys Lys Met Gln Asn Lys Asn Lys Ile Ser Asn Gly Val Tyr Lys Ile Leu Ser Pro Leu Pro Leu Lys Val Asn Thr Cys Lys Arg Met Ile Thr Leu Glu Glu Thr Ala Leu Ala Glu Asp Leu Glu Thr Gln Ser Asn Ser Thr Met Leu Ser Ser Ser Asp Asn Ile Tyr Phe Asp Asp Leu Ala Leu Leu Leu Ser Lys Ser Ser Ala Tyr Gln Gln Val Phe Pro Gln Asp Glu Lys Glu Ala Ala Ile Leu Leu Met Ala Leu Ser His Gly Met Val His Gly
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or a fragment or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence of SEQ ID NO:1 or a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In another specific embodiment, the polypeptide having substantial similarity is an allelic variant of a polypeptide sequence of SEQ ID NO:2, or a fragment, domain, repeat or chimera thereof. In another specific embodiment, the isolated nucleic acid includes a plurality of regions from the polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or fragment or domain thereof, or a sequence complementary thereto.
In another specific embodiment, the polypeptide is a polypeptide sequence of SEQ ID NO:2. In another specific embodiment, the polypeptide is a functional fragment or domain. In yet another specific embodiment, the polypeptide is a chimera, where the chimera may include functional protein domains, including domains, repeats, post-translational modification sites, or other features. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum, and teosinte.
_20_ In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In another specific embodiment, the location or tissue is a seed.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof or a sequence complementary thereto, includes a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides. In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence of SEQ ID NO:1, or a fragment or domain thereof or a sequence complementary thereto, includes a substitution of at least one codon. In a more specific embodiment, the substitution is conservative.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence of SEQ ID NO:2 or a fragment, domain, repeat, or chimeras thereof includes a deletion or insertion of at least one amino acid.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence of SEQ ID NO:2 or a fragment, domain, repeat, or chimeras thereof includes a substitution of at least one amino acid.
Embodiments of the present invention also relate to a shuffled nucleic acid containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments corresponds to a region of a nucleotide sequence of SEQ ID NO:1 and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid. In a more specific embodiment, all of the fragments in a shuffled nucleic acid containing a plurality of nucleotide sequence fragments are from a single gene. In a more specific embodiment, the plurality of fragments originates from at least two different genes. In a more specific embodiment, the shuffled nucleic acid is operably linked to a promoter sequence. Another more specific embodiment is a chimeric polynucleotide including a promoter sequence operably linked to the shuffled nucleic acid. In a more specific embodiment, the shuffled nucleic acid is contained within a host cell.
Embodiments of the present invention also contemplate an expression cassette including a promoter sequence operably linked to an isolated nucleic acid containing a nucleotide sequence including:
a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
Further encompassed within the invention is a recombinant vector comprising an expression cassette according to embodiments of the present invention. Also encompassed are plant cells, which contain expression cassettes, according to the present disclosure, and plants, containing these plant cells. In a specific embodiment, the plant is a dicot. In a more specific embodiment, the dicot is selected from the group consisting of soybean, tobacco or cotton. In another specific embodiment, the plant is a gymnosperm. In another specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, _22_ millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum and teosinte.
In one embodiment, the expression cassette is expressed throughout the plant. In another embodiment, the expression cassette is expressed in a specific location or tissue of a plant. In a specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternative specific embodiment, the location or tissue is a seed.
In one embodiment, the expression cassette is involved in a function such as, for example, but not limited to, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In a more specific embodiment, the chimeric polypeptide is involved in a function such as, nitrogen utilization, abiotic stress tolerance, enhanced yield, disease resistance and/or nutritional composition.
In one embodiment, the plant contains a modification to a phenotype or measurable characteristic of the plant, the modification being attributable to the expression of at least one gene contained in the expression cassette. In a specific embodiment, the modification may be, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
Embodiments of the present invention also provide seed and isolated product from plants which contain an expression cassette including a promoter sequence operably linked to an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence of SEQ ID NO:1or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c) ;
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d) according to the present disclosure.
In a specific embodiment the isolated product includes an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
Embodiments of the present invention also relate to isolated products produced by expression of an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence of SEQ ID NO:1, or fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID
NO:2, or a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a) or (b);
(e) a nucleotide sequence complementary to (a), (b), (c) or (d);
or (f) a nucleotide sequence that is the reverse complement of (a), (b) (c) or (d) according to the present disclosure.
In a specific embodiment, the product is produced in a plant. In another specific embodiment, the product is produced in cell culture. In another specific embodiment, the product is produced in a cell-free system.
In another specific embodiment, the product includes an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
In a specific embodiment, the product is a polypeptide containing an amino acid sequence of SEQ ID NO:2. In a more specific embodiment, the protein is an transcription factor.
Embodiments of the present invention further relate to an isolated polynucleotide including a nucleotide sequence of at least 10 bases, which sequence is identical, complementary, or substantially similar to a region of any sequence of SEQ ID NO:1, and wherein the polynucleotide is adapted for any of numerous uses.
In a specific embodiment, the polynucleotide is used as a chromosomal marker. In another specific embodiment, the polynucleotide is used as a marker for RFLP analysis. In another specific embodiment, the polynucleotide is used as a marker for quantitative trait linked breeding. In another specific embodiment, the polynucleotide is used as a marker for marker-assisted breeding. In another specific embodiment, the polynucleotide is used as a bait sequence in a two-hybrid system to identify sequence- encoding polypeptides interacting with the polypeptide encoded by the bait sequence. In another specific embodiment, the polynucleotide is used as a diagnostic indicator for genotyping or identifying an individual or population of individuals. In another specific embodiment, the polynucleotide is used for genetic analysis to identify boundaries of genes or exons.
Embodiments of the present invention also relate to an expression vector comprising or consisting of a nucleic acid molecule including:
(a) a nucleic acid encoding a polypeptide as listed in SEQ ID
NO:2 (b) a fragment, one or more domains, or featured regions of SEQ ID NO:1; or (c) a complete nucleic acid sequence listed in SEQ ID NO:1, or a fragment thereof, in combination with a heterologous sequence.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-1a promoter, ubiquitin and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or gIutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
In a specific embodiment, the expression vector is a eukaryotic expression vector or a prokaryotic expression vector. In a more specific embodiment, the eukaryotic expression vector includes a tissue-specific promoter. More specifically, the expression vector is operable in plants.
Embodiments of the present invention also relate to a cell comprising or consisting of a nucleic acid construct comprising an expression vector and a nucleic acid including a nucleic acid encoding a polypeptide as listed in SEQ
ID NO:2, or a nucleic acid sequence listed in SEQ ID NO:1, or a segment thereof, in combination with a heterologous sequence.
In a specific embodiment, the cell is a bacterial cell, a fungal cell, a plant cell, or an animal cell. In a specific embodiment, the cell is a plant cell. In a more specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a most specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In an alternate most specific embodiment, the location or tissue is a seed. In a specific embodiment, the polypeptide is involved in a function such as, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
Embodiments of the present invention also relate to polypeptides encoded by the isolated nucleic acid molecules of the present disclosure including a polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c);
(f) or a functional fragment thereof.
A polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including a polypeptide sequence including:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a domain, repeat, or chimeras thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1 or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d);
(f) or a functional fragment thereof.
Embodiments of the present invention contemplate a polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid which includes a shuffled nucleic acid containing a plurality of nucleotide sequence fragments, wherein at least one of the fragments corresponds to a region of a nucleotide sequence listed SEQ ID NO:1, and wherein at least two of the plurality of sequence fragments are in an order, from 5' to 3' which is not an order in which the plurality of fragments naturally occur in a nucleic acid, or functional fragment thereof.
Embodiments of the present invention contemplate a polypeptide containing a polypeptide sequence encoded by an isolated polynucleotide containing a nucleotide sequence of at least 10 bases, which sequence is identical, complementary, or substantially similar to a region of any of sequences of SEQ ID NO:1, or functional fragment thereof and wherein the polynucleotide is adapted for a use including:
(a) use as a chromosomal marker to identify the location of the corresponding or complementary polynucleotide on a native or artificial chromosome;
(b) use as a marker for RFLP analysis;
(c) use as a marker for quantitative trait linked breeding;
(d) use as a marker for marker-assisted breeding;
(e) use as a bait sequence in a two-hybrid system to identify sequence encoding polypeptides interacting with the polypeptide encoded by the bait sequence;
(f) use as a diagnostic indicator for genotyping or identifying an individual or population of individuals; or (g) use for genetic analysis to identify boundaries of genes or exons.
_28-Embodiments of the present invention also contemplate an isolated polypeptide containing a polypeptide sequence including:
(a) a polypeptide sequence listed SEQ ID NO:2, or exon or domain thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In a specific embodiment, the substantial similarity is at least about 65% identity. In a more specific embodiment, the substantial similarity is at least about 80% identity. In a most specific embodiment, the substantial similarity is at least about 95% identity. In a specific embodiment, the substantial similarity is at least three percent greater than the percent identity to the closest homologous sequence listed in any of the Sequence Listings.
In a specific embodiment, the sequence having substantial similarity is from a plant. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum and teosinte.
In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In another specific embodiment, the location or tissue is a seed. In a specific embodiment, the polypeptide is involved in a function such as, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In a specific embodiment, hybridization of a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto, or a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed SEQ ID NO:1, or to a sequence complementary thereto, allows the sequence to form a duplex at medium or high stringency.
In a specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is an allelic variant of the polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a naturally occurring variant of the polypeptide sequence listed in SEQ ID NO:2.
In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a polymorphic variant of the polypeptide sequence listed in SEQ ID NO:2.
In an alternate specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one amino acid. In a more specific embodiment, the deletion or insertion is of less than about ten amino acids. In a most specific embodiment, the deletion or insertion is of less than about three amino acids.
In a specific embodiment, the sequence having substantial similarity encodes a substitution in at least one amino acid.
Also contemplated is a method of producing a plant comprising a modification thereto, including the steps of: (1) providing a nucleic acid which is an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed SEQ ID NO:1, or exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); or (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c);
and (2) introducing the nucleic acid into the plant, wherein the nucleic acid is expressible in the plant in an amount effective to effect the modification. In one embodiment, the modification comprises an altered characteristic in the plant, wherein the characteristic corresponds to the nucleic acid introduced into the plant. In other specific embodiments the characteristic corresponds to carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation.
In another embodiment, the modification includes an increased or decreased expression or accumulation of a product of the plant. Specifically, the product is a natural product of the plant. Equally specifically, the product is a new or altered product of the plant. Specifically, the product comprises a GATA transcription factor.
Also encompassed within the presently disclosed invention is a method of producing a recombinant protein, comprising the steps of:
(a) growing recombinant cells comprising a nucleic acid construct under suitable growth conditions, the construct comprising an expression vector and a nucleic acid including: a nucleic acid encoding a protein as listed in SEQ ID NO:2, or a nucleic acid sequence listed in SEQ ID NO:1, or segments thereof; and (b) isolating from the recombinant cells the recombinant protein expressed thereby.
Embodiments of the present invention provide a method of producing a recombinant protein in which the expression vector includes one or more elements including a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, and an affinity purification-tag encoding sequence. In one specific embodiment, the nucleic acid construct includes an epitope-tag encoding sequence and the isolating step includes use of an antibody specific for the epitope-tag. In another specific embodiment, the nucleic acid construct contains a polyamino acid encoding sequence and the isolating step includes use of a resin comprising a polyamino acid binding substance, specifically where the polyamino acid is polyhistidine and the polyamino binding resin is nickel-charged agarose resin. In yet another specific embodiment, the nucleic acid construct contains a polypeptide encoding sequence and the isolating step includes the use of a resin containing a polypeptide binding substance, specifically where the polypeptide is a chitin binding domain and the resin contains chitin-sepharose.
Embodiments of the present invention also relate to a plant modified by a method that includes introducing into a plant a nucleic acid where the nucleic acid is expressible in the plant in an amount effective to effect the modification. The modification can be, for example, carbon, nitrogen and/or sulfur metabolism, nitrogen utilization, nitrogen assimilation, photosynthesis, signal transduction, cell growth, reproduction, disease resistance, abiotic stress tolerance, nutritional composition, gene regulation, and/or differentiation. In one embodiment, the modified plant has increased or decreased resistance to an herbicide, a stress, or a pathogen. In another embodiment, the modified plant has enhanced or diminished requirement for light, water, nitrogen, or trace elements. In yet another embodiment, the modified plant is enriched for an essential amino acid as a proportion of a protein fraction of the plant. The protein fraction may be, for example, total seed protein, soluble protein, insoluble protein, water-extractable protein, and lipid-associated protein. The modification may include overexpression, underexpression, antisense modulation, sense suppression, inducible expression, inducible repression, or inducible modulation of a gene.
The invention further relates to a seed from a modified plant or an isolated product of a modified plant, where the product may be an enzyme, a nutritional protein, a structural protein, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin and a plant hormone.
The above Summary of Invention lists several embodiments of the invention, and in many cases lists variations and permutations of these embodiments. The Summary is merely exemplary of the numerous and varied embodiments. Mention of one or more specific features of a given embodiment is likewise exemplary. Such embodiment can typically exist with or without the feature(s) mentioned; likewise, those features can be applied to other embodiments of the invention, whether listed in this Summary or not.
To avoid excessive repetition, this Summary does not list or suggest all possible combinations of such features.
For purposes of summarizing the invention and the advantages achieved over the prior art, certain objects and advantages of the invention have been described above. Of course, it is to be understood that not necessarily all such objects or advantages may be achieved in accordance with any particular embodiment of the invention. Thus, for example, those skilled in the art will recognize that the invention may be embodied or carried out in a manner that achieves or optimizes one advantage or group of advantages as taught herein without necessarily achieving other objects or advantages as may be taught or suggested herein.
Further aspects, features and advantages of this invention will become apparent from the detailed description of the specific embodiments that follow.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 and SEQ ID NO:1 shows the nucleic acid sequence of full length OsGATAl 1.
Figure 2 and SEQ ID NO:2 shows the amino acid sequence of OsGATAl 1.
Figure 3 shows the alignment of the amino acid sequence of At4g26150 (SEQ ID NO:7) and its rice ortholog OsGATAl 1 (SEQ ID NO:2).
Figure 4A and B shows the phenotypes of the OsGATAl 1 over-expressing plants.
Figure 5A and B shows the chlorophyll level affected by the expression of OsGATA11 gene.
Figure 6A and B shows the seed yield of OsGATAl 1 over-expressing plants.
Figure 7 are pictures showing more resistant to stress in the OsGATAl1 over-expressing plants.
DEFINITIONS
For clarity, certain terms used in the specification are defined and presented as follows:
"Associated with / operatively linked" refer to two nucleic acid sequences that are related physically or functionally. For example, a promoter or regulatory DNA sequence is said to be "associated with" a DNA
sequence that codes for an RNA or a protein if the two sequences are operatively linked, or situated such that the regulator DNA sequence will affect the expression level of the coding or structural DNA sequence.
A "chimeric construct" is a recombinant nucleic acid sequence in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA or which is expressed as a protein, such that the regulatory nucleic acid sequence is able to regulate transcription or expression of the associated nucleic acid sequence. The regulatory nucleic acid sequence of the chimeric construct is not normally operatively linked to the associated nucleic acid sequence as found in nature.
A "co-factor" is a natural reactant, such as an organic molecule or a metal ion, required in an enzyme-catalyzed reaction. A co-factor is e.g.
NAD(P), riboflavin (including FAD and FMN), folate, molybdopterin, thiamin, biotin, lipoic acid, pantothenic acid and coenzyme A, S-adenosylmethionine, pyridoxal phosphate, ubiquinone, menaquinone. Optionally, a co-factor can be regenerated and reused.
A"coding sequence" is a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA.
Specifically the RNA is then translated in an organism to produce a protein.
Complementary: "complementary" refers to two nucleotide sequences that comprise antiparallel nucleotide sequences capable of pairing with one another upon formation of hydrogen bonds between the complementary base residues in the antiparaliel nucleotide sequences.
Enzyme activity: means herein the ability of an enzyme to catalyze the conversion of a substrate into a product. A substrate for the enzyme comprises the natural substrate of the enzyme but also comprises analogues of the natural substrate, which can also be converted, by the enzyme into a product or into an analogue of a product. The activity of the enzyme is measured for example by determining the amount of product in the reaction after a certain period of time, or by determining the amount of substrate remaining in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of an unused co-factor of the reaction remaining in the reaction mixture after a certain period of time or by determining the amount of used co-factor in the reaction mixture after a certain period of time. The activity of the enzyme is also measured by determining the amount of a donor of free energy or energy-rich molecule (e.g. ATP, phosphoenolpyruvate, acetyl phosphate or phosphocreatine) remaining in the reaction mixture after a certain period of time or by determining the amount of a used donor of free energy or energy-rich molecule (e.g. ADP, pyruvate, acetate or creatine) in the reaction mixture after a certain period of time.
Expression Cassette: "Expression cassette" as used herein means a nucleic acid molecule capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operatively linked to the nucleotide sequence of interest which is operatively linked to termination signals. It also typically comprises sequences required for proper translation of the nucleotide sequence. The coding region usually codes for a protein of interest but may also code for a functional RNA of interest, for example antisense RNA or a nontransiated RNA, in the sense or antisense direction. The expression cassette comprising the nucleotide sequence of interest may be chimeric, meaning that at least one of its components is heterologous with respect to at least one of its other components. The expression cassette may also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. Typically, however, the expression cassette is heterologous with respect to the host, i.e., the particular DNA sequence of the expression cassette does not occur naturally in the host cell and must have been introduced into the host cell or an ancestor of the host cell by a transformation event. The expression of the nucleotide sequence in the expression cassette may be under the control of a constitutive promoter or of an inducible promoter that initiates transcription only when the host cell is exposed to some particular external stimulus. In the case of a multicellular organism, such as a plant, the promoter can also be specific to a particular tissue or organ or stage of development.
The term "functional fragment" as used herein in relation to a nucleic acid or protein sequence means a fragment or portion of the sequence that retains the function of the full length sequence.
Gene: the term "gene" is used broadly to refer to any segment of DNA
associated with a biological function. Thus, genes include coding sequences and/or the regulatory sequences required for their expression. Genes also include nonexpressed DNA segments that, for example, form recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.
Heterologous/exogenous: The terms "heterologous" and "exogenous"
when used herein to refer to a nucleic acid sequence (e.g. a DNA sequence) or a gene, refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling. The terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides.
A "homologous" nucleic acid (e.g. DNA) sequence is a nucleic acid (e.g. DNA) sequence naturally associated with a host cell into which it is introduced.
Hybridization: The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially"
refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence.
Inhibitor: a chemical substance that inactivates the enzymatic activity of a protein such as a biosynthetic enzyme, receptor, signal transduction protein, structural gene product, or transport protein. The term "herbicide" (or "herbicidal compound") is used herein to define an inhibitor applied to a plant at any stage of development, whereby the herbicide inhibits the growth of the plant or kills the plant.
Interaction: quality or state of mutual action such that the effectiveness or toxicity of one protein or compound on another protein is inhibitory (antagonists) or enhancing (agonists).
A nucleic acid sequence is "isocoding with" a reference nucleic acid sequence when the nucleic acid sequence encodes a polypeptide having the same amino acid sequence as the polypeptide encoded by the reference nucleic acid sequence.
Isogenic: plants that are genetically identical, except that they may differ by the presence or absence of a heterologous DNA sequence.
Isolated: in the context of the present invention, an isolated DNA
molecule or an isolated enzyme is a DNA molecule or enzyme that, by human intervention, exists apart from its native environment and is therefore not a product of nature. An isolated DNA molecule or enzyme may exist in a purified form or may exist in a non-native environment such as, for example, in a transgenic host cell.
Mature protein: protein from which the transit peptide, signal peptide, and/or propeptide portions have been removed.
Minimal Promoter: the smallest piece of a promoter, such as a TATA
element, that can support any transcription. A minimal promoter typically has greatly reduced promoter activity in the absence of upstream activation. In the presence of a suitable transcription factor, the minimal promoter functions to permit transcription.
Modified Enzyme Activity: enzyme activity different from that which naturally occurs in a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally occurring enzyme activity.
Native: refers to a gene that is present in the genome of an untransformed plant cell.
Naturally occurring: the term "naturally occurring" is used to describe an object that can be found in nature as distinct from being artificially produced by man. For example, a protein or nucleotide sequence present in an organism (including a virus), which can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory, is naturally occurring.
Nucleic acid: the term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated.
Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19: 5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:
2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8: 91-98 (1994)). The terms "nucleic acid" or "nucleic acid sequence" may also be used interchangeably with gene, cDNA, and mRNA encoded by a gene.
"ORF" means open reading frame.
Percent identity: the phrases "percent identityl" or "percent identical," in the context of two nucleic acid or protein sequences, refers to two or more sequences or subsequences that have for example 60%, specifically 70%, more specifically 80%, still more specifically 90%, even more specifically 95%, and most specifically at least 99% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. Specifically, the percent identity exists over a region of the sequences that is at least about 50 residues in length, more specifically over a region of at least about 100 residues, and most specifically the percent identity exists over at least about 150 residues. In an especially specific embodiment, the percent identity exists over the entire length of the coding regions.
For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appi. Math.
2: 482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48: 443 (1970), by the search for similarity method of Pearson &
Lipman, Proc. Nat'l. Acad. Sci. USA 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by visual inspection (see generally, Ausubel et al., infra).
One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., J. Mol. Biol. 215: 403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., 1990). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when the cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below due to the accumulation of one or more negative-scoring residue alignments, or the end of either sequence is reached. The BLAST
algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP
program uses as defaults a wordiength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci.
USA 89: 10915 (1989)).
In addition to calculating percent sequence identity, the BLAST
algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90:
5873-5787 (1993)). One measure of similarity provided by the BLAST
algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic acid sequence is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid sequence to the reference nucleic acid sequence is less than about 0.1, more specifically less than about 0.01, and most specifically less than about 0.001.
Pre-protein: protein that is normally targeted to a cellular organelle, such as a chloroplast, and still comprises its native transit peptide.
Purified: the term "purified," when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It is specifically in a homogeneous state although it can be in either a dry or aqueous solution.
Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified. The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel.
Particularly, it means that the nucleic acid or protein is at least about 50%
pure, more specifically at least about 85% pure, and most specifically at least about 99% pure.
Two nucleic acids are "recombined" when sequences from each of the two nucleic acids are combined in a progeny nucleic acid. Two sequences are "directly" recombined when both of the nucleic acids are substrates for recombination. Two sequences are "indirectly recombined" when the sequences are recombined using an intermediate such as a cross-over oligonucleotide. For indirect recombination, no more than one of the sequences is an actual substrate for recombination, and in some cases, neither sequence is a substrate for recombination.
"Regulatory elements" refer to sequences involved in controlling the expression of a nucleotide sequence. Regulatory elements comprise a promoter operatively linked to the nucleotide sequence of interest and termination signals. They also typically encompass sequences required for proper translation of the nucleotide sequence.
Significant Increase: an increase in enzymatic activity that is larger than the margin of error inherent in the measurement technique, specifically an increase by about 2-fold or greater of the activity of the wild-type enzyme in the presence of the inhibitor, more specifically an increase by about 5-fold or greater, and most specifically an increase by about 10-fold or greater.
Significantly less: means that the amount of a product of an enzymatic reaction is reduced by more than the margin of error inherent in the measurement technique, specifically a decrease by about 2-fold or greater of the activity of the wild-type enzyme in the absence of the inhibitor, more specifically an decrease by about 5-fold or greater, and most specifically an decrease by about 10-fold or greater.
Specific Binding/Immunological Cross-Reactivity: An indication that two nucleic acid sequences or proteins are substantially identical is that the protein encoded by the first nucleic acid is immunologically cross reactive with, or specifically binds to, the protein encoded by the second nucleic acid.
Thus, a protein is typically substantially identical to a second protein, for example, where the two proteins differ only by conservative substitutions.
The phrase "specifically (or selectively) binds to an antibody," or "specifically (or selectively) immunoreactive with," when referring to a protein or peptide, refers to a binding reaction which is determinative of the presence of the protein in the presence of a heterogeneous population of proteins and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. For example, antibodies raised to the protein with the amino acid sequence encoded by any of the nucleic acid sequences of the invention can be selected to obtain antibodies specifically immunoreactive with that protein and not with other proteins except for polymorphic variants. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein. For example, solid-phase ELISA
immunoassays, Western blots, or immunohistochemistry are routinely used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York "Harlow and Lane"), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity. Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background.
"Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern hybridizations are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays" Elsevier, New York. Generally, highly stringent hybridization and wash conditions are selected to be about 5 C
lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Typically, under "stringent conditions" a probe will hybridize to its target subsequence, but to no other sequences.
The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.
Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42 C, with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.1 5M NaCI at 72 C for about minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65 C for 15 minutes (see, Sambrook, infra, for a description of SSC buffer).
Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a 15 duplex of, e.g., more than 100 nucleotides, is lx SSC at 45 C for 15 minutes.
An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40 C for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30 C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization.
Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
The following are examples of sets of hybridization/wash conditions that may be used to clone nucleotide sequences that are homologues of reference nucleotide sequences of the present invention: a reference nucleotide sequence specifically hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 2X SSC, 0.1% SDS at 50 C, more desirably in 7%
sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 1X SSC, 0.1% SDS at 50 C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 0.5X SSC, 0.1% SDS at 50 C, specifically in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50 C with washing in 0.1X SSC, 0.1%
SDS at 50 C, more specifically in 7% sodium dodecyl sulfate (SDS), 0.5 M
NaPO4, 1 mM EDTA at 50 C with washing in 0.1X SSC, 0.1% SDS at 65 C.
A "subsequence" refers to a sequence of nucleic acids or amino acids that comprise a part of a longer sequence of nucleic acids or amino acids (e.g., protein) respectively.
Substantial similarity: The term "substantial similarity" in the context of two nucleic acid or protein sequences, refers to two or more sequences or subsequences that are substantially similar, for example that have 50%, specifically 60%, more specifically 70%, even more specifically 80%, still more specifically 90%, further more specifically 95%, and most specifically 99%
sequence identity.
Substrate: a substrate is the molecule that an enzyme naturally recognizes and converts to a product in the biochemical pathway in which the enzyme naturally carries out its function, or is a modified version of the molecule, which is also recognized by the enzyme and is converted by the enzyme to a product in an enzymatic reaction similar to the naturally-occurring reaction.
Transformation: a process for introducing heterologous DNA into a plant cell, plant tissue, or plant. Transformed plant cells, plant tissue, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof.
"Transformed," "transgenic," and "recombinant" refer to a host organism such as a bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome of the host or the nucleic acid molecule can also be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants are understood to encompass not only the end product of a transformation process, but also transgenic progeny thereof. A "non-transformed," "non-transgenic," or "non-recombinant" host refers to a wild-type organism, e.g., a bacterium or plant, which does not contain the heterologous nucleic acid molecule.
Viability: "viability" as used herein refers to a fitness parameter of a plant. Plants are assayed for their homozygous performance of plant development, indicating which proteins are essential for plant growth.
DETAILED DESCRIPTION OF THE INVENTION
1. General Description of Trait Functional Genomics The goal of functional genomics is to identify genes controlling expression of organismal phenotypes, and employs a variety of methodologies, including but not limited to bioinformatics, gene expression studies, gene and gene product interactions, genetics, biochemistry and molecular genetics. For example, bioinformatics can assign function to a given gene by identifying genes in heterologous organisms with a high degree of similarity (homology) at the amino acid or nucleotide level. Expression of a gene at the mRNA or protein levels can assign function by linking expression of a gene to an environmental response, a developmental process or a genetic (mutational) or molecular genetic (gene overexpression or underexpression) perturbation. Expression of a gene at the mRNA level can be ascertained either alone (Northern analysis) or in concert with other genes (microarray analysis), whereas expression of a gene at the protein level can be ascertained either alone (native or denatured protein gel or immunoblot analysis) or in concert with other genes (proteomic analysis). Knowledge of protein/protein and protein/DNA interactions can assign function by identifying proteins and nucleic acid sequences acting together in the same biological process. Genetics can assign function to a gene by demonstrating that DNA
lesions (mutations) in the gene have a quantifiable effect on the organism, including but not limited to: its development; hormone biosynthesis and response; growth and growth habit (plant architecture); mRNA expression profiles; protein expression profiles; ability to resist diseases; tolerance of abiotic stresses; ability to acquire nutrients; photosynthetic efficiency;
altered primary and secondary metabolism; and the composition of various plant organs. Biochemistry can assign function by demonstrating that the protein encoded by the gene, typically when expressed in a heterologous organism, possesses a certain enzymatic activity, alone or in combination with other proteins. Molecular genetics can assign function by overexpressing or underexpressing the gene in the native plant or in heterologous organisms, and observing quantifiable effects as described in functional assignment by genetics above. In functional genomics, any or all of these approaches are utilized, often in concert, to assign genes to functions across any of a number of organismal phenotypes.
It is recognized by those skilled in the art that these different methodologies can each provide data as evidence for the function of a particular gene, and that such evidence is stronger with increasing amounts of data used for functional assignment: specifically from a single methodology, more specifically from two methodologies, and even more specifically from more than two methodologies. In addition, those skilled in the art are aware that different methodologies can differ in the strength of the evidence for the assignment of gene function. Typically, but not always, a datum of biochemical, genetic and molecular genetic evidence is considered stronger than a datum of bioinformatic or gene expression evidence. Finally, those skilled in the art recognize that, for different genes, a single datum from a single methodology can differ in terms of the strength of the evidence provided by each distinct datum for the assignment of the function of these different genes.
The objective of crop trait functional genomics is to identify crop trait genes, i.e. genes capable of conferring useful agronomic traits in crop plants.
Such agronomic traits include, but are not limited to: enhanced yield, whether in quantity or quality; enhanced nutrient acquisition and enhanced metabolic efficiency; enhanced or altered nutrient composition of plant tissues used for food, feed, fiber or processing; enhanced utility for agricultural or industrial processing; enhanced resistance to plant diseases; enhanced tolerance of adverse environmental conditions (abiotic stresses) including but not limited to drought, excessive cold, excessive heat, or excessive soil salinity or extreme acidity or alkalinity; and alterations in plant architecture or development, including changes in developmental timing. The deployment of such identified trait genes by either transgenic or non-transgenic means could materially improve crop plants for the benefit of agriculture.
Cereals are the most important crop plants on the planet, in terms of both human and animal consumption. Genomic synteny (conservation of gene order within large chromosomal segments) is observed in rice, maize, wheat, barley, rye, oats and other agriculturally important monocots, which facilitates the mapping and isolation of orthologous genes from diverse cereal species based on the sequence of a single cereal gene. Rice has the smallest (- 420 Mb) genome among the cereal grains, and has recently been a major focus of public and private genomic and EST sequencing efforts.
To identify crop trait genes in the rice [wheat] genome controlling [trait], genes from the rice draft genome sequence [wheat EST databases] were prioritized based on one or more functional genomic methodologies. For example, genome-wide expression studies of rice plants infected with rice blast fungus (Magnaporthe grisea) were used to prioritize candidate genes controlling disease resistance. Full-length and partial cDNAs of rice trait gene candidates could then be predicted based on analysis of the rice whole-genome sequence, and isolated by designing and using primers for PCR
amplification using a commercially available PCR primer-picking program.
Primers were used for PCR amplification of full-length or partial cDNAs from rice cDNA libraries or first-strand cDNA. cDNA clones resulting from either approach were used for the construction of vectors designed for altering expression of these genes in transgenic plants using plant molecular genetic methodologies, which are described in detail below. Alteration of plant phenotype through overexpression or underexpression of key trait genes in transgenic plants is a robust and established method for assigning functions to plant genes. Assays to identify transgenic plants with alterations in traits of interest are to be used to unambiguously assign the utility of these genes for the improvement of rice, and by extension, other cereals, either by transgenic or classical breeding methods.
II. Identifying, Cloning and Sequencing cDNAs The cloning and sequencing of the cDNAs of the present invention are described in Example 1.
The isolated nucleic acids and proteins of the present invention are usable over a range of plants, monocots and dicots, in particular monocots such as rice, wheat, barley and maize. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp., or teosinte. In a most specific embodiment, the cereal is rice. Other plants genera include, but are not limited to, Cucurbita, Rosa, Vitis, Juglans, Gragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, Lolium, Oryza, Avena, Hordeum, Secale, Allium, and Triticum.
The present invention also provides a method of genotyping a plant or plant part comprising a nucleic acid molecule of the present invention.
Optionally, the plant is a monocot such as, but not limited rice or wheat.
Genotyping provides a means of distinguishing homologs of a chromosome pari and can be used to differentiate segregants in a plant population.
Molecular marker methods can be used in phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomeal segments affecting mongenic traits, map based cloning, and the study of quantitative inheritance (see Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark ed., Springer-Verlag, Berlin 1997; Paterson, A.H., "The DNA Revolution", chapter 2 in Genome Mapping in Plants, Paterson, A.H. ed., Academic Press/R.G. Lands Co., Austin, Texas 1996).
The method of genotyping may employ any number of molecular marker analytical techniques such as, but not limited to, restriction length polymorphisms (RFLPs). As is well known in the art, RFLPs are produced by differences in the DNA restriction fragment lengths resulting from nucleotide differences between alleles of the same gene. Thus, the present invention provides a method of following segregation of a gene or nucleic acid of the present invention or chromosomal sequences genetically linked by using RFLP analysis. Linked chromosomal sequences are within 50 centiMorgans (50 cM), within 40 or 30 cM, specifically within 20 or 10 cM, more specifically within 5, 3, 2, or 1 cM of the nucleic acid of the invention.
Ill. Traits of Interest The present invention encompasses the identification and isolation of polynucleotides encoding proteins involved in sugar sensing and, ultimately, in nitrogen uptake and carbon metabolism. Altering the expression of genes related to these traits can be used to improve or modify plants and/or grain, as desired. Examples describe the isolated genes of interest and methods of analyzing the alteration of expression and their effects on the plant characteristics.
One aspect of the present invention provides compositions and methods for altering (i.e. increasing or decreasing) the level of nucleic acid molecules and polypeptides of the present invention in plants. In particular, the nucleic acid molecules and polypeptides of the invention are expressed constitutively, temporally or spatially, e.g. at developmental stages, in certain tissues, and/or quantities, which are uncharacteristic of non-recombinantly engineered plants.
Therefore, the present invention provides utility in such exemplary applications as altering the specified characteristics identified above.
VI. Controlling Gene Expression in Transgenic Plants The invention further relates to transformed cells comprising the nucleic acid molecules, transformed plants, seeds, and plant parts, and methods of modifying phenotypic traits of interest by altering the expression of the genes of the invention.
A. Modification of Coding Sequences and Adjacent Sequences The transgenic expression in plants of genes derived from heterologous sources may involve the modification of those genes to achieve and optimize their expression in plants. In particular, bacterial ORFs which encode separate enzymes but which are encoded by the same transcript in the native microbe are best expressed in plants on separate transcripts. To achieve this, each microbial ORF is isolated individually and cloned within a cassette which provides a plant promoter sequence at the 5' end of the ORF
and a plant transcriptional terminator at the 3' end of the ORF. The isolated ORF sequence specifically includes the initiating ATG codon and the terminating STOP codon but may include additional sequence beyond the initiating ATG and the STOP codon. In addition, the ORF may be truncated, but still retain the required activity; for particularly long ORFs, truncated versions which retain activity may be preferable for expression in transgenic organisms. By "plant promoter" and "plant transcriptional terminator" it is intended to mean promoters and transcriptional terminators that operate within plant cells. This includes promoters and transcription terminators that may be derived from non-plant sources such as viruses (an example is the Cauliflower Mosaic Virus).
In some cases, modification to the ORF coding sequences and adjacent sequence is not required. It is sufficient to isolate a fragment containing the ORF of interest and to insert it downstream of a plant promoter.
For example, Gaffney et al. (Science 261: 754-756 (1993)) have expressed the Pseudomonas nahG gene in transgenic plants under the control of the CaMV 35S promoter and the CaMV tml terminator successfully without modification of the coding sequence and with nucleotides of the Pseudomonas gene upstream of the ATG still attached, and nucleotides downstream of the STOP codon still attached to the nahG ORF. Specifically, as little adjacent microbial sequence as possible should be left attached upstream of the ATG and downstream of the STOP codon. In practice, such construction may depend on the availability of restriction sites.
In other cases, the expression of genes derived from microbial sources may provide problems in expression. These problems have been well characterized in the art and are particularly common with genes derived from certain sources such as Bacillus. These problems may apply to the nucleotide sequence of this invention and the modification of these genes can be undertaken using techniques now well known in the art. The following problems may be encountered:
1. Codon Usage.
The specific codon usage in plants differs from the specific codon usage in certain microorganisms. Comparison of the usage of codons within a cloned microbial ORF to usage in plant genes (and in particular genes from the target plant) will enable an identification of the codons within the ORF
that should specifically be changed. Typically plant evolution has tended towards a strong preference of the nucleotides C and G in the third base position of monocotyledons, whereas dicotyledons often use the nucleotides A or T at this position. By modifying a gene to incorporate specific codon usage for a particular target transgenic species, many of the problems described below for GC/AT content and illegitimate splicing will be overcome.
2. GC/AT Content.
Plant genes typically have a GC content of more than 35%. ORF
sequences which are rich in A and T nucleotides can cause several problems in plants. Firstly, motifs of ATTTA are believed to cause destabilization of messages and are found at the 3' end of many short-lived mRNAs. Secondly, the occurrence of polyadenylation signals such as AATAAA at inappropriate positions within the message is believed to cause premature truncation of transcription. In addition, monocotyledons may recognize AT-rich sequences as splice sites (see below).
3. Sequences Adjacent to the Initiating Methionine.
Plants differ from microorganisms in that their messages do not possess a defined ribosome-binding site. Rather, it is believed that ribosomes attach to the 5' end of the message and scan for the first available ATG at which to start translation. Nevertheless, it is believed that there is a preference for certain nucleotides adjacent to the ATG and that expression of microbial genes can be enhanced by the inclusion of a eukaryotic consensus translation initiator at the ATG. Clontech (1993/1994 catalog, page 210, incorporated herein by reference) have suggested one sequence as a consensus translation initiator for the expression of the E. coli uidA gene in plants. Further, Joshi (N.A.R. 15: 6643-6653 (1987), incorporated herein by reference) has compared many plant sequences adjacent to the ATG and suggests another consensus sequence. In situations where difficulties are encountered in the expression of microbial ORFs in plants, inclusion of one of these sequences at the initiating ATG may improve translation. In such cases the last three nucleotides of the consensus may not be appropriate for inclusion in the modified sequence due to their modification of the second AA
residue. Specific sequences adjacent to the initiating methionine may differ between different plant species. A survey of 14 maize genes located in the GenBank database provided the following results:
Position Before the Initiating ATG in 14 Maize Genes:
This analysis can be done for the desired plant species into which the nucleotide sequence is being incorporated, and the sequence adjacent to the ATG modified to incorporate the specific nucleotides.
4. Removal of Illegitimate Splice Sites.
Genes cloned from non-plant sources and not optimized for expression in plants may also contain motifs which may be recognized in plants as 5' or 3' splice sites, and be cleaved, thus generating truncated or deleted messages. These sites can be removed using the techniques well known in the art.
Techniques for the modification of coding sequences and adjacent sequences are well known in the art. In cases where the initial expression of a microbial ORF is low and it is deemed appropriate to make alterations to the sequence as described above, then the construction of synthetic genes can be accomplished according to methods well known in the art. These are, for example, described in the published patent disclosures EP 0 385 962 (to Monsanto), EP 0 359 472 (to Lubrizol) and WO 93/07278 (to Ciba-Geigy), all of which are incorporated herein by reference. In most cases it is preferable to assay the expression of gene constructions using transient assay protocols (which are well known in the art) prior to their transfer to transgenic plants.
B. Construction of Plant Expression Cassettes Coding sequences intended for expression in transgenic plants are first assembled in expression cassettes behind a suitable promoter expressible in plants. The expression cassettes may also comprise any further sequences required or selected for the expression of the transgene. Such sequences include, but are not restricted to, transcription terminators, extraneous sequences to enhance expression such as introns, vital sequences, and sequences intended for the targeting of the gene product to specific organelles and cell compartments. These expression cassettes can then be easily transferred to the plant transformation vectors described below. The following is a description of various components of typical expression cassettes.
1. Promoters The selection of the promoter used in expression cassettes will determine the spatial and temporal expression pattern of the transgene in the transgenic plant. Selected promoters will express transgenes in specific cell types (such as leaf epidermal cells, mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, for example) and the selection will reflect the desired location of accumulation of the gene product.
Alternatively, the selected promoter may drive expression of the gene under various inducing conditions. Promoters vary in their strength, i.e., ability to promote transcription. Depending upon the host cell system utilized, any one of a number of suitable promoters can be used, including the gene's native promoter. The following are non-limiting examples of promoters that may be used in expression cassettes.
a. Constitutive Expression, the Ubiquitin Promoter:
Ubiquitin is a gene product known to accumulate in many cell types and its promoter has been cloned from several species for use in transgenic plants (e.g. sunflower - Binet et al. Plant Science 79: 87-94 (1991); maize -Christensen et al. Plant Molec. Biol. 12: 619-632 (1989); and Arabidopsis -Callis et al., J. Biol. Chem. 265:12486-12493 (1990) and Norris et al., Plant Mol. Biol. 21:895-906 (1993)). The maize ubiquitin promoter has been developed in transgenic monocot systems and its sequence and vectors constructed for monocot transformation are disclosed in the patent publication EP 0 342 926 (to Lubrizol) which is herein incorporated by reference. Taylor et al. (Plant Cell Rep. 12: 491-495 (1993)) describe a vector (pAHC25) that comprises the maize ubiquitin promoter and first intron and its high activity in cell suspensions of numerous monocotyledons when introduced via microprojectile bombardment. The Arabidopsis ubiquitin promoter is ideal for use with the nucleotide sequences of the present invention. The ubiquitin promoter is suitable for gene expression in transgenic plants, both monocotyledons and dicotyledons. Suitable vectors are derivatives of pAHC25 or any of the transformation vectors described in this application, modified by the introduction of the appropriate ubiquitin promoter and/or intron sequences.
b. Constitutive Expression, the CaMV 35S Promoter:
Construction of the plasmid pCGN1761 is described in the published patent application EP 0 392 225 (Example 23), which is hereby incorporated by reference. pCGN1761 contains the "double" CaMV 35S promoter and the tml transcriptional terminator with a unique EcoRl site between the promoter and the terminator and has a pUC-type backbone. A derivative of pCGN1761 is constructed which has a modified polylinker which includes Notl and Xhol sites in addition to the existing EcoRl site. This derivative is designated pCGN 1761 ENX. pCGN 1761 ENX is useful for the cloning of cDNA sequences or coding sequences (including microbial ORF sequences) within its polylinker for the purpose of their expression under the control of the 35S promoter in transgenic plants. The entire 35S promoter-coding sequence-tml terminator cassette of such a construction can be excised by Hindlil, Sphl, Sall, and Xbal sites 5' to the promoter and Xbal, BamHl and BgII sites 3' to the terminator for transfer to transformation vectors such as those described below. Furthermore, the double 35S promoter fragment can be removed by 5' excision with Hindlll, Sphl, Sall, Xbal, or Pstl, and 3' excision with any of the polylinker restriction sites (EcoRl, Notl or Xhol) for replacement with another promoter. If desired, modifications around the cloning sites can be made by the introduction of sequences that may enhance translation. This is particularly useful when overexpression is desired. For example, pCGN1761ENX may be modified by optimization of the translational initiation site as described in Example 37 of U.S. Patent No. 5,639,949, incorporated herein by reference.
c. Constitutive Expression, the Actin Promoter:
Several isoforms of actin are known to be expressed in most cell types and consequently the actin promoter is a good choice for a constitutive promoter. In particular, the promoter from the rice Actl gene has been cloned and characterized (McElroy et al. Plant Cell 2: 163-171 (1990)). A 1.3kb fragment of the promoter was found to contain all the regulatory elements required for expression in rice protoplasts. Furthermore, numerous expression vectors based on the Actl promoter have been constructed specifically for use in monocotyledons (McElroy et al. Mol. Gen. Genet. 231:
150-160 (1991)). These incorporate the Actl-intron 1, Adhl 5' flanking sequence and Adhi-intron 1(from the maize alcohol dehydrogenase gene) and sequence from the CaMV 35S promoter. Vectors showing highest expression were fusions of 35S and Actl intron or the Actl 5' flanking sequence and the Actl intron. Optimization of sequences around the initiating ATG (of the GUS reporter gene) also enhanced expression. The promoter expression cassettes described by McElroy et al. (Mol. Gen. Genet. 231: 150-160 (1991)) can be easily modified for gene expression and are particularly suitable for use in monocotyledonous hosts. For example, promoter-containing fragments is removed from the McElroy constructions and used to replace the double 35S promoter in pCGN1761ENX, which is then available for the insertion of specific gene sequences. The fusion genes thus constructed can then be transferred to appropriate transformation vectors. In a separate report, the rice Actl promoter with its first intron has also been found to direct high expression in cultured barley cells (Chibbar et al. Plant Cell Rep. 12: 506-509 (1993)).
d. Inducible Expression, PR-1 Promoters:
The double 35S promoter in pCGN1761 ENX may be replaced with any other promoter of choice that will result in suitably high expression levels.
By way of example, one of the chemically regulatable promoters described in U.S. Patent No. 5,614,395, such as the tobacco PR-1a promoter, may replace the double 35S promoter. Alternately, the Arabidopsis PR-1 promoter described in Lebel et al., Plant J. 16:223-233 (1998) may be used. The promoter of choice is specifically excised from its source by restriction enzymes, but can alternatively be PCR-amplified using primers that carry appropriate terminal restriction sites. Should PCR-amplification be undertaken, the promoter should be re-sequenced to check for amplification errors after the cloning of the amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR-la promoter is cleaved from plasmid pCIB1004 (for construction, see example 21 of EP 0 332 104, which is hereby incorporated by reference) and transferred to plasmid pCGN1761ENX (Uknes et al., Plant Cell 4: 645-656 (1992)). pCIB1004 is cleaved with Ncol and the resultant 3' overhang of the linearized fragment is rendered blunt by treatment with T4 DNA polymerase. The fragment is then cleaved with Hindlll and the resultant PR-la promoter-containing fragment is gel purified and cloned into pCGN1761ENX from which the double 35S
promoter has been removed. This is accomplished by cleavage with Xhol and blunting with T4 polymerase, followed by cleavage with Hindlll, and isolation of the larger vector-terminator containing fragment into which the pCIB1004 promoter fragment is cloned. This generates a pCGN1761ENX
derivative with the PR-la promoter and the tml terminator and an intervening polylinker with unique EcoRl and Notl sites. The selected coding sequence can be inserted into this vector, and the fusion products (i.e. promoter-gene-terminator) can subsequently be transferred to any selected transformation vector, including those described infra. Various chemical regulators may be employed to induce expression of the selected coding sequence in the plants transformed according to the present invention, including the benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in U.S. Patent Nos. 5,523,311 and 5,614,395.
e. Inducible Expression, an Ethanol-Inducible Promoter:
A promoter inducible by certain alcohols or ketones, such as ethanol, may also be used to confer inducible expression of a coding sequence of the present invention. Such a promoter is for example the alcA gene promoter from Aspergillus nidulans (Caddick et al. (1998) Nat. Biotechnol 16:177-180).
In A. nidulans, the alcA gene encodes alcohol dehydrogenase I, the expression of which is regulated by the AIcR transcription factors in presence of the chemical inducer. For the purposes of the present invention, the CAT
coding sequences in plasmid palcA:CAT comprising a alcA gene promoter sequence fused to a minimal 35S promoter (Caddick et al. (1998) Nat.
Biotechnol 16:177-180) are replaced by a coding sequence of the present invention to form an expression cassette having the coding sequence under the control of the alcA gene promoter. This is carried out using methods well known in the art.
f. Inducible Expression, a Glucocorticoid-Inducible Promoter:
Induction of expression of a nucleic acid sequence of the present invention using systems based on steroid hormones is also contemplated.
For example, a glucocorticoid-mediated induction system is used (Aoyama and Chua (1997) The Plant Journal 11: 605-612) and gene expression is induced by application of a glucocorticoid, for example a synthetic glucocorticoid, specifically dexamethasone, specifically at a concentration ranging from 0.1 mM to 1mM, more specifically from 10mM to 100mM. For the purposes of the present invention, the luciferase gene sequences are replaced by a nucleic acid sequence of the invention to form an expression cassette having a nucleic acid sequence of the invention under the control of six copies of the GAL4 upstream activating sequences fused to the 35S
minimal promoter. This is carried out using methods well known in the art.
The trans-acting factor comprises the GAL4 DNA-binding domain (Keegan et al. (1986) Science 231: 699-704) fused to the transactivating domain of the herpes viral protein VP16 (Triezenberg et al. (1988) Genes Devel. 2: 718-729) fused to the hormone-binding domain of the rat glucocorticoid receptor (Picard et al. (1988) Cell 54: 1073-1080). The expression of the fusion protein is controlled either by a promoter known in the art or described here. This expression cassette is also comprised in the plant comprising a nucleic acid sequence of the invention fused to the 6xGAL4/minimal promoter. Thus, tissue- or organ-specificity of the fusion protein is achieved leading to inducible tissue- or organ-specificity of the insecticidal toxin.
g. Root Specific Expression:
Another pattern of gene expression is root expression. A suitable root promoter is the promoter of the maize metallothionein-like (MTL) gene described by de Framond (FEBS 290: 103-106 (1991)) and also in U.S.
Patent No. 5,466,785, incorporated herein by reference. This "MTL" promoter is transferred to a suitable vector such as pCGN1761 ENX for the insertion of a selected gene and subsequent transfer of the entire promoter-gene-terminator cassette to a transformation vector of interest.
h. Wound-Inducible Promoters:
Wound-inducible promoters may also be suitable for gene expression.
Numerous such promoters have been described (e.g. Xu et al. Plant Molec.
Biol. 22: 573-588 (1993), Logemann et al. Plant Cell 1: 151-158 (1989), Rohrmeier & Lehle, Plant Molec. Biol. 22: 783-792 (1993), Firek et al. Plant Molec. Biol. 22: 129-142 (1993), Warner et al. Plant J. 3: 191-201 (1993)) and all are suitable for use with the instant invention. Logemann et al. describe the 5' upstream sequences of the dicotyledonous potato wunl gene. Xu et al.
show that a wound-inducible promoter from the dicotyledon potato (pin2) is active in the monocotyledon rice. Further, Rohrmeier & Lehle describe the cloning of the maize Wipl cDNA which is wound induced and which can be used to isolate the cognate promoter using standard techniques. Similar, Firek et al. and Warner et al. have described a wound-induced gene from the monocotyledon Asparagus officinalis, which is expressed at local wound and pathogen invasion sites. Using cloning techniques well known in the art, these promoters can be transferred to suitable vectors, fused to the genes pertaining to this invention, and used to express these genes at the sites of plant wounding.
i. Pith-Specific Expression:
Patent Application WO 93/07278, which is herein incorporated by reference, describes the isolation of the maize trpA gene, which is preferentially expressed in pith cells. The gene sequence and promoter extending up to -1726 bp from the start of transcription are presented. Using standard molecular biological techniques, this promoter, or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 35S
promoter and be used to drive the expression of a foreign gene in a pith-specific manner. In fact, fragments containing the pith-specific promoter or parts thereof can be transferred to any vector and modified for utility in transgenic plants.
j. Leaf-Specific Expression:
A maize gene encoding phosphoenol carboxylase (PEPC) has been described by Hudspeth & Grula (Plant Molec Biol 12: 579-589 (1989)). Using standard molecular biological techniques the promoter for this gene can be used to drive the expression of any gene in a leaf-specific manner in transgenic plants.
k. Pollen-Specific Expression:
WO 93/07278 describes the isolation of the maize calcium-dependent protein kinase (CDPK) gene which is expressed in pollen cells. The gene sequence and promoter extend up to 1400 bp from the start of transcription.
Using standard molecular biological techniques, this promoter or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 35S promoter and be used to drive the expression of a nucleic acid sequence of the invention in a pollen-specific manner.
2. Transcriptional Terminators A variety of transcriptional terminators are available for use in expression cassettes. These are responsible for the termination of transcription beyond the transgene and correct mRNA polyadenylation.
Appropriate transcriptional terminators are those that are known to function in plants and include the CaMV 35S terminator, the tml terminator, the nopaline synthase terminator and the pea rbcS E9 terminator. These can be used in both monocotyledons and dicotyledons. In addition, a gene's native transcription terminator may be used.
3. Sequences for the Enhancement or Regulation of Expression Numerous sequences have been found to enhance gene expression from within the transcriptional unit and these sequences can be used in conjunction with the genes of this invention to increase their expression in transgenic plants.
Various intron sequences have been shown to enhance expression, particularly in monocotyledonous cells. For example, the introns of the maize Adhl gene have been found to significantly enhance the expression of the wild-type gene under its cognate promoter when introduced into maize cells.
Intron 1 was found to be particularly effective and enhanced expression in fusion constructs with the chloramphenicol acetyltransferase gene (Callis et al., Genes Develop. 1: 1183-1200 (1987)). In the same experimental system, the intron from the maize bronzel gene had a similar effect in enhancing expression. Intron sequences have been routinely incorporated into plant transformation vectors, typically within the non-translated leader.
A number of non-translated leader sequences derived from viruses are also known to enhance expression, and these are particularly effective in dicotyledonous cells. Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the "W-sequence"), Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be effective in enhancing expression (e.g. Gallie et al. Nucl. Acids Res. 15: 8693-8711 (1987); Skuzeski et al. Plant Molec. Biol. 15: 65-79 (1990)). Other leader sequences known in the art include but are not limited to: picornavirus leaders, for example, EMCV
leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein, 0., Fuerst, T.
R., and Moss, B. PNAS USA 86:6126-6130 (1989)); potyvirus leaders, for example, TEV leader (Tobacco Etch Virus) (Allison et al., 1986); MDMV
leader (Maize Dwarf Mosaic Virus); Virology 154:9-20); human immunoglobulin heavy-chain binding protein (BiP) leader, (Macejak, D. G., and Sarnow, P., Nature 353: 90-94 (1991); untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4), (Jobling, S. A., and Gehrke, L., Nature 325:622-625 (1987); tobacco mosaic virus leader (TMV), (Gallie, D. R. et al., Molecular Biology of RNA, pages 237-256 (1989); and Maize Chlorotic Mottle Virus leader (MCMV) (Lommel, S. A. et al., Virology 81:382-385 (1991). See also, Della-Cioppa et al., Plant Physiology 84:965-968 (1987).
In addition to incorporating one or more of the aforementioned elements into the 5' regulatory region of a target expression cassette of the invention, other elements peculiar to the target expression cassette may also be incorporated. Such elements include but are not limited to a minimal promoter. By minimal promoter it is intended that the basal promoter elements are inactive or nearly so without upstream activation. Such a promoter has low background activity in plants when there is no transactivator present or when enhancer or response element binding sites are absent. One minimal promoter that is particularly useful for target genes in plants is the Bzl minimal promoter, which is obtained from the bronzel gene of maize. The Bzl core promoter is obtained from the "myc" mutant Bzl-luciferase construct pBzlLucR98 via cleavage at the Nhel site located at -53 to -58. Roth et al., Plant Cell 3: 317 (1991). The derived Bzl core promoter fragment thus extends from -53 to +227 and includes the Bzl intron-1 in the 5' untranslated region. Also useful for the invention is a minimal promoter created by use of a synthetic TATA element. The TATA element allows recognition of the promoter by RNA polymerase factors and confers a basal level of gene expression in the absence of activation (see generally, Mukumoto (1993) Plant Mol Biol 23: 995-1003; Green (2000) Trends Biochem Sci 25: 59-63) 4. Targeting of the Gene Product Within the Cell Various mechanisms for targeting gene products are known to exist in plants and the sequences controlling the functioning of these mechanisms have been characterized in some detail. For example, the targeting of gene products to the chloroplast is controlled by a signal sequence found at the amino terminal end of various proteins which is cleaved during chloroplast import to yield the mature protein (e.g. Comai et al. J. Biol. Chem. 263:
15104-15109 (1988)). These signal sequences can be fused to heterologous gene products to effect the import of heterologous products into the chioroplast (van den Broeck, et al. Nature 313: 358-363 (1985)). DNA
encoding for appropriate signal sequences can be isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB protein, the EPSP
synthase enzyme, the GS2 protein and many other proteins which are known to be chloroplast localized. See also, the section entitled "Expression With Chloroplast Targeting" in Example 37 of U.S. Patent No. 5,639,949.
Other gene products are localized to other organelles such as the mitochondrion and the peroxisome (e.g. Unger et al. Plant Molec. Biol. 13:
411-418 (1989)). The cDNAs encoding these products can also be manipulated to effect the targeting of heterologous gene products to these organelles. Examples of such sequences are the nuclear-encoded ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting cellular protein bodies has been described by Rogers et al. (Proc. Natl. Acad.
Sci. USA 82: 6512-6516 (1985)).
In addition, sequences have been characterized which cause the targeting of gene products to other cell compartments. Amino terminal sequences are responsible for targeting to the ER, the apoplast, and extracellular secretion from aleurone cells (Koehler & Ho, Plant Cell 2: 769-783 (1990)). Additionally, amino terminal sequences in conjunction with carboxy terminal sequences are responsible for vacuolar targeting of gene products (Shinshi et al. Plant Molec. Biol. 14: 357-368 (1990)).
By the fusion of the appropriate targeting sequences described above to transgene sequences of interest it is possible to direct the transgene product to any organelle or cell compartment. For chloroplast targeting, for example, the chloroplast signal sequence from the RUBISCO gene, the CAB
gene, the EPSP synthase gene, or the GS2 gene is fused in frame to the amino terminal ATG of the transgene. The signal sequence selected should include the known cleavage site, and the fusion constructed should take into account any amino acids after the cleavage site which are required for cleavage. In some cases this requirement may be fulfilled by the addition of a small number of amino acids between the cleavage site and the transgene ATG or, alternatively, replacement of some amino acids within the transgene sequence. Fusions constructed for chloroplast import can be tested for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions followed by in vitro chloroplast uptake using techniques described by Bartlett et al. In: Edelmann et al. (Eds.) Methods in Chloroplast Molecular Biology, Elsevier pp 1081-1091 (1982) and Wasmann et al. Mol.
Gen. Genet. 205: 446-453 (1986). These construction techniques are well known in the art and are equally applicable to mitochondria and peroxisomes.
The above-described mechanisms for cellular targeting can be utilized not only in conjunction with their cognate promoters, but also in conjunction with heterologous promoters so as to effect a specific cell-targeting goal under the transcriptional regulation of a promoter that has an expression pattern different to that of the promoter from which the targeting signal derives.
C. Construction of Plant Transformation Vectors Numerous transformation vectors available for plant transformation are known to those of ordinary skill in the plant transformation arts, and the genes pertinent to this invention can be used in conjunction with any such vectors.
The selection of vector will depend upon the specific transformation technique and the target species for transformation. For certain target species, different antibiotic or herbicide selection markers may be specific. Selection markers used routinely in transformation include the nptll gene, which confers resistance to kanamycin and related antibiotics (Messing & Vierra. Gene 19:
259-268 (1982); Bevan et al., Nature 304:184-187 (1983)), the bar gene, which confers resistance to the herbicide phosphinothricin (White et al., Nucl.
Acids Res 18: 1062 (1990), Spencer et al. Theor. Appl. Genet 79: 625-631 (1990)), the hph gene, which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol Cell Biol 4: 2929-2931), and the dhfr gene, which confers resistance to methatrexate (Bourouis et al., EMBO J. 2(7):
1099-1104 (1983)), the EPSPS gene, which confers resistance to glyphosate (U.S. Patent Nos. 4,940,935 and 5,188,642), and the mannose-6-phosphate isomerase gene, which provides the ability to metabolize mannose (U.S.
Patent Nos. 5,767,378 and 5,994,629).
1. Vectors Suitable for Agrobacterium Transformation Many vectors are available for transformation using Agrobacterium tumefaciens. These typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, Nucl. Acids Res. (1984)). Below, the construction of two typical vectors suitable for Agrobacterium transformation is described.
a. pCIB200 and pCIB2001:
The binary vectors pCIB200 and pCIB2001 are used for the construction of recombinant vectors for use with Agrobacterium and are constructed in the following manner. pTJS75kan is created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. Bacteriol. 164: 446-455 (1985)) allowing excision of the tetracycline-resistance gene, followed by insertion of an Accl fragment from pUC4K carrying an NPTII (Messing & Vierra, Gene 19:
259-268 (1982): Bevan et al., Nature 304: 184-187 (1983): McBride et al., Plant Molecular Biology 14: 266-276 (1990)). Xhol linkers are ligated to the EcoRV fragment of PCIB7 which contains the left and right T-DNA borders, a plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et al., Gene 53: 153-161 (1987)), and the Xhol-digested fragment are cloned into Sall-digested pTJS75kan to create pCIB200 (see also EP 0 332 104, example 19). pCIB200 contains the following unique polylinker restriction sites:
EcoRl, Sstl, Kpnl, BgIlI, Xbal, and Sall. pCIB2001 is a derivative of pCIB200 created by the insertion into the polylinker of additional restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRl, Sstl, Kpnl, Bglll, Xbal, Sall, Mlul, Bcll, Avrll, Apal, Hpal, and Stul. pCIB2001, in addition to containing these unique restriction sites also has plant and bacterial kanamycin selection, left and right T-DNA borders for Agrobacterium-mediated transformation, the RK2-derived trfA function for mobilization between E. coli and other hosts, and the OriT and OriV functions also from RK2. The pCIB2001 polylinker is suitable for the cloning of plant expression cassettes containing their own regulatory signals.
b. pCIB10 and Hygromycin Selection Derivatives thereof:
The binary vector pCIB10 contains a gene encoding kanamycin resistance for selection in plants and T-DNA right and left border sequences and incorporates sequences from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its construction is described by Rothstein et al. (Gene 53: 153-161 (1987)). Various derivatives of pCIB10 are constructed which incorporate the gene for hygromycin B
phosphotransferase described by Gritz et al. (Gene 25: 179-188 (1983)).
These derivatives enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin (pCIB715, pCIB717).
2. Vectors Suitable for non-Agrobacterium Transformation Transformation without the use of Agrobacterium tumefaciens circumvents the requirement for T-DNA sequences in the chosen transformation vector and consequently vectors lacking these sequences can be utilized in addition to vectors such as the ones described above which contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium include transformation via particle bombardment, protoplast uptake (e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on the specific selection for the species being transformed. Below, the construction of typical vectors suitable for non-Agrobacterium transformation is described.
a. pCIB3064:
pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in combination with selection by the herbicide basta (or phosphinothricin). The plasmid pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene and the CaMV 35S
transcriptional terminator and is described in the PCT published application WO 93/07278. The 35S promoter of this vector contains two ATG sequences 5' of the start site. These sites are mutated using standard PCR techniques in such a way as to remove the ATGs and generate the restriction sites Sspl and Pvull. The new restriction sites are 96 and 37 bp away from the unique Sall site and 101 and 42 bp away from the actual start site. The resultant derivative of pCIB246 is designated pCIB3025. The GUS gene is then excised from pCIB3025 by digestion with Sall and Sacl, the termini rendered blunt and religated to generate plasmid pCIB3060. The plasmid pJIT82 is obtained from the John Innes Centre, Norwich and the a 400 bp Smal fragment containing the bar gene from Streptomyces viridochromogenes is excised and inserted into the Hpal site of pCIB3060 (Thompson et al. EMBO J
6: 2519-2523 (1987)). This generated pCIB3064, which comprises the bar gene under the control of the CaMV 35S promoter and terminator for herbicide selection, a gene for ampicillin resistance (for selection in E.
coli) and a polylinker with the unique sites Sphl, Pstl, Hindill, and BamHl. This vector is suitable for the cloning of plant expression cassettes containing their own regulatory signals.
b. pSOG19 and pSOG35:
pSOG35 is a transformation vector that utilizes the E. coli gene dihydrofolate reductase (DFR) as a selectable marker conferring resistance to methotrexate. PCR is used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene (-550 bp) and 18 bp of the GUS untransiated leader sequence from pSOG10. A 250-bp fragment encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and these two PCR fragments are assembled with a Sacl-Pstl fragment from pB1221 (Clontech) which comprises the pUC19 vector backbone and the nopaline synthase terminator. Assembly of these fragments generates pSOG19 which contains the 35S promoter in fusion with the intron 6 sequence, the GUS
leader, the DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generates the vector pSOG35. pSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and have Hindill, Sphl, Pstl and EcoRl sites available for the cloning of foreign substances.
3. Vector Suitable for Chloroplast Transformation For expression of a nucleotide sequence of the present invention in plant plastids, plastid transformation vector pPH143 (WO 97/32011, example 36) is used. The nucleotide sequence is inserted into pPH143 thereby replacing the PROTOX coding sequence. This vector is then used for plastid transformation and selection of transformants for spectinomycin resistance.
Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. In this case, transformants are selected for resistance to PROTOX inhibitors.
D. Transformation Once a nucleic acid sequence of the invention has been cloned into an expression system, it is transformed into a plant cell. The receptor and target expression cassettes of the present invention can be introduced into the plant cell in a number of art-recognized ways. Methods for regeneration of plants are also well known in the art. For example, Ti plasmid vectors have been utilized for the delivery of foreign DNA, as well as direct DNA uptake, liposomes, electroporation, microinjection, and microprojectiles. In addition, bacteria from the genus Agrobacterium can be utilized to transform plant cells.
Below are descriptions of representative techniques for transforming both dicotyledonous and monocotyledonous plants, as well as a representative plastid transformation technique.
1. Transformation of Dicotyledons Transformation techniques for dicotyledons are well known in the art and include Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non-Agrobacterium techniques involve the uptake of exogenous genetic material directly by protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, particle bombardment-mediated delivery, or microinjection. Examples of these techniques are described by Paszkowski et al., EMBO J 3: 2717-2722 (1984), Potrykus et al., Mol. Gen. Genet. 199: 169-177 (1985), Reich et al., Biotechnology 4: 1001-1004 (1986), and Klein et al., Nature 327: 70-73 (1987). In each case the transformed cells are regenerated to whole plants using standard techniques known in the art.
Agrobacterium-mediated transformation is a specific technique for transformation of dicotyledons because of its high efficiency of transformation and its broad utility with many different species. Agrobacterium transformation typically involves the transfer of the binary vector carrying the foreign DNA of interest (e.g. pCIB200 or pCIB2001) to an appropriate Agrobacterium strain which may depend of the complement of vir genes carried by the host Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g. strain CIB542 for pCIB200 and pCIB2001 (Uknes et al.
Plant Cell 5: 159-169 (1993)). The transfer of the recombinant binary vector to Agrobacterium is accomplished by a triparental mating procedure using E.
coli carrying the recombinant binary vector, a helper E. coli strain which carries a plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary vector can be transferred to Agrobacterium by DNA
transformation (Hofgen & Willmitzer, Nucl. Acids Res. 16: 9877 (1988)).
Transformation of the target plant species by recombinant Agrobacterium usually involves co-cultivation of the Agrobacterium with explants from the plant and follows protocols well known in the art.
Transformed tissue is regenerated on selectable medium carrying the antibiotic or herbicide resistance marker present between the binary plasmid T-DNA borders.
Another approach to transforming plant cells with a gene involves propelling inert or biologically active particles at plant tissues and cells.
This technique is disclosed in U.S. Patent Nos. 4,945,050, 5,036,006, and 5,100,792 all to Sanford et al. Generally, this procedure involves propelling inert or biologically active particles at the cells under conditions effective to penetrate the outer surface of the cell and afford incorporation within the interior thereof. When inert particles are utilized, the vector can be introduced into the cell by coating the particles with the vector containing the desired gene. Alternatively, the target cell can be surrounded by the vector so that the vector is carried into the cell by the wake of the particle. Biologically active particles (e.g., dried yeast cells, dried bacterium or a bacteriophage, each containing DNA sought to be introduced) can also be propelled into plant cell tissue.
2. Transformation of Monocotyledons Transformation of most monocotyledon species has now also become routine. Specific techniques include direct gene transfer into protoplasts using PEG or electroporation techniques, and particle bombardment into callus tissue. Transformations can be undertaken with a single DNA species or multiple DNA species (i.e. co-transformation) and both these techniques are suitable for use with this invention. Co-transformation may have the advantage of avoiding complete vector construction and of generating transgenic plants with unlinked loci for the gene of interest and the selectable marker, enabling the removal of the selectable marker in subsequent generations, should this be regarded desirable. However, a disadvantage of the use of co-transformation is the less than 100% frequency with which separate DNA species are integrated into the genome (Schocher et al.
Biotechnology 4: 1093-1096 (1986)).
Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques for the preparation of callus and protoplasts from an elite inbred line of maize, transformation of protoplasts using PEG or electroporation, and the regeneration of maize plants from transformed protoplasts. Gordon-Kamm et al. (Plant Cell 2: 603-618 (1990)) and Fromm et al. (Biotechnology 8: 833-839 (1990)) have published techniques for transformation of A188-derived maize line using particle bombardment.
Furthermore, WO 93/07278 and Koziel et al. (Biotechnology 11: 194-200 (1993)) describe techniques for the transformation of elite inbred lines of maize by particle bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length excised from a maize ear 14-15 days after pollination and a PDS-1000He Biolistics device for bombardment.
Transformation of rice can also be undertaken by direct gene transfer techniques utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been described for Japonica-types and Indica-types (Zhang et al. Plant Cell Rep 7: 379-384 (1988); Shimamoto et al. Nature 338:
274-277 (1989); Datta et al. Biotechnology 8: 736-740 (1990)). Both types are also routinely transformable using particle bombardment (Christou et al.
Biotechnology 9: 957-962 (1991)). Furthermore, WO 93/21335 describes techniques for the transformation of rice via electroporation.
Patent Application EP 0 332 581 describes techniques for the generation, transformation and regeneration of Pooideae protoplasts. These techniques allow the transformation of Dactylis and wheat. Furthermore, wheat transformation has been described by Vasil et al. (Biotechnology 10:
667-674 (1992)) using particle bombardment into cells of type C long-term regenerable callus, and also by Vasil et al. (Biotechnology 11: 1553-1558 (1993)) and Weeks et al. (Plant Physiol. 102: 1077-1084 (1993)) using particle bombardment of immature embryos and immature embryo-derived callus. A
specific technique for wheat transformation, however, involves the transformation of wheat by particle bombardment of immature embryos and includes either a high sucrose or a high maltose step prior to gene delivery.
Prior to bombardment, any number of embryos (0.75-1 mm in length) are plated onto MS medium with 3% sucrose (Murashiga & Skoog, Physiologia Plantarum 15: 473-497 (1962)) and 3 mg/I 2,4-D for induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day of bombardment, embryos are removed from the induction medium and placed onto the osmoticum (i.e. induction medium with sucrose or maltose added at the desired concentration, typically 15%). The embryos are allowed to plasmolyze for 2-3 hours and are then bombarded. Twenty embryos per target plate is typical, although not critical. An appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles using standard procedures. Each plate of embryos is shot with the DuPont Biolistics helium device using a burst pressure of -1000 psi using a standard 80 mesh screen. After bombardment, the embryos are placed back into the dark to recover for about 24 hours (still on osmoticum).
After 24 hrs, the embryos are removed from the osmoticum and placed back onto induction medium where they stay for about a month before regeneration. Approximately one month later the embryo explants with developing embryogenic callus are transferred to regeneration medium (MS +
1 mg/liter NAA, 5 mg/liter GA), further containing the appropriate selection agent (10 mg/I basta in the case of pCIB3064 and 2 mg/I methotrexate in the case of pSOG35). After approximately one month, developed shoots are transferred to larger sterile containers known as "GA7s" which contain half-strength MS, 2% sucrose, and the same concentration of selection agent.
Tranformation of monocotyledons using Agrobacterium has also been described. See, WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein by reference. See also, Negrotto et al., Plant Cell Reports 19: 798-803 (2000), incorporated herein by reference. For this example, rice (Oryza sativa) is used for generating transgenic plants. Various rice cultivars can be used (Hiei et al., 1994, Plant Journal 6:271-282; Dong et al., 1996, Molecular Breeding 2:267-276; Hiei et al., 1997, Plant Molecular Biology, 35:205-218). Also, the various media constituents described below may be either varied in quantity or substituted. Embryogenic responses are initiated and/or cultures are established from mature embryos by culturing on MS-CIM medium (MS basal salts, 4.3 g/liter; B5 vitamins (200 x), 5 mI/liter;
Sucrose, 30 g/liter; proline, 500 mg/liter; glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2,4-D (1 mg/mI), 2 mI/liter; adjust pH to 5.8 with KOH; Phytagel, 3 g/Iiter). Either mature embryos at the initial stages of culture response or established culture lines are inoculated and co-cultivated with the Agrobacterium tumefaciens strain LBA4404 (Agrobacterium) containing the desired vector construction. Agrobacterium is cultured from glycerol stocks on solid YPC medium (100 mg/L spectinomycin and any other appropriate antibiotic) for -2 days at 28 oC. Agrobacterium is re-suspended in liquid MS-CIM medium. The Agrobacterium culture is diluted to an OD600 of 0.2-0.3 and acetosyringone is added to a final concentration of 200 uM.
Acetosyringone is added before mixing the solution with the rice cultures to induce Agrobacterium for DNA transfer to the plant cells. For inoculation, the plant cultures are immersed in the bacterial suspension. The liquid bacterial suspension is removed and the inoculated cultures are placed on co-cultivation medium and incubated at 22 C for two days. The cultures are then transferred to MS-CIM medium with Ticarcillin (400 mg/liter) to inhibit the growth of Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., In Vitro Cell. Dev. Biol.-Plant 37:127-132), cultures are transferred to selection medium containing Mannose as a carbohydrate source (MS with 2%Mannose, 300 mg/liter Ticarcillin) after 7 days, and cultured for 3-4 weeks in the dark. Resistant colonies are then transferred to regeneration induction medium (MS with no 2,4-D, 0.5 mg/liter IAA, 1 mg/liter zeatin, 200 mg/liter timentin 2% Mannose and 3% Sorbitol) and grown in the dark for 14 days. Proliferating colonies are then transferred to another round of regeneration induction media and moved to the light growth room.
Regenerated shoots are transferred to GA7 containers with GA7-1 medium (MS with no hormones and 2% Sorbitol) for 2 weeks and then moved to the greenhouse when they are large enough and have adequate roots. Plants are transplanted to soil in the greenhouse (TO generation) grown to maturity, and the T1 seed is harvested.
3. Transformation of Plastids Seeds of Nicotiana tabacum c.v. 'Xanthi nc' are germinated seven per plate in a 1" circular array on T agar medium and bombarded 12-14 days after sowing with 1 pm tungsten particles (M10, Biorad, Hercules, CA) coated with DNA from plasmids pPH143 and pPH145 essentially as described (Svab, Z.
and Maliga, P. (1993) PNAS 90, 913-917). Bombarded seedlings are incubated on T medium for two days after which leaves are excised and placed abaxial side up in bright light (350-500 pmol photons/m2/s) on plates of RMOP medium (Svab, Z., Hajdukiewicz, P. and Maliga, P. (1990) PNAS
87, 8526-8530) containing 500 pg/mi spectinomycin dihydrochloride (Sigma, St. Louis, MO). Resistant shoots appearing underneath the bleached leaves three to eight weeks after bombardment are subcloned onto the same selective medium, allowed to form callus, and secondary shoots isolated and subcloned. Complete segregation of transformed plastid genome copies (homoplasmicity) in independent subclones is assessed by standard techniques of Southern blotting (Sambrook et al., (1989) Molecular Cloning: A
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor).
BamHI/EcoRI-digested total cellular DNA (Mettler, I. J. (1987) Plant Mol Biol Reporter 5, 346-349) is separated on 1% Tris-borate (TBE) agarose gels, transferred to nylon membranes (Amersham) and probed with 32P-labeled random primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA fragment from pC8 containing a portion of the rps7/12 plastid targeting sequence. Homoplasmic shoots are rooted aseptically on spectinomycin-containing MS/IBA medium (McBride, K. E. et al. (1994) PNAS 91, 7301-7305) and transferred to the greenhouse.
V. Breeding and Seed Production A. Breeding The plants obtained via tranformation with a nucleic acid sequence of the present invention can be any of a wide variety of plant species, including those of monocots and dicots; however, the plants used in the method of the invention are specifically selected from the list of agronomically important target crops set forth supra. The expression of a gene of the present invention in combination with other characteristics important for production and quality can be incorporated into plant lines through breeding. Breeding approaches and techniques are known in the art. See, for example, Welsh J. R., Fundamentals of Plant Genetics and Breeding, John Wiley & Sons, NY
(1981); Crop Breeding, Wood D. R. (Ed.) American Society of Agronomy Madison, Wisconsin (1983); Mayo 0., The Theory of Plant Breeding, Second Edition, Clarendon Press, Oxford (1987); Singh, D.P., Breeding for Resistance to Diseases and Insect Pests, Springer-Verlag, NY (1986); and Wricke and Weber, Quantitative Genetics and Selection Plant Breeding, Walter de Gruyter and Co., Berlin (1986).
The genetic properties engineered into the transgenic seeds and plants described above are passed on by sexual reproduction or vegetative growth and can thus be maintained and propagated in progeny plants. Generally said maintenance and propagation make use of known agricultural methods developed to fit specific purposes such as tilling, sowing or harvesting.
Specialized processes such as hydroponics or greenhouse technologies can also be applied. As the growing crop is vulnerable to attack and damages caused by insects or infections as well as to competition by weed plants, measures are undertaken to control weeds, plant diseases, insects, nematodes, and other adverse conditions to improve yield. These include mechanical measures such a tillage of the soil or removal of weeds and infected plants, as well as the application of agrochemicals such as herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and insecticides.
Use of the advantageous genetic properties of the transgenic plants and seeds according to the invention can further be made in plant breeding, which aims at the development of plants with improved properties such as tolerance of pests, herbicides, or stress, improved nutritional value, increased yield, or improved structure causing less loss from lodging or shattering. The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate progeny plants. Depending on the desired properties, different breeding measures are taken. The relevant techniques are well known in the art and include but are not limited to hybridization, inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization techniques also include the sterilization of plants to yield male or female sterile plants by mechanical, chemical, or biochemical means. Cross pollination of a male sterile plant with pollen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines. Thus, the transgenic seeds and plants according to the invention can be used for the breeding of improved plant lines, that for example, increase the effectiveness of conventional methods such as herbicide or pesticide treatment or allow one to dispense with said methods due to their modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained, which, due to their optimized genetic "equipment", yield harvested product of better quality than products that were not able to tolerate comparable adverse developmental conditions.
B. Seed Production In seed production, germination quality and uniformity of seeds are essential product characteristics. As it is difficult to keep a crop free from other crop and weed seeds, to control seedborne diseases, and to produce seed with good germination, fairly extensive and well-defined seed production practices have been developed by seed producers, who are experienced in the art of growing, conditioning and marketing of pure seed. Thus, it is common practice for the farmer to buy certified seed meeting specific quality standards instead of using seed harvested from his own crop. Propagation material to be used as seeds is customarily treated with a protectant coating comprising herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides, or mixtures thereof. Customarily used protectant coatings comprise compounds such as captan, carboxin, thiram (TMTD ), methalaxyl (Apron ), and pirimiphos-methyl (Actellic ). If desired, these compounds are formulated together with further carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation to provide protection against damage caused by bacterial, fungal or animal pests. The protectant coatings may be applied by impregnating propagation material with a liquid formulation or by coating with a combined wet or dry formulation. Other methods of application are also possible such as treatment directed at the buds or the fruit.
VI. Alteration of Expression of Nucleic Acid Molecules The alteration in expression of the nucleic acid molecules of the present invention is achieved in one of the following ways:
A. "Sense" Suppression Alteration of the expression of a nucleotide sequence of the present invention, specifically reduction of its expression, is obtained by "sense"
suppression (referenced in e.g. Jorgensen et al. (1996) Plant Mol. Biol. 31, 957-973). In this case, the entirety or a portion of a nucleotide sequence of the present invention is comprised in a DNA molecule. The DNA molecule is specifically operatively linked to a promoter functional in a cell comprising the target gene, specifically a plant cell, and introduced into the cell, in which the nucleotide sequence is expressible. The nucleotide sequence is inserted in the DNA molecule in the "sense orientation", meaning that the coding strand of the nucleotide sequence can be transcribed. In a specific embodiment, the nucleotide sequence is fully translatable and all the genetic information comprised in the nucleotide sequence, or portion thereof, is translated into a polypeptide. In another specific embodiment, the nucleotide sequence is partially translatable and a short peptide is translated. In a specific embodiment, this is achieved by inserting at least one premature stop codon in the nucleotide sequence, which bring translation to a halt. In another more specific embodiment, the nucleotide sequence is transcribed but no translation product is being made. This is usually achieved by removing the start codon, e.g. the "ATG", of the polypeptide encoded by the nucleotide sequence. In a further specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule.
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
B. "Anti-sense" Suppression In another specific embodiment, the alteration of the expression of a nucleotide sequence of the present invention, specifically the reduction of its expression is obtained by "anti-sense" suppression. The entirety or a portion of a nucleotide sequence of the present invention is comprised in a DNA
molecule. The DNA molecule is specifically operatively linked to a promoter functional in a plant cell, and introduced in a plant cell, in which the nucleotide sequence is expressible. The nucleotide sequence is inserted in the DNA
molecule in the "anti-sense orientation", meaning that the reverse complement (also called sometimes non-coding strand) of the nucleotide sequence can be transcribed. In a specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Green, P. J. et al., Ann. Rev.
Biochem. 55:569-597 (1986); van der Krol, A. R. et al, Antisense Nuc. Acids &
Proteins, pp. 125-141 (1991); Abel, P. P. et al., PNASroc. Natl. Acad. Sci.
USA 86:6949-6952 (1989); Ecker, J. R. et al., Proc. Natl. Acad. Sci. USANAS
83:5372-5376 (Aug. 1986)).
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
C. Homologous Recombination In another specific embodiment, at least one genomic copy corresponding to a nucleotide sequence of the present invention is modified in the genome of the plant by homologous recombination as further illustrated in Paszkowski et al., EMBO Journal 7:4021-26 (1988). This technique uses the property of homologous sequences to recognize each other and to exchange nucleotide sequences between each by a process known in the art as homologous recombination. Homologous recombination can occur between the chromosomal copy of a nucleotide sequence in a cell and an incoming copy of the nucleotide sequence introduced in the cell by transformation.
Specific modifications are thus accurately introduced in the chromosomal copy of the nucleotide sequence. In one embodiment, the regulatory elements of the nucleotide sequence of the present invention are modified. Such regulatory elements are easily obtainable by screening a genomic library using the nucleotide sequence of the present invention, or a portion thereof, as a probe. The existing regulatory elements are replaced by different regulatory elements, thus altering expression of the nucleotide sequence, or they are mutated or deleted, thus abolishing the expression of the nucleotide sequence. In another embodiment, the nucleotide sequence is modified by deletion of a part of the nucleotide sequence or the entire nucleotide sequence, or by mutation. Expression of a mutated polypeptide in a plant cell is also contemplated in the present invention. More recent refinements of this technique to disrupt endogenous plant genes have been described (Kempin et al., Nature 389:802-803 (1997) and Miao and Lam, Plant J., 7:359-365 (1995).
In another specific embodiment, a mutation in the chromosomal copy of a nucleotide sequence is introduced by transforming a cell with a chimeric oligonucleotide composed of a contiguous stretch of RNA and DNA residues in a duplex conformation with double hairpin caps on the ends. An additional feature of the oligonucleotide is for example the presence of 2'-O-methylation at the RNA residues. The RNA/DNA sequence is designed to align with the sequence of a chromosomal copy of a nucleotide sequence of the present invention and to contain the desired nucleotide change. For example, this technique is further illustrated in US patent 5,501,967 and Zhu et al. (1999) Proc. Natl. Acad. Sci. USA 96: 8768-8773.
D. Ribozymes In a further embodiment, the RNA coding for a polypeptide of the present invention is cleaved by a catalytic RNA, or ribozyme, specific for such RNA. The ribozyme is expressed in transgenic plants and results in reduced amounts of RNA coding for the polypeptide of the present invention in plant cells, thus leading to reduced amounts of polypeptide accumulated in the cells. This method is further illustrated in US patent 4,987,071.
E. Dominant-Negative Mutants In another specific embodiment, the activity of the polypeptide encoded by the nucleotide sequences of this invention is changed. This is achieved by expression of dominant negative mutants of the proteins in transgenic plants, leading to the loss of activity of the endogenous protein.
F. Aptamers In a further embodiment, the activity of polypeptide of the present invention is inhibited by expressing in transgenic plants nucleic acid ligands, so-called aptamers, which specifically bind to the protein. Aptamers are preferentially obtained by the SELEX (Systematic Evolution of Ligands by EXponential Enrichment) method. In the SELEX method, a candidate mixture of single stranded nucleic acids having regions of randomized sequence is contacted with the protein and those nucleic acids having an increased affinity to the target are partitioned from the remainder of the candidate mixture. The partitioned nucleic acids are amplified to yield a ligand enriched mixture.
After several iterations a nucleic acid with optimal affinity to the polypeptide is obtained and is used for expression in transgenic plants. This method is further illustrated in US patent 5,270,163.
G. Zinc finger proteins A zinc finger protein that binds a nucleotide sequence of the present invention or to its regulatory region is also used to alter expression of the nucleotide sequence. Specifically, transcription of the nucleotide sequence is reduced or increased. Zinc finger proteins are for example described in Beerli et al. (1998) PNAS 95:14628-14633., or in WO 95/19431, WO 98/54311, or WO 96/06166, all incorporated herein by reference in their entirety.
H. dsRNA
Alteration of the expression of a nucleotide sequence of the present invention is also obtained by dsRNA interference as described for example in WO 99/32619, WO 99/53050 or WO 99/61631, all incorporated herein by reference in their entirety. In another specific embodiment, the alteration of the expression of a nucleotide sequence of the present invention, specifically the reduction of its expression, is obtained by double-stranded RNA (dsRNA) interference. The entirety or, specifically a portion of a nucleotide sequence of the present invention is comprised in a DNA molecule. The size of the DNA molecule is specifically from 100 to 1000 nucleotides or more; the optimal size to be determined empirically. Two copies of the identical DNA
molecule are linked, separated by a spacer DNA molecule, such that the first and second copies are in opposite orientations. In the specific embodiment, the first copy of the DNA molecule is in the reverse complement (also known as the non-coding strand) and the second copy is the coding strand; in the most specific embodiment, the first copy is the coding strand, and the second copy is the reverse complement. The size of the spacer DNA molecule is specifically 200 to 10,000 nucleotides, more specifically 400 to 5000 nucleotides and most specifically 600 to 1500 nucleotides in length. The spacer is specifically a random piece of DNA, more specifically a random piece of DNA without homology to the target organism for dsRNA
interference, and most specifically a functional intron which is effectively spliced by the target organism. The two copies of the DNA molecule separated by the spacer are operatively linked to a promoter functional in a plant cell, and introduced in a plant cell, in which the nucleotide sequence is expressible. In a specific embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another specific embodiment the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Waterhouse et al. (1998) PNAS
95:13959-13964; Chuang and Meyerowitz (2000) PNAS 97:4985-4990; Smith et al. (2000) Nature 407:319-320). Alteration of the expression of a nucleotide sequence by dsRNA interference is also described in, for example WO
99/32619, WO 99/53050 or WO 99/61631, all incorporated herein by reference in their entirety.
In transgenic plants containing one of the DNA molecules described immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule is specifically reduced. Specifically, the nucleotide sequence in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, more specifically it is at least 80% identical, yet more specifically at least 90% identical, yet more specifically at least 95% identical, yet more specifically at least 99% identical.
1. Insertion of a DNA molecule (Insertional mutagenesis) In another specific embodiment, a DNA molecule is inserted into a chromosomal copy of a nucleotide sequence of the present invention, or into a regulatory region thereof. Specifically, such DNA molecule comprises a transposable element capable of transposition in a plant cell, such as e.g.
Ac/Ds, Em/Spm, mutator. Alternatively, the DNA molecule comprises a T-DNA border of an Agrobacterium T-DNA. The DNA molecule may also comprise a recombinase or integrase recognition site which can be used to remove part of the DNA molecule from the chromosome of the plant cell.
Methods of insertional mutagenesis using T-DNA, transposons, oligonucleotides or other methods known to those skilled in the art are also encompassed. Methods of using T-DNA and transposon for insertional mutagenesis are described in Winkler et al. (1989) Methods Mol. Biol. 82:129-136 and Martienssen (1998) PNAS 95:2021-2026, incorporated herein by reference in their entireties.
J. Deletion mutagenesis In yet another embodiment, a mutation of a nucleic acid molecule of the present invention is created in the genomic copy of the sequence in the cell or plant by deletion of a portion of the nucleotide sequence or regulator sequence. Methods of deletion mutagenesis are known to those skilled in the art. See, for example, Miao et al, (1995) Plant J. 7:359.
In yet another embodiment, this deletion is created at random in a large population of plants by chemical mutagenesis or irradiation and a plant with a deletion in a gene of the present invention is isolated by forward or reverse genetics. Irradiation with fast neutrons or gamma rays is known to cause deletion mutations in plants (Silverstone et al, (1998) Plant Cell, 10:155-169;
Bruggemann et al., (1996) Plant J., 10:755-760; Redei and Koncz in Methods in Arabidopsis Research, World Scientific Press (1992), pp. 16-82). Deletion mutations in a gene of the present invention can be recovered in a reverse genetics strategy using PCR with pooled sets of genomic DNAs as has been shown in C. elegans (Liu et al., (1999), Genome Research, 9:859-867.). A
forward genetics strategy would involve mutagenesis of a line displaying PTGS followed by screening the M2 progeny for the absence of PTGS.
Among these mutants would be expected to be some that disrupt a gene of the present invention. This could be assessed by Southern blot or PCR for a gene of the present invention with genomic DNA from these mutants.
K. Overexpression in a plant cell In yet another specific embodiment, a nucleotide sequence of the present invention encoding a polypeptide is over-expressed. Examples of nucleic acid molecules and expression cassettes for over-expression of a nucleic acid molecule of the present invention are described above. Methods known to those skilled in the art of over-expression of nucleic acid molecules are also encompassed by the present invention.
In a specific embodiment, the expression of the nucleotide sequence of the present invention is altered in every cell of a plant. This is for example obtained though homologous recombination or by insertion in the chromosome. This is also for example obtained by expressing a sense or antisense RNA, zinc finger protein or ribozyme under the control of a promoter capable of expressing the sense or antisense RNA, zinc finger protein or ribozyme in every cell of a plant. Constitutive expression, inducible, tissue-specific or developmentally-regulated expression are also within the scope of the present invention and result in a constitutive, inducible, tissue-specific or developmentally-regulated alteration of the expression of a nucleotide sequence of the present invention in the plant cell. Constructs for expression of the sense or antisense RNA, zinc finger protein or ribozyme, or for over-expression of a nucleotide sequence of the present invention, are prepared and transformed into a plant cell according to the teachings of the present invention, e.g. as described infra.
VII. Polypeptides The present invention further relates to isolated polypeptides comprising the amino acid sequence of SEQ ID NO:2. In particular, isolated polypeptides comprising the amino acid sequence of SEQ ID NO:2, and variants having conservative amino acid modifications. One skilled in the art will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide or protein sequence which alters, adds or deletes a single amino acid or a small percent of amino acids in the encoded sequence is a "conservative modification" where the modification results in the substitution of an amino acid with a chemically similar amino acid.
Conservative modified variants provide similar biological activity as the unmodified polypeptide. Conservative substitution tables listing functionally similar amino acids are known in the art. See Crighton (1984) Proteins, W.H.
Freeman and Company.
In a specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence of SEQ ID NO:2, or exon or domain thereof, is an allelic variant of the polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof, is a naturally occurring variant of the polypeptide sequence listed SEQ ID NO:2.
In another specific embodiment, a polypeptide having substantial similarity to a polypeptide sequence listed SEQ ID NO:2, or exon or domain thereof, is a polymorphic variant of the polypeptide sequence listed in SEQ ID NO:2.
In an alternate specific embodiment, the sequence having substantial similarity contains a deletion or insertion of at least one amino acid. In a more specific embodiment, the deletion or insertion is of less than about ten amino acids. In a most specific embodiment, the deletion or insertion is of less than about three amino acids.
In a specific embodiment, the sequence having substantial similarity encodes a substitution in at least one amino acid.
Embodiments of the present invention also contemplate an isolated polypeptide containing a polypeptide sequence including (a) a polypeptide sequence listed in SEQ ID NO:2, or exon or domain thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
In another specific embodiment, the polypeptide having substantial similarity is an allelic variant of a polypeptide sequence listed in SEQ ID
NO:2, or a fragment, domain, repeat or chimeras thereof. In another specific embodiment, the isolated nucleic acid includes a plurality of regions from the polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or fragment or domain thereof, or a sequence complementary thereto.
In another specific embodiment, the polypeptide is a polypeptide sequence listed in SEQ ID NO:2. In another specific embodiment, the polypeptide is a functional fragment or domain. In yet another specific embodiment, the polypeptide is a chimera, where the chimera may include functional protein domains, including domains, repeats, post-translational modification sites, or other features. In a more specific embodiment, the polypeptide is a plant polypeptide. In a more specific embodiment, the plant is a dicot. In a more specific embodiment, the plant is a gymnosperm. In a more specific embodiment, the plant is a monocot. In a more specific embodiment, the monocot is a cereal. In a more specific embodiment, the cereal may be, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum, and teosinte. In another specific embodiment, the cereal is rice.
In a specific embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In a more specific embodiment, the location or tissue is for example, but not limited to, epidermis, vascular tissue, meristem, cambium, cortex or pith. In a most specific embodiment, the location or tissue is leaf or sheath, root, flower, and developing ovule or seed. In a more specific embodiment, the location or tissue may be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, and flower. In a more specific embodiment, the location or tissue is a seed.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1 or a fragment or domain thereof or a sequence complementary thereto, includes a deletion or insertion of at least one nucleotide. In a more specific embodiment, the deletion or insertion is of less than about thirty nucleotides. In a most specific embodiment, the deletion or insertion is of less than about five nucleotides.
In a specific embodiment, the polypeptide sequence encoded by a nucleotide sequence having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or fragment or domain thereof or a sequence complementary thereto, includes a substitution of at least one codon. In a more specific embodiment, the substitution is conservative.
In a specific embodiment, the polypeptide sequences having substantial similarity to the polypeptide sequence listed in SEQ ID NO:2, or a fragment, domain, repeat or chimeras thereof includes a deletion or insertion of at least one amino acid.
The polypeptides of the invention, fragments thereof or variants thereof can comprise any number of contiguous amino acid residues from a polypeptide of the invention, wherein the number of residues is selected from the group of integers consisting of from 10 to the number of residues in a full-length polypeptide of the invention. Specifically, the portion or fragment of the polypeptide is a functional protein. The present invention includes active polypeptides having specific activity of at least 20%, 30%, or 40%, and specifically at least 505, 60%, or 70%, and most specifically at least 805, 90%
or 95% that of the native (non-synthetic) endogenous polypeptide. Further, the substrate specificity (kcat/Km) is optionally substantially similar to the native (non-synthetic), endogenous polypeptide. Typically the Km will be at least 30%, 40%, or 50% of the native, endogenous polypeptide; and more specifically at least 605, 70%, 80%, or 90%. Methods of assaying and quantifying measures of activity and substrate specificity are well known to those of skill in the art.
The isolated polypeptides of the present invention will elicit production of an antibody specifically reactive to a polypeptide of the present invention when presented as an immunogen. Therefore, the polypeptides of the present invention can be employed as immunogens for constructing antibodies immunoreactive to a protein of the present invention for such purposes, but not limited to, immunoassays or protein purification techniques.
Immunoassays for determining binding are well known to those of skill in the art such as, but not limited to, ELISAs or competitive immunoassays.
Embodiments of the present invention also relate to chimeric polypeptides encoded by the isolated nucleic acid molecules of the present disclosure including a chimeric polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence including:
(a) a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof;
(b) a nucleotide sequence having substantial similarity to (a);
(c) a nucleotide sequence capable of hybridizing to (a);
(d) a nucleotide sequence complementary to (a), (b) or (c); and (e) a nucleotide sequence which is the reverse complement of (a), (b) or (c); or (f) a functional fragment thereof.
A polypeptide containing a polypeptide sequence encoded by an isolated nucleic acid containing a nucleotide sequence, its complement, or its reverse complement, encoding a polypeptide including a polypeptide sequence including:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a domain, repeat or chimeras thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or an exon or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; and a functional fragment of (a), (b), (c) or (d);
or (e) a functional fragment thereof.
The isolated nucleic acid molecules of the present invention are useful for expressing a polypeptide of the present invention in a recombinantly engineered cell such as a bacteria, yeast, insect, mammalian or plant cell.
The cells produce the polypeptide in a non-natural condition (e.g. in quantity, composition, location and/or time) because they have been genetically altered to do so. Those skilled in the art are knowledgeable in the numerous expression systems available for expression of nucleic acids encoding a protein of the present invention, and will not be described in detail below.
Briefly, the expression of isolated nucleic acids encoding a polypeptide of the invention will typically be achieved, for example, by operably linking the nucleic acid or cDNA to a promoter (constitutive or regulatable) followed by incorporation into an expression vector. The vectors are suitable for replication and/or integration in either prokaryotes or eukaryotes. Commonly used expression vectors comprise transcription and translation terminators, initiation sequences and promoters for regulation of the expression of the nucleic acid molecule encoding the polypeptide. To obtain high levels of expression of the cloned nucleic acid molecule, it is desirable to use expression vectors comprising a strong promoter to direct transcription, a ribosome binding site for translation initiation, and a transcription/translation terminator. One skilled in the art will recognize that modifications may be made to the polypeptide of the present invention without diminishing its biological activity. Some modifications may be made to facilitate the cloning, expression or incorporation of the polypeptide of the invention into a fusion protein. Such modification are well known in the art and include, but are not limited to, a methionine added at the amino terminus to provide an initiation site, or additional amino acids (e.g. poly Histadine) placed on either terminus to create conveniently located purification sequences. Restriction sites or termination codons can also be introduced into the vector.
In a specific embodiment, the expression vector includes one or more elements such as, for example, but not limited to, a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, or an affinity purification-tag encoding sequence. In a more specific embodiment, the promoter-enhancer sequence may be, for example, the CaMV 35S promoter, the CaMV 19S promoter, the tobacco PR-la promoter, the ubiquitin promoter, and the phaseolin promoter. In another embodiment, the promoter is operable in plants, and more specifically, a constitutive or inducible promoter. In another specific embodiment, the selection marker sequence encodes an antibiotic resistance gene. In another specific embodiment, the epitope-tag sequence encodes V5, the peptide Phe-His-His-Thr-Thr, hemagglutinin, or glutathione-S-transferase. In another specific embodiment the affinity purification-tag sequence encodes a polyamino acid sequence or a polypeptide. In a more specific embodiment, the polyamino acid sequence is polyhistidine. In a more specific embodiment, the polypeptide is chitin binding domain or glutathione-S-transferase. In a more specific embodiment, the affinity purification-tag sequence comprises an intein encoding sequence.
Prokaryotic cells may be used a host cells, for example, but not limited to, Escherichia coli, and other microbial strains known to those in the art.
Methods for expressing proteins in prokaryotic cells are well known to those in the art and can be found in many laboratory manuals such as Molecular Cloning: A Laboratory Manual, by J. Sambrook et al. (1989, Cold Spring Harbor Laboratory Press). A variety of promoters, ribosome binding sites, and operators to control expression are available to those skilled in the art, as are selectable markers such as antibiotic resistance genes. The type of vector chosen is to allow for optimal growth and expression in the selected cell type.
A variety of eukaryotic expression systems are available such as, but not limited to, yeast, insect cell lines, plant cells and mammalian cells.
Expression and synthesis of heterologous proteins in yeast is well known (see Sherman et al., Methods in Yeast Genetics, Cold Spring Harbor Laboratory Press, 1982). Commonly used yeast strains widely used for production of eukaryotic proteins are Saccharomyces cerevisiae and Pichia pastoris, and vectors, strains and protocols for expression are available from commercial suppliers (e.g., Invitrogen).
Mammalian cell systems may be transfected with expression vectors for production of proteins. Many suitable host cell lines are available to those in the art, such as, but not limited to the HEK293, BHK21 and CHO cells lines.
Expression vectors for these cells can include expression control sequences such as an origin of replication, a promoter, (e.g., the CMV promoter, a HSV
tk promoter or phosphoglycerate kinase (pgk) promoter), an enhancer, and protein processing sites such as ribosome binding sites, RNA splice sites, polyadenylation sites, and transcription terminator sequences. Other animal cell lines useful for the production of proteins are available commercially or from depositories such as the American Type Culture Collection.
Expression vectors for expressing proteins in insect cells are usually derived from the SF9 baculovirus or other viruses known in the art. A number of suitable insect cell lines are available including but not limited to, mosquito larvae, silkworm, armyworm, moth and Drosophila cell lines.
Methods of transfecting animal and lower eukaryotic cells are known.
Numerous methods are used to make eukaryotic cells competent to introduce DNA such as but not limited to: calcium phosphate precipitation, fusion of the recipient cell with bacterial protoplasts containing the DNA, treatment of the recipient cells with liposomes containing the DNA, DEAE dextrin, electroporation, biolistics, and microinjection of the DNA directly into the cells.
Transfected cells are cultured using means well known in the art (see, Kuchler, R.J., Biochemical Methods in Cell Culture and Virology, Dowden, Hutchinson and Ross, Inc. 1997).
Once a polypeptide of the present invention is expressed it may be isolated and purified from the cells using methods known to those skilled in the art. The purification process may be monitored using Western blot techniques or radioimmunoassay or other standard immunoassay techniques.
Protein purification techniques are commonly known and used by those in the art (see R. Scopes, Protein Purification: Principles and Practice, Springer-Verlag, New York 1982: Deutscher, Guide to Protein Purification, Academic Press (1990). Embodiments of the present invention provide a method of producing a recombinant protein in which the expression vector includes one or more elements including a promoter-enhancer sequence, a selection marker sequence, an origin of replication, an epitope-tag encoding sequence, and an affinity purification-tag encoding sequence. In one specific embodiment, the nucleic acid construct includes an epitope-tag encoding sequence and the isolating step includes use of an antibody specific for the epitope-tag. In another specific embodiment, the nucleic acid construct contains a polyamino acid encoding sequence and the isolating step includes use of a resin comprising a polyamino acid binding substance, specifically where the polyamino acid is polyhistidine and the polyamino binding resin is nickel-charged agarose resin. In yet another specific embodiment, the nucleic acid construct contains a polypeptide encoding sequence and the isolating step includes the use of a resin containing a polypeptide binding substance, specifically where the polypeptide is a chitin binding domain and the resin contains chitin-sepharose.
The polypeptides of the present invention cam be synthesized using non-cellular synthetic methods known to those in the art. Techniques for solid phase synthesis are described by Barany and Mayfield, Solid-Phase Peptide Synthesis, pp. 3-284 in the Peptides: Analysis, Synthesis, Biology, Vol.2, Special Methods in Peptide Synthesis, Part A; Merrifield, et al., J. Am. Chem.
Soc. 85:2149-56 (1963) and Stewart et al., Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, IL (1984).
The present invention further provides a method for modifying (i.e.
increasing or decreasing) the concentration or composition of the polypeptides of the invention in a plant or part thereof. Modification can be effected by increasing or decreasing the concentration and/or the composition (i.e. the ratio of the polypeptides of the present invention) in a plant. The method comprised introducing into a plant cell with an expression cassette comprising a nucleic acid molecule of the present invention, or an nucleic acid encoding a OsGATAl 1 sequence as described above to obtain a transformed plant cell or tissue, culturing the transformed plant cell or tissue. The nucleic acid molecule can be under the regulation of a constitutive or inducible promoter. The method can further comprise inducing or repressing expression of a nucleic acid molecule of a sequence in the plant for a time sufficient to modify the concentration and/or composition in the plant or plant part.
A plant or plant part having modified expression of a nucleic acid molecule of the invention can be analyzed and selected using methods known to those skilled in the art such as, but not limited to, Southern blot, DNA
sequencing, or PCR analysis using primers specific to the nucleic acid molecule and detecting amplicons produced therefrom.
In general, concentration or composition in increased or decreased by at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90% relative to a native control plant, plant part or cell lacking the expression cassette.
Sugars are central regulators of many vital processes in photosynthetic plants, such as photosynthesis, carbon and nitrogen metabolism and this regulation is achieved by regulating gene expression, either activate or repress genes involved. The mechanisms by which sugars control gene expression are not understood well. This GATA transcription factor disclosed here is involved in regulating sugar sensing and the expression of the factor itself is influenced by the change of the N status. Increased expression of this gene can produce plants with increased yield, particularly as the manipulation of sugar signaling pathways can lead to increased photosynthesis and increased nitrogen utilization and alter source-sink relationships in seeds, tubes, roots and other storage organs.
The invention will be further described by reference to the following detailed examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified.
EXAMPLES
Standard recombinant DNA and molecular cloning techniques used here are well known in the art and are described by J. Sambrook, et al., Molecular Cloning: A Laboratory Manual, 3d Ed., Cold Spring Harbor, NY:
Cold Spring Harbor Laboratory Press (2001); by T.J. Silhavy, M.L. Berman, and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1984) and by Ausubel, F.M. et al., Current Protocols in Molecular Biology, New York, John Wiley and Sons Inc., (1988), Reiter, et al., Methods in Arabidopsis Research, World Scientific Press (1992), and Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publishers (1998).
EXPERIMENTAL BACKGROUND AND PROCEDURES
A. Determining rice and maize growth conditions under limiting nitrogen conditions In past experiments to study genes involved in nitrate uptake and assimilation, the present inventors and others have utilized growth conditions in which nitrate was generally either present in excess or absent in its entirety.
In the latter case, nitrate is typically added to plants grown in its absence in order to understand nitrate regulation of these and other genes. While this type of extreme treatment is useful in defining some aspects of gene regulation, it is not suitable to gain a better understanding of the effect of nitrogen limitation. The inventors have defined conditions for Arabidopsis in which nitrogen limits growth. This involved developing a system using Rockwool (Hirai et al., 1995 Plant Cell Physiol 36, 1331-1339) and defining three conditions: one where growth is maximal; one where nitrogen limits growth to 70-75% maximal growth levels; one where there is a more severe limitation to 30-35% maximal growth levels. The nitrogen limitation acts as a 'stress' with the amount of 'stress' easily varied by altering the concentration of nitrate. The inventors assay the physiological "nitrogen status" by measuring nitrate, chlorophyll (which is often used as a reflection of nitrogen status under field conditions- see, e.g., Fox RH et al 2001 Agron J. 93, 590-597; Minotti PL et al 1994 Hort Science 29, 1497-1550), amino acid levels, and nitrate reductase and glutamine synthetase activities in order to give a baseline in which to assess studies on mutant lines.
B. Expression profiling experiments on Arabidopsis plants under nitrogen limitation Transcript expression profiling can be used to test RNA levels of large numbers of genes at the same time. Large numbers of these types of experiments have been done in the past, and if the experimental system is amenable, these can be used to pinpoint the "expression status" of an organism under different conditions and to use this information to make hypotheses on what genes and pathways are involved in various processes.
The inventors found that the more profound the difference in growth conditions, the larger the differences in transcript profiles between the plants grown under these conditions and the more difficult it was to decipher which changes were most important. The only published whole genome profiling experiment in this area is one in Arabidopsis where an extreme change in nitrate levels was studied (Wang R et al 2003 Plant Physiol. 132, 556-67). In the case of nitrogen limitations, the inventors studied the effect of growing plants under chronic nitrogen stress as well as changes in the level of available nitrogen. The inventors have already determined the impact on growth of different nitrogen levels in Arabidopsis.
The effect of different nitrogen levels on the transcript profiles was studied: where nitrogen does not limit growth. For Arabidopsis the inventors collected 4-week old shoots grown under the different nitrogen regimes.
Three different samples were collected (biological triplicates) in order to get statistically significant results. The transcript profiling was done using Arabidopsis GeneChip whole genome array (Affymetrix) to study the transcript levels in Arabidopsis. The bioinformatic analysis necessary to study the considerable data produced by these experiments was performed. By studying the effect of nitrogen limitation on the expression patterns, the inventors can pinpoint which pathways are involved in their response to nutrient stress Materials and methods Plant growth conditions Peat moss and vermiculate (1:4) (SunGro Horticulture Canada Ltd. BC, Canada) was used to grow Oryza sativa Kaybonnet plants, adding nutrient solution with different amount of nitrate once a week till harvest. The nutrient solution contains 4 mM MgSO4, 5 mM KCI, 5 mM CaC12, 1 mM KH2P04, 0.1 mM Fe-EDTA, 0.5 mM MES (pH6.0), 9 p M MnSO4, 0.7 pM Zn SO4, 0.3 pM
CuSOa, 46 pM NaB4O7 and 0.2 pM (NH4) 6Mo7O2. For limiting N condition, 3mM N solution was used once a week till harvest. For sufficient N condition, 10mM N solution was used once a week for the first six week, changed to 5mM for another 6 weeks, and the changed to 3mM N solution till harvest.
Plants were grown in a growth room with 16 hr light (-400 pmolm-zs"') at 28-30 C and 8 hr dark at 22-24 C for the first four weeks and then had one week short-day treatment (10 hr light/14 hr dark). After that, plants were moved to greenhouse to grow till harvest.
Generating transgenic rice plants The constructs for over-expressing or silencing OsGATAII were made. T1 transgenic seeds over-expressing OsGATAl1, and silencing OsGATAl 1 (RNAi) were analyzed.
Genotyping transgenic plants Leaf samples were grounded in 300 NI buffer (Strategic Diagnostics Inc. Part # 7000006). One dipstick (Strategic Diagnostics Inc. Part # 7000052) was inserted into the tube and left for -15 minutes by which time the lines on the sticks were clear. The appearance of one red line (control) on the strip indicates a negative result. The appearance of two red lines (control and test) on the strip indicates a positive result.
Expression analysis by semi-quantitative RT-PCR
One pg total RNA extracted was used to make cDNA. Primers for OsGATAl1 are 5'- CGTCGAGCACCAAGGGCAAATC-3' (SEQ ID NO:3) and 5'- GGATAGGGTCATGAGCAGCATGG-3' (SEQ ID NO:4). Primers for OsTubulin are: 5'- AGGAGGATGCCGCTAACAACTTTG-3' (SEQ ID NO:5) and 5'- AAACAGCATTGGTGATTTCAGGC-3' (SEQ ID NO:6).
Chlorophyll measurement Total chlorophyll was measured either using the Minolta SPAD 502DL
chlorophyll meter (Tokyo, Japan), or extracted by ethanol and measured by spectrophotometer according to Kirk (1968).
Results Strategy to phenotype transgenic plants The strategy for initial genetic and phenotypic analysis involved growing 5 transgenic events from each construct under mainly limiting nitrogen (N) condition (- 18 plants). Also some plants were grown under sufficient N condition (- 10 plants). PMI sticks were used for genotyping to detect the selectable marker PMI. Transgene expression levels were tested by semi-quantitative RT-PCR. Chlorophyll level, culm length, tiller number, panicle number, flowering time, seed yield and shoot biomass was recorded.
Phenotypes of the OsGATAl1 over-expression plants The OsGATAl1 gene shares - 34% similarity at protein level with the AtGATA gene (At4g26150, Figure 3). Total chlorophyll levels were measured when the transgenic plants were about 4-wk-old under limiting N condition. At least two transgenic events (event 5 and 6) had significant higher chlorophyll content from the average of PMI positive plants (3-6 plants) compared to wild type control plants (6 plants) (Figure 4A). Those transgenic plants did have elevated expression of the OsGATAII gene (Figure 4B). To ensure that chlorophyll level can be affected by the expression levels of the OsGATAl1 gene, the transgenic RNAi OsGATAl 1 plants were analyzed. The expression level of the OsGATA11 gene was significantly reduced in the transgenic RNAi OsGATAl 1 plants (Figure 5A), and indeed, chlorophyll level was significantly lower in those plants (Figure 5B). One event (event 6) had - 20% higher seed yield from the average of 10 PMI positive plants compared to the average of 11 wild type control plants under limiting N condition (Figure 6A). This same event had almost doubled seed yield from the average of 4 PMI positive plants compared to the average of 6 wild type control plants under sufficient N
condition (Figure 6B). Also, plants grown under high N experienced stress after being transferred from the growth room to the greenhouse and the transgenic plants responded much better to the stress (Figure 7).
Having now described particular embodiments of the invention by way of the foregoing examples, which are not intended to be limiting, the invention will now be further set forth in the following claims. Those skilled in the art will recognize that the claims also permit for the inclusion of equivalents beyond the claims' literal scope.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: UNIVERSITY OF GUELPH
(ii) TITLE OF INVENTION: NITROGEN-REGULATED SUGAR SENSING GENE
AND PROTEIN MODULATION THEREOF
(iii) NUMBER OF SEQUENCES: 7 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: BERESKIN & PARR
(B) STREET: 40 King Street West (C) CITY: Toronto (D) STATE: Ontario (E) COUNTRY: Canada (F) ZIP: M5H 3Y2 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: iMAC - Using Virtual PC
(C) OPERATING SYSTEM: Windows 198 (D) SOFTWARE: PatentIn Release #1.0, Version #1.25 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: GRAVELLE, MICHELINE
(C) REFERENCE/DOCKET NUMBER: 6580-346 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (416) 364-7311 (B) TELEFAX: (416) 361-1398 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1343 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Rice (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
gaacttctct cccatctctt tcctcctcct cctctctgat atgtctacta tctacatgag 60 ccagctacct gctactctcc ctctaatgga gggggatcag gatcaggggc tctacccagc 120 cttccataga gcaaaggacc ctcctatctt gttccctttc atgatcgaca gcgccgtcga 180 gcaccaaggg caaatctatg gagatcaggg cttgaggagg cagcaggttt tgggtgaatc 240 caatcaacag ttcaatgatc acatgatgat gggcggatca gatgtcttcc tcacaccgtc 300 tccgttccga ccaaccatcc aaagcatcgg cagcgacatg atccagcgat catcttatga 360 tccatacgat atcgagagta acaacaagca gcatgccaat ggatcaacca gcaagtggat 420 gtcgacgccg ccaatgaaga tgaggatcat aaggaagggg gcggcaaccg atcctgaggg 480 cggggcggtg agaaagccaa ggagaagagc acaagcgcac caggatgaga gccagcaaca 540 actgcagcaa gctttgggtg tcgttagagt gtgctcggac tgcaacacca ccaagacccc 600 cttgtggaga agtggtcctt gtggccccaa gtccctttgc aacgcgtgtg gcatcaggca 660 aaggaaggcg cggcgggcga tggccgctgc tgccaacggc ggagcggcgg tggcgccggc 720 aaagagcgtg gccgcggcgc cggtgaacaa taagccggcg gcgaagaagg agaagagggc 780 ggcggacgtc gaccggtcgc tgccgttcaa gaaacggtgc aagatggtcg atcacgttgc 840 tgctgccgtc gctgccacca agcccacggc tgctggagaa gtagtggccg ccgctccgaa 900 ggaccaagat cacgtcatcg tcgtcggtgg cgagaacgcc gccgccacct ccatgccggc 960 acagaacccg atatccaagg cggcggcgac cgccgctgcc gccgccgcct ctccggcgtt 1020 cttccacggc ctccctcgcg acgagatcac cgacgccgcc atgctgctca tgaccctatc 1080 ctgtggcctc gtccacagct agctagctag ctgatcaaaa ctagctagct actagtaccg 1140 ttaatttgat gagggcaaca accagagtac tatgtaccac tactagcaat attttgtgtg 1200 tgccttgtga tcttttgttg ttttgtgttg ttgaggagat cactagatca ggatgaagga 1260 gagatagtga tcacatgtct aaggacgaaa taaacgagaa caaactcgct agctagctac 1320 tagccgggat caggattata ttt 1343 (2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 353 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Rice (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Ser Thr Ile Tyr Met Ser Gln Leu Pro Ala Thr Leu Pro Leu Met Glu Gly Asp Gln Asp Gln Gly Leu Tyr Pro Ala Phe His Arg Ala Lys Asp Pro Pro Ile Leu Phe Pro Phe Met Ile Asp Ser Ala Val Glu His Gln Gly Gln Ile Tyr Gly Asp Gln Gly Leu Arg Arg Gln Gln Val Leu Gly Glu Ser Asn Gln Gln Phe Asn Asp His Met Met Met Gly Gly Ser Asp Val Phe Leu Thr Pro Ser Pro Phe Arg Pro Thr Ile Gln Ser Ile Gly Ser Asp Met Ile Gln Arg Ser Ser Tyr Asp Pro Tyr Asp Ile Glu Ser Asn Asn Lys Gln His Ala Asn Gly Ser Thr Ser Lys Trp Met Ser Thr Pro Pro Met Lys Met Arg Ile Ile Arg Lys Gly Ala Ala Thr Asp Pro Glu Gly Gly Ala Val Arg Lys Pro Arg Arg Arg Ala Gln Ala His Gln Asp Glu Ser Gln Gln Gln Leu Gln Gln Ala Leu Gly Val Val Arg Val Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Cys Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Met Ala Ala Ala Ala Asn Gly Gly Ala Ala Val Ala Pro Ala Lys Ser Val Ala Ala Ala Pro Val Asn Asn Lys Pro Ala Ala Lys Lys Glu Lys Arg Ala Ala Asp Val Asp Arg Ser Leu Pro Phe Lys Lys Arg Cys Lys Met Val Asp His Val Ala Ala Ala Val Ala Ala Thr Lys Pro Thr Ala Ala Gly Glu Val Val Ala Ala Ala Pro Lys Asp Gln Asp His Val Ile Val Val Gly Gly Glu Asn Ala Ala Ala Thr Ser Met Pro Ala Gln Asn Pro Ile Ser Lys Ala Ala Ala Thr Ala Ala Ala Ala Ala Ala Ser Pro Ala Phe Phe His Gly Leu Pro Arg Asp Glu Ile Thr Asp Ala Ala Met Leu Leu Met Thr Leu Ser Cys Gly Leu Val His Ser (2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
cgtcgagcac caagggcaaa tc 22 (2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
ggatagggtc atgagcagca tgg 23 (2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
aggaggatgc cgctaacaac tttg 24 (2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence Description: Primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
aaacagcatt ggtgatttca ggc 23 (2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 352 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Arabidopsis thaliana (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
Met Gly Ser Asn Phe His Tyr Thr Ile Asp Leu Asn Glu Asp Gln Asn His Gln Pro Phe Phe Ala Ser Leu Gly Ser Ser Leu His His His Leu Gln Gin Gln Gln Gln Gln Gln Gln His Phe His His Gln Ala Ser Ser Asn Pro Ser Ser Leu Met Ser Pro Ser Leu Ser Tyr Phe Pro Phe Leu Ile Asn Ser Arg Gln Asp Gln Val Tyr Val Gly Tyr Asn Asn Asn Thr Phe His Asp Val Leu Asp Thr His Ile Ser Gln Pro Leu Glu Thr Lys Asn Phe Val Ser Asp Gly Gly Ser Ser Ser Ser Asp Gln Met Val Pro Lys Lys Glu Thr Arg Leu Lys Leu Thr Ile Lys Lys Lys Asp Asn His Gln Asp Gln Thr Asp Leu Pro Gln Ser Pro Ile Lys Asp Met Thr Gly Thr Asn Ser Leu Lys Trp Ile Ser Ser Lys Val Arg Leu Met Lys Lys Lys Lys Ala Ile Ile Thr Thr Ser Asp Ser Ser Lys Gln His Thr Asn Asn Asp Gln Ser Ser Asn Leu Ser Asn Ser Glu Arg Gln Asn Gly Tyr Asn Asn Asp Cys Val Ile Arg Ile Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Arg Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Ala Met Ala Thr Ala Thr Ala Thr Ala Val Ser Gly Val Ser Pro Pro Val Met Lys Lys Lys Met Gln Asn Lys Asn Lys Ile Ser Asn Gly Val Tyr Lys Ile Leu Ser Pro Leu Pro Leu Lys Val Asn Thr Cys Lys Arg Met Ile Thr Leu Glu Glu Thr Ala Leu Ala Glu Asp Leu Glu Thr Gln Ser Asn Ser Thr Met Leu Ser Ser Ser Asp Asn Ile Tyr Phe Asp Asp Leu Ala Leu Leu Leu Ser Lys Ser Ser Ala Tyr Gln Gln Val Phe Pro Gln Asp Glu Lys Glu Ala Ala Ile Leu Leu Met Ala Leu Ser His Gly Met Val His Gly
Claims (30)
1. A method of modulating a characteristic in a plant cell comprising modulating expression of a GATA transcription factor gene in the plant cell.
2. The method according to claim 1, wherein the expression of the GATA
transcription factor gene is modulated by administering, to the cell, an effective amount of an agent that can modulate the expression levels of a GATA transcription factor gene in the plant cell.
transcription factor gene is modulated by administering, to the cell, an effective amount of an agent that can modulate the expression levels of a GATA transcription factor gene in the plant cell.
3. The method according to claim 1 or 2, wherein the characteristic is an agronomic trait.
4. The method according to claim 3, wherein the characteristic is one that is affected by nitrogen, carbon and/or sulfur metabolism, biosynthesis of lipids, perception of nutrients, nutritional adaptation, electron transport and/or membrane associated energy conservation.
5. The method according to claim 3, wherein the characteristic is selected from one or more of nitrogen utilization, yield, cell growth, reproduction, chlorophyll synthesis, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation, abiotic stress tolerance and nutritional composition.
6. The method according to claim 5, wherein the characteristic is nitrogen utilization.
7. The method according to any one of claims 1-6, wherein the plant cell is a dicot, a gymnosperm or a monocot.
8. The method according to claim 7, wherein the monocot is selected from the group consisting of maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum sp. and teosite.
9. The method according to claim 8, wherein the dicot is selected from the group consisting of soybean, tobacco or cotton.
10. The method according to any one of claims 2-9, wherein the agent enhances the expression levels of a GATA transcription factor gene in the plant cell.
11. The method according to claim 10, wherein the modulated characteristic is an increase or improvement in one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation, abiotic stress tolerance and nutritional composition.
12. The method according to claim 10 or 11, wherein the agent that enhances the expression levels of a GATA transcription factor gene in the plant cell comprises a nucleic acid molecule encoding a GATA transcription factor.
13. The method according to claim 12, wherein the nucleic acid molecule comprises the sequence of the OsGATA11 gene of SEQ ID NO:1 or a functional fragment thereof.
14. The method according to claim 12, wherein the nucleic acid molecule comprises a sequence that hybridizes under medium stringency conditions to the OsGATA11 gene of SEQ ID NO:1 or a functional fragment thereof.
15. The method according to claim 12, wherein the nucleic acid molecule comprises a nucleic acid sequence derived from the nucleotide sequence of the OsGATA11 gene of SEQ ID NO:1 and has a nucleotide sequence comprising codons specific for expression in plants.
16. The method according to any one of claims 1-12, wherein the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c);
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
(a) a nucleotide sequence of SEQ ID NO:1 or a fragment or domain thereof;
(b) a nucleotide sequence encoding a polypeptide of SEQ ID NO:2, a fragment or domain thereof;
(c) a nucleotide sequence having substantial similarity to (a) or (b);
(d) a nucleotide sequence capable of hybridizing to (a), (b) or (c);
(e) a nucleotide sequence complementary to (a), (b), (c) or (d); or (f) a nucleotide sequence that is the reverse complement of (a), (b), (c) or (d).
17. The method according to any one of claims 1-12, wherein the agent that can modulate the expression levels of a GATA transcription factor gene in a plant cell comprises:
(a) a polypeptide sequence listed in SEQ ID NO:2, or a functional fragment, domain, repeat, or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or a functional fragment or domain thereof, or a sequence complementary thereto; or (d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto.
(a) a polypeptide sequence listed in SEQ ID NO:2, or a functional fragment, domain, repeat, or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:1, or a functional fragment or domain thereof, or a sequence complementary thereto; or (d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto.
18. The method according to any one of claims 12-17, wherein the nucleic acid sequence is expressed in a specific location or tissue of the plant.
19. The method according to claim 18, wherein the location or tissue is selected from one or more of seed, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf and flower.
20. The method according to claim 19, wherein the location or tissue is a seed.
21. The method according to any one of claims 12-20, wherein the agent that enhances the expression levels of a GATA transcription factor gene in the plant cell comprises an expression cassette for modulating a characteristic in a plant cell including a promoter sequence operably linked to the isolated nucleic acid encoding a GATA transcription factor.
22. A method of producing a transgenic plant comprising:
(1) providing an isolated nucleic acid having the sequence shown in SEQ ID NO:1; and (2) introducing the nucleic acid into the plant, wherein the nucleic acid is expressed in the plant.
(1) providing an isolated nucleic acid having the sequence shown in SEQ ID NO:1; and (2) introducing the nucleic acid into the plant, wherein the nucleic acid is expressed in the plant.
23. The method according to claim 22 wherein the plant demonstrates an increase or improvement in one or more of nitrogen utilization, yield, cell growth, reproduction, photosynthesis, nitrogen assimilation, disease resistance, differentiation, signal transduction, gene regulation, abiotic stress tolerance and nutritional composition.
24. The method according to claim 23 wherein the plant has an increase in chlorophyll synthesis, seed yield and/or stress tolerance.
25. The method of any one of claims 22 or 24 wherein the nucleic acid is introduced into the plant using a method selected from the group consisting of microparticle bombardment, Agrobacterium-mediated transformation, and whiskers-mediated transformation.
26. A plant cell of the plant produced by any one of claims 22-25.
27. A use of an nucleic acid molecule comprising a nucleotide sequence of at least 10 bases, which sequence is identical, complementary, or substantially similar to a region of any of SEQ ID NO:1 or a functional fragment thereof and wherein the use is selected from the group consisting of:
(i) use as a chromosomal marker to identify the location of the corresponding or complementary polynucleotide on a native or artificial chromosome;
(ii) use as a marker for RFLP analysis;
(iii) use as a marker for quantitative trait linked breeding;
(iv) use as a marker for marker-assisted breeding;
(v) use as a bait sequence in a two-hybrid system to identify sequence encoding polypeptides interacting with the polypeptide encoded by the bait sequence;
(vi) use as a diagnostic indicator for genotyping or identifying an individual or population of individuals; and (vii) use for genetic analysis to identify boundaries of genes or exons.
(i) use as a chromosomal marker to identify the location of the corresponding or complementary polynucleotide on a native or artificial chromosome;
(ii) use as a marker for RFLP analysis;
(iii) use as a marker for quantitative trait linked breeding;
(iv) use as a marker for marker-assisted breeding;
(v) use as a bait sequence in a two-hybrid system to identify sequence encoding polypeptides interacting with the polypeptide encoded by the bait sequence;
(vi) use as a diagnostic indicator for genotyping or identifying an individual or population of individuals; and (vii) use for genetic analysis to identify boundaries of genes or exons.
28. An antibody raised against an isolated polypeptide comprising:
(a) a polypeptide sequence of SEQ ID NO:2, or a fragment, domain, repeat or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:2, or a fragment or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
(a) a polypeptide sequence of SEQ ID NO:2, or a fragment, domain, repeat or chimera thereof;
(b) a polypeptide sequence having substantial similarity to (a);
(c) a polypeptide sequence encoded by a nucleotide sequence identical to or having substantial similarity to a nucleotide sequence listed in SEQ ID NO:2, or a fragment or domain thereof, or a sequence complementary thereto;
(d) a polypeptide sequence encoded by a nucleotide sequence capable of hybridizing under medium stringency conditions to a nucleotide sequence listed in SEQ ID NO:1, or to a sequence complementary thereto; or (e) a functional fragment of (a), (b), (c) or (d).
29. The antibody according to claim 28 wherein the polypeptide comprises the sequence of SEQ ID NO:2 or a variant thereof having a conservative amino acid modification.
30. An immunoassay kit comprising the antibody of claim 28 or 29 and instructions for the use thereof.
Priority Applications (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA 2584934 CA2584934A1 (en) | 2007-04-17 | 2007-04-17 | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
MX2009011184A MX2009011184A (en) | 2007-04-17 | 2008-04-16 | Carbon and nitrogen-regulating gene and protein and modulation thereof. |
CN200880018512A CN101688180A (en) | 2007-04-17 | 2008-04-16 | Genes and proteins regulating carbon and nitrogen and their regulation |
CN201510092990.0A CN104726464A (en) | 2007-04-17 | 2008-04-16 | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
EP08733744A EP2144996A4 (en) | 2007-04-17 | 2008-04-16 | GENE AND PROTEIN FOR REGULATING CARBON AND NITROGEN AND MODULATION THEREOF |
PCT/CA2008/000688 WO2008124933A1 (en) | 2007-04-17 | 2008-04-16 | Carbon and nitrogen-regulating gene and protein and modulation thereof |
BRPI0810652-5A2A BRPI0810652A2 (en) | 2007-04-17 | 2008-04-16 | CARBON AND NITROGEN REGULATOR GENE AND PROTEIN AND MODULATION OF THE SAME. |
CL2008001097A CL2008001097A1 (en) | 2007-04-17 | 2008-04-17 | METHOD FOR MODULATING A CHARACTERISTICS IN A PLANT OR VEGETABLE CELL THAT INCLUDES MODULATING THE EXPRESSION OF A CAT TRANSCRIPTION FACTOR; AND METHOD TO PRODUCE A TRANSGENIC PLANT. |
ARP080101583 AR066094A1 (en) | 2007-04-17 | 2008-04-17 | A METHOD TO MODULATE A CHARACTERISTICS IN A PLANT OR VEGETABLE CELL, THE VEGETABLE CELL PRODUCED BY SUCH METHOD, ITS USE AND THE ANTIBODY OBTAINED |
ZA2009/07097A ZA200907097B (en) | 2007-04-17 | 2009-10-12 | Carbon and nitrogen-regulating gene and protein and modulation thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA 2584934 CA2584934A1 (en) | 2007-04-17 | 2007-04-17 | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2584934A1 true CA2584934A1 (en) | 2008-10-17 |
Family
ID=39855358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA 2584934 Abandoned CA2584934A1 (en) | 2007-04-17 | 2007-04-17 | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
Country Status (9)
Country | Link |
---|---|
EP (1) | EP2144996A4 (en) |
CN (2) | CN104726464A (en) |
AR (1) | AR066094A1 (en) |
BR (1) | BRPI0810652A2 (en) |
CA (1) | CA2584934A1 (en) |
CL (1) | CL2008001097A1 (en) |
MX (1) | MX2009011184A (en) |
WO (1) | WO2008124933A1 (en) |
ZA (1) | ZA200907097B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101948870B (en) * | 2010-09-08 | 2012-06-27 | 上海交通大学 | Method for reducing branch quantity of plant and improving chlorophyll and anthocyanin contents of plant |
CN107937415B (en) * | 2017-12-27 | 2020-04-07 | 宁夏农林科学院农业生物技术研究中心 | Potato GATA transcription factor and cloning method and application thereof |
CN109633151B (en) * | 2018-12-26 | 2022-03-11 | 西北农林科技大学 | A kind of Salmonella Enteritidis detection method, test strip and application |
CN116640769B (en) * | 2023-05-17 | 2024-08-20 | 青岛农业大学 | Peanut AhGATA11 gene and its application in improving plant stress resistance |
WO2024263876A1 (en) * | 2023-06-21 | 2024-12-26 | The Curators Of The University Of Missouri | Compositions and methods for improving seed yield of plants |
Family Cites Families (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4945050A (en) | 1984-11-13 | 1990-07-31 | Cornell Research Foundation, Inc. | Method for transporting substances into living cells and tissues and apparatus therefor |
US5036006A (en) | 1984-11-13 | 1991-07-30 | Cornell Research Foundation, Inc. | Method for transporting substances into living cells and tissues and apparatus therefor |
US5100792A (en) | 1984-11-13 | 1992-03-31 | Cornell Research Foundation, Inc. | Method for transporting substances into living cells and tissues |
NZ217113A (en) | 1985-08-07 | 1988-06-30 | Monsanto Co | Production of eucaryotic plants which are glyphosate resistant, vectors (transformation and expression), chimeric gene and plant cells |
US4987071A (en) | 1986-12-03 | 1991-01-22 | University Patents, Inc. | RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods |
DE3856241T2 (en) | 1987-05-20 | 1999-03-25 | Novartis Ag, Basel | Process for the production of transgenic Zea mays plants which have been regenerated from protoplasts or from cells obtained from protoplasts |
DE19775050I2 (en) | 1987-08-21 | 2010-12-16 | Syngenta Participations Ag | Benzothiadiazoles and their use in processes and agents for plant diseases. |
DE58909753D1 (en) | 1988-03-08 | 1997-01-23 | Ciba Geigy Ag | Regeneration of fertile Gramineae plants from the Pooideae subfamily based on protoplasts |
US5614395A (en) | 1988-03-08 | 1997-03-25 | Ciba-Geigy Corporation | Chemically regulatable and anti-pathogenic DNA sequences and uses thereof |
EP0332104A3 (en) | 1988-03-08 | 1991-03-20 | Ciba-Geigy Ag | Chemically regulatable dna sequences and genes and uses thereof |
CA1339684C (en) | 1988-05-17 | 1998-02-24 | Peter H. Quail | Plant ubquitin promoter system |
NZ230375A (en) | 1988-09-09 | 1991-07-26 | Lubrizol Genetics Inc | Synthetic gene encoding b. thuringiensis insecticidal protein |
EP1103616A3 (en) | 1989-02-24 | 2001-06-27 | Monsanto Company | Synthetic plant genes and method for preparation |
ATE241699T1 (en) | 1989-03-24 | 2003-06-15 | Syngenta Participations Ag | DISEASE RESISTANT TRANSGENIC PLANT |
US5501967A (en) | 1989-07-26 | 1996-03-26 | Mogen International, N.V./Rijksuniversiteit Te Leiden | Process for the site-directed integration of DNA into the genome of plants |
US4940935A (en) | 1989-08-28 | 1990-07-10 | Ried Ashman Manufacturing | Automatic SMD tester |
ATE225853T1 (en) | 1990-04-12 | 2002-10-15 | Syngenta Participations Ag | TISSUE-SPECIFIC PROMOTORS |
US5270163A (en) | 1990-06-11 | 1993-12-14 | University Research Corporation | Methods for identifying nucleic acid ligands |
US5639949A (en) | 1990-08-20 | 1997-06-17 | Ciba-Geigy Corporation | Genes for the synthesis of antipathogenic substances |
GB9304200D0 (en) | 1993-03-02 | 1993-04-21 | Sandoz Ltd | Improvements in or relating to organic compounds |
US5994629A (en) | 1991-08-28 | 1999-11-30 | Novartis Ag | Positive selection |
UA48104C2 (en) | 1991-10-04 | 2002-08-15 | Новартіс Аг | Dna fragment including sequence that codes an insecticide protein with optimization for corn, dna fragment providing directed preferable for the stem core expression of the structural gene of the plant related to it, dna fragment providing specific for the pollen expression of related to it structural gene in the plant, recombinant dna molecule, method for obtaining a coding sequence of the insecticide protein optimized for corn, method of corn plants protection at least against one pest insect |
JPH07505531A (en) | 1992-04-15 | 1995-06-22 | プラント・ジェネティック・システムズ・エヌ・ブイ | Transformation method for monocot cells |
DE69334225D1 (en) | 1992-07-07 | 2008-07-31 | Japan Tobacco Inc | METHOD FOR TRANSFORMING A MONOCOTYLEDONE PLANT |
AU704601B2 (en) | 1994-01-18 | 1999-04-29 | Scripps Research Institute, The | Zinc finger protein derivatives and methods therefor |
US6140466A (en) | 1994-01-18 | 2000-10-31 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
USRE45795E1 (en) | 1994-08-20 | 2015-11-10 | Gendaq, Ltd. | Binding proteins for recognition of DNA |
HUP9901044A3 (en) | 1996-02-28 | 2001-11-28 | Syngenta Participations Ag | Dna molecules encoding plant protoporphyrinogen oxidase and inhibitor-resistant mutants thereof |
US6506559B1 (en) | 1997-12-23 | 2003-01-14 | Carnegie Institute Of Washington | Genetic inhibition by double-stranded RNA |
EP1068311B2 (en) | 1998-04-08 | 2020-12-09 | Commonwealth Scientific and Industrial Research Organisation | Methods and means for obtaining modified phenotypes |
AR020078A1 (en) | 1998-05-26 | 2002-04-10 | Syngenta Participations Ag | METHOD FOR CHANGING THE EXPRESSION OF AN OBJECTIVE GENE IN A PLANT CELL |
JP2005185101A (en) * | 2002-05-30 | 2005-07-14 | National Institute Of Agrobiological Sciences | Plant full-length cDNA and use thereof |
ATE539158T1 (en) | 2002-09-18 | 2012-01-15 | Mendel Biotechnology Inc | POLYNUCLEOTIDES AND POLYPEPTIDES IN PLANTS |
US20070250956A1 (en) * | 2005-01-14 | 2007-10-25 | University Of Guelph | Nitrogen-Regulated Sugar Sensing Gene and Protein and Modulation Thereof |
AU2006205999B2 (en) | 2005-01-14 | 2011-09-22 | University Of Guelph | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
AU2006217847B2 (en) * | 2005-02-26 | 2011-04-07 | Basf Plant Science Gmbh | Expression cassettes for seed-preferential expression in plants |
-
2007
- 2007-04-17 CA CA 2584934 patent/CA2584934A1/en not_active Abandoned
-
2008
- 2008-04-16 EP EP08733744A patent/EP2144996A4/en not_active Withdrawn
- 2008-04-16 WO PCT/CA2008/000688 patent/WO2008124933A1/en active Application Filing
- 2008-04-16 MX MX2009011184A patent/MX2009011184A/en active IP Right Grant
- 2008-04-16 BR BRPI0810652-5A2A patent/BRPI0810652A2/en not_active IP Right Cessation
- 2008-04-16 CN CN201510092990.0A patent/CN104726464A/en active Pending
- 2008-04-16 CN CN200880018512A patent/CN101688180A/en active Pending
- 2008-04-17 CL CL2008001097A patent/CL2008001097A1/en unknown
- 2008-04-17 AR ARP080101583 patent/AR066094A1/en unknown
-
2009
- 2009-10-12 ZA ZA2009/07097A patent/ZA200907097B/en unknown
Also Published As
Publication number | Publication date |
---|---|
EP2144996A4 (en) | 2010-06-09 |
EP2144996A1 (en) | 2010-01-20 |
BRPI0810652A2 (en) | 2014-10-07 |
CN104726464A (en) | 2015-06-24 |
ZA200907097B (en) | 2016-08-31 |
WO2008124933A1 (en) | 2008-10-23 |
AR066094A1 (en) | 2009-07-22 |
CL2008001097A1 (en) | 2008-08-22 |
CN101688180A (en) | 2010-03-31 |
MX2009011184A (en) | 2009-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20100218282A1 (en) | Nucleic acid molecules from rice controlling abiotic stress tolerance | |
US20040219675A1 (en) | Nucleic acid molecules from rice encoding proteins for abiotic stress tolerance, enhanced yeild, disease resistance and altered nutritional quality and uses thereof | |
WO2005084331A2 (en) | Sorghum gene expression profiling | |
US8586825B2 (en) | Nitrogen-regulated sugar sensing gene and protein and modulation thereof | |
US20090328255A1 (en) | Nitrogen limitation adaptability gene and protein and modulation thereof | |
CA2584934A1 (en) | Nitrogen-regulated sugar sensing gene and protein and modulation thereof | |
US8742201B2 (en) | Nitrogen-regulated sugar sensing gene and protein and modulation thereof | |
US7230159B2 (en) | Isolated BOS1 gene promoters from arabidopsis and uses thereof | |
US6956115B2 (en) | Nucleic acid molecules from rice encoding RAR1 disease resistance proteins and uses thereof | |
WO2007060514A2 (en) | Methods and compositions for modulating root growth in plants | |
WO2007036045A1 (en) | Method of modulating flowering time and shoot branching | |
WO2005014794A2 (en) | Generation of low phytate plants by molecular disruption of inositol polyphosphate kinases |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Dead |
Effective date: 20170418 |