CA2350328A1 - Gene encoding oxalate decarboxylase from aspergillus phoenices - Google Patents
Gene encoding oxalate decarboxylase from aspergillus phoenices Download PDFInfo
- Publication number
- CA2350328A1 CA2350328A1 CA 2350328 CA2350328A CA2350328A1 CA 2350328 A1 CA2350328 A1 CA 2350328A1 CA 2350328 CA2350328 CA 2350328 CA 2350328 A CA2350328 A CA 2350328A CA 2350328 A1 CA2350328 A1 CA 2350328A1
- Authority
- CA
- Canada
- Prior art keywords
- nucleic acid
- plant
- pro
- glu
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010068005 Oxalate decarboxylase Proteins 0.000 title claims abstract description 31
- 241000228212 Aspergillus Species 0.000 title claims abstract description 24
- 108090000623 proteins and genes Proteins 0.000 title abstract description 132
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 claims abstract description 101
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 81
- 230000000593 degrading effect Effects 0.000 claims abstract description 4
- 241000196324 Embryophyta Species 0.000 claims description 141
- 108020004707 nucleic acids Proteins 0.000 claims description 62
- 102000039446 nucleic acids Human genes 0.000 claims description 62
- 238000000034 method Methods 0.000 claims description 59
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 34
- 239000002773 nucleotide Substances 0.000 claims description 26
- 125000003729 nucleotide group Chemical group 0.000 claims description 26
- 235000006408 oxalic acid Nutrition 0.000 claims description 23
- 230000014509 gene expression Effects 0.000 claims description 22
- 239000013612 plasmid Substances 0.000 claims description 20
- 230000001939 inductive effect Effects 0.000 claims description 17
- RNPABQVCNAUEIY-GUQYYFCISA-N Germine Chemical group O1[C@@]([C@H](CC[C@]23C)O)(O)[C@H]3C[C@@H](O)[C@@H]([C@]3(O)[C@@H](O)[C@H](O)[C@@H]4[C@]5(C)O)[C@@]12C[C@H]3[C@@H]4CN1[C@H]5CC[C@H](C)C1 RNPABQVCNAUEIY-GUQYYFCISA-N 0.000 claims description 16
- 108010020084 germin Proteins 0.000 claims description 16
- RNPABQVCNAUEIY-UHFFFAOYSA-N germine Natural products O1C(C(CCC23C)O)(O)C3CC(O)C(C3(O)C(O)C(O)C4C5(C)O)C12CC3C4CN1C5CCC(C)C1 RNPABQVCNAUEIY-UHFFFAOYSA-N 0.000 claims description 16
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 14
- 239000013598 vector Substances 0.000 claims description 14
- 244000020551 Helianthus annuus Species 0.000 claims description 13
- 201000010099 disease Diseases 0.000 claims description 13
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 13
- 229920001184 polypeptide Polymers 0.000 claims description 13
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 13
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 10
- 230000001717 pathogenic effect Effects 0.000 claims description 7
- 240000000385 Brassica napus var. napus Species 0.000 claims description 6
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 6
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 claims description 5
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 5
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 5
- 230000015556 catabolic process Effects 0.000 claims description 5
- 238000006731 degradation reaction Methods 0.000 claims description 5
- 239000001963 growth medium Substances 0.000 claims description 5
- 235000017060 Arachis glabrata Nutrition 0.000 claims description 4
- 244000105624 Arachis hypogaea Species 0.000 claims description 4
- 235000010777 Arachis hypogaea Nutrition 0.000 claims description 4
- 235000018262 Arachis monticola Nutrition 0.000 claims description 4
- 244000020518 Carthamus tinctorius Species 0.000 claims description 4
- 235000003255 Carthamus tinctorius Nutrition 0.000 claims description 4
- 244000068988 Glycine max Species 0.000 claims description 4
- 235000010469 Glycine max Nutrition 0.000 claims description 4
- 235000004431 Linum usitatissimum Nutrition 0.000 claims description 4
- 240000006240 Linum usitatissimum Species 0.000 claims description 4
- 240000004658 Medicago sativa Species 0.000 claims description 4
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 4
- 244000046052 Phaseolus vulgaris Species 0.000 claims description 4
- 235000010627 Phaseolus vulgaris Nutrition 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 235000020232 peanut Nutrition 0.000 claims description 4
- 241000221662 Sclerotinia Species 0.000 claims description 3
- 230000008506 pathogenesis Effects 0.000 claims description 3
- 241001558929 Sclerotium <basidiomycota> Species 0.000 claims description 2
- 241000219793 Trifolium Species 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims 3
- 235000016639 Syzygium aromaticum Nutrition 0.000 claims 2
- 244000223014 Syzygium aromaticum Species 0.000 claims 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims 1
- 102000004169 proteins and genes Human genes 0.000 abstract description 45
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 23
- 238000003556 assay Methods 0.000 abstract description 17
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 13
- 239000003550 marker Substances 0.000 abstract description 8
- 239000013615 primer Substances 0.000 description 61
- 210000004027 cell Anatomy 0.000 description 53
- 108020004414 DNA Proteins 0.000 description 43
- 210000001519 tissue Anatomy 0.000 description 32
- 229940039748 oxalate Drugs 0.000 description 31
- 102000004190 Enzymes Human genes 0.000 description 25
- 108090000790 Enzymes Proteins 0.000 description 25
- 108010013835 arginine glutamate Proteins 0.000 description 24
- 230000009466 transformation Effects 0.000 description 21
- 241000589158 Agrobacterium Species 0.000 description 20
- 238000009396 hybridization Methods 0.000 description 20
- 239000012634 fragment Substances 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 16
- 230000000694 effects Effects 0.000 description 16
- 229940024606 amino acid Drugs 0.000 description 15
- 230000009261 transgenic effect Effects 0.000 description 15
- 239000002609 medium Substances 0.000 description 14
- 239000000523 sample Substances 0.000 description 14
- 239000002299 complementary DNA Substances 0.000 description 13
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 12
- 150000001413 amino acids Chemical class 0.000 description 12
- 206010020649 Hyperkeratosis Diseases 0.000 description 11
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 11
- 108010005942 methionylglycine Proteins 0.000 description 11
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 10
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 229930027917 kanamycin Natural products 0.000 description 10
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 10
- 229960000318 kanamycin Drugs 0.000 description 10
- 229930182823 kanamycin A Natural products 0.000 description 10
- 108010003700 lysyl aspartic acid Proteins 0.000 description 10
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 8
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 8
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 8
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 8
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 8
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 8
- 238000002360 preparation method Methods 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 108010029895 rubimetide Proteins 0.000 description 8
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 8
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 7
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 7
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 7
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 7
- 108010063734 Oxalate oxidase Proteins 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 239000003795 chemical substances by application Substances 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 230000001404 mediated effect Effects 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 239000002245 particle Substances 0.000 description 7
- 108010024607 phenylalanylalanine Proteins 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 6
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 6
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 6
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 6
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 6
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 6
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- 108020004705 Codon Proteins 0.000 description 6
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 6
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 6
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 6
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 6
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 6
- YHRCLOURJWJABF-WDSOQIARSA-N Trp-His-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YHRCLOURJWJABF-WDSOQIARSA-N 0.000 description 6
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 108091033319 polynucleotide Proteins 0.000 description 6
- 102000040430 polynucleotide Human genes 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 5
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 5
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 5
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 5
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 5
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 5
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 5
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 5
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 5
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 5
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 5
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 5
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 5
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 5
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 5
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 5
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 5
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 5
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 5
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 5
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 5
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 5
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 5
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 5
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 5
- 241000701489 Cauliflower mosaic virus Species 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 5
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 5
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 5
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 5
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 5
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 5
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 5
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 5
- BHXSLRDWXIFKTP-SRVKXCTJSA-N Glu-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BHXSLRDWXIFKTP-SRVKXCTJSA-N 0.000 description 5
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 5
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 5
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 5
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 5
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 5
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 5
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 5
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 5
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 5
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 5
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 5
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 5
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 5
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 5
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 5
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 5
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 5
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 5
- CHLJXFMOQGYDNH-SZMVWBNQSA-N Met-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 CHLJXFMOQGYDNH-SZMVWBNQSA-N 0.000 description 5
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 5
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 5
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 5
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 5
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 5
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 5
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 5
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 5
- 238000012300 Sequence Analysis Methods 0.000 description 5
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 5
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 5
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 5
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 5
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 5
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 5
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 5
- VFURAIPBOIWAKP-SZMVWBNQSA-N Trp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VFURAIPBOIWAKP-SZMVWBNQSA-N 0.000 description 5
- IMYTYAWRKBYTSX-YTQUADARSA-N Trp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O IMYTYAWRKBYTSX-YTQUADARSA-N 0.000 description 5
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 5
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 5
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 5
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 5
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 5
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 5
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 5
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 5
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 5
- 108010036533 arginylvaline Proteins 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000002538 fungal effect Effects 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 5
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 244000052769 pathogen Species 0.000 description 5
- 108010018625 phenylalanylarginine Proteins 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 4
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 4
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 4
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 4
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 4
- 241000219194 Arabidopsis Species 0.000 description 4
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 4
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 4
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 4
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 4
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 4
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 4
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 4
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 4
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 4
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 4
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 4
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 4
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 4
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 4
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 4
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 4
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 4
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 4
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 4
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 4
- 244000061456 Solanum tuberosum Species 0.000 description 4
- 235000002595 Solanum tuberosum Nutrition 0.000 description 4
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 4
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 4
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 4
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 4
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 235000019253 formic acid Nutrition 0.000 description 4
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 4
- 238000003018 immunoassay Methods 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- 230000008018 melting Effects 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000001988 toxicity Effects 0.000 description 4
- 231100000419 toxicity Toxicity 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 3
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 3
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 3
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 3
- 108090000698 Formate Dehydrogenases Proteins 0.000 description 3
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 3
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 3
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 3
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 3
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 3
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 3
- 241000209140 Triticum Species 0.000 description 3
- 235000021307 Triticum Nutrition 0.000 description 3
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 229960004261 cefotaxime Drugs 0.000 description 3
- AZZMGZXNTDTSME-JUZDKLSSSA-M cefotaxime sodium Chemical compound [Na+].N([C@@H]1C(N2C(=C(COC(C)=O)CS[C@@H]21)C([O-])=O)=O)C(=O)\C(=N/OC)C1=CSC(N)=N1 AZZMGZXNTDTSME-JUZDKLSSSA-M 0.000 description 3
- 238000007824 enzymatic assay Methods 0.000 description 3
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 3
- 229960005542 ethidium bromide Drugs 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 2
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 240000006499 Flammulina velutipes Species 0.000 description 2
- 235000016640 Flammulina velutipes Nutrition 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 2
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 2
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 2
- 240000005979 Hordeum vulgare Species 0.000 description 2
- 235000007340 Hordeum vulgare Nutrition 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 101100477785 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SMF3 gene Proteins 0.000 description 2
- 241000221696 Sclerotinia sclerotiorum Species 0.000 description 2
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 102220472177 Thymidine kinase 2, mitochondrial_T17V_mutation Human genes 0.000 description 2
- 244000098338 Triticum aestivum Species 0.000 description 2
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000001307 helium Substances 0.000 description 2
- 229910052734 helium Inorganic materials 0.000 description 2
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000009630 liquid culture Methods 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000003976 plant breeding Methods 0.000 description 2
- 230000008635 plant growth Effects 0.000 description 2
- 244000000003 plant pathogen Species 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000001850 reproductive effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 239000003270 steroid hormone Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical group C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- 108700004205 Arabidopsis HSP18.2 Proteins 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- KHBQMWCZKVMBLN-UHFFFAOYSA-N Benzenesulfonamide Chemical compound NS(=O)(=O)C1=CC=CC=C1 KHBQMWCZKVMBLN-UHFFFAOYSA-N 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000218631 Coniferophyta Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- ANPADMNVVOOYKW-DCAQKATOSA-N Cys-His-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ANPADMNVVOOYKW-DCAQKATOSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 101710198510 Enoyl-[acyl-carrier-protein] reductase [NADH] Proteins 0.000 description 1
- 101710129170 Extensin Proteins 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- 101150056327 HMG2 gene Proteins 0.000 description 1
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- XSEAJSPAOTZXJE-IHPCNDPISA-N His-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N XSEAJSPAOTZXJE-IHPCNDPISA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 101150062179 II gene Proteins 0.000 description 1
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 235000002678 Ipomoea batatas Nutrition 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- 239000007987 MES buffer Substances 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 101800000135 N-terminal protein Proteins 0.000 description 1
- 101150005851 NOS gene Proteins 0.000 description 1
- 241000208133 Nicotiana plumbaginifolia Species 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 101800001452 P1 proteinase Proteins 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 244000299790 Rheum rhabarbarum Species 0.000 description 1
- 235000009411 Rheum rhabarbarum Nutrition 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 101100020617 Solanum lycopersicum LAT52 gene Proteins 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 208000026723 Urinary tract disease Diseases 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 108010006886 Vitrogen Proteins 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010021384 barley lectin Proteins 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- -1 but not limited to Proteins 0.000 description 1
- HOZOZZFCZRXYEK-GSWUYBTGSA-M butylscopolamine bromide Chemical compound [Br-].C1([C@@H](CO)C(=O)O[C@H]2C[C@@H]3[N+]([C@H](C2)[C@@H]2[C@H]3O2)(C)CCCC)=CC=CC=C1 HOZOZZFCZRXYEK-GSWUYBTGSA-M 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000011088 calibration curve Methods 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 244000037666 field crops Species 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 238000011990 functional testing Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 235000021384 green leafy vegetables Nutrition 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000010324 immunological assay Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000012499 inoculation medium Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000004570 mortar (masonry) Substances 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- NIFHFRBCEUSGEE-UHFFFAOYSA-N oxalic acid Chemical compound OC(=O)C(O)=O.OC(=O)C(O)=O NIFHFRBCEUSGEE-UHFFFAOYSA-N 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000005648 plant growth regulator Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 108010055896 polyornithine Proteins 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000001965 potato dextrose agar Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- ZNCPFRVNHGOPAG-UHFFFAOYSA-L sodium oxalate Chemical compound [Na+].[Na+].[O-]C(=O)C([O-])=O ZNCPFRVNHGOPAG-UHFFFAOYSA-L 0.000 description 1
- 229940039790 sodium oxalate Drugs 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 244000000000 soil microbiome Species 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 108700020534 tetracycline resistance-encoding transposon repressor Proteins 0.000 description 1
- 230000006032 tissue transformation Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 230000002485 urinary effect Effects 0.000 description 1
- 208000014001 urinary system disease Diseases 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8209—Selection, visualisation of transformants, reporter constructs, e.g. antibiotic resistance markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8282—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for fungal resistance
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
A novel nucleic acid sequence encoding Aspergillus phoenices oxalate decarboxylase (APOXD) has been determined, as well as the encoded amino acid sequence. The gene and its encoded protein are useful in degrading oxalate, in diagnostic assays of oxalate, and as a selectable marker.
Description
Applicant Ref. 0561-PCT
- I -GENE ENCODING OXALATE DECARBOXYLASE FROM
ASPERGILLUS PHOENICES
Field of the Invention This invention relates to a novel nucleic acid sequence encoding oxalate decarboylyase isolated from Aspergillus phoenices and to use of the nucleic acid sequence to produce its encoded protein.
Background of the Invention Oxalic acid (oxalate) is a diffusable toxin associated with various plant diseases, particularly those caused by fungi. Some leafy green vegetables, including spinach and rhubarb, produce oxalate as a nutritional stress factor. When plants containing oxalate are consumed in large amounts, they can be toxic to humans.
Oxalate is used by pathogens to gain access into and subsequently throughout an infected plant. See for example, Mehta and Datta, The Jourhal of Biological Chemistry, 266:23548-23553, 1991; and published PCT Application W092/14824.
Field crops such as sunflower, bean, canola, alfalfa, soybean, flax, safflower, peanut, clover, as well as numerous vegetable crops, flowers, and trees are susceptible to oxalate-secreting pathogens. For example, fungal species including Sclerotinia and Sclerotium use oxalic acid to provide an opportunistic route of entry into plants, causing serious damage to crops such as sunflower.
Because of the role of oxalate in plant disease and toxicity, compounds that inhibit oxalate mediated disease, and particularly genes encoding such inhibitory degrading molecules, are greatly needed.
Enzymes that utilize oxalate as a substrate have been identified. These include oxalate oxidase and oxalate decarboxylase. Oxalate oxidase catalyzes the conversion of oxalate to COZ and H202. A gene encoding barley oxalate oxidase has been cloned from a barley root cDNA library and sequenced (See PCT publication No.
W092/14824). A gene encoding wheat oxalate oxidase activity (Germin) has been isolated and sequenced, (PCT publication No. WO 94/13790) and the gene has been introduced into a canola variety. Canola plants harboring the gene appeared to show Applicant Ref. 0561-PCT
- I -GENE ENCODING OXALATE DECARBOXYLASE FROM
ASPERGILLUS PHOENICES
Field of the Invention This invention relates to a novel nucleic acid sequence encoding oxalate decarboylyase isolated from Aspergillus phoenices and to use of the nucleic acid sequence to produce its encoded protein.
Background of the Invention Oxalic acid (oxalate) is a diffusable toxin associated with various plant diseases, particularly those caused by fungi. Some leafy green vegetables, including spinach and rhubarb, produce oxalate as a nutritional stress factor. When plants containing oxalate are consumed in large amounts, they can be toxic to humans.
Oxalate is used by pathogens to gain access into and subsequently throughout an infected plant. See for example, Mehta and Datta, The Jourhal of Biological Chemistry, 266:23548-23553, 1991; and published PCT Application W092/14824.
Field crops such as sunflower, bean, canola, alfalfa, soybean, flax, safflower, peanut, clover, as well as numerous vegetable crops, flowers, and trees are susceptible to oxalate-secreting pathogens. For example, fungal species including Sclerotinia and Sclerotium use oxalic acid to provide an opportunistic route of entry into plants, causing serious damage to crops such as sunflower.
Because of the role of oxalate in plant disease and toxicity, compounds that inhibit oxalate mediated disease, and particularly genes encoding such inhibitory degrading molecules, are greatly needed.
Enzymes that utilize oxalate as a substrate have been identified. These include oxalate oxidase and oxalate decarboxylase. Oxalate oxidase catalyzes the conversion of oxalate to COZ and H202. A gene encoding barley oxalate oxidase has been cloned from a barley root cDNA library and sequenced (See PCT publication No.
W092/14824). A gene encoding wheat oxalate oxidase activity (Germin) has been isolated and sequenced, (PCT publication No. WO 94/13790) and the gene has been introduced into a canola variety. Canola plants harboring the gene appeared to show Applicant Ref. 0561-PCT
some resistance to Sclerotinia sclerotiorum, in vitro (Dumas, et al., 1994, Abstracts: 4th Int'l. Congress of Plant Molecular Biology, #1906).
- Oxalate decarboxylase converts oxalate to C02 and formic acid. A gene encoding oxalate decarboxylase has been isolated from Collybia velutipes (now termed Flammulina velutipes) and the cDNA clone has been sequenced (W094/12622, published 9 June 1994). Oxalate decarboxylase activities have also been described in Aspergillus niger and Aspergillus phoenices (Emiliani et al., 1964, ARCH. Biochem.
Biophys.
105:488-493), however the amino acid sequence and nucleic acid sequence encoding these enzyme activities have not been isolated or characterized.
Enzymatic assays for clinical analysis of urinary oxalate provide significant advantages in sensitivity and qualification Obzansky, et al., 1983, Clinical Chem. 29:1815-1819. For many reasons, including reactivity with interfering analytes and the high cost of available oxalate oxidase used in this diagnostic assay, alternative enzymes are needed. (Lathika et al., 1995, Analytical Letters 28: 425-442).
In this application, we disclose the isolation, cloning, and sequencing of a unique gene encoding an oxalate decarboxylase enzyme from Aspergillus phoenices. The gene is useful in producing highly purified Aspergillus phoenices oxalate decarboxylase enzyme, in producing transgenic plant cells and plants expressing the enzyme in viva, and in diagnostic assays of oxalate.
Summary of the Invention The present invention provides a nucleic acid sequence encoding oxalate decarboxylase isolated from Aspergillus phoenices (APOXD). The gene sequence [Seq ID No: l ], the recombinant protein produced therefrom [Seq ID No:2], and vectors, transformed cells, and plants containing the gene sequence are provided as individual embodiments of the invention, as well as methods using the gene or its encoded protein.
The nucleic acid is useful for producing oxalate decarboxylase for commercial applications, including degradation of oxalic acid, protection against oxalic acid toxicity, and diagnostic assays to quantify oxalate.
The nucleic acid of the invention is also useful as a selectable marker.
Growth of plant cells in the presence of oxalic acid favors survival of plant cells transformed with the coding sequence of the gene.
Applicant Ref. 0561-PCT
The present invention also includes compositions and methods for degrading oxalic acid, in providing protection against oxalic acid toxicity, and in - combating and providing protection against plant pathogens that utilize oxalate to gain access to plant tissue or otherwise in the course of the pathogenesis of the disease.
Oxalate decarboxylase from Aspergillis phoenices (APOXD) of the present invention is combined with an appropriate carrier for delivery to the soil or plants.
Alternatively, plant cells are transformed with the nucleic acid sequence of the invention for expression of APOXD in vivo.
Brief Description of the Drawings Figure 1 is a diagram showing a first primer strategy for amplification of a portion of the nucleic acid sequence encoding APOXD.
Figure 2 is a diagram showing the primer position and design of nested, gene-specific primers (arrows above diagram) for 3' RACE and the single gene specific primer (arrow beneath diagram) used for 5' RACE.
Figure 3 is a diagram showing the construction of plasmid pPHP9723 containing the l.4kb nucleic acid sequence encoding APOXD including leader and pre-sequence.
Figure 4 is a diagram of the plasmid pPHP9723.
Figure 5 is a diagram showing the plasmid pPHP9762 containing the nucleic acid sequence encoding APOXD with the fungal leader and pre-sequence replaced by the plant signal sequence of the wheat oxalate oxidase gene, Germin.
Detailed Descriution of the Invention The purified oxalate decarboxylase of the present invention has many commercial uses, including inhibiting oxalate toxicity of plants and preventing pathogenic disease in plants where oxalic acid plays a critical role. It has been suggested that degradation of oxalic acid is a preventative measure, e.g., to prevent invasion of a pathogen into a plant, or during pathogenesis, when oxalic acid concentrations rise (Dumas, et al., 1994, Supra). The gene of the invention is also useful as a selectable marker of transformed cells, for diagnostic assay of oxalate, and for production of the enzyme in plants.
- Oxalate decarboxylase converts oxalate to C02 and formic acid. A gene encoding oxalate decarboxylase has been isolated from Collybia velutipes (now termed Flammulina velutipes) and the cDNA clone has been sequenced (W094/12622, published 9 June 1994). Oxalate decarboxylase activities have also been described in Aspergillus niger and Aspergillus phoenices (Emiliani et al., 1964, ARCH. Biochem.
Biophys.
105:488-493), however the amino acid sequence and nucleic acid sequence encoding these enzyme activities have not been isolated or characterized.
Enzymatic assays for clinical analysis of urinary oxalate provide significant advantages in sensitivity and qualification Obzansky, et al., 1983, Clinical Chem. 29:1815-1819. For many reasons, including reactivity with interfering analytes and the high cost of available oxalate oxidase used in this diagnostic assay, alternative enzymes are needed. (Lathika et al., 1995, Analytical Letters 28: 425-442).
In this application, we disclose the isolation, cloning, and sequencing of a unique gene encoding an oxalate decarboxylase enzyme from Aspergillus phoenices. The gene is useful in producing highly purified Aspergillus phoenices oxalate decarboxylase enzyme, in producing transgenic plant cells and plants expressing the enzyme in viva, and in diagnostic assays of oxalate.
Summary of the Invention The present invention provides a nucleic acid sequence encoding oxalate decarboxylase isolated from Aspergillus phoenices (APOXD). The gene sequence [Seq ID No: l ], the recombinant protein produced therefrom [Seq ID No:2], and vectors, transformed cells, and plants containing the gene sequence are provided as individual embodiments of the invention, as well as methods using the gene or its encoded protein.
The nucleic acid is useful for producing oxalate decarboxylase for commercial applications, including degradation of oxalic acid, protection against oxalic acid toxicity, and diagnostic assays to quantify oxalate.
The nucleic acid of the invention is also useful as a selectable marker.
Growth of plant cells in the presence of oxalic acid favors survival of plant cells transformed with the coding sequence of the gene.
Applicant Ref. 0561-PCT
The present invention also includes compositions and methods for degrading oxalic acid, in providing protection against oxalic acid toxicity, and in - combating and providing protection against plant pathogens that utilize oxalate to gain access to plant tissue or otherwise in the course of the pathogenesis of the disease.
Oxalate decarboxylase from Aspergillis phoenices (APOXD) of the present invention is combined with an appropriate carrier for delivery to the soil or plants.
Alternatively, plant cells are transformed with the nucleic acid sequence of the invention for expression of APOXD in vivo.
Brief Description of the Drawings Figure 1 is a diagram showing a first primer strategy for amplification of a portion of the nucleic acid sequence encoding APOXD.
Figure 2 is a diagram showing the primer position and design of nested, gene-specific primers (arrows above diagram) for 3' RACE and the single gene specific primer (arrow beneath diagram) used for 5' RACE.
Figure 3 is a diagram showing the construction of plasmid pPHP9723 containing the l.4kb nucleic acid sequence encoding APOXD including leader and pre-sequence.
Figure 4 is a diagram of the plasmid pPHP9723.
Figure 5 is a diagram showing the plasmid pPHP9762 containing the nucleic acid sequence encoding APOXD with the fungal leader and pre-sequence replaced by the plant signal sequence of the wheat oxalate oxidase gene, Germin.
Detailed Descriution of the Invention The purified oxalate decarboxylase of the present invention has many commercial uses, including inhibiting oxalate toxicity of plants and preventing pathogenic disease in plants where oxalic acid plays a critical role. It has been suggested that degradation of oxalic acid is a preventative measure, e.g., to prevent invasion of a pathogen into a plant, or during pathogenesis, when oxalic acid concentrations rise (Dumas, et al., 1994, Supra). The gene of the invention is also useful as a selectable marker of transformed cells, for diagnostic assay of oxalate, and for production of the enzyme in plants.
Applicant Ref. 0561-PCT
Nucleic Acid Seguence Encoding APOXD
A nucleic acid sequence encoding APOXD [Seq. ID No: 1 ] has now been determined by methods described more fully in the Examples below. Briefly, DNA
encoding APOXD was obtained by amplification of genomic A. phoenices DNA using a RACE strategy as described in Innis et. al., eds., 1990, PCR Protocols. A
Guide to Methods and Applications, Academic Press, San Diego, CA, pages 28-38. See also pages 39-45, "Degenerate primers". The nucleic acid sequence and its deduced amino acid sequence (Seq. ID No:2] are shown below in Table 1. The predicted signal peptide [Seq.
ID No: 3] and pre-protein (Seq. ID No: 4] are shown along with the potential cleavage site between them as determined by computer analysis using PC gene software (IntelliGenetics, Inc., Mountain View, CA). The mature protein [Seq. ID No: 5]
is also indicated. This 1.4 kb sequence encodes a 458 amino acid enzyme subunit with a calculated molecular weight of 51,994 daltons. Southern hybridization indicates that the enzyme is encoded by a single gene in the Aspergillis phoenices genome. The plasmid pPHP9685 containing the nucleic acid sequence encoding APOXD as an insert was deposited with the A.T.C.C. on March 18, 1997, having Accession No. 97959.
SEQUENCE OF FULL LENGTH APOXD DNA
(Signal Peptide W
(Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly (Pre-protein ~
IGly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala (Mature Protein fi TCA TCC CCA TCC ATT TTC TTC (CAA GAC AAG CCA TTC ACC CCC GAT CAT 197 Ser Ser Pro Ser Ile Phe Phe IGln Asp Lys Pro Phe Thr Pro Asp His Applicant Ref. 0561-PCT
Nucleic Acid Seguence Encoding APOXD
A nucleic acid sequence encoding APOXD [Seq. ID No: 1 ] has now been determined by methods described more fully in the Examples below. Briefly, DNA
encoding APOXD was obtained by amplification of genomic A. phoenices DNA using a RACE strategy as described in Innis et. al., eds., 1990, PCR Protocols. A
Guide to Methods and Applications, Academic Press, San Diego, CA, pages 28-38. See also pages 39-45, "Degenerate primers". The nucleic acid sequence and its deduced amino acid sequence (Seq. ID No:2] are shown below in Table 1. The predicted signal peptide [Seq.
ID No: 3] and pre-protein (Seq. ID No: 4] are shown along with the potential cleavage site between them as determined by computer analysis using PC gene software (IntelliGenetics, Inc., Mountain View, CA). The mature protein [Seq. ID No: 5]
is also indicated. This 1.4 kb sequence encodes a 458 amino acid enzyme subunit with a calculated molecular weight of 51,994 daltons. Southern hybridization indicates that the enzyme is encoded by a single gene in the Aspergillis phoenices genome. The plasmid pPHP9685 containing the nucleic acid sequence encoding APOXD as an insert was deposited with the A.T.C.C. on March 18, 1997, having Accession No. 97959.
SEQUENCE OF FULL LENGTH APOXD DNA
(Signal Peptide W
(Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly (Pre-protein ~
IGly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala (Mature Protein fi TCA TCC CCA TCC ATT TTC TTC (CAA GAC AAG CCA TTC ACC CCC GAT CAT 197 Ser Ser Pro Ser Ile Phe Phe IGln Asp Lys Pro Phe Thr Pro Asp His Applicant Ref. 0561-PCT
NruI
Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Applicant Ref. 0561-PCT
Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Applicant Ref. 0561-PCT
Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro GIu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro GIy Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp ,. Applicant Ref. 0561-PCT
- '7 -Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (Stop 9~;i.: ~~~i ~ a _,'~;~,.
l~ ~ .. ~ .. ~~. I
,~, x u, _ , .- ~,.. , 2: .'~ a s '~h. . . I , ._ ~, r, w I ~~:.m II -s ' r ~~. .
~ ~.' 1:. , ' , ' . ~fl. S
'.~T',~1 ~ ~.E~,G ~~I ~ . , ~7 CJ
~ r r. .
~... ~a. ~~: ~ ~~ ~, ~ ~,..d' ~;~
e~. N , S. l , ' w Y.
!~ ..7" 4.~,.mr:;~ ~i . .. ~$.''~h~t~~~IF~ 1 ,16 ,c S~ t , I?;; ,., ii ..7w..~9.d.".. W~IIe'a.
:~1. 'k, ~''i4 1 :F.x._;'., "x tV'~7d~Aw~a~..'"Y~~I~ _w'y _.~..-r.Ji~
;'.~wh~, n .-1.4 kb gene 1-1437 1 Encoded 24-1397 1-458 2 Protein Signal Peptide 24-101 1-26 3 Pre-protein 102-1397 27-458 4 Mature Protein 71-1394 50-457 5 Redundancy in the genetic code permits variation in the gene sequences shown in Table 1. In particular, one skilled in the art will recognize specific codon preferences by a specific host species and can adapt the disclosed sequence as preferred for the desired host. For example, rare codons having a frequency of less than about 20%
in known sequence of the desired host are preferably replaced with higher frequency codons. Codon preferences for a specific organism may be calculated, for example, codon usage tables available on the INTERNET at the following address:
http://www.dna.affrc.go.jp/~nakamura/codon.html. One specific program available for Arabidopsis is found at: http://genome-www.stanford.edu/Arabidopsis/codon usage.html.
- '7 -Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (Stop 9~;i.: ~~~i ~ a _,'~;~,.
l~ ~ .. ~ .. ~~. I
,~, x u, _ , .- ~,.. , 2: .'~ a s '~h. . . I , ._ ~, r, w I ~~:.m II -s ' r ~~. .
~ ~.' 1:. , ' , ' . ~fl. S
'.~T',~1 ~ ~.E~,G ~~I ~ . , ~7 CJ
~ r r. .
~... ~a. ~~: ~ ~~ ~, ~ ~,..d' ~;~
e~. N , S. l , ' w Y.
!~ ..7" 4.~,.mr:;~ ~i . .. ~$.''~h~t~~~IF~ 1 ,16 ,c S~ t , I?;; ,., ii ..7w..~9.d.".. W~IIe'a.
:~1. 'k, ~''i4 1 :F.x._;'., "x tV'~7d~Aw~a~..'"Y~~I~ _w'y _.~..-r.Ji~
;'.~wh~, n .-1.4 kb gene 1-1437 1 Encoded 24-1397 1-458 2 Protein Signal Peptide 24-101 1-26 3 Pre-protein 102-1397 27-458 4 Mature Protein 71-1394 50-457 5 Redundancy in the genetic code permits variation in the gene sequences shown in Table 1. In particular, one skilled in the art will recognize specific codon preferences by a specific host species and can adapt the disclosed sequence as preferred for the desired host. For example, rare codons having a frequency of less than about 20%
in known sequence of the desired host are preferably replaced with higher frequency codons. Codon preferences for a specific organism may be calculated, for example, codon usage tables available on the INTERNET at the following address:
http://www.dna.affrc.go.jp/~nakamura/codon.html. One specific program available for Arabidopsis is found at: http://genome-www.stanford.edu/Arabidopsis/codon usage.html.
Applicant Ref. 0561-PCT
Additional sequence modifications are known to enhance protein expression in a cellular host. These include elimination of sequences encoding spurious - polyadenylation signals, exon/intron splice site signals, transposon-like repeats, and other such well-characterized sequences which may be deleterious to gene expression.
The G
C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. Where possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.
Other useful modifications include the addition of a translational initiation consensus sequence at the start of the open reading frame, as described in Kozak, 1989, Mol Cell Biol. 9:5073 5080.
In addition, the native APOXD gene or a modified version of the APOXD
gene might be further optimized for expression by omitting the predicted signal and pre-sequence, replacing the signal sequence with another signal sequence, or replacing the signal and pre-sequence with another signal sequence. Any one of the possible APOXD
gene variations may work best when combined with a specific promoter and/or termination sequence.
The term "selectively hybridizes" includes a reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detestably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences typically have about at least 80% sequence identity, preferably 90% sequence identity, and most preferably 100%
sequence identity (i.e., complementary) with each other.
The terms "stringent conditions" or "stringent hybridization conditions"
include reference to conditions under which a probe will hybridize to its target sequence, to a detestably greater degree than other sequences (e.g., at least 2-fold over background).
Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences can be identified which are 100% complementary to the probe (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected Applicant Ref. 0561-PCT
(heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, optionally less than 500 nucleotides in length.
Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI, 1 % SDS (sodium dodecyl sulphate) at 37°C, and a wash in 1X to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in O.SX to 1X
SSC at 55 to 60°C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in O.1X SSC at 60 to 65°C.
Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth and Wahl, Aaal. Biochem., 138:267-284 (1984): Tm = 81.5°C + 16.6 (log M) +
0.41 (%CG) -0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %CG is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs.
The Tm is the temperature (under defined ionic strength and pH) at which SO%
of a complementary target sequence hybridizes to a perfectly matched probe. Tm is reduced by about I°C for each 1% of mismatching; thus, Tm, hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with > 90% identity are sought, the Tm can be decreased 10°C.
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence and its complement at a defined ionic strength and pH.
However, severely stringent conditions can utilize a hybridization and/or wash at l, 2, 3, or 4° lower than the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10°C lower than the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or wash at 11, 12, Applicant Ref. 0561-PCT
- 1~ -13, 14, 15, or 20°C lower than the thermal melting point (Tn,). Using the equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a Tm of less than 45°C (aqueous solution) or 32°C (formamide solution) it is preferred to increase the SSC
concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, Chapter "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995).
As used herein, "vector" includes reference to a nucleic acid used in transfection of a host cell and into which can be inserted a polynucleotide.
Vectors are often replicons. Expression vectors permit transcription of a nucleic acid inserted therein.
The following terms are used to describe the sequence relationships between two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison windows", (c) "sequence identity", and (d) "percentage of sequence identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.
(b) As used herein, "comparison window" means includes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Applicant Ref. 0561-PCT
- m -Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman. Adv. Appl. Math. 2: 482 (1981); by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol 48: 443 (1970);
by the search for similarity method of Pearson and Lipman, PYOC. Natl. Acad.
Sci. 85:
2444 (1988); by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PCIGene program by Intelligenetics, Mountain View, California, GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wisconsin, USA; the CLUSTAL program is well described by Higgins and Sharp, Gene 73: 237-244 (I988); Higgins and Sharp, CABIOS 5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer Applications in the Biosciences 8: 155-65 (1992), and Pearson, et al., Methods in Molecular Biology 24: 307-331 (1994). The BLAST family of programs which can be used for database similarity searches includes: BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX for nucleotide query sequences against protein database sequences;
BLASTP for protein query sequences against protein database sequences; TBLASTN
for protein query sequences against nucleotide database sequences; and TBLASTX for nucleotide query sequences against nucleotide database sequences. See, Current Protocols in Molecular Biology, Chapter 19, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995).
GAP uses the algorithm of Needleman and Wunsch (J Mol Biol 48: 443-453 (1970)) to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps. GAP considers all possible alignments and gap positions and creates the alignment with the largest number of matched bases and the fewest gaps. Default gap creation penalty values and gap extension penalty values in Version 10 of the Wisconsin Genetics Software Package are 8 and 2, respectively, for protein sequences. For nucleotide sequences the default gap creation penalty is 50 while the default gap extension penalty is 3. The gap creation and gap extension penalties can be expressed as an integer selected form the group of integers consisting of form 0 to 100.
Thus, for example, the gap creation and gap extension penalties can be 0, l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, or greater.
Applicant Ref. 0561-PCT
GAP presents one member of the family of best alignments. There may be many members of this family, but no other member has a better quality. GAP
displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity. The Quality is the metric maximized in order to align the sequences. Ratio is the quality divided by the number of bases in the shorter segment. Percent Identity is the percent of the symbols that actually match. Percent Similarity is the percent of the symbols that are similar.
Symbols that are across from gaps are ignored. A similarity is scored when the scoring matrix value for a pair of symbols is greater than or equal to 0.50, the similarity threshold.
The scoring matrix used in Version 10 of the Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff and Henikoff, Proc Natl Acad Sci USA 89:10915).
Unless otherwise stated, sequence identity/similarity values provided herein refer to the value obtained using GAP version 10 of Wisconsin Genetic Software Package using default parameters.
APOXD Protein The recombinant APOXD protein produced from the disclosed nucleic acid sequence provides a substantially pure protein useful to degrade oxalate, particularly in applications where highly purified enzymes are required. The recombinant protein may be used in enzymatic assays of oxalate or added to compositions containing oxalate to induce oxalate degradation.
When used externally, the enzyme can be placed in a liquid dispersion or solution, or may be mixed with a carrier solid for application as a dust or powder. The particular method of application and carrier used will be determined by the particular plant and pathogen target. Such methods are known, and are described, for example, in U.S. Patent No. 5,488,035 to Rao:
Gene Delivery The nucleic acid sequence encoding APOXD may be delivered to plant cells for transient transfections or for incorporation into the plant's genome by methods know in the art. Preferably, the gene is used to stably transform plant cells for expression of the protein in vivo.
To accomplish such delivery, the gene containing the coding sequence for APOXD may be attached to regulatory elements needed for the expression of the gene in Applicant Ref. 0561-PCT
a particular host cell or system. These regulatory elements include, for example, promoters, terminators, and other elements that permit desired expression of the enzyme in a particular plant host, in a particular tissue or organ of a host such as vascular tissue, root, leaf, or flower, or in response to a particular signal.
Promoters A promoter is a DNA sequence that directs the transcription of a structural gene, e.g., that portion of the DNA sequence that is transcribed into messenger RNA
(mRNA) and then translated into a sequence of amino acids characteristic of a specific polypeptide. Typically, a promoter is located in the 5' region of a gene, proximal to the transcriptional start site. A promoter may be inducible, increasing the rate of transcription in response to an inducing agent. In contrast, a promoter may be constitutive, whereby the rate of transcription is not regulated by an inducing agent. A promoter may be regulated in a tissue-specific or tissue-preferred manner, such that it is only active in transcribing the operably linked coding region in a specific tissue type or types, such as plant leaves, roots, or meristem.
Inducible Promoters An inducible promoter useful in the present invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the inducible promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD. With an inducible promoter, the rate of transcription increases in response to an inducing agent.
Any inducible promoter can be used in the present invention to direct transcription of APOXD, including those described in Ward, et al., 1993, Plant Molecular Biol. 22: 361:-366. Exemplary inducible promoters include that from the ACEl system which responds to copper (Mett et al., 1993, PNAS 90: 4567-4571); In2 gene promoter from maize which responds to benzenesulfonamide herbicide safeners (Hershey et al., 1991, Plant Mol. Biol. 17:679-690; and the Tet repressor from TnlO (Hersey, et al., 1991, Mol. Gen. Genetics 227:229-237; Gatz, et al., 1994, Mol.Gen. Genetics 243:32-38).
A particularly preferred inducible promoter is one that responds to an inducing agent to which plants do not normally respond. One example of such a promoter is the steroid hormone gene promoter. Transcription of the steroid hormone gene Applicant Ref. 0561-PCT
promoter is induced by glucocorticosteroid hormone. (Schena et al., 1991, PNAS
U.S.A.
88:10421) - In the present invention, an expression vector comprises an inducible promoter operably linked to a nucleotide sequence encoding APOXD. The expression vector is introduced into plant cells and presumptively transformed cells are exposed to an inducer of the inducible promoter. The cells are screened for the presence of APOXD
proteins by immunoassay methods or by analysis of the enzyme's activity.
Pathogen-Inducible Promoters A pathogen-inducible promoter of the present invention is an inducible promoter that responds specifically to the inducing agent, oxalic acid, or to plant pathogens such as oxalic acid-producing pathogens including Sclerotinia sclerotiorum.
Genes that produce transcripts in response to Sclerotinia and oxalic acid have been described in Mouley et al., 1992, Plant Science 85:51-59. One member of the prpl-1 gene family contains a promoter that is activated in potato during early stages of late blight infection and is described in Martini et al., 1993, Mol.Gen.Genet.
236:179-186.
Tissue-specific or Tissue-Preferred Promoters A tissue specific promoter of the invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the tissue-specific promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD. Plants transformed with a gene encoding APOXD operably linked to a tissue specific promoter produce APOXD
protein exclusively, or preferentially, in a specific tissue.
Any tissue-specific or tissue-preferred promoter can be utilized in the instant invention. Examples of such promoters include a root-preferred promoter such as that from the phaseolin gene as described in Murai et al., 1983, Science 222:476-482 and in Sengupta-Gopalan et al., 1985, PNAS USA 82:3320-3324; a leaf specific and light-induced promoter such as that from cab or rubisco as described in Simpson et al., 1985, EMBO J. 4(11):2723-2729, and in Timko et al., 1985, Nature 318:579-582; an anther-specific promoter such as that from LAT52 as described in Twell et al., 1989, Mol. Gen.
Genet. 217:240-245; a pollen-specific promoter such as that from Zml3 as described in Guerrero et al., 1990, Mol.Gen. Genet. 224:161-168; and a microspore-preferred Applicant Ref. 0561-PCT
promoter such as that from apg as described in Twell et al., 1993, Sex. Plant Reprod.
6:217-224.
. Other tissue-specific promoters useful in the present invention include a phloem-preferred promoter such as that associated with the Arabidopsis sucrose synthase gene as described in Martin et al., 1993, The Plant Journal 4(2):367-377; a floral-specific promoter such as that of the Arabidopsis HSP 18.2 gene described in Tsukaya et al., 1993, Mol. Gen. Genet. 237:26-32 and of the Arabidopsis HMG2 gene as described in Enjuto et al., 1995, Plant Cell 7:517-527.
An expression vector of the present invention comprises a tissue-specific or tissue-preferred promoter operably linked to a nucleotide sequence encoding APOXD.
The expression vector is introduced into plant cells. The cells are screened for the presence of APOXD protein by immunological methods or by analysis of enzyme activity.
Constitutive Promoters A constitutive promoter of the invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the constitutive promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD.
Many different constitutive promoters can be utilized in the instant invention to express APOXD. Examples include promoters from plant viruses such as the 35S promoter from cauliflower mosaic virus (CaMV), as described in Odell et al., 1985, Nature 313:810-812, and promoters from genes such as rice actin (McElroy et al., 1990, Plant Cell 2:163-171); ubiquitin (Christensen et al., 1989, Plant Mol. Biol.
12:619-632;
and Christensen et al., 1992, Plant Mol. Biol 18:675-689); pEMU (Last et al., 1991, Theor. Appl. Genet. 81:581-588); MAS (Velten et al., 1984, EMBO J. 3:2723-2730); and maize H3 histone (Lepetit et al., 1992, Mol.Gen.Genet. 231:276-285; and Atanassvoa et al., 1992, Plant Journal 2(3):291-300).
The ALS promoter, a Xba/NcoI fragment 5' to the Brassica napus ALS3 structural gene, or a nucleotide sequence having substantial sequence similarity to the XbaI/NcoI fragment, represents a particularly useful constitutive promoter, and is described in published PCT Application number WO 96/30530.
In the present invention, an expression vector comprises a constitutive promoter operably linked to a nucleotide sequence encoding APOXD. The expression r Applicant Ref. 0561-PCT
vector is introduced into plant cells and presumptively transformed cells are screened for the presence of APOXD proteins by immunoassay methods or by analysis of the enzyme's activity.
Additional regulatory elements that may be connected to the APOXD
nucleic acid sequence for expression in plant cells include terminators, polyadenylation sequences, and nucleic acid sequences encoding signal peptides that permit localization within a plant cell or secretion of the protein from the cell. Such regulatory elements and methods for adding or exchanging these elements with the regulatory elements of the APOXD gene are known, and include, but are not limited to, 3'termination and/or polyadenylation regions such as those of the Agrobacterium tumefaciens nopaline synthase (nos) gene (Bevan et al., 1983, Nucl. Acids Res. 11(2):369-385); the potato proteinase inhibitor II (PINII) gene (Keil. et al., 1986, Nucl. Acids Res.
14:5641-5650; and An et al., 1989, Plant Cell 1:115-122); and the CaMV 19S gene (Molten et al., 1990, Plant Cell 2:1261-1272).
Plant signal sequences, including, but not limited to, signal-peptide encoding DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell (Dratewka-Kos, et al., J. Biol. Chem. 264:4896-4900, 1989) and the Nicotiana plumbaginifolia extensin gene (DeLoose, et al., Gene 99:95-100, 1991), or signal peptides which target proteins to the vacuole like the sweet potato sporamin gene (Matsuoka, et al., PNAS 88:834, 1991) and the barley lectin gene (Wilkins, et al., Plant Cell, 2:301-313, 1990), or signals which cause proteins to be secreted such as that of PRIb (Lund, et al., Plant Mol. Biol. 18:47-53, 1992), or those which target proteins to the plastids such as that of rapeseed enoyl-Acp reductase (Verwoert, et al., Plant Mol. Biol.
26:189-202, 1994) are useful in the invention.
Gene Transformation Methods Numerous methods for introducing foreign genes into plants are known and can be used to insert the APOXD gene into a plant host, including biological and physical plant transformation protocols. See, for example, Miki et al., 1993, "Procedure for Introducing Foreign DNA into Plants" in: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 67-88.
The methods chosen vary with the host plant, and include chemical transfection methods such as calcium phosphate, microorganism-mediated gene transfer such as Applicant Ref. 0561-PCT
- i~ -Agrobacterium (Horsch, et al., Science 227:1229-31, 1985), electroporation, micro-injection, and biolistic bombardment.
Expression cassettes and vectors and in vitro culture methods for plant cell or tissue transformation and regeneration of plants are known and available.
See, for example, Gruber, et al., 1993, "Vectors for Plant Transformation" In: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 89-119.
A~robacte~ium-mediated Transformation The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium. A.
tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectfully, carry genes responsible for genetic transformation of plants. See, for example, Kado, 1991, Crit. Rev.Plant Sci. 10(1):1-32. Descriptions of the Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer are provided in Gruber et al., supra;
Miki, et al., supra; and Moloney, et al., 1989, Plant Cell Reports 8:238.
Direct Gene Transfer Despite the fact that the host range for Agrobacterium-mediated transformation is broad, some major cereal crop species and gymnosperms have generally be recalcitrant to this mode of gene transfer, even though some success has recently been achieved in rice (Hiei et al., 1994, The Plant Journal 6(2):271-282). Several methods of plant transformation, collectively referred to as direct gene transfer, have been developed as an alternative to Agrobacterium-mediated transformation.
A generally applicable method of plant transformation is microprojectile-mediated transformation, where DNA is carried on the surface of microprojectiles measuring about 1 to 4 pm. The expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds of 300 to 600 mls which is sufficient to penetrate the plant cell walls and membranes. (Sanford et al., 1987, Part.Sci.
Technol 5:27; Sanford, 1988, Trends Biotech 6:299; Sanford, 1990, Physiol.
Plant 79:206; Klein et al., 1992, Biotechnology 10:268) Another method for physical delivery of DNA to plants is sonication of target cells as described in Zhang et al., 1991, BiolTechnology 9:996.
Alternatively, Applicant Ref. 0561-PCT
liposome or spheroplast fusions have been used to introduce expression vectors into plants. See, for example, Deshayes et al., 1985, EMBO J. 4:2731-2737; and Christou, et - al., 1987, PNAS USA 84:3962-3966. Direct uptake of DNA into protoplasts using CaCl2 precipitation, polyvinyl alcohol or poly-L-ornithine have also been reported.
See, for example, Hain et al., 1985, Mol. Gen.Genet. 199:161; and Draper, et al., 1982, Plant &
Cell Physiol. 23:451.
Electroporation of protoplasts and whole cells and tissues has also been described. See, for example, D'Halluin, et al., 1992, Plant Cell 4:1495-1505;
and Spencer, et al., 1994, PlantMol.Biol. 24:51-61.
Particle Woundin~lA~robacterium Delivery Another useful basic transformation protocol involves a combination of wounding by particle bombardment, followed by use of Agrobacterium for DNA
delivery, as described by Bidney, et al. 1992, Plant Mol. Biol. 18:301-313. Useful plasmids for plant transformation include pPHP9762 shown in Figure 5. The binary backbone for pPHP9762 is pPHP6333. See Bevan, 1984, Nucleic Acids Research 12:8711-8721.
This protocol is preferred for transformation of sunflower plants, and employs either the "intact meristem" method or the "split meristem" method.
In general, the intact meristem transformation method (Bidney, et al., SupYa) involves imbibing seed for 24 hours in the dark, removing the cotyledons and root radical, followed by culturing of the meristem explants. Twenty-four hours later, the primary leaves are removed to expose the apical meristem. The explants are placed apical dome side up and bombarded, e.g., twice with particles, followed by co-cultivation with Agrobacterium. To start the co-cultivation for intact meristems, Agrobacterium is placed on the meristem. After about a 3-day co-cultivation period the meristems are transferred to culture medium with cefotaxime (plus kanamycin for the NPTII selection).
Selection can also be done using kanamycin.
The split meristem method involves imbibing seed, breaking of the cotyledons to produce a clean fracture at the plane of the embryonic axis, excising the root tip and then bisecting the explants longitudinally between the primordial leaves (Malone-Schoneberg et al., 1994, Plant Science 103:199-207). The two halves are placed cut surface up on the medium then bombarded twice with particles, followed by co-cultivation with Agrobacterium. For split meristems, after bombardment the meristems Applicant Ref. 0561-PCT
are placed in an Agrobacterium suspension for 30 minutes. They are then removed from the suspension onto solid culture medium for three day co-cultivation. After this period, - the meristems are transferred to fresh medium with cefotaxime (plus kanamycin for selection).
Transfer by Plant Breeding Alternatively, once a single transformed plant has been obtained by the foregoing recombinant DNA method, conventional plant breeding methods can be used to transfer the structural gene and associated regulatory sequences via crossing and backcrossing. Such intermediate methods will comprise the further steps of:
(1) sexually crossing the disease-resistant plant with a plant from the disease-susceptible taxon; (2) recovering reproductive material from the progeny of the cross; and (3) growing disease-resistant plants from the reproductive material. Where desirable or necessary, the agronomic characteristics of the susceptible taxon can be substantially preserved by expanding this method to include the further steps of repetitively: (1) backcrossing the disease-resistant progeny with disease-susceptible plants from the susceptible taxon; and (2) selecting for expression of APOXD activity (or an associated marker gene) among the progeny of the backcross, until the desired percentage of the characteristics of the susceptible taxon are present in the progeny along with the gene imparting APOXD
activity.
By the term "taxon" herein is meant a unit of botanical classification of genus or lower. It thus includes genus, species, cultivars, varieties, variants and other minor taxonomic groups which lack a consistent nomenclature.
Assay Methods Transgenic plant cells, callus, tissues, shoots, and transgenic plants are tested for the presence of the APOXD gene by DNA analysis (Southern blot or PCR) and for expression of the gene by immunoassay or by assay of oxalate decarboxylase activity.
Tolerance to exogenous oxalic acid can also be used as a functional test of enzyme expression in transformed plants.
APOXD ELISA
Transgenic cells, callus, plants and the like are screened for the expression of APOXD protein by immunological assays, including ELISA. Anti-APOXD
antibodies are generated against APOXD preparations by known methods and are used in typical Applicant Ref. 0561-PCT
ELISA reactions. Polyclonal anti-APOXD can, for example, detect a range of about 10-100 pg APOXD protein in transgenic plant tissues.
In a suitable method for an APOXD-ELISA assay, fresh leaf or callus tissue is homogenized and centrifuged. An aliquot of the supernatant is added to a microtiter plate with a first anti-APOXD antibody and incubated for sufficient time for antibody-antigen reaction. The bound antibody is then reacted with a second antibody linked to a marker, which marker is developed or otherwise converted to a detectable signal correlated to the amount of APOXD protein in the sample. Any of the known methods for producing antibodies and utilizing such antibodies in an immunoassay can be used to determine the amount of APOXD expressed in transgenic plant cells and tissues of the invention.
Oxalate Decarboxylase Assay Transgenic cells, tissue, or plants expressing the APOXD gene are assayed for enzyme activity to verify expression of the gene. In general, the cells or tissue is frozen in liquid nitrogen, placed on a lyophilizer overnight to dehydrate, then crushed into a fme powder for use in the assay reaction. Leaf tissue is homogenized as fresh tissue in the reaction mixture, or dehydrated and treated as described above.
A typical assay reaction is begun by adding 0.75 mg of powdered tissue, such as callus, to 1 ml of oxalate decarboxylase reaction mixture: 900 ~1 0.2 M sodium phosphate buffer, pH 5.0, and 100 ~l of 10 mM sodium oxalate, pH 5Ø The reaction is incubated at room temperature for 3 hours with gentle mixing, and is stopped by the addition of 150 p1 of 1 M Tris-HCI, pH 7Ø The mixture is centrifuged, and an aliquot is placed in a cuvette with NAD (600 fig) and formate dehydrogenase (200 pg). The absorbance at 340 nm is correlated to the activity of the APOXD enzyme.
Use of Oxalate Decarboxylase as a Selectable Marker Oxalate decarboxylase is useful in selecting successful transformants, e.g., as a selectable marker. Growth of plant cells in the presence of oxalic acid favors the survival of plant cells that have been transformed with a gene encoding an oxalate-degrading enzyme, such as APOXD. In published PCT application WO 94/13790, herein incorporated by reference, plant cells grown on a selection medium containing oxalic acid (and all of the elements necessary for multiplication and differentiation of plant cells) demonstrated selection of only those cells transformed with and expressing oxalate Applicant Ref. 0561-PCT
oxidase. In like manner, transformation. and expression of the gene encoding APOXD in plant cells is used to degrade oxalic acid present in the media and allow the growth of - only APOXD-gene transformed cells.
Production of APOXD in Plants Trangenic plants of the present invention, expressing the APOXD gene, are used to produce oxalate decarboxylase in commercial quantities. The gene transformation and assay selection techniques described above yield a plurality of transgenic plants which are grown and harvested in a conventional manner.
Oxalate decarboxylase is extracted from the plant tissue or from total plant biomass.
Oxalate decarboxylase extraction from biomass is accomplished by known methods. See for example, Heney and Orr, 1981, Ahal. Biochem. 114:92-96.
In any extraction methodology, losses of material are expected and costs of the procedure are also considered. Accordingly, a minimum level of expression of oxalate decarboxylase is required for the process to be deemed economically worthwhile.
The terms "commercial" and "commercial quantities" here denote a level of expression where at least 0.1% of the total extracted protein is oxalate decarboxylase.
Higher levels of oxalate decarboxylase expression are preferred.
Diagnostic Oxalate Assay Clinical measurement of oxalic acid in urine is important, for example, in the diagnosis and treatment of patients with urinary tract disorders or hyperoxaluric syndromes. The recombinant APOXD enzyme of the invention is preferably immobilized onto beads or solid support, or added in aqueous solution to a sample for quantitation of oxalate. As discussed above, oxalate decarboxylase catalyzes the conversion of oxalate to C02 and formic acid. A variety of detection systems can be utilized to quantify this enzyme catalyzed conversion, including methods for detecting an increase in CO2, or for detecting an increase in formic acid.
For example, the conversion of oxalate to formic acid and COZ is assayed by determining formate production via the reduction of NAD in the presence of formate dehydrogenase. This method is described in Lung, et al., 1994, J.
Bacteriology, 176:2468-2472 and Johnson, et al., 1964, Biochem. Biophys. Acta 89:35.
A calibration curve is generated using known amounts of oxalic acid. The amount of oxalate in a specimen is extrapolated from the standard curve.
Applicant Ref. 0561-PCT
Other enzymatic assays and the like are adapted by known methods to utilize the APOXD enzyme to detect conversion of oxalate.
EXAMPLES
The invention is described more fully below in the following Examples, which are exemplary in nature and are not intended to limit the scope of the invention in any way.
Example 1 Cloning of the Gene Encoding APOXD
Protein Seguence A commercial preparation of A. phoenices oxalate decarboxylase enzyme was obtained from Boehringer Mannheim. (Catalog #479 586) SDS polyacrylamide gel electrophoresis was used to determine the purity of the enzyme. Only one dark band appeared following Coomassie blue staining of the polyacrylamide gel (12.5%).
This band was about 49 kd in size, as determined by comparison to molecular weight markers.
Aliquots of the preparation were sent to the University of Michigan for sequence analysis by Edman degradation on an automated protein sequencer. Preparative polyacrylamide gels were run and the APOXD band was isolated from the gel prior to sequencing. The protein was first sequenced at the amino terminus. Proteins were chemically cleaved into fragments by cyanogen bromide, size separated on polyacrylamide gels, and isolated as bands on the gel for further preparation and sequencing. The results of the sequencing are shown below in Table 2.
Peptide Seq uence* Seq. ID
No.
ammo termlnLiSG1n AspLys Pro Phe ThrPro Asp HisArg 6 Asp ProTyr Asp His LysVal Asp AlaIle Gly GluX Pro Leu His Glu fragment Val IleArg Glu Leu HisTrp His ArgGlu 7 Ala Gly fragment Arg LeuAsp Glu Gly ValIle Arg GluLeu His CysHis Arg 1u G Ala Glu fragment Ser TyrPhe Lys Arg GlyArg Ala ArgTyr Thr IlePhe Ala la a g A Glu Ar Gly Asn Al fragment Ser AlaHis Thr Pro ProSer Val LeuAla Gly Asn * X = Unknown.
a Applicant Ref. 0561-PCT
PCR Amplification of Genomic A. phoenices Genomic DNA was used as the PCR template to amplify the APOXD
- sequence. Aspergillus phoenices was obtained from the American Type Culture Collection (ATCC), Rockville, MD. Cultures were established on solid potato dextrose agar medium (Difco formulation). Liquid stationary cultures were started from culture plates by innoculatory spores in a minimal growth medium previously described for the culture of Aspergillus strains (Emiliani, et al., 1964, Arch. Biochem. Biophys 105:488-493, cited above).
To isolate DNA, mycelial mats were recovered from 4-day liquid stationary cultures, washed in cold water, and blotted dry. The tissue was then frozen in liquid nitrogen, ground by mortar and pestle, and stored frozen at -80°C. DNA was extracted by the method described for fungal mycelium in Sunis et al. (eds.), 1990, PCR
protocols, pages 282-287.
PCR Strategy As diagrammed in Figure 1, primers were designed for both the N-terminal protein sequence and for an internal peptide fragment. One set of primers (PHN
[Seq ID No. 11] and PHN 11339 [Seq ID No. 12]) was designed with nearly full degeneracy. A second set of primers (PHN 11471 [Seq. ID No. 13] and PHN 11476 [Seq ID No. 14]) was designed with no degeneracy. These were based on a codon usage table for Aspergillus niger generated using the Wisconsin Sequence Analysis Package (GCG) (Genetics Computer Group, Inc., Madison, WI). The sequences of these primers is shown in Table 3, below, and diagrammatically in Figure 1. Table 3 shows the degenerate primer mixtures using ICTPAC designations, as described in Cornish-Bowden, 1985, Nucleic Acids Res. 13:3021-3030. The lUPAC nucleic acid symbols include: Y=C
or T;
N=A, T, C, or G; R=A or G; D=A, T, or G; and V=A, C, or G. Both of these PCR
strategies were successful in amplifying a DNA fragment, shown in Table 4, having homology to the protein sequence data shown in Table 2.
Primer Sets-3') Seq. ID
(5' #
CAU TA
CUA VA
CTT
Applicant Ref. 0561-PCT
PCR reactions were set up containing increasing quantities of A. phoenices genomic DNA, in the range of 1-10 nanograms, and various oligonucleotide primer sets.
Degenerate primers were added at a ten-fold higher concentration than that standardly used, due to their degeneracy. All other conditions for PCR were standard, essentially as described in Innis, et aL, 1990, PCR Protocols, pages 282-287, except for the annealing temperatures for the primers. These temperatures were determined on an individual basis using the Oligo 4.0 computer program for analysis as described in Rychlik et al., 1989, Nuc.Acids Res. 17:8543-8551. Specifically, the primers and annealing temperatures were primer first 5 cycles next 30 cycles PHN 11471 50 C 58~ C
PHN 11476 50 C 58~ C
Transformation and Seauencin~
PCR amplification products were ligated into pCR II using the TA Cloning Kit (InVitrogen, San Diego, CA), and transformed into E.coli strain DHSa competent cells 1 S (Life Technologies, Gathersburg, MD) according to the protocol provided with the strain, for cloning and sequencing. Transformed bacteria with plasmid insertions were selected on medium 34Z (LB agar plates containing 100 mg/I carbenicillin) using standard X-GAL
selection protocols (Ausubel, et al., eds, 1989, Current Protocols in Molecular Biology, pages 1Ø3-1.15.8). Briefly, white colonies were picked with an inoculating loop and inoculated directly into a PCR reaction mixture containing primers specific to the universal and reverse promoter regions just outside the multiple cloning site.
The remaining innoculum on the loop was used to streak a plate of 34Z medium and numbered to correspond to the PCR reaction. Successful amplification of an inserted PCR fragment resulted in a band on an ethidium bromide stained agaraose gel which was slightly larger than the size of the insert. Bacterial isolates with an insert of the correct Applicant Ref. 0561-PCT
size were inoculated into shaking liquid cultures and subsequently used for plasmid isolation protocols, followed by sequencing of the insert of interest.
Sequence quality plasmid was prepared by using the Nucleobond P-100 plasmid isolation kit (Machery-Nagle GmBH & Co., Cat.No. BP 101352m distributed by the Nest Group, Southboro, MA). This kit uses an alkaline lysis step and is followed by an ion exchange silica column purification step. Plasmid and gene specific primers were sent to Iowa State University to be sequenced on an automated, ABI DNA
Sequencing machine.
The degenerate primer PCR experiment resulted in the amplification of a 0.4 kb band, which was sequenced and determined to have a deduced amino acid sequence matching the protein data in Table 2. The non-degenerate primer experiment resulted in DNA fragments of various sizes. One fragment was about 0.4 kb in length and encoded a protein having homology to the protein sequence data of Table 2. The region of the APOXD gene that was amplified by both primer sets was nearly the same, so DNA
sequence data for the amplified fragments was compiled, and the sequence of the compiled APOXD genomic fragment is shown in Table 4 [Seq ID No. 15] together with its deduced amino acid sequence [Seq ID Nos. 16 and 29]. The underlined amino acid sequences were represented in the original protein sequence analysis data (Table 2).
Applicant Ref. 0561-PCT
APOXD FRAGMENT
ACG ATC ACA AGG TGG ATG CGA TCG GGG AAG GCC ATG AGC CCT TGC CCT
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro GGC GCA TGG GAG ATG GAG CCA CCA TCA TGG GAC CCC GCA ACA AGG ACC
Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp GTG AGC GCC AGA ACC CCG ACA TGC TCC GTC CTC CGA GCA CCG ACC ATG
Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His GCA ACA TGC CGA ACA TGC GGT GGA GCT TTG CTG ACT CCC ACA TTC GCA
Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg TCG AGG TAA GCC CTT CGA GGG TTT TGT GTA CGA CAA GCA AAA TAG GCT
Ile Glu AAT GCA CTG CAG GAG GGC GGC TGG ACA CGC CAG ACT ACC GTA CGC GAG
Gly Trp Thr Arg Gln Thr Thr Val Arg Glu CTG CCA ACG AGC AAG GAG CTT GCG GGT GTA AAC ATG CGC CTC GAT GAG
Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu GGT GTC ATC CGC GAG TTG CAC TGG CAA GGG CTG AAG GCG AAT TCC AGC
Gly Val Ile Arg Glu Leu His Trp ACA CTG GCG GCC GTT ACT AGT GGA TCC GAG CTC GGT ACC AAG CTT GAT
GC ATAGCT
Applicant Ref. 0561-PCT
3' RACE
Nested oligonucleotide primers were designed based on the genomic DNA
_ sequence fragment which was previously amplified (Table 4) and used for 3' RACE to enhance gene specific amplification. The nested primer design is diagrammatically shown in Figure 2 and the nucleic acid sequences of the primers is shown below in Table 5. Arrows represent the gene specific primers (from top to bottom) PHN 11811, PHN
11810, and the oligo dT based 3' primer from a commercially supplied 3' RACE
kit (Life Technologies, Gaithersburg, MD, Cat. No. 18373-019) TABLE S
3' RACE Primers (5'-3') seq 1D
No.
GGT AAG
The first round of PCR amplification using the outside gene specific primer (GSP) PHN11810 and the oligo dT based 3' primer resulted in no visible DNA
bands. The inside GSP PHNI 1811 and the oligo dT based 3' primer were then used for a second round of amplification on the same sample. A large number of bands appeared, some of which stained intensely with ethidium bromide and some which did not.
The prominent bands were 0.4, 0.8 and 1.3 kb in size. This experiment was set up using 5' and 3' primers with custom ends which only allow ligation of DNA fragments amplified by both. This method permitted the reaction to be used in the ligation protocol without further purification or characterization of the DNA fragments. All three of the prominent bands described above were ligated into pAMPl (Life Technologies, Cat. No., 016), transformed into DHSa cells (Life Technologies, Cat. No. 18263-12), cloned and sequenced. The 0.4 kb band was found to encode an amino acid sequence having homology to the APOXD sequence data of Table 1.
5' RACE
Total RNA was reverse transcribed with commercially available components and a set of oligo dT-based primers ending in G, C or A which are collectively termed Bam T17V (5' CGC GGA TCC GT1~ V) 3') [Seq >D No. 19] These Applicant Ref. 0561-PCT
primers are disclosed in published PCT Application No. U596/08582. First strand cDNA
was oligo dC-tailed and then column purified using commercially available components.
- (Life Technologies, Gaithersburg). The product of this reaction was then used in PCR
with primer set Bam G13H, an equimolar mixture of oligo dG primers ending in A, C, or T (5' TAA GGA TCC TG~3 H 3') [Seq. ID NO: 20], and a second gene specific primer, PHN 11813 [Seq ID No. f1 ]. Amplified products were characterized by Southern analysis using the protocol as described in Ausubel, et al. (eds.), 1989, Current Protocols in Molecular Biology, pages 2Ø1 - 2.12.5.
Hybridization of the 5' RACE product was done using the PCR amplified genomic DNA fragment (Table 4) as a radiolabeled probe. A 0.6 kb band was amplified by this reaction and was strongly labeled with the probe. No other bands appeared. This 0.6 kb band was ligated into the PCR II vector using the TA-cloning procedure, transformed into DHSI, cloned and sequenced. The DNA sequence analysis of the 0.6 kb PCR fragment showed it was homologous to the APOXD sequence data shown in Table 2.
5' RACE Primers SEQ. ID
No.
Bam T17V 5' CGC GGA TCC GT1,V 3' 19 Bam G13H 5' TAA GGA TCC TG13H 3' 2Q
PHN 118135' CAU CAU CAU CAU TAC CTC GAT GCG AAT 21 GTG 3' IUPAC Symbols: V=G,C, or A; H=A, T, or C.
PCR For Full Length The 5' and 3' RACE products were sequenced to their ends as determined by the initiating methionine and the poly-A tail respectively. DNA sequence at each end was analyzed by Oligo 4.0 for oligonucleotide primer design in preparation for PCR to obtain the complete gene.
Primer PHN 12566 designed to the 3' end of the sequence, was used to reverse transcribe total RNA. Primers PHN 12565 and PHN 12567 were used to amplify first strand cDNA. The PCR amplified band was ligated into PCR II using the TA
a Applicant Ref. 0561-PCT
cloning kit (In Vitrogen; San Diego, CA) then transformed into DHSI, cloned, and sequenced.
Full Length cDNA Primers (5'-~3') sEQ: ID
No.
GTA
CC
CCG
A 1.4 kb band was amplified which stained very intensely with ethidium bromide. Other, smaller bands were present, but clearly, the 1.4 kb band was prominent.
This band was sequenced and subjected to open reading frame analysis. All of the protein fragments originally sequenced (Table 2), were found in the deduced amino acid sequence of this PCR product.
Southern analysis was performed on genomic DNA using the 1.4 kb cDNA
as a radiolabeled probe. Only one band hybridized, suggesting that the gene is a single copy and unique in the A. phoenices genome.
Table 1 (pages 4-7) shows the full length cDNA sequence [Seq ID No:l]
and deduced amino acid sequence [Seq ID No:2] of the A. phoehices oxalate decarboxylase gene as amplified, using PCR primers PHN 12565 and PHN 12567.
The underlined amino acid sequences were represented in the original protein sequence analysis data (Table 2). The protein sequence encoded by the full length cDNA
includes a pre-protein, amino acid residues 27-458 [Seq ID No:4], and a mature protein, amino acid residues 50-458 [Seq ID No:S].
Example 2 Transformed plant tissue degrades oxalate CaMV35S/0'/APOXD
The insert ofpPHP9685 (1.4 kb APOXD cDNA in pCR II) was placed into a cloning vector intermediate (pLitmus 28, New England Biolabs) between a plant Applicant Ref. 0561-PCT
expressible promoter and 3' region as shown in the construction diagrams of Figure 3.
The upstream region consists of a cauliflower mosaic virus 35S promoter with a - duplicated enhancer region (2X35S; bases -421 to -90 and -421 to +2, Gardner, et al., 1985, Nucleic Acids Res. 9:2871-2888) with a flanking 5' Notl site and a 3' Pst site, and S2' RNA leader sequence. The 3' region is from potato proteinase inhibitor II.
These are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313. The 2X CaMV
promoter is described in Odell, et al., Nature 313:810-812.
The plant-expressible APOXD gene cassette was then isolated from the cloning intermediate and ligated into the ALS::NPT II:: PIN II-containing pBINl9 construct, pPHP8110. Plasmid pPHP8110 was created from pBIN 19 (Bevan, 1984, Nucleic Acids Res. 12:8711-8721) by replacing the NOS::NPTII::NOS gene cassette in pBINI9 with an ALS::NPTII::PINII cassette. As shown in Figure 3, pPHP8110 is a derivative of pBINl9 containing the NPT II gene, the aminoglycoside-3'-O-phosphotransferase coding sequence, bases 1551 to 2345 from E.coli transposon (Genbank Accession Number V00004, Beck, et al., 1982, Gene 19:327-336). The second amino acid was modified from an isoleucine to a valine in order to create a Nco I
restriction site which was used to make a translational fusion with the ALS
promoter (see copending U.S. Patent Application Serial No. 08/409,297). pPHP8110 further contains the potato proteinase inhibitor II terminator (PIN II) bases 2-310, as described in An, et al., 1989, Plant Cell 1:115-122.
As shown in Figure 4, the resultant plasmid, pPHP9723, carries the APOXD gene construct, together with the NPTII gene for selection of transgenic plant cells, positioned between Agrobacterium T-DNA borders.
Germin/APOXD
A second APOXD cDNA containing plasmid was constructed using the methods described above for producing pPHP 9723. In the second construct, the APOXD
fungal signal and presequence (49 amino acids) were replaced with a plant signal sequence obtained from the 5' end of an enzyme subunit of wheat oxalate oxidase. (Lane, et al., 1991, J. Biol. Chem. 266:10461.) This was accomplished by designing primers that were homologous to the Germin signal sequence, and having extensions to provide the addition of a Sal I restriction site at the 5' end and APOXD 5' sequence followed by a Applicant Ref. 0561-PCT
Nru I site at the 3' end. The primers were used to amplify the Germin signal sequence and are shown below in Table 8.
. Table 8 Germin Signal Sequence Primers (5'-3') Seq ID
No.
GAC
CCT TC
AT CCT
GGG TAG CC AAAA CAG CT GGAG
The amplified Germin signal sequence product [Seq ID N0:27] shown below in Table 9, and a vector containing the full length APOXD cDNA
(pPHP9648) were each digested with Sal I and Nru I. A ligation reaction was set up with the digested fragments to form a Germin signal sequence - APOXD coding sequence fusion construct.
Clones of the correct size were sequenced to verify correct results.
As shown in Table 9, the SaIIlNruI cut Germin SS - containing sequence also contained modif ed APOXD codons matched to fill in the NruI-cut APOXD
sequence. The Germin signal sequence [Seq. m No: 28] is shown in lower case.
Table 9 Amplified Germin Signal/APOXD Sequence*
Sa.ZI start 51 TACAATTACT ATTTACAATT ACAGTCGACC CGGGATCC atg ggt tac 98 tca aag acc ttg gtt get ggt ttg ttc get atg ttg ttg 137 ttg get cca get gtt ttg get acc ICAG GAT AAG CCT TTC
NruI
Applicant Ref. 0561-PCT
*The SaII (GTCGAC) and NruI (TCGCGA) restriction sites are underlined, the Germin signal sequence is in lower case, with the Germin start site in bold. APOXD
sequences modified in the PCR primer design are shown in bold.
This fusion gene was placed in the binary T-DNA plasmid to produce plasmid pPHP9762 carrying the fusion gene and the plant expressible NPTII gene positioned between Agrobacterium T-DNA borders, as described above.
Agrobacterium tumefaciens strain EHA105 (as described in Hood, et al., 1993, Transgen. Res. 2:208-218) was transformed with kanamycin resistant binary T-DNA vectors carrying the different versions of APOXD. Transformation was accomplished by the freeze-thaw method of Holsters, et al., 1978, Mol. Gen.
Genetics 1:181-7. The transformed isolates were selected on solidified 60A (YEP; 10 g/1 yeast extract, 10 g/1 bactopeptone, 5 g/1 NaCI, pH 7.0) medium with 50 mg/1 kanamycin.
Transformed bacteria were cultured in liquid culture of YEP medium containing 50 mg/1 I5 kanamycin, to log phase growth (O.D.6o~ 0.5-1.0) for use in plant transformations. Binary plasmids were re-isolated from transformed Agrobacterium to verify that integrity was maintained throughout the transformation procedures.
Sunflower leaf discs were obtained by harvesting leaves which were not fully expanded, sterilizing the surface in 20% bleach with TWEEN 20, and punching discs out of the leaf with a paper punch. Agrobacterium suspensions were centrifuged and resuspended in inoculation medium (12.5 ~M MES buffer, pH 5.7, 1 g/1 NH4C1, 0.3 g/1 MgS04) to a calculated OD6oo of 0.75 as described in Malone-Schoneberg, et al., 1994, Plant Science 103:199-207. Leaf discs were inoculated in the resuspended Agrobacterium for 10 minutes then blotted on sterile filter paper.
The tissue and bacteria were co-cultivated on 527 for 3 days, then transferred to 527E medium for the selection of transgenic plant cells. After 2 weeks of Applicant Ref. 0561-PCT
culture, the transgenic callus nodes were removed from the leaf disc and subcultured on fresh 527E medium. A number of subcultures were repeated prior to the assay of the callus tissue for enzyme activity.
To assay for enzyme activity, callus was harvested, snap frozen in liquid nitrogen, lyophilized to dryness and powdered. A quantity of 0.75 mg of powder from each prepared callus line was added to 1.0 ml reaction mixture (900 p1 200 mM
NaP04, pH 5.0, 100 p1 10 mM Na-oxalate pH 5.0). The reaction proceeded for 3 hours at room temperature and was stopped by the addition of 150 p,1 of 1M TRIS-HCI, pH 7Ø
Each sample was spun at 14,000 rpm for one minute and 1 ml was removed to a cuvette. One hundred (100) ~l of (3-NAD (6.6 mg/ml stock) and 50 p1 formate dehydrogenase (4.0 mg/ml stock) were added and the increase in absorbance was measured at 340 nm. A
slope was generated for each sample as well as for a formate standard curve.
Assay results were reported as pM oxalate metabolized /mg powder.
The results of the leaf disk assay are shown below in Table 10, and demonstrate that the APOXD gene sequence produces enzyme that is active in transgenic callus. No activity was seen in control callus, or callus transformed with the native APOXD gene (pPHP 9723).
Table 10 Oxalate Decarboxylase Activity in Trans~enic Sunflower Tissue Callus Line Binary Vector Activity pM oxalate/min/mg SMF3 None 0 9723 -1 pPHP 9723 0 -2 pPHP 9723 0 -3 pPHP 9723 0 9762-1 pPHP 9762 1.35 -2 pPHP 9762 1.40 -3 pPHP 9762 0.87 -4 pPHP 9762 0.81 -5 pPHP 9762 0.81 -6 pPHP 9762 0.90 Applicant Ref. 0561-PCT
Example 3 Transgenic Sunflower Plants Expressing APOXD
Sunflower plants were transformed using a basic transformation protocol involving a combination of wounding by particle bombardment, followed by use of Agrobacterium for DNA delivery, as described by Bidney, et al. Plant Mol.
Biol. 18:301 313. The plasmid pPHP9762, as described above for Example 2 and shown in Figure 5, was used in these experiments. pPHP9762 contains the APOXD gene with the fungal signal and presequence replaced with the Germin signal sequence and a plant expressible NPTII gene which provides kanamycin resistance to transgenic plant tissues.
Procedures for preparation of Agrobacterium and preparation of particles for wounding are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313. The Pioneer sunflower line SMF3, used in these experiments, is described in Burros, et al., 1991, Plant Cell Rep. 10:161-166. The Agrobacterium strain used in these experiments, EHA 105. Procedures for use of the helium gun, intact meristem preparation, tissue culture and co-cultivation conditions, as well as recovery of transgenic plants, are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313.
Sunflower explants were prepared by imbibing seed overnight, removing the cotyledons and radical tip, then culturing overnight on medium containing plant growth regulators. Primary leaves were then removed and explants arranged in the center of a petri plate for bombardment. The PDS 1000 helium-driven particle bombardment device (Bio-Rad) was used with 600 psi rupture discs and a vacuum of 26 inches, Hg to bombard meristem explants twice on the highest shelf position. Following bombardment, log phase Agrobacterium cultures transformed with the APOXD-plasmid pPHP 9762, as described for Example 2, were centrifuged and resuspended at a calculated OD600 (vis) of 4.0 in inoculation buffer. Agrobacterium was then dropped onto the meristem explants using a fine tipped pipettor. Inoculated explants were co-cultured for three days then transferred to medium containing 50 mg/1 kanamycin and 250 mg/1 cefotaxime for selection. Explants were cultured on this medium for two weeks then transferred to the same medium, but lacking kanamycin. Green, kanamycin-resistant shoots were recovered to the greenhouse and assayed by an NPTII ELISA assay to verify transformation.
Oxalate decarboxylase enzyme assays are performed on these plants and/or progeny to confirm the expression of APOXD.
Applicant Ref. 0561-PCT
The invention has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: PIONEER HI-BRED INTERNATIONAL, INC.
(ii) TITLE OF INVENTION: GENE ENCODING OXALATE DECARBOXYLASE FROM
ASPERGILLUS PHOENICES
(iii) NUMBER OF SEQUENCES: 30 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SMART & BIGGAR
(B) STREET: P.O. BOX 2999, STATION D
(C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,350,328 (B) FILING DATE: 26-JUN-2001 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: SMART & BIGGAR
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 75529-56 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-232-2486 (B) TELEFAX: (613)-232-8440 (2) INFORMATION FOR SEQ ID NO.: 1:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 1438 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (24)...(1394) (ix) FEATURE
(A) NAME/KEY: mat_peptide (B) LOCATION: (171)...(1394) (ix) FEATURE
(A) NAME/KEY: sig~eptide (B) LOCATION: (24)...(101) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 2:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 457 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 3:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 26 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly (2) INFORMATION FOR SEQ ID NO.: 4:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 431 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 4:
Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 5:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 408 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 5:
Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln 20 Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 6:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: VARIANT
(B) LOCATION: (1)...(27) (C) OTHER INFORMATION: Xaa = Any Amino Acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6:
Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Xaa His Glu Pro Leu (2) INFORMATION FOR SEQ ID NO.: 7:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 12 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7:
Val Ile Arg Glu Leu His Trp His Arg Glu Ala Gly (2) INFORMATION FOR SEQ ID NO.: 8:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 17 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 8:
Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Cys His Arg Glu Ala Glu (2) INFORMATION FOR SEQ ID NO.: 9:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 20 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 9:
Ser Tyr Phe Lys Arg Gly Arg Ala Arg Tyr Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg 30 (2) INFORMATION FOR SEQ ID NO.: 10:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 12 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 10:
40 Ser Ala His Thr Pro Pro Ser Val Leu Ala Gly Asn (2) INFORMATION FOR SEQ ID NO.: 11:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 35 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
50 (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(35) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 11:
(2) INFORMATION FOR SEQ ID NO.: 12:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 35 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
( i. i ) MOLECULE TYPE : DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(35) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 12:
(2) INFORMATION FOR SEQ ID NO.: 13:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 22 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 13:
(2) INFORMATION FOR SEQ ID NO.: 14:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 21 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(v1) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 14:
(2) INFORMATION FOR SEQ ID NO.: 15:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 440 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (3)...(197) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (259)...(360) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 15:
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp (2) INFORMATION FOR SEQ ID NO.: 16:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 99 40 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 16:
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp 50 Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp (2) INFORMATION FOR SEQ ID NO.: 17:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 19 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 17:
(2) INFORMATION FOR SEQ ID NO.: 18:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 30 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 18:
(2) INFORMATION FOR SEQ ID NO.: 19:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 28 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 19:
(2) INFORMATION FOR SEQ ID NO.: 20:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 24 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 20:
(2) INFORMATION FOR SEQ ID NO.: 21:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 30 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 21:
(2) INFORMATION FOR SEQ ID NO.: 22:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 28 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 22:
(2) INFORMATION FOR SEQ ID NO.: 23:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 26 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 23:
(2) INFORMATION FOR SEQ ID NO.: 24:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 24:
(2) INFORMATION FOR SEQ ID NO.: 25:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 38 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 25:
(2) INFORMATION FOR SEQ ID NO.: 26:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 56 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 26:
(2) INFORMATION FOR SEQ ID NO.: 27:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 507 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Triticum aestivum (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 27:
(2) INFORMATION FOR SEQ ID NO.: 28:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 72 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Triticum aestivum (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 28:
(2) INFORMATION FOR SEQ ID NO.: 29:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 16 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(16) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 29:
(2) INFORMATION FOR SEQ ID NO.: 30:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 17 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(17) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 30:
Additional sequence modifications are known to enhance protein expression in a cellular host. These include elimination of sequences encoding spurious - polyadenylation signals, exon/intron splice site signals, transposon-like repeats, and other such well-characterized sequences which may be deleterious to gene expression.
The G
C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. Where possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.
Other useful modifications include the addition of a translational initiation consensus sequence at the start of the open reading frame, as described in Kozak, 1989, Mol Cell Biol. 9:5073 5080.
In addition, the native APOXD gene or a modified version of the APOXD
gene might be further optimized for expression by omitting the predicted signal and pre-sequence, replacing the signal sequence with another signal sequence, or replacing the signal and pre-sequence with another signal sequence. Any one of the possible APOXD
gene variations may work best when combined with a specific promoter and/or termination sequence.
The term "selectively hybridizes" includes a reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detestably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences typically have about at least 80% sequence identity, preferably 90% sequence identity, and most preferably 100%
sequence identity (i.e., complementary) with each other.
The terms "stringent conditions" or "stringent hybridization conditions"
include reference to conditions under which a probe will hybridize to its target sequence, to a detestably greater degree than other sequences (e.g., at least 2-fold over background).
Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences can be identified which are 100% complementary to the probe (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected Applicant Ref. 0561-PCT
(heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, optionally less than 500 nucleotides in length.
Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI, 1 % SDS (sodium dodecyl sulphate) at 37°C, and a wash in 1X to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in O.SX to 1X
SSC at 55 to 60°C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in O.1X SSC at 60 to 65°C.
Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth and Wahl, Aaal. Biochem., 138:267-284 (1984): Tm = 81.5°C + 16.6 (log M) +
0.41 (%CG) -0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %CG is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs.
The Tm is the temperature (under defined ionic strength and pH) at which SO%
of a complementary target sequence hybridizes to a perfectly matched probe. Tm is reduced by about I°C for each 1% of mismatching; thus, Tm, hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with > 90% identity are sought, the Tm can be decreased 10°C.
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence and its complement at a defined ionic strength and pH.
However, severely stringent conditions can utilize a hybridization and/or wash at l, 2, 3, or 4° lower than the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10°C lower than the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or wash at 11, 12, Applicant Ref. 0561-PCT
- 1~ -13, 14, 15, or 20°C lower than the thermal melting point (Tn,). Using the equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a Tm of less than 45°C (aqueous solution) or 32°C (formamide solution) it is preferred to increase the SSC
concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, Chapter "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995).
As used herein, "vector" includes reference to a nucleic acid used in transfection of a host cell and into which can be inserted a polynucleotide.
Vectors are often replicons. Expression vectors permit transcription of a nucleic acid inserted therein.
The following terms are used to describe the sequence relationships between two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison windows", (c) "sequence identity", and (d) "percentage of sequence identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.
(b) As used herein, "comparison window" means includes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Applicant Ref. 0561-PCT
- m -Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman. Adv. Appl. Math. 2: 482 (1981); by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol 48: 443 (1970);
by the search for similarity method of Pearson and Lipman, PYOC. Natl. Acad.
Sci. 85:
2444 (1988); by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PCIGene program by Intelligenetics, Mountain View, California, GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wisconsin, USA; the CLUSTAL program is well described by Higgins and Sharp, Gene 73: 237-244 (I988); Higgins and Sharp, CABIOS 5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer Applications in the Biosciences 8: 155-65 (1992), and Pearson, et al., Methods in Molecular Biology 24: 307-331 (1994). The BLAST family of programs which can be used for database similarity searches includes: BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX for nucleotide query sequences against protein database sequences;
BLASTP for protein query sequences against protein database sequences; TBLASTN
for protein query sequences against nucleotide database sequences; and TBLASTX for nucleotide query sequences against nucleotide database sequences. See, Current Protocols in Molecular Biology, Chapter 19, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995).
GAP uses the algorithm of Needleman and Wunsch (J Mol Biol 48: 443-453 (1970)) to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps. GAP considers all possible alignments and gap positions and creates the alignment with the largest number of matched bases and the fewest gaps. Default gap creation penalty values and gap extension penalty values in Version 10 of the Wisconsin Genetics Software Package are 8 and 2, respectively, for protein sequences. For nucleotide sequences the default gap creation penalty is 50 while the default gap extension penalty is 3. The gap creation and gap extension penalties can be expressed as an integer selected form the group of integers consisting of form 0 to 100.
Thus, for example, the gap creation and gap extension penalties can be 0, l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, or greater.
Applicant Ref. 0561-PCT
GAP presents one member of the family of best alignments. There may be many members of this family, but no other member has a better quality. GAP
displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity. The Quality is the metric maximized in order to align the sequences. Ratio is the quality divided by the number of bases in the shorter segment. Percent Identity is the percent of the symbols that actually match. Percent Similarity is the percent of the symbols that are similar.
Symbols that are across from gaps are ignored. A similarity is scored when the scoring matrix value for a pair of symbols is greater than or equal to 0.50, the similarity threshold.
The scoring matrix used in Version 10 of the Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff and Henikoff, Proc Natl Acad Sci USA 89:10915).
Unless otherwise stated, sequence identity/similarity values provided herein refer to the value obtained using GAP version 10 of Wisconsin Genetic Software Package using default parameters.
APOXD Protein The recombinant APOXD protein produced from the disclosed nucleic acid sequence provides a substantially pure protein useful to degrade oxalate, particularly in applications where highly purified enzymes are required. The recombinant protein may be used in enzymatic assays of oxalate or added to compositions containing oxalate to induce oxalate degradation.
When used externally, the enzyme can be placed in a liquid dispersion or solution, or may be mixed with a carrier solid for application as a dust or powder. The particular method of application and carrier used will be determined by the particular plant and pathogen target. Such methods are known, and are described, for example, in U.S. Patent No. 5,488,035 to Rao:
Gene Delivery The nucleic acid sequence encoding APOXD may be delivered to plant cells for transient transfections or for incorporation into the plant's genome by methods know in the art. Preferably, the gene is used to stably transform plant cells for expression of the protein in vivo.
To accomplish such delivery, the gene containing the coding sequence for APOXD may be attached to regulatory elements needed for the expression of the gene in Applicant Ref. 0561-PCT
a particular host cell or system. These regulatory elements include, for example, promoters, terminators, and other elements that permit desired expression of the enzyme in a particular plant host, in a particular tissue or organ of a host such as vascular tissue, root, leaf, or flower, or in response to a particular signal.
Promoters A promoter is a DNA sequence that directs the transcription of a structural gene, e.g., that portion of the DNA sequence that is transcribed into messenger RNA
(mRNA) and then translated into a sequence of amino acids characteristic of a specific polypeptide. Typically, a promoter is located in the 5' region of a gene, proximal to the transcriptional start site. A promoter may be inducible, increasing the rate of transcription in response to an inducing agent. In contrast, a promoter may be constitutive, whereby the rate of transcription is not regulated by an inducing agent. A promoter may be regulated in a tissue-specific or tissue-preferred manner, such that it is only active in transcribing the operably linked coding region in a specific tissue type or types, such as plant leaves, roots, or meristem.
Inducible Promoters An inducible promoter useful in the present invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the inducible promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD. With an inducible promoter, the rate of transcription increases in response to an inducing agent.
Any inducible promoter can be used in the present invention to direct transcription of APOXD, including those described in Ward, et al., 1993, Plant Molecular Biol. 22: 361:-366. Exemplary inducible promoters include that from the ACEl system which responds to copper (Mett et al., 1993, PNAS 90: 4567-4571); In2 gene promoter from maize which responds to benzenesulfonamide herbicide safeners (Hershey et al., 1991, Plant Mol. Biol. 17:679-690; and the Tet repressor from TnlO (Hersey, et al., 1991, Mol. Gen. Genetics 227:229-237; Gatz, et al., 1994, Mol.Gen. Genetics 243:32-38).
A particularly preferred inducible promoter is one that responds to an inducing agent to which plants do not normally respond. One example of such a promoter is the steroid hormone gene promoter. Transcription of the steroid hormone gene Applicant Ref. 0561-PCT
promoter is induced by glucocorticosteroid hormone. (Schena et al., 1991, PNAS
U.S.A.
88:10421) - In the present invention, an expression vector comprises an inducible promoter operably linked to a nucleotide sequence encoding APOXD. The expression vector is introduced into plant cells and presumptively transformed cells are exposed to an inducer of the inducible promoter. The cells are screened for the presence of APOXD
proteins by immunoassay methods or by analysis of the enzyme's activity.
Pathogen-Inducible Promoters A pathogen-inducible promoter of the present invention is an inducible promoter that responds specifically to the inducing agent, oxalic acid, or to plant pathogens such as oxalic acid-producing pathogens including Sclerotinia sclerotiorum.
Genes that produce transcripts in response to Sclerotinia and oxalic acid have been described in Mouley et al., 1992, Plant Science 85:51-59. One member of the prpl-1 gene family contains a promoter that is activated in potato during early stages of late blight infection and is described in Martini et al., 1993, Mol.Gen.Genet.
236:179-186.
Tissue-specific or Tissue-Preferred Promoters A tissue specific promoter of the invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the tissue-specific promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD. Plants transformed with a gene encoding APOXD operably linked to a tissue specific promoter produce APOXD
protein exclusively, or preferentially, in a specific tissue.
Any tissue-specific or tissue-preferred promoter can be utilized in the instant invention. Examples of such promoters include a root-preferred promoter such as that from the phaseolin gene as described in Murai et al., 1983, Science 222:476-482 and in Sengupta-Gopalan et al., 1985, PNAS USA 82:3320-3324; a leaf specific and light-induced promoter such as that from cab or rubisco as described in Simpson et al., 1985, EMBO J. 4(11):2723-2729, and in Timko et al., 1985, Nature 318:579-582; an anther-specific promoter such as that from LAT52 as described in Twell et al., 1989, Mol. Gen.
Genet. 217:240-245; a pollen-specific promoter such as that from Zml3 as described in Guerrero et al., 1990, Mol.Gen. Genet. 224:161-168; and a microspore-preferred Applicant Ref. 0561-PCT
promoter such as that from apg as described in Twell et al., 1993, Sex. Plant Reprod.
6:217-224.
. Other tissue-specific promoters useful in the present invention include a phloem-preferred promoter such as that associated with the Arabidopsis sucrose synthase gene as described in Martin et al., 1993, The Plant Journal 4(2):367-377; a floral-specific promoter such as that of the Arabidopsis HSP 18.2 gene described in Tsukaya et al., 1993, Mol. Gen. Genet. 237:26-32 and of the Arabidopsis HMG2 gene as described in Enjuto et al., 1995, Plant Cell 7:517-527.
An expression vector of the present invention comprises a tissue-specific or tissue-preferred promoter operably linked to a nucleotide sequence encoding APOXD.
The expression vector is introduced into plant cells. The cells are screened for the presence of APOXD protein by immunological methods or by analysis of enzyme activity.
Constitutive Promoters A constitutive promoter of the invention is operably linked to a nucleotide sequence encoding APOXD. Optionally, the constitutive promoter is operably linked to a nucleotide sequence encoding a signal sequence which is operably linked to a nucleotide sequence encoding APOXD.
Many different constitutive promoters can be utilized in the instant invention to express APOXD. Examples include promoters from plant viruses such as the 35S promoter from cauliflower mosaic virus (CaMV), as described in Odell et al., 1985, Nature 313:810-812, and promoters from genes such as rice actin (McElroy et al., 1990, Plant Cell 2:163-171); ubiquitin (Christensen et al., 1989, Plant Mol. Biol.
12:619-632;
and Christensen et al., 1992, Plant Mol. Biol 18:675-689); pEMU (Last et al., 1991, Theor. Appl. Genet. 81:581-588); MAS (Velten et al., 1984, EMBO J. 3:2723-2730); and maize H3 histone (Lepetit et al., 1992, Mol.Gen.Genet. 231:276-285; and Atanassvoa et al., 1992, Plant Journal 2(3):291-300).
The ALS promoter, a Xba/NcoI fragment 5' to the Brassica napus ALS3 structural gene, or a nucleotide sequence having substantial sequence similarity to the XbaI/NcoI fragment, represents a particularly useful constitutive promoter, and is described in published PCT Application number WO 96/30530.
In the present invention, an expression vector comprises a constitutive promoter operably linked to a nucleotide sequence encoding APOXD. The expression r Applicant Ref. 0561-PCT
vector is introduced into plant cells and presumptively transformed cells are screened for the presence of APOXD proteins by immunoassay methods or by analysis of the enzyme's activity.
Additional regulatory elements that may be connected to the APOXD
nucleic acid sequence for expression in plant cells include terminators, polyadenylation sequences, and nucleic acid sequences encoding signal peptides that permit localization within a plant cell or secretion of the protein from the cell. Such regulatory elements and methods for adding or exchanging these elements with the regulatory elements of the APOXD gene are known, and include, but are not limited to, 3'termination and/or polyadenylation regions such as those of the Agrobacterium tumefaciens nopaline synthase (nos) gene (Bevan et al., 1983, Nucl. Acids Res. 11(2):369-385); the potato proteinase inhibitor II (PINII) gene (Keil. et al., 1986, Nucl. Acids Res.
14:5641-5650; and An et al., 1989, Plant Cell 1:115-122); and the CaMV 19S gene (Molten et al., 1990, Plant Cell 2:1261-1272).
Plant signal sequences, including, but not limited to, signal-peptide encoding DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell (Dratewka-Kos, et al., J. Biol. Chem. 264:4896-4900, 1989) and the Nicotiana plumbaginifolia extensin gene (DeLoose, et al., Gene 99:95-100, 1991), or signal peptides which target proteins to the vacuole like the sweet potato sporamin gene (Matsuoka, et al., PNAS 88:834, 1991) and the barley lectin gene (Wilkins, et al., Plant Cell, 2:301-313, 1990), or signals which cause proteins to be secreted such as that of PRIb (Lund, et al., Plant Mol. Biol. 18:47-53, 1992), or those which target proteins to the plastids such as that of rapeseed enoyl-Acp reductase (Verwoert, et al., Plant Mol. Biol.
26:189-202, 1994) are useful in the invention.
Gene Transformation Methods Numerous methods for introducing foreign genes into plants are known and can be used to insert the APOXD gene into a plant host, including biological and physical plant transformation protocols. See, for example, Miki et al., 1993, "Procedure for Introducing Foreign DNA into Plants" in: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 67-88.
The methods chosen vary with the host plant, and include chemical transfection methods such as calcium phosphate, microorganism-mediated gene transfer such as Applicant Ref. 0561-PCT
- i~ -Agrobacterium (Horsch, et al., Science 227:1229-31, 1985), electroporation, micro-injection, and biolistic bombardment.
Expression cassettes and vectors and in vitro culture methods for plant cell or tissue transformation and regeneration of plants are known and available.
See, for example, Gruber, et al., 1993, "Vectors for Plant Transformation" In: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 89-119.
A~robacte~ium-mediated Transformation The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium. A.
tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectfully, carry genes responsible for genetic transformation of plants. See, for example, Kado, 1991, Crit. Rev.Plant Sci. 10(1):1-32. Descriptions of the Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer are provided in Gruber et al., supra;
Miki, et al., supra; and Moloney, et al., 1989, Plant Cell Reports 8:238.
Direct Gene Transfer Despite the fact that the host range for Agrobacterium-mediated transformation is broad, some major cereal crop species and gymnosperms have generally be recalcitrant to this mode of gene transfer, even though some success has recently been achieved in rice (Hiei et al., 1994, The Plant Journal 6(2):271-282). Several methods of plant transformation, collectively referred to as direct gene transfer, have been developed as an alternative to Agrobacterium-mediated transformation.
A generally applicable method of plant transformation is microprojectile-mediated transformation, where DNA is carried on the surface of microprojectiles measuring about 1 to 4 pm. The expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds of 300 to 600 mls which is sufficient to penetrate the plant cell walls and membranes. (Sanford et al., 1987, Part.Sci.
Technol 5:27; Sanford, 1988, Trends Biotech 6:299; Sanford, 1990, Physiol.
Plant 79:206; Klein et al., 1992, Biotechnology 10:268) Another method for physical delivery of DNA to plants is sonication of target cells as described in Zhang et al., 1991, BiolTechnology 9:996.
Alternatively, Applicant Ref. 0561-PCT
liposome or spheroplast fusions have been used to introduce expression vectors into plants. See, for example, Deshayes et al., 1985, EMBO J. 4:2731-2737; and Christou, et - al., 1987, PNAS USA 84:3962-3966. Direct uptake of DNA into protoplasts using CaCl2 precipitation, polyvinyl alcohol or poly-L-ornithine have also been reported.
See, for example, Hain et al., 1985, Mol. Gen.Genet. 199:161; and Draper, et al., 1982, Plant &
Cell Physiol. 23:451.
Electroporation of protoplasts and whole cells and tissues has also been described. See, for example, D'Halluin, et al., 1992, Plant Cell 4:1495-1505;
and Spencer, et al., 1994, PlantMol.Biol. 24:51-61.
Particle Woundin~lA~robacterium Delivery Another useful basic transformation protocol involves a combination of wounding by particle bombardment, followed by use of Agrobacterium for DNA
delivery, as described by Bidney, et al. 1992, Plant Mol. Biol. 18:301-313. Useful plasmids for plant transformation include pPHP9762 shown in Figure 5. The binary backbone for pPHP9762 is pPHP6333. See Bevan, 1984, Nucleic Acids Research 12:8711-8721.
This protocol is preferred for transformation of sunflower plants, and employs either the "intact meristem" method or the "split meristem" method.
In general, the intact meristem transformation method (Bidney, et al., SupYa) involves imbibing seed for 24 hours in the dark, removing the cotyledons and root radical, followed by culturing of the meristem explants. Twenty-four hours later, the primary leaves are removed to expose the apical meristem. The explants are placed apical dome side up and bombarded, e.g., twice with particles, followed by co-cultivation with Agrobacterium. To start the co-cultivation for intact meristems, Agrobacterium is placed on the meristem. After about a 3-day co-cultivation period the meristems are transferred to culture medium with cefotaxime (plus kanamycin for the NPTII selection).
Selection can also be done using kanamycin.
The split meristem method involves imbibing seed, breaking of the cotyledons to produce a clean fracture at the plane of the embryonic axis, excising the root tip and then bisecting the explants longitudinally between the primordial leaves (Malone-Schoneberg et al., 1994, Plant Science 103:199-207). The two halves are placed cut surface up on the medium then bombarded twice with particles, followed by co-cultivation with Agrobacterium. For split meristems, after bombardment the meristems Applicant Ref. 0561-PCT
are placed in an Agrobacterium suspension for 30 minutes. They are then removed from the suspension onto solid culture medium for three day co-cultivation. After this period, - the meristems are transferred to fresh medium with cefotaxime (plus kanamycin for selection).
Transfer by Plant Breeding Alternatively, once a single transformed plant has been obtained by the foregoing recombinant DNA method, conventional plant breeding methods can be used to transfer the structural gene and associated regulatory sequences via crossing and backcrossing. Such intermediate methods will comprise the further steps of:
(1) sexually crossing the disease-resistant plant with a plant from the disease-susceptible taxon; (2) recovering reproductive material from the progeny of the cross; and (3) growing disease-resistant plants from the reproductive material. Where desirable or necessary, the agronomic characteristics of the susceptible taxon can be substantially preserved by expanding this method to include the further steps of repetitively: (1) backcrossing the disease-resistant progeny with disease-susceptible plants from the susceptible taxon; and (2) selecting for expression of APOXD activity (or an associated marker gene) among the progeny of the backcross, until the desired percentage of the characteristics of the susceptible taxon are present in the progeny along with the gene imparting APOXD
activity.
By the term "taxon" herein is meant a unit of botanical classification of genus or lower. It thus includes genus, species, cultivars, varieties, variants and other minor taxonomic groups which lack a consistent nomenclature.
Assay Methods Transgenic plant cells, callus, tissues, shoots, and transgenic plants are tested for the presence of the APOXD gene by DNA analysis (Southern blot or PCR) and for expression of the gene by immunoassay or by assay of oxalate decarboxylase activity.
Tolerance to exogenous oxalic acid can also be used as a functional test of enzyme expression in transformed plants.
APOXD ELISA
Transgenic cells, callus, plants and the like are screened for the expression of APOXD protein by immunological assays, including ELISA. Anti-APOXD
antibodies are generated against APOXD preparations by known methods and are used in typical Applicant Ref. 0561-PCT
ELISA reactions. Polyclonal anti-APOXD can, for example, detect a range of about 10-100 pg APOXD protein in transgenic plant tissues.
In a suitable method for an APOXD-ELISA assay, fresh leaf or callus tissue is homogenized and centrifuged. An aliquot of the supernatant is added to a microtiter plate with a first anti-APOXD antibody and incubated for sufficient time for antibody-antigen reaction. The bound antibody is then reacted with a second antibody linked to a marker, which marker is developed or otherwise converted to a detectable signal correlated to the amount of APOXD protein in the sample. Any of the known methods for producing antibodies and utilizing such antibodies in an immunoassay can be used to determine the amount of APOXD expressed in transgenic plant cells and tissues of the invention.
Oxalate Decarboxylase Assay Transgenic cells, tissue, or plants expressing the APOXD gene are assayed for enzyme activity to verify expression of the gene. In general, the cells or tissue is frozen in liquid nitrogen, placed on a lyophilizer overnight to dehydrate, then crushed into a fme powder for use in the assay reaction. Leaf tissue is homogenized as fresh tissue in the reaction mixture, or dehydrated and treated as described above.
A typical assay reaction is begun by adding 0.75 mg of powdered tissue, such as callus, to 1 ml of oxalate decarboxylase reaction mixture: 900 ~1 0.2 M sodium phosphate buffer, pH 5.0, and 100 ~l of 10 mM sodium oxalate, pH 5Ø The reaction is incubated at room temperature for 3 hours with gentle mixing, and is stopped by the addition of 150 p1 of 1 M Tris-HCI, pH 7Ø The mixture is centrifuged, and an aliquot is placed in a cuvette with NAD (600 fig) and formate dehydrogenase (200 pg). The absorbance at 340 nm is correlated to the activity of the APOXD enzyme.
Use of Oxalate Decarboxylase as a Selectable Marker Oxalate decarboxylase is useful in selecting successful transformants, e.g., as a selectable marker. Growth of plant cells in the presence of oxalic acid favors the survival of plant cells that have been transformed with a gene encoding an oxalate-degrading enzyme, such as APOXD. In published PCT application WO 94/13790, herein incorporated by reference, plant cells grown on a selection medium containing oxalic acid (and all of the elements necessary for multiplication and differentiation of plant cells) demonstrated selection of only those cells transformed with and expressing oxalate Applicant Ref. 0561-PCT
oxidase. In like manner, transformation. and expression of the gene encoding APOXD in plant cells is used to degrade oxalic acid present in the media and allow the growth of - only APOXD-gene transformed cells.
Production of APOXD in Plants Trangenic plants of the present invention, expressing the APOXD gene, are used to produce oxalate decarboxylase in commercial quantities. The gene transformation and assay selection techniques described above yield a plurality of transgenic plants which are grown and harvested in a conventional manner.
Oxalate decarboxylase is extracted from the plant tissue or from total plant biomass.
Oxalate decarboxylase extraction from biomass is accomplished by known methods. See for example, Heney and Orr, 1981, Ahal. Biochem. 114:92-96.
In any extraction methodology, losses of material are expected and costs of the procedure are also considered. Accordingly, a minimum level of expression of oxalate decarboxylase is required for the process to be deemed economically worthwhile.
The terms "commercial" and "commercial quantities" here denote a level of expression where at least 0.1% of the total extracted protein is oxalate decarboxylase.
Higher levels of oxalate decarboxylase expression are preferred.
Diagnostic Oxalate Assay Clinical measurement of oxalic acid in urine is important, for example, in the diagnosis and treatment of patients with urinary tract disorders or hyperoxaluric syndromes. The recombinant APOXD enzyme of the invention is preferably immobilized onto beads or solid support, or added in aqueous solution to a sample for quantitation of oxalate. As discussed above, oxalate decarboxylase catalyzes the conversion of oxalate to C02 and formic acid. A variety of detection systems can be utilized to quantify this enzyme catalyzed conversion, including methods for detecting an increase in CO2, or for detecting an increase in formic acid.
For example, the conversion of oxalate to formic acid and COZ is assayed by determining formate production via the reduction of NAD in the presence of formate dehydrogenase. This method is described in Lung, et al., 1994, J.
Bacteriology, 176:2468-2472 and Johnson, et al., 1964, Biochem. Biophys. Acta 89:35.
A calibration curve is generated using known amounts of oxalic acid. The amount of oxalate in a specimen is extrapolated from the standard curve.
Applicant Ref. 0561-PCT
Other enzymatic assays and the like are adapted by known methods to utilize the APOXD enzyme to detect conversion of oxalate.
EXAMPLES
The invention is described more fully below in the following Examples, which are exemplary in nature and are not intended to limit the scope of the invention in any way.
Example 1 Cloning of the Gene Encoding APOXD
Protein Seguence A commercial preparation of A. phoenices oxalate decarboxylase enzyme was obtained from Boehringer Mannheim. (Catalog #479 586) SDS polyacrylamide gel electrophoresis was used to determine the purity of the enzyme. Only one dark band appeared following Coomassie blue staining of the polyacrylamide gel (12.5%).
This band was about 49 kd in size, as determined by comparison to molecular weight markers.
Aliquots of the preparation were sent to the University of Michigan for sequence analysis by Edman degradation on an automated protein sequencer. Preparative polyacrylamide gels were run and the APOXD band was isolated from the gel prior to sequencing. The protein was first sequenced at the amino terminus. Proteins were chemically cleaved into fragments by cyanogen bromide, size separated on polyacrylamide gels, and isolated as bands on the gel for further preparation and sequencing. The results of the sequencing are shown below in Table 2.
Peptide Seq uence* Seq. ID
No.
ammo termlnLiSG1n AspLys Pro Phe ThrPro Asp HisArg 6 Asp ProTyr Asp His LysVal Asp AlaIle Gly GluX Pro Leu His Glu fragment Val IleArg Glu Leu HisTrp His ArgGlu 7 Ala Gly fragment Arg LeuAsp Glu Gly ValIle Arg GluLeu His CysHis Arg 1u G Ala Glu fragment Ser TyrPhe Lys Arg GlyArg Ala ArgTyr Thr IlePhe Ala la a g A Glu Ar Gly Asn Al fragment Ser AlaHis Thr Pro ProSer Val LeuAla Gly Asn * X = Unknown.
a Applicant Ref. 0561-PCT
PCR Amplification of Genomic A. phoenices Genomic DNA was used as the PCR template to amplify the APOXD
- sequence. Aspergillus phoenices was obtained from the American Type Culture Collection (ATCC), Rockville, MD. Cultures were established on solid potato dextrose agar medium (Difco formulation). Liquid stationary cultures were started from culture plates by innoculatory spores in a minimal growth medium previously described for the culture of Aspergillus strains (Emiliani, et al., 1964, Arch. Biochem. Biophys 105:488-493, cited above).
To isolate DNA, mycelial mats were recovered from 4-day liquid stationary cultures, washed in cold water, and blotted dry. The tissue was then frozen in liquid nitrogen, ground by mortar and pestle, and stored frozen at -80°C. DNA was extracted by the method described for fungal mycelium in Sunis et al. (eds.), 1990, PCR
protocols, pages 282-287.
PCR Strategy As diagrammed in Figure 1, primers were designed for both the N-terminal protein sequence and for an internal peptide fragment. One set of primers (PHN
[Seq ID No. 11] and PHN 11339 [Seq ID No. 12]) was designed with nearly full degeneracy. A second set of primers (PHN 11471 [Seq. ID No. 13] and PHN 11476 [Seq ID No. 14]) was designed with no degeneracy. These were based on a codon usage table for Aspergillus niger generated using the Wisconsin Sequence Analysis Package (GCG) (Genetics Computer Group, Inc., Madison, WI). The sequences of these primers is shown in Table 3, below, and diagrammatically in Figure 1. Table 3 shows the degenerate primer mixtures using ICTPAC designations, as described in Cornish-Bowden, 1985, Nucleic Acids Res. 13:3021-3030. The lUPAC nucleic acid symbols include: Y=C
or T;
N=A, T, C, or G; R=A or G; D=A, T, or G; and V=A, C, or G. Both of these PCR
strategies were successful in amplifying a DNA fragment, shown in Table 4, having homology to the protein sequence data shown in Table 2.
Primer Sets-3') Seq. ID
(5' #
CAU TA
CUA VA
CTT
Applicant Ref. 0561-PCT
PCR reactions were set up containing increasing quantities of A. phoenices genomic DNA, in the range of 1-10 nanograms, and various oligonucleotide primer sets.
Degenerate primers were added at a ten-fold higher concentration than that standardly used, due to their degeneracy. All other conditions for PCR were standard, essentially as described in Innis, et aL, 1990, PCR Protocols, pages 282-287, except for the annealing temperatures for the primers. These temperatures were determined on an individual basis using the Oligo 4.0 computer program for analysis as described in Rychlik et al., 1989, Nuc.Acids Res. 17:8543-8551. Specifically, the primers and annealing temperatures were primer first 5 cycles next 30 cycles PHN 11471 50 C 58~ C
PHN 11476 50 C 58~ C
Transformation and Seauencin~
PCR amplification products were ligated into pCR II using the TA Cloning Kit (InVitrogen, San Diego, CA), and transformed into E.coli strain DHSa competent cells 1 S (Life Technologies, Gathersburg, MD) according to the protocol provided with the strain, for cloning and sequencing. Transformed bacteria with plasmid insertions were selected on medium 34Z (LB agar plates containing 100 mg/I carbenicillin) using standard X-GAL
selection protocols (Ausubel, et al., eds, 1989, Current Protocols in Molecular Biology, pages 1Ø3-1.15.8). Briefly, white colonies were picked with an inoculating loop and inoculated directly into a PCR reaction mixture containing primers specific to the universal and reverse promoter regions just outside the multiple cloning site.
The remaining innoculum on the loop was used to streak a plate of 34Z medium and numbered to correspond to the PCR reaction. Successful amplification of an inserted PCR fragment resulted in a band on an ethidium bromide stained agaraose gel which was slightly larger than the size of the insert. Bacterial isolates with an insert of the correct Applicant Ref. 0561-PCT
size were inoculated into shaking liquid cultures and subsequently used for plasmid isolation protocols, followed by sequencing of the insert of interest.
Sequence quality plasmid was prepared by using the Nucleobond P-100 plasmid isolation kit (Machery-Nagle GmBH & Co., Cat.No. BP 101352m distributed by the Nest Group, Southboro, MA). This kit uses an alkaline lysis step and is followed by an ion exchange silica column purification step. Plasmid and gene specific primers were sent to Iowa State University to be sequenced on an automated, ABI DNA
Sequencing machine.
The degenerate primer PCR experiment resulted in the amplification of a 0.4 kb band, which was sequenced and determined to have a deduced amino acid sequence matching the protein data in Table 2. The non-degenerate primer experiment resulted in DNA fragments of various sizes. One fragment was about 0.4 kb in length and encoded a protein having homology to the protein sequence data of Table 2. The region of the APOXD gene that was amplified by both primer sets was nearly the same, so DNA
sequence data for the amplified fragments was compiled, and the sequence of the compiled APOXD genomic fragment is shown in Table 4 [Seq ID No. 15] together with its deduced amino acid sequence [Seq ID Nos. 16 and 29]. The underlined amino acid sequences were represented in the original protein sequence analysis data (Table 2).
Applicant Ref. 0561-PCT
APOXD FRAGMENT
ACG ATC ACA AGG TGG ATG CGA TCG GGG AAG GCC ATG AGC CCT TGC CCT
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro GGC GCA TGG GAG ATG GAG CCA CCA TCA TGG GAC CCC GCA ACA AGG ACC
Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp GTG AGC GCC AGA ACC CCG ACA TGC TCC GTC CTC CGA GCA CCG ACC ATG
Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His GCA ACA TGC CGA ACA TGC GGT GGA GCT TTG CTG ACT CCC ACA TTC GCA
Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg TCG AGG TAA GCC CTT CGA GGG TTT TGT GTA CGA CAA GCA AAA TAG GCT
Ile Glu AAT GCA CTG CAG GAG GGC GGC TGG ACA CGC CAG ACT ACC GTA CGC GAG
Gly Trp Thr Arg Gln Thr Thr Val Arg Glu CTG CCA ACG AGC AAG GAG CTT GCG GGT GTA AAC ATG CGC CTC GAT GAG
Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu GGT GTC ATC CGC GAG TTG CAC TGG CAA GGG CTG AAG GCG AAT TCC AGC
Gly Val Ile Arg Glu Leu His Trp ACA CTG GCG GCC GTT ACT AGT GGA TCC GAG CTC GGT ACC AAG CTT GAT
GC ATAGCT
Applicant Ref. 0561-PCT
3' RACE
Nested oligonucleotide primers were designed based on the genomic DNA
_ sequence fragment which was previously amplified (Table 4) and used for 3' RACE to enhance gene specific amplification. The nested primer design is diagrammatically shown in Figure 2 and the nucleic acid sequences of the primers is shown below in Table 5. Arrows represent the gene specific primers (from top to bottom) PHN 11811, PHN
11810, and the oligo dT based 3' primer from a commercially supplied 3' RACE
kit (Life Technologies, Gaithersburg, MD, Cat. No. 18373-019) TABLE S
3' RACE Primers (5'-3') seq 1D
No.
GGT AAG
The first round of PCR amplification using the outside gene specific primer (GSP) PHN11810 and the oligo dT based 3' primer resulted in no visible DNA
bands. The inside GSP PHNI 1811 and the oligo dT based 3' primer were then used for a second round of amplification on the same sample. A large number of bands appeared, some of which stained intensely with ethidium bromide and some which did not.
The prominent bands were 0.4, 0.8 and 1.3 kb in size. This experiment was set up using 5' and 3' primers with custom ends which only allow ligation of DNA fragments amplified by both. This method permitted the reaction to be used in the ligation protocol without further purification or characterization of the DNA fragments. All three of the prominent bands described above were ligated into pAMPl (Life Technologies, Cat. No., 016), transformed into DHSa cells (Life Technologies, Cat. No. 18263-12), cloned and sequenced. The 0.4 kb band was found to encode an amino acid sequence having homology to the APOXD sequence data of Table 1.
5' RACE
Total RNA was reverse transcribed with commercially available components and a set of oligo dT-based primers ending in G, C or A which are collectively termed Bam T17V (5' CGC GGA TCC GT1~ V) 3') [Seq >D No. 19] These Applicant Ref. 0561-PCT
primers are disclosed in published PCT Application No. U596/08582. First strand cDNA
was oligo dC-tailed and then column purified using commercially available components.
- (Life Technologies, Gaithersburg). The product of this reaction was then used in PCR
with primer set Bam G13H, an equimolar mixture of oligo dG primers ending in A, C, or T (5' TAA GGA TCC TG~3 H 3') [Seq. ID NO: 20], and a second gene specific primer, PHN 11813 [Seq ID No. f1 ]. Amplified products were characterized by Southern analysis using the protocol as described in Ausubel, et al. (eds.), 1989, Current Protocols in Molecular Biology, pages 2Ø1 - 2.12.5.
Hybridization of the 5' RACE product was done using the PCR amplified genomic DNA fragment (Table 4) as a radiolabeled probe. A 0.6 kb band was amplified by this reaction and was strongly labeled with the probe. No other bands appeared. This 0.6 kb band was ligated into the PCR II vector using the TA-cloning procedure, transformed into DHSI, cloned and sequenced. The DNA sequence analysis of the 0.6 kb PCR fragment showed it was homologous to the APOXD sequence data shown in Table 2.
5' RACE Primers SEQ. ID
No.
Bam T17V 5' CGC GGA TCC GT1,V 3' 19 Bam G13H 5' TAA GGA TCC TG13H 3' 2Q
PHN 118135' CAU CAU CAU CAU TAC CTC GAT GCG AAT 21 GTG 3' IUPAC Symbols: V=G,C, or A; H=A, T, or C.
PCR For Full Length The 5' and 3' RACE products were sequenced to their ends as determined by the initiating methionine and the poly-A tail respectively. DNA sequence at each end was analyzed by Oligo 4.0 for oligonucleotide primer design in preparation for PCR to obtain the complete gene.
Primer PHN 12566 designed to the 3' end of the sequence, was used to reverse transcribe total RNA. Primers PHN 12565 and PHN 12567 were used to amplify first strand cDNA. The PCR amplified band was ligated into PCR II using the TA
a Applicant Ref. 0561-PCT
cloning kit (In Vitrogen; San Diego, CA) then transformed into DHSI, cloned, and sequenced.
Full Length cDNA Primers (5'-~3') sEQ: ID
No.
GTA
CC
CCG
A 1.4 kb band was amplified which stained very intensely with ethidium bromide. Other, smaller bands were present, but clearly, the 1.4 kb band was prominent.
This band was sequenced and subjected to open reading frame analysis. All of the protein fragments originally sequenced (Table 2), were found in the deduced amino acid sequence of this PCR product.
Southern analysis was performed on genomic DNA using the 1.4 kb cDNA
as a radiolabeled probe. Only one band hybridized, suggesting that the gene is a single copy and unique in the A. phoenices genome.
Table 1 (pages 4-7) shows the full length cDNA sequence [Seq ID No:l]
and deduced amino acid sequence [Seq ID No:2] of the A. phoehices oxalate decarboxylase gene as amplified, using PCR primers PHN 12565 and PHN 12567.
The underlined amino acid sequences were represented in the original protein sequence analysis data (Table 2). The protein sequence encoded by the full length cDNA
includes a pre-protein, amino acid residues 27-458 [Seq ID No:4], and a mature protein, amino acid residues 50-458 [Seq ID No:S].
Example 2 Transformed plant tissue degrades oxalate CaMV35S/0'/APOXD
The insert ofpPHP9685 (1.4 kb APOXD cDNA in pCR II) was placed into a cloning vector intermediate (pLitmus 28, New England Biolabs) between a plant Applicant Ref. 0561-PCT
expressible promoter and 3' region as shown in the construction diagrams of Figure 3.
The upstream region consists of a cauliflower mosaic virus 35S promoter with a - duplicated enhancer region (2X35S; bases -421 to -90 and -421 to +2, Gardner, et al., 1985, Nucleic Acids Res. 9:2871-2888) with a flanking 5' Notl site and a 3' Pst site, and S2' RNA leader sequence. The 3' region is from potato proteinase inhibitor II.
These are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313. The 2X CaMV
promoter is described in Odell, et al., Nature 313:810-812.
The plant-expressible APOXD gene cassette was then isolated from the cloning intermediate and ligated into the ALS::NPT II:: PIN II-containing pBINl9 construct, pPHP8110. Plasmid pPHP8110 was created from pBIN 19 (Bevan, 1984, Nucleic Acids Res. 12:8711-8721) by replacing the NOS::NPTII::NOS gene cassette in pBINI9 with an ALS::NPTII::PINII cassette. As shown in Figure 3, pPHP8110 is a derivative of pBINl9 containing the NPT II gene, the aminoglycoside-3'-O-phosphotransferase coding sequence, bases 1551 to 2345 from E.coli transposon (Genbank Accession Number V00004, Beck, et al., 1982, Gene 19:327-336). The second amino acid was modified from an isoleucine to a valine in order to create a Nco I
restriction site which was used to make a translational fusion with the ALS
promoter (see copending U.S. Patent Application Serial No. 08/409,297). pPHP8110 further contains the potato proteinase inhibitor II terminator (PIN II) bases 2-310, as described in An, et al., 1989, Plant Cell 1:115-122.
As shown in Figure 4, the resultant plasmid, pPHP9723, carries the APOXD gene construct, together with the NPTII gene for selection of transgenic plant cells, positioned between Agrobacterium T-DNA borders.
Germin/APOXD
A second APOXD cDNA containing plasmid was constructed using the methods described above for producing pPHP 9723. In the second construct, the APOXD
fungal signal and presequence (49 amino acids) were replaced with a plant signal sequence obtained from the 5' end of an enzyme subunit of wheat oxalate oxidase. (Lane, et al., 1991, J. Biol. Chem. 266:10461.) This was accomplished by designing primers that were homologous to the Germin signal sequence, and having extensions to provide the addition of a Sal I restriction site at the 5' end and APOXD 5' sequence followed by a Applicant Ref. 0561-PCT
Nru I site at the 3' end. The primers were used to amplify the Germin signal sequence and are shown below in Table 8.
. Table 8 Germin Signal Sequence Primers (5'-3') Seq ID
No.
GAC
CCT TC
AT CCT
GGG TAG CC AAAA CAG CT GGAG
The amplified Germin signal sequence product [Seq ID N0:27] shown below in Table 9, and a vector containing the full length APOXD cDNA
(pPHP9648) were each digested with Sal I and Nru I. A ligation reaction was set up with the digested fragments to form a Germin signal sequence - APOXD coding sequence fusion construct.
Clones of the correct size were sequenced to verify correct results.
As shown in Table 9, the SaIIlNruI cut Germin SS - containing sequence also contained modif ed APOXD codons matched to fill in the NruI-cut APOXD
sequence. The Germin signal sequence [Seq. m No: 28] is shown in lower case.
Table 9 Amplified Germin Signal/APOXD Sequence*
Sa.ZI start 51 TACAATTACT ATTTACAATT ACAGTCGACC CGGGATCC atg ggt tac 98 tca aag acc ttg gtt get ggt ttg ttc get atg ttg ttg 137 ttg get cca get gtt ttg get acc ICAG GAT AAG CCT TTC
NruI
Applicant Ref. 0561-PCT
*The SaII (GTCGAC) and NruI (TCGCGA) restriction sites are underlined, the Germin signal sequence is in lower case, with the Germin start site in bold. APOXD
sequences modified in the PCR primer design are shown in bold.
This fusion gene was placed in the binary T-DNA plasmid to produce plasmid pPHP9762 carrying the fusion gene and the plant expressible NPTII gene positioned between Agrobacterium T-DNA borders, as described above.
Agrobacterium tumefaciens strain EHA105 (as described in Hood, et al., 1993, Transgen. Res. 2:208-218) was transformed with kanamycin resistant binary T-DNA vectors carrying the different versions of APOXD. Transformation was accomplished by the freeze-thaw method of Holsters, et al., 1978, Mol. Gen.
Genetics 1:181-7. The transformed isolates were selected on solidified 60A (YEP; 10 g/1 yeast extract, 10 g/1 bactopeptone, 5 g/1 NaCI, pH 7.0) medium with 50 mg/1 kanamycin.
Transformed bacteria were cultured in liquid culture of YEP medium containing 50 mg/1 I5 kanamycin, to log phase growth (O.D.6o~ 0.5-1.0) for use in plant transformations. Binary plasmids were re-isolated from transformed Agrobacterium to verify that integrity was maintained throughout the transformation procedures.
Sunflower leaf discs were obtained by harvesting leaves which were not fully expanded, sterilizing the surface in 20% bleach with TWEEN 20, and punching discs out of the leaf with a paper punch. Agrobacterium suspensions were centrifuged and resuspended in inoculation medium (12.5 ~M MES buffer, pH 5.7, 1 g/1 NH4C1, 0.3 g/1 MgS04) to a calculated OD6oo of 0.75 as described in Malone-Schoneberg, et al., 1994, Plant Science 103:199-207. Leaf discs were inoculated in the resuspended Agrobacterium for 10 minutes then blotted on sterile filter paper.
The tissue and bacteria were co-cultivated on 527 for 3 days, then transferred to 527E medium for the selection of transgenic plant cells. After 2 weeks of Applicant Ref. 0561-PCT
culture, the transgenic callus nodes were removed from the leaf disc and subcultured on fresh 527E medium. A number of subcultures were repeated prior to the assay of the callus tissue for enzyme activity.
To assay for enzyme activity, callus was harvested, snap frozen in liquid nitrogen, lyophilized to dryness and powdered. A quantity of 0.75 mg of powder from each prepared callus line was added to 1.0 ml reaction mixture (900 p1 200 mM
NaP04, pH 5.0, 100 p1 10 mM Na-oxalate pH 5.0). The reaction proceeded for 3 hours at room temperature and was stopped by the addition of 150 p,1 of 1M TRIS-HCI, pH 7Ø
Each sample was spun at 14,000 rpm for one minute and 1 ml was removed to a cuvette. One hundred (100) ~l of (3-NAD (6.6 mg/ml stock) and 50 p1 formate dehydrogenase (4.0 mg/ml stock) were added and the increase in absorbance was measured at 340 nm. A
slope was generated for each sample as well as for a formate standard curve.
Assay results were reported as pM oxalate metabolized /mg powder.
The results of the leaf disk assay are shown below in Table 10, and demonstrate that the APOXD gene sequence produces enzyme that is active in transgenic callus. No activity was seen in control callus, or callus transformed with the native APOXD gene (pPHP 9723).
Table 10 Oxalate Decarboxylase Activity in Trans~enic Sunflower Tissue Callus Line Binary Vector Activity pM oxalate/min/mg SMF3 None 0 9723 -1 pPHP 9723 0 -2 pPHP 9723 0 -3 pPHP 9723 0 9762-1 pPHP 9762 1.35 -2 pPHP 9762 1.40 -3 pPHP 9762 0.87 -4 pPHP 9762 0.81 -5 pPHP 9762 0.81 -6 pPHP 9762 0.90 Applicant Ref. 0561-PCT
Example 3 Transgenic Sunflower Plants Expressing APOXD
Sunflower plants were transformed using a basic transformation protocol involving a combination of wounding by particle bombardment, followed by use of Agrobacterium for DNA delivery, as described by Bidney, et al. Plant Mol.
Biol. 18:301 313. The plasmid pPHP9762, as described above for Example 2 and shown in Figure 5, was used in these experiments. pPHP9762 contains the APOXD gene with the fungal signal and presequence replaced with the Germin signal sequence and a plant expressible NPTII gene which provides kanamycin resistance to transgenic plant tissues.
Procedures for preparation of Agrobacterium and preparation of particles for wounding are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313. The Pioneer sunflower line SMF3, used in these experiments, is described in Burros, et al., 1991, Plant Cell Rep. 10:161-166. The Agrobacterium strain used in these experiments, EHA 105. Procedures for use of the helium gun, intact meristem preparation, tissue culture and co-cultivation conditions, as well as recovery of transgenic plants, are described in Bidney, et al., 1992, Plant Mol. Biol. 18:301-313.
Sunflower explants were prepared by imbibing seed overnight, removing the cotyledons and radical tip, then culturing overnight on medium containing plant growth regulators. Primary leaves were then removed and explants arranged in the center of a petri plate for bombardment. The PDS 1000 helium-driven particle bombardment device (Bio-Rad) was used with 600 psi rupture discs and a vacuum of 26 inches, Hg to bombard meristem explants twice on the highest shelf position. Following bombardment, log phase Agrobacterium cultures transformed with the APOXD-plasmid pPHP 9762, as described for Example 2, were centrifuged and resuspended at a calculated OD600 (vis) of 4.0 in inoculation buffer. Agrobacterium was then dropped onto the meristem explants using a fine tipped pipettor. Inoculated explants were co-cultured for three days then transferred to medium containing 50 mg/1 kanamycin and 250 mg/1 cefotaxime for selection. Explants were cultured on this medium for two weeks then transferred to the same medium, but lacking kanamycin. Green, kanamycin-resistant shoots were recovered to the greenhouse and assayed by an NPTII ELISA assay to verify transformation.
Oxalate decarboxylase enzyme assays are performed on these plants and/or progeny to confirm the expression of APOXD.
Applicant Ref. 0561-PCT
The invention has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: PIONEER HI-BRED INTERNATIONAL, INC.
(ii) TITLE OF INVENTION: GENE ENCODING OXALATE DECARBOXYLASE FROM
ASPERGILLUS PHOENICES
(iii) NUMBER OF SEQUENCES: 30 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SMART & BIGGAR
(B) STREET: P.O. BOX 2999, STATION D
(C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,350,328 (B) FILING DATE: 26-JUN-2001 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: SMART & BIGGAR
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 75529-56 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-232-2486 (B) TELEFAX: (613)-232-8440 (2) INFORMATION FOR SEQ ID NO.: 1:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 1438 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (24)...(1394) (ix) FEATURE
(A) NAME/KEY: mat_peptide (B) LOCATION: (171)...(1394) (ix) FEATURE
(A) NAME/KEY: sig~eptide (B) LOCATION: (24)...(101) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 2:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 457 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 3:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 26 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3:
Met Gln Leu Thr Leu Pro Pro Arg Gln Leu Leu Leu Ser Phe Ala Thr Val Ala Ala Leu Leu Asp Pro Ser His Gly (2) INFORMATION FOR SEQ ID NO.: 4:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 431 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 4:
Gly Pro Val Pro Asn Glu Ala Tyr Gln Gln Leu Leu Gln Ile Pro Ala Ser Ser Pro Ser Ile Phe Phe Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 5:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 408 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 5:
Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln 20 Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Glu Gly Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp His Arg Glu Ala Glu Trp Ala Tyr Val Leu Ala Gly Arg Val Arg Val Thr Gly Leu Asp Leu Glu Gly Gly Ser Phe Ile Asp Asp Leu Glu Glu Gly Asp Leu Trp Tyr Phe Pro Ser Gly His Pro His Ser Leu Gln Gly Leu Ser Pro Asn Gly Thr Glu Phe Leu Leu Ile Phe Asp Asp Gly Asn Phe Ser Glu Glu Ser Thr Phe Leu Leu Thr Asp Trp Ile Ala His Thr Pro Lys Ser Val Leu Ala Gly Asn Phe Arg Met Arg Pro Gln Thr Phe Lys Asn Ile Pro Pro Ser Glu Lys Tyr Ile Phe Gln Gly Ser Val Pro Asp Ser Ile Pro Lys Glu Leu Pro Arg Asn Phe Lys Ala Ser Lys Gln Arg Phe Thr His Lys Met Leu Ala Gln Lys Pro Glu His Thr Ser Gly Gly Glu Val Arg Ile Thr Asp Ser Ser Asn Phe Pro Ile Ser Lys Thr Val Ala Ala Ala His Leu Thr Ile Asn Pro Gly Ala Ile Arg Glu Met His Trp His Pro Asn Ala Asp Glu Trp Ser Tyr Phe Lys Arg Gly Arg Ala Arg Val Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg Thr Phe Asp Tyr Val Ala Gly Asp Val Gly Ile Val Pro Arg Asn Met Gly His Phe Ile Glu Asn Leu Ser Asp Asp Glu Glu Val Glu Val Leu Glu Ile Phe Arg Ala Asp Arg Phe Arg Asp Phe Ser Leu Phe Gln Trp Met Gly Glu Thr Pro Gln Arg Met Val Ala Glu His Val Phe Lys Asp Asp Pro Asp Ala Ala Arg Glu Phe Leu Lys Ser Val Glu Ser Gly Glu Lys Asp Pro Ile Arg Ser Pro Ser Glu (2) INFORMATION FOR SEQ ID NO.: 6:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: VARIANT
(B) LOCATION: (1)...(27) (C) OTHER INFORMATION: Xaa = Any Amino Acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6:
Gln Asp Lys Pro Phe Thr Pro Asp His Arg Asp Pro Tyr Asp His Lys Val Asp Ala Ile Gly Glu Xaa His Glu Pro Leu (2) INFORMATION FOR SEQ ID NO.: 7:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 12 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7:
Val Ile Arg Glu Leu His Trp His Arg Glu Ala Gly (2) INFORMATION FOR SEQ ID NO.: 8:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 17 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 8:
Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Cys His Arg Glu Ala Glu (2) INFORMATION FOR SEQ ID NO.: 9:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 20 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 9:
Ser Tyr Phe Lys Arg Gly Arg Ala Arg Tyr Thr Ile Phe Ala Ala Glu Gly Asn Ala Arg 30 (2) INFORMATION FOR SEQ ID NO.: 10:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 12 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 10:
40 Ser Ala His Thr Pro Pro Ser Val Leu Ala Gly Asn (2) INFORMATION FOR SEQ ID NO.: 11:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 35 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
50 (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(35) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 11:
(2) INFORMATION FOR SEQ ID NO.: 12:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 35 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
( i. i ) MOLECULE TYPE : DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(35) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 12:
(2) INFORMATION FOR SEQ ID NO.: 13:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 22 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 13:
(2) INFORMATION FOR SEQ ID NO.: 14:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 21 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(v1) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 14:
(2) INFORMATION FOR SEQ ID NO.: 15:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 440 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (3)...(197) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (259)...(360) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 15:
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp (2) INFORMATION FOR SEQ ID NO.: 16:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 99 40 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus phoenices (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 16:
Asp His Lys Val Asp Ala Ile Gly Glu Gly His Glu Pro Leu Pro Trp 50 Arg Met Gly Asp Gly Ala Thr Ile Met Gly Pro Arg Asn Lys Asp Arg Glu Arg Gln Asn Pro Asp Met Leu Arg Pro Pro Ser Thr Asp His Gly Asn Met Pro Asn Met Arg Trp Ser Phe Ala Asp Ser His Ile Arg Ile Glu Gly Trp Thr Arg Gln Thr Thr Val Arg Glu Leu Pro Thr Ser Lys Glu Leu Ala Gly Val Asn Met Arg Leu Asp Glu Gly Val Ile Arg Glu Leu His Trp (2) INFORMATION FOR SEQ ID NO.: 17:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 19 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 17:
(2) INFORMATION FOR SEQ ID NO.: 18:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 30 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 18:
(2) INFORMATION FOR SEQ ID NO.: 19:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 28 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 19:
(2) INFORMATION FOR SEQ ID NO.: 20:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 24 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 20:
(2) INFORMATION FOR SEQ ID NO.: 21:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 30 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 21:
(2) INFORMATION FOR SEQ ID NO.: 22:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 28 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 22:
(2) INFORMATION FOR SEQ ID NO.: 23:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 26 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 23:
(2) INFORMATION FOR SEQ ID NO.: 24:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 24:
(2) INFORMATION FOR SEQ ID NO.: 25:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 38 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 25:
(2) INFORMATION FOR SEQ ID NO.: 26:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 56 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 26:
(2) INFORMATION FOR SEQ ID NO.: 27:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 507 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Triticum aestivum (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 27:
(2) INFORMATION FOR SEQ ID NO.: 28:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 72 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Triticum aestivum (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 28:
(2) INFORMATION FOR SEQ ID NO.: 29:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 16 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(16) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 29:
(2) INFORMATION FOR SEQ ID NO.: 30:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 17 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Artificial Sequence (ix) FEATURE
(C) OTHER INFORMATION: primer (ix) FEATURE
(A) NAME/KEY: misc_feature (B) LOCATION: (1). .(17) (C) OTHER INFORMATION: n = A,T,C or G
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 30:
Claims (28)
1. An isolated nucleic acid, having the sequence of the Aspergillus phoenices oxalate decarboxylase insert in the plasmid ATCC No. 97959.
2. An isolated nucleic acid selected from the group consisting of:
a) the nucleic acid shown in SEQ ID NO: 1; and b) nucleotide 171 to 1437 of the nucleic acid shown in SEQ ID NO: 1.
a) the nucleic acid shown in SEQ ID NO: 1; and b) nucleotide 171 to 1437 of the nucleic acid shown in SEQ ID NO: 1.
3. An isolated nucleic acid comprising a nucleic acid encoding the polypeptide of SEQ ID NO: 2 or SEQ ID NO: 5.
4. The nucleic acid of any of the claims 1, 2, or 3, further comprising a plant signal sequence.
5. A vector for delivery of a nucleic acid to a host cell, the vector comprising the nucleic acid of any of the claims 1, 2, or 3.
6. A host cell containing the vector of claim 5.
7. A host cell transformed with the nucleic acid of any of the claims 1, 2, or 3.
8. The host cell of claim 7, wherein the cell is a plant cell.
9. The host cell of claim 8, wherein the nucleic acid further comprises a plant signal sequence.
10. The host cell of claim 9, wherein said plant signal sequence comprises the Germin signal sequence contained in SEQ ID NO: 28.
11. The host cell of claim 8, wherein the plant is selected from the group consisting of sunflower, bean, canola, alfalfa, soybean, flax, safflower, peanut and clover.
12. A plant cell transformed with a nucleic acid comprising the nucleic acid of any of the claims 1, 2, or 3.
13. A plant having stably incorporated within its genome a nucleic acid comprising the nucleic acid of any of the claims 1, 2, or 3.
14. The plant of claim 13, wherein said nucleic acid further comprises a plant signal sequence.
15. The plant of claim 14, wherein said plant signal sequence comprises the Germin signal sequence contained in SEQ ID NO: 28.
16. A method for degrading oxalic acid comprising the steps of:
a) transforming a plant with a nucleic acid, wherein the nucleic acid is the nucleic acid of any of the claims 1, 2, or 3; and b) expressing said nucleic acid for a time sufficient to allow for degradation of the oxalic acid.
a) transforming a plant with a nucleic acid, wherein the nucleic acid is the nucleic acid of any of the claims 1, 2, or 3; and b) expressing said nucleic acid for a time sufficient to allow for degradation of the oxalic acid.
17. The method of claim 16, wherein said nucleic acid further comprises a plant signal sequence.
18. The method of claim 17, wherein said plant signal sequence comprises the Germin signal sequence contained in SEQ ID NO: 28.
19. The method of claim 16, wherein said plant is selected from the group consisting of sunflower, bean, canola, alfalfa, soybean, flax, safflower, peanut and clove.
20. The method of claim 19, wherein said plant is sunflower.
21. A method of identifying transformed plant cells comprising the steps of:
a) culturing cells from a selected target plant in a culture medium;
b) transforming said cells with at least one copy of an expression cassette comprising the nucleic acid of any of the claims 1, 2, or 3;
c) introducing oxalic acid into the culture medium; and d) identifying transformed cells as the surviving cells in the oxalic acid-treated culture.
a) culturing cells from a selected target plant in a culture medium;
b) transforming said cells with at least one copy of an expression cassette comprising the nucleic acid of any of the claims 1, 2, or 3;
c) introducing oxalic acid into the culture medium; and d) identifying transformed cells as the surviving cells in the oxalic acid-treated culture.
22. An isolated nucleic acid encoding a signal peptide comprising nucleotide 24 to 101 of SEQ ID NO: 1.
23. An isolated nucleic acid which hybridizes under highly stringent conditions to SEQ ID NO: 1.
24. An isolated nucleic acid which is at least 80% identical to SEQ ID NO: 1.
25. An isolated nucleic acid which is at least 90% identical to SEQ ID NO: 1.
26. A method of preventing pathogenic disease in plants where oxalic acid plays a role in pathogenesis, the method comprising:
a) transforming a plant cell with a nucleic acid of any of the claims 1, 2, or 3;
b) culturing the plant cell under plant growing conditions to produce a regenerated plant; and c) inducing expression of said polypeptide for a time sufficient to prevent pathogenic disease.
a) transforming a plant cell with a nucleic acid of any of the claims 1, 2, or 3;
b) culturing the plant cell under plant growing conditions to produce a regenerated plant; and c) inducing expression of said polypeptide for a time sufficient to prevent pathogenic disease.
27. The method of claim 26 wherein the plant cell is selected from the group consisting of sunflower, bean, canola, alfalfa, soybean, flax, safflower, peanut and clove.
28. The method of claim 26- wherein the pathogenic disease is caused by Sclerotinia or Sclerotium.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA 2350328 CA2350328A1 (en) | 2001-06-26 | 2001-06-26 | Gene encoding oxalate decarboxylase from aspergillus phoenices |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA 2350328 CA2350328A1 (en) | 2001-06-26 | 2001-06-26 | Gene encoding oxalate decarboxylase from aspergillus phoenices |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2350328A1 true CA2350328A1 (en) | 2002-12-26 |
Family
ID=4169264
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA 2350328 Abandoned CA2350328A1 (en) | 2001-06-26 | 2001-06-26 | Gene encoding oxalate decarboxylase from aspergillus phoenices |
Country Status (1)
Country | Link |
---|---|
CA (1) | CA2350328A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2433260A (en) * | 2005-12-16 | 2007-06-20 | Mologic Ltd | A selectable decarboxylase marker |
-
2001
- 2001-06-26 CA CA 2350328 patent/CA2350328A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2433260A (en) * | 2005-12-16 | 2007-06-20 | Mologic Ltd | A selectable decarboxylase marker |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3253071B2 (en) | Site-specific recombination of DNA in plant cells | |
EP0359617A2 (en) | Stress-tolerant plants | |
JP2002520058A (en) | Aminopolyolamine oxidase polynucleotides and related polypeptides and methods of use | |
US20210102218A1 (en) | Expression of transcription regulators that provide heat tolerance | |
AU716814B2 (en) | Chimeric genes comprising a fungus-responsive element | |
US6303846B1 (en) | Gene encoding oxalate decarboxylase from aspergillus phoenices | |
AU718274B2 (en) | Antifungal proteins, DNA coding therefore, and hosts incorporating same | |
CA2780989A1 (en) | Angiogenin expression in plants | |
JP5079799B2 (en) | Toxoflavin and its derivative-degrading gene tflA and transformed organisms expressing the same | |
AU746787B2 (en) | Salicylic acid pathway genes and their use for the induction of resistance in plants | |
CN101508980B (en) | Wheat cinnamyl alcohol dehydrogenase and its coding gene and application | |
JP2002520059A (en) | Aminopolyolamine oxidase polynucleotides and related polypeptides and methods of use | |
CA2350328A1 (en) | Gene encoding oxalate decarboxylase from aspergillus phoenices | |
US20040205842A1 (en) | Lipoxygenase overexpression in plants and reduction in plant sensitivity to diseases and to attacks from pathogenic organisms | |
US7314972B2 (en) | Transgenic plants expressing biocidal proteins and method of producing the same | |
AU769546B2 (en) | Method for obtaining transgenic plants expressing a protein with activity producing hydrogen peroxide by transformation by Agrobacterium rhizogenes | |
EP0945508A1 (en) | The insect-resistant use of sweet potato sporamin gene and method for controlling pests using the gene | |
CN119799718A (en) | Rice protein OsHIPP56 related to plant disease resistance and its encoding gene and application | |
WO2005037863A2 (en) | Alternative splicing factors polynucleotides, polypeptides and uses thereof | |
EA048048B1 (en) | PROMOTERS FOR REGULATION OF GENE EXPRESSION IN PLANTS | |
WO2010099320A1 (en) | Plant resistance to banana bunchy top virus | |
WO2003048365A1 (en) | Fungal elicitor | |
AU2015202777A1 (en) | Angiogenin expression in plants (2) | |
ZA200005115B (en) | Salicylic acid pathway genes and their use for the induction of resistance in plants. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |