CN113293173B - Agrobacterium tumefaciens binary expression vector - Google Patents
Agrobacterium tumefaciens binary expression vector Download PDFInfo
- Publication number
- CN113293173B CN113293173B CN202010104155.5A CN202010104155A CN113293173B CN 113293173 B CN113293173 B CN 113293173B CN 202010104155 A CN202010104155 A CN 202010104155A CN 113293173 B CN113293173 B CN 113293173B
- Authority
- CN
- China
- Prior art keywords
- gene
- agrobacterium
- vector
- nucleotide sequence
- tobacco
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000013604 expression vector Substances 0.000 title claims abstract description 30
- 241000589155 Agrobacterium tumefaciens Species 0.000 title abstract 2
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 96
- 241000589158 Agrobacterium Species 0.000 claims abstract description 43
- 241000208125 Nicotiana Species 0.000 claims abstract description 32
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims abstract description 32
- 230000014509 gene expression Effects 0.000 claims abstract description 30
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 22
- 239000003550 marker Substances 0.000 claims abstract description 18
- 238000012216 screening Methods 0.000 claims abstract description 16
- 101100192402 Arabidopsis thaliana NPF2.9 gene Proteins 0.000 claims abstract description 12
- 101710136524 X polypeptide Proteins 0.000 claims abstract description 8
- 108090000121 Aromatic-L-amino-acid decarboxylases Proteins 0.000 claims description 16
- 230000004927 fusion Effects 0.000 claims description 16
- 239000002773 nucleotide Substances 0.000 claims description 15
- 125000003729 nucleotide group Chemical group 0.000 claims description 15
- 102000003823 Aromatic-L-amino-acid decarboxylases Human genes 0.000 claims description 14
- 238000003786 synthesis reaction Methods 0.000 claims description 12
- AMBQHHVBBHTQBF-UHFFFAOYSA-N Loganin Natural products C12C(C)C(O)CC2C(C(=O)OC)=COC1OC1OC(CO)C(O)C(O)C1O AMBQHHVBBHTQBF-UHFFFAOYSA-N 0.000 claims description 8
- AMBQHHVBBHTQBF-UOUCRYGSSA-N loganin Chemical compound O([C@@H]1OC=C([C@H]2C[C@H](O)[C@H](C)[C@H]21)C(=O)OC)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O AMBQHHVBBHTQBF-UOUCRYGSSA-N 0.000 claims description 8
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 229930013930 alkaloid Natural products 0.000 claims description 4
- RHCSKNNOAZULRK-APZFVMQVSA-N 2,2-dideuterio-2-(3,4,5-trimethoxyphenyl)ethanamine Chemical compound NCC([2H])([2H])C1=CC(OC)=C(OC)C(OC)=C1 RHCSKNNOAZULRK-APZFVMQVSA-N 0.000 claims description 3
- 102000004366 Glucosidases Human genes 0.000 claims description 3
- 108010056771 Glucosidases Proteins 0.000 claims description 3
- 108090000836 Nitrate Transporters Proteins 0.000 claims description 3
- BLGXFZZNTVWLAY-DIRVCLHFSA-N rauwolscine Chemical class C1=CC=C2C(CCN3C[C@H]4CC[C@H](O)[C@H]([C@H]4C[C@H]33)C(=O)OC)=C3NC2=C1 BLGXFZZNTVWLAY-DIRVCLHFSA-N 0.000 claims description 3
- 150000003797 alkaloid derivatives Chemical class 0.000 claims description 2
- 229920001184 polypeptide Polymers 0.000 claims description 2
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 2
- 108010078791 Carrier Proteins Proteins 0.000 claims 1
- 102000014914 Carrier Proteins Human genes 0.000 claims 1
- 229910002651 NO3 Inorganic materials 0.000 claims 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 claims 1
- 239000013598 vector Substances 0.000 abstract description 37
- 102000004169 proteins and genes Human genes 0.000 abstract description 19
- 238000011065 in-situ storage Methods 0.000 abstract description 4
- CJDRUOGAGYHKKD-RQBLFBSQSA-N 1pon08459r Chemical compound CN([C@H]1[C@@]2(C[C@@]3([H])[C@@H]([C@@H](O)N42)CC)[H])C2=CC=CC=C2[C@]11C[C@@]4([H])[C@H]3[C@H]1O CJDRUOGAGYHKKD-RQBLFBSQSA-N 0.000 abstract 2
- CJDRUOGAGYHKKD-UHFFFAOYSA-N Iso-ajmalin Natural products CN1C2=CC=CC=C2C2(C(C34)O)C1C1CC3C(CC)C(O)N1C4C2 CJDRUOGAGYHKKD-UHFFFAOYSA-N 0.000 abstract 2
- 244000061121 Rauvolfia serpentina Species 0.000 abstract 2
- 229960004332 ajmaline Drugs 0.000 abstract 2
- 239000005090 green fluorescent protein Substances 0.000 description 17
- 241000196324 Embryophyta Species 0.000 description 14
- -1 Terpenoid indole alkaloids Chemical class 0.000 description 11
- 239000012634 fragment Substances 0.000 description 10
- 108020004414 DNA Proteins 0.000 description 8
- 238000000034 method Methods 0.000 description 8
- 238000010276 construction Methods 0.000 description 7
- 229930005303 indole alkaloid Natural products 0.000 description 7
- 240000001829 Catharanthus roseus Species 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 208000015181 infectious disease Diseases 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 241000863480 Vinca Species 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 108020005345 3' Untranslated Regions Proteins 0.000 description 4
- 108020003589 5' Untranslated Regions Proteins 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000006801 homologous recombination Effects 0.000 description 4
- 238000002744 homologous recombination Methods 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 101150108455 Sil1 gene Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 3
- 238000002705 metabolomic analysis Methods 0.000 description 3
- 230000001431 metabolomic effect Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 101150017313 sls1 gene Proteins 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 230000010474 transient expression Effects 0.000 description 3
- XBAMJZTXGWPTRM-NTXHKPOFSA-N 3alpha(S)-strictosidine Chemical compound O([C@@H]1OC=C([C@H]([C@H]1C=C)C[C@H]1C2=C(C3=CC=CC=C3N2)CCN1)C(=O)OC)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O XBAMJZTXGWPTRM-NTXHKPOFSA-N 0.000 description 2
- 101000637861 Arabidopsis thaliana Thylakoid lumenal 19 kDa protein, chloroplastic Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 108050009160 DNA polymerase 1 Proteins 0.000 description 2
- 101000983560 Drosophila melanogaster RNA-binding protein cabeza Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 101000852980 Homo sapiens Interleukin-23 subunit alpha Proteins 0.000 description 2
- 101000607570 Mus musculus Unknown protein from 2D-PAGE of fibroblasts Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101000760984 Soil-borne wheat mosaic virus (strain United States/Nebraska/1981) Capsid protein Proteins 0.000 description 2
- 101000621441 Soil-borne wheat mosaic virus (strain United States/Nebraska/1981) Suppressor of RNA silencing Proteins 0.000 description 2
- OXAGNIAQEYWXSM-JJEHOMFVSA-N Strictosidine Natural products CC(=O)OC1=CO[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@@H](C=C)[C@@H]1C[C@@H]3NCCc4c3[nH]c5ccccc45 OXAGNIAQEYWXSM-JJEHOMFVSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- GKWYINOZGDHWRA-UHFFFAOYSA-N catharanthine Natural products C1C(CC)(O)CC(CC2C(=O)OC)CN1CCC1=C2NC2=CC=CC=C12 GKWYINOZGDHWRA-UHFFFAOYSA-N 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- XBAMJZTXGWPTRM-UHFFFAOYSA-N epi-strictosidinic acid methyl ester Natural products C=CC1C(CC2C3=C(C4=CC=CC=C4N3)CCN2)C(C(=O)OC)=COC1OC1OC(CO)C(O)C(O)C1O XBAMJZTXGWPTRM-UHFFFAOYSA-N 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000012215 gene cloning Methods 0.000 description 2
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 2
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 2
- 101150044508 key gene Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000012257 pre-denaturation Methods 0.000 description 2
- 238000011897 real-time detection Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 150000003505 terpenes Chemical class 0.000 description 2
- 235000007586 terpenes Nutrition 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- CZJDUZOWQVAEEV-XIEZEKGWSA-N (+)-19-epi-Ajmalicine Natural products O=C(OC)C=1[C@@H]2[C@@H]([C@@H](C)OC=1)C[N+]1[C@H](c3[nH]c4c(c3CC1)cccc4)C2 CZJDUZOWQVAEEV-XIEZEKGWSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 101100226347 Escherichia phage lambda exo gene Proteins 0.000 description 1
- 108010042634 F2A4-K-NS peptide Proteins 0.000 description 1
- 108050002220 Green fluorescent protein, GFP Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 229930189077 Rifamycin Natural products 0.000 description 1
- 241000589324 Solanum quadriloculatum Species 0.000 description 1
- 235000000644 Solanum quadriloculatum Nutrition 0.000 description 1
- 102000006853 Strictosidine synthase Human genes 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- JXLYSJRDGCGARV-WWYNWVTFSA-N Vinblastine Natural products O=C(O[C@H]1[C@](O)(C(=O)OC)[C@@H]2N(C)c3c(cc(c(OC)c3)[C@]3(C(=O)OC)c4[nH]c5c(c4CCN4C[C@](O)(CC)C[C@H](C3)C4)cccc5)[C@@]32[C@H]2[C@@]1(CC)C=CCN2CC3)C JXLYSJRDGCGARV-WWYNWVTFSA-N 0.000 description 1
- WVTGEXAIVZDLCR-UHFFFAOYSA-N Vindoline Natural products CC1C2CN3CCCC14CCC5Nc6ccccc6C25C34 WVTGEXAIVZDLCR-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- GRTOGORTSDXSFK-XJTZBENFSA-N ajmalicine Chemical compound C1=CC=C2C(CCN3C[C@@H]4[C@H](C)OC=C([C@H]4C[C@H]33)C(=O)OC)=C3NC2=C1 GRTOGORTSDXSFK-XJTZBENFSA-N 0.000 description 1
- 229940007897 ajmalicine Drugs 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- LSQZJLSUYDQPKJ-NJBDSQKTSA-N amoxicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=C(O)C=C1 LSQZJLSUYDQPKJ-NJBDSQKTSA-N 0.000 description 1
- 229960003022 amoxicillin Drugs 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 229930101531 artemisinin Natural products 0.000 description 1
- BLUAFEHZUWYNDE-NNWCWBAJSA-N artemisinin Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2OC(=O)[C@@H]4C BLUAFEHZUWYNDE-NNWCWBAJSA-N 0.000 description 1
- 229960004191 artemisinin Drugs 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000004531 blood pressure lowering effect Effects 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- CMKFQVZJOWHHDV-DYHNYNMBSA-N catharanthine Chemical compound C([C@@H]1C=C([C@@H]2[C@@]3(C1)C(=O)OC)CC)N2CCC1=C3NC2=CC=CC=C12 CMKFQVZJOWHHDV-DYHNYNMBSA-N 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 229960002064 kanamycin sulfate Drugs 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000011177 media preparation Methods 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- LSQZJLSUYDQPKJ-UHFFFAOYSA-N p-Hydroxyampicillin Natural products O=C1N2C(C(O)=O)C(C)(C)SC2C1NC(=O)C(N)C1=CC=C(O)C=C1 LSQZJLSUYDQPKJ-UHFFFAOYSA-N 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- ZLQMRLSBXKQKMG-UHFFFAOYSA-N rauniticine Natural products COC(=O)C1=CC2CC3N(CCc4c3[nH]c5ccccc45)CC2C(C)O1 ZLQMRLSBXKQKMG-UHFFFAOYSA-N 0.000 description 1
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 1
- 229960001225 rifampicin Drugs 0.000 description 1
- 229960003292 rifamycin Drugs 0.000 description 1
- HJYYPODYNSCCOU-ODRIEIDWSA-N rifamycin SV Chemical compound OC1=C(C(O)=C2C)C3=C(O)C=C1NC(=O)\C(C)=C/C=C/[C@H](C)[C@H](O)[C@@H](C)[C@@H](O)[C@@H](C)[C@H](OC(C)=O)[C@H](C)[C@@H](OC)\C=C\O[C@@]1(C)OC2=C3C1=O HJYYPODYNSCCOU-ODRIEIDWSA-N 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 235000017709 saponins Nutrition 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 108020005090 strictosidine synthase Proteins 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 229960003048 vinblastine Drugs 0.000 description 1
- JXLYSJRDGCGARV-XQKSVPLYSA-N vincaleukoblastine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1NC1=CC=CC=C21 JXLYSJRDGCGARV-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- CXBGOBGJHGGWIE-IYJDUVQVSA-N vindoline Chemical compound CN([C@H]1[C@](O)([C@@H]2OC(C)=O)C(=O)OC)C3=CC(OC)=CC=C3[C@]11CCN3CC=C[C@]2(CC)[C@@H]13 CXBGOBGJHGGWIE-IYJDUVQVSA-N 0.000 description 1
- GBABOYUKABKIAF-GHYRFKGUSA-N vinorelbine Chemical compound C1N(CC=2C3=CC=CC=C3NC=22)CC(CC)=C[C@H]1C[C@]2(C(=O)OC)C1=CC([C@]23[C@H]([C@]([C@H](OC(C)=O)[C@]4(CC)C=CCN([C@H]34)CC2)(O)C(=O)OC)N2C)=C2C=C1OC GBABOYUKABKIAF-GHYRFKGUSA-N 0.000 description 1
- 229960002066 vinorelbine Drugs 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
- C12N15/8205—Agrobacterium mediated transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/65—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Nutrition Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
技术领域technical field
本发明属于基因工程和生物合成技术领域,具体地说,涉及一种农杆菌双元表达载体,具体涉及一种提高农杆菌侵染植物检测效果和提高目的基因表达效果的农杆菌双元表达载体、及其在烟草中促进阿吗碱的异源生物合成的用途。The invention belongs to the technical field of genetic engineering and biosynthesis, and in particular relates to an Agrobacterium binary expression vector, in particular to an Agrobacterium binary expression vector which improves the detection effect of Agrobacterium-infected plants and improves the expression effect of target genes , and its use in promoting the heterologous biosynthesis of amoline in tobacco.
背景技术Background technique
萜类吲哚生物碱是一类重要的植物次级代谢产物,也是重要的药用化合物。部分化合物已应用于人类疾病的治疗和预防(吴世文et al.,2018):长春碱(vinblastine)、长春瑞滨(vinorelbine)和长春新碱(vincristine)可用于多种肿瘤/癌症的治疗;文多灵(vindoline)和长春质碱(catharanthine)也用于糖尿病的治疗;阿吗碱(ajmalicine)具有抗炎和降血压的作用。但是,萜类吲哚生物碱在植物体内的含量极低(约占鲜重的0.0002%),而且植物宿主多为木本,生长周期长,很难满足日益增长的市场需求。合成生物学是解决这一问题的有效手段。Terpenoid indole alkaloids are an important class of plant secondary metabolites and important medicinal compounds. Some compounds have been used in the treatment and prevention of human diseases (Wu Shiwen et al., 2018): vinblastine, vinorelbine and vincristine can be used in the treatment of various tumors/cancers; Vindoline and catharanthine are also used in the treatment of diabetes; ajmalicine has anti-inflammatory and blood pressure-lowering effects. However, the content of terpenoid indole alkaloids in plants is extremely low (accounting for about 0.0002% of fresh weight), and most of the plant hosts are woody plants with long growth cycle, so it is difficult to meet the growing market demand. Synthetic biology is an effective means to solve this problem.
基于代谢工程或合成生物学策略,异源合成阿吗碱是满足不断增长的市场需求的有效手段。目前,长春花中参与阿吗碱及其他异育亨宾生物碱的相关基因已经鉴定(Miettinen et al.,2014;Stavrinides et al.,2016;Stavrinides et al.,2015.),为后续阿吗碱及异育亨宾生物碱的合成生物学研究奠定了基础。Based on metabolic engineering or synthetic biology strategies, heterologous synthesis of amoline is an effective means to meet the growing market demands. At present, the genes involved in amoxicillin and other heterohimbine alkaloids in Vinca have been identified (Miettinen et al., 2014; Stavrinides et al., 2016; Stavrinides et al., 2015.), which is the follow-up study of amohimbine It laid the foundation for the study of synthetic biology of alkali and isoyohimbine alkaloids.
烟草因生物量大且易于表达植物来源的基因,是进行合成生物学研究的一个重要模式底盘系统。该系统已成功用于青蒿素及前体、紫杉醇前体、皂苷及环烯醚萜类化合物的异源合成(Farhi et al.,2011;Fuentes et al.,2016;Li et al.,2019;Malhotra etal.,2016;Miettinen et al.,2014;Paddon et al.,2013;Reed et al.,2017.),成为合成生物学研究中的重要植物模式底盘系统,为烟草中萜类吲哚生物碱的合成生物学研究奠定了坚实基础。Tobacco is an important model chassis system for synthetic biology research due to its large biomass and ease of expression of plant-derived genes. This system has been successfully used for the heterologous synthesis of artemisinin and its precursors, paclitaxel precursors, saponins and iridoids (Farhi et al., 2011; Fuentes et al., 2016; Li et al., 2019 ; Malhotra et al., 2016; Miettinen et al., 2014; Paddon et al., 2013; Reed et al., 2017.), an important plant model chassis system in synthetic biology research, for the terpene indole in tobacco The research on synthetic biology of alkaloids has laid a solid foundation.
专利文献CN202010092362.3公开了一种用于生产阿吗碱的烟草系统,将长春花中克隆获得的关键基因构建农杆菌表达载体,通过烟草瞬时表达体系在烟草中异源表达参与阿吗碱合成的关键基因,实现了烟草中阿吗碱的生物合成。Patent document CN202010092362.3 discloses a tobacco system for the production of amoline. The key gene cloned from periwinkle is constructed as an Agrobacterium expression vector, and heterologously expressed in tobacco through the tobacco transient expression system to participate in the synthesis of amoline The key gene for the biosynthesis of amoline in tobacco.
农杆菌侵染方法是进行植物稳定转化或瞬时转化的有效手段,农杆菌介导的瞬时转化系统是进行烟草合成生物学研究的一种有效手段。pEAQ系列载体和pCambia2301载体是两类通用的农杆菌双元表达载体,已成功用于农杆菌侵染实验。但是,这两类载体均存在一定的缺陷:pEAQ系列载体侵染烟草后,只能通过取样后的qRT-PCR或western-blot检测的方法确定目的基因的表达情况,无法实现目的蛋白表达情况的可视化;pCambia2301载体中引入了GUS筛选标记,可以通过GUS染色的方法,可视化地检测蛋白的表达情况,但是这种检测需要将样品取下后染色,无法原位、实时监测目的蛋白的表达情况。The Agrobacterium infection method is an effective method for stable or transient transformation of plants, and the Agrobacterium-mediated transient transformation system is an effective method for tobacco synthetic biology research. The pEAQ series vectors and pCambia2301 vectors are two general-purpose Agrobacterium binary expression vectors, which have been successfully used in Agrobacterium infection experiments. However, both types of vectors have certain defects: after the pEAQ series vectors infect tobacco, the expression of the target gene can only be determined by qRT-PCR or western-blot detection after sampling, and the expression of the target protein cannot be determined. Visualization: The GUS screening marker is introduced into the pCambia2301 vector, and the expression of the protein can be detected visually by GUS staining. However, this detection requires the sample to be removed and stained, and it is impossible to monitor the expression of the target protein in situ and in real time.
绿色荧光蛋白(Green fluorescent protein,GFP)是一种可视化的筛选标记,已成功用于多种植物中转基因植株筛选,或用作外源蛋白的表达筛选标签(Estrada-Navarrete et al.,2007;Harper et al.,1999;Sparkes et al.,2006)。但是,传统的GFP需要用激光共聚焦显微镜或在特定滤光片下才能观察到荧光的产生。为了更容易检测GFP的荧光,日本科学家构建了肉眼可见的GFP蛋白,并成功用于景观植物的构建(Chin etal.,2018)。为合成生物学研究过程中外源蛋白的实时检测奠定了基础。Green fluorescent protein (Green fluorescent protein, GFP) is a visual screening marker, which has been successfully used in the screening of transgenic plants in a variety of plants, or as an expression screening tag for foreign proteins (Estrada-Navarrete et al., 2007; Harper et al., 1999; Sparkes et al., 2006). However, traditional GFP needs to use a confocal laser microscope or under a specific filter to observe the generation of fluorescence. In order to detect the fluorescence of GFP more easily, Japanese scientists constructed a GFP protein visible to the naked eye and successfully used it in the construction of landscape plants (Chin et al., 2018). It laid the foundation for the real-time detection of foreign proteins in the process of synthetic biology research.
发明内容Contents of the invention
为了提高异源目的基因在植物比如烟草中的表达量、提高监测外源目的蛋白表达的可视化实时检测效果,发明人对常用的商业化农杆菌双元表达载体pCambia2301进行了系统优化,获得了表达量更高的载体系统;同时,引入了肉眼可见的绿色荧光蛋白GFP筛选标记,易于实时监测植物比如烟草中异源蛋白的表达情况。In order to improve the expression level of heterologous target genes in plants such as tobacco, and improve the effect of visual real-time detection for monitoring the expression of exogenous target proteins, the inventors systematically optimized the commonly used commercial Agrobacterium binary expression vector pCambia2301, and obtained expression At the same time, the green fluorescent protein GFP screening marker visible to the naked eye is introduced, which is easy to monitor the expression of heterologous proteins in plants such as tobacco in real time.
具体而言,本发明包含如下技术方案。Specifically, the present invention includes the following technical solutions.
一种农杆菌双元表达载体,其包含:删除了GUS筛选标记基因的载体pCambia2301骨架、35S启动子基因、P19蛋白编码基因和GFP筛选标记基因。An Agrobacterium binary expression vector, which comprises: the carrier pCambia2301 backbone in which the GUS selection marker gene has been deleted, a 35S promoter gene, a P19 protein coding gene and a GFP selection marker gene.
优选地,上述农杆菌双元表达载体中,35S启动子基因、P19蛋白编码基因与GFP筛选标记基因通过自剪切多肽2A相融合从而形成融合基因。Preferably, in the above-mentioned Agrobacterium binary expression vector, the 35S promoter gene, the P19 protein coding gene and the GFP selection marker gene are fused by self-
上述融合基因可以包括如下生物元件:35S启动子、5’UTR、P19蛋白、2A肽、eyGFPuv、3’UTR、NosT。这种情况下,所述融合基因结构为35S启动子-5’UTR-P19-2A肽-eyGFPuv-3’UTR-NosT。The above-mentioned fusion gene may include the following biological elements: 35S promoter, 5'UTR, P19 protein, 2A peptide, eyGFPuv, 3'UTR, NosT. In this case, the fusion gene structure is 35S promoter-5'UTR-P19-2A peptide-eyGFPuv-3'UTR-NosT.
其中P19来源于番茄丛矮病毒,可抑制宿主对外源基因的RNA沉默效应,提高异源基因转录本的稳定性,进而促进异源蛋白的表达。5’UTR是非编码区(UntranslatedRegion)即非翻译区,位于转录起始位点的下游,是转录的,但不翻译。3’UTR从编码区末端的终止密码子延伸至多聚A尾巴(Poly-A)的末端。eyGFPuv是一种在不可见紫外(UV)光激发下表现出明亮荧光的cPYGFP衍生物,该蛋白不仅具有强烈的荧光,而且激发光位于不可见的紫外光谱区。Nost是胭脂碱合成酶基因(nos)的终止子。P19蛋白、5’UTR、3’UTR和NosT的基因序列例如可以是在文献(Frank Sainsbury,Eva C.Thuenemann and GeorgeP.Lomonossoff,pEAQ:versatile expression vectors for easy and quick transientexpression of heterologous proteins in plants,Plant Biotechnology Journal(2009)7,682–693)中公开的序列。Among them, P19 is derived from tomato bush dwarf virus, which can inhibit the host's RNA silencing effect on foreign genes, improve the stability of heterologous gene transcripts, and promote the expression of heterologous proteins. 5'UTR is the non-coding region (UntranslatedRegion), that is, the untranslated region, which is located downstream of the transcription initiation site and is transcribed but not translated. The 3' UTR extends from the stop codon at the end of the coding region to the end of the poly-A tail (Poly-A). eyGFPuv is a cPYGFP derivative that exhibits bright fluorescence under the excitation of invisible ultraviolet (UV) light. The protein not only has strong fluorescence, but the excitation light is located in the invisible ultraviolet spectral region. Nost is the terminator of the nopaline synthase gene (nos). The gene sequences of P19 protein, 5'UTR, 3'UTR and NosT can be, for example, in the literature (Frank Sainsbury, Eva C.Thuenemann and GeorgeP.Lomonossoff, pEAQ: versatile expression vectors for easy and quick transient expression of heterologous proteins in plants, Plant Sequence published in Biotechnology Journal (2009) 7, 682-693).
自剪切多肽2A(self-cleaving 2A peptide,2A)或称可自切割2A peptide,其可以选自P2A、E2A、T2A、F2A,但不限于此。当2A肽为F2A时,上述融合基因结构为35S启动子-5’UTR-P19-F2A-eyGFPuv-3’UTR-NosT。Self-
优选地,上述融合基因的核苷酸序列为SEQ ID NO:1。Preferably, the nucleotide sequence of the fusion gene is SEQ ID NO:1.
这种情况下,构建出的农杆菌双元表达载体的核苷酸序列可以为SEQ ID NO:2。In this case, the nucleotide sequence of the constructed Agrobacterium binary expression vector can be SEQ ID NO:2.
在一种实施方式中,上述农杆菌双元表达载体可以克隆入与阿吗碱合成及其他萜类吲哚生物碱合成相关的基因。这些与阿吗碱合成相关的基因分别编码色氨酸脱羧酶(Tryptophan decaboxylase,TDC)、裂环马钱子苷合成酶(Secologanin synthase,SLS)、异胡豆苷合成酶(Strictosidine synthase,STR)、异胡豆苷beta-D型葡萄糖苷酶(Strictosidine beta-D-glucosidase,SGD)、硝酸盐转运蛋白NPF2.9、异育亨宾生物碱合酶HYS。这些酶和编码基因已经在专利文献CN202010092362.3中进行了公开,将其全部内容并入本发明中。In one embodiment, the above-mentioned Agrobacterium binary expression vector can be cloned into genes related to the synthesis of amoline and other terpenoid indole alkaloids. These genes related to the synthesis of amothine encode tryptophan decarboxylase (TDC), split loganin synthase (SLS), isososidine synthase (Strictosidine synthase, STR), Strictosidine beta-D-glucosidase (Strictosidine beta-D-glucosidase, SGD), nitrate transporter NPF2.9, isoyohimbine alkaloid synthase HYS. These enzymes and coding genes have been disclosed in the patent document CN202010092362.3, the entire contents of which are incorporated into the present invention.
当农杆菌双元表达载体包含两个以上上述基因时,两个以上基因可以通过2A肽基因相融合从而形成融合基因。When the Agrobacterium binary expression vector contains two or more of the above-mentioned genes, the two or more genes can be fused through the 2A peptide gene to form a fusion gene.
作为一种优选的实施方式,上述色氨酸脱羧酶TDC来源于长春花(Catharanthusroseus),其编码基因的核苷酸序列是SEQ ID NO:3。上述裂环马钱子苷合成酶SLS可以来源于长春花(Catharanthus roseus),或称SLS1,其编码基因的核苷酸序列是SEQ ID NO:4。上述异胡豆苷合成酶(STR)的氨基酸序列可以为GenBank登录号:CAA43936,其编码基因可以为GenBank登录号:X61932。上述异胡豆苷beta-D型葡萄糖苷酶SGD来源于长春花(Catharanthus roseus),其编码基因的核苷酸序列是SEQ ID NO:5。上述硝酸盐转运蛋白NPF2.9的氨基酸序列可以为GenBank登录号:AQM73449,其编码基因可以为GenBank登录号:KX372303。上述异育亨宾生物碱合酶HYS的氨基酸序列可以为GenBank登录号:ANQ45225,其编码基因可以为GenBank登录号:KU865325。As a preferred embodiment, the above-mentioned tryptophan decarboxylase TDC is derived from Catharanthus roseus, and the nucleotide sequence of its coding gene is SEQ ID NO:3. The above-mentioned split loganin synthase SLS may be derived from Catharanthus roseus, or SLS1, and the nucleotide sequence of its coding gene is SEQ ID NO:4. The amino acid sequence of the above-mentioned isosidine synthase (STR) can be GenBank accession number: CAA43936, and its coding gene can be GenBank accession number: X61932. The aforementioned isossididine beta-D glucosidase SGD is derived from Catharanthus roseus, and the nucleotide sequence of its coding gene is SEQ ID NO:5. The amino acid sequence of the above-mentioned nitrate transporter NPF2.9 can be GenBank accession number: AQM73449, and its coding gene can be GenBank accession number: KX372303. The amino acid sequence of the above-mentioned heteroyohimbe alkaloid synthase HYS can be GenBank accession number: ANQ45225, and its coding gene can be GenBank accession number: KU865325.
当克隆了上述TDC、SLS、STR、SGD、NPF2.9、HYS的编码基因(或称生物元件)后,上述农杆菌双元表达载体便可用作阿吗碱合成及其他萜类吲哚生物碱合成的专用载体。When the coding genes (or biological elements) of the above-mentioned TDC, SLS, STR, SGD, NPF2.9, and HYS are cloned, the above-mentioned Agrobacterium binary expression vectors can be used for amoline synthesis and other terpenoid indole biological A dedicated carrier for base synthesis.
本发明的第二个方面提供了一种转化了上述专用载体的农杆菌。The second aspect of the present invention provides an Agrobacterium transformed with the above special vector.
本发明的第三个方面提供了上述农杆菌在生产阿吗碱中的应用,即,使用上述的农杆菌侵染适合于表达阿吗碱或其他萜类吲哚生物碱的农作物,以该农作物作为生物合成系统生产阿吗碱或其他萜类吲哚生物碱。A third aspect of the present invention provides the application of the above-mentioned Agrobacterium in the production of amoline, that is, using the above-mentioned Agrobacterium to infect crops suitable for expressing amoline or other terpene indole alkaloids, and the crops As a biosynthetic system for the production of amoline or other terpenoid indole alkaloids.
优选地,上述农作物是烟草。Preferably, the aforementioned agricultural crop is tobacco.
例如,可以使用上述的农杆菌侵染烟草后,饲喂色氨酸和马钱苷(Loganin),收取烟草叶片并提取阿吗碱。For example, after infecting tobacco with the above-mentioned Agrobacterium, feeding tryptophan and loganin (Loganin), harvesting tobacco leaves and extracting amoline.
本发明以商业化的农杆菌双元表达载体pCambia2301为骨架,删除了GUS筛选标记,以便减小整个载体的大小;将抑制35S启动子沉默、具有增强表达作用的P19蛋白编码基因和紫外激发下肉眼可见的GFP筛选标记构建融合基因;进而构建出优化后的农杆菌双元表达载体pF08(pCambia2301-Optimized)。该优化的农杆菌双元表达载体既可提高异源基因的表达量,又可实时、原位监测外源蛋白的表达情况。通过将与阿吗碱合成及其他萜类吲哚生物碱合成相关的基因克隆入该优化载体中并在烟草进行瞬时表达,可构建出产阿吗碱烟草表达系统,提高了相关基因的表达量,促进了阿吗碱及其他萜类吲哚生物碱的异源生物合成,具有开发和应用潜力。The present invention uses the commercialized Agrobacterium binary expression vector pCambia2301 as the backbone, and deletes the GUS screening marker so as to reduce the size of the entire vector; The GFP screening marker visible to the naked eye was used to construct the fusion gene; then the optimized Agrobacterium binary expression vector pF08 (pCambia2301-Optimized) was constructed. The optimized Agrobacterium binary expression vector can not only increase the expression of heterologous genes, but also monitor the expression of foreign proteins in real time and in situ. By cloning genes related to the synthesis of amoline and other terpenoid indole alkaloids into the optimized vector and performing transient expression in tobacco, the expression system of amoline-producing tobacco can be constructed, and the expression of related genes can be increased. It promotes the heterologous biosynthesis of amoline and other terpenoid indole alkaloids, and has development and application potential.
附图说明Description of drawings
图1显示了本发明构建的优化载体农杆菌双元表达载体pCambia2301-Optimized(pF08)的结构示意图。Fig. 1 shows a schematic structural diagram of the optimized vector Agrobacterium binary expression vector pCambia2301-Optimized (pF08) constructed in the present invention.
图2显示了本发明构建的专用载体(pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS)在侵染烟草中GFP的表达情况(365nm紫外照射)。其中照片A是烟草叶片的部分区域注射侵染,绿色部分(浅色部分)为注射区域,表明GFP标签顺利表达;红色部分(深色部分)为未注射区域,表明无GFP标签的表达。照片B是整个叶片注射侵染,绿色(浅色)部分表明GFP标签顺利表达;红色(深色)部分表明注射部分的GFP标签未表达或者注射未能覆盖整个叶片。照片A和照片B验证了本发明构建的优化载体pF08的功能。Fig. 2 shows the expression situation of GFP (365nm ultraviolet irradiation ). Photo A is part of the injection infection of tobacco leaves, the green part (light part) is the injected area, indicating that the GFP tag is expressed smoothly; the red part (dark part) is the non-injected area, indicating that there is no expression of the GFP tag. Photo B is the injection infection of the whole leaf. The green (light color) part indicates that the GFP tag is successfully expressed; the red (dark color) part indicates that the GFP tag in the injected part is not expressed or the injection fails to cover the entire leaf. Photo A and Photo B verify the function of the optimized vector pF08 constructed by the present invention.
图3显示了克隆有TDC、SLS、STR、SGD、NPF2.9、HYS基因的原始载体pCambia2301(TDC-2A-SLS、STR-2A-NPF2.9、SGD-2A-HYS)侵染烟草中阿吗碱含量的测定与本发明构建的优化载体(pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS)侵染烟草中阿吗碱含量的测定对比情况。其中图A是侵染烟草中阿吗碱的定性分析结果,ajm:阿吗碱标准品;图B是侵染烟草中阿吗碱的含量检测(μg/g鲜重)结果。Figure 3 shows that the original vector pCambia2301 (TDC-2A-SLS, STR-2A-NPF2.9, SGD-2A-HYS) with TDC, SLS, STR, SGD, NPF2.9, HYS gene cloned infects tobacco in Arabidopsis The mensuration of moline content and the optimization carrier (pF08-TDC-2A-SLS, pF08-STR-2A-NPF2.9, pF08-SGD-2A-HYS) constructed by the present invention compare the mensuration of moline content in tobacco Condition. Among them, Figure A is the qualitative analysis result of amoline in infected tobacco, ajm: amoline standard; Figure B is the result of the content detection (μg/g fresh weight) of amoline in infected tobacco.
具体实施方式Detailed ways
本发明对商业化的原始农杆菌双元表达载体pCambia2301进行了系统优化,即以原始pCambia2301载体为骨架,删除原骨架中的GUS筛选标记(有2922个核苷酸),克隆入抑制35S启动子沉默的P19蛋白编码基因以及紫外激发下肉眼可见的GFP筛选标记基因eyGFPuv(或称eYGFPuv),获得了表达效果更高的载体系统pCambia2301-Optimized(pF08)。其中GFP筛选标记的引入方便了异源蛋白表达情况的实时监测。The present invention systematically optimizes the commercial original Agrobacterium binary expression vector pCambia2301, that is, the original pCambia2301 vector is used as the backbone, the GUS screening marker (2922 nucleotides) in the original backbone is deleted, and the 35S promoter is cloned into A vector system pCambia2301-Optimized (pF08) with higher expression effect was obtained from the silenced P19 protein coding gene and the GFP screening marker gene eyGFPuv (or called eYGFPuv) visible to the naked eye under ultraviolet excitation. The introduction of the GFP screening marker facilitates the real-time monitoring of the expression of the heterologous protein.
在优化载体中加载了异源蛋白基因后,提高了异源蛋白的表达量。该优化载体pF08的表达效果通过在烟草中的阿吗碱生物合成进行了验证,在载体pF08中克隆了与阿吗碱生物合成相关的目的基因TDC、SLS、STR、NPF2.9、SGD、HYS,构建出农杆菌,侵染烟草后,提高了阿吗碱的产量,证明这些异源蛋白的表达量明显提高。After the heterologous protein gene is loaded in the optimized vector, the expression level of the heterologous protein is increased. The expression effect of the optimized vector pF08 was verified by amoline biosynthesis in tobacco, and the target genes TDC, SLS, STR, NPF2.9, SGD, HYS related to amoline biosynthesis were cloned in the vector pF08 , Agrobacterium was constructed, and after infecting tobacco, the production of amoline was increased, which proved that the expression of these heterologous proteins was significantly increased.
在本文中,有时为了描述简便,会将某一蛋白比如TDC蛋白质名称与其编码基因(DNA)名称混用,本领域技术人员应能理解它们在不同描述场合表示不同的物质。例如,对于TDC(基因),用于描述色氨酸脱羧酶功能或类别时,指的是蛋白质;在作为一种基因描述时,指的是编码该色氨酸脱羧酶的基因,以此类推,这是本领域技术人员容易理解的。In this paper, sometimes for the sake of simplicity of description, the name of a certain protein such as TDC protein and the name of its coding gene (DNA) are mixed, and those skilled in the art should understand that they represent different substances in different description occasions. For example, TDC (gene), when used to describe the function or class of tryptophan decarboxylase, refers to the protein; when described as a gene, refers to the gene encoding that tryptophan decarboxylase, and so on , which is easily understood by those skilled in the art.
以下结合具体实施例对本发明做进一步详细说明。应理解,以下实施例仅用于说明本发明而非用于限定本发明的范围。The present invention will be described in further detail below in conjunction with specific examples. It should be understood that the following examples are only used to illustrate the present invention but not to limit the scope of the present invention.
本文的实施例中涉及到多种物质的添加量、含量及浓度,其中所述的百分含量,除特别说明外,皆指质量百分含量。The examples herein refer to the addition amount, content and concentration of various substances, and the percentage content mentioned therein refers to the mass percentage content unless otherwise specified.
实施例中,如果对于操作温度没有做出具体说明,则该温度通常指室温(15-30℃)。In the examples, if there is no specific statement about the operating temperature, the temperature usually refers to room temperature (15-30° C.).
实施例Example
材料和方法Materials and methods
实施例中的全基因合成、引物合成及测序皆由南京金斯瑞生物科技有限公司完成。The whole gene synthesis, primer synthesis and sequencing in the examples were all completed by Nanjing GenScript Biotechnology Co., Ltd.
实施例中的分子生物学实验包括质粒构建、酶切、连接、感受态细胞制备、转化、培养基配制等等,主要参照《分子克隆实验指南》(第三版),J.萨姆布鲁克,D.W.拉塞尔(美)编著,黄培堂等译,科学出版社,北京,2002)进行。必要时可以通过简单试验确定具体实验条件。The molecular biology experiments in the examples include plasmid construction, enzyme digestion, connection, competent cell preparation, transformation, medium preparation, etc., mainly referring to "Molecular Cloning Experiment Guide" (third edition), J. Sambrook, Edited by D.W. Russell (US), translated by Huang Peitang et al., Science Press, Beijing, 2002). The specific experimental conditions can be determined by simple experiments if necessary.
PCR扩增实验根据质粒或DNA模板供应商提供的反应条件或试剂盒说明书进行。必要时可以通过简单试验予以调整。PCR amplification experiments were carried out according to the reaction conditions or kit instructions provided by the plasmid or DNA template suppliers. It can be adjusted by simple experiment if necessary.
LB培养基:10g/L胰蛋白胨、5g/L酵母提取物、10g/L氯化钠,pH 7.0。LB medium: 10g/L tryptone, 5g/L yeast extract, 10g/L sodium chloride, pH 7.0.
阿吗碱及前体的HPLC-MS检测条件:HPLC-MS detection conditions of amoline and its precursors:
靶标代谢组学检测使用的是Agilent 1260Infinity HPLC-MS系统进行检测。色谱柱为Agilent C18柱(色谱柱:3.5um,4.0×100mm);流动相为0.1%甲酸水溶液(A)和色谱纯乙腈(B),按照下列梯度进行分离:0-2min,5%B;22min,40%B;25min,60%B;28min,95%B;35min,5%B.流速为0.8mL/min,柱平衡5min。总分析时间为40min。质谱为单四级杆质谱,按照Agilent厂家的默认参数进行。Agilent 1260 Infinity HPLC-MS system was used for target metabolomics detection. The chromatographic column is an Agilent C18 column (chromatographic column: 3.5um, 4.0×100mm); the mobile phase is 0.1% formic acid aqueous solution (A) and chromatographically pure acetonitrile (B), and the separation is carried out according to the following gradient: 0-2min, 5% B; 22min, 40%B; 25min, 60%B; 28min, 95%B; 35min, 5%B. The flow rate was 0.8mL/min, and the column equilibrated for 5min. The total analysis time is 40 min. The mass spectrometer was a single quadrupole mass spectrometer, and was performed according to the default parameters of the Agilent manufacturer.
实施例1原始载体pCambia2301中GUS筛选标记的删除Deletion of the GUS screening marker in the original vector pCambia2301 of embodiment 1
1.1利用重组酶Redα/β或者RecE/T介导的同源重组对DNA分子进行精确修饰的技术来进行GUS筛选标记及其启动子和终止子的删除,步骤包括:将原始pCambia2301载体(Invitrogen公司)利用限制性内切酶SnaBI和MaubI(Thermofisher公司)于37℃酶切,获得线性化的pCambia2301载体。1.1 Utilize recombinase Redα/β or RecE/T-mediated homologous recombination to precisely modify DNA molecules to delete GUS screening markers and their promoters and terminators. The steps include: adding the original pCambia2301 vector (Invitrogen Corporation ) was digested at 37° C. with restriction endonucleases SnaBI and MaubI (Thermofisher Company) to obtain a linearized pCambia2301 vector.
1.2然后将上述线性化的pCambia2301载体和引物GUS-deletion-F及GUS-deletion-R利用Red/ET技术在DH10β大肠杆菌(NEB公司)中进行体内重组。重组DH10β大肠杆菌在含硫酸卡那霉素的LB培养基中37℃培养过夜。1.2 Then, the above-mentioned linearized pCambia2301 vector and primers GUS-deletion-F and GUS-deletion-R were recombined in DH10β Escherichia coli (NEB Company) in vivo using Red/ET technology. Recombinant Escherichia coli DH10β was cultured overnight at 37°C in LB medium containing kanamycin sulfate.
所用引物序列(5’-3’)为:The primer sequences (5'-3') used are:
GUS-deletion-F:GUS-deletion-F:
CTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC;CTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC;
GUS-deletion-R:GUS-deletion-R:
FCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC。FCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC.
1.3使用质粒提取试剂盒(Axygen公司)提取质粒,经测序验证正确,获得删除GUS筛选标记的中间载体pCambia2301-Gus-Del。1.3 Use a plasmid extraction kit (Axygen Company) to extract the plasmid, and verify correctness by sequencing to obtain the intermediate vector pCambia2301-Gus-Del with the GUS screening marker deleted.
实施例2优化载体的构建Embodiment 2 optimizes the construction of carrier
2.1插入片段(Insertfragment)的合成:插入片段包含的元件有35S启动子、5’UTR、P19、F2A肽、eyGFPuv、3’UTR和NosT七部分,结构为35S启动子-5’UTR-P19-2A肽-eyGFPuv-3’UTR-NosT,该融合基因的核苷酸序列为SEQ ID NO:1。2.1 Synthesis of the insert fragment (Insert fragment): The insert fragment contains seven parts: 35S promoter, 5'UTR, P19, F2A peptide, eyGFPuv, 3'UTR and NosT, and the structure is 35S promoter-5'UTR-P19- 2A peptide-eyGFPuv-3'UTR-NosT, the nucleotide sequence of the fusion gene is SEQ ID NO:1.
该融合序列委托南京金斯瑞生物科技有限公司合成。The fusion sequence was synthesized by Nanjing GenScript Biotechnology Co., Ltd.
2.2构建优化的载体:将步骤2.1中合成的插入片段和实施例1中得到的中间载体pCambia2301-Gus-Del载体用限制性内切酶ECoRI和SmaI(NEB公司)于37℃酶切12h,获得线性化的pCambia2301-Gus-Del载体。然后利用T4 DNA Ligase(NEB公司)按照试剂盒说明书进行连接,构建优化的载体pCambia2301-Optimized,命名为pF08,经测序验证正确。该载体pF08的核苷酸序列为SEQ ID NO:2。2.2 Construction of an optimized vector: Digest the insert fragment synthesized in step 2.1 and the intermediate vector pCambia2301-Gus-Del vector obtained in Example 1 with restriction enzymes ECoRI and SmaI (NEB Company) at 37°C for 12h to obtain Linearized pCambia2301-Gus-Del vector. Then, T4 DNA Ligase (NEB Company) was used to perform ligation according to the instructions of the kit to construct an optimized vector pCambia2301-Optimized, named pF08, which was verified to be correct by sequencing. The nucleotide sequence of the vector pF08 is SEQ ID NO:2.
实施例3阿吗碱生物合成相关基因的扩增Example 3 Amplification of Amoline Biosynthesis-Related Genes
参照专利文献CN202010092362.3中公开的方法,对阿吗碱生物合成相关的目的基因TDC、SLS1、STR、NPF2.9、SGD和HYS进行扩增。包括如下步骤。Referring to the method disclosed in the patent document CN202010092362.3, the target genes TDC, SLS1, STR, NPF2.9, SGD and HYS related to amoline biosynthesis were amplified. Including the following steps.
3.1提取长春花总RNA与反转录3.1 Extraction of periwinkle total RNA and reverse transcription
长春花叶片液氮研磨成粉末,取100mg研磨好的粉末与1.5mL RNA free的离心管中,然后按照全式金的RNA提取试剂盒说明书进行RNA提取。提取后的RNA直接反转录生产cDNA。The periwinkle leaves were ground into powder with liquid nitrogen, and 100 mg of the ground powder was placed in a centrifuge tube with 1.5 mL of RNA free, and then RNA was extracted according to the instructions of Quanjin’s RNA extraction kit. The extracted RNA is directly reverse-transcribed to produce cDNA.
取1μg RNA,利用全式金的反转录试剂盒反转录生产cDNA。Take 1 μg of RNA and reverse transcribe to produce cDNA using Quanshijin reverse transcription kit.
3.2目的基因克隆3.2 Target gene cloning
以步骤3.1中反转录获得的长春花cDNA为模板,以表1所示引物按照以下体系进行扩增[5×FastPfu Flybμffer10μl,dNTP(2.5mM)4μL,引物(10μM)各2μL,cDNA 1μL,FastPfu Fly DNA聚合酶1μL,补加ddH2O至终体积50μL扩增。PCR反应程序:98℃预变性30s,98℃变性10s,58℃退火30s,72℃延伸1min 30s,35个循环,72℃终延伸5min,16℃保存。利用Axygen公司的Agarose Gel Fragment Recovery Kit Ver.2.0纯化PCR产物。分别得到TDC、SLS、STR、NPF2.9、SGD和HYS基因片段。Use the periwinkle cDNA obtained by reverse transcription in step 3.1 as a template, and use the primers shown in Table 1 to amplify according to the following system [5× 10 μl of FastPfu Flyb μffer, 4 μL of dNTP (2.5 mM), 2 μL of each primer (10 μM), 1 μL of cDNA, FastPfu Fly DNA polymerase 1 μL, add ddH 2 O to a final volume of 50 μL for amplification. PCR reaction program: pre-denaturation at 98°C for 30s, denaturation at 98°C for 10s, annealing at 58°C for 30s, extension at 72°C for 1min and 30s, 35 cycles, final extension at 72°C for 5min, storage at 16°C. The PCR product was purified using Agarose Gel Fragment Recovery Kit Ver.2.0 from Axygen. The gene fragments of TDC, SLS, STR, NPF2.9, SGD and HYS were obtained respectively.
表1、基因扩增所需引物Table 1. Primers required for gene amplification
其中,F是正向引物;R是反向引物。Wherein, F is a forward primer; R is a reverse primer.
3.3融合基因克隆3.3 Fusion gene cloning
以步骤3.2中获得的含目的基因的质粒为模板,以表2所示引物按照以下体系进行融合基因的扩增[5×FastPfu Flybμffer10μl,dNTP(2.5mM)4μL,引物(10μM)各2μL,cDNA 1μL,FastPfu Fly DNA聚合酶1μL,补加ddH2O至终体积50μL扩增。PCR反应程序:98℃预变性30s,98℃变性10s,58℃退火30s,72℃延伸1min 30s,35个循环,72℃终延伸5min,16℃保存。利用Axygen公司的Agarose Gel Fragment Recovery Kit Ver.2.0纯化PCR产物。分别得到带有融合蛋白接头2A肽基因的TDC、SLS1、STR、NPF2.9、SGD和HYS基因片段。Use the plasmid containing the target gene obtained in step 3.2 as a template, and use the primers shown in Table 2 to amplify the fusion gene according to the following system [5× 10 μl of FastPfu Flyb μffer, 4 μL of dNTP (2.5 mM), 2 μL of each primer (10 μM), 1 μL of cDNA, FastPfu Fly DNA polymerase 1 μL, add ddH 2 O to a final volume of 50 μL for amplification. PCR reaction program: pre-denaturation at 98°C for 30s, denaturation at 98°C for 10s, annealing at 58°C for 30s, extension at 72°C for 1min and 30s, 35 cycles, final extension at 72°C for 5min, storage at 16°C. The PCR product was purified using Agarose Gel Fragment Recovery Kit Ver.2.0 from Axygen. The TDC, SLS1, STR, NPF2.9, SGD and HYS gene fragments with
表2、融合基因扩增所需引物Table 2. Primers required for fusion gene amplification
其中,F是正向引物;R是反向引物。Wherein, F is a forward primer; R is a reverse primer.
实施例4农杆菌表达载体的构建The construction of embodiment 4 Agrobacterium expression vector
4.1原始载体的构建4.1 Construction of the original vector
将原始农杆菌双元表达载体pCambia2301(Invitrogen公司)用合适的限制性内切酶BamHI和SalI在37℃酶切12h,采用胶回收试剂盒(Axygen公司)纯化酶切产物,利用同源重组酶将步骤3.3中的纯化片段和酶切载体按照说明书操作进行同源重组,得到表达融合基因的原始载体。包括pCambia2301-TDC-2A-SLS、pCambia2301-STR-2A-NPF2.9、pCambia2301-SGD-2A-HYS等。The original Agrobacterium binary expression vector pCambia2301 (Invitrogen Company) was digested with appropriate restriction enzymes BamHI and SalI at 37°C for 12 hours, and the digested product was purified using a gel recovery kit (Axygen Company), and homologous recombination was used to Perform homologous recombination on the purified fragment and restriction vector in step 3.3 according to the instructions to obtain the original vector expressing the fusion gene. Including pCambia2301-TDC-2A-SLS, pCambia2301-STR-2A-NPF2.9, pCambia2301-SGD-2A-HYS, etc.
4.2优化载体的构建4.2 Construction of optimized vector
参照步骤4.1的方法,将步骤2.2中构建的优化载体pF08用合适的限制性内切酶BamHI和SalI在37℃酶切12h,采用胶回收试剂盒(Axygen公司)纯化酶切产物,利用同源重组酶将步骤3.3中的纯化片段和酶切载体按照说明书操作进行同源重组,得到表达融合基因的优化载体。包括pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS等。Referring to the method in step 4.1, the optimized vector pF08 constructed in step 2.2 was digested with the appropriate restriction enzymes BamHI and SalI at 37°C for 12 h, and the digested product was purified using a gel recovery kit (Axygen Company). The recombinase performs homologous recombination on the purified fragment and restriction vector in step 3.3 according to the instructions to obtain an optimized vector for expressing the fusion gene. Including pF08-TDC-2A-SLS, pF08-STR-2A-NPF2.9, pF08-SGD-2A-HYS, etc.
实施例5构建农杆菌Embodiment 5 constructs Agrobacterium
取1μg测序正确的实施例4中得到的表达载体和农杆菌GV3101感受态细胞(上海唯地生物有限公司)在冰上孵育30分钟,液氮速冻5min,37℃热激5min,冰上放置5min,加入1mL的LB培养基,放置在28℃摇床复苏4小时。将菌体全部涂板在卡那抗性和利福霉素抗性的LB平板上,放置在28℃培养箱培养2-4d,得到分别转化了原始表达载体(包括pCambia2301-TDC-2A-SLS、pCambia2301-STR-2A-NPF2.9、pCambia2301-SGD-2A-HYS等)和优化表达载体(包括pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS等)的农杆菌工程菌。Take 1 μg of the expression vector obtained in Example 4 with correct sequencing and Agrobacterium GV3101 competent cells (Shanghai Weidi Biological Co., Ltd.) and incubate on ice for 30 minutes, quick freeze in liquid nitrogen for 5 minutes, heat shock at 37°C for 5 minutes, and place on ice for 5 minutes , add 1 mL of LB medium, and place on a shaker at 28°C for 4 hours to recover. All the bacteria were plated on kana-resistant and rifamycin-resistant LB plates, and placed in a 28°C incubator for 2-4 days to obtain the original expression vectors (including pCambia2301-TDC-2A-SLS , pCambia2301-STR-2A-NPF2.9, pCambia2301-SGD-2A-HYS, etc.) and optimized expression vectors (including pF08-TDC-2A-SLS, pF08-STR-2A-NPF2.9, pF08-SGD-2A- HYS, etc.) Agrobacterium engineering bacteria.
实施例6农杆菌侵染Example 6 Agrobacterium Infection
从卡那霉素和利福平霉素抗性的LB培养基平板上挑取单菌落至LB(Kan+Rif)试管培养48h至OD600≥2.0。在28℃培养过夜(12h)。收集菌体后用MMA buffer(10mM MES、10mMMgCl2和100μM乙酰丁香酮)孵育3h,然后用无针头注射器注射烟草。侵染烟草4d后饲喂底物(100mg/mL色氨酸和100mg/mL马钱苷(Loganin)),1d后收取烟草叶片,进行靶标代谢组学分析(HPLC-MS分析)。Pick a single colony from the LB medium plate resistant to kanamycin and rifampicin and culture it in LB (Kan+Rif) tube for 48 hours until OD600≥2.0. Incubate overnight (12h) at 28°C. After the cells were collected, they were incubated with MMA buffer (10 mM MES, 10 mM MgCl 2 and 100 μM acetosyringone) for 3 h, and then the tobacco was injected with a needle-free syringe. The substrate (100 mg/mL tryptophan and 100 mg/mL loganin (Loganin)) was fed 4 days after the tobacco infection, and the tobacco leaves were harvested 1 day later for target metabolomics analysis (HPLC-MS analysis).
实施例7烟草叶片代谢组学分析Example 7 Metabolomics Analysis of Tobacco Leaf
如图2所示,用含有优化表达载体(pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS)的农杆菌侵染烟草叶片后,能够GFP标签顺利表达,365nm紫外照射下,侵染区域为绿色(浅色)部分,肉眼可见,与未注射侵染区域即红色(深色)部分区别明显,表明本发明构建的优化农杆菌双元表达载体pF08可实时、原位监测外源蛋白的表达情况。As shown in Figure 2, after infecting tobacco leaves with Agrobacterium containing optimized expression vectors (pF08-TDC-2A-SLS, pF08-STR-2A-NPF2.9, pF08-SGD-2A-HYS), GFP labeling Smooth expression, under 365nm ultraviolet irradiation, the infected area is a green (light-colored) part, which is visible to the naked eye, and is obviously different from the red (dark) part of the infected area without injection, indicating that the optimized Agrobacterium binary expression vector constructed by the present invention pF08 can monitor the expression of foreign proteins in real time and in situ.
图3显示,使用含有原始载体pCambia2301(TDC-2A-SLS、STR-2A-NPF2.9、SGD-2A-HYS)的农杆菌侵染的烟草叶片中阿吗碱的含量约为12.1μg/g鲜重;使用含有优化载体pF08(pF08-TDC-2A-SLS、pF08-STR-2A-NPF2.9、pF08-SGD-2A-HYS)的农杆菌侵染的烟草叶片中阿吗碱的含量约为20μg/g鲜重,明显高于原始载体,表明本发明构建的优化农杆菌双元表达载体可提高异源基因(TDC、SLS、STR、NPF2.9、SGD和HYS)的表达量,进而提高阿吗碱产量。Figure 3 shows that the content of amoline in the tobacco leaves infected by Agrobacterium containing the original vector pCambia2301 (TDC-2A-SLS, STR-2A-NPF2.9, SGD-2A-HYS) is about 12.1 μg/g Fresh weight; the content of amoline in tobacco leaves infected with Agrobacterium containing optimized vector pF08 (pF08-TDC-2A-SLS, pF08-STR-2A-NPF2.9, pF08-SGD-2A-HYS) was about It is 20 μ g/g fresh weight, obviously higher than original carrier, shows that the optimized Agrobacterium binary expression carrier that the present invention constructs can improve the expression level of heterologous gene (TDC, SLS, STR, NPF2.9, SGD and HYS), and then Increase the production of amoline.
还需说明的是,本说明书中对先前公开的文献的列举和论述不应视为承认该文献是现有技术或者是公知常识。It should also be noted that the enumeration and discussion of previously disclosed documents in this specification should not be regarded as acknowledging that the documents are prior art or common knowledge.
参考文献references
1.Chin,D.P.,Shiratori,I.,Shimizu,A.,Kato,K.,Mii,M.,and Waga,I.(2018).Generation of brilliant green fluorescent petunia plants by using a new andpotent fluorescent protein transgene.Sci Rep 8,1-10.1. Chin, D.P., Shiratori, I., Shimizu, A., Kato, K., Mii, M., and Waga, I. (2018). Generation of brilliant green fluorescent petunia plants by using a new and potent fluorescent protein transgene .Sci Rep 8,1-10.
2.Estrada-Navarrete,G.,Alvarado-Affantranger,X.,Olivares,J.-E.,Guillén,G.,Diaz-Camino,C.,Campos,F.,Quinto,C.,Gresshoff,P.M.,and Sanchez,F.(2007).Fast,efficient and reproducible genetic transformation of Phaseolus spp.byAgrobacterium rhizogenes.Nature protocols 2,1819.2. Estrada-Navarrete, G., Alvarado-Affantranger, X., Olivares, J.-E., Guillén, G., Diaz-Camino, C., Campos, F., Quinto, C., Gresshoff, P.M., and Sanchez, F. (2007). Fast, efficient and reproducible genetic transformation of Phaseolus spp. by Agrobacterium rhizogenes. Nature protocols 2, 1819.
3.Farhi,M.,Marhevka,E.,Ben-Ari,J.,Algamas-Dimantov,A.,Liang,Z.,Zeevi,V.,Edelbaum,O.,Spitzer-Rimon,B.,Abeliovich,H.,and Schwartz,B.(2011).Generation of the potent anti-malarial drug artemisinin in tobacco.NatBiotechnol 29,1072.3. Farhi, M., Marhevka, E., Ben-Ari, J., Algamas-Dimantov, A., Liang, Z., Zeevi, V., Edelbaum, O., Spitzer-Rimon, B., Abeliovich, H., and Schwartz, B. (2011). Generation of the potent anti-malarial drug artemisinin in tobacco. Nat Biotechnol 29, 1072.
4.Frank Sainsbury,Eva C.Thuenemann and George P.Lomonossoff,pEAQ:versatile expression vectors for easy and quick transient expression ofheterologous proteins in plants,Plant Biotechnology Journal(2009)7,682–693.4. Frank Sainsbury, Eva C. Thuenemann and George P. Lomonossoff, pEAQ: versatile expression vectors for easy and quick transient expression of heterologous proteins in plants, Plant Biotechnology Journal (2009) 7, 682–693.
5.Fuentes,P.,Zhou,F.,Erban,A.,Karcher,D.,Kopka,J.,and Bock,R.(2016).Anew synthetic biology approach allows transfer of an entire metabolic pathwayfrom a medicinal plant to a biomass crop.eLife 5.5. Fuentes, P., Zhou, F., Erban, A., Karcher, D., Kopka, J., and Bock, R. (2016). A new synthetic biology approach allows transfer of an entire metabolic pathway from a medicinal plant to a biomass crop. eLife 5.
6.Harper,B.K.,Mabon,S.A.,Leffel,S.M.,Halfhill,M.D.,Richards,H.A.,Moyer,K.A.,and Stewart,C.N.(1999).Green fluorescent protein as a marker forexpression of a second gene in transgenic plants.Nat Biotechnol 17,1125-1129.6. Harper, B.K., Mabon, S.A., Leffel, S.M., Halfhill, M.D., Richards, H.A., Moyer, K.A., and Stewart, C.N. (1999). Green fluorescent protein as a marker for expression of a second gene in transgenic plants.
7.Li,J.,Mutanda,I.,Wang,K.,Yang,L.,Wang,J.,and Wang,Y.(2019).Chloroplastic metabolic engineering coupled with isoprenoid pool enhancementfor committed taxanes biosynthesis in Nicotiana benthamiana.Nat Commun 10,4850.7. Li, J., Mutanda, I., Wang, K., Yang, L., Wang, J., and Wang, Y. (2019). Chloroplastic metabolic engineering coupled with isoprenoid pool enhancement for committed taxanes biosynthesis in Nicotiana benthamiana .Nat Commun 10, 4850.
8.Malhotra,K.,Subramaniyan,M.,Rawat,K.,Kalamuddin,M.,Qureshi,M.I.,Malhotra,P.,Mohmmed,A.,Cornish,K.,Daniell,H.,and Kumar,S.(2016).Compartmentalized Metabolic Engineering for Artemisinin Biosynthesis andEffective Malaria Treatment by Oral Delivery of Plant Cells.Mol Plant 9,1464-1477.8. Malhotra, K., Subramaniyan, M., Rawat, K., Kalamuddin, M., Qureshi, M.I., Malhotra, P., Mohmmed, A., Cornish, K., Daniell, H., and Kumar, S .(2016).Compartmentalized Metabolic Engineering for Artemisinin Biosynthesis and Effective Malaria Treatment by Oral Delivery of Plant Cells.Mol Plant 9,1464-1477.
9.Miettinen,K.,Dong,L.,Navrot,N.,Schneider,T.,Burlat,V.,Pollier,J.,Woittiez,L.,Der Krol,S.V.,Lugan,R.,and Ilc,T.(2014).The seco-iridoid pathwayfrom Catharanthus roseus.Nat Commun 5,3606-3606.9. Miettinen, K., Dong, L., Navrot, N., Schneider, T., Burlat, V., Pollier, J., Woittiez, L., Der Krol, S.V., Lugan, R., and Ilc, T. (2014). The seco-iridoid pathway from Catharanthus roseus. Nat Commun 5, 3606-3606.
10.Paddon,C.J.,Westfall,P.J.,Pitera,D.J.,Benjamin,K.,Fisher,K.,McPhee,D.,Leavell,M.,Tai,A.,Main,A.,and Eng,D.(2013).High-level semi-synthetic production of the potent antimalarial artemisinin.Nature 496,528.10. Paddon, C.J., Westfall, P.J., Pitera, D.J., Benjamin, K., Fisher, K., McPhee, D., Leavell, M., Tai, A., Main, A., and Eng, D. ( 2013). High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496,528.
11.Reed,J.,Stephenson,M.J.,Miettinen,K.,Brouwer,B.,Leveau,A.,Brett,P.,Goss,R.J.,Goossens,A.,O’Connell,M.A.,and Osbourn,A.(2017).A translationalsynthetic biology platform for rapid access to gram-scale quantities of noveldrug-like molecules.Metab Eng 42,185-193.11. Reed, J., Stephenson, M.J., Miettinen, K., Brouwer, B., Leveau, A., Brett, P., Goss, R.J., Goossens, A., O’Connell, M.A., and Osbourn, A. .(2017).A translationalsynthetic biology platform for rapid access to gram-scale quantities of noveldrug-like molecules.Metab Eng 42,185-193.
12.Sparkes,I.A.,Runions,J.,Kearns,A.,and Hawes,C.(2006).Rapid,transient expression of fluorescent fusion proteins in tobacco plants andgeneration of stably transformed plants.Nature protocols 1,2019.12. Sparkes, I.A., Runions, J., Kearns, A., and Hawes, C. (2006). Rapid, transient expression of fluorescent fusion proteins in tobacco plants and generation of stably transformed plants. Nature protocols 1, 2019.
13.吴世文,杨盟权,and肖友利(2018).单萜吲哚生物碱的合成生物学研究进展.Chin J Org Chem 38,2243-2258.13. Wu Shiwen, Yang Mengquan, and Xiao Youli (2018). Research progress in synthetic biology of monoterpene indole alkaloids. Chin J Org Chem 38, 2243-2258.
序列表sequence listing
<110> 中国科学院上海生命科学研究院<110> Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences
<120> 一种农杆菌双元表达载体<120> A binary expression vector for Agrobacterium
<130> SHPI2010068<130> SHPI2010068
<160> 5<160> 5
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 2526<211> 2526
<212> DNA<212>DNA
<213> 人工序列()<213> artificial sequence ()
<400> 1<400> 1
ggaaacctcc tcggattcca ttgcccagct atctgtcact ttattgagaa gatagtggaa 60ggaaacctcc tcggattcca ttgcccagct atctgtcact ttattgagaa gatagtggaa 60
aaggaaggtg gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat 120aaggaaggtg gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat 120
gcctctgccg acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa 180gcctctgccg acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa 180
gaagacgttc caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta 240gaagacgttc caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta 240
agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca 300agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca 300
tttcatttgg agaggtatta aaatcttaat aggttttgat aaaagcgaac gtggggaaac 360tttcatttgg agaggtatta aaatcttaat aggttttgat aaaagcgaac gtggggaaac 360
ccgaaccaaa ccttcttcta aactctctct catctctctt aaagcaaact tctctcttgt 420ccgaaccaaa ccttcttcta aactctctct catctctctt aaagcaaact tctctcttgt 420
ctttcttgcg tgagcgatct tcaacgttgt cagatcgtgc ttcggcacca gtacaacgtt 480ctttcttgcg tgagcgatct tcaacgttgt cagatcgtgc ttcggcacca gtacaacgtt 480
ttctttcact gaagcgaaat caaagatctc tttgtggaca cgtagtgcgg cgccattaaa 540ttctttcact gaagcgaaat caaagatctc tttgtggaca cgtagtgcgg cgccattaaa 540
taacgtgtac ttgtcctatt cttgtcggtg tggtcttggg aaaagaaagc ttgctggagg 600taacgtgtac ttgtcctatt cttgtcggtg tggtcttggg aaaagaaagc ttgctggagg 600
ctgctgttca gccccataca ttacttgtta cgattctgct gactttcggc gggtgcaata 660ctgctgttca gccccataca ttacttgtta cgattctgct gactttcggc gggtgcaata 660
tctctacttc tgcttgacga ggtattgttg cctgtacttc tttcttcttc ttcttgctga 720tctctacttc tgcttgacga ggtattgttg cctgtacttc tttcttcttc ttcttgctga 720
ttggttctat aagaaatcta gtattttctt tgaaacagag ttttcccgtg gttttcgaac 780ttggttctat aagaaatcta gtattttctt tgaaacagag ttttcccgtg gttttcgaac 780
ttggagaaag attgttaagc ttctgtatat tctgcccaaa ttcgcgatgg aacgagctat 840ttggagaaag attgttaagc ttctgtatat tctgcccaaa ttcgcgatgg aacgagctat 840
acaaggaaac gacgctaggg aacaagctaa cagtgaacgt tgggatggag gatcaggagg 900acaaggaaac gacgctaggg aacaagctaa cagtgaacgt tgggatggag gatcaggagg 900
aagcacttct cccttcaaac ttcctgacga aagtccgagt tggactgagt ggcggctaca 960aagcacttct cccttcaaac ttcctgacga aagtccgagt tggactgagt ggcggctaca 960
taacgatgag acgaactcga atcaagataa tccccttggt ttcaaggaaa gctggggttt 1020taacgatgag acgaactcga atcaagataa tccccttggt ttcaaggaaa gctggggttt 1020
cgggaaagtt gtatttaaga gatatctcag atacgacagg acggaggcat cactgcacag 1080cgggaaagtt gtatttaaga gatatctcag atacgacagg acggaggcat cactgcacag 1080
agtccttgga tcttggacgg gagattcggt taactatgca gcatctcgat ttttcggttt 1140agtccttgga tcttggacgg gagattcggt taactatgca gcatctcgat ttttcggttt 1140
cgaccagatc ggatgtacct atagtattcg gtttcgagga gttagtatca ccgtttctgg 1200cgaccagatc ggatgtacct atagtattcg gtttcgagga gttagtatca ccgtttctgg 1200
agggtctcga actcttcagc atctctgtga gatggcaatt cggtctaagc aagaactgct 1260agggtctcga actcttcagc atctctgtga gatggcaatt cggtctaagc aagaactgct 1260
acagcttgcc ccaatcgaag tggaaagtaa tgtatcaaga ggatgccctg aaggtactga 1320acagcttgcc ccaatcgaag tggaaagtaa tgtatcaaga ggatgccctg aaggtactga 1320
gaccttcgaa aaagaaagcg agggatctgg agttaagcaa actttgaatt ttgatttgtt 1380gaccttcgaa aaagaaagcg agggatctgg agttaagcaa actttgaatt ttgatttgtt 1380
gaagttggct ggagatgttg aatctaatcc tggacctgag ctcatgacaa ccttcaaaat 1440gaagttggct ggagatgttg aatctaatcc tggacctgag ctcatgacaa ccttcaaaat 1440
cgagtcccgc atccacggca acctcaacgg ggagaagttc gagttggttg gaggtggagt 1500cgagtcccgc atccacggca acctcaacgg ggagaagttc gagttggttg gaggtggagt 1500
aggtgaggag ggtcgcctcg agattgagat gaagactaaa gataaaccac tggcattctc 1560aggtgaggag ggtcgcctcg agattgagat gaagactaaa gataaaccac tggcattctc 1560
tcccttcctg ctgaccactt gcatgggtta cgggttctac cacttcgcca gcttcccaaa 1620tcccttcctg ctgaccactt gcatgggtta cgggttctac cacttcgcca gcttcccaaa 1620
ggggattaag aacatctatc ttcatgctgc aacgaacgga ggttacacca acaccaggaa 1680ggggattaag aacatctatc ttcatgctgc aacgaacgga ggttacacca acaccaggaa 1680
ggagatctat gaagacggcg gcatcttgga ggtcaacttc cgttacactt acgagttcaa 1740ggagatctat gaagacggcg gcatcttgga ggtcaacttc cgttacactt acgagttcaa 1740
caagatcatc ggtgacgtcg agtgcattgg acatggattc ccaagtcaga gtccgatctt 1800caagatcatc ggtgacgtcg agtgcattgg acatggattc ccaagtcaga gtccgatctt 1800
caaggacacg atcgtgaaga gttgtcccac ggtggacctg atgttgccaa tgtccgggaa 1860caaggacacg atcgtgaaga gttgtcccac ggtggacctg atgttgccaa tgtccgggaa 1860
catcatcgcc agctcctacg cttacgcctt ccaactgaag gacggctctt tctacacggc 1920catcatcgcc agctcctacg cttacgcctt ccaactgaag gacggctctt tctacacggc 1920
agaagtcaag aacaacatag acttcaagaa tccaatccac gagtccttct cgaagtcagg 1980agaagtcaag aacaacatag acttcaagaa tccaatccac gagtcccttct cgaagtcagg 1980
gcccatgttc acccacagac gtgtcgagga gactctcacc aaggagaacc ttgccatagt 2040gcccatgttc accccacagac gtgtcgagga gactctcacc aaggagaacc ttgccatagt 2040
ggagtaccag caggttttca acagcgcccc aagagacatg tagactagtt taactctggt 2100ggagtaccag caggttttca acagcgcccc aagagacatg tagactagtt taactctggt 2100
ttcattaaat tttctttagt ttgaatttac tgttattcgg tgtgcatttc tatgtttggt 2160ttcattaaat tttctttagt ttgaatttac tgttattcgg tgtgcatttc tatgtttggt 2160
gagcggtttt ctgtgctcag agtgtgttta ttttatgtaa tttaatttct ttgtgagcac 2220gagcggtttt ctgtgctcag agtgtgttta ttttatgtaa tttaatttct ttgtgagcac 2220
ctgtttagca ggtcgtccct tcagcaagga cacaaaaaga ttttaatttt attgatcgtt 2280ctgtttagca ggtcgtccct tcagcaagga cacaaaaaga ttttaatttt attgatcgtt 2280
caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt gcgatgatta 2340caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt gcgatgatta 2340
tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa tgcatgacgt 2400tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa tgcatgacgt 2400
tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa tacgcgatag 2460tatttatgag atgggttttt atgattatag tcccgcaatt atacatttaa tacgcgatag 2460
aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca tctatgttac 2520aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca tctatgttac 2520
tagatc 2526tagatc 2526
<210> 2<210> 2
<211> 12724<211> 12724
<212> DNA<212>DNA
<213> 人工序列()<213> artificial sequence ()
<400> 2<400> 2
ggatcctcta gagtcgacct gcaggcatgc cctgctttaa tgagatatgc gagacgccta 60ggatcctcta gagtcgacct gcaggcatgc cctgctttaa tgagatatgc gagacgccta 60
tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180
cgtattttta tgaataatat tctccgttca atttactgat tgtccaagct tggcactggc 240cgtattttta tgaataatat tctccgttca atttactgat tgtccaagct tggcactggc 240
cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 300cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt accccaactta atcgccttgc 300
agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc 360agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc 360
cgataaatta tcgcgcgcgg tgtcatctat gttactagat cgggaattaa actatcagtg 420cgataaatta tcgcgcgcgg tgtcatctat gttactagat cgggaattaa actatcagtg 420
tttgacagga tatattggcg ggtaaaccta agagaaaaga gcgtttatta gaataacgga 480tttgacagga tatattggcg ggtaaaccta agagaaaaga gcgtttatta gaataacgga 480
tatttaaaag ggcgtgaaaa ggtttatccg ttcgtccatt tgtatgtgca tgccaaccac 540tattaaaag ggcgtgaaaa ggtttatccg ttcgtccatt tgtatgtgca tgccaaccac 540
agggttcccc tcgggatcaa agtactttga tccaacccct ccgctgctat agtgcagtcg 600agggttcccc tcgggatcaa agtactttga tccaacccct ccgctgctat agtgcagtcg 600
gcttctgacg ttcagtgcag ccgtcttctg aaaacgacat gtcgcacaag tcctaagtta 660gcttctgacg ttcagtgcag ccgtcttctg aaaacgacat gtcgcacaag tcctaagtta 660
cgcgacaggc tgccgccctg cccttttcct ggcgttttct tgtcgcgtgt tttagtcgca 720cgcgacaggc tgccgccctg cccttttcct ggcgttttct tgtcgcgtgt tttagtcgca 720
taaagtagaa tacttgcgac tagaaccgga gacattacgc catgaacaag agcgccgccg 780taaagtagaa tacttgcgac tagaaccgga gacattacgc catgaacaag agcgccgccg 780
ctggcctgct gggctatgcc cgcgtcagca ccgacgacca ggacttgacc aaccaacggg 840ctggcctgct gggctatgcc cgcgtcagca ccgacgacca ggacttgacc aaccaacggg 840
ccgaactgca cgcggccggc tgcaccaagc tgttttccga gaagatcacc ggcaccaggc 900ccgaactgca cgcggccggc tgcaccaagc tgttttccga gaagatcacc ggcaccaggc 900
gcgaccgccc ggagctggcc aggatgcttg accacctacg ccctggcgac gttgtgacag 960gcgaccgccc ggagctggcc aggatgcttg accacctacg ccctggcgac gttgtgacag 960
tgaccaggct agaccgcctg gcccgcagca cccgcgacct actggacatt gccgagcgca 1020tgaccaggct agaccgcctg gcccgcagca cccgcgacct actggacatt gccgagcgca 1020
tccaggaggc cggcgcgggc ctgcgtagcc tggcagagcc gtgggccgac accaccacgc 1080tccaggaggc cggcgcgggc ctgcgtagcc tggcagagcc gtgggccgac accaccacgc 1080
cggccggccg catggtgttg accgtgttcg ccggcattgc cgagttcgag cgttccctaa 1140cggccggccg catggtgttg accgtgttcg ccggcattgc cgagttcgag cgttccctaa 1140
tcatcgaccg cacccggagc gggcgcgagg ccgccaaggc ccgaggcgtg aagtttggcc 1200tcatcgaccg cacccggagc gggcgcgagg ccgccaaggc ccgaggcgtg aagtttggcc 1200
cccgccctac cctcaccccg gcacagatcg cgcacgcccg cgagctgatc gaccaggaag 1260cccgccctac cctcaccccg gcacagatcg cgcacgcccg cgagctgatc gaccaggaag 1260
gccgcaccgt gaaagaggcg gctgcactgc ttggcgtgca tcgctcgacc ctgtaccgcg 1320gccgcaccgt gaaagaggcg gctgcactgc ttggcgtgca tcgctcgacc ctgtaccgcg 1320
cacttgagcg cagcgaggaa gtgacgccca ccgaggccag gcggcgcggt gccttccgtg 1380cacttgagcg cagcgaggaa gtgacgccca ccgaggccag gcggcgcggt gccttccgtg 1380
aggacgcatt gaccgaggcc gacgccctgg cggccgccga gaatgaacgc caagaggaac 1440aggacgcatt gaccgaggcc gacgccctgg cggccgccga gaatgaacgc caagaggaac 1440
aagcatgaaa ccgcaccagg acggccagga cgaaccgttt ttcattaccg aagagatcga 1500aagcatgaaa ccgcaccagg acggccagga cgaaccgttt ttcattaccg aagagatcga 1500
ggcggagatg atcgcggccg ggtacgtgtt cgagccgccc gcgcacgtct caaccgtgcg 1560ggcggagatg atcgcggccg ggtacgtgtt cgagccgccc gcgcacgtct caaccgtgcg 1560
gctgcatgaa atcctggccg gtttgtctga tgccaagctg gcggcctggc cggccagctt 1620gctgcatgaa atcctggccg gtttgtctga tgccaagctg gcggcctggc cggccagctt 1620
ggccgctgaa gaaaccgagc gccgccgtct aaaaaggtga tgtgtatttg agtaaaacag 1680ggccgctgaa gaaaccgagc gccgccgtct aaaaaggtga tgtgtatttg agtaaaacag 1680
cttgcgtcat gcggtcgctg cgtatatgat gcgatgagta aataaacaaa tacgcaaggg 1740cttgcgtcat gcggtcgctg cgtatatgat gcgatgagta aataaacaaa tacgcaaggg 1740
gaacgcatga aggttatcgc tgtacttaac cagaaaggcg ggtcaggcaa gacgaccatc 1800gaacgcatga aggttatcgc tgtacttaac cagaaaggcg ggtcaggcaa gacgaccatc 1800
gcaacccatc tagcccgcgc cctgcaactc gccggggccg atgttctgtt agtcgattcc 1860gcaacccatc tagcccgcgc cctgcaactc gccggggccg atgttctgtt agtcgattcc 1860
gatccccagg gcagtgcccg cgattgggcg gccgtgcggg aagatcaacc gctaaccgtt 1920gatccccagg gcagtgcccg cgattgggcg gccgtgcggg aagatcaacc gctaaccgtt 1920
gtcggcatcg accgcccgac gattgaccgc gacgtgaagg ccatcggccg gcgcgacttc 1980gtcggcatcg accgcccgac gattgaccgc gacgtgaagg ccatcggccg gcgcgacttc 1980
gtagtgatcg acggagcgcc ccaggcggcg gacttggctg tgtccgcgat caaggcagcc 2040gtagtgatcg acggagcgcc ccaggcggcg gacttggctg tgtccgcgat caaggcagcc 2040
gacttcgtgc tgattccggt gcagccaagc ccttacgaca tatgggccac cgccgacctg 2100gacttcgtgc tgattccggt gcagccaagc ccttacgaca tatgggccac cgccgacctg 2100
gtggagctgg ttaagcagcg cattgaggtc acggatggaa ggctacaagc ggcctttgtc 2160gtggagctgg ttaagcagcg cattgaggtc acggatggaa ggctacaagc ggcctttgtc 2160
gtgtcgcggg cgatcaaagg cacgcgcatc ggcggtgagg ttgccgaggc gctggccggg 2220gtgtcgcggg cgatcaaagg cacgcgcatc ggcggtgagg ttgccgaggc gctggccggg 2220
tacgagctgc ccattcttga gtcccgtatc acgcagcgcg tgagctaccc aggcactgcc 2280tacgagctgc ccattcttga gtcccgtatc acgcagcgcg tgagctaccc aggcactgcc 2280
gccgccggca caaccgttct tgaatcagaa cccgagggcg acgctgcccg cgaggtccag 2340gccgccggca caaccgttct tgaatcagaa cccgagggcg acgctgcccg cgaggtccag 2340
gcgctggccg ctgaaattaa atcaaaactc atttgagtta atgaggtaaa gagaaaatga 2400gcgctggccg ctgaaattaa atcaaaactc atttgagtta atgaggtaaa gagaaaatga 2400
gcaaaagcac aaacacgcta agtgccggcc gtccgagcgc acgcagcagc aaggctgcaa 2460gcaaaagcac aaacacgcta agtgccggcc gtccgagcgc acgcagcagc aaggctgcaa 2460
cgttggccag cctggcagac acgccagcca tgaagcgggt caactttcag ttgccggcgg 2520cgttggccag cctggcagac acgccagcca tgaagcgggt caactttcag ttgccggcgg 2520
aggatcacac caagctgaag atgtacgcgg tacgccaagg caagaccatt accgagctgc 2580aggatcacac caagctgaag atgtacgcgg tacgccaagg caagaccatt accgagctgc 2580
tatctgaata catcgcgcag ctaccagagt aaatgagcaa atgaataaat gagtagatga 2640tatctgaata catcgcgcag ctaccagagt aaatgagcaa atgaataaat gagtagatga 2640
attttagcgg ctaaaggagg cggcatggaa aatcaagaac aaccaggcac cgacgccgtg 2700attttagcgg ctaaaggagg cggcatggaa aatcaagaac aaccaggcac cgacgccgtg 2700
gaatgcccca tgtgtggagg aacgggcggt tggccaggcg taagcggctg ggttgtctgc 2760gaatgcccca tgtgtgggagg aacgggcggt tggccaggcg taagcggctg ggttgtctgc 2760
cggccctgca atggcactgg aacccccaag cccgaggaat cggcgtgagc ggtcgcaaac 2820cggccctgca atggcactgg aacccccaag cccgaggaat cggcgtgagc ggtcgcaaac 2820
catccggccc ggtacaaatc ggcgcggcgc tgggtgatga cctggtggag aagttgaagg 2880catccggccc ggtacaaatc ggcgcggcgc tgggtgatga cctggtggag aagttgaagg 2880
ccgcgcaggc cgcccagcgg caacgcatcg aggcagaagc acgccccggt gaatcgtggc 2940ccgcgcaggc cgcccagcgg caacgcatcg aggcagaagc acgccccggt gaatcgtggc 2940
aagcggccgc tgatcgaatc cgcaaagaat cccggcaacc gccggcagcc ggtgcgccgt 3000aagcggccgc tgatcgaatc cgcaaagaat cccggcaacc gccggcagcc ggtgcgccgt 3000
cgattaggaa gccgcccaag ggcgacgagc aaccagattt tttcgttccg atgctctatg 3060cgattaggaa gccgcccaag ggcgacgagc aaccagattt tttcgttccg atgctctatg 3060
acgtgggcac ccgcgatagt cgcagcatca tggacgtggc cgttttccgt ctgtcgaagc 3120acgtgggcac ccgcgatagt cgcagcatca tggacgtggc cgttttccgt ctgtcgaagc 3120
gtgaccgacg agctggcgag gtgatccgct acgagcttcc agacgggcac gtagaggttt 3180gtgaccgacg agctggcgag gtgatccgct acgagcttcc agacgggcac gtagaggttt 3180
ccgcagggcc ggccggcatg gccagtgtgt gggattacga cctggtactg atggcggttt 3240ccgcagggcc ggccggcatg gccagtgtgt gggattacga cctggtactg atggcggttt 3240
cccatctaac cgaatccatg aaccgatacc gggaagggaa gggagacaag cccggccgcg 3300cccatctaac cgaatccatg aaccgatacc gggaagggaa gggagacaag cccggccgcg 3300
tgttccgtcc acacgttgcg gacgtactca agttctgccg gcgagccgat ggcggaaagc 3360tgttccgtcc acacgttgcg gacgtactca agttctgccg gcgagccgat ggcggaaagc 3360
agaaagacga cctggtagaa acctgcattc ggttaaacac cacgcacgtt gccatgcagc 3420agaaagacga cctggtagaa acctgcattc ggttaaacac cacgcacgtt gccatgcagc 3420
gtacgaagaa ggccaagaac ggccgcctgg tgacggtatc cgagggtgaa gccttgatta 3480gtacgaagaa ggccaagaac ggccgcctgg tgacggtatc cgagggtgaa gccttgatta 3480
gccgctacaa gatcgtaaag agcgaaaccg ggcggccgga gtacatcgag atcgagctag 3540gccgctacaa gatcgtaaag agcgaaaccg ggcggccgga gtacatcgag atcgagctag 3540
ctgattggat gtaccgcgag atcacagaag gcaagaaccc ggacgtgctg acggttcacc 3600ctgattggat gtaccgcgag atcacagaag gcaagaaccc ggacgtgctg acggttcacc 3600
ccgattactt tttgatcgat cccggcatcg gccgttttct ctaccgcctg gcacgccgcg 3660ccgattactt tttgatcgat cccggcatcg gccgttttct ctaccgcctg gcacgccgcg 3660
ccgcaggcaa ggcagaagcc agatggttgt tcaagacgat ctacgaacgc agtggcagcg 3720ccgcaggcaa ggcagaagcc agatggttgt tcaagacgat ctacgaacgc agtggcagcg 3720
ccggagagtt caagaagttc tgtttcaccg tgcgcaagct gatcgggtca aatgacctgc 3780ccggagagtt caagaagttc tgtttcaccg tgcgcaagct gatcgggtca aatgacctgc 3780
cggagtacga tttgaaggag gaggcggggc aggctggccc gatcctagtc atgcgctacc 3840cggagtacga tttgaaggag gaggcggggc aggctggccc gatcctagtc atgcgctacc 3840
gcaacctgat cgagggcgaa gcatccgccg gttcctaatg tacggagcag atgctagggc 3900gcaacctgat cgagggcgaa gcatccgccg gttcctaatg tacggagcag atgctagggc 3900
aaattgccct agcaggggaa aaaggtcgaa aaggtctctt tcctgtggat agcacgtaca 3960aaattgccct agcaggggaa aaaggtcgaa aaggtctctt tcctgtggat agcacgtaca 3960
ttgggaaccc aaagccgtac attgggaacc ggaacccgta cattgggaac ccaaagccgt 4020ttgggaaccc aaagccgtac attgggaacc ggaacccgta cattgggaac ccaaagccgt 4020
acattgggaa ccggtcacac atgtaagtga ctgatataaa agagaaaaaa ggcgattttt 4080acattgggaa ccggtcacac atgtaagtga ctgatataaa agagaaaaaa ggcgattttt 4080
ccgcctaaaa ctctttaaaa cttattaaaa ctcttaaaac ccgcctggcc tgtgcataac 4140ccgcctaaaa ctctttaaaa ctttattaaaa ctcttaaaac ccgcctggcc tgtgcataac 4140
tgtctggcca gcgcacagcc gaagagctgc aaaaagcgcc tacccttcgg tcgctgcgct 4200tgtctggcca gcgcacagcc gaagagctgc aaaaagcgcc tacccttcgg tcgctgcgct 4200
ccctacgccc cgccgcttcg cgtcggccta tcgcggccgc tggccgctca aaaatggctg 4260ccctacgccc cgccgcttcg cgtcggccta tcgcggccgc tggccgctca aaaatggctg 4260
gcctacggcc aggcaatcta ccagggcgcg gacaagccgc gccgtcgcca ctcgaccgcc 4320gcctacggcc aggcaatcta ccagggcgcg gacaagccgc gccgtcgcca ctcgaccgcc 4320
ggcgcccaca tcaaggcacc ctgcctcgcg cgtttcggtg atgacggtga aaacctctga 4380ggcgcccaca tcaaggcacc ctgcctcgcg cgtttcggtg atgacggtga aaacctctga 4380
cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4440cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4440
gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat gacccagtca 4500gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat gacccagtca 4500
cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag attgtactga 4560cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag attgtactga 4560
gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca 4620gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca 4620
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 4680ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 4680
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 4740cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 4740
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4800gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4800
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 4860tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 4860
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 4920agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 4920
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 4980tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 4980
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 5040cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 5040
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 5100ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 5100
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 5160ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 5160
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 5220ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 5220
ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc 5280ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc 5280
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 5340cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 5340
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 5400gcggtggtttttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 5400
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 5460atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 5460
ttttggtcat gcattctagg tactaaaaca attcatccag taaaatataa tattttattt 5520ttttggtcat gcattctagg tactaaaaca attcatccag taaaatataa tattttattt 5520
tctcccaatc aggcttgatc cccagtaagt caaaaaatag ctcgacatac tgttcttccc 5580tctcccaatc aggcttgatc cccagtaagt caaaaaatag ctcgacatac tgttcttccc 5580
cgatatcctc cctgatcgac cggacgcaga aggcaatgtc ataccacttg tccgccctgc 5640cgatatcctc cctgatcgac cggacgcaga aggcaatgtc ataccacttg tccgccctgc 5640
cgcttctccc aagatcaata aagccactta ctttgccatc tttcacaaag atgttgctgt 5700cgcttctccc aagatcaata aagccactta ctttgccatc tttcacaaag atgttgctgt 5700
ctcccaggtc gccgtgggaa aagacaagtt cctcttcggg cttttccgtc tttaaaaaat 5760ctcccaggtc gccgtgggaa aagacaagtt cctcttcggg cttttccgtc tttaaaaaat 5760
catacagctc gcgcggatct ttaaatggag tgtcttcttc ccagttttcg caatccacat 5820catacagctc gcgcggatct ttaaatggag tgtcttcttc ccagttttcg caatccacat 5820
cggccagatc gttattcagt aagtaatcca attcggctaa gcggctgtct aagctattcg 5880cggccagatc gttattcagt aagtaatcca attcggctaa gcggctgtct aagctattcg 5880
tatagggaca atccgatatg tcgatggagt gaaagagcct gatgcactcc gcatacagct 5940tatagggaca atccgatatg tcgatggagt gaaagagcct gatgcactcc gcatacagct 5940
cgataatctt ttcagggctt tgttcatctt catactcttc cgagcaaagg acgccatcgg 6000cgataatctt ttcagggctt tgttcatctt catactcttc cgagcaaagg acgccatcgg 6000
cctcactcat gagcagattg ctccagccat catgccgttc aaagtgcagg acctttggaa 6060cctcactcat gagcagattg ctccagccat catgccgttc aaagtgcagg acctttggaa 6060
caggcagctt tccttccagc catagcatca tgtccttttc ccgttccaca tcataggtgg 6120caggcagctt tccttccagc catagcatca tgtccttttc ccgttccaca tcataggtgg 6120
tccctttata ccggctgtcc gtcattttta aatataggtt ttcattttct cccaccagct 6180tccctttata ccggctgtcc gtcattttta aatataggtt ttcattttct cccaccagct 6180
tatatacctt agcaggagac attccttccg tatcttttac gcagcggtat ttttcgatca 6240tatatacctt agcaggagac attccttccg tatcttttac gcagcggtat ttttcgatca 6240
gttttttcaa ttccggtgat attctcattt tagccattta ttatttcctt cctcttttct 6300gttttttcaa ttccggtgat attctcattt tagccattatta ttattcctt cctcttttct 6300
acagtattta aagatacccc aagaagctaa ttataacaag acgaactcca attcactgtt 6360acagtattta aagatacccc aagaagctaa ttataacaag acgaactcca attcactgtt 6360
ccttgcattc taaaacctta aataccagaa aacagctttt tcaaagttgt tttcaaagtt 6420ccttgcattc taaaacctta aataccagaa aacagctttt tcaaagttgt tttcaaagtt 6420
ggcgtataac atagtatcga cggagccgat tttgaaaccg cggtgatcac aggcagcaac 6480ggcgtataac atagtatcga cggagccgat tttgaaaccg cggtgatcac aggcagcaac 6480
gctctgtcat cgttacaatc aacatgctac cctccgcgag atcatccgtg tttcaaaccc 6540gctctgtcat cgttacaatc aacatgctac cctccgcgag atcatccgtg tttcaaaccc 6540
ggcagcttag ttgccgttct tccgaatagc atcggtaaca tgagcaaagt ctgccgcctt 6600ggcagcttag ttgccgttct tccgaatagc atcggtaaca tgagcaaagt ctgccgcctt 6600
acaacggctc tcccgctgac gccgtcccgg actgatgggc tgcctgtatc gagtggtgat 6660acaacggctc tcccgctgac gccgtcccgg actgatgggc tgcctgtatc gagtggtgat 6660
tttgtgccga gctgccggtc ggggagctgt tggctggctg gtggcaggat atattgtggt 6720tttgtgccga gctgccggtc ggggagctgt tggctggctg gtggcaggat atattgtggt 6720
gtaaacaaat tgacgcttag acaacttaat aacacattgc ggacgttttt aatgtactga 6780gtaaacaaat tgacgcttag acaacttaat aacacattgc ggacgttttt aatgtactga 6780
attaacgccg aattaattcg ggggatctgg attttagtac tggattttgg ttttaggaat 6840attaacgccg aattaattcg ggggatctgg attttagtac tggattttgg ttttaggaat 6840
tagaaatttt attgatagaa gtattttaca aatacaaata catactaagg gtttcttata 6900tagaaatttt attgatagaa gtattttaca aatacaaata catactaagg gtttcttata 6900
tgctcaacac atgagcgaaa ccctatagga accctaattc ccttatctgg gaactactca 6960tgctcaacac atgagcgaaa ccctatagga accctaattc ccttatctgg gaactactca 6960
cacattatta tggagaaact cgagcttgtc gatcgactct agctagagga tcgatccgaa 7020cacatttatta tggagaaact cgagcttgtc gatcgactct agctagagga tcgatccgaa 7020
ccccagagtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 7080ccccagagtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 7080
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 7140tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 7140
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 7200tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac accccagccgg 7200
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 7260ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 7260
tcgccatgtg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 7320tcgccatgtg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 7320
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 7380agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 7380
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 7440gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 7440
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 7500gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 7500
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 7560gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 7560
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 7620tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 7620
agccacgata gccgcgctgc ctcgtcctgg agttcattca gggcaccgga caggtcggtc 7680agccacgata gccgcgctgc ctcgtcctgg agttcattca gggcaccgga caggtcggtc 7680
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 7740ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 7740
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 7800ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 7800
cctgcgtgca atccatcttg ttcaatcccc atggtcgatc gacagatctg cgaaagctcg 7860cctgcgtgca atccatcttg ttcaatcccc atggtcgatc gacagatctg cgaaagctcg 7860
agagagatag atttgtagag agagactggt gatttcagcg tgtcctctcc aaatgaaatg 7920agagagatag atttgtagag agagactggt gatttcagcg tgtcctctcc aaatgaaatg 7920
aacttcctta tatagaggaa ggtcttgcga aggatagtgg gattgtgcgt catcccttac 7980aacttcctta tatagaggaa ggtcttgcga aggatagtgg gattgtgcgt catcccttac 7980
gtcagtggag atatcacatc aatccacttg ctttgaagac gtggttggaa cgtcttcttt 8040gtcagtggag atatcacatc aatccacttg ctttgaagac gtggttggaa cgtcttcttt 8040
ttccacgatg ctcctcgtgg gtgggggtcc atctttggga ccactgtcgg cagaggcatc 8100ttccacgatg ctcctcgtgg gtgggggtcc atctttggga ccactgtcgg cagaggcatc 8100
ttgaacgata gcctttcctt tatcgcaatg atggcatttg taggtgccac cttccttttc 8160ttgaacgata gcctttcctt tatcgcaatg atggcatttg taggtgccac cttccttttc 8160
tactgtcctt ttgatgaagt gacagatagc tgggcaatgg aatccgagga ggtttcccga 8220tactgtcctt ttgatgaagt gacagatagc tgggcaatgg aatccgagga ggtttcccga 8220
tattaccctt tgttgaaaag tctcaatagc cctttggtct tctgagactg tatctttgat 8280tattaccctt tgttgaaaag tctcaatagc cctttggtct tctgagactg tatctttgat 8280
attcttggag tagacgagag tgtcgtgctc caccatgtta tcacatcaat ccacttgctt 8340attcttggag tagacgagag tgtcgtgctc caccatgtta tcacatcaat ccacttgctt 8340
tgaagacgtg gttggaacgt cttctttttc cacgatgctc ctcgtgggtg ggggtccatc 8400tgaagacgtg gttggaacgt cttctttttc cacgatgctc ctcgtgggtgggggtccatc 8400
tttgggacca ctgtcggcag aggcatcttg aacgatagcc tttcctttat cgcaatgatg 8460tttgggacca ctgtcggcag aggcatcttg aacgatagcc tttcctttat cgcaatgatg 8460
gcatttgtag gtgccacctt ccttttctac tgtccttttg atgaagtgac agatagctgg 8520gcatttgtag gtgccacctt ccttttctac tgtccttttg atgaagtgac agatagctgg 8520
gcaatggaat ccgaggaggt ttcccgatat taccctttgt tgaaaagtct caatagccct 8580gcaatggaat ccgaggaggt ttcccgatat taccctttgt tgaaaagtct caatagccct 8580
ttggtcttct gagactgtat ctttgatatt cttggagtag acgagagtgt cgtgctccac 8640ttggtcttct gagactgtat ctttgatatt cttggagtag acgagagtgt cgtgctccac 8640
catgttggca agctgctcta gccaatacgc aaaccgcctc tccccgcgcg ttggccgatt 8700catgttggca agctgctcta gccaatacgc aaaccgcctc tccccgcgcg ttggccgatt 8700
cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca 8760cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca 8760
attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct 8820attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct 8820
cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag ctatgacatg 8880cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag ctatgacatg 8880
attacgaatt cggaaacctc ctcggattcc attgcccagc tatctgtcac tttattgaga 8940attacgaatt cggaaacctc ctcggattcc attgcccagc tatctgtcac tttattgaga 8940
agatagtgga aaaggaaggt ggctcctaca aatgccatca ttgcgataaa ggaaaggcca 9000agatagtgga aaaggaaggt ggctcctaca aatgccatca ttgcgataaa ggaaaggcca 9000
tcgttgaaga tgcctctgcc gacagtggtc ccaaagatgg acccccaccc acgaggagca 9060tcgttgaaga tgcctctgcc gacagtggtc ccaaagatgg accccccaccc acgaggagca 9060
tcgtggaaaa agaagacgtt ccaaccacgt cttcaaagca agtggattga tgtgatatct 9120tcgtggaaaa agaagacgtt ccaaccacgt cttcaaagca agtggattga tgtgatatct 9120
ccactgacgt aagggatgac gcacaatccc actatccttc gcaagaccct tcctctatat 9180ccactgacgt aagggatgac gcacaatccc actatccttc gcaagaccct tcctctatat 9180
aaggaagttc atttcatttg gagaggtatt aaaatcttaa taggttttga taaaagcgaa 9240aaggaagttc atttcatttg gagaggtatt aaaatcttaa taggttttga taaaagcgaa 9240
cgtggggaaa cccgaaccaa accttcttct aaactctctc tcatctctct taaagcaaac 9300cgtggggaaa cccgaaccaa accttcttct aaactctctc tcatctctct taaagcaaac 9300
ttctctcttg tctttcttgc gtgagcgatc ttcaacgttg tcagatcgtg cttcggcacc 9360ttctctcttg tctttcttgc gtgagcgatc ttcaacgttg tcagatcgtg cttcggcacc 9360
agtacaacgt tttctttcac tgaagcgaaa tcaaagatct ctttgtggac acgtagtgcg 9420agtacaacgt tttctttcac tgaagcgaaa tcaaagatct ctttgtggac acgtagtgcg 9420
gcgccattaa ataacgtgta cttgtcctat tcttgtcggt gtggtcttgg gaaaagaaag 9480gcgccattaa ataacgtgta cttgtcctat tcttgtcggt gtggtcttgg gaaaagaaag 9480
cttgctggag gctgctgttc agccccatac attacttgtt acgattctgc tgactttcgg 9540cttgctggag gctgctgttc agccccatac attacktgtt acgattctgc tgactttcgg 9540
cgggtgcaat atctctactt ctgcttgacg aggtattgtt gcctgtactt ctttcttctt 9600cgggtgcaat atctctactt ctgcttgacg aggtattgtt gcctgtactt ctttcttctt 9600
cttcttgctg attggttcta taagaaatct agtattttct ttgaaacaga gttttcccgt 9660cttcttgctg attggttcta taagaaatct agtattttct ttgaaacaga gttttcccgt 9660
ggttttcgaa cttggagaaa gattgttaag cttctgtata ttctgcccaa attcgcgatg 9720ggttttcgaa cttggagaaa gattgttaag cttctgtata ttctgcccaa attcgcgatg 9720
gaacgagcta tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga 9780gaacgagcta tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga 9780
ggatcaggag gaagcacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag 9840ggatcaggag gaagcacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag 9840
tggcggctac ataacgatga gacgaactcg aatcaagata atccccttgg tttcaaggaa 9900tggcggctac ataacgatga gacgaactcg aatcaagata atccccttgg tttcaaggaa 9900
agctggggtt tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaggca 9960agctggggtt tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaggca 9960
tcactgcaca gagtccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga 10020tcactgcaca gagtcccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga 10020
tttttcggtt tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc 10080tttttcggtt tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc 10080
accgtttctg gagggtctcg aactcttcag catctctgtg agatggcaat tcggtctaag 10140accgtttctg gagggtctcg aactcttcag catctctgtg agatggcaat tcggtctaag 10140
caagaactgc tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct 10200caagaactgc tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct 10200
gaaggtactg agaccttcga aaaagaaagc gagggatctg gagttaagca aactttgaat 10260gaaggtactg agaccttcga aaaagaaagc gagggatctg gagttaagca aactttgaat 10260
tttgatttgt tgaagttggc tggagatgtt gaatctaatc ctggacctga gctcatgaca 10320tttgatttgt tgaagttggc tggagatgtt gaatctaatc ctggacctga gctcatgaca 10320
accttcaaaa tcgagtcccg catccacggc aacctcaacg gggagaagtt cgagttggtt 10380accttcaaaa tcgagtcccg catccacggc aacctcaacg gggagaagtt cgagttggtt 10380
ggaggtggag taggtgagga gggtcgcctc gagattgaga tgaagactaa agataaacca 10440ggaggtggag taggtgagga gggtcgcctc gagattgaga tgaagactaa agataaacca 10440
ctggcattct ctcccttcct gctgaccact tgcatgggtt acgggttcta ccacttcgcc 10500ctggcattct ctcccttcct gctgaccact tgcatgggtt acgggttcta ccacttcgcc 10500
agcttcccaa aggggattaa gaacatctat cttcatgctg caacgaacgg aggttacacc 10560agcttcccaa aggggattaa gaacatctat cttcatgctg caacgaacgg aggttacacc 10560
aacaccagga aggagatcta tgaagacggc ggcatcttgg aggtcaactt ccgttacact 10620aacaccagga aggagatcta tgaagacggc ggcatcttgg aggtcaactt ccgttacact 10620
tacgagttca acaagatcat cggtgacgtc gagtgcattg gacatggatt cccaagtcag 10680tacgagttca acaagatcat cggtgacgtc gagtgcattg gacatggatt cccaagtcag 10680
agtccgatct tcaaggacac gatcgtgaag agttgtccca cggtggacct gatgttgcca 10740agtccgatct tcaaggacac gatcgtgaag agttgtccca cggtggacct gatgttgcca 10740
atgtccggga acatcatcgc cagctcctac gcttacgcct tccaactgaa ggacggctct 10800atgtccggga acatcatcgc cagctcctac gcttacgcct tccaactgaa ggacggctct 10800
ttctacacgg cagaagtcaa gaacaacata gacttcaaga atccaatcca cgagtccttc 10860ttctacacgg cagaagtcaa gaacaacata gacttcaaga atccaatcca cgagtccttc 10860
tcgaagtcag ggcccatgtt cacccacaga cgtgtcgagg agactctcac caaggagaac 10920tcgaagtcag ggcccatgtt cacccacaga cgtgtcgagg agactctcac caaggagaac 10920
cttgccatag tggagtacca gcaggttttc aacagcgccc caagagacat gtagactagt 10980cttgccatag tggagtacca gcaggttttc aacagcgccc caagagacat gtagactagt 10980
ttaactctgg tttcattaaa ttttctttag tttgaattta ctgttattcg gtgtgcattt 11040ttaactctgg tttcattaaa ttttctttag tttgaattta ctgttattcg gtgtgcattt 11040
ctatgtttgg tgagcggttt tctgtgctca gagtgtgttt attttatgta atttaatttc 11100ctatgtttgg tgagcggttt tctgtgctca gagtgtgttt attttatgta atttaatttc 11100
tttgtgagca cctgtttagc aggtcgtccc ttcagcaagg acacaaaaag attttaattt 11160tttgtgagca cctgtttagc aggtcgtccc ttcagcaagg acacaaaaag attttaattt 11160
tattgatcgt tcaaacattt ggcaataaag tttcttaaga ttgaatcctg ttgccggtct 11220tattgatcgt tcaaacattt ggcaataaag tttcttaaga ttgaatcctg ttgccggtct 11220
tgcgatgatt atcatataat ttctgttgaa ttacgttaag catgtaataa ttaacatgta 11280tgcgatgatt atcatataat ttctgttgaa ttacgttaag catgtaataa ttaacatgta 11280
atgcatgacg ttatttatga gatgggtttt tatgattaga gtcccgcaat tatacattta 11340atgcatgacg ttattattga gatgggtttt tatgattaga gtcccgcaat tatacattta 11340
atacgcgata gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc 11400atacgcgata gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc 11400
atctatgtta ctagatcccc ggggatctcc tttgccccag agatcacaat ggacgacttc 11460atctatgtta ctagatcccc ggggatctcc tttgccccag agatcacaat ggacgacttc 11460
ctctatctct acgatctagt caggaagttc gacggagaag gtgacgatac catgttcacc 11520ctctatctct acgatctagt caggaagttc gacggagaag gtgacgatac catgttcacc 11520
actgataatg agaagattag ccttttcaat ttcagaaaga atgctaaccc acagatggtt 11580actgataatg agaagattag ccttttcaat ttcagaaaga atgctaaccc acagatggtt 11580
agagaggctt acgcagcagg tctcatcaag acgatctacc cgagcaataa tctccaggag 11640agagaggctt acgcagcagg tctcatcaag acgatctacc cgagcaataa tctccaggag 11640
atcaaatacc ttcccaagaa ggttaaagat gcagtcaaaa gattcaggac taactgcatc 11700atcaaatacc ttcccaagaa ggttaaagat gcagtcaaaa gattcaggac taactgcatc 11700
aagaacacag agaaagatat atttctcaag atcagaagta ctattccagt atggacgatt 11760aagaacacag agaaagatat atttctcaag atcagaagta ctattccagt atggacgatt 11760
caaggcttgc ttcacaaacc aaggcaagta atagagattg gagtctctaa aaaggtagtt 11820caaggcttgc ttcacaaacc aaggcaagta atagagattg gagtctctaa aaaggtagtt 11820
cccactgaat caaaggccat ggagtcaaag attcaaatag aggacctaac agaactcgcc 11880cccactgaat caaaggccat ggagtcaaag attcaaatag aggacctaac agaactcgcc 11880
gtaaagactg gcgaacagtt catacagagt ctcttacgac tcaatgacaa gaagaaaatc 11940gtaaagactg gcgaacagtt catacagagt ctcttacgac tcaatgacaa gaagaaaatc 11940
ttcgtcaaca tggtggagca cgacacactt gtctactcca aaaatatcaa agatacagtc 12000ttcgtcaaca tggtggagca cgacacactt gtctactcca aaaatatcaa agatacagtc 12000
tcagaagacc aaagggcaat tgagactttt caacaaaggg taatatccgg aaacctcctc 12060tcagaagacc aaagggcaat tgagactttt caacaaaggg taatatccgg aaacctcctc 12060
ggattccatt gcccagctat ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc 12120ggattccatt gcccagctat ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc 12120
tcctacaaat gccatcattg cgataaagga aaggccatcg ttgaagatgc ctctgccgac 12180tcctacaaat gccatcattg cgataaagga aaggccatcg ttgaagatgc ctctgccgac 12180
agtggtccca aagatggacc cccacccacg aggagcatcg tggaaaaaga agacgttcca 12240agtggtccca aagatggacc cccaccacg aggagcatcg tggaaaaaga agacgttcca 12240
accacgtctt caaagcaagt ggattgatgt gataacatgg tggagcacga cacacttgtc 12300accacgtctt caaagcaagt ggattgatgt gataacatgg tggagcacga cacacttgtc 12300
tactccaaaa atatcaaaga tacagtctca gaagaccaaa gggcaattga gacttttcaa 12360tactccaaaa atatcaaaga tacagtctca gaagaccaaa gggcaattga gacttttcaa 12360
caaagggtaa tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt 12420caaagggtaa tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt 12420
gtgaagatag tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 12480gtgaagatag tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 12480
gccatcgttg aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg 12540gccatcgttg aagatgcctc tgccgacagt ggtcccaaag atggaccccc accacgagg 12540
agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat 12600agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat 12600
atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 12660atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 12660
atataaggaa gttcatttca tttggagagg acacgctgaa atcaccagtc tctctctcaa 12720atataaggaa gttcatttca tttggagagg acacgctgaa atcaccagtc tctctctcaa 12720
gctt 12724gctt 12724
<210> 3<210> 3
<211> 1503<211> 1503
<212> DNA<212>DNA
<213> Catharanthus roseus<213> Catharanthus roseus
<400> 3<400> 3
atgggcagca ttgattcaac aaatgtagcc atgtccaatt ctccagttgg agaatttaag 60atgggcagca ttgattcaac aaatgtagcc atgtccaatt ctccagttgg agaatttaag 60
ccacttgaag ctgaggaatt ccgaaaacaa gcccatcgta tggtagattt catagccgat 120ccacttgaag ctgaggaatt ccgaaaacaa gcccatcgta tggtagattt catagccgat 120
tattacaaaa atgtggaaac atatccggtc cttagcgaag tcgaacctgg atatctccga 180tattacaaaa atgtggaaac atatccggtc cttagcgaag tcgaacctgg atatctccga 180
aaacgtatcc ccgaaaccgc tccttacctc cccgaatcac tcgacgacat catgaaagat 240aaacgtatcc ccgaaaccgc tccttacctc cccgaatcac tcgacgacat catgaaagat 240
attcagaagg atattatccc aggaatgaca aattggatga gccctaattt ttatgcattt 300attcagaagg atattatccc aggaatgaca aattggatga gccctaattt ttatgcattt 300
tttcctgcca ctgttagttc agctgccttt ttaggagaaa tgttgtctac tgccctaaat 360tttcctgcca ctgttagttc agctgccttt ttaggagaaa tgttgtctac tgccctaaat 360
tcagtaggct ttacttgggt ttcttcacca gccgccaccg aattagaaat gattgttatg 420tcagtaggct ttacttgggt ttcttcacca gccgccaccg aattagaaat gattgttatg 420
gattggttgg ctcagatcct taaactcccc aaatctttca tgttttcagg tacgggtggc 480gattggttgg ctcagatcct taaactcccc aaatctttca tgttttcagg tacgggtggc 480
ggcgtcatcc aaaacaccac tagtgagtcc attctttgta caatcattgc cgcccgggaa 540ggcgtcatcc aaaacaccac tagtgagtcc attctttgta caatcattgc cgcccgggaa 540
agggccctgg agaagctcgg tcccgatagt attggaaaac ttgtctgtta cggatcagat 600agggccctgg agaagctcgg tcccgatagt attggaaaac ttgtctgtta cggatcagat 600
caaacccata ccatgttccc aaaaacttgc aaattggcgg gaattttccc gaataatatt 660caaacccata ccatgttccc aaaaacttgc aaattggcgg gaattttccc gaataatatt 660
aggttaatac ctacaaccgt cgaaacggat ttcggcatct cacctcaagt tctacgaaaa 720aggttaatac ctacaaccgt cgaaacggat ttcggcatct cacctcaagt tctacgaaaa 720
atggtcgagg atgacgtggc ggccggatat gtaccgctgt tcttatgcgc taccctgggt 780atggtcgagg atgacgtggc ggccggatat gtaccgctgt tcttatgcgc taccctgggt 780
accacctcga ccacggctac cgatcctgtg gactcacttt ctgaaatcgc taacgagttt 840accacctcga ccacggctac cgatcctgtg gactcacttt ctgaaatcgc taacgagttt 840
ggtatttgga tccacgtgga tgcggcttat gcgggcagcg cctgtatatg tcccgagttc 900ggtatttgga tccacgtgga tgcggcttat gcgggcagcg cctgtatatg tcccgagttc 900
agacattact tggatggaat cgagcgagtt gactcactga gtctgagtcc acacaaatgg 960agacattact tggatggaat cgagcgagtt gactcactga gtctgagtcc acacaaatgg 960
ctactcgctt acttagattg cacttgcttg tgggtcaagc aaccacattt gttactaagg 1020ctactcgctt acttagattg cacttgcttg tgggtcaagc aaccacattt gttactaagg 1020
gcactcacta cgaatcctga gtatttaaaa aataaacaga gtgatttaga caaagttgtg 1080gcactcacta cgaatcctga gtatttaaaa aataaacaga gtgattaga caaagttgtg 1080
gacttcaaaa attggcaaat cgcaacggga cgaaaatttc ggtcgcttaa actttggctc 1140gacttcaaaa attggcaaat cgcaacggga cgaaaatttc ggtcgcttaa actttggctc 1140
attttacgta gctatggagt tgttaattta cagagtcata ttcgttctga cgtcgcaatg 1200attttacgta gctatggagt tgttaattta cagagtcata ttcgttctga cgtcgcaatg 1200
gcgaaaatgt tcgaagaatg ggttagatca gactccagat tcgaaattgt ggtaccaaga 1260gcgaaaatgt tcgaagaatg ggttagatca gactccagat tcgaaattgt ggtaccaaga 1260
aacttttctc ttgtttgttt tagattaaag cctgacgttt cgagtttaga tgtagaagaa 1320aacttttctc ttgtttgttt tagattaaag cctgacgttt cgagtttaga tgtagaagaa 1320
gtgaataaga aacttttgga tatgcttaac tcgacgggac gagtttatat gactcatact 1380gtgaataaga aacttttgga tatgcttaac tcgacgggac gagtttatat gactcatact 1380
attgtgggag gcatatacat gctaagactg gctgttggct catcgctaac tgaagaacat 1440attgtggggag gcatatacat gctaagactg gctgttggct catcgctaac tgaagaacat 1440
catgtacgcc gtgtttggga tttgattcaa aaattaaccg atgatttgct caaagaagct 1500catgtacgcc gtgtttggga tttgattcaa aaattaaccg atgatttgct caaagaagct 1500
tga 1503tga 1503
<210> 4<210> 4
<211> 1575<211> 1575
<212> DNA<212>DNA
<213> Catharanthus roseus<213> Catharanthus roseus
<400> 4<400> 4
atggagatgg atatggatac cattagaaag gcaattgctg ccactatttt tgcattggta 60atggagatgg atatggatac cattagaaag gcaattgctg ccactatttt tgcattggta 60
atggcttggg catggagagt gttggattgg gcatggttta ctcctaagag gatcgagaaa 120atggcttggg catggagagt gttggattgg gcatggtta ctcctaagag gatcgagaaa 120
cgtctaaggc agcaaggttt tagaggaaat ccttatagat tcttggttgg agatgttaag 180cgtctaaggc agcaaggttt tagaggaaat ccttatagat tcttggttgg agatgttaag 180
gagagtggaa aaatgcatca agaagccttg tctaaaccca tggagttcaa caatgatatt 240gagagtggaa aaatgcatca agaagccttg tctaaaccca tggagttcaa caatgatatt 240
gttcctcgcc tcatgccaca tattaaccac actatcaata cttacggtag gaattccttt 300gttcctcgcc tcatgccaca tattaaccac actatcaata cttacggtag gaattccttt 300
acatggatgg gaaggattcc aagaattcat gttatggaac ctgaacttat taaggaagta 360acatggatgg gaaggattcc aagaattcat gttatggaac ctgaacttat taaggaagta 360
ttgacccact caagcaaata ccaaaagaac tttgatgttc acaatcccct tgttaagttc 420ttgacccact caagcaaata ccaaaagaac tttgatgttc acaatcccct tgttaagttc 420
cttctcaccg gagttggaag ctttgagggt gcaaaatggt caaaacacag aagaattatt 480cttctcaccg gagttggaag ctttgagggt gcaaaatggt caaaacacag aagaattatt 480
tcccctgcct tcactcttga gaaactaaag tcaatgctgc cagcttttgc catatgctac 540tcccctgcct tcactcttga gaaactaaag tcaatgctgc cagcttttgc catatgctac 540
catgacatgt tgaccaaatg ggagaaaata gctgaaaaac aaggatccca tgaagttgat 600catgacatgt tgaccaaatg ggagaaaata gctgaaaaac aaggatccca tgaagttgat 600
atctttccca cgtttgatgt tttaacaagt gatgtgattt caaaggttgc atttggtagc 660atctttccca cgtttgatgt tttaacaagt gatgtgattt caaaggttgc atttggtagc 660
acatatgaag aaggaggcaa aatcttcaga ctattgaaag aactcatgga tctcacaatt 720acatatgaag aaggaggcaa aatcttcaga ctattgaaag aactcatgga tctcacaatt 720
gactgcatga gagatgtcta cattccagga tggagctact tgccaaccaa gaggaacaag 780gactgcatga gagatgtcta cattccagga tggagctact tgccaaccaa gaggaacaag 780
aggatgaaag aaattaacaa agagatcaca gatatgctaa ggttcatcat caacaagaga 840aggatgaaag aaattaacaa agagatcaca gatatgctaa ggttcatcat caacaagaga 840
atgaaggctt tgaaggctgg agagccaggt gaggatgact tgctgggagt attgttggaa 900atgaaggctt tgaaggctgg agagccaggt gaggatgact tgctgggagt attgttggaa 900
tcaaacattc aagaaattca aaagcaagga aacaagaagg atggtggaat gtcaatcaat 960tcaaacattc aagaaattca aaagcaagga aacaagaagg atggtggaat gtcaatcaat 960
gatgtaattg aggagtgcaa attgttctac tttgctggtc aagaaactac tggagtttta 1020gatgtaattg aggagtgcaa attgttctac tttgctggtc aagaaactac tggagtttta 1020
ctgacatgga ccaccatctt attgagcaag caccctgagt ggcaagagcg agctagagaa 1080ctgacatgga ccaccatctt attgagcaag caccctgagt ggcaagagcg agctagagaa 1080
gaagttctcc aagcctttgg caagaataaa cctgaatttg aacgcttaaa tcacctcaaa 1140gaagttctcc aagcctttgg caagaataaa cctgaatttg aacgcttaaa tcacctcaaa 1140
tatgtgtcta tgatcttgta cgaggttcta aggttgtacc caccagtgat tgatctaacc 1200tatgtgtcta tgatcttgta cgaggttcta aggttgtacc caccagtgat tgatctaacc 1200
aagattgtcc acgaggacac aaagttaggt ccgtacacaa ttcctgcagg aacacaagtg 1260aagattgtcc acgaggacac aaagttaggt ccgtacacaa ttcctgcagg aacacaagtg 1260
atgttgccaa cagtaatgct tcacagagag aagagcattt ggggagaaga tgcaacagaa 1320atgttgccaa cagtaatgct tcacagagag aagagcattt ggggagaaga tgcaacagaa 1320
ttcaacccaa tgagatttgc tgatggagtt gccaatgcaa ccaagaacaa tgtcacatat 1380ttcaacccaa tgagatttgc tgatggagtt gccaatgcaa ccaagaacaa tgtcacatat 1380
ttgccattca gttggggacc tagggtttgt cttggccaaa actttgcact tctgcaagca 1440ttgccattca gttggggacc tagggtttgt cttggccaaa actttgcact tctgcaagca 1440
aaattaggat tggcaatgat tttacaacgc ttcacgtttg atgttgctcc atcctatgtt 1500aaattaggat tggcaatgat tttacaacgc ttcacgtttg atgttgctcc atcctatgtt 1500
catgctcctt ttaccattct cacagttcaa ccccagtttg gttctcatgt catctacaag 1560catgctcctt ttaccattct cacagttcaa ccccagtttg gttctcatgt catctacaag 1560
aagcttgaga gctag 1575aagcttgaga gctag 1575
<210> 5<210> 5
<211> 1668<211> 1668
<212> DNA<212>DNA
<213> Catharanthus roseus<213> Catharanthus roseus
<400> 5<400> 5
atgggatcta aagatgatca gtcccttgtt gttgccattt ctccagctgc tgaaccaaat 60atgggatcta aagatgatca gtcccttgtt gttgccattt ctccagctgc tgaaccaaat 60
ggaaatcatt ctgtccccat cccattcgcc taccccagta tccccattca acctagaaag 120ggaaatcatt ctgtccccat cccattcgcc taccccagta tccccattca acctagaaag 120
cacaacaagc ccatcgttca tcgtcgagat ttcccctcag atttcatctt gggtgccgga 180cacaacaagc ccatcgttca tcgtcgagat ttcccctcag atttcatctt gggtgccgga 180
ggatctgctt atcagtgtga gggtgcatat aatgaaggca accgcggtcc cagtatatgg 240ggatctgctt atcagtgtga gggtgcatat aatgaaggca accgcggtcc cagtatatgg 240
gatactttca caaaccgata tccagccaaa atagctgatg gatctaatgg caatcaagcc 300gatactttca caaaccgata tccagccaaa atagctgatg gatctaatgg caatcaagcc 300
atcaattctt acaatttgta caaggaagat atcaagatta tgaagcaaac aggcttggaa 360atcaattctt acaatttgta caaggaagat atcaagatta tgaagcaaac aggcttggaa 360
tcatataggt tttcaatttc atggtcaaga gtattgccag gtggaaatct atccggtgga 420tcatataggt tttcaatttc atggtcaaga gtattgccag gtggaaatct atccggtgga 420
gtgaataaag atggtgtcaa gttctatcat gactttatag atgagcttct agccaatggc 480gtgaataaag atggtgtcaa gttctatcat gactttatag atgagcttct agccaatggc 480
atcaaaccct ttgcaactct cttccactgg gatcttcccc aagctcttga agacgagtat 540atcaaaccct ttgcaactct cttccactgg gatcttcccc aagctcttga agacgagtat 540
ggaggcttct tgagtgatcg aattgtggaa gattttacgg agtatgcaga attttgcttt 600ggaggcttct tgagtgatcg aattgtggaa gattttacgg agtatgcaga attttgcttt 600
tgggaattcg gtgacaaagt aaaattttgg acgactttca atgaaccaca tacttatgtt 660tgggaattcg gtgacaaagt aaaattttgg acgactttca atgaaccaca tacttatgtt 660
gcaagtggat atgccactgg tgaatttgca ccaggaagag gtggtgcaga tggcaagggg 720gcaagtggat atgccactgg tgaatttgca ccaggaagag gtggtgcaga tggcaagggg 720
aaccctggca aagaacccta tatagcgaca cataatttac ttctttctca caaagctgct 780aaccctggca aagaacccta tatagcgaca cataatttac ttctttctca caaagctgct 780
gtggaagtat ataggaaaaa ttttcagaaa tgtcaaggag gtgaaattgg aattgtactt 840gtggaagtat ataggaaaaa ttttcagaaa tgtcaaggag gtgaaattgg aattgtactt 840
aattcaatgt ggatggagcc tctcaatgaa accaaagaag atattgatgc tcgggaaagg 900aattcaatgt ggatggagcc tctcaatgaa accaaagaag atattgatgc tcgggaaagg 900
ggtcttgatt tcatgctcgg atggttcata gagccattaa caacgggtga atacccaaaa 960ggtcttgatt tcatgctcgg atggttcata gagccattaa caacgggtga atacccaaaa 960
tccatgagag ctcttgtagg aagccgtctt ccagaatttt caacagaaga ttccgaaaaa 1020tccatgagag ctcttgtagg aagccgtctt ccagaatttt caacagaaga ttccgaaaaa 1020
ttaacaggat gctatgattt tatcggaatg aattattata caactactta tgtttctaat 1080ttaacaggat gctatgattt tatcggaatg aattattata caactactta tgtttctaat 1080
gcagacaaaa ttcccgatac tccgggttac gaaacagatg ctcgaattaa taagaatatt 1140gcagacaaaa ttcccgatac tccgggttac gaaacagatg ctcgaattaa taagaatatt 1140
tttgtcaaaa aagttgatgg gaaggaagtg cgcattggtg aaccgtgcta tgggggatgg 1200tttgtcaaaa aagttgatgg gaaggaagtg cgcattggtg aaccgtgcta tgggggatgg 1200
cagcatgttg ttccatctgg actctacaat ctcttggttt acactaagga gaaataccat 1260cagcatgttg ttccatctgg actctacaat ctcttggttt acactaagga gaaataccat 1260
gttccagtga tttatgtctc agaatgtggt gtggttgagg aaaatagaac caacatatta 1320gttccagtga tttatgtctc agaatgtggt gtggttgagg aaaatagaac caacatatta 1320
cttacagaag gtaaaaccaa catattactt acagaagctc gtcacgataa actcagggtt 1380cttacagaag gtaaaaccaa catattactt acagaagctc gtcacgataa actcagggtt 1380
gattttctac aaagtcatct cgctagcgtg cgagatgcta ttgatgatgg tgtgaatgta 1440gattttctac aaagtcatct cgctagcgtg cgagatgcta ttgatgatgg tgtgaatgta 1440
aaaggattct ttgtttggtc attcttcgac aacttcgaat ggaatttggg atatatatgc 1500aaaggattct ttgtttggtc attcttcgac aacttcgaat ggaatttggg atatatatgc 1500
cgttatggaa ttatccatgt tgattataaa acttttcaaa gatatccaaa ggattctgcc 1560cgttatggaa ttatccatgt tgattataaa acttttcaaa gatatccaaa ggattctgcc 1560
atatggtaca agaatttcat tagtgaagga tttgttacga atacagctaa aaagagattc 1620atatggtaca agaatttcat tagtgaagga tttgttacga atacagctaa aaagagattc 1620
cgagaagaag ataaactagt tgagttagtc aagaagcaaa aatactaa 1668cgagaagaag ataaactagt tgagttagtc aagaagcaaa aatactaa 1668
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010104155.5A CN113293173B (en) | 2020-02-20 | 2020-02-20 | Agrobacterium tumefaciens binary expression vector |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010104155.5A CN113293173B (en) | 2020-02-20 | 2020-02-20 | Agrobacterium tumefaciens binary expression vector |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113293173A CN113293173A (en) | 2021-08-24 |
CN113293173B true CN113293173B (en) | 2023-02-03 |
Family
ID=77317509
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010104155.5A Active CN113293173B (en) | 2020-02-20 | 2020-02-20 | Agrobacterium tumefaciens binary expression vector |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113293173B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113817766A (en) * | 2021-09-24 | 2021-12-21 | 沈阳农业大学 | Gene expression cassette, recombinant expression vector, preparation method and application thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101012462B (en) * | 2006-11-14 | 2010-05-12 | 西南大学 | Rauwolfia tryptophan decarboxylase protein coding sequence |
EP2118274A4 (en) * | 2007-01-29 | 2010-09-01 | Univ Ohio State Res Found | SYSTEM FROM A VIRAL EXPRESSION VECTOR USED TO EXPRESS GENES IN PLANTS |
CN102703501B (en) * | 2012-05-24 | 2015-04-01 | 上海交通大学 | Method for increasing content of vinca alkaloids in vinca by corotation of orca3/g10h genes |
CN106222197A (en) * | 2013-07-16 | 2016-12-14 | 中国科学院上海生命科学研究院 | Plant Genome pointed decoration method |
CN104762314A (en) * | 2015-01-30 | 2015-07-08 | 西南大学 | Screening marker gene-deletable plant expression vector and use thereof |
-
2020
- 2020-02-20 CN CN202010104155.5A patent/CN113293173B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN113293173A (en) | 2021-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109957569B (en) | Base editing system and method based on CPF1 protein | |
KR102061438B1 (en) | A method for converting monocot genome sequences in which a nucleic acid base in a targeting DNA sequence is specifically converted, and a molecular complex used therein. | |
AU2016203445B2 (en) | Integration of a polynucleotide encoding a polypeptide that catalyzes pyruvate to acetolactate conversion | |
DK2931918T5 (en) | PROCEDURE FOR IDENTIFYING A CELL WITH INCREASED CONCENTRATION OF A PARTICULAR METABOLIT COMPARED TO THE SIMILAR WILD TYPE CELL ..... | |
KR101850162B1 (en) | Transformation plasmid | |
CN112941087B (en) | Application of corn ZmBES1/BZR1-2 gene in improving plant drought tolerance | |
KR20220012327A (en) | Methods and cells for production of phytocannabinoids and phytocannabinoid precursors | |
CN110511955A (en) | Infectious clone of plant cytoplasmic rhabdovirus, expression vector and its construction method and application | |
CN107603980A (en) | A kind of Kiwi berry Gene A cPDS based on PTG Cas9 edits carrier and its construction method and application | |
CN113293173B (en) | Agrobacterium tumefaciens binary expression vector | |
CN115820666B (en) | A kind of Dioscorea chrysanthemum DcW5 gene and its application in drought stress and salt stress tolerance | |
JP5913434B2 (en) | Artificial DNA sequence with optimized leader function in 5 '(5'-UTR) for improved expression of heterologous proteins in plants | |
CN118325926B (en) | Application of Chilo suppressalis salivary protein disulfide isomerase in improving resistance of tobacco to spodoptera frugiperda and aphids and method | |
CN105349553A (en) | Flower color adjusting gene VwMYB (Myoblastoma) 29 and application thereof | |
CN118703458A (en) | Taro CePDS gene and its VIGS virus vector and silencing method | |
CN114656530B (en) | Peanut bacterial wilt effector protein RipAU and application thereof in peanut bacterial wilt resistance | |
CN110747186B (en) | CRISPR/Cas9 systems and methods for efficient generation of mutants that do not carry transgenic elements in plants | |
CN114214332B (en) | Tianmu rehmannia anthocyanin related gene RcMYB1 and application thereof | |
CN114317589B (en) | Application of SpRYn-ABE base editing system in plant genome base substitution | |
CN109642236A (en) | Drought resistant polygene constructs | |
CN113265388B (en) | Tobacco system for producing ajmaline | |
CN116970046A (en) | 2A peptide mutant applicable to rhodosporidium toruloides as well as screening method and application thereof | |
CN117126865B (en) | LbaMYB44 gene for promoting carotenoid content accumulation and application thereof | |
CN114317596B (en) | A method for mutating A to G in target sequence of plant genome | |
CN110628794B (en) | Cell enrichment technology of C·T base substitution using inactivated screening agent resistance gene as reporter system and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |