CN107177625B - 一种定点突变的人工载体系统及定点突变方法 - Google Patents
一种定点突变的人工载体系统及定点突变方法 Download PDFInfo
- Publication number
- CN107177625B CN107177625B CN201710383003.1A CN201710383003A CN107177625B CN 107177625 B CN107177625 B CN 107177625B CN 201710383003 A CN201710383003 A CN 201710383003A CN 107177625 B CN107177625 B CN 107177625B
- Authority
- CN
- China
- Prior art keywords
- nucleotide sequence
- regulatory element
- seq
- rice
- site
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 239000013598 vector Substances 0.000 title claims description 29
- 238000002741 site-directed mutagenesis Methods 0.000 title claims description 14
- 239000002773 nucleotide Substances 0.000 claims abstract description 87
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 87
- 235000007164 Oryza sativa Nutrition 0.000 claims abstract description 65
- 235000009566 rice Nutrition 0.000 claims abstract description 61
- 230000001105 regulatory effect Effects 0.000 claims abstract description 55
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 40
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 16
- 230000035772 mutation Effects 0.000 claims abstract description 14
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 12
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 12
- 108091027544 Subgenomic mRNA Proteins 0.000 claims abstract description 7
- 241000193996 Streptococcus pyogenes Species 0.000 claims abstract description 6
- 238000006467 substitution reaction Methods 0.000 claims abstract description 5
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 4
- 206010020649 Hyperkeratosis Diseases 0.000 claims description 21
- 238000013518 transcription Methods 0.000 claims description 13
- 230000035897 transcription Effects 0.000 claims description 13
- 238000003776 cleavage reaction Methods 0.000 claims description 12
- 230000007017 scission Effects 0.000 claims description 12
- 241000589158 Agrobacterium Species 0.000 claims description 11
- 108091008146 restriction endonucleases Proteins 0.000 claims description 11
- 238000010362 genome editing Methods 0.000 claims description 9
- 238000011144 upstream manufacturing Methods 0.000 claims description 8
- 108091026890 Coding region Proteins 0.000 claims description 7
- 238000010367 cloning Methods 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 5
- 230000001404 mediated effect Effects 0.000 claims description 5
- 102000004190 Enzymes Human genes 0.000 claims description 4
- 108090000790 Enzymes Proteins 0.000 claims description 4
- 230000008859 change Effects 0.000 claims description 4
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims description 3
- 210000001938 protoplast Anatomy 0.000 claims description 3
- 241000209094 Oryza Species 0.000 claims 15
- 108700005078 Synthetic Genes Proteins 0.000 claims 1
- 239000000969 carrier Substances 0.000 claims 1
- 230000000977 initiatory effect Effects 0.000 claims 1
- 240000007594 Oryza sativa Species 0.000 abstract description 58
- 231100000350 mutagenesis Toxicity 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 26
- 241000196324 Embryophyta Species 0.000 description 16
- 230000009261 transgenic effect Effects 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 9
- 230000037429 base substitution Effects 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 6
- 240000008042 Zea mays Species 0.000 description 6
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 3
- 229940097277 hygromycin b Drugs 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 229940027257 timentin Drugs 0.000 description 3
- 108091033409 CRISPR Proteins 0.000 description 2
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- 241000209140 Triticum Species 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 description 1
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 102000000311 Cytosine Deaminase Human genes 0.000 description 1
- 108010080611 Cytosine Deaminase Proteins 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 240000006394 Sorghum bicolor Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 108091026822 U6 spliceosomal RNA Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 239000000645 desinfectant Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000006780 non-homologous end joining Effects 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 238000013433 optimization analysis Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
- C12N15/8205—Agrobacterium mediated transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Cell Biology (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本申请涉及一套用于水稻基因组碱基定点替换的人工系统及定点突变方法。该人工系统包括:第I调节元件,其包括能够编码如氨基酸序列I的核苷酸序列;其中所述氨基酸序列I选自SEQ ID Nos.1‑6中的一种;第II调节元件,其包括依次从5’端到3’端的第II‑1核苷酸序列和第II‑2核苷酸序列;所述第II‑1核苷酸序列包括靶核苷酸序列,所述第II‑2核苷酸序列包括来源于化脓链球菌的sgRNA核酸序列,第II‑1核苷酸序列与第II‑2核苷酸序列转录融合,其产物能引导第I调控元件编码的蛋白至目标生物基因组中待突变的靶位点处,并将靶位点处的C诱导突变为T、A和G中的一种;或G突变为A、T和C中的一种。
Description
技术领域
本发明涉及一套植物基因组碱基定点突变的人工系统及定点突变方法,特别是用于水稻基因组单碱基置换的人工系统。
背景技术
水稻(Oryza sativa L.)是世界主要粮食作物之一,世界上近一半人口,包括几乎整个东亚和东南亚的人口,都以稻米为食。在中国,水稻播种面积占全国粮食作物的1/4,而产量则占一半以上。提高产量,改善稻米品质,增加水稻植株抗病、抗逆性等研究以保证粮食的稳定供给是人类社会可持续性发展的重大课题。水稻也是单子叶植物的模式系统,其研究技术、方法、理论和成果对其它禾本科植物,如小麦、玉米、高粱等具有重要指导作用。
进入二十一世纪以来,全基因组测序,基因挖掘和基因编辑等技术手段的不断涌现让现代生物技术在农业上发挥了前所未有的作用。其中,基因功能的鉴定以及对有重大应用价值的基因的转化应用是最为关键的步骤。随着基因组大数据工程的快速发展,海量的植物基因组序列等待着解读,基因的挖掘和应用严重滞后,由此显得更加迫切。值得注意的是,利用现代生物技术所获得的转基因材料,在应用上涉及田间释放的转基因生物安全问题,近些年来引发了公众的普遍顾虑。
目前在植物研究领域,CRISPR/Cas9技术已在拟南芥、烟草、水稻、大豆、番茄、玉米、大麦、小麦、马铃薯、蘑菇等模式植物和农作物上展现出骄人的成绩,获得了大量基因突变材料和基因编辑产品。但是这些编辑事件基本上都是利用植物细胞非同源末端连接修复机制获得的,属于基因功能丧失性突变体。
获得基因功能获得性突变体,对于基础研究和应用研究的意义更为重大。然而,传统的基因功能获得性突变体是通过同源重组机制而获得的。在大部分复杂生物体内,非同源末端连接修复机制在外源DNA片段的插入过程中占主导地位,同源重组发生的概率非常低,从而定制突变体的效率只有千分之一到百分之二。最近出现的利用胞嘧啶去氨酶介导的单碱基替换技术,为人们高效获得基因功能获得性突变体提供了全新思路。然而,初步结果发现,现有的单碱基替换技术,如rBE3系统,是基于来源于小鼠的APOBEC1蛋白,其对靶位点TC和CC具有偏好性,而对靶位点GC和AC编辑效率较低。
发明内容
由于水稻基因组中GC和AC位点含量极高,因此,为了克服现有技术对GC和AC位点不敏感的缺陷,本申请提供了一套人工系统,所述人工系统包括
第I调节元件,其包括能够编码如氨基酸序列I的核苷酸序列;其中所述氨基酸序列I选自SEQ ID No.1、SEQ ID No.2、SEQ ID No.3、SEQ ID No.4、SEQ ID No.5和SEQ IDNo.6中的一种;
第II调节元件,其包括依次从5’端到3’端的第II-1核苷酸序列和第II-2核苷酸序列;所述第II-1核苷酸序列包括靶核苷酸序列;所述第II-2核苷酸序列包括来源于化脓链球菌(Streptococcus pyogenes)的sgRNA核酸序列;所述第II-1核苷酸序列和所述第II-2核苷酸序列转录融合,其产物能引导第I调控元件编码的蛋白至目标生物基因组中待突变的靶位点处,并将所述靶位点处的C突变为T、A和G中的一种,或将所述靶位点处的G突变为A、T和C中的一种;
当所述第II调节元件为多个时,包含在其中的多个第II-1核苷酸序列两两不相同。另外,当所述第II调节元件为多个时,它们可以串联的形成连接在一起。
通过利用本申请的人工系统,可以将水稻基因组内源位点的特定的C定点突变为T、A和G中的一种,或G定点突变为A、T和C中的一种,筛选得到水稻基因功能获得性突变体。但是,使用的靶核苷酸序列为靶位点处为C那条链上的核苷酸序列。通过本申请SEQ IDNo.2的验证,加之生物信息学的分析,以及结合本领域的常规技术知识,可以合理的得出SEQ ID No.1、SEQ ID No.3、SEQ ID No.4、SEQ ID No.5和SEQ ID No.6也均适用于本申请的人工系统。
在一个具体实施方式中,所述第I调节元件的核苷酸序列为能够适于在水稻中表达的核苷酸序列,所述第II调节元件的核苷酸序列为能够适于在水稻中发生转录的核苷酸序列。
在一个具体实施方式中,能够编码如SEQ ID No.1所示蛋白的核苷酸编码序列如SEQ ID No.7所示;能够编码如SEQ ID No.2所示蛋白的核苷酸编码序列如SEQ ID No.8所示;能够编码如SEQ ID No.3所示蛋白的核苷酸编码序列如SEQ ID No.9所示;能够编码如SEQ ID No.4所示蛋白的核苷酸编码序列如SEQ ID No.10所示;能够编码如SEQ ID No.5所示蛋白的核苷酸编码序列如SEQ ID No.11所示;能够编码如SEQ ID No.6所示蛋白的核苷酸编码序列如SEQ ID No.12所示。经过密码子优化分析,在本申请中随机挑选出适合在水稻中表达如SEQ ID No.2所示氨基酸的核苷酸序列其中之一如SEQ ID No.8所示。相应地,根据如SEQ ID No.8所示的核苷酸序列信息,根据生物信息学的分析,以及结合本领域的常规技术知识,可以合理的得出适合在水稻中表达如SEQ ID No.1、SEQ ID No.3、SEQ IDNo.4、SEQ ID No.5和SEQ ID No.6所示氨基酸的核苷酸序列。
在一个具体实施方式中,所述第II-2核苷酸序列如SEQ ID No.13所示。
在一个具体实施方式中,所述第II-1核苷酸序列包括IIS型限制性内切酶的酶切位点,所述靶核苷酸序列通过所述IIS型限制性内切酶的酶切位点的切割被克隆,以使所述第II-1核苷酸序列与第II-2序列转录融合;当所述第II调节元件为多个时,用于克隆不同靶核苷酸序列的所述IIS型限制性内切酶的酶切位点两两不相同。
其中,由于所述靶核苷酸序列是根据基因编辑位点而变化的,因此可以将其他元件构建好,包括事先克隆到相关位置的限制性内切酶的酶切位点。在使用的之前,再根据基因编辑目的将所述靶核苷酸序列通过限制性内切酶的酶切位点的切割而被克隆。当所述第II调节元件为多个时,包含在其中的多个第II-1核苷酸序列中的限制性内切酶的酶切位点两两不相同,如此,可以有效的保障不同的靶核苷酸顺利的被克隆到目标位置。多个靶核苷酸序列可用于目标生物基因组上的多个待突变的靶位点的碱基替换。
在一个具体实施方式中,优选所述克隆位点的核苷酸序列包括SEQ ID No.14和/或SEQ ID No.15。
在一个具体实施方式中,通过如下方式确定所述靶核苷酸序列:
1)确定水稻基因组上需要被改造的核苷酸序列;
2)判断步骤1)中确定的需要被改造的核苷酸序列或其反向互补序列中是否携带有待突变的核苷酸C,并判断所述待突变的核苷酸C突变为T、A和G中的一种、或所述待突变的核苷酸G突变为A、T和C中的一种后引起的改变是否符合预期;
3)在需要被改造的核苷酸序列或其反向互补序列中筛选靶标序列:向所述待突变的核苷酸C的3′端方向搜索以确认存在能够被氨基酸序列I识别的识别模序。且所述待突变的核苷酸C处在所述识别模序5′端上游的-19至-13的位置,由此确定的所述识别模序5′端上游17至21个核苷酸序列(不含识别模序)为所述靶核苷酸序列。
在一个具体实施方式中,当第I调节元件中含有能够编码SEQ ID No.1、SEQ IDNo.2和SEQ ID No.3所示中的一种的氨基酸序列时,所识别模序为5′-NGG-3′、5′-NGA-3′、5′-GAGN-3′、5′-AAGN-3′中的一种,所述靶核苷酸序列上游的17至21个核苷酸序列,淘汰含有连续五个T的核苷酸序列;其中,所述N为A、G、C和T中的一种。
在一个具体实施方式中,当第I调节元件中含有能够编码SEQ ID No.4、SEQ IDNo.5和SEQ ID No.6所示中的一种氨基酸序列时,所述模序为5′-NGA-3′、5′-TGCG-3′、5′-TGTG-3′、5′-GAAG-3′、5′-CGCG-3′中的一种,所述靶核苷酸序列为所述识别模序5′端上游的17至21个核苷酸序列,淘汰含有连续五个T的核苷酸序列;其中,所述N为A、G、C和T中的一种。
在一个具体实施方式中,所述靶核苷酸序列如SEQ ID No.16所示。
在一个具体实施方式中,所述人工系统还包括在所述第I调节元件的5’端的能够用于水稻中的,且能够启动所述第I调节元件转录的第一启动子;和/或所述人工系统还包括在所述第II调节元件的5’端的能够用于水稻中的,且能够启动所述第II调节元件转录的第二启动子。
在一个具体实施方式中,所述第一启动子为RNA聚合酶II型启动子;和/或第二启动子为RNA聚合酶III型启动子。
在一个具体实施方式中,第一启动子为SEQ ID No.17;和/或第二启动子为SEQ IDNo.18和/或SEQ ID No.19。
在一个具体实施方式中,所述人工系统还包括在所述第I调节元件的3’端的能够终止所述第I调节元件转录的第一终止子;和/或所述人工系统还包括在所述第II调节元件的3’端的能够终止所述第II调节元件转录的第二终止子。
在一个具体实施方式中,第一终止子为SEQ ID No.20;和/或第二终止子为SEQ IDNo.21。
在一个具体实施方式中,所述第I调节元件和所述第II元件能够被克隆到至少一个载体上。例如,所述第I调节元件表达框架和所述第II调节元件表达框架能够被克隆或整合到同一个载体上,或第I调节元件表达框架和第II调节元件表达框架混合或分别位于不同的载体上时,可以采用基因枪法、农杆菌侵染法或PEG介导转化法将两个表达框架或含有两个表达框架的载体导入到水稻的愈伤组织或水稻原生质体中。
在一个具体实施方式中,所述第I调节元件能够被克隆到pUbi-ccdB上;所述第II调节元件被克隆到入门载体pENTR4上。pUbi-ccdB为基于Gateway反应并用于水稻遗传转化的双元载体。
在一个具体实施方式中,所述第一启动子、第I调节元件和第一终止子能够被克隆到pUbi-ccdB载体上。
在一个具体实施方式中,第二启动子、第II调节元件和第二终止子被克隆到pENTR4载体上。
本申请之二提供了一种如本申请之一中任意一人工系统在用于将水稻基因组中的C定点突变为T、A和G中的一种,或将水稻基因组中的G定点突变为A、T和C中的一种的应用。
本申请之三提供了一种将水稻基因组上的C定点突变为T、A或G的方法,其包括如下步骤:
1)将本申请之一中任意一人工系统通过农杆菌介导的方法导入到水稻愈伤组织,然后培养以获得水稻植株;
2)筛选获得含有定点突变的水稻植株。
进一步地,所述水稻植株能够产生含有定点替换碱基的种子,水稻获得了新的经济性状。
在进行所述的人工系统导入时,可以采用农杆菌侵染法,也可以采用基因枪法或PEG介导转化的方法中的一种将所述的人工系统导入到水稻愈伤组织或水稻原生质体中,这是本领域技术人员容易理解的。
本领域的技术人员公知,水稻基因组DNA由两条链组成,因此,所述靶核苷酸序列可以在其中的任意一条链上,并且靶位点处为C。例如,当所述靶核苷酸序列位于某一功能基因的一条链中时,该功能基因的特定位点上的C被定点突变为T、A和G中的一种后,如果其中的一种突变能够获得预期的其对应的功能蛋白中的氨基酸,则可以采用此系统来实现,即可以通过直接的碱基替换三联体密码子中的C为T、A和G中的一种,或通过间接的碱基替换三联体密码子中的G为A、T和C中的一种来改变所述三联体密码子编码氨基酸为所需氨基酸的核苷酸序列,得到水稻基因功能获得性突变体;或当所述靶核苷酸序列位于某一功能基因的另一条链中时,该功能基因的特定位点上的G被定点突变为A、T和C中的一种后,如果其中的一种突变能够获得预期的其对应的功能蛋白中的氨基酸,也可以采用此系统来实现,即可以通过将该链中的G替换为A、T和C中的一种来改变该链中的所述三联体密码子编码氨基酸,或通过间接的碱基替换三联体密码子中的G为A、T和C中的一种,得到水稻基因功能获得性突变体。
本申请的有益效果在于:
a)第II调节元件可以为多个,这样可以同时编辑水稻细胞内多个基因靶位点。
b)本申请的人工系统的碱基编辑效率高,特别是GC和AC处的靶位点碱基编辑效率高,可达26.9%,而现有的rBE3系统不能实现此位点相应的编辑。因此,本申请的人工系统在水稻基因功能研究和分子育种中具有广泛的应用价值。
附图说明
图1显示了本申请实施例带有pi-d2基因的靶向序列相对应pi-d2基因的位置,以及利用rBE5系统获得的Pi-d2基因功能获得性突变体核苷酸突变信息。其中,识别模序序列由下划实线表示;靶核苷酸序列为加粗显示部分,氨基酸替换位点由下划虚线表示。
具体实施方式
pUbi-ccdB载体的来源:pUbi-ccdB载体由本实验室通过改造pCAMBIA1300而来,其中插入了attR1-ccdB-attR2模块,用于gateway反应接受来自于入门载体的attL1-靶向序列转录模块-attL2模块。
pENTR4载体的来源:购自美国Invitrogen公司。
实施例1
1.载体的构建
通过DNA克隆的常规操作,将玉米的组成型启动子Ubi-p(SEQ ID No.17)、SEQ IDNo.8、Nos终止子(SEQ ID No.20)按照从5’到3’的顺序克隆到pUbi-ccdB载体上,命名为pUbi:rBE5,用于水稻的转基因植株研究。
将OsU6-p启动子(SEQ ID No.18)、两个BsaI酶切位点(SEQ ID No.14)、sgRNA序列(SEQ ID No.13)、(T)8终止子(SEQ ID No.21)、粳稻U6snRNA启动子(SEQ ID No.19)、两个BtgZI酶切位点(SEQ ID No.15)、sgRNA序列(SEQ ID No.13)、(T)8终止子(SEQ ID No.21)按照从5’到3’的顺序克隆到pENTR4载体的多克隆位点上,命名为pENTR4:sgRNA。其中的两个BtgZI或两个BsaI酶切位点可以用于分别克隆以下实施例2中特定基因的靶向序列。
2.针对Pi-d2基因的识别序列设计和克隆。
每个基因的转录本序列和基因组序列从MSU水稻基因组数据库中获得(http://rice.plantbiology.msu.edu/)。1)判断编辑位点的正链或负链是否携带有核酸碱基C,相应碱基C突变为T、A或G后引起的氨基酸改变是否符合预期;2)在3′端方向搜索NGG、NGA、GAGN、AAGN等模序,所述N是A或G或C或T,使突变碱基C处在所述模序上游的的-19至-13位置;3)合成19-20bp的靶向序列,进行载体构建。
针对Pi-d2基因的靶核苷酸序列为5′-GAGCATAATGACAATAATAA-3′(SEQ IDNo.16)。分别合成gPi-d2-F1(5′-GTGTGAGCATAATGACAATAATAA-3′,SEQ ID No.22)和gPi-d2-R1(5′-AAACTTATTATTGTCATTATGCTC-3′,SEQ ID No.23)引物后,使用T4多聚核苷酸激酶进行磷酸化处理,退火形成双链,克隆到pENTR4:sgRNA载体的BsaI酶切位点中,测序确认插入片段(靶核苷酸序列)完全正确,获得pENTR4:gPi-d2。将pENTR4:gPi-d2用AatII酶切进行线性化,再通过Gateway反应将第II核苷酸序列转录元件转移到pUbi:rBE5中,获得为pUbi:rBE5-gPi-d2最终载体,其对于Pi-d2基因上的改造位点序列为5′-TTATTATTGTCATTATGCTC-3′(SEQ ID No.24),该序列与SEQ ID No.16互补;基因编辑后序列为5′-TTATTATTGTCATTATACTC-3′(SEQ ID No.25)。
3.rBE系统转化粳稻品种Kitaake。
1)水稻愈伤诱导:
去壳的未成熟的水稻种子用50%的市售消毒液处理25分钟;无菌水清洗3-5次,然后将种子转移至无菌的培养皿中,吸出多余的水份;将种子放置于MSD平板上(4.43g/L MS粉;30g/L蔗糖;2ml/L 2,4-D;8g/L植物凝胶;pH5.7),于光照培养室培养10天,诱导愈伤组织形成;去除种子的胚和芽,将愈伤组织转移至一个新的MSD培养皿上,培养5天,直至能够用于农杆菌的转化。
2)农杆菌转化:
将pUbi:rBE5-gPi-d2通过电击法转入到农杆菌菌株EHA105中,在LB培养基中过夜培养12小时;收集农杆菌,用MSD溶液重悬,使其OD600=0.1,待用。
3)水稻愈伤的农杆菌侵染:
将愈伤组织置于上述农杆菌悬浮液中30分钟;除去农杆菌悬浮液,将愈伤组织转移至无菌的吸水纸上;将愈伤组织转移至新的含有100μM乙酰丁香酮的MSD培养基上,室温避光培养3天。
4)水稻抗性愈伤筛选:
将暗培养后的愈伤组织转移至MSD培养基(200mg/L特美汀;50mg/L潮霉素B)上,光照培养2周至1个月,直至愈伤组织的表面出现抗性愈伤,转移抗性愈伤到新的MSD培养基(200mg/L特美汀,50mg/L潮霉素B)上,每2周换一次培养基。
5)抗性愈伤分化与生根
将抗性愈伤组织转移至再生培养基上(4.43g/L MS粉;30g/L蔗糖;25g/L山梨醇;0.5mg/L NAA;3mg/L BA;100mg/L特美汀;50mg/L潮霉素B;12g/L琼脂粉;pH=5.7),直至长成植株幼苗,每7-10天转移一次;转移幼苗至1/2MS培养基(2.21g/L MS粉;15g/L蔗糖;8g/L植物凝胶;pH5.7)中生根。
4.对T0代转基因水稻中Pi-d2基因靶位点进行鉴定。
用CTAB法提取抗性愈伤和转基因植株的基因组DNA。根据Pi-d2基因的靶位点DNA序列设计特异性的PCR引物:Pi-d2-F1:5′-CGGGTTGTAAGAGTGCCTGT-3′(SEQ ID No.26),Pi-d2-R1:5′-CTCCAGCTTCTTCACAGCAA-3′(SEQ ID No.27),利用I-5高保真酶混合液(MACLAB)进行PCR扩增目的片段491bp,PCR产物直接测序或连接pGEM-T载体后测序,基因编辑效果如图1所示,pUbi:rBE5系统对Pi-d2的靶位点编辑效率高达26.9%。
实施例2
通过DNA克隆的常规操作,将玉米的组成型启动子Ubi-p(SEQ ID No.17)、SEQ IDNo.28、Nos终止子(SEQ ID No.20)按照从5’到3’的顺序克隆到pUbi-ccdB载体上,命名为pUbi:rBE3。将pENTR4:gPi-d2用AatII酶切进行线性化,再通过Gateway反应将靶向序列转录元件转移到pUbi:rBE3中,获得为pUbi:rBE3-gPi-d2最终载体,用于水稻的转基因植株研究。其对于Pi-d2在基因组靶位点序列与实施例1相同,为5′-TTATTATTGTCATTATGCTC-3′(SEQ ID No.24)。SEQ ID No.28所示的核苷酸序列编码如SEQ ID No.29所示的氨基酸序列。
其他操作同实施例1。
转基因植株群体中未筛选获得到Pi-d2的编辑植株。
本申请通过转化水稻愈伤,获得含有rBE3(pUbi:rBE3-gPi-d2)和rRE5(pUbi:rBE5-gPi-d2)系统的转基因水稻,通过靶位点测序发现,对于GC靶位点,rBE5系统成功地在转基因水稻植株中对Pi-d2位点进行了单碱基定点替换。我们从26株独立的转基因水稻株系中筛选获得到7株目标突变体,基因编辑效率为26.9%,而已报道的单碱基替换载体rBE3系统对其的编辑效率为零。综上所述,我们构建的这套新载体进行针对GC和AC靶位点的定点碱基突变方法,对于用基因编辑技术获得指定基因功能获得性水稻突变体具有重要价值。
LHA1760261 核苷酸序和氨基酸列表
<110> 中国农业科学院植物保护研究所
<120>一种定点突变的人工载体系统及定点突变方法
<130> LHA1760261
<160> 29
<170> PatentIn version 3.5
<210> 1
<211> 1572
<212> PRT
<213> 人工序列
<223> SEQ ID No. 1
<400> 1
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDRPKKKRKVGG;
<210> 2
<211> 1558
<212> PRT
<213>人工序列
<223> SEQ ID No. 2
<400> 2
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTSGSETPGTSESATPESDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDRPKKKRKVGG;
<210>3
<211> 1676
<212> PRT
<213> 人工序列
<223> SEQ ID No. 3
<400> 3
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTSGSETPGTSESATPESDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSPKKKRKV;
<210> 4
<211> 1572
<212> PRT
<213> 人工序列
<223> SEQ ID No. 4
<400> 4
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFVSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDRPKKKRKVGG;
<210> 5
<211> 1588
<212> PRT
<213>人工序列
<223> SEQ ID No. 5
<400> 5
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTSGSETPGTSESATPESDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFVSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDRPKKKRKVGG;
<210> 6
<211> 1676
<212> PRT
<213>人工序列
<223> SEQ ID No. 6
<400> 6
MDSLLMNRREFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFISWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHGRTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTSGSETPGTSESATPESDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDANAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFVSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSPKKKRKV;
<210> 7
<211> 4719
<212> DNA
<213>人工序列
<223> SEQ ID No. 7
<400> 7
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGACTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACCGCCCCAAAAAGAAGAGGAAAGTTGGCGGGTGA;
<210> 8
<211> 4767
<212> DNA
<213>人工序列
<223> SEQ ID No. 8
<400> 8
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTAGCGGAAGCGAGACGCCAGGGACTTCTGAATCGGCCACCCCCGAGAGCGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGACTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACCGCCCCAAAAAGAAGAGGAAAGTTGGCGGGTGA;
<210> 9
<211> 5031
<212> DNA
<213>人工序列
<223> SEQ ID No. 9
<400> 9
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTAGCGGAAGCGAGACGCCAGGGACTTCTGAATCGGCCACCCCCGAGAGCGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGACTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACTCCGGCGGAAGTACAAACCTTTCAGACATTATAGAAAAGGAAACCGGCAAGCAACTCGTCATCCAGGAATCCATACTTATGCTCCCTGAAGAGGTGGAAGAAGTGATCGGTAATAAACCAGAGAGCGACATACTTGTCCACACCGCTTATGACGAAAGTACAGACGAAAACGTCATGCTTCTGACGAGTGATGCCCCCGAATACAAACCTTGGGCGCTCGTCATCCAGGATTCCAATGGGGAGAATAAAATAAAGATGCTCTCTGGAGGCAGCCCAAAGAAGAAGAGAAAGGTCTGA;
<210> 10
<211> 4719
<212> DNA
<213>人工序列
<223> SEQ ID No. 10
<400> 10
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGtCTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACCGCCCCAAAAAGAAGAGGAAAGTTGGCGGGTGA;
<210> 11
<211> 4767
<212> DNA
<213>人工序列
<223> SEQ ID No. 11
<400> 11
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTAGCGGAAGCGAGACGCCAGGGACTTCTGAATCGGCCACCCCCGAGAGCGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGtCTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACCGCCCCAAAAAGAAGAGGAAAGTTGGCGGGTGA;
<210> 12
<211> 5031
<212> DNA
<213>人工序列
<223> SEQ ID No. 12
<400> 12
ATGGATAGCCTTCTCATGAACAGAAGAGAGTTTCTCTATCAGTTTAAAAATGTTCGGTGGGCGAAGGGGAGGAGAGAGACATATCTCTGCTATGTTGTTAAGCGGAGAGATTCTGCGACCTCATTCTCACTCGATTTTGGTTATTTGAGGAACAAGAATGGATGTCATGTCGAATTGTTGTTTCTCCGGTATATTTCCGACTGGGATTTGGACCCAGGGCGGTGTTACCGGGTCACATGGTTTATTTCCTGGAGTCCATGTTACGACTGTGCGCGCCATGTCGCCGACTTCCTCAGGGGTAATCCTAACTTGTCCTTGCGGATTTTTACAGCCAGACTCTATTTCTGTGAGGATCGGAAGGCGGAACCCGAGGGGCTGAGAAGACTGCACCGCGCTGGCGTCCAAATCGCCATCATGACTTTTAAGGATTATTTCTACTGTTGGAACACGTTCGTCGAGAACCACGGTCGGACCTTCAAAGCCTGGGAAGGGCTGCATGAAAATTCCGTGAGGTTGTCCCGGCAACTCCGCAGAATACTCCTGCCCCTTTATGAGGTCGACGATCTCAGAGACGCCTTTAGAACTAGCGGAAGCGAGACGCCAGGGACTTCTGAATCGGCCACCCCCGAGAGCGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAACGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGtCTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACTCCGGCGGAAGTACAAACCTTTCAGACATTATAGAAAAGGAAACCGGCAAGCAACTCGTCATCCAGGAATCCATACTTATGCTCCCTGAAGAGGTGGAAGAAGTGATCGGTAATAAACCAGAGAGCGACATACTTGTCCACACCGCTTATGACGAAAGTACAGACGAAAACGTCATGCTTCTGACGAGTGATGCCCCCGAATACAAACCTTGGGCGCTCGTCATCCAGGATTCCAATGGGGAGAATAAAATAAAGATGCTCTCTGGAGGCAGCCCAAAGAAGAAGAGAAAGGTCTGA;
<210> 13
<211> 76
<212> DNA
<213>化脓链球菌(Streptococcus pyogenes)
<223> SEQ ID No. 13
<400> 13
GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGC;
<210> 14
<211> 20
<212> DNA
<213> 人工序列
<223> SEQ ID No. 14
<400> 14
AGAGACCAAAGGAGGTCTCA;
<210> 15
<211> 37
<212> DNA
<213>人工序列
<223> SEQ ID No. 15
<400> 15
GGCTAGGATCCATCGCAGTCAGCGATGAGTACAGCAA;
<210> 16
<211> 20
<212> DNA
<213> 水稻(Oryza sativa L)
<223>针对Pi-d2的靶核苷酸序列
<400> 16
GAGCATAATGACAATAATAA;
<210> 17
<211> 1765
<212> DNA
<213>玉米(Zea mays L.)
<223> SEQ ID No. 17
<400> 17
GCAGCGTGACCCGGTCGTGCCCCTCTCTAGAGATAATGAGCATTGCATGTCTAAGTTATAAAAAATTACCACATATTTTTTTTGTCACACTTGTTTGAAGTGCAGTTTATCTATCTTTATACATATATTTAAACTTTACTCTACGAATAATATAATCTATAGTACTACAATAATATCAGTGTTTTAGAGAATCATATAAATGAACAGTTAGACATGGTCTAAAGGACAATTGAGTATTTTGACAACAGGACTCTACAGTTTTATCTTTTTAGTGTGCATGTGTTCTCCTTTTTTTTTGCAAATAGCTTCACCTATATAATACTTCATCCATTTTATTAGTACATCCATTTAGGGTTTAGGGTTAATGGTTTTTATAGACTAATTTTTTTAGTACATCTATTTTATTCTATTTTAGCCTCTAAATTAAGAAAACTAAAACTCTATTTTAGTTTTTTTATTTAATAATTTAGATATAAAATAGAATAAAATAAAGTGACTAAAAATTAAACAAATACCCTTTAAGAAATTAAAAAAACTAAGGAAACATTTTTCTTGTTTCGAGTAGATAATGCCAGCCTGTTAAACGCCGTCGACGAGTCTAACGGACACCAACCAGCGAACCAGCAGCGTCGCGTCGGGCCAAGCGAAGCAGACGGCACGGCATCTCTGTCGCTGCCTCTGGACCCCTCTCGAGAGTTCCGCTCCACCGTTGGACTTGCTCCGCTGTCGGCATCCAGAAATTGCGTGGCGGAGCGGCAGACGTGAGCCGGCACGGCAGGCGGCCTCCTCCTCCTCTCACGGCACGGCAGCTACGGGGGATTCCTTTCCCACCGCTCCTTCGCTTTCCCTTCCTCGCCCGCCGTAATAAATAGACACCCCCTCCACACCCTCTTTCCCCAACCTCGTGTTGTTCGGAGCGCACACACACACAACCAGATCTCCCCCAAATCCACCCGTCGGCACCTCCGCTTCAAGGTACGCCGCTCGTCCTCCCCCCCCCCCCCTCTCTACCTTCTCTAGATCGGCGTTCCGGTCCATGGTTAGGGCCCGGTAGTTCTACTTCTGTTCATGTTTGTGTTAGATCCGTGTTTGTGTTAGATCCGTGCTGCTAGCGTTCGTACACGGATGCGACCTGTACGTCAGACACGTTCTGATTGCTAACTTGCCAGTGTTTCTCTTTGGGGAATCCTGGGATGGCTCTAGCCGTTCCGCAGACGGGATCGATTTCATGATTTTTTTTGTTTCGTTGCATAGGGTTTGGTTTGCCCTTTTCCTTTATTTCAATATATGCCGTGCACTTGTTTGTCGGGTCATCTTTTCATGCTTTTTTTTTGTCTTGGTTGTGATGATGTGGTGTGGTTGGGCGGTCGTTCATTCGTTCTAGATCGGAGTAGAATACTGTTTCAAACTACCTGGTGTATTTATTAATTTTGGAACTGTATGTGTGTGTCATACATCTTCATAGTTACGAGTTTAAGATGGATGGAAATATCGATCTAGGATAGGTATACATGTTGATGTGGGTTTTACTGATGCATATACATGATGGCATATGCAGCATCTATTCATATGCTCTAACCTTGAGTACCTATCTATTATAATAAACAAGTATGTTTTATAATTATTTTGATCTTGATATACTTGGATGATGGCATATGCAGCAGCTATATGTGGATTTTTTTAGCCCTGCCTTCATACGCTATTTATTTGCTTGGTACTGTTTCTTTTGTCGATGCTCACCCTGTTGTTTGGTGTTACTTCTGCA;
<210> 18
<211> 326
<212> DNA
<213>水稻(Oryza sativa L)
<223> SEQ ID No. 18
<400> 18
AAGAACGAACTAAGCCGGACAAAAAAAGGAGCACATATACAAACCGGTTTTATTCATGAATGGTCACGATGGATGATGGGGCTCAGACTTGAGCTACGAGGCCGCAGGCGAGAGAAGCCTAGTGTGCTCTCTGCTTGTTTGGGCCGTAACGGAGGATACGGCCGACGAGCGTGTACTACCGCGCGGGATGCCGCTGGGCGCTGCGGGGGCCGTTGGATGGGGATCGGTGGGTCGCGGGAGCGTTGAGGGGAGACAGGTTTAGTACCACCTCGCCTACCGAACAATGAAGAACCCACCTTATAACCCCGCGCGCTGCCGCTTGTGTT;
<210> 19
<211> 245
<212> DNA
<213>水稻(Oryza sativa L)
<223> SEQ ID No. 19
<400> 19
GGATCATGAACCAACGGCCTGGCTGTATTTGGTGGTTGTGTAGGGAGATGGGGAGAAGAAAAGCCCGATTCTCTTCGCTGTGATGGGCTGGATGCATGCGGGGGAGCGGGAGGCCCAAGTACGTGCACGGTGAGCGGCCCACAGGGCGAGTGTGAGCGCGAGAGGCGGGAGGAACAGTTTAGTACCACATTGCCCAGCTAACTCGAACGCGACCAACTTATAAACCCGCGCGCTGTCGCTTGTGT;
<210> 20
<211> 253
<212> DNA
<213>CaMV (Califlower mosaic virus)
<223> SEQ ID No. 20
<400> 20
GATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAATACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC;
<210> 21
<211> 8
<212> DNA
<213>人工序列
<223> SEQ ID No. 21
<400> 21
TTTTTTTT;
<210> 22
<211> 24
<212> DNA
<213>人工序列
<223> gPi-d2-F1
<400> 22
GTGTGAGCATAATGACAATAATAA;
<210> 23
<211> 24
<212> DNA
<213>人工序列
<223> gPi-d2-R1
<400> 23
AAACTTATTATTGTCATTATGCTC;
<210> 24
<211> 20
<212>DNA
<213>人工序列
<223> Pi-d2基因上的编辑位点序列
<400> 24
TTATTATTGTCATTATGCTC;
<210> 25
<211> 20
<212> DNA
<213>人工序列
<223> Pi-d2基因被编辑后的部分核苷酸序列
<400> 25
TTATTATTGTCATTATACTC;
<210> 26
<211> 20
<212> DNA
<213>人工序列
<223> Pi-d2-F1
<400> 26
CGGGTTGTAAGAGTGCCTGT;
<210> 27
<211> 20
<212> DNA
<213>人工序列
<223> Pi-d2-R1
<400> 27
CTCCAGCTTCTTCACAGCAA;
<210> 28
<211> 5133
<212> DNA
<213>人工序列
<223> SEQ ID No. 28
<400> 28
ATGAGTAGCGAGACAGGTCCTGTTGCAGTTGACCCGACCCTTCGGAGAAGGATAGAGCCACACGAATTTGAAGTGTTTTTCGACCCTAGAGAACTGAGGAAGGAGACGTGCCTTCTGTACGAGATAAACTGGGGTGGTCGCCACTCTATTTGGAGGCACACTTCGCAAAACACGAACAAGCATGTGGAGGTGAACTTTATAGAAAAATTTACGACTGAGAGATACTTCTGCCCTAATACCCGGTGCTCCATCACCTGGTTCCTTAGCTGGAGCCCTTGTGGCGAATGCTCGAGGGCAATCACCGAGTTTCTGTCCAGATACCCACATGTGACGCTTTTTATATATATTGCCCGCTTGTATCACCACGCTGACCCTAGAAACCGCCAGGGTCTTCGCGATCTGATATCTTCAGGAGTTACCATCCAAATAATGACGGAACAAGAATCCGGTTACTGTTGGCGCAATTTCGTCAACTATAGCCCTTCCAATGAAGCTCATTGGCCTAGATATCCGCACCTCTGGGTCCGGCTGTATGTTCTCGAGCTTTACTGCATTATACTTGGACTTCCCCCCTGCTTGAATATTCTCCGCAGAAAGCAGCCTCAGCTTACGTTTTTTACGATTGCACTCCAAAGTTGTCATTATCAGAGACTGCCACCCCATATCTTGTGGGCTACGGGACTGAAGAGCGGAAGCGAGACGCCAGGGACTTCTGAATCGGCCACCCCCGAGAGCGATAAAAAGTATTCAATCGGACTTGCTATTGGGACAAACTCTGTGGGCTGGGCGGTAATTACCGACGAGTACAAGGTGCCTAGTAAGAAATTTAAAGTGCTCGGAAACACTGACAGGCACTCTATAAAGAAGAACCTGATCGGGGCACTGCTTTTCGACTCCGGAGAGACGGCGGAGGCGACGCGTCTCAAGCGTACCGCGCGCCGCAGGTACACAAGAAGGAAGAATAGGATCTGCTACTTGCAGGAAATCTTCAGTAACGAGATGGCGAAGGTCGACGATAGTTTCTTTCATCGGTTGGAAGAATCGTTCCTCGTAGAGGAGGACAAAAAGCACGAGCGTCACCCAATATTCGGGAATATTGTTGACGAGGTTGCCTACCATGAGAAATATCCTACAATATATCACCTCCGTAAGAAGCTTGTCGATTCAACTGATAAGGCTGATCTCAGACTCATCTATCTTGCCCTCGCACATATGATTAAGTTTCGTGGCCACTTCTTGATTGAAGGCGACCTCAACCCGGACAACTCAGATGTTGACAAGCTTTTTATACAGCTCGTCCAGACATATAACCAGCTGTTTGAAGAGAATCCCATCAATGCGAGTGGGGTTGATGCTAAGGCCATTTTGTCCGCCAGGTTGTCCAAATCTCGCAGACTGGAAAACCTGATCGCACAGCTTCCCGGTGAAAAGAAAAACGGGCTCTTCGGCAATCTCATCGCACTGTCCCTCGGCCTCACCCCAAACTTCAAGTCTAACTTCGACCTGGCCGAGGATGCGAAGCTCCAGCTGTCAAAAGATACATACGACGACGATTTGGACAATCTGCTTGCGCAAATAGGCGACCAGTATGCGGACCTGTTCCTGGCTGCCAAAAATCTGTCAGATGCAATCCTCCTGTCCGATATATTGCGTGTGAACACCGAAATCACGAAGGCACCGCTTAGCGCATCCATGATCAAGAGATACGACGAGCACCATCAGGACCTCACACTCCTCAAGGCGCTTGTTCGTCAGCAGCTTCCCGAGAAATATAAGGAAATTTTTTTCGATCAAAGCAAGAATGGATATGCTGGCTATATTGACGGTGGCGCTTCGCAGGAGGAGTTCTATAAATTCATTAAGCCGATTCTGGAGAAGATGGACGGAACGGAGGAGCTCCTCGTCAAGCTTAACCGGGAAGACCTGTTGCGGAAGCAGAGGACTTTTGATAACGGCTCTATTCCGCACCAAATCCATCTGGGTGAGTTGCACGCAATCTTGAGAAGACAAGAGGATTTCTACCCGTTCCTTAAGGATAACAGAGAGAAGATAGAAAAAATACTGACCTTCAGGATACCATACTATGTGGGCCCACTGGCGCGCGGAAATAGTCGTTTCGCATGGATGACTAGAAAGTCCGAAGAAACGATCACGCCATGGAATTTTGAGGAAGTGGTCGACAAGGGCGCCTCTGCCCAGAGCTTCATCGAAAGGATGACCAATTTTGACAAAAATCTGCCTAACGAAAAGGTGCTTCCGAAGCACAGCCTGTTGTATGAATACTTCACAGTTTATAACGAGCTCACTAAGGTCAAGTACGTCACGGAGGGCATGCGTAAGCCTGCTTTCCTGTCTGGTGAACAAAAAAAGGCGATTGTGGACCTCCTTTTCAAGACGAACCGTAAAGTTACTGTGAAGCAACTGAAAGAGGATTACTTTAAGAAAATTGAGTGCTTCGACAGTGTGGAGATTTCCGGTGTCGAGGACCGGTTTAACGCCAGCCTGGGTACGTATCATGACCTGCTTAAAATTATCAAGGATAAAGATTTCCTGGATAATGAAGAGAACGAAGATATACTGGAGGACATTGTGTTGACTTTGACCCTCTTCGAGGACAGAGAGATGATTGAGGAAAGACTGAAGACCTACGCACACCTTTTTGATGACAAGGTCATGAAACAACTCAAGCGCCGGCGCTATACTGGCTGGGGCCGGCTTTCTCGCAAGCTCATCAATGGGATTCGGGATAAGCAATCAGGCAAGACAATTTTGGACTTCCTCAAATCCGACGGATTCGCAAATAGGAATTTTATGCAGCTGATACATGACGACTCTTTGACATTCAAAGAAGACATACAGAAGGCTCAGGTCTCCGGCCAAGGAGATTCTTTGCACGAGCATATCGCTAACTTGGCAGGTAGCCCCGCCATAAAAAAGGGCATTCTTCAAACGGTAAAAGTTGTTGACGAACTCGTGAAGGTTATGGGCCGTCATAAGCCGGAAAACATTGTTATTGAAATGGCTAGGGAAAATCAGACGACCCAGAAGGGACAGAAAAATAGCAGGGAGCGGATGAAGAGAATTGAAGAGGGAATTAAGGAGCTTGGATCTCAGATTCTTAAGGAGCACCCTGTGGAGAACACCCAACTTCAGAATGAAAAGCTCTACCTTTACTACCTTCAAAACGGCCGGGATATGTACGTCGATCAGGAACTTGACATTAACCGGTTGAGCGATTATGACGTTGACCATATTGTGCCCCAATCTTTCCTTAAAGACGACTCTATCGACAATAAAGTGCTGACGCGCAGCGATAAAAATCGCGGTAAGTCGGATAATGTCCCGTCGGAAGAGGTGGTTAAAAAAATGAAGAACTATTGGAGGCAACTCCTGAATGCCAAGCTGATCACTCAGAGGAAATTCGACAATCTCACCAAGGCAGAAAGGGGTGGACTTAGCGAGCTCGACAAGGCCGGTTTTATCAAAAGACAGCTGGTGGAGACACGCCAAATCACCAAACACGTTGCCCAGATCCTGGATTCGAGGATGAACACGAAGTATGACGAGAACGACAAGTTGATTAGGGAAGTCAAGGTCATCACTTTGAAGTCCAAGCTGGTGAGCGACTTTCGCAAAGACTTCCAGTTTTACAAAGTCAGGGAAATTAATAACTACCACCACGCCCACGACGCCTACCTTAACGCCGTGGTTGGCACAGCACTCATCAAGAAATACCCTAAGCTCGAATCTGAGTTCGTCTATGGCGACTATAAGGTCTACGACGTTAGAAAAATGATCGCGAAATCTGAGCAGGAAATAGGCAAGGCAACTGCCAAGTACTTCTTCTATTCCAATATCATGAACTTTTTTAAGACGGAGATTACCCTGGCGAATGGTGAGATCCGCAAGCGCCCTTTGATTGAGACAAACGGAGAAACAGGAGAGATCGTATGGGACAAAGGGCGGGACTTTGCTACTGTTAGGAAGGTGCTCTCTATGCCACAAGTTAACATTGTCAAAAAAACTGAAGTGCAGACAGGTGGGTTTAGCAAGGAATCTATCCTGCCGAAGAGGAACTCTGACAAGCTGATCGCCCGCAAGAAAGATTGGGATCCGAAAAAGTACGGAGGATTCGACTCCCCCACAGTTGCGTACTCCGTGCTTGTCGTGGCCAAAGTGGAGAAGGGCAAGTCTAAGAAGCTCAAGAGCGTCAAAGAGTTGTTGGGGATCACGATTATGGAGCGGTCGTCTTTCGAAAAGAATCCGATAGATTTTCTCGAGGCCAAGGGTTATAAAGAAGTCAAGAAGGATCTTATCATCAAGCTCCCTAAGTACTCCCTCTTTGAGCTTGAAAACGGACGGAAAAGAATGCTGGCTTCAGCGGGTGAACTTCAGAAGGGTAATGAACTCGCTCTGCCCTCAAAATATGTGAATTTCCTTTACCTGGCATCACACTATGAGAAGCTTAAGGGGTCTCCAGAGGACAACGAGCAGAAGCAACTGTTCGTTGAACAACACAAGCACTACCTTGACGAGATTATCGAGCAAATCAGCGAGTTTAGCAAGCGCGTTATACTGGCAGACGCAAATCTTGATAAGGTCCTTAGCGCCTACAACAAGCATAGAGACAAACCCATCCGGGAGCAGGCCGAGAACATTATTCATCTCTTCACCTTGACGAATCTTGGGGCCCCGGCCGCGTTCAAGTACTTCGATACTACCATAGACAGAAAGCGCTATACATCGACAAAGGAAGTTCTTGACGCCACGCTGATCCACCAAAGTATAACAGGCCTCTATGAGACACGCATCGACCTTTCGCAGTTGGGCGGTGACTCCGGCGGAAGTACAAACCTTTCAGACATTATAGAAAAGGAAACCGGCAAGCAACTCGTCATCCAGGAATCCATACTTATGCTCCCTGAAGAGGTGGAAGAAGTGATCGGTAATAAACCAGAGAGCGACATACTTGTCCACACCGCTTATGACGAAAGTACAGACGAAAACGTCATGCTTCTGACGAGTGATGCCCCCGAATACAAACCTTGGGCGCTCGTCATCCAGGATTCCAATGGGGAGAATAAAATAAAGATGCTCTCTGGAGGCAGCCCAAAGAAGAAGAGAAAGGTCTGA;
<210> 29
<211> 1710
<212> PRT
<213>人工序列
<223> SEQ ID No. 29
<400> 29
MSSETGPVAVDPTLRRRIEPHEFEVFFDPRELRKETCLLYEINWGGRHSIWRHTSQNTNKHVEVNFIEKFTTERYFCPNTRCSITWFLSWSPCGECSRAITEFLSRYPHVTLFIYIARLYHHADPRNRQGLRDLISSGVTIQIMTEQESGYCWRNFVNYSPSNEAHWPRYPHLWVRLYVLELYCIILGLPPCLNILRRKQPQLTFFTIALQSCHYQRLPPHILWATGLKSGSETPGTSESATPESDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSPKKKRKV。
Claims (14)
1.一套基因编辑人工系统,所述人工系统包括:
第I调节元件,其包括能够编码如氨基酸序列I的核苷酸序列;其中所述氨基酸序列I如SEQ ID No. 2所示;
第II调节元件,其包括依次从5’端到3’端的第II-1核苷酸序列和第II-2核苷酸序列;所述第II-1核苷酸序列包括靶核苷酸序列;所述第II-2核苷酸序列包括来源于化脓链球菌(Streptococcus pyogenes)的sgRNA核酸序列;所述第II-1核苷酸序列和所述第II-2核苷酸序列转录融合,其产物能引导第I调控元件编码的蛋白至目标生物基因组中待突变的靶位点处,并将所述靶位点处的C突变为T、A和G中的一种;或将所述靶位点处的G突变为A、T和C中的一种;
当所述第II调节元件为多个时,包含在其中的多个第II-1核苷酸序列两两不相同;
所述第I调节元件的核苷酸序列为能够适于在水稻中表达的核苷酸序列,所述第II调节元件的核苷酸序列为能够适于在水稻中发生转录的核苷酸序列;
能够编码如SEQ ID No. 2所示蛋白的核苷酸编码序列如SEQ ID No. 8所示;
所述第II-2核苷酸序列如SEQ ID No. 13所示;
通过如下方式确定所述靶核苷酸序列:
1)确定水稻基因组上需要被改造的核苷酸序列;
2)判断步骤1)中确定的需要被改造的核苷酸序列或其反向互补序列中是否携带有待突变的核苷酸C,并判断所述待突变的核苷酸C突变为T、A和G中的一种,或所述待突变的核苷酸G突变为A、T和C中的一种后引起的改变是否符合预期;
3)在需要被改造的核苷酸序列或其反向互补序列中筛选靶标序列:向所述待突变的核苷酸C的3ʹ端方向搜索以确认存在能够被氨基酸序列I识别的识别模序,且所述待突变的核苷酸C处在所述识别模序5ʹ端上游的-19至-13的位置,由此确定的所述识别模序5ʹ端上游17至21个核苷酸序列为所述靶核苷酸序列;
所述识别模序为5ʹ-NGG-3ʹ、5ʹ-NGA-3ʹ、5ʹ- GAGN-3ʹ、5ʹ-AAGN-3ʹ中的一种,所述靶核苷酸序列为所述识别模序5ʹ端上游的17至21个核苷酸序列,淘汰含有连续五个T的核苷酸序列;
其中,所述N为A、G、C和T中的一种。
2.根据权利要求1所述的人工系统,其特征在于,所述第II-1核苷酸序列包括IIS型限制性内切酶的酶切位点,所述靶核苷酸序列通过所述IIS型限制性内切酶的酶切位点而被克隆,以使所述第II-1核苷酸序列与第II-2序列转录融合;
当所述第II调节元件为多个时,用于克隆不同靶核苷酸序列的所述IIS型限制性内切酶的酶切位点两两不相同。
3.根据权利要求1所述的人工系统,其特征在于,所述靶核苷酸序列如SEQ ID No. 16所示。
4.根据权利要求1-3中任意一项所述的人工系统,其特征在于,所述人工系统还包括在所述第I调节元件的5ʹ端的能够用于水稻中的,且能够启动所述第I调节元件转录的第一启动子;和/或所述人工系统还包括在所述第II调节元件的5ʹ端的能够用于水稻中的,且能够启动所述第II调节元件转录的第二启动子;
所述人工系统还包括在所述第I调节元件的3’端的能够终止所述第I调节元件转录的第一终止子;和/或所述人工系统还包括在所述第II调节元件的3’端的能够终止所述第II调节元件转录的第二终止子。
5.根据权利要求4所述的人工系统,其特征在于,所述第一启动子为RNA聚合酶II型启动子;和/或第二启动子为RNA聚合酶III型启动子。
6.根据权利要求5所述的人工系统,其特征在于,第一启动子为SEQ ID No. 17;和/或第二启动子为SEQ ID No. 18和/或SEQ ID No. 19。
7.根据权利要求4所述的人工系统,其特征在于,第一终止子为SEQ ID No. 20;和/或第二终止子为SEQ ID No. 21。
8.根据权利要求4所述的人工系统,其特征在于,所述第I调节元件和所述第II元件能够被克隆到至少一个载体上。
9.根据权利要求8所述的人工系统,其特征在于,所述第I调节元件能够被克隆到pUbi-ccdB上,所述第II调节元件被克隆到入门载体pENTR4上。
10.根据权利要求9所述的人工系统,其特征在于,所述第一启动子、第I调节元件和第一终止子能够被克隆到pUbi-ccdB载体上。
11.根据权利要求9所述的人工系统,其特征在于,所述第二启动子、第II调节元件和第二终止子被克隆到pENTR4载体上。
12.根据权利要求8所述的人工系统,其特征在于,所述第I调节元件和所述第II调节元件能够被整合到同一个载体上,或被分布在多个载体上一起使用。
13.如权利要求1-12中任意一项所述的人工系统在用于将水稻基因组中的C定点突变为T、A和G中的一种、或将水稻基因组中的G定点突变为A、T和C中的一种的应用。
14.一种将水稻基因组上的C定点突变为T、A和G中的一种,或将水稻基因组中的G定点突变为A、T和C中的一种的方法,其包括如下步骤:
1)将如权利要求1-12中任意一项所述的人工系统通过农杆菌介导、基因枪轰击或PEG介导转化的方法中的一种导入到水稻愈伤组织或水稻原生质体中,然后培养获得水稻植株;
2)筛选获得含有定点突变的水稻植株;进一步地,所述水稻植株能够产生含有定点替换碱基的水稻种子。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710383003.1A CN107177625B (zh) | 2017-05-26 | 2017-05-26 | 一种定点突变的人工载体系统及定点突变方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710383003.1A CN107177625B (zh) | 2017-05-26 | 2017-05-26 | 一种定点突变的人工载体系统及定点突变方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107177625A CN107177625A (zh) | 2017-09-19 |
CN107177625B true CN107177625B (zh) | 2021-05-25 |
Family
ID=59835058
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710383003.1A Active CN107177625B (zh) | 2017-05-26 | 2017-05-26 | 一种定点突变的人工载体系统及定点突变方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107177625B (zh) |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10323236B2 (en) | 2011-07-22 | 2019-06-18 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
US9840699B2 (en) | 2013-12-12 | 2017-12-12 | President And Fellows Of Harvard College | Methods for nucleic acid editing |
SG10202104041PA (en) | 2015-10-23 | 2021-06-29 | Harvard College | Nucleobase editors and uses thereof |
IL264565B2 (en) | 2016-08-03 | 2024-07-01 | Harvard College | Adenosine nuclear base editors and their uses |
CN109804066A (zh) | 2016-08-09 | 2019-05-24 | 哈佛大学的校长及成员们 | 可编程cas9-重组酶融合蛋白及其用途 |
US11542509B2 (en) | 2016-08-24 | 2023-01-03 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
KR102622411B1 (ko) | 2016-10-14 | 2024-01-10 | 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 | 핵염기 에디터의 aav 전달 |
WO2018119359A1 (en) | 2016-12-23 | 2018-06-28 | President And Fellows Of Harvard College | Editing of ccr5 receptor gene to protect against hiv infection |
US11898179B2 (en) | 2017-03-09 | 2024-02-13 | President And Fellows Of Harvard College | Suppression of pain by gene editing |
JP2020510439A (ja) | 2017-03-10 | 2020-04-09 | プレジデント アンド フェローズ オブ ハーバード カレッジ | シトシンからグアニンへの塩基編集因子 |
CA3057192A1 (en) | 2017-03-23 | 2018-09-27 | President And Fellows Of Harvard College | Nucleobase editors comprising nucleic acid programmable dna binding proteins |
WO2018209320A1 (en) | 2017-05-12 | 2018-11-15 | President And Fellows Of Harvard College | Aptazyme-embedded guide rnas for use with crispr-cas9 in genome editing and transcriptional activation |
JP2020534795A (ja) | 2017-07-28 | 2020-12-03 | プレジデント アンド フェローズ オブ ハーバード カレッジ | ファージによって支援される連続的進化(pace)を用いて塩基編集因子を進化させるための方法および組成物 |
US11319532B2 (en) | 2017-08-30 | 2022-05-03 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
KR20200121782A (ko) | 2017-10-16 | 2020-10-26 | 더 브로드 인스티튜트, 인코퍼레이티드 | 아데노신 염기 편집제의 용도 |
CN110066824B (zh) * | 2018-01-24 | 2021-06-08 | 中国农业科学院植物保护研究所 | 一套用于水稻的碱基编辑人工系统 |
EP3746549A4 (en) * | 2018-02-01 | 2021-10-13 | Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences | IMPROVED GENOME EDITING PROCESS |
EP3797160A1 (en) | 2018-05-23 | 2021-03-31 | The Broad Institute Inc. | Base editors and uses thereof |
WO2020041751A1 (en) * | 2018-08-23 | 2020-02-27 | The Broad Institute, Inc. | Cas9 variants having non-canonical pam specificities and uses thereof |
US12281338B2 (en) | 2018-10-29 | 2025-04-22 | The Broad Institute, Inc. | Nucleobase editors comprising GeoCas9 and uses thereof |
CN113913454B (zh) * | 2018-11-07 | 2023-07-21 | 中国农业科学院植物保护研究所 | 一套用于水稻的人工基因编辑系统 |
CN109666694B (zh) * | 2018-12-29 | 2022-08-16 | 北京市农林科学院 | Scr7在碱基编辑系统编辑受体基因组中的应用 |
US12351837B2 (en) | 2019-01-23 | 2025-07-08 | The Broad Institute, Inc. | Supernegatively charged proteins and uses thereof |
JP7669281B2 (ja) | 2019-03-19 | 2025-04-28 | ザ ブロード インスティテュート,インコーポレーテッド | 編集ヌクレオチド配列を編集するための方法および組成物 |
CN111100852B (zh) * | 2019-12-16 | 2021-04-13 | 中国农业科学院植物保护研究所 | OsALS1的定向突变及农作物内源基因定向进化的方法 |
AU2021267940A1 (en) | 2020-05-08 | 2022-12-08 | President And Fellows Of Harvard College | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102482639B (zh) * | 2009-04-03 | 2016-01-06 | 医学研究会 | 活化诱导胞苷脱氨酶(aid)突变体及使用方法 |
CN105934516A (zh) * | 2013-12-12 | 2016-09-07 | 哈佛大学的校长及成员们 | 用于基因编辑的cas变体 |
-
2017
- 2017-05-26 CN CN201710383003.1A patent/CN107177625B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102482639B (zh) * | 2009-04-03 | 2016-01-06 | 医学研究会 | 活化诱导胞苷脱氨酶(aid)突变体及使用方法 |
CN105934516A (zh) * | 2013-12-12 | 2016-09-07 | 哈佛大学的校长及成员们 | 用于基因编辑的cas变体 |
Non-Patent Citations (4)
Title |
---|
A CRISPR/Cas9 toolkit for efficient targeted base editing to induce genetic variations in rice;Bin Ren等;《Sci. China Life Sci.》;20170303;第60卷(第5期);第516–519页 * |
Comparison of the Differential Context-dependence of DNA Deamination by APOBEC Enzymes: Correlation with Mutation Spectra in Vivo;Rupert C. L. Beale等;《J. Mol. Biol.》;20041231;第337卷;585–596 * |
Improved Base Editor for Efficiently Inducing Genetic Variations in Rice with CRISPR/Cas9-Guided Hyperactive hAID Mutant;Bin Ren等;《Molecular Plant》;20180127;第11卷;623–626 * |
Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage;Alexis C. Komor等;《Nature》;20161020;第533卷(第7603期);420–424 * |
Also Published As
Publication number | Publication date |
---|---|
CN107177625A (zh) | 2017-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107177625B (zh) | 一种定点突变的人工载体系统及定点突变方法 | |
Cardi et al. | CRISPR/Cas-mediated plant genome editing: outstanding challenges a decade after implementation | |
US11820990B2 (en) | Method for base editing in plants | |
CN105063083B (zh) | 防止基因漂移的水稻工程保持系的创制方法及其应用 | |
US20210403901A1 (en) | Targeted mutagenesis using base editors | |
CN104846009B (zh) | 一种水稻工程保持系的构建方法及其应用 | |
CN108034671B (zh) | 一种质粒载体及利用其建立植物群体的方法 | |
Pan et al. | CRISPR-Combo–mediated orthogonal genome editing and transcriptional activation for plant breeding | |
CN104450745A (zh) | 一种获得水稻指定基因突变体的方法及其应用 | |
CN110066824B (zh) | 一套用于水稻的碱基编辑人工系统 | |
CN114045303B (zh) | 一套用于水稻的人工基因编辑系统 | |
CN115927381B (zh) | 一种油菜rna加工因子ncbp基因及其应用 | |
CN113801891A (zh) | 甜菜BvCENH3基因单倍体诱导系的构建方法与应用 | |
CN107338265B (zh) | 一种基因编辑系统及应用其对植物基因组进行编辑的方法 | |
Char et al. | CRISPR/Cas9 for mutagenesis in rice | |
Li et al. | Creating large chromosomal deletions in rice using CRISPR/Cas9 | |
CN114854723A (zh) | 水稻尿嘧啶dna糖苷酶及其在基因编辑诱导植物单碱基多样性中的应用 | |
US20220315938A1 (en) | AUGMENTED sgRNAS AND METHODS FOR THEIR USE TO ENHANCE SOMATIC AND GERMLINE PLANT GENOME ENGINEERING | |
CN111100852B (zh) | OsALS1的定向突变及农作物内源基因定向进化的方法 | |
CN113265403A (zh) | 大豆Dt1基因编辑位点及其应用 | |
CN118127073A (zh) | 水稻烷基嘌呤糖基化酶及其突变体在植物A-to-K单碱基编辑中的应用 | |
CN113774082A (zh) | 一种核酸表达的方法 | |
CN107365772B (zh) | 一种植物花粉特异性启动子psp1及其应用 | |
Jiang et al. | Improving plant C-to-G base editors with a cold-adapted glycosylase and TadA-8e variants | |
CN112226458B (zh) | 一种利用水稻osa-miR5511基因提高水稻产量的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |