[go: up one dir, main page]

CN109957569A - Base editing system and method based on CPF1 protein - Google Patents

Base editing system and method based on CPF1 protein Download PDF

Info

Publication number
CN109957569A
CN109957569A CN201811578853.8A CN201811578853A CN109957569A CN 109957569 A CN109957569 A CN 109957569A CN 201811578853 A CN201811578853 A CN 201811578853A CN 109957569 A CN109957569 A CN 109957569A
Authority
CN
China
Prior art keywords
lys
leu
glu
ile
asn
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811578853.8A
Other languages
Chinese (zh)
Other versions
CN109957569B (en
Inventor
高彩霞
王延鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Qihe Biotechnology Co ltd
Original Assignee
Institute of Genetics and Developmental Biology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Genetics and Developmental Biology of CAS filed Critical Institute of Genetics and Developmental Biology of CAS
Publication of CN109957569A publication Critical patent/CN109957569A/en
Application granted granted Critical
Publication of CN109957569B publication Critical patent/CN109957569B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • C12N15/907Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/78Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y305/00Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
    • C12Y305/04Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
    • C12Y305/04001Cytosine deaminase (3.5.4.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Cell Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to genetic engineering fields.Specifically, the present invention relates to a kind of base edit methods based on CPF1 albumen.More specifically, the present invention relates to a kind of methods that Cpf1- deamination enzyme fusion proteins instructed by guide RNA carry out efficient base editor to the target sequence in organism (such as plant) genome, and the genetically modified organism (such as plant) and its offspring generated by the method.

Description

基于CPF1蛋白的碱基编辑系统和方法Base editing system and method based on CPF1 protein

技术领域technical field

本发明涉及基因工程领域。具体而言,本发明涉及一种基于CPF1蛋白的碱基编辑系统和方法。更具体而言,本发明涉及一种通过向导RNA指导的Cpf1-脱氨酶融合蛋白对生物体(例如植物)基因组中的靶序列进行高效碱基编辑的系统和方法,以及通过所述方法产生的经遗传修饰的生物体(例如植物)及其后代。The present invention relates to the field of genetic engineering. Specifically, the present invention relates to a CPF1 protein-based base editing system and method. More specifically, the present invention relates to a system and method for efficient base editing of target sequences in the genome of an organism (eg, a plant) by a guide RNA-guided Cpf1-deaminase fusion protein, and production by the method genetically modified organisms (eg, plants) and their progeny.

背景技术Background technique

高效的作物改良的前提是能够获得新的遗传突变,这些突变可以容易地引入现代栽培种中。遗传研究,尤其是基于全基因组相关的研究表明,单核苷酸的改变是构成作物性状差异的主要原因。单碱基的变异会导致氨基酸替换,从而导致优良等位基因和优异性状的进化。在基因组编辑出现前,定向诱导基因组局部突变(TILLING)可以作为用于产生作物改良中迫切需要的突变的方法。然而,TILLING筛选耗时耗力,并且所鉴别的点突变经常受数目和种类的限制。基因组编辑技术,特别是基于CRISPR/Cas9系统的基因组编辑技术可以通过同源重组(HR)介导的DNA修复途径来实现在基因组位点中引入特定碱基的替换。但目前,该方法的成功使用受到很大限制,主要是由于在植物中HR介导的双链断链修复发生的频率很低。另外,有效的提供足量的DNA修复模板也是目前的一大难关。这些问题使得目前在植物中通过HR的方式高效而简单地实现定点突变成为一大挑战。A prerequisite for efficient crop improvement is the availability of new genetic mutations that can be easily introduced into modern cultivars. Genetic studies, especially those based on genome-wide associations, have shown that single nucleotide changes are the major contributors to differences in crop traits. Single-base variations lead to amino acid substitutions, leading to the evolution of superior alleles and superior traits. Before the advent of genome editing, targeted induction of localized mutagenesis in the genome (TILLING) could be used as a method for generating mutations that are sorely needed in crop improvement. However, TILLING screening is time-consuming and labor-intensive, and the point mutations identified are often limited in number and variety. Genome editing technologies, especially those based on the CRISPR/Cas9 system, can achieve the introduction of specific base substitutions in genomic loci through the homologous recombination (HR)-mediated DNA repair pathway. However, currently, the successful use of this method is greatly limited, mainly due to the low frequency of HR-mediated double-strand scission repair in plants. In addition, effectively providing sufficient DNA repair templates is also a major difficulty at present. These problems make it a challenge to efficiently and simply implement site-directed mutagenesis by HR in plants.

近年,利用Cas9与DNA的结合特性与DNA脱氨酶的特性,将Cas9 与脱氨酶融合,可以实现精准的对靶基因进行单碱基胞嘧啶(C)到胸腺嘧啶(T)的转变以及腺嘌呤(A)到鸟嘌呤(G)的转变。目前,对于C到T 的转变的系统主要包括SpnCas9-BE3、SpnCas9-AID以及Cas9变体的融合物如VQR-BE3、EQR-BE3和VRER-BE3,此外还有SaCas9-BE3以及变体 SaKKH-BE3,这些组合的使胞嘧啶(C)到胸腺嘧啶(T)转变的PAM限制降低以及编辑窗口的范围更为可变。此外,近期哈佛大学David Liu实验室通过人工进化的方式,研发出了能作用于ssDNA的腺嘌呤脱氨酶,与 Cas9融合可以实现对DNA进行A到G的转变的Cas9-ABE系统,使单碱基编辑的作用方位进一步扩大。尽管这些研究使DNA的单碱基编辑有了很大的使用范围,但当前的单碱基编辑技术仍存在许多问题:第一,由于Cas9 及Cas9变体的PAM一般限于富含G/C的区域,单碱编辑系统的PAM类型仍需要拓宽;第二,由于Cas9及编辑本身特异性较差,单碱基编辑系统在特异性方面仍需要提高;第三,由于nCas9-BE3及其变体的BE3,还有 nCas9-ABE通常在靶位点的非靶向链上产生单链的缺刻,很容易在错配修复的过程中在产生单碱基突变的同时也会产生DNA的确实和插入,所以在单碱基的高保真方面也有待提高。因此,本领域仍然迫切需要新的对植物基因组进行碱基编辑的系统和方法。In recent years, using the binding properties of Cas9 to DNA and the properties of DNA deaminase, the fusion of Cas9 and deaminase can achieve precise single-base cytosine (C) to thymine (T) conversion of target genes and Conversion of adenine (A) to guanine (G). Currently, the systems for C-to-T transition mainly include SpnCas9-BE3, SpnCas9-AID and fusions of Cas9 variants such as VQR-BE3, EQR-BE3 and VRER-BE3, in addition to SaCas9-BE3 and the variant SaKKH- BE3, these combinations have a reduced PAM restriction for cytosine (C) to thymine (T) transition and a more variable range of editing windows. In addition, recently, the Harvard University David Liu laboratory has developed adenine deaminase that can act on ssDNA through artificial evolution, and the Cas9-ABE system can realize the conversion of DNA from A to G by fusion with Cas9, making single The role of base editing has been further expanded. Although these studies have brought a wide range of DNA single-base editing, there are still many problems with the current single-base editing technology: First, since the PAM of Cas9 and Cas9 variants is generally limited to G/C-rich Region, the PAM types of single-base editing systems still need to be broadened; second, due to the poor specificity of Cas9 and the editing itself, the specificity of single-base editing systems still needs to be improved; third, due to nCas9-BE3 and its variants The BE3, and nCas9-ABE usually produce single-stranded nicks on the non-targeted strand of the target site, and it is easy to generate single-base mutations in the process of mismatch repair. , so the high fidelity of single base also needs to be improved. Therefore, there is still an urgent need in the art for new systems and methods for base editing plant genomes.

附图描述Description of drawings

图1.CPF1介导的植物基因组切割活性的优化。Figure 1. Optimization of CPF1-mediated plant genome cleavage activity.

图2.CPF1介导的植物基因组C到T的突变。Figure 2. CPF1-mediated mutation of C to T in plant genomes.

图3.CPF1介导的植物基因组A到G的突变。Figure 3. CPF1-mediated mutation of plant genomes A to G.

图4.利用CPF1的RNA切割活性进行多位点的同时碱基编辑。Figure 4. Simultaneous base editing at multiple sites utilizing the RNA cleavage activity of CPF1.

发明内容SUMMARY OF THE INVENTION

一、定义1. Definition

在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。并且,本文中所用的蛋白质和核酸化学、分子生物学、细胞和组织培养、微生物学、免疫学相关术语和实验室操作步骤均为相应领域内广泛使用的术语和常规步骤。例如,本发明中使用的标准重组DNA和分子克隆技术为本领域技术人员熟知,并且在如下文献中有更全面的描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,MolecularCloning:A Laboratory Manual;Cold Spring Harbor Laboratory Press:Cold SpringHarbor,1989(下文称为“Sambrook”)。同时,为了更好地理解本发明,下面提供相关术语的定义和解释。In the present invention, unless otherwise specified, scientific and technical terms used herein have the meanings commonly understood by those skilled in the art. Moreover, the protein and nucleic acid chemistry, molecular biology, cell and tissue culture, microbiology, immunology related terms and laboratory procedures used herein are the terms and routine procedures widely used in the corresponding fields. For example, standard recombinant DNA and molecular cloning techniques used in the present invention are well known to those of skill in the art and are more fully described in Sambrook, J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter "Sambrook"). Meanwhile, for a better understanding of the present invention, definitions and explanations of related terms are provided below.

“Cpf1核酸酶”、“Cpf1蛋白”和“Cpf1”在本文中可互换使用,指的是包括Cpf1蛋白或其片段的RNA指导的核酸酶。Cpf1是CRISPR-Cpf1基因组编辑系统的组分,能在向导RNA(crRNA)的指导下靶向并切割DNA靶序列形成DNA双链断裂(DSB)。Cpf1蛋白上包含一个DNA的切割结构域和一个独立的RNA切割结构域。Cpf1蛋白的RNA切割结构域能够加工 pre-crRNA以形成成熟crRNA。"Cpf1 nuclease," "Cpf1 protein," and "Cpf1" are used interchangeably herein to refer to RNA-directed nucleases that include Cpf1 protein or fragments thereof. Cpf1 is a component of the CRISPR-Cpf1 genome editing system, which can target and cleave DNA target sequences to form DNA double-strand breaks (DSBs) under the guidance of guide RNA (crRNA). The Cpf1 protein contains a DNA cleavage domain and an independent RNA cleavage domain. The RNA cleavage domain of Cpf1 protein can process pre-crRNA to form mature crRNA.

“向导RNA”和“gRNA”在本文中可互换使用。Cpf1介导的基因组编辑系统的向导RNA通常仅由成熟crRNA分子构成,其中crRNA包含的序列与靶序列具有足够相同性以便与靶序列的互补序列杂交并且指导复合物(Cpf1+crRNA)与该靶序列序列特异性结合。"Guide RNA" and "gRNA" are used interchangeably herein. Guide RNAs of Cpf1-mediated genome editing systems typically consist only of mature crRNA molecules that contain sequences that are sufficiently identical to the target sequence to hybridize to the complement of the target sequence and guide the complex (Cpf1+crRNA) to the target Sequence sequence specific binding.

“脱氨酶”是指催化脱氨基反应的酶。在本发明一些实施方式中,所述脱氨酶指的是胞嘧啶脱氨酶,其催化胞苷或脱氧胞苷分别脱氨化为尿嘧啶或脱氧尿嘧啶。在本发明一些实施方式中,所述脱氨酶指的是腺嘌呤脱氨酶,其能够催化腺苷或脱氧腺苷(A)形成肌苷(I)。"Deaminase" refers to an enzyme that catalyzes a deamination reaction. In some embodiments of the present invention, the deaminase refers to cytosine deaminase, which catalyzes the deamination of cytidine or deoxycytidine to uracil or deoxyuracil, respectively. In some embodiments of the present invention, the deaminase refers to adenine deaminase, which can catalyze the formation of inosine (I) from adenosine or deoxyadenosine (A).

“基因组”在用于植物细胞时不仅涵盖存在于细胞核中的染色体DNA,而且还包括存在于细胞的亚细胞组分(如线粒体、质体)中的细胞器DNA。"Genome" when used in plant cells encompasses not only the chromosomal DNA present in the nucleus, but also the organelle DNA present in the subcellular components of the cell (eg mitochondria, plastids).

如本文所用,“生物体”包括适于基因组编辑的任何生物体,优选真核生物。生物体的实例包括但不限于,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。As used herein, "organism" includes any organism suitable for genome editing, preferably eukaryotes. Examples of organisms include, but are not limited to, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, geese; plants including monocotyledonous and dicotyledonous plants, For example, rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis, etc.

“经遗传修饰的生物体”或“经遗传修饰的细胞”意指在其基因组内包含外源多核苷酸或修饰的基因或表达调控序列的生物体或细胞。例如外源多核苷酸能够稳定地整合进生物体或细胞的基因组中,并遗传连续的世代。外源多核苷酸可单独地或作为重组DNA构建体的部分整合进基因组中。修饰的基因或表达调控序列为在生物体或细胞基因组中所述序列包含单个或多个脱氧核苷酸取代、缺失和添加。例如,通过本发明获得的经遗传修饰的生物体可以相对于野生型(相应的未经所述遗传修饰的生物体)包含一个或多个C至T或A至G的取代。"Genetically modified organism" or "genetically modified cell" means an organism or cell that contains within its genome an exogenous polynucleotide or modified gene or expression control sequences. For example, exogenous polynucleotides can be stably integrated into the genome of an organism or cell and inherited for successive generations. The exogenous polynucleotide can be integrated into the genome alone or as part of a recombinant DNA construct. Modified gene or expression control sequences are those that comprise single or multiple deoxynucleotide substitutions, deletions and additions in the genome of an organism or cell. For example, a genetically modified organism obtained by the present invention may comprise one or more C to T or A to G substitutions relative to the wild type (the corresponding organism not having said genetic modification).

针对序列而言的“外源”意指来自外来物种的序列,或者如果来自相同物种,则指通过蓄意的人为干预而从其天然形式发生了组成和/或基因座的显著改变的序列。"Foreign" with respect to a sequence means a sequence from a foreign species, or, if from the same species, a sequence that has been significantly altered in composition and/or locus from its native form by deliberate human intervention.

“多核苷酸”、“核酸序列”、“核苷酸序列”或“核酸片段”可互换使用并且是单链或双链RNA或DNA聚合物,任选地可含有合成的、非天然的或改变的核苷酸碱基。核苷酸通过如下它们的单个字母名称来指代:“A”为腺苷或脱氧腺苷(分别对应RNA或DNA),“C”表示胞苷或脱氧胞苷,“G”表示鸟苷或脱氧鸟苷,“U”表示尿苷,“T”表示脱氧胸苷,“R”表示嘌呤 (A或G),“Y”表示嘧啶(C或T),“K”表示G或T,“H”表示A或C或 T,“I”表示肌苷,并且“N”表示任何核苷酸。"Polynucleotide", "nucleic acid sequence", "nucleotide sequence" or "nucleic acid fragment" are used interchangeably and are polymers of single- or double-stranded RNA or DNA, optionally containing synthetic, non-natural or altered nucleotide bases. Nucleotides are referred to by their single-letter names as follows: "A" for adenosine or deoxyadenosine (for RNA or DNA, respectively), "C" for cytidine or deoxycytidine, "G" for guanosine or Deoxyguanosine, "U" for uridine, "T" for deoxythymidine, "R" for purine (A or G), "Y" for pyrimidine (C or T), "K" for G or T, " H" means A or C or T, "I" means inosine, and "N" means any nucleotide.

“多肽”、“肽”、和“蛋白质”在本发明中可互换使用,指氨基酸残基的聚合物。该术语适用于其中一个或多个氨基酸残基是相应的天然存在的氨基酸的人工化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸聚合物。术语“多肽”、“肽”、“氨基酸序列”和“蛋白质”还可包括修饰形式,包括但不限于糖基化、脂质连接、硫酸盐化、谷氨酸残基的γ羧化、羟化和ADP-核糖基化。"Polypeptide," "peptide," and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The term applies to amino acid polymers in which one or more amino acid residues is an artificial chemical analog of the corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The terms "polypeptide", "peptide", "amino acid sequence" and "protein" may also include modified forms including, but not limited to, glycosylation, lipid linkage, sulfation, gamma carboxylation of glutamic acid residues, hydroxyl sylation and ADP-ribosylation.

如本发明所用,“表达构建体”是指适于感兴趣的核苷酸序列在生物体中表达的载体如重组载体。“表达”指功能产物的产生。例如,核苷酸序列的表达可指核苷酸序列的转录(如转录生成mRNA或功能RNA)和/或RNA 翻译成前体或成熟蛋白质。As used in the present invention, "expression construct" refers to a vector such as a recombinant vector suitable for expression of a nucleotide sequence of interest in an organism. "Expression" refers to the production of a functional product. For example, expression of a nucleotide sequence can refer to transcription of the nucleotide sequence (eg, transcription to produce mRNA or functional RNA) and/or translation of RNA into a precursor or mature protein.

本发明的“表达构建体”可以是线性的核酸片段、环状质粒、病毒载体,或者,在一些实施方式中,可以是能够翻译的RNA(如mRNA)。"Expression constructs" of the present invention may be linear nucleic acid fragments, circular plasmids, viral vectors, or, in some embodiments, RNA capable of translation (eg, mRNA).

本发明的“表达构建体”可包含不同来源的调控序列和感兴趣的核苷酸序列,或相同来源但以不同于通常天然存在的方式排列的调控序列和感兴趣的核苷酸序列。An "expression construct" of the present invention may comprise regulatory sequences and nucleotide sequences of interest from different sources, or regulatory sequences and nucleotide sequences of interest from the same source but arranged in a manner different from that normally found in nature.

“调控序列”和“调控元件”可互换使用,指位于编码序列的上游(5′非编码序列)、中间或下游(3′非编码序列),并且影响相关编码序列的转录、 RNA加工或稳定性或者翻译的核苷酸序列。"Regulatory sequence" and "regulatory element" are used interchangeably and refer to a coding sequence upstream (5' non-coding sequence), intermediate or downstream (3' non-coding sequence) and affecting transcription, RNA processing or Stability or translated nucleotide sequence.

调控序列可包括但不限于启动子、翻译前导序列、内含子和多腺苷酸化识别序列。Regulatory sequences can include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.

“启动子”指能够控制另一核酸片段转录的核酸片段。在本发明的一些实施方案中,启动子是能够控制生物体细胞中基因转录的启动子,无论其是否来源于所述生物体。启动子可以是组成型启动子或组织特异性启动子或发育调控启动子或诱导型启动子。"Promoter" refers to a nucleic acid segment capable of controlling the transcription of another nucleic acid segment. In some embodiments of the invention, a promoter is a promoter capable of controlling the transcription of a gene in cells of an organism, whether or not it is derived from the organism. The promoter may be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.

“组成型启动子”指一般将引起基因在多数细胞类型中在多数情况下表达的启动子。“组织特异性启动子”和“组织优选启动子”可互换使用,并且指主要但非必须专一地在一种组织或器官中表达,而且也可在一种特定细胞或细胞型中表达的启动子。“发育调控启动子”指其活性由发育事件决定的启动子。“诱导型启动子”响应内源性或外源性刺激(环境、激素、化学信号等)而选择性表达可操纵连接的DNA序列。A "constitutive promoter" refers to a promoter that will generally cause a gene to be expressed in most cell types under most circumstances. "Tissue-specific promoter" and "tissue-preferred promoter" are used interchangeably and refer to expression primarily, but not necessarily exclusively, in a tissue or organ, but also in a particular cell or cell type promoter. "Developmentally regulated promoter" refers to a promoter whose activity is determined by developmental events. An "inducible promoter" selectively expresses an operably linked DNA sequence in response to endogenous or exogenous stimuli (environmental, hormonal, chemical signals, etc.).

如本文中所用,术语“可操作地连接”指调控元件(例如但不限于,启动子序列、转录终止序列等)与核酸序列(例如,编码序列或开放读码框)连接,使得核苷酸序列的转录被所述转录调控元件控制和调节。用于将调控元件区域可操作地连接于核酸分子的技术为本领域已知的。As used herein, the term "operably linked" refers to regulatory elements (eg, but not limited to, promoter sequences, transcription termination sequences, etc.) are linked to a nucleic acid sequence (eg, a coding sequence or open reading frame) such that nucleotides Transcription of the sequence is controlled and regulated by the transcriptional regulatory elements. Techniques for operably linking regions of regulatory elements to nucleic acid molecules are known in the art.

将核酸分子(例如质粒、线性核酸片段、RNA等)或蛋白质“导入”生物体是指用所述核酸或蛋白质转化生物体细胞,使得所述核酸或蛋白质在细胞中能够发挥功能。本发明所用的“转化”包括稳定转化和瞬时转化。"Introducing" a nucleic acid molecule (eg, plasmid, linear nucleic acid fragment, RNA, etc.) or protein into an organism refers to transforming the cells of the organism with the nucleic acid or protein so that the nucleic acid or protein can function in the cell. "Transformation" as used in the present invention includes stable transformation and transient transformation.

“稳定转化”指将外源核苷酸序列导入基因组中,导致外源基因稳定遗传。一旦稳定转化,外源核酸序列稳定地整合进所述生物体和其任何连续世代的基因组中。"Stable transformation" refers to the introduction of a foreign nucleotide sequence into the genome, resulting in the stable inheritance of the foreign gene. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the organism and any successive generations thereof.

“瞬时转化”指将核酸分子或蛋白质导入细胞中,执行功能而没有外源基因稳定遗传。瞬时转化中,外源核酸序列不整合进基因组中。"Transient transformation" refers to the introduction of a nucleic acid molecule or protein into a cell to perform a function without the stable inheritance of an exogenous gene. In transient transformation, the exogenous nucleic acid sequence is not integrated into the genome.

如本文所使用的,术语“植物”包括整个植物和任何后代、植物的细胞、组织、或部分。术语“植物部分”包括植物的任何部分,包括,例如但不限于:种子(包括成熟种子、没有种皮的未成熟胚、和不成熟的种子);植物插条(plant cutting);植物细胞;植物细胞培养物;植物器官(例如,花粉、胚、花、果实、芽、叶、根、茎,和相关外植体)。植物组织或植物器官可以是种子、愈伤组织、或者任何其他被组织成结构或功能单元的植物细胞群体。植物细胞或组织培养物能够再生出具有该细胞或组织所来源的植物的生理学和形态学特征的植物,并能够再生出与该植物具有基本上相同基因型的植物。与此相反,一些植物细胞不能够再生产生植物。植物细胞或组织培养物中的可再生细胞可以是胚、原生质体、分生细胞、愈伤组织、花粉、叶、花药、根、根尖、丝、花、果仁、穗、穗轴、壳、或茎。As used herein, the term "plant" includes whole plants and any progeny, cells, tissues, or parts of plants. The term "plant part" includes any part of a plant, including, for example, but not limited to: seeds (including mature seeds, immature embryos without seed coats, and immature seeds); plant cuttings; plant cells; Plant cell cultures; plant organs (eg, pollen, embryos, flowers, fruits, shoots, leaves, roots, stems, and related explants). A plant tissue or plant organ can be a seed, callus, or any other population of plant cells organized into structural or functional units. A plant cell or tissue culture is capable of regenerating plants having the physiological and morphological characteristics of the plant from which the cell or tissue is derived, and capable of regenerating plants having substantially the same genotype as the plant. In contrast, some plant cells are unable to regenerate to produce plants. The regenerable cells in plant cells or tissue culture can be embryos, protoplasts, meristems, callus, pollen, leaves, anthers, roots, root tips, silks, flowers, nuts, ears, cobs, shells , or stem.

植物部分包括可收获的部分和可用于繁殖后代植物的部分。可用于繁殖的植物部分包括,例如但不限于:种子;果实;插条;苗;块茎;和砧木。植物的可收获部分可以是植物的任何有用部分,包括,例如但不限于:花;花粉;苗;块茎;叶;茎;果实;种子;和根。Plant parts include both harvestable parts and parts that can be used to propagate progeny plants. Plant parts that can be used for propagation include, for example, but not limited to: seeds; fruits; cuttings; seedlings; tubers; and rootstocks. A harvestable part of a plant can be any useful part of a plant, including, for example, but not limited to: flowers; pollen; shoots; tubers; leaves; stems; fruits; seeds; and roots.

植物细胞是植物的结构和生理单元。如本文所使用的,植物细胞包括原生质体和具有部分细胞壁的原生质体。植物细胞可以处于分离的单个细胞或细胞聚集体的形式(例如,松散愈伤组织和培养的细胞),并且可以是更高级组织单元(例如,植物组织、植物器官、和植物)的一部分。因此,植物细胞可以是原生质体、产生配子的细胞,或者能够再生成完整植物的细胞或细胞的集合。因此,在本文的实施方案中,包含多个植物细胞并能够再生成为整株植物的种子被认为是一种“植物部分”。Plant cells are the structural and physiological units of plants. As used herein, plant cells include protoplasts and protoplasts with partial cell walls. Plant cells can be in the form of isolated individual cells or cell aggregates (eg, loose callus and cultured cells), and can be part of higher order tissue units (eg, plant tissues, plant organs, and plants). Thus, a plant cell can be a protoplast, a gamete-producing cell, or a cell or collection of cells capable of regenerating into a whole plant. Thus, in embodiments herein, a seed comprising a plurality of plant cells and capable of regenerating into a whole plant is considered a "plant part".

如本文所使用的,术语“原生质体”是指细胞壁被完全或部分地除去、其脂双层膜裸露的植物细胞。典型地,原生质体是没有细胞壁的分离植物细胞,其具有再生成细胞培养物或整株植物的潜力。As used herein, the term "protoplast" refers to a plant cell with its cell wall completely or partially removed and its lipid bilayer membrane exposed. Typically, protoplasts are isolated plant cells without a cell wall that have the potential to regenerate into cell cultures or whole plants.

植物“后代”包括植物的任何后续世代。Plant "progeny" includes any subsequent generation of the plant.

“性状”指植物或特定植物材料或细胞的生理的、形态的、生化的或物理的特征。在一些实施方式中,这些特征可以是肉眼可见的,比如种子、植株的大小等;可用生物化学技术测定的指标,如种子或叶片中蛋白、淀粉或油份的含量等;可观察的代谢或生理过程,如测定对水分胁迫、特定盐、糖或氮浓度的抗性;可检测的基因表达水平;或可观察渗透胁迫的抗性或产量等农艺性状。在一些实施方式中,性状还包括植物的倍性(ploidy),例如对植物育种重要的单倍性(h叩loidy)。在一些实施方式中,性状还包括植物对除草剂的抗性。"Trait" refers to a physiological, morphological, biochemical or physical characteristic of a plant or specific plant material or cell. In some embodiments, these characteristics may be visible to the naked eye, such as the size of seeds, plants, etc.; indicators that can be determined by biochemical techniques, such as the content of protein, starch or oil in seeds or leaves, etc.; observable metabolism or Physiological processes such as determination of resistance to water stress, specific salt, sugar, or nitrogen concentrations; detectable gene expression levels; or observable agronomic traits such as resistance to osmotic stress or yield. In some embodiments, the trait also includes the ploidy of the plant, eg, the haploidy important for plant breeding. In some embodiments, the trait also includes the resistance of the plant to herbicides.

“农艺性状”是可测量的指标参数,包括但不限于:叶片绿色、籽粒产量、生长速率、总生物量或积累速率、成熟时的鲜重、成熟时的干重、果实产量、种子产量、植物总氮含量、果实氮含量、种子氮含量、植物营养组织氮含量、植物总游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、植物营养组织游离氨基酸含量、植物总蛋白含量、果实蛋白含量、种子蛋白含量、植物营养组织蛋白质含量、抗旱性、氮的吸收、根的倒伏、收获指数、茎的倒伏、株高、穗高、穗长、抗病性、抗寒性、抗盐性和分蘖数等。"Agronomic traits" are measurable indicator parameters including, but not limited to: leaf greenness, grain yield, growth rate, total biomass or accumulation rate, fresh weight at maturity, dry weight at maturity, fruit yield, seed yield, Plant total nitrogen content, fruit nitrogen content, seed nitrogen content, plant vegetative tissue nitrogen content, plant total free amino acid content, fruit free amino acid content, seed free amino acid content, plant vegetative tissue free amino acid content, plant total protein content, fruit protein content , seed protein content, plant vegetative tissue protein content, drought resistance, nitrogen uptake, root lodging, harvest index, stem lodging, plant height, ear height, ear length, disease resistance, cold resistance, salt resistance and Number of tillers, etc.

二、基于Cpf1蛋白的碱基编辑系统2. Base editing system based on Cpf1 protein

本发明提供了一种用于对生物体基因组中的靶序列进行碱基编辑的系统,其包含以下i)至v)中至少一项:The present invention provides a system for base editing a target sequence in the genome of an organism, comprising at least one of the following i) to v):

i)碱基编辑融合蛋白,和向导RNA;i) base editing fusion protein, and guide RNA;

ii)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和向导 RNA;ii) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein, and a guide RNA;

iii)碱基编辑融合蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;iii) base editing fusion proteins, and expression constructs comprising nucleotide sequences encoding guide RNAs;

iv)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;iv) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein, and an expression construct comprising a nucleotide sequence encoding a guide RNA;

v)包含编码碱基编辑融合蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;v) an expression construct comprising a nucleotide sequence encoding a base-edited fusion protein and a nucleotide sequence encoding a guide RNA;

其中所述碱基编辑融合蛋白包含DNA切割活性缺失的Cpf1和脱氨酶,所述向导RNA能够将所述碱基编辑融合蛋白靶向基因组中的靶序列,导致所述靶序列中一或多个C至T或者A至G的取代。wherein the base-editing fusion protein comprises Cpf1 and deaminase deficient in DNA cleavage activity, and the guide RNA is capable of targeting the base-editing fusion protein to a target sequence in the genome, resulting in one or more C to T or A to G substitutions.

Cpf1包含一个DNA切割结构域(RuvC),将其突变后可以使Cpf1的 DNA切割活性缺失,形成“DNA切割活性缺失的Cpf1”。所述DNA切割活性缺失的Cpf1仍然保留gRNA指导的DNA结合能力。因此,原则上,当与另外的蛋白融合时,DNA切割活性缺失的Cpf1可以简单地通过与合适的向导RNA共表达而将所述另外的蛋白靶向几乎任何DNA序列。Cpf1 contains a DNA cleavage domain (RuvC), which can be mutated to delete the DNA cleavage activity of Cpf1, forming "Cpf1 with missing DNA cleavage activity". The DNA-cleaving activity-depleted Cpf1 still retains the DNA-binding ability directed by the gRNA. Thus, in principle, when fused to an additional protein, Cpf1 deficient in DNA cleavage activity can target the additional protein to almost any DNA sequence simply by co-expression with a suitable guide RNA.

本发明所述DNA切割活性缺失的Cpf1可以衍生自不同物种的Cpf1,例如,衍生自Francisella novicida U112、Acidaminococcussp.BV3L6和 Lachnospiraceae bacteriumND2006的分别称为FnCpf1(例如野生型的氨基酸序列示于SEQ ID NO:19)、AsCpf1(例如野生型的氨基酸序列示于SEQ ID NO:18)和LbCpf1的Cpf1蛋白(例如野生型的氨基酸序列示于SEQ ID NO:20)。The DNA cleavage activity-deficient Cpf1 of the present invention can be derived from Cpf1 of different species, for example, derived from Francisella novicida U112, Acidaminococcus sp. BV3L6 and Lachnospiraceae bacterium ND2006, respectively called FnCpf1 (for example, the amino acid sequence of the wild type is shown in SEQ ID NO: 19), AsCpf1 (eg, the amino acid sequence of wild type is shown in SEQ ID NO: 18) and Cpf1 protein of LbCpf1 (eg, the amino acid sequence of wild type is shown in SEQ ID NO: 20).

在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的FnCpf1。在一些具体实施方式中,所述DNA切割活性缺失的FnCpf1 相对于野生型FnCpf1包含D917A突变。In some embodiments, the DNA cleavage activity-depleted Cpf1 is DNA cleavage activity-depleted FnCpf1. In some embodiments, the DNA-cleaving activity-depleted FnCpf1 comprises a D917A mutation relative to wild-type FnCpf1.

在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的AsCpf1。在一些具体实施方式中,所述DNA切割活性缺失的AsCpf1 相对于野生型AsCpf1包含D908A突变。In some embodiments, the DNA cleavage activity-deficient Cpf1 is DNA cleavage activity-depleted AsCpf1. In some embodiments, the DNA cleavage activity-depleted AsCpf1 comprises a D908A mutation relative to wild-type AsCpf1.

在一些优选实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的LbCpf1。在一些具体实施方式中,所述DNA切割活性缺失的LbCpf1相对于野生型LbCpf1包含D832A突变。In some preferred embodiments, the DNA cleavage activity-depleted Cpf1 is DNA cleavage activity-depleted LbCpf1. In some embodiments, the DNA cleavage activity-depleted LbCpf1 comprises a D832A mutation relative to wild-type LbCpf1.

在一些实施方案中,所述DNA切割活性缺失的Cpf1保留其RNA切割活性,从而能对pre-crRNA进行加工以形成成熟crRNA。因此,在一些实施方案中,本发明的系统中的包含编码向导RNA的核苷酸序列的表达构建体可以包含编码串联的多个不同向导RNA(crRNA)前体的序列,其在转录后可以被所述DNA切割活性缺失的Cpf1加工形成多个不同的向导 RNA(crRNA),从而同时靶向多个不同靶序列。In some embodiments, the DNA-cleaving activity-depleted Cpf1 retains its RNA-cleaving activity, thereby enabling processing of pre-crRNA to form mature crRNA. Thus, in some embodiments, an expression construct comprising a nucleotide sequence encoding a guide RNA in the system of the invention may comprise a sequence encoding a tandem of multiple different guide RNA (crRNA) precursors, which upon transcription may Cpf1, which is deficient in the DNA cleavage activity, is processed to form multiple different guide RNAs (crRNAs), thereby simultaneously targeting multiple different target sequences.

本发明的一些实施方案中,融合蛋白中的所述脱氨酶是胞苷脱氨酶,例如载脂蛋白B mRNA编辑复合体(APOBEC)家族脱氨酶。In some embodiments of the invention, the deaminase in the fusion protein is a cytidine deaminase, such as an apolipoprotein B mRNA editing complex (APOBEC) family deaminase.

胞苷脱氨酶可以催化DNA上胞苷(C)的脱氨化作用形成尿嘧啶(U)。本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与胞苷脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向基因组中的靶序列,由于Cpf1 的DNA切割活性失活缺失,DNA双链不被切割,而融合蛋白中的胞苷脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的胞苷脱氨转换成U,再通过碱基错配修复实现C至T的取代。Cytidine deaminase catalyzes the deamination of cytidine (C) on DNA to form uracil (U). The inventors have surprisingly found that by fusing Cpf1 with a lack of DNA cleavage activity to cytidine deaminase, under the guidance of guide RNA, the fusion protein can target the target sequence in the genome, due to the inactivation of the DNA cleavage activity of Cpf1. In the absence, the DNA double-strand is not cleaved, and the cytidine deaminase in the fusion protein can convert the cytidine deamination of the single-stranded DNA generated in the formation of the Cpf1-guide RNA-DNA complex into U, and then through the base error. With repair to achieve C to T substitution.

本发明所述胞苷脱氨酶特别是可以接受单链DNA作为底物的胞苷脱氨酶。本发明可用的胞苷脱氨酶的实例包括但不限于:APOBEC1脱氨酶、激活诱导的胞苷脱氨酶(AID)、APOBEC3G或CDA1。在本发明的一些具体实施方式中,所述胞苷脱氨酶包含SEQ ID NO:1所示的氨基酸序列。The cytidine deaminase of the present invention is particularly a cytidine deaminase that can accept single-stranded DNA as a substrate. Examples of cytidine deaminase enzymes useful in the present invention include, but are not limited to, APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, or CDA1. In some specific embodiments of the present invention, the cytidine deaminase comprises the amino acid sequence shown in SEQ ID NO:1.

在融合蛋白中的脱氨酶是胞苷脱氨酶的情况下,本发明的碱基编辑系统可以将基因组靶序列中的一或多个C突变为T,也称作Cpf1-PBE系统。In the case where the deaminase in the fusion protein is cytidine deaminase, the base editing system of the present invention can mutate one or more Cs to Ts in the genomic target sequence, also referred to as the Cpf1-PBE system.

在细胞中,尿嘧啶DNA糖基化酶催化U从DNA上的去除并启动碱基切除修复(BER),导致将U:G修复成C:G。因此,不受任何理论限制,在本发明的碱基编辑融合蛋白或本发明的系统中包含尿嘧啶DNA糖基化酶抑制剂将能够增加碱基编辑的效率。In cells, uracil DNA glycosylase catalyzes the removal of U from DNA and initiates base excision repair (BER), resulting in the repair of U:G to C:G. Therefore, without being bound by any theory, the inclusion of a uracil DNA glycosylase inhibitor in the base editing fusion proteins of the present invention or the systems of the present invention will increase the efficiency of base editing.

因此,在本发明的涉及Cpf1-PBE系统的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI)。在一些具体实施方式中,所述尿嘧啶DNA糖基化酶抑制剂包含SEQ ID NO:2所示的氨基酸序列。Accordingly, in some embodiments of the invention involving the Cpf1-PBE system, the base editing fusion protein further comprises a uracil DNA glycosylase inhibitor (UGI). In some specific embodiments, the uracil DNA glycosylase inhibitor comprises the amino acid sequence shown in SEQ ID NO:2.

本发明的一些实施方案中,所述脱氨酶是腺嘌呤脱氨酶。In some embodiments of the invention, the deaminase is adenine deaminase.

天然存在的腺嘌呤脱氨酶同常以RNA为底物,通过脱氨作用将单链RNA上的腺苷转变成肌苷(I)。最近,通过定向进化的方法,已经基于大肠杆菌的tRNA腺嘌呤脱氨酶TadA获得了能够以单链DNA为底物,将单链 DNA上的脱氧鸟苷转变为肌苷(I)的DNA依赖型腺嘌呤脱氨酶。参见Nicloe M.Gaudelli等人,doi:10.1038/nature24644,2017。Naturally occurring adenine deaminase also uses RNA as a substrate and converts adenosine on single-stranded RNA to inosine (I) by deamination. Recently, a DNA-dependent DNA-dependent conversion of deoxyguanosine on single-stranded DNA to inosine (I) using single-stranded DNA as a substrate has been obtained based on the tRNA adenine deaminase TadA of Escherichia coli by means of directed evolution. type adenine deaminase. See Nicloe M. Gaudelli et al., doi: 10.1038/nature24644, 2017.

本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与DNA依赖型腺嘌呤脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向植物基因组中的靶序列,由于Cpf1的DNA切割活性缺失,DNA双链不被切割,,而融合蛋白中的DNA依赖型腺嘌呤脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的腺苷脱氨转换成肌苷(I),由于DNA聚合酶会将肌苷(I)当做鸟嘌呤(G)处理,因此通过碱基错配修复可以实现A至G的取代。因此,在融合蛋白中的脱氨酶是DNA依赖型腺嘌呤脱氨酶的情况下,本发明的碱基编辑系统可以将基因组靶序列中的一或多个A突变为G,也称作Cpf1-ABE系统。The inventors of the present invention surprisingly found that by fusing Cpf1, which lacks DNA cleavage activity, with DNA-dependent adenine deaminase, under the guidance of guide RNA, the fusion protein can target the target sequence in the plant genome, because the DNA of Cpf1 The cleavage activity is absent, the DNA double-strand is not cleaved, and the DNA-dependent adenine deaminase in the fusion protein can convert the adenosine deamination of single-stranded DNA generated in the formation of the Cpf1-guide RNA-DNA complex into myosine For glycoside (I), since DNA polymerase treats inosine (I) as guanine (G), substitution of A to G can be achieved by base mismatch repair. Therefore, in the case where the deaminase in the fusion protein is a DNA-dependent adenine deaminase, the base editing system of the present invention can mutate one or more A to G in the genomic target sequence, also referred to as Cpf1 -ABE system.

在本发明的一些实施方案中,所述DNA依赖型腺嘌呤脱氨酶是大肠杆菌tRNA腺嘌呤脱氨酶TadA(ecTadA)的变体,特别是可以接受单链DNA 作为底物的变体,所述变体相对于野生型ecTadA包含一或多组选自以下的突变:In some embodiments of the invention, the DNA-dependent adenine deaminase is a variant of the E. coli tRNA adenine deaminase TadA (ecTadA), particularly a variant that can accept single-stranded DNA as a substrate, The variant comprises, relative to wild-type ecTadA, one or more sets of mutations selected from the group consisting of:

1)A106V和D108N;1) A106V and D108N;

2)D147Y和E155V;2) D147Y and E155V;

3)L84F、H123Y和I156F;3) L84F, H123Y and I156F;

4)A142N;4) A142N;

5)H36L、R51L、S146C和K157N;5) H36L, R51L, S146C and K157N;

6)P48S/T/A;6) P48S/T/A;

7)A142N;7) A142N;

8)W23L/R;8) W23L/R;

9)R152H/P。9) R152H/P.

在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.9)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、A142N、D147Y、 E155V和R152P。In a specific embodiment of the invention, the DNA-dependent adenine deaminase (ABE version 7.9) comprises the following mutations relative to wild-type ecTadA: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F , H123Y, I156F, A142N, D147Y, E155V and R152P.

在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、D147Y、E155V 和R152P。In a specific embodiment of the invention, the DNA-dependent adenine deaminase (ABE version 7.10) comprises the following mutations relative to wild-type ecTadA: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F , H123Y, I156F, D147Y, E155V and R152P.

野生型ecTadA氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNR PIGRHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIH SRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALL SDFFRMRRQEIKAQKKAQSSTD(SEQ ID NO:3)。在一些实施方案中,其中起始甲硫氨酸可以不存在。The wild-type ecTadA amino acid sequence is shown below: MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNRPIGRHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIHSRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALLSDFFRMRRQEIKAQKKAQSSTD (SEQ ID NO: 3). In some embodiments, the starting methionine may be absent.

优选的ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNR AIGLHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTFEPCVMCAGAMIH SRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD(SEQ ID NO:4)。在一些实施方案中,其中起始甲硫氨酸可以不存在。A preferred ecTadA-derived DNA-dependent adenine deaminase (ABE version 7.10) amino acid sequence is shown below: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD (SEQ ID NO. In some embodiments, the starting methionine may be absent.

在本发明的一些实施方案中,所述脱氨酶被融合至所述DNA切割活性缺失的Cpf1的N末端。在一些实施方案中,所述脱氨酶被融合至所述DNA 切割活性缺失的Cpf1的C末端。In some embodiments of the invention, the deaminase is fused to the N-terminus of Cpfl that is deficient in DNA cleavage activity. In some embodiments, the deaminase is fused to the C-terminus of Cpf1 in which the DNA cleavage activity is deleted.

在一些优选实施方案中,所述DNA依赖型腺嘌呤脱氨酶的N端融合有对应的野生型腺嘌呤脱氨酶。预期DNA依赖型腺嘌呤脱氨酶与野生型腺嘌呤脱氨酶形成异二聚体可以显著提高融合蛋白A至G的编辑活性。In some preferred embodiments, the DNA-dependent adenine deaminase is fused to the N-terminus of the corresponding wild-type adenine deaminase. It is expected that DNA-dependent adenine deaminase forms a heterodimer with wild-type adenine deaminase, which can significantly enhance the editing activity of fusion proteins A to G.

在本发明的一些实施方案中,所述脱氨酶和所述DNA切割活性缺失的 Cpf1通过接头融合。所述接头可以是长1-50个(例如1、2、3、4、5、6、 7、8、9、10、11、12、13、14、15、16、17、18、19、20个或20-25 个、25-50个)或更多个氨基酸、无二级以上结构的非功能性氨基酸序列。例如,所述接头可以是柔性接头,例如GGGGS、GS、GAP、(GGGGS)x 3、GGS和(GGS)x7等。在一些具体的实施方案中,所述接头是XTEN接头。在一些具体实施方案中,所述接头长32个氨基酸。在一些具体的实施方案中,所述接头的氨基酸序列为: SGGSSGGSSGSETPGTSESATPESSGGSSGGS。In some embodiments of the invention, the deaminase and the DNA cleavage activity-depleted Cpf1 are fused by a linker. The linkers may be 1-50 in length (eg 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 20-25, 25-50) or more amino acids, non-functional amino acid sequence without secondary structure. For example, the linker may be a flexible linker such as GGGGS, GS, GAP, (GGGGS)x3, GGS and (GGS)x7, and the like. In some specific embodiments, the linker is an XTEN linker. In some specific embodiments, the linker is 32 amino acids in length. In some specific embodiments, the amino acid sequence of the linker is: SGGSSGGSSGSETPGTSESATPESSGGSSGGS.

在本发明的一些实施方案中,本发明的碱基编辑融合蛋白还包含核定位序列(NLS)。一般而言,所述碱基编辑融合蛋白中的一个或多个NLS应具有足够的强度,以便在植物细胞的核中驱动所述碱基编辑融合蛋白以可实现其碱基编辑功能的量积聚。一般而言,核定位活性的强度由所述碱基编辑融合蛋白中NLS的数目、位置、所使用的一个或多个特定的NLS、或这些因素的组合决定。In some embodiments of the invention, the base editing fusion proteins of the invention further comprise a nuclear localization sequence (NLS). In general, the one or more NLSs in the base editing fusion protein should be of sufficient strength to drive accumulation of the base editing fusion protein in the nucleus of a plant cell in an amount that will enable its base editing function . In general, the strength of nuclear localization activity is determined by the number, location of NLSs in the base editing fusion protein, one or more specific NLSs used, or a combination of these factors.

在本发明的一些实施方案中,本发明的碱基编辑融合蛋白的NLS可以位于N端和/或C端。在一些实施方案中,所述碱基编辑融合蛋白包含约1、 2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于N端的约1、2、3、4、5、6、7、8、9、 10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于C端约1、2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含这些的组合,如包含在N端的一个或多个NLS以及在C端的一个或多个NLS。当存在多于一个NLS时,每一个可以被选择为不依赖于其他NLS。在本发明的一些优选实施方式中,所述碱基编辑融合蛋白包含2个NLS,例如所述2个NLS分别位于N端和C端。In some embodiments of the invention, the NLS of the base editing fusion proteins of the invention may be located at the N-terminus and/or the C-terminus. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs at or near the N-terminus. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLSs at or near the C-terminus. In some embodiments, the base editing fusion protein comprises a combination of these, such as comprising one or more NLSs at the N-terminus and one or more NLSs at the C-terminus. When there is more than one NLS, each can be selected to be independent of the other NLS. In some preferred embodiments of the present invention, the base editing fusion protein comprises 2 NLSs, eg, the 2 NLSs are located at the N-terminus and the C-terminus, respectively.

一般而言,NLS由暴露于蛋白表面上的带正电的赖氨酸或精氨酸的一个或多个短序列组成,但其他类型的NLS也是已知的。NLS的非限制性实例包括:KKRKV(核苷酸序列5’-AAGAAGAGAAAGGTC-3’)、 PKKKRKV(核苷酸序列5’-CCCAAGAAGAAGAGGAAGGTG-3’或CCAAAGAAGAAGAGGAAGGTT),或SGGSPKKKRKV(核苷酸序列5’- TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3’)。In general, NLSs consist of one or more short sequences of positively charged lysines or arginines exposed on the surface of proteins, but other types of NLSs are also known. Non-limiting examples of NLS include: KKRKV (nucleotide sequence 5'-AAGAAGAGAAAGGTC-3'), PKKKRKV (nucleotide sequence 5'-CCCAAGAAGAAGAGGAAGGTG-3' or CCAAAGAAGAAGAGGAAGGTT), or SGGSPKKKRKV (nucleotide sequence 5'- TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3').

在本发明的一些实施方式中,所述碱基编辑融合蛋白的N端包含 PKKKRKV所示的氨基酸序列的NLS。在本发明的一些实施方式中,所述碱基编辑融合蛋白的C端包含SGGSPKKKRKV或 KRPAATKKAGQAKKKK所示的氨基酸序列的NLS。In some embodiments of the invention, the N-terminus of the base-edited fusion protein comprises the NLS of the amino acid sequence shown in PKKKRKV. In some embodiments of the present invention, the C-terminus of the base editing fusion protein comprises the NLS of the amino acid sequence shown in SGGSPKKKRKV or KRPAATKKAGQAKKKK.

此外,根据所需要编辑的DNA位置,本发明的碱基编辑融合蛋白还可以包括其他的定位序列,例如细胞质定位序列、叶绿体定位序列、线粒体定位序列等。In addition, according to the DNA position to be edited, the base editing fusion protein of the present invention may also include other localization sequences, such as cytoplasmic localization sequences, chloroplast localization sequences, mitochondrial localization sequences, and the like.

在本发明的涉及Cpf1-PBE系统的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI),且还在所述UGI的N端或 C端紧接两个NLS。在一些优选实施方案中,本发明的碱基编辑融合蛋白包含选自SEQ ID NO:24-29的氨基酸序列。In some embodiments of the present invention involving the Cpf1-PBE system, the base editing fusion protein further comprises a uracil DNA glycosylase inhibitor (UGI), and is also close to the N-terminus or C-terminus of the UGI Pick up two NLS. In some preferred embodiments, the base editing fusion proteins of the present invention comprise an amino acid sequence selected from the group consisting of SEQ ID NOs: 24-29.

为了获得有效表达,在本发明的一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列针对待进行碱基编辑的生物物种进行密码子优化。In order to obtain efficient expression, in some embodiments of the present invention, the nucleotide sequence encoding the base-edited fusion protein is codon-optimized for the biological species to be base-edited.

密码子优化是指通过用在宿主细胞的基因中更频繁地或者最频繁地使用的密码子代替天然序列的至少一个密码子(例如约或多于约1、2、3、4、 5、10、15、20、25、50个或更多个密码子同时维持该天然氨基酸序列而修饰核酸序列以便增强在感兴趣宿主细胞中的表达的方法。不同的物种对于特定氨基酸的某些密码子展示出特定的偏好。密码子偏好性(在生物之间的密码子使用的差异)经常与信使RNA(mRNA)的翻译效率相关,而该翻译效率则被认为依赖于被翻译的密码子的性质和特定的转运RNA(tRNA)分子的可用性。细胞内选定的tRNA的优势一般反映了最频繁用于肽合成的密码子。因此,可以将基因定制为基于密码子优化在给定生物中的最佳基因表达。密码子利用率表可以容易地获得,例如在www.kazusa.orjp/codon/上可获得的密码子使用数据库(“Codon Usage Database”)中,并且这些表可以通过不同的方式调整适用。参见,Nakamura Y.等,“Codon usage tabulatedfrom theinternational DNA sequencedatabases:statusfortheyear2000. Nucl.AcidsRes.,28:292(2000)。Codon optimization refers to replacing at least one codon of a native sequence (eg, about or more than about 1, 2, 3, 4, 5, 10) with a codon that is more or most frequently used in a gene in a host cell. , 15, 20, 25, 50 or more codons while maintaining the native amino acid sequence and modifying the nucleic acid sequence so as to enhance the expression in a host cell of interest. Different species display certain codons for specific amino acids Codon preference (differences in codon usage between organisms) is often related to the translation efficiency of messenger RNA (mRNA), which is thought to depend on the nature and nature of the codons being translated. Availability of a specific transfer RNA (tRNA) molecule. The predominance of a selected tRNA within a cell generally reflects the codons most frequently used for peptide synthesis. Thus, genes can be tailored based on codon optimization for the most efficient use in a given organism Optimal gene expression. Codon usage tables are readily available, for example in the "Codon Usage Database" available at www.kazusa.orjp/codon/ , and these tables can be adjusted in different ways Applicable. See, Nakamura Y. et al., "Codon usage tabulated from the international DNA sequence databases: status for the year 2000. Nucl. Acids Res., 28: 292 (2000).

在一些具体实施方案中,本发明的碱基编辑融合蛋白由选自SEQ ID NO:8-9、11-12或14-15的核苷酸序列编码。In some specific embodiments, the base editing fusion proteins of the invention are encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOs: 8-9, 11-12, or 14-15.

在本发明一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列和/或所述编码向导RNA的核苷酸序列与表达调控元件如启动子可操作地连接。In some embodiments of the invention, the nucleotide sequence encoding a base-edited fusion protein and/or the nucleotide sequence encoding a guide RNA is operably linked to an expression control element, such as a promoter.

本发明可使用的启动子的实例包括但不限于聚合酶(pol)I、pol II或pol III启动子。pol I启动子的实例包括鸡RNApol I启动子。pol II启动子的实例包括但不限于巨细胞病毒立即早期(CMV)启动子、劳斯肉瘤病毒长末端重复(RSV-LTR)启动子和猿猴病毒40(SV40)立即早期启动子。pol III启动子的实例包括U6和H1启动子。可以使用诱导型启动子如金属硫蛋白启动子。启动子的其他实例包括T7噬菌体启动子、T3噬菌体启动子、β-半乳糖苷酶启动子和Sp6噬菌体启动子。当用于植物时,启动子可以是花椰菜花叶病毒35S启动子、玉米Ubi-1启动子、小麦U6启动子、水稻U3启动子、玉米U3启动子、水稻肌动蛋白启动子。Examples of promoters that can be used in the present invention include, but are not limited to, polymerase (pol) I, pol II, or pol III promoters. Examples of pol I promoters include the chicken RNA pol I promoter. Examples of pol II promoters include, but are not limited to, the cytomegalovirus immediate early (CMV) promoter, the Rous sarcoma virus long terminal repeat (RSV-LTR) promoter, and the simian virus 40 (SV40) immediate early promoter. Examples of pol III promoters include the U6 and H1 promoters. Inducible promoters such as the metallothionein promoter can be used. Other examples of promoters include T7 phage promoter, T3 phage promoter, β-galactosidase promoter, and Sp6 phage promoter. When used in plants, the promoter may be the cauliflower mosaic virus 35S promoter, the maize Ubi-1 promoter, the wheat U6 promoter, the rice U3 promoter, the maize U3 promoter, the rice actin promoter.

优选地,向导RNA(crRNA)使用Ubi-1启动子表达并用核酶例如HDV 核酶来切割成熟。Preferably, the guide RNA (crRNA) is expressed using the Ubi-1 promoter and matured by cleavage with a ribozyme such as HDV ribozyme.

在一实施方式中,在Ubi-1启动子后添加一内含子可以增强目的蛋白或 RNA的表达。In one embodiment, adding an intron after the Ubi-1 promoter can enhance the expression of the protein or RNA of interest.

在一些具体实施方案中,用于表达本发明的碱基融合蛋白的表达构建体包含SEQID NO:10或13所示的表达盒。或者所述表达构建体包含SEQ ID NO:30所示的表达调控序列。In some specific embodiments, the expression construct for expressing the base fusion protein of the present invention comprises the expression cassette set forth in SEQ ID NO: 10 or 13. Alternatively, the expression construct comprises the expression control sequence set forth in SEQ ID NO:30.

三、产生经遗传修饰的生物体的方法3. Methods of generating genetically modified organisms

在另一方面,本发明提供了一种产生经遗传修饰的生物体(如植物)的方法,包括将本发明的用于对生物体基因组中的靶序列进行碱基编辑的系统导入所述生物体细胞,由此所述向导RNA将所述碱基编辑融合蛋白靶向所述植物基因组中的靶序列,导致所述靶序列中的一或多个C被T取代或一或多个A被G取代。In another aspect, the present invention provides a method of producing a genetically modified organism, such as a plant, comprising introducing into said organism a system of the present invention for base editing a target sequence in the genome of an organism Somatic cells, whereby the guide RNA targets the base-editing fusion protein to a target sequence in the plant genome, resulting in one or more Cs in the target sequence being replaced by T or one or more A by G replaces.

可以被Cpf1蛋白和向导RNA(即crRNA)复合物识别并靶向的靶序列或 crRNA编码序列的设计可以参照例如Zhang et al.,Cell 163,1-13,October 22, 2015。一般而言,本发明的基因组编辑系统靶向的靶序列5’末端需包含前间区序列邻近基序(protospaceradjacent motif)(PAM)5’-TTTN或5’-YTN,其中N独立地选自A、G、C和T,Y选自C和T。The design of target sequences or crRNA coding sequences that can be recognized and targeted by the complex of Cpf1 protein and guide RNA (ie crRNA) can be referred to, for example, Zhang et al., Cell 163, 1-13, October 22, 2015. In general, the 5' end of the target sequence targeted by the genome editing system of the present invention needs to comprise a protospaceradjacent motif (PAM) 5'-TTTN or 5'-YTN, wherein N is independently selected from A, G, C and T, Y is selected from C and T.

例如,在本发明的一些实施方案中,所述靶序列具有以下结构:5’- TTTN-NX-3’或5’-YTN-NX-3’,其中N独立地选自A、G、C和T,Y选自C 和T;X为15≤X≤35的整数;Nx表示X个连续的核苷酸。For example, in some embodiments of the invention, the target sequence has the following structure: 5'-TTTN- NX -3' or 5'-YTN- NX -3', wherein N is independently selected from A, G , C and T, Y is selected from C and T; X is an integer of 15≤X≤35; Nx represents X consecutive nucleotides.

在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。In the present invention, the target sequence to be modified can be located at any position in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve the Modification of gene function or modification of gene expression.

可以通过T7EI、PCR/RE或测序方法检测所述细胞靶序列中的A至G 或C至T碱基编辑。A to G or C to T base edits in the cellular target sequence can be detected by T7EI, PCR/RE or sequencing methods.

在本发明的方法中,所述碱基编辑的系统可以通过本领域技术人员熟知的各种方法导入细胞。可用于将本发明的基因组编辑系统导入细胞的方法包括但不限于:磷酸钙转染、原生质融合、电穿孔、脂质体转染、微注射、病毒感染(如杆状病毒、痘苗病毒、腺病毒、腺相关病毒、慢病毒和其他病毒)、基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化。In the method of the present invention, the base editing system can be introduced into cells by various methods well known to those skilled in the art. Methods that can be used to introduce the genome editing system of the present invention into cells include, but are not limited to, calcium phosphate transfection, protoplast fusion, electroporation, lipofection, microinjection, viral infection (eg, baculovirus, vaccinia virus, adenovirus) virus, adeno-associated virus, lentivirus and others), biolistic, PEG-mediated protoplast transformation, Agrobacterium-mediated transformation.

可以通过本发明的方法进行基因组编辑的细胞可以来自例如,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。Cells that can undergo genome editing by the methods of the invention can be derived, for example, from mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, geese; plants, including monads Leafy and dicotyledonous plants such as rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis, and the like.

本发明的方法尤其适合于产生经遗传修饰的植物,例如作物植物。在本发明的产生经遗传修饰的植物的方法中,所述碱基编辑系统可以本领域技术人员熟知的各种方法导入植物。可用于将本发明的碱基编辑系统导入植物的方法包括但不限于:基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化、植物病毒介导的转化、花粉管通道法和子房注射法。The methods of the present invention are particularly suitable for producing genetically modified plants, such as crop plants. In the methods of producing genetically modified plants of the present invention, the base editing system can be introduced into plants by various methods well known to those skilled in the art. Methods that can be used to introduce the base editing system of the present invention into plants include, but are not limited to: biolistic method, PEG-mediated protoplast transformation, Agrobacterium-mediated transformation, plant virus-mediated transformation, pollen tube pathway, and protoplast transformation. Atrial injection.

在本发明的产生经遗传修饰的植物的方法中,只需在植物细胞中导入或产生所述碱基编辑融合蛋白和向导RNA即可实现对靶序列的修饰,并且所述修饰可以稳定遗传,无需将所述碱基编辑系统稳定转化植物。这样避免了稳定存在的碱基编辑系统的潜在脱靶作用,也避免外源核苷酸序列在植物基因组中的整合,从而具有更高生物安全性。In the method for producing a genetically modified plant of the present invention, the modification of the target sequence can be achieved only by introducing or producing the base editing fusion protein and the guide RNA in a plant cell, and the modification can be stably inherited, There is no need to stably transform plants with the base editing system. This avoids potential off-target effects of stably existing base editing systems, and also avoids the integration of exogenous nucleotide sequences into the plant genome, resulting in higher biological safety.

在一些优选实施方式中,所述导入在不存在选择压力下进行,从而避免外源核苷酸序列在植物基因组中的整合。In some preferred embodiments, the introduction is performed in the absence of selective pressure, thereby avoiding integration of foreign nucleotide sequences into the plant genome.

在一些实施方式中,所述导入包括将本发明的碱基编辑系统转化至分离的植物细胞或组织,然后使所述经转化的植物细胞或组织再生为完整植物。优选地,在不存在选择压力下进行所述再生,也即是,在组织培养过程中不使用任何针对表达载体上携带的选择基因的选择剂。不使用选择剂可以提高植物的再生效率,获得不含外源核苷酸序列的经修饰的植物。In some embodiments, the introducing comprises transforming the base editing system of the invention into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant. Preferably, the regeneration is carried out in the absence of selective pressure, that is, without the use of any selection agent for the selection gene carried on the expression vector during tissue culture. Regeneration efficiency of plants can be increased without the use of selection agents, resulting in modified plants free of exogenous nucleotide sequences.

在另一些实施方式中,可以将本发明的碱基编辑系统转化至完整植物上的特定部位,例如叶片、茎尖、花粉管、幼穗或下胚轴。这特别适合于难以进行组织培养再生的植物的转化。In other embodiments, the base editing systems of the invention can be transformed into specific sites on intact plants, such as leaves, shoot tips, pollen tubes, young ears, or hypocotyls. This is particularly suitable for the transformation of plants that are difficult to regenerate in tissue culture.

在本发明的一些实施方式中,直接将体外表达的蛋白质和/或体外转录的RNA分子转化至所述植物。所述蛋白质和/或RNA分子能够在植物细胞中实现碱基编辑,随后被细胞降解,避免了外源核苷酸序列在植物基因组中的整合。In some embodiments of the invention, the in vitro expressed protein and/or in vitro transcribed RNA molecule is directly transformed into the plant. The protein and/or RNA molecule enables base editing in plant cells and subsequent degradation by the cells, avoiding the integration of exogenous nucleotide sequences into the plant genome.

可以通过本发明的方法进行碱基编辑的植物包括单子叶植物和双子叶植物。例如,所述植物可以是作物植物,例如小麦、水稻、玉米、大豆、向日葵、高粱、油菜、苜蓿、棉花、大麦、粟、甘蔗、番茄、烟草、木薯或马铃薯。Plants that can be base edited by the methods of the present invention include monocotyledonous and dicotyledonous plants. For example, the plant can be a crop plant such as wheat, rice, corn, soybean, sunflower, sorghum, canola, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava or potato.

在本发明的一些实施方式中,其中所述靶序列与植物性状如农艺性状相关,由此所述碱基编辑导致所述植物相对于野生型植物具有改变的性状。In some embodiments of the invention, wherein the target sequence is associated with a plant trait, such as an agronomic trait, whereby the base editing results in the plant having an altered trait relative to a wild-type plant.

在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。相应地,在本发明的一些实施方式中,所述C至T或A至G的取代导致靶蛋白中的氨基酸取代或靶蛋白的截短(生成了终止密码子)。在本发明的另一些实施方式中,所述C至T或A至G的取代导致靶基因的表达发生变化。In the present invention, the target sequence to be modified can be located at any position in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve the Modification of gene function or modification of gene expression. Accordingly, in some embodiments of the invention, the C to T or A to G substitutions result in amino acid substitutions in the target protein or truncation of the target protein (generating a stop codon). In other embodiments of the invention, the C to T or A to G substitution results in a change in the expression of the target gene.

在本发明的一些实施方式中,所述方法还包括获得所述经遗传修饰的植物的后代。In some embodiments of the invention, the method further comprises obtaining progeny of the genetically modified plant.

在另一方面,本发明还提供了经遗传修饰的植物或其后代或其部分,其中所述植物通过本发明上述的方法获得。In another aspect, the present invention also provides a genetically modified plant or progeny or part thereof, wherein said plant is obtained by the above-described method of the present invention.

在另一方面,本发明还提供了一种植物育种方法,包括将通过本发明上述的方法获得的经遗传修饰的第一植物与不含有所述遗传修饰的第二植物杂交,从而将所述遗传修饰导入第二植物。In another aspect, the present invention also provides a method of plant breeding, comprising crossing a first genetically modified plant obtained by the above-mentioned method of the present invention with a second plant not containing the genetic modification, so that the The genetic modification is introduced into the second plant.

实施例Example

构建Ubi-CPF1-PBE/ABE表达载体Construction of Ubi-CPF1-PBE/ABE expression vector

ABE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-ABE片段。PCR产物用HindIII和EcoRI 消化,然后插入至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-ABE。ABE, XTEN, dCPF1 sequences were codon optimized for plants and ordered from GenScript (Nanjing). The full-length dCPF1-ABE fragment was amplified using primer pair HindIII-F (with HindIII restriction site) and EcoRI (with EcoRI restriction site). The PCR product was digested with HindIII and EcoRI, and then inserted into the pJIT163-GFP vector (the vector sequence is shown in SEQ ID NO: 16) digested with these two enzymes to generate the fusion expression vector dCPF1-ABE.

PBE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-PBE片段。PCR产物用HindIII和EcoRI 消化,然后插入至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-PBE。PBE, XTEN, dCPF1 sequences were codon optimized for plants and ordered from GenScript (Nanjing). The full-length dCPF1-PBE fragment was amplified using primer pair HindIII-F (with HindIII restriction site) and EcoRI (with EcoRI restriction site). The PCR product was digested with HindIII and EcoRI and inserted into the pJIT163-GFP vector (the vector sequence is shown in SEQ ID NO: 16) digested with these two enzymes to generate the fusion expression vector dCPF1-PBE.

构建sgRNA表达载体Construction of sgRNA expression vector

根据之前描述(Wang,Y.et al.Simultaneous editing of three homoeoallelesin hexaploid bread wheat confers heritable resistance to powdery mildew.Nat.Biotechnol.32,947-951,2014;Shan,Q.et al.Targeted genome modification of cropplants using a CRISPR-Cas system.Nat.Biotechnol.31,686-688,2013;和 Liang,Z.etal.Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system.JGenet Genomics.41,63-68,2014)基于 pTaU6-sgRNA(Addgene ID53062)或pOsU3-sgRNA(Addgene ID53063)或 pZmU3-sgRNA(Addgene ID5306)或OsU3/TaU6-tRNA-sgRNA(Zhanget al. 2017.Genome Biology.DOI:10.1186/s13059-017-1325-9)构建sgRNA表达载体。此外,还有通过II型启动子来启动锤头酶和crRNA来产生crRNA(Tang et al.Nature plant,doi:10.1038/nplants.2017.18)According to the previous description (Wang, Y. et al. Simultaneous editing of three homoeoallelesin hexaploid bread wheat confers heritable resistance to powdery mildew. Nat. Biotechnol. 32, 947-951, 2014; Shan, Q. et al. Targeted genome modification of cropplants using a CRISPR-Cas system. Nat. Biotechnol. 31, 686-688, 2013; and Liang, Z. et al. Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system. JGenet Genomics. 41, 63-68, 2014 ) based on pTaU6-sgRNA (Addgene ID53062) or pOsU3-sgRNA (Addgene ID53063) or pZmU3-sgRNA (Addgene ID5306) or OsU3/TaU6-tRNA-sgRNA (Zhang et al. 2017. Genome Biology. DOI: 10.1186/s13059-017- 1325-9) Construction of sgRNA expression vector. In addition, there is a type II promoter to initiate the hammerhead enzyme and crRNA to produce crRNA (Tang et al. Nature plant, doi: 10.1038/nplants.2017.18)

pUbi-mGFPP-crRNA、pUbi-DEP1-sgRNA、pUbi-DEP1-crRNA、pUbi -DME-crRNA.pUbi-mGFPP-crRNA, pUbi-DEP1-sgRNA, pUbi-DEP1-crRNA, pUbi-DME-crRNA.

BFP和GFP表达载体BFP and GFP expression vectors

pUbi-mGFP,该载体序列示于SEQ ID NO:17。pUbi-mGFP, the vector sequence is shown in SEQ ID NO:17.

原生质体测定Protoplast assay

在本研究中使用小麦Bobwhite品种、水稻日本晴品种。如下所述进行原生质体转化。平均转化效率为55-70%。每种质粒用10μg通过PEG介导方法进行转化,48小时后,收集原生质体,提取DNA用于T7EI和PCR-RE 测定。The wheat variety Bobwhite and the rice Nipponbare variety were used in this study. Protoplast transformation was performed as described below. The average conversion efficiency was 55-70%. 10 μg of each plasmid was used for transformation by PEG-mediated methods, and after 48 hours, protoplasts were collected and DNA was extracted for T7EI and PCR-RE assays.

小麦原生质体制备及转化Preparation and transformation of wheat protoplasts

1)取小麦幼嫩的叶片,将其中间部分切成0.5-1mm的丝,放入0.6M 的Mannitol溶液中避光处理10分钟,再用滤网过滤,将其放入50ml酶液中20-25℃避光,10rmp缓慢摇晃消化5小时。1) Take the young leaves of wheat, cut the middle part into 0.5-1mm filaments, put them into 0.6M Mannitol solution for 10 minutes in the dark, filter them with a filter, and put them into 50ml of enzyme solution for 20 minutes. -25°C protected from light, 10rmp shaking slowly for 5 hours.

2)加10ml W5稀释酶解产物,用75μm尼龙滤膜过滤酶解液于圆底离心管中(50ml)。2) Add 10 ml of W5 to dilute the enzymatic hydrolysis product, and filter the enzymatic hydrolysis solution with a 75 μm nylon filter into a round-bottom centrifuge tube (50 ml).

3)23℃,100g,离心3min,弃上清。3) 23°C, 100g, centrifuge for 3min, discard the supernatant.

4)用W510ml轻轻悬起,冰上放置30min使原生质体逐渐沉降,弃上清。4) Gently suspend with W510ml, place on ice for 30min to gradually settle the protoplasts, discard the supernatant.

5)加适量MMG悬浮,至于冰上,待转化。5) Add an appropriate amount of MMG to suspend, and keep it on ice for transformation.

6)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。6) Add 10-20 μg plasmid, 200 μl protoplasts (about 4×10 5 cells) and 220 μl freshly prepared PEG solution to a 2 ml centrifuge tube, mix well, and place at room temperature in the dark for 10-20 minutes to induce transformation.

7)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,100g水平离心3min,吸弃上清。7) After the induction and transformation, slowly add 880 μl of W5 solution, gently invert and mix, centrifuge horizontally at 100 g for 3 min, and aspirate and discard the supernatant.

8)加2ml W5溶液重悬,转移到六孔板中,室温(或25℃)暗处培养。若用于提取原生质体基因组DNA,需培养48h。8) Add 2ml of W5 solution to resuspend, transfer to a six-well plate, and cultivate at room temperature (or 25°C) in the dark. If it is used to extract protoplast genomic DNA, it needs to be cultured for 48h.

水稻原生质体制备及转化:Preparation and transformation of rice protoplasts:

1)选取幼苗叶鞘部分分离原生质体,用锋利刀片切成大约0.5mm宽。1) Select the leaf sheath part of the seedling to separate the protoplast, and cut it into a width of about 0.5mm with a sharp blade.

2)切开后立刻转移到0.6M Mannitol溶液中,避光放置10min。2) Immediately after incision, transfer to 0.6M Mannitol solution and place in the dark for 10min.

3)过滤掉Mannitol溶液,转移到酶解液中,避光抽真空30min。3) Filter out the Mannitol solution, transfer it to the enzymatic hydrolysis solution, and vacuumize for 30 minutes in the dark.

4)避光酶解5-6h,同时缓慢摇动(脱色摇床,速度10)。4) Enzyme hydrolysis in the dark for 5-6 hours, while shaking slowly (decolorizing shaker, speed 10).

5)酶解结束后,加入等体积的W5,水平摇动10sec,释放原生质体。5) After the enzymatic hydrolysis, add an equal volume of W5 and shake it horizontally for 10 sec to release the protoplasts.

6)使用40μm尼龙膜过滤原生质体到50ml圆底离心管,再加W5溶液冲洗。6) Use a 40 μm nylon membrane to filter the protoplasts into a 50 ml round-bottom centrifuge tube, and then rinse with W5 solution.

7)250g水平离心3min沉淀原生质体,吸弃上清。7) Horizontal centrifugation at 250 g for 3 min to precipitate protoplasts, and the supernatant was discarded.

8)加10ml W5重悬原生质体,250g离心3min,弃上清。8) Add 10ml of W5 to resuspend the protoplasts, centrifuge at 250g for 3min, and discard the supernatant.

9)加适量MMG溶液重悬原生质体浓度为2×106/ml。9) Add an appropriate amount of MMG solution to resuspend the protoplast with a concentration of 2×10 6 /ml.

注:以上所有步骤在室温进行。NOTE: All the above steps are performed at room temperature.

10)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。10) Add 10-20 μg plasmid, 200 μl protoplasts (about 4×10 5 cells) and 220 μl freshly prepared PEG solution to a 2 ml centrifuge tube, mix well, and place at room temperature in the dark for 10-20 minutes to induce transformation.

11)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,250g水平离心3min,吸弃上清。11) After the induction and transformation, slowly add 880 μl of W5 solution, gently invert and mix, centrifuge horizontally at 250 g for 3 min, and aspirate and discard the supernatant.

12)加2ml WI溶液重悬,转移到六孔板中,室温(或25℃)暗处培养,若用于提取原生质体基因组DNA,需培养48h。12) Add 2 ml of WI solution to resuspend, transfer to a six-well plate, and culture at room temperature (or 25°C) in the dark. If it is used to extract protoplast genomic DNA, it needs to be cultured for 48 hours.

PCR/RE检测:PCR/RE detection:

1)提取植物基因组DNA。1) Extracting plant genomic DNA.

2)合成基因特异引物,扩增含有靶位点的片段,长度为350-1000bp 之间:2) Synthesize gene-specific primers to amplify fragments containing the target site, with a length between 350-1000bp:

10×EasyTaq Buffer10×EasyTaq Buffer 5μl5μl dNTP(2.5mM)dNTP (2.5mM) 4μl4μl 正向引物(10μM)Forward primer (10μM) 2μl2μl 正向引物(10μM)Forward primer (10μM) 2μl2μl Easy TaqEasy Taq 0.5μl0.5μl DNADNA 2μl2μl ddH<sub>2</sub>OddH<sub>2</sub>O 至50μl to 50μl

3)一般反应条件是:94℃变性5min;94℃变性30s,58℃复性30s,72℃延伸30s,扩增30至35个循环;72℃保温5min;12℃保温。取5μl PCR 产物电泳检测。3) The general reaction conditions are: denaturation at 94°C for 5 minutes; denaturation at 94°C for 30s, renaturation at 58°C for 30s, extension at 72°C for 30s, and amplification for 30 to 35 cycles; incubation at 72°C for 5 minutes; incubation at 12°C. Take 5μl PCR product for electrophoresis detection.

4)限制性内切酶酶切PCR产物,一般的酶切体系如下:4) Restriction endonuclease digestion of PCR products, the general digestion system is as follows:

10×Fastdigest Buffer10×Fastdigest Buffer 2ul2ul 限制性内切酶restriction endonuclease 1 μl1 μl PCR产物PCR product 3-5μl3-5μl ddH<sub>2</sub>OddH<sub>2</sub>O 至20μl to 20μl

5)37℃,酶切2-3h。1.2%琼脂糖凝胶电泳检测。5) 37℃, digested with enzyme for 2-3h. 1.2% agarose gel electrophoresis detection.

6)回收纯化PCR产物中未切开的突变条带,进行TA克隆。反应体系如下:6) Recover the uncut mutant band in the purified PCR product, and perform TA cloning. The reaction system is as follows:

pEasy-T VectorpEasy-T Vector 1μl1μl 回收的未切开的PCR产物Recovered uncut PCR product 4μl 4μl

7)22℃连接10min,转化E.coli感受态细胞,涂LB固体平板(Amp100、 IPTG和X-gal),培养12-16h,挑选白色菌落鉴定阳性克隆,送测序。7) Connect at 22°C for 10min, transform E.coli competent cells, coat on LB solid plate (Amp100, IPTG and X-gal), cultivate for 12-16h, select white colonies to identify positive clones, and send for sequencing.

深度测序deep sequencing

不同的sgRNA表达载体分别与Ubi-CPF1-PBE/ABE表达载体转化至小麦、水稻质体48小时后,收集原生质体,提取DNA进行深度测序。在第一轮PCR中,靶区域使用位点特异性引物进行扩增。在二轮PCR中,将正向和反向标签添加至PCR产物末端进行文库构建。合并等量不同PCR产物。样品然后在Beijing Genomics Institute使用Illumina High-Seq 4000测序。Different sgRNA expression vectors and Ubi-CPF1-PBE/ABE expression vectors were transformed into wheat and rice plastids for 48 hours, and the protoplasts were collected, and DNA was extracted for deep sequencing. In the first round of PCR, the target region is amplified using site-specific primers. In two rounds of PCR, forward and reverse tags are added to the ends of PCR products for library construction. Equal amounts of different PCR products were pooled. Samples were then sequenced using an Illumina High-Seq 4000 at Beijing Genomics Institute.

实施例Example

实施例1.CPF1介导的植物基因组切割活性的优化。Example 1. Optimization of CPF1-mediated plant genome cleavage activity.

CPF1在植物细胞中的编辑活性在不同文章中的差别较大,并且不同类型的CPF1之间的切割活性差别也很大。The editing activity of CPF1 in plant cells varies widely from article to article, and the cleavage activity of CPF1 varies greatly among different types of CPF1.

本实施例通过对AsCPF1、FnCPF1以及LbCPF1的入核状态进行优化,同时也对crRNA的启动子进行优化,来提高CPF1在植物细胞里的切割活性。构建了AsCPF1、FnCPF1和LbCPF1的1-4个NLS的载体,并且构建了U3/U6以及UBI启动的通过核酶来产生crRNA的不同载体(如图1)。通过PCR/RE的结果可以看出,2个NLS的三种CPF1都可以工作,而且 LbCPF1效率较高(SEQ ID NO:5-7分别为ASCPF1-2NLS、FNCPF1-2NLS 和LBCPF1-2NLS的编码序列,可容易获得对应的氨基酸序列)。针对OsPDS 这个基因的靶位点,可以看出2NLS-LbCPF1的效率要高于NLS-LbCPF1,高于其他一些报道的构建体。In this example, the cleavage activity of CPF1 in plant cells is improved by optimizing the nuclear state of AsCPF1, FnCPF1 and LbCPF1, and also optimizing the promoter of crRNA. 1-4 NLS vectors of AsCPF1, FnCPF1 and LbCPF1 were constructed, and U3/U6 and UBI-promoted different vectors for ribozyme-generated crRNA were constructed (Fig. 1). It can be seen from the results of PCR/RE that the three CPF1s of the two NLSs can work, and the efficiency of LbCPF1 is higher (SEQ ID NOs: 5-7 are the coding sequences of ASCPF1-2NLS, FNCPF1-2NLS and LBCPF1-2NLS, respectively. , the corresponding amino acid sequence can be easily obtained). For the target site of the OsPDS gene, it can be seen that the efficiency of 2NLS-LbCPF1 is higher than that of NLS-LbCPF1 and higher than some other reported constructs.

实施例2.CPF1介导的植物基因组C到T的突变(CPF1-PBE)Example 2. CPF1-mediated mutation of plant genomes C to T (CPF1-PBE)

参考CPF1在植物细胞切割活性的特点,构建了以下dCPF1-PBE系统: dAsCPF 1-2NLS-PBE、dFnCPF1-2NLS-PBE、dLbCPF1-2NLS-PBE。其中C 端的NLS分别放在了UGI的一端和分别放在了UGI的两端。crRNA用UBI1 来启动并用核酶切割。利用PCR/RE检测结果表明,dFnCPf1和dLbCPF1 检测出了编辑活性,并且NLS在UGI的一端的活性较高(SEQ ID NO:8,9分别示出dFNCPF1-PBE-2NLS和dLbCPF1-2NLS-PBE的编码序列,可容易获得对应的氨基酸序列)。此外,还构建了加强版的dCPF1-PBE2-X,即在用ZmUbi-1启动子后面加入了一个内含子来增加dCPF1-PBE的表达(SEQ ID NO:10示出了包含ZmUbi-1启动子和内含子序列的dLBCPF1-PBE-2NLS 表达盒)。Referring to the cleavage activity of CPF1 in plant cells, the following dCPF1-PBE systems were constructed: dAsCPF1-2NLS-PBE, dFnCPF1-2NLS-PBE, dLbCPF1-2NLS-PBE. The NLS at the C end is placed at one end of the UGI and at both ends of the UGI respectively. crRNA was initiated with UBI1 and cleaved with ribozymes. The results of PCR/RE assay showed that dFnCPf1 and dLbCPF1 detected editing activity, and NLS had higher activity at one end of UGI (SEQ ID NOs: 8, 9 show the dFNCPF1-PBE-2NLS and dLbCPF1-2NLS-PBE, respectively coding sequence, the corresponding amino acid sequence is readily available). In addition, an enhanced version of dCPF1-PBE2-X was constructed by adding an intron after the ZmUbi-1 promoter to increase the expression of dCPF1-PBE (SEQ ID NO: 10 shows the inclusion of the ZmUbi-1 promoter dLBCPF1-PBE-2NLS expression cassette of intron and intron sequences).

实施例3.CPF1介导的植物基因组A到G的突变(CPF1-ABE)Example 3. CPF1-mediated mutation of plant genomes A to G (CPF1-ABE)

构建了以下CPF1-ABE系统:dAsCPF1-1NLS-ABE、 dFnCPF1-NLS-ABE、dLbCPF1-1NLS-ABE、以及dAsCPF1-2NLS-ABE、 dFnCPF1-2NLS-ABE、dLbCPF1-2NLS-ABE,其中ABE又包括ABE7.9和 ABE7.10两个版本。crRNA用UBI1来启动并用核酶切割。The following CPF1-ABE systems were constructed: dAsCPF1-1NLS-ABE, dFnCPF1-NLS-ABE, dLbCPF1-1NLS-ABE, and dAsCPF1-2NLS-ABE, dFnCPF1-2NLS-ABE, dLbCPF1-2NLS-ABE, where ABEs in turn included ABE7 .9 and ABE7.10 two versions. crRNA was initiated with UBI1 and cleaved with ribozymes.

利用图3E的GFP碱基编辑报告系统的结果表明:dFnCPF1-ABE7.10 (SEQ ID NO:11)和dLbCPF1-ABE7.9和dLbCPF1-ABE7.10(SEQ ID NO:12) 都可以工作,而7.10要高于7.9(图3F)。The results using the GFP base editing reporter system in Figure 3E show that dFnCPF1-ABE7.10 (SEQ ID NO: 11) and dLbCPF1-ABE7.9 and dLbCPF1-ABE7.10 (SEQ ID NO: 12) all work, while 7.10 is higher than 7.9 (Fig. 3F).

利用PCR/RE检测结果表明,dLbCPF1-ABE7.10检测出了活性,并且 2NLS要高于1NLS。此外,还构建了两个加强版的dCPF1-ABE2,即在UBI1 启动子后面加入了一个内含子来增加dCPF1-ABE的表达(dCPF1-ABE2-X1)(SEQ ID NO.13),还有将ABE构建在了CPF1的C端(dCPF1-ABE2-X2/X3)(SEQ ID NO.14、15),利用图3E的GFP碱基编辑报告系统的结果:dCPF1-ABE2-X2/X3编辑活性要高于 dLbCPF1-ABE7.10(图3G)。The results of PCR/RE assay showed that dLbCPF1-ABE7.10 detected the activity, and the 2NLS was higher than the 1NLS. In addition, two enhanced versions of dCPF1-ABE2 were constructed by adding an intron after the UBI1 promoter to increase the expression of dCPF1-ABE (dCPF1-ABE2-X1) (SEQ ID NO. 13), and ABE was constructed at the C-terminus of CPF1 (dCPF1-ABE2-X2/X3) (SEQ ID NO. 14, 15), and the results of using the GFP base editing reporter system in Figure 3E: dCPF1-ABE2-X2/X3 editing activity higher than dLbCPF1-ABE7.10 (Fig. 3G).

实施例4.CPF1介导的基因编辑优化Example 4. Optimization of CPF1-mediated gene editing

为了继续提高CPF1的编辑效率,我们继续对CPF1系统进行优化,首先对CPF1介导的编辑的所有表达载体采用BdUbi10启动子进行启动,以便增加其表达量。此外,我们还对crRNA的启动采用II型启动子启动,并将crRNAArray放入表达基因的5’UTR或3’UTR区域,来通过提高mRNA 的表达来提高CPF1的编辑效率。In order to continue to improve the editing efficiency of CPF1, we continued to optimize the CPF1 system. First, all expression vectors for CPF1-mediated editing were initiated with the BdUbi10 promoter to increase their expression levels. In addition, we also used type II promoter for the initiation of crRNA, and put the crRNAArray into the 5'UTR or 3'UTR region of the expressed gene to improve the editing efficiency of CPF1 by increasing the expression of mRNA.

相关序列描述:Related sequence description:

SEQ ID NO.1胞苷脱氨酶氨基酸序列SEQ ID NO.1 amino acid sequence of cytidine deaminase

SEQ ID NO.2尿嘧啶DNA糖基化酶抑制剂(UGI)氨基酸序列SEQ ID NO.2 amino acid sequence of uracil DNA glycosylase inhibitor (UGI)

SEQ ID NO.3野生型ecTadA氨基酸序列SEQ ID NO.3 Wild-type ecTadA amino acid sequence

SEQ ID NO.4 ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列SEQ ID NO.4 ecTadA-derived DNA-dependent adenine deaminase (ABE version 7.10) amino acid sequence

SEQ ID NO.5 ASCPF1-2NLS编码序列SEQ ID NO.5 ASCPF1-2NLS coding sequence

SEQ ID NO.6 FNCPF1-2NLS编码序列SEQ ID NO.6 FNCPF1-2NLS coding sequence

SEQ ID NO.7 LBCPF1-2NLS编码序列SEQ ID NO.7 LBCPF1-2NLS coding sequence

SEQ ID NO.8 dFNCPF1-PBE-2NLS编码序列SEQ ID NO.8 dFNCPF1-PBE-2NLS coding sequence

SEQ ID NO.9 dLBCPF1-PBE-2NLS编码序列SEQ ID NO.9 dLBCPF1-PBE-2NLS coding sequence

SEQ ID NO.10启动子+内含子+dLBCPF1-PBE-2NLS编码序列SEQ ID NO.10 promoter+intron+dLBCPF1-PBE-2NLS coding sequence

SEQ ID NO.11 dFNCPF1-ABE7.10-2NLS编码序列SEQ ID NO.11 dFNCPF1-ABE7.10-2NLS coding sequence

SEQ ID NO.12 dLBCPF1-ABE7.10-2NLS编码序列SEQ ID NO.12 dLBCPF1-ABE7.10-2NLS coding sequence

SEQ ID NO.13启动子+内含子+dLBCPF1-ABE2-x编码序列SEQ ID NO.13 promoter+intron+dLBCPF1-ABE2-x coding sequence

SEQ ID NO.14 LBCPF1-ABE2-x2编码序列SEQ ID NO. 14 LBCPF1-ABE2-x2 coding sequence

SEQ ID NO.15 LBCPF1-ABE2-x3编码序列SEQ ID NO.15 LBCPF1-ABE2-x3 coding sequence

SEQ ID NO.16 PJIT163-GFPSEQ ID NO. 16 PJIT163-GFP

SEQ ID NO:17 pBUI-mGFPSEQ ID NO: 17 pBUI-mGFP

SEQ ID NO:18 ASCPF1氨基酸序列SEQ ID NO: 18 ASCPF1 amino acid sequence

SEQ ID NO:19 FNCPF1氨基酸序列SEQ ID NO: 19 FNCPF1 amino acid sequence

SEQ ID NO:20 LBCPF1氨基酸序列SEQ ID NO: 20 LBCPF1 amino acid sequence

SEQ ID NO:21 ASCPF1-2NLS氨基酸序列SEQ ID NO: 21 ASCPF1-2NLS amino acid sequence

SEQ ID NO:22 FNCPF1-2NLS氨基酸序列SEQ ID NO: 22 FNCPF1-2NLS amino acid sequence

SEQ ID NO:23 LBCPF1-2NLS氨基酸序列SEQ ID NO: 23 LBCPF1-2NLS amino acid sequence

SEQ ID NO:24 dFNCPF1-PBE-2NLS氨基酸序列SEQ ID NO: 24 dFNCPF1-PBE-2NLS amino acid sequence

SEQ ID NO:25 dLBCPF1-PBE-2NLS氨基酸序列SEQ ID NO: 25 dLBCPF1-PBE-2NLS amino acid sequence

SEQ ID NO:26 dFNCPF1-ABE7.10-2NLS氨基酸序列SEQ ID NO: 26 dFNCPF1-ABE7.10-2NLS amino acid sequence

SEQ ID NO:27 dLBCPF1-ABE7.10-2NLS氨基酸序列SEQ ID NO: 27 dLBCPF1-ABE7.10-2NLS amino acid sequence

SEQ ID NO:28 LBCPF1-ABE2-x2氨基酸序列SEQ ID NO: 28 LBCPF1-ABE2-x2 amino acid sequence

SEQ ID NO:29 LBCPF1-ABE2-x3氨基酸序列SEQ ID NO: 29 LBCPF1-ABE2-x3 amino acid sequence

SEQ ID NO:30启动子+内含子核苷酸序列。SEQ ID NO: 30 Promoter+intron nucleotide sequence.

序列表sequence listing

<110> 中国科学院遗传与发育生物学研究所<110> Institute of Genetics and Developmental Biology, Chinese Academy of Sciences

<120> 基于CPF1蛋白的碱基编辑系统和方法<120> Base editing system and method based on CPF1 protein

<130> 149132<130> 149132

<150> 201711403490.X<150> 201711403490.X

<151> 2017-12-22<151> 2017-12-22

<160> 30<160> 30

<170> PatentIn version 3.5<170> PatentIn version 3.5

<210> 1<210> 1

<211> 227<211> 227

<212> PRT<212> PRT

<213> Rattus norvegicus<213> Rattus norvegicus

<400> 1<400> 1

Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg ArgSer Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg Arg

1 5 10 151 5 10 15

Ile Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu ArgIle Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu Arg

20 25 30 20 25 30

Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His SerLys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His Ser

35 40 45 35 40 45

Ile Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val AsnIle Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val Asn

50 55 60 50 55 60

Phe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr ArgPhe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr Arg

65 70 75 8065 70 75 80

Cys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys SerCys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys Ser

85 90 95 85 90 95

Arg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu PheArg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu Phe

100 105 110 100 105 110

Ile Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg GlnIle Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg Gln

115 120 125 115 120 125

Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met ThrGly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met Thr

130 135 140 130 135 140

Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser ProGlu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser Pro

145 150 155 160145 150 155 160

Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg LeuSer Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg Leu

165 170 175 165 170 175

Tyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys LeuTyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys Leu

180 185 190 180 185 190

Asn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile AlaAsn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile Ala

195 200 205 195 200 205

Leu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp AlaLeu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp Ala

210 215 220 210 215 220

Thr Gly LeuThr Gly Leu

225225

<210> 2<210> 2

<211> 87<211> 87

<212> PRT<212> PRT

<213> Bacillus phage<213> Bacillus phage

<400> 2<400> 2

Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu ValThr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val

1 5 10 151 5 10 15

Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val IleIle Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile

20 25 30 20 25 30

Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp GluGly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu

35 40 45 35 40 45

Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu TyrSer Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr

50 55 60 50 55 60

Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys IleLys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile

65 70 75 8065 70 75 80

Lys Met Leu Ser Gly Gly SerLys Met Leu Ser Gly Gly Ser

85 85

<210> 3<210> 3

<211> 167<211> 167

<212> PRT<212> PRT

<213> Escherichia coli<213> Escherichia coli

<400> 3<400> 3

Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala LeuMet Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu

1 5 10 151 5 10 15

Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly AlaThr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala

20 25 30 20 25 30

Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg ProVal Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro

35 40 45 35 40 45

Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgIle Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg

50 55 60 50 55 60

Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr LeuGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu

65 70 75 8065 70 75 80

Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile HisTyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His

85 90 95 85 90 95

Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr GlySer Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly

100 105 110 100 105 110

Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn HisAla Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His

115 120 125 115 120 125

Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala LeuArg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu

130 135 140 130 135 140

Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln LysLeu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys

145 150 155 160145 150 155 160

Lys Ala Gln Ser Ser Thr AspLys Ala Gln Ser Ser Thr Asp

165 165

<210> 4<210> 4

<211> 167<211> 167

<212> PRT<212> PRT

<213> Escherichia coli<213> Escherichia coli

<400> 4<400> 4

Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala LeuMet Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu

1 5 10 151 5 10 15

Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly AlaThr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala

20 25 30 20 25 30

Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg AlaVal Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala

35 40 45 35 40 45

Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgIle Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg

50 55 60 50 55 60

Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr LeuGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu

65 70 75 8065 70 75 80

Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile HisTyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His

85 90 95 85 90 95

Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr GlySer Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr Gly

100 105 110 100 105 110

Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn HisAla Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn His

115 120 125 115 120 125

Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala LeuArg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu

130 135 140 130 135 140

Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln LysLeu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys

145 150 155 160145 150 155 160

Lys Ala Gln Ser Ser Thr AspLys Ala Gln Ser Ser Thr Asp

165 165

<210> 5<210> 5

<211> 3990<211> 3990

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> ASCPF1-2NLS<223> ASCPF1-2NLS

<400> 5<400> 5

atgacgcagt tcgaggggtt caccaacctc taccaggtca gcaagacgct gcggttcgag 60atgacgcagt tcgaggggtt caccaacctc taccaggtca gcaagacgct gcggttcgag 60

ctgattccgc agggcaagac cctcaagcac atccaggagc agggctttat cgaggaggac 120ctgattccgc agggcaagac cctcaagcac atccaggagc agggctttat cgaggaggac 120

aaagcgcgga acgaccacta caaggagctc aagccgatca tcgaccggat ctacaagacc 180aaagcgcgga acgaccacta caaggagctc aagccgatca tcgaccggat ctacaagacc 180

tacgcggacc agtgcctgca gctcgtgcag ctcgactggg agaacctctc ggccgccatc 240tacgcggacc agtgcctgca gctcgtgcag ctcgactggg agaacctctc ggccgccatc 240

gactcctacc gcaaggagaa gacggaggag acacgcaacg cgctcatcga agaacaggct 300gactcctacc gcaaggagaa gacggaggag acacgcaacg cgctcatcga agaacaggct 300

acgtatcgca acgctatcca cgactacttc atcgggcgca cagataatct caccgacgcc 360acgtatcgca acgctatcca cgactacttc atcgggcgca cagataatct caccgacgcc 360

atcaacaagc gccacgccga aatctataag gggctgttta aggccgagct gttcaatggc 420atcaacaagc gccacgccga aatctataag gggctgttta aggccgagct gttcaatggc 420

aaagtgctga agcaactggg caccgtcacg accacagagc atgagaacgc cctcctccgg 480aaagtgctga agcaactggg caccgtcacg accacagagc atgagaacgc cctcctccgg 480

tcgttcgaca agtttacgac atactttagc ggcttttacg agaaccgcaa aaacgtgttc 540tcgttcgaca agtttacgac atactttagc ggcttttacg agaaccgcaa aaacgtgttc 540

agcgccgagg acattagcac cgccatcccg cataggatcg tgcaagacaa cttcccgaag 600agcgccgagg acattagcac cgccatcccg cataggatcg tgcaagacaa cttcccgaag 600

ttcaaggaga actgccacat cttcacccgc ctcatcacgg ccgtgccgtc cctccgcgag 660ttcaaggaga actgccacat cttcacccgc ctcatcacgg ccgtgccgtc cctccgcgag 660

cacttcgaga acgtgaagaa ggccatcggg atcttcgtgt ccacctcgat cgaagaggtg 720cacttcgaga acgtgaagaa ggccatcggg atcttcgtgt ccacctcgat cgaagaggtg 720

ttctccttcc ccttctacaa tcagctgctg acgcagaccc agatcgacct ctacaaccag 780ttctccttcc ccttctacaa tcagctgctg acgcagaccc agatcgacct ctacaaccag 780

ctcctcggcg gcatctcccg cgaggccggc accgagaaga tcaagggcct gaacgaggtg 840ctcctcggcg gcatctcccg cgaggccggc accgagaaga tcaagggcct gaacgaggtg 840

ctcaatctcg ccatccagaa gaacgacgaa accgcgcaca tcattgcctc actcccacat 900ctcaatctcg ccatccagaa gaacgacgaa accgcgcaca tcattgcctc actcccacat 900

aggtttatcc ccctgtttaa gcagatcctc tccgaccgca acacgttgtc cttcatcctc 960aggtttatcc ccctgtttaa gcagatcctc tccgaccgca acacgttgtc cttcatcctc 960

gaggagttca agagcgacga ggaggtcatc cagtccttct gcaagtacaa gaccctcctc 1020gaggagttca agagcgacga ggaggtcatc cagtccttct gcaagtacaa gaccctcctc 1020

cgcaacgaga atgtgctcga aaccgccgag gcgctgttca atgagctcaa tagcatcgat 1080cgcaacgaga atgtgctcga aaccgccgag gcgctgttca atgagctcaa tagcatcgat 1080

ctcacccaca tcttcatctc ccataagaag ctggaaacca tctccagcgc cctgtgcgac 1140ctcacccaca tcttcatctc ccataagaag ctggaaacca tctccagcgc cctgtgcgac 1140

cactgggaca ccctccgcaa cgccctctac gagcggcgca tcagcgagct caccggcaag 1200cactgggaca ccctccgcaa cgccctctac gagcggcgca tcagcgagct caccggcaag 1200

atcacgaagt cggcgaaaga gaaagtgcaa aggagcctca agcacgagga cattaacctg 1260atcacgaagt cggcgaaaga gaaagtgcaa aggagcctca agcacgagga cattaacctg 1260

caggagatca tctccgccgc gggcaaggag ctgtccgagg ctttcaagca gaagacctcg 1320caggagatca tctccgccgc gggcaaggag ctgtccgagg ctttcaagca gaagacctcg 1320

gagatcctct ctcacgccca cgcggccctg gaccagccgc tgccgacgac cctgaaaaag 1380gagatcctct ctcacgccca cgcggccctg gaccagccgc tgccgacgac cctgaaaaag 1380

caagaggaaa aggaaatcct caagtcccag ctggacagcc tcctggggct gtaccacctt 1440caagaggaaa aggaaatcct caagtcccag ctggacagcc tcctggggct gtaccacctt 1440

ctcgactggt tcgccgtgga cgagtccaac gaggtcgatc cggagtttag cgcccgcctc 1500ctcgactggt tcgccgtgga cgagtccaac gaggtcgatc cggagtttag cgcccgcctc 1500

accgggatca agcttgagat ggaacctagc ttgagcttct ataataaggc gcgcaattac 1560accgggatca agcttgagat ggaacctagc ttgagcttct ataataaggc gcgcaattac 1560

gcgaccaaga agccgtattc cgtggagaag ttcaagctga acttccaaat gcccaccctg 1620gcgaccaaga agccgtattc cgtggagaag ttcaagctga acttccaaat gcccaccctg 1620

gccagcgggt gggacgttaa caaagagaag aacaacggag ccattctttt cgtgaaaaat 1680gccagcgggt gggacgttaa caaagagaag aacaacggag ccattctttt cgtgaaaaat 1680

gggttgtatt atttgggaat tatgccgaaa caaaaaggca ggtacaaggc gctcagcttc 1740gggttgtatt atttgggaat tatgccgaaa caaaaaggca ggtacaaggc gctcagcttc 1740

gagccaactg agaagacctc cgagggcttc gacaagatgt actacgatta ttttcctgac 1800gagccaactg agaagacctc cgagggcttc gacaagatgt actacgatta ttttcctgac 1800

gctgcaaaga tgataccgaa gtgcagcact cagcttaagg cggtgacggc gcactttcag 1860gctgcaaaga tgataccgaa gtgcagcact cagcttaagg cggtgacggc gcactttcag 1860

acccatacca cccccatcct cctctccaac aacttcatcg agccgctcga gatcaccaag 1920acccatacca cccccatcct cctctccaac aacttcatcg agccgctcga gatcaccaag 1920

gagatatacg atctgaataa tccagaaaag gaacccaaga agttccagac cgcctacgcc 1980gagatatacg atctgaataa tccagaaaag gaacccaaga agttccagac cgcctacgcc 1980

aagaagacgg gcgatcaaaa ggggtataga gaggcgctct gcaagtggat cgacttcacg 2040aagaagacgg gcgatcaaaa ggggtataga gaggcgctct gcaagtggat cgacttcacg 2040

cgcgatttcc tcagcaagta caccaagaca acctccatcg atctctcttc cctccgcccc 2100cgcgatttcc tcagcaagta caccaagaca acctccatcg atctctcttc cctccgcccc 2100

tcttcccagt acaaggacct cggggagtac tacgccgaac tcaacccact cctgtatcac 2160tcttcccagt acaaggacct cggggagtac tacgccgaac tcaacccact cctgtatcac 2160

atctcgtttc agcgtatcgc ggaaaaggag atcatggacg ccgtcgaaac cggcaagttg 2220atctcgtttc agcgtatcgc ggaaaaggag atcatggacg ccgtcgaaac cggcaagttg 2220

tatcttttcc aaatctataa caaggacttc gcgaagggcc accacgggaa gccaaacctg 2280tatcttttcc aaatctataa caaggacttc gcgaagggcc accacgggaa gccaaacctg 2280

cacaccctct actggacagg cctcttcagc ccggaaaatc tcgcgaagac gagcataaag 2340cacaccctct actggacagg cctcttcagc ccggaaaatc tcgcgaagac gagcataaag 2340

ctgaacggcc aggcagaact cttctacagg ccgaagtcca ggatgaagcg catggctcat 2400ctgaacggcc aggcagaact cttctacagg ccgaagtcca ggatgaagcg catggctcat 2400

cgcctcggtg agaagatgct gaacaagaag ctgaaagatc aaaagacgcc aatccctgat 2460cgcctcggtg agaagatgct gaacaagaag ctgaaagatc aaaagacgcc aatccctgat 2460

acactgtatc aggagctgta cgattacgtg aaccaccgcc tctcacacga cctcagcgac 2520acactgtatc aggagctgta cgattacgtg aaccaccgcc tctcacacga cctcagcgac 2520

gaggcccgcg cgctcctgcc aaacgtcatc acgaaggagg tcagccacga gatcataaag 2580gaggcccgcg cgctcctgcc aaacgtcatc acgaaggagg tcagccacga gatcataaag 2580

gatcggcggt ttacctctga caagttcttt ttccatgtcc ccatcacgct gaactaccag 2640gatcggcggt ttacctctga caagttcttt ttccatgtcc ccatcacgct gaactaccag 2640

gccgcgaata gcccgtccaa gttcaaccag cgggtcaacg cgtatctcaa ggagcaccca 2700gccgcgaata gcccgtccaa gttcaaccag cgggtcaacg cgtatctcaa ggagcaccca 2700

gagacaccca taatcgggat tgaccggggg gagcggaacc tcatctacat caccgtcatc 2760gagacaccca taatcgggat tgaccggggg gagcggaacc tcatctacat caccgtcatc 2760

gactccaccg gaaagattct cgagcaacgg tcgctcaata ccatccagca gttcgactac 2820gactccaccg gaaagattct cgagcaacgg tcgctcaata ccatccagca gttcgactac 2820

cagaagaagc tcgacaaccg ggagaaggaa cgcgtcgccg cgaggcaggc ctggtccgta 2880cagaagaagc tcgacaaccg ggagaaggaa cgcgtcgccg cgaggcaggc ctggtccgta 2880

gtgggcacga tcaaagacct gaagcagggc tatctcagcc aggtcatcca tgagatagtg 2940gtgggcacga tcaaagacct gaagcagggc tatctcagcc aggtcatcca tgagatagtg 2940

gatctcatga tccactacca agccgtcgtg gtcctcgaga atctcaattt cggattcaaa 3000gatctcatga tccactacca agccgtcgtg gtcctcgaga atctcaattt cggattcaaa 3000

tccaagcgca caggcatcgc cgagaaggcg gtgtaccaac agttcgagaa aatgcttatc 3060tccaagcgca caggcatcgc cgagaaggcg gtgtaccaac agttcgagaa aatgcttatc 3060

gacaagctca attgcctggt gctcaaggac tatccggcgg agaaggtcgg gggggtcctc 3120gacaagctca attgcctggt gctcaaggac tatccggcgg agaaggtcgg gggggtcctc 3120

aatccgtatc agctgaccga ccagtttacg tcatttgcga agatgggcac ccagagcggc 3180aatccgtatc agctgaccga ccagtttacg tcatttgcga agatgggcac ccagagcggc 3180

ttccttttct atgtcccggc cccatatacc tcaaagattg atcccttgac cggattcgtg 3240ttccttttct atgtcccggc cccatatacc tcaaagattg atcccttgac cggattcgtg 3240

gacccgtttg tctggaagac catcaagaac catgagtcgc gtaagcattt cctggagggt 3300gacccgtttg tctggaagac catcaagaac catgagtcgc gtaagcattt cctggagggt 3300

ttcgacttcc tgcactatga tgtaaaaacc ggagacttca tcctgcattt caagatgaat 3360ttcgacttcc tgcactatga tgtaaaaacc ggagacttca tcctgcattt caagatgaat 3360

cggaacctct ccttccagcg gggactccct ggcttcatgc ccgcttggga tatcgttttt 3420cggaacctct ccttccagcg gggactccct ggcttcatgc ccgcttggga tatcgttttt 3420

gagaaaaatg aaacccaatt cgacgccaaa ggcacgcctt tcatcgcggg caagaggatt 3480gagaaaaatg aaacccaatt cgacgccaaa ggcacgcctt tcatcgcggg caagaggatt 3480

gtccctgtaa ttgagaacca tagattcacc gggcgttacc gtgacctgta ccccgcaaac 3540gtccctgtaa ttgagaacca tagattcacc gggcgttacc gtgacctgta ccccgcaaac 3540

gaactcatcg ccctcctgga ggagaaaggc atcgttttcc gcgacgggtc aaatatcctc 3600gaactcatcg ccctcctgga ggagaaaggc atcgttttcc gcgacgggtc aaatatcctc 3600

cccaaactgc tcgagaacga tgatagccac gctattgaca cgatggtagc gctcatcaga 3660cccaaactgc tcgagaacga tgatagccac gctattgaca cgatggtagc gctcatcaga 3660

tccgtgctgc aaatgagaaa ttcaaatgct gccactgggg aggattacat caactcccct 3720tccgtgctgc aaatgagaaa ttcaaatgct gccactgggg aggattacat caactcccct 3720

gtgcgtgatc tcaatggcgt gtgcttcgat tctagatttc agaatcctga gtggccgatg 3780gtgcgtgatc tcaatggcgt gtgcttcgat tctagatttc agaatcctga gtggccgatg 3780

gatgccgatg ctaacggcgc ataccacata gcattgaaag gacaactgtt gttgaaccat 3840gatgccgatg ctaacggcgc ataccacata gcattgaaag gacaactgtt gttgaaccat 3840

ctcaaggaga gcaaggacct taagctgcag aacggcatca gcaaccagga ttggcttgcc 3900ctcaaggaga gcaaggacct taagctgcag aacggcatca gcaaccagga ttggcttgcc 3900

tatatccaag agctccgcaa ttccggcggc agcccaaaga agaagaggaa ggtgagcggc 3960tatatccaag agctccgcaa ttccggcggc agcccaaaga agaagaggaa ggtgagcggc 3960

ggcagcccaa agaagaagcg caaggtctag 3990ggcagcccaa agaagaagcg caaggtctag 3990

<210> 6<210> 6

<211> 3969<211> 3969

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> FNCPF1-2NLS<223> FNCPF1-2NLS

<400> 6<400> 6

atgtccatct accaggagtt cgtcaataag tactcactct ctaagaccct gcggttcgag 60atgtccatct accaggagtt cgtcaataag tactcactct ctaagaccct gcggttcgag 60

ctgatcccgc agggcaagac actcgagaac atcaaggcgc gcggcctgat tctcgacgat 120ctgatcccgc agggcaagac actcgagaac atcaaggcgc gcggcctgat tctcgacgat 120

gagaagcggg ccaaggacta caagaaggcg aagcagatca ttgataagta ccaccagttc 180gagaagcggg ccaaggacta caagaaggcg aagcagatca ttgataagta ccaccagttc 180

ttcatcgagg agattctgtc cagcgtgtgc atctctgagg atctcctgca gaattactcc 240ttcatcgagg agattctgtc cagcgtgtgc atctctgagg atctcctgca gaattactcc 240

gacgtctact tcaagctcaa gaagtctgac gatgacaacc tgcagaagga tttcaagtcc 300gacgtctact tcaagctcaa gaagtctgac gatgacaacc tgcagaagga tttcaagtcc 300

gccaaggaca ccatcaagaa gcagatttct gagtacatca aggattccga gaagttcaag 360gccaaggaca ccatcaagaa gcagatttct gagtacatca aggattccga gaagttcaag 360

aatctcttca accagaatct gattgatgcg aagaagggcc aggagtctga cctgatcctc 420aatctcttca accagaatct gattgatgcg aagaagggcc aggagtctga cctgatcctc 420

tggctgaagc agtccaagga caatggcatt gagctgttca aggccaacag cgatatcacc 480tggctgaagc agtccaagga caatggcatt gagctgttca aggccaacag cgatatcacc 480

gatattgacg aggcgctgga gatcattaag tcattcaagg gctggaccac atacttcaag 540gatattgacg aggcgctgga gatcattaag tcattcaagg gctggaccac atacttcaag 540

ggcttccatg agaaccggaa gaatgtgtac tcatctaacg acattccgac ctccatcatc 600ggcttccatg agaaccggaa gaatgtgtac tcatctaacg acattccgac ctccatcatc 600

tacaggatcg tcgatgacaa tctgccaaag ttcctcgaga acaaggccaa gtacgagtcc 660tacaggatcg tcgatgacaa tctgccaaag ttcctcgaga acaaggccaa gtacgagtcc 660

ctcaaggaca aggccccgga ggcgattaat tacgagcaga tcaagaagga tctggcggag 720ctcaaggaca aggccccgga ggcgattaat tacgagcaga tcaagaagga tctggcggag 720

gagctgacct tcgatatcga ctacaagaca agcgaggtga accagagggt gttctccctc 780gagctgacct tcgatatcga ctacaagaca agcgaggtga accagagggt gttctccctc 780

gatgaggtgt tcgagatcgc caatttcaac aattacctga accagtccgg cattaccaag 840gatgaggtgt tcgagatcgc caatttcaac aattacctga accagtccgg cattaccaag 840

ttcaatacaa tcattggcgg caagttcgtc aacggcgaga ataccaagcg caagggcatt 900ttcaatacaa tcattggcgg caagttcgtc aacggcgaga ataccaagcg caagggcatt 900

aacgagtaca tcaatctcta ctcccagcag atcaacgaca agaccctgaa gaagtacaag 960aacgagtaca tcaatctcta ctcccagcag atcaacgaca agaccctgaa gaagtacaag 960

atgtctgtgc tcttcaagca gatcctgtcc gatacagagt ccaagagctt cgtcattgat 1020atgtctgtgc tcttcaagca gatcctgtcc gatacagagt ccaagagctt cgtcattgat 1020

aagctcgagg acgacagcga cgtggtcacc acaatgcagt cattctacga gcagatcgcc 1080aagctcgagg acgacagcga cgtggtcacc acaatgcagt cattctacga gcagatcgcc 1080

gcgttcaaga ccgtggagga gaagagcatt aaggagacac tctcactcct gttcgatgac 1140gcgttcaaga ccgtggagga gaagagcatt aaggagacac tctcactcct gttcgatgac 1140

ctgaaggccc agaagctcga cctgagcaag atctacttca agaacgataa gagcctcaca 1200ctgaaggccc agaagctcga cctgagcaag atctacttca agaacgataa gagcctcaca 1200

gacctgtcac agcaggtgtt cgatgactac tcagtgattg gcaccgccgt cctcgagtac 1260gacctgtcac agcaggtgtt cgatgactac tcagtgattg gcaccgccgt cctcgagtac 1260

attacacagc agatcgcgcc aaagaacctc gataatcctt ctaagaagga gcaggagctg 1320attacacagc agatcgcgcc aaagaacctc gataatcctt ctaagaagga gcaggagctg 1320

atcgccaaga aaaccgagaa ggcgaagtac ctctccctgg agacaattaa gctcgccctg 1380atcgccaaga aaaccgagaa ggcgaagtac ctctccctgg agacaattaa gctcgccctg 1380

gaggagttca ataagcacag ggatattgac aagcagtgcc gcttcgagga gatcctcgcg 1440gaggagttca ataagcacag ggatattgac aagcagtgcc gcttcgagga gatcctcgcg 1440

aacttcgccg cgatcccaat gattttcgat gagatcgccc agaacaagga caatctggcg 1500aacttcgccg cgatcccaat gattttcgat gagatcgccc agaacaagga caatctggcg 1500

cagatctcta ttaagtacca gaaccagggc aagaaggacc tcctgcaggc ctccgcagag 1560cagatctcta ttaagtacca gaaccagggc aagaaggacc tcctgcaggc ctccgcagag 1560

gacgacgtga aggccatcaa ggatctcctg gaccagacca acaatctcct gcacaagctc 1620gacgacgtga aggccatcaa ggatctcctg gaccagacca acaatctcct gcacaagctc 1620

aagatcttcc atatttcaca gtctgaggat aaggccaata tcctcgataa ggacgagcat 1680aagatcttcc atatttcaca gtctgaggat aaggccaata tcctcgataa ggacgagcat 1680

ttctacctgg tgttcgagga gtgctacttc gagctggcga acattgtccc tctgtacaac 1740ttctacctgg tgttcgagga gtgctacttc gagctggcga acattgtccc tctgtacaac 1740

aagattagga attacatcac acagaagccg tacagcgacg agaagttcaa gctcaacttc 1800aagattagga attacatcac acagaagccg tacagcgacg agaagttcaa gctcaacttc 1800

gagaattcaa ccctggccaa cggctgggat aagaataagg agcctgacaa cacagcgatc 1860gagaattcaa ccctggccaa cggctgggat aagaataagg agcctgacaa cacagcgatc 1860

ctcttcatca aggacgacaa gtactacctg ggcgtgatga ataagaagaa caataagatc 1920ctcttcatca aggacgacaa gtactacctg ggcgtgatga ataagaagaa caataagatc 1920

ttcgatgaca aggccattaa ggagaacaag ggcgagggct acaagaagat cgtgtacaag 1980ttcgatgaca aggccattaa ggagaacaag ggcgagggct acaagaagat cgtgtacaag 1980

ctcctgcctg gcgccaataa gatgctcccg aaggtgttct tctccgcgaa gtccattaag 2040ctcctgcctg gcgccaataa gatgctcccg aaggtgttct tctccgcgaa gtccattaag 2040

ttctacaacc caagcgagga tatcctcagg atcaggaacc actctaccca tacaaagaac 2100ttctacaacc caagcgagga tatcctcagg atcaggaacc actctaccca tacaaagaac 2100

ggctcccctc agaagggcta cgagaagttc gagttcaata tcgaggattg ccggaagttc 2160ggctcccctc agaagggcta cgagaagttc gagttcaata tcgaggattg ccggaagttc 2160

attgacttct acaagcagtc catcagcaag caccctgagt ggaaggattt cggcttccgc 2220attgacttct acaagcagtc catcagcaag caccctgagt ggaaggattt cggcttccgc 2220

ttcagcgaca cccagcggta caactcaatc gatgagttct acagggaggt ggagaatcag 2280ttcagcgaca cccagcggta caactcaatc gatgagttct acagggaggt ggagaatcag 2280

ggctacaagc tcacattcga gaacatttca gagtcttaca tcgactccgt ggtcaatcag 2340ggctacaagc tcacattcga gaacatttca gagtcttaca tcgactccgt ggtcaatcag 2340

ggcaagctct acctgttcca gatctacaac aaggatttca gcgcctactc aaagggcagg 2400ggcaagctct acctgttcca gatctacaac aaggatttca gcgcctactc aaagggcagg 2400

ccgaacctcc ataccctgta ctggaaggcg ctcttcgatg agcgcaatct gcaggacgtg 2460ccgaacctcc ataccctgta ctggaaggcg ctcttcgatg agcgcaatct gcaggacgtg 2460

gtctacaagc tcaacggcga ggccgagctg ttctaccgca agcagtctat tccgaagaag 2520gtctacaagc tcaacggcga ggccgagctg ttctaccgca agcagtctat tccgaagaag 2520

atcacacacc cagcgaagga ggccatcgcg aacaagaata aggacaatcc gaagaaggag 2580atcacacacc cagcgaagga ggccatcgcg aacaagaata aggacaatcc gaagaaggag 2580

tccgtgttcg agtacgatct cattaaggac aagcggttca ccgaggataa gttcttcttc 2640tccgtgttcg agtacgatct cattaaggac aagcggttca ccgaggataa gttcttcttc 2640

cattgcccaa tcacaattaa cttcaagtcc agcggcgcca acaagttcaa tgacgagatc 2700cattgcccaa tcacaattaa cttcaagtcc agcggcgcca acaagttcaa tgacgagatc 2700

aatctcctgc tcaaggagaa ggcgaacgat gtgcatatcc tctccattga ccgcggcgag 2760aatctcctgc tcaaggagaa ggcgaacgat gtgcatatcc tctccattga ccgcggcgag 2760

cggcatctcg cctactacac cctggtcgat ggcaagggca atatcattaa gcaggatacc 2820cggcatctcg cctactacac cctggtcgat ggcaagggca atatcattaa gcaggatacc 2820

ttcaacatca ttggcaatga ccgcatgaaa accaactacc acgataagct cgccgcgatc 2880ttcaacatca ttggcaatga ccgcatgaaa accaactacc acgataagct cgccgcgatc 2880

gagaaggata gggacagcgc gcgcaaggac tggaagaaga tcaacaatat taaggagatg 2940gagaaggata gggacagcgc gcgcaaggac tggaagaaga tcaacaatat taaggagatg 2940

aaggagggct acctctcaca ggtggtccat gagattgcca agctcgtgat cgagtacaat 3000aaggagggct acctctcaca ggtggtccat gagattgcca agctcgtgat cgagtacaat 3000

gcgattgtgg tgttcgagga cctcaacttc ggcttcaagc ggggcaggtt caaggtggag 3060gcgattgtgg tgttcgagga cctcaacttc ggcttcaagc ggggcaggtt caaggtggag 3060

aagcaggtct accagaagct ggagaagatg ctcatcgaga agctcaatta cctggtgttc 3120aagcaggtct accagaagct ggagaagatg ctcatcgaga agctcaatta cctggtgttc 3120

aaggataacg agttcgacaa gaccggagga gtcctcaggg cataccagct gaccgcgcca 3180aaggataacg agttcgacaa gaccggagga gtcctcaggg cataccagct gaccgcgcca 3180

ttcgagacat tcaagaagat gggcaagcag acaggcatca tctactacgt gccagccggc 3240ttcgagacat tcaagaagat gggcaagcag acaggcatca tctactacgt gccagccggc 3240

ttcacctcca agatctgccc tgtgacaggc ttcgtcaacc agctctaccc taagtacgag 3300ttcacctcca agatctgccc tgtgacaggc ttcgtcaacc agctctaccc taagtacgag 3300

tccgtcagca agtcacagga gttcttcagc aagttcgata agatttgcta caatctggac 3360tccgtcagca agtcacagga gttcttcagc aagttcgata agatttgcta caatctggac 3360

aagggctact tcgagttctc attcgattac aagaacttcg gcgacaaggc cgcgaagggc 3420aagggctact tcgagttctc attcgattac aagaacttcg gcgacaaggc cgcgaagggc 3420

aagtggacca ttgcgtcttt cggctcccgg ctcatcaact tcaggaatag cgataagaac 3480aagtggacca ttgcgtcttt cggctcccgg ctcatcaact tcaggaatag cgataagaac 3480

cacaattggg acaccaggga ggtgtacccg acaaaggagc tggagaagct gctcaaggac 3540cacaattggg acaccaggga ggtgtacccg acaaaggagc tggagaagct gctcaaggac 3540

tactcaatcg agtacggcca tggcgagtgc attaaggccg cgatctgcgg cgagagcgat 3600tactcaatcg agtacggcca tggcgagtgc attaaggccg cgatctgcgg cgagagcgat 3600

aagaagttct tcgccaagct gacctcagtc ctcaatacaa tcctgcagat gaggaactct 3660aagaagttct tcgccaagct gacctcagtc ctcaatacaa tcctgcagat gaggaactct 3660

aagaccggca cagagctgga ctacctgatt tccccagtgg ccgatgtcaa cggcaatttc 3720aagaccggca cagagctgga ctacctgatt tccccagtgg ccgatgtcaa cggcaatttc 3720

ttcgacagcc gccaggcgcc gaagaatatg ccacaggacg ccgacgccaa cggcgcatac 3780ttcgacagcc gccaggcgcc gaagaatatg ccacaggacg ccgacgccaa cggcgcatac 3780

cacatcggcc tcaagggcct gatgctgctc ggacgcatta agaacaatca ggagggcaag 3840cacatcggcc tcaagggcct gatgctgctc ggacgcatta agaacaatca ggagggcaag 3840

aagctcaacc tcgtgatcaa gaacgaggag tacttcgagt tcgtccaaaa ccggaacaat 3900aagctcaacc tcgtgatcaa gaacgaggag tacttcgagt tcgtccaaaa ccggaacaat 3900

tccggcggca gcccaaagaa gaagaggaag gtgagcggcg gcagcccaaa gaagaagcgc 3960tccggcggca gcccaaagaa gaagaggaag gtgagcggcg gcagcccaaa gaagaagcgc 3960

aaggtctag 3969aaggtctag 3969

<210> 7<210> 7

<211> 3753<211> 3753

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-2NLS<223> LBCPF1-2NLS

<400> 7<400> 7

atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60

gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120

gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180

tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240

ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300

ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360

aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420

gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480

atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540

actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600

gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660

gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720gaaggtgagt ttttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720

atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780

ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840

ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900

ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960

ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020

ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080

aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140

tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200

caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagt aatcatccaa 1260

aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320

gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380

gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440

aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500

gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560

ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620

gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680

aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740

atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800

aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860

ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920

gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980

gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040

tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100

atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160

acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220

ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280

gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340

tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400

gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460

aagcatgacg acaatcctta cgtcattggg attgatcggg gcgagaggaa cctcctctat 2520aagcatgacg acaatcctta cgtcattggg attgatcggg gcgagaggaa cctcctctat 2520

attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580

aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640

aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700

gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760

gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820

caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880

aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940

gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000

acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060

attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120

gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180

aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240

aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300

aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360

aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420

attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480

ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540

gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600

gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660

tatgctcaga catccgtaaa gcattccggc ggcagcccaa agaagaagag gaaggtgagc 3720tatgctcaga catccgtaaa gcattccggc ggcagcccaa agaagaagag gaaggtgagc 3720

ggcggcagcc caaagaagaa gcgcaaggtc tag 3753ggcggcagcc caaagaagaa gcgcaaggtc tag 3753

<210> 8<210> 8

<211> 4995<211> 4995

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dFNCPF1-PBE-2NLS<223> dFNCPF1-PBE-2NLS

<400> 8<400> 8

atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60

ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120

aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180

accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240

aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300

ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360

atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420

ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480

aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540

tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600

aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660

cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720

acgccaggca cctccgagag cgctacgcct gaacttaagt ccatctacca ggagttcgtc 780acgccaggca cctccgagag cgctacgcct gaacttaagt ccatctacca ggagttcgtc 780

aataagtact cactctctaa gaccctgcgg ttcgagctga tcccgcaggg caagacactc 840aataagtact cactctctaa gaccctgcgg ttcgagctga tcccgcaggg caagacactc 840

gagaacatca aggcgcgcgg cctgattctc gacgatgaga agcgggccaa ggactacaag 900gagaacatca aggcgcgcgg cctgattctc gacgatgaga agcgggccaa ggactacaag 900

aaggcgaagc agatcattga taagtaccac cagttcttca tcgaggagat tctgtccagc 960aaggcgaagc agatcattga taagtaccac cagttcttca tcgaggagat tctgtccagc 960

gtgtgcatct ctgaggatct cctgcagaat tactccgacg tctacttcaa gctcaagaag 1020gtgtgcatct ctgaggatct cctgcagaat tactccgacg tctacttcaa gctcaagaag 1020

tctgacgatg acaacctgca gaaggatttc aagtccgcca aggacaccat caagaagcag 1080tctgacgatg acaacctgca gaaggatttc aagtccgcca aggacaccat caagaagcag 1080

atttctgagt acatcaagga ttccgagaag ttcaagaatc tcttcaacca gaatctgatt 1140atttctgagt acatcaagga ttccgagaag ttcaagaatc tcttcaacca gaatctgatt 1140

gatgcgaaga agggccagga gtctgacctg atcctctggc tgaagcagtc caaggacaat 1200gatgcgaaga agggccagga gtctgacctg atcctctggc tgaagcagtc caaggacaat 1200

ggcattgagc tgttcaaggc caacagcgat atcaccgata ttgacgaggc gctggagatc 1260ggcattgagc tgttcaaggc caacagcgat atcaccgata ttgacgaggc gctggagatc 1260

attaagtcat tcaagggctg gaccacatac ttcaagggct tccatgagaa ccggaagaat 1320attaagtcat tcaagggctg gaccacatac ttcaagggct tccatgagaa ccggaagaat 1320

gtgtactcat ctaacgacat tccgacctcc atcatctaca ggatcgtcga tgacaatctg 1380gtgtactcat ctaacgacat tccgacctcc atcatctaca ggatcgtcga tgacaatctg 1380

ccaaagttcc tcgagaacaa ggccaagtac gagtccctca aggacaaggc cccggaggcg 1440ccaaagttcc tcgagaacaa ggccaagtac gagtccctca aggacaaggc cccggaggcg 1440

attaattacg agcagatcaa gaaggatctg gcggaggagc tgaccttcga tatcgactac 1500attaattacg agcagatcaa gaaggatctg gcggaggagc tgaccttcga tatcgactac 1500

aagacaagcg aggtgaacca gagggtgttc tccctcgatg aggtgttcga gatcgccaat 1560aagacaagcg aggtgaacca gagggtgttc tccctcgatg aggtgttcga gatcgccaat 1560

ttcaacaatt acctgaacca gtccggcatt accaagttca atacaatcat tggcggcaag 1620ttcaacaatt acctgaacca gtccggcatt accaagttca atacaatcat tggcggcaag 1620

ttcgtcaacg gcgagaatac caagcgcaag ggcattaacg agtacatcaa tctctactcc 1680ttcgtcaacg gcgagaatac caagcgcaag ggcattaacg agtacatcaa tctctactcc 1680

cagcagatca acgacaagac cctgaagaag tacaagatgt ctgtgctctt caagcagatc 1740cagcagatca acgacaagac cctgaagaag tacaagatgt ctgtgctctt caagcagatc 1740

ctgtccgata cagagtccaa gagcttcgtc attgataagc tcgaggacga cagcgacgtg 1800ctgtccgata cagagtccaa gagcttcgtc attgataagc tcgaggacga cagcgacgtg 1800

gtcaccacaa tgcagtcatt ctacgagcag atcgccgcgt tcaagaccgt ggaggagaag 1860gtcaccacaa tgcagtcatt ctacgagcag atcgccgcgt tcaagaccgt ggaggagaag 1860

agcattaagg agacactctc actcctgttc gatgacctga aggcccagaa gctcgacctg 1920agcattaagg agacactctc actcctgttc gatgacctga aggcccagaa gctcgacctg 1920

agcaagatct acttcaagaa cgataagagc ctcacagacc tgtcacagca ggtgttcgat 1980agcaagatct acttcaagaa cgataagagc ctcacagacc tgtcacagca ggtgttcgat 1980

gactactcag tgattggcac cgccgtcctc gagtacatta cacagcagat cgcgccaaag 2040gactactcag tgattggcac cgccgtcctc gagtacatta cacagcagat cgcgccaaag 2040

aacctcgata atccttctaa gaaggagcag gagctgatcg ccaagaaaac cgagaaggcg 2100aacctcgata atccttctaa gaaggagcag gagctgatcg ccaagaaaac cgagaaggcg 2100

aagtacctct ccctggagac aattaagctc gccctggagg agttcaataa gcacagggat 2160aagtacctct ccctggagac aattaagctc gccctggagg agttcaataa gcacagggat 2160

attgacaagc agtgccgctt cgaggagatc ctcgcgaact tcgccgcgat cccaatgatt 2220attgacaagc agtgccgctt cgaggagatc ctcgcgaact tcgccgcgat cccaatgatt 2220

ttcgatgaga tcgcccagaa caaggacaat ctggcgcaga tctctattaa gtaccagaac 2280ttcgatgaga tcgcccagaa caaggacaat ctggcgcaga tctctattaa gtaccagaac 2280

cagggcaaga aggacctcct gcaggcctcc gcagaggacg acgtgaaggc catcaaggat 2340cagggcaaga aggacctcct gcaggcctcc gcagaggacg acgtgaaggc catcaaggat 2340

ctcctggacc agaccaacaa tctcctgcac aagctcaaga tcttccatat ttcacagtct 2400ctcctggacc agaccaacaa tctcctgcac aagctcaaga tcttccatat ttcacagtct 2400

gaggataagg ccaatatcct cgataaggac gagcatttct acctggtgtt cgaggagtgc 2460gaggataagg ccaatatcct cgataaggac gagcatttct acctggtgtt cgaggagtgc 2460

tacttcgagc tggcgaacat tgtccctctg tacaacaaga ttaggaatta catcacacag 2520tacttcgagc tggcgaacat tgtccctctg tacaacaaga ttaggaatta catcacacag 2520

aagccgtaca gcgacgagaa gttcaagctc aacttcgaga attcaaccct ggccaacggc 2580aagccgtaca gcgacgagaa gttcaagctc aacttcgaga attcaaccct ggccaacggc 2580

tgggataaga ataaggagcc tgacaacaca gcgatcctct tcatcaagga cgacaagtac 2640tgggataaga ataaggagcc tgacaacaca gcgatcctct tcatcaagga cgacaagtac 2640

tacctgggcg tgatgaataa gaagaacaat aagatcttcg atgacaaggc cattaaggag 2700tacctgggcg tgatgaataa gaagaacaat aagatcttcg atgacaaggc cattaaggag 2700

aacaagggcg agggctacaa gaagatcgtg tacaagctcc tgcctggcgc caataagatg 2760aacaagggcg agggctacaa gaagatcgtg tacaagctcc tgcctggcgc caataagatg 2760

ctcccgaagg tgttcttctc cgcgaagtcc attaagttct acaacccaag cgaggatatc 2820ctcccgaagg tgttcttctc cgcgaagtcc attaagttct acaacccaag cgaggatatc 2820

ctcaggatca ggaaccactc tacccataca aagaacggct cccctcagaa gggctacgag 2880ctcaggatca ggaaccactc tacccataca aagaacggct cccctcagaa gggctacgag 2880

aagttcgagt tcaatatcga ggattgccgg aagttcattg acttctacaa gcagtccatc 2940aagttcgagt tcaatatcga ggattgccgg aagttcattg acttctacaa gcagtccatc 2940

agcaagcacc ctgagtggaa ggatttcggc ttccgcttca gcgacaccca gcggtacaac 3000agcaagcacc ctgagtggaa ggatttcggc ttccgcttca gcgacaccca gcggtacaac 3000

tcaatcgatg agttctacag ggaggtggag aatcagggct acaagctcac attcgagaac 3060tcaatcgatg agttctacag ggaggtggag aatcagggct acaagctcac attcgagaac 3060

atttcagagt cttacatcga ctccgtggtc aatcagggca agctctacct gttccagatc 3120atttcagagt cttacatcga ctccgtggtc aatcagggca agctctacct gttccagatc 3120

tacaacaagg atttcagcgc ctactcaaag ggcaggccga acctccatac cctgtactgg 3180tacaacaagg atttcagcgc ctactcaaag ggcaggccga acctccatac cctgtactgg 3180

aaggcgctct tcgatgagcg caatctgcag gacgtggtct acaagctcaa cggcgaggcc 3240aaggcgctct tcgatgagcg caatctgcag gacgtggtct acaagctcaa cggcgaggcc 3240

gagctgttct accgcaagca gtctattccg aagaagatca cacacccagc gaaggaggcc 3300gagctgttct accgcaagca gtctattccg aagaagatca cacacccagc gaaggaggcc 3300

atcgcgaaca agaataagga caatccgaag aaggagtccg tgttcgagta cgatctcatt 3360atcgcgaaca agaataagga caatccgaag aaggagtccg tgttcgagta cgatctcatt 3360

aaggacaagc ggttcaccga ggataagttc ttcttccatt gcccaatcac aattaacttc 3420aaggacaagc ggttcaccga ggataagttc ttcttccatt gcccaatcac aattaacttc 3420

aagtccagcg gcgccaacaa gttcaatgac gagatcaatc tcctgctcaa ggagaaggcg 3480aagtccagcg gcgccaacaa gttcaatgac gagatcaatc tcctgctcaa ggagaaggcg 3480

aacgatgtgc atatcctctc cattgcccgc ggcgagcggc atctcgccta ctacaccctg 3540aacgatgtgc atatcctctc cattgcccgc ggcgagcggc atctcgccta ctacaccctg 3540

gtcgatggca agggcaatat cattaagcag gataccttca acatcattgg caatgaccgc 3600gtcgatggca agggcaatat cattaagcag gataccttca acatcattgg caatgaccgc 3600

atgaaaacca actaccacga taagctcgcc gcgatcgaga aggataggga cagcgcgcgc 3660atgaaaacca actaccacga taagctcgcc gcgatcgaga aggataggga cagcgcgcgc 3660

aaggactgga agaagatcaa caatattaag gagatgaagg agggctacct ctcacaggtg 3720aaggactgga agaagatcaa caatattaag gagatgaagg agggctacct ctcacaggtg 3720

gtccatgaga ttgccaagct cgtgatcgag tacaatgcga ttgtggtgtt cgaggacctc 3780gtccatgaga ttgccaagct cgtgatcgag tacaatgcga ttgtggtgtt cgaggacctc 3780

aacttcggct tcaagcgggg caggttcaag gtggagaagc aggtctacca gaagctggag 3840aacttcggct tcaagcgggg caggttcaag gtggagaagc aggtctacca gaagctggag 3840

aagatgctca tcgagaagct caattacctg gtgttcaagg ataacgagtt cgacaagacc 3900aagatgctca tcgagaagct caattacctg gtgttcaagg ataacgagtt cgacaagacc 3900

ggaggagtcc tcagggcata ccagctgacc gcgccattcg agacattcaa gaagatgggc 3960ggaggagtcc tcagggcata ccagctgacc gcgccattcg agacattcaa gaagatgggc 3960

aagcagacag gcatcatcta ctacgtgcca gccggcttca cctccaagat ctgccctgtg 4020aagcagacag gcatcatcta ctacgtgcca gccggcttca cctccaagat ctgccctgtg 4020

acaggcttcg tcaaccagct ctaccctaag tacgagtccg tcagcaagtc acaggagttc 4080acaggcttcg tcaaccagct ctaccctaag tacgagtccg tcagcaagtc acaggagttc 4080

ttcagcaagt tcgataagat ttgctacaat ctggacaagg gctacttcga gttctcattc 4140ttcagcaagt tcgataagat ttgctacaat ctggacaagg gctacttcga gttctcattc 4140

gattacaaga acttcggcga caaggccgcg aagggcaagt ggaccattgc gtctttcggc 4200gattacaaga acttcggcga caaggccgcg aagggcaagt ggaccattgc gtctttcggc 4200

tcccggctca tcaacttcag gaatagcgat aagaaccaca attgggacac cagggaggtg 4260tcccggctca tcaacttcag gaatagcgat aagaaccaca attgggacac cagggaggtg 4260

tacccgacaa aggagctgga gaagctgctc aaggactact caatcgagta cggccatggc 4320tacccgacaa aggagctgga gaagctgctc aaggactact caatcgagta cggccatggc 4320

gagtgcatta aggccgcgat ctgcggcgag agcgataaga agttcttcgc caagctgacc 4380gagtgcatta aggccgcgat ctgcggcgag agcgataaga agttcttcgc caagctgacc 4380

tcagtcctca atacaatcct gcagatgagg aactctaaga ccggcacaga gctggactac 4440tcagtcctca atacaatcct gcagatgagg aactctaaga ccggcacaga gctggactac 4440

ctgatttccc cagtggccga tgtcaacggc aatttcttcg acagccgcca ggcgccgaag 4500ctgatttccc cagtggccga tgtcaacggc aatttcttcg acagccgcca ggcgccgaag 4500

aatatgccac aggacgccga cgccaacggc gcataccaca tcggcctcaa gggcctgatg 4560aatatgccac aggacgccga cgccaacggc gcataccaca tcggcctcaa gggcctgatg 4560

ctgctcggac gcattaagaa caatcaggag ggcaagaagc tcaacctcgt gatcaagaac 4620ctgctcggac gcattaagaa caatcaggag ggcaagaagc tcaacctcgt gatcaagaac 4620

gaggagtact tcgagttcgt ccaaaaccgg aacaatacgc gtgactccgg cggcagcacc 4680gaggagtact tcgagttcgt ccaaaaccgg aacaatacgc gtgactccgg cggcagcacc 4680

aacctgtccg acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc 4740aacctgtccg acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc 4740

ctcatgctgc cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg 4800ctcatgctgc cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg 4800

gtgcacaccg cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc 4860gtgcacaccg cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc 4860

ccagagtaca agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag 4920ccagagtaca agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag 4920

atgctgtcgg gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag 4980atgctgtcgg gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag 4980

aagcgcaagg tctag 4995aagcgcaagg tctag 4995

<210> 9<210> 9

<211> 4779<211> 4779

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dLBCPF1-PBE-2NLS<223> dLBCPF1-PBE-2NLS

<400> 9<400> 9

atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60

ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120

aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180

accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240

aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300

ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360

atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420

ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480

aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540

tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600

aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660

cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720

acgccaggca cctccgagag cgctacgcct gaacttaagt caaagctcga gaaattcacc 780acgccaggca cctccgagag cgctacgcct gaacttaagt caaagctcga gaaattcacc 780

aactgttatt cgttgagcaa aacactgcgg tttaaagcga ttccagtcgg caagactcaa 840aactgttatt cgttgagcaa aacactgcgg tttaaagcga ttccagtcgg caagactcaa 840

gagaatatag acaataagcg gctgttggtg gaagatgaaa agcgcgcgga agactacaaa 900gagaatatag acaataagcg gctgttggtg gaagatgaaa agcgcgcgga agactacaaa 900

ggggtgaaga agttgttgga cagatactac ctctctttta tcaatgatgt cttgcactca 960ggggtgaaga agttgttgga cagatactac ctctctttta tcaatgatgt cttgcactca 960

atcaaattga agaatctgaa caactacatc tccctcttca gaaagaaaac aaggacagaa 1020atcaaattga agaatctgaa caactacatc tccctcttca gaaagaaaac aaggacagaa 1020

aaggagaata aggaacttga aaatttggag atcaatctga ggaaagagat cgcgaaagcc 1080aaggagaata aggaacttga aaatttggag atcaatctga ggaaagagat cgcgaaagcc 1080

tttaaaggca acgaaggata caaaagtctg ttcaagaagg atataattga gacaattttg 1140tttaaaggca acgaaggata caaaagtctg ttcaagaagg atataattga gacaattttg 1140

ccagagttcc tcgatgacaa ggacgagatt gcgctggtca attcgttcaa cggattcaca 1200ccagagttcc tcgatgacaa ggacgagatt gcgctggtca attcgttcaa cggattcaca 1200

acagcattca caggcttctt tgataatcgg gaaaatatgt tctctgagga ggcaaagtcc 1260acagcattca caggcttctt tgataatcgg gaaaatatgt tctctgagga ggcaaagtcc 1260

acttctattg cgttcaggtg tatcaatgag aatctcacta ggtacatttc caacatggat 1320acttctattg cgttcaggtg tatcaatgag aatctcacta ggtacatttc caacatggat 1320

atctttgaga aggttgacgc aatttttgac aagcacgaag ttcaggagat taaggagaag 1380atctttgaga aggttgacgc aatttttgac aagcacgaag ttcaggagat taaggagaag 1380

atcctcaatt ccgattatga cgttgaggac ttcttcgaag gtgagttttt taatttcgtg 1440atcctcaatt ccgattatga cgttgaggac ttcttcgaag gtgagttttt taatttcgtg 1440

ctcactcaag agggtatcga cgtgtataat gcgatcatcg gtgggttcgt gactgagtcc 1500ctcactcaag agggtatcga cgtgtataat gcgatcatcg gtgggttcgt gactgagtcc 1500

ggtgaaaaga ttaagggatt gaacgagtat atcaaccttt acaaccaaaa gacgaaacag 1560ggtgaaaaga ttaagggatt gaacgagtat atcaaccttt acaaccaaaa gacgaaacag 1560

aagctgccaa agttcaagcc tctttacaaa caggttcttt cagaccgcga gtcactctcg 1620aagctgccaa agttcaagcc tctttacaaa caggttcttt cagaccgcga gtcactctcg 1620

ttctatgggg agggctacac ttcggatgag gaagtcctgg aggtgttcag gaatactctc 1680ttctatgggg agggctacac ttcggatgag gaagtcctgg aggtgttcag gaatactctc 1680

aataagaatt cggagatttt ctcttctata aaaaaactgg aaaagttgtt taagaatttt 1740aataagaatt cggagatttt ctcttctata aaaaaactgg aaaagttgtt taagaatttt 1740

gacgaatact ctagcgccgg catatttgtg aaaaacggcc cggccatatc aacgataagt 1800gacgaatact ctagcgccgg catatttgtg aaaaacggcc cggccatatc aacgataagt 1800

aaagatatct tcggcgaatg gaacgtgatc agagacaaat ggaacgcgga gtatgacgat 1860aaagatatct tcggcgaatg gaacgtgatc agagacaaat ggaacgcgga gtatgacgat 1860

attcacctga agaagaaggc tgtcgtaacg gagaagtacg aggatgatcg caggaaaagc 1920attcacctga agaagaaggc tgtcgtaacg gagaagtacg aggatgatcg caggaaaagc 1920

ttcaaaaaga tcggaagttt cagcctggaa cagttgcagg agtatgctga cgccgatctt 1980ttcaaaaaga tcggaagttt cagcctggaa cagttgcagg agtatgctga cgccgatctt 1980

agcgtcgtcg agaagttgaa ggagataatc atccaaaagg tcgacgagat atataaagtc 2040agcgtcgtcg agaagttgaa ggagataatc atccaaaagg tcgacgagat atataaagtc 2040

tatggatcaa gtgaaaaact gttcgacgcc gacttcgttt tggagaagtc cctgaagaag 2100tatggatcaa gtgaaaaact gttcgacgcc gacttcgttt tggagaagtc cctgaagaag 2100

aacgacgctg ttgttgccat tatgaaggat ctgctcgaca gcgtgaagag tttcgagaac 2160aacgacgctg ttgttgccat tatgaaggat ctgctcgaca gcgtgaagag tttcgagaac 2160

tatattaagg cttttttcgg ggaggggaag gagactaaca gagatgagtc cttctacgga 2220tatattaagg cttttttcgg ggaggggaag gagactaaca gagatgagtc cttctacgga 2220

gacttcgtcc tcgcgtacga tatactcctt aaggtagacc acatctacga cgcaatcaga 2280gacttcgtcc tcgcgtacga tatactcctt aaggtagacc acatctacga cgcaatcaga 2280

aattacgtga cacaaaagcc gtacagcaag gacaagttca aactctactt ccagaacccc 2340aattacgtga cacaaaagcc gtacagcaag gacaagttca aactctactt ccagaacccc 2340

cagttcatgg gcggctggga caaggacaag gaaacggatt acagggctac gatcctgagg 2400cagttcatgg gcggctggga caaggacaag gaaacggatt acagggctac gatcctgagg 2400

tatggttcaa aatactactt ggcgattatg gacaagaagt acgccaagtg tctccagaag 2460tatggttcaa aatactactt ggcgattatg gacaagaagt acgccaagtg tctccagaag 2460

attgacaaag acgatgtcaa tggcaattat gagaagatca actacaagct gcttccgggt 2520attgacaaag acgatgtcaa tggcaattat gagaagatca actacaagct gcttccgggt 2520

ccgaacaaga tgctcccaaa ggttttcttc agcaagaaat ggatggccta ctataaccca 2580ccgaacaaga tgctcccaaa ggttttcttc agcaagaaat ggatggccta ctataaccca 2580

agcgaggaca tccagaagat ttataagaac ggtacgttca agaagggcga catgttcaat 2640agcgaggaca tccagaagat ttataagaac ggtacgttca agaagggcga catgttcaat 2640

cttaacgact gtcacaagct gatcgacttc ttcaaagact caattagccg gtacccaaag 2700cttaacgact gtcacaagct gatcgacttc ttcaaagact caattagccg gtacccaaag 2700

tggtctaacg cctatgactt caacttttcg gaaaccgaga agtacaagga tatagccgga 2760tggtctaacg cctatgactt caacttttcg gaaaccgaga agtacaagga tatagccgga 2760

ttttatagag aggtggaaga gcagggctac aaggtgtcat tcgagtccgc cagcaagaag 2820ttttatagag aggtggaaga gcagggctac aaggtgtcat tcgagtccgc cagcaagaag 2820

gaagtggaca agctcgtgga agagggtaag ctctacatgt tccagattta taataaagac 2880gaagtggaca agctcgtgga agagggtaag ctctacatgt tccagattta taataaagac 2880

tttagcgata agagccacgg gacacctaat ctccacacaa tgtatttcaa gctgctcttc 2940tttagcgata agagccacgg gacacctaat ctccacacaa tgtatttcaa gctgctcttc 2940

gacgagaata accacggcca aatcaggttg tcaggagggg ctgaactctt catgcggcgc 3000gacgagaata accacggcca aatcaggttg tcaggagggg ctgaactctt catgcggcgc 3000

gctagcctta agaaggagga gcttgtagtc caccctgcga atagtccaat tgcgaataag 3060gctagcctta agaaggagga gcttgtagtc caccctgcga atagtccaat tgcgaataag 3060

aacccggaca atcctaaaaa gactacaaca ttgagctacg acgtgtacaa ggataagagg 3120aacccggaca atcctaaaaa gactacaaca ttgagctacg acgtgtacaa ggataagagg 3120

ttttccgagg atcagtacga gctccacatc ccgattgcga tcaacaagtg cccaaagaat 3180ttttccgagg atcagtacga gctccacatc ccgattgcga tcaacaagtg cccaaagaat 3180

attttcaaga taaacacaga ggtgcgtgta ctcctgaagc atgacgacaa tccttacgtc 3240attttcaaga taaacacaga ggtgcgtgta ctcctgaagc atgacgacaa tccttacgtc 3240

attgggattg ctcggggcga gaggaacctc ctctatattg tggtggtgga cgggaagggg 3300attgggattg ctcggggcga gaggaacctc ctctatattg tggtggtgga cgggaagggg 3300

aacatagtcg aacagtactc ccttaacgaa ataattaaca atttcaacgg catccgtatc 3360aacatagtcg aacagtactc ccttaacgaa ataattaaca atttcaacgg catccgtatc 3360

aagaccgact accattcgtt gctggacaag aaggagaagg agagatttga ggcgcggcaa 3420aagaccgact accattcgtt gctggacaag aaggagaagg agagatttga ggcgcggcaa 3420

aattggacaa gtatcgagaa catcaaggaa ctcaaagcag gttatatctc tcaagttgtg 3480aattggacaa gtatcgagaa catcaaggaa ctcaaagcag gttatatctc tcaagttgtg 3480

cataagatat gcgagctggt tgagaagtat gacgcagtga tcgctcttga ggacctcaac 3540cataagatat gcgagctggt tgagaagtat gacgcagtga tcgctcttga ggacctcaac 3540

tcgggcttta agaattctag agttaaagtg gagaagcagg tctatcaaaa gttcgagaag 3600tcgggcttta agaattctag agttaaagtg gagaagcagg tctatcaaaa gttcgagaag 3600

atgcttatag ataagctcaa ctacatggtc gataagaaat cgaacccatg tgccaccggc 3660atgcttatag ataagctcaa ctacatggtc gataagaaat cgaacccatg tgccaccggc 3660

ggcgcactca aaggttacca aataacaaac aaattcgagt ccttcaaatc gatgagtact 3720ggcgcactca aaggttacca aataacaaac aaattcgagt ccttcaaatc gatgagtact 3720

cagaatgggt tcatatttta tataccggcg tggcttacgt ctaagatcga cccgtcaact 3780cagaatgggt tcatatttta tataccggcg tggcttacgt ctaagatcga cccgtcaact 3780

ggttttgtca acctgttgaa gacgaaatac acgtccattg ccgattcgaa aaagttcata 3840ggttttgtca acctgttgaa gacgaaatac acgtccattg ccgattcgaa aaagttcata 3840

tctagttttg atcgtattat gtacgtccca gaggaagatc ttttcgagtt tgctctcgac 3900tctagttttg atcgtattat gtacgtccca gaggaagatc ttttcgagtt tgctctcgac 3900

tacaaaaact tttcgcggac cgatgcggat tacattaaaa aatggaaact ctattcgtac 3960tacaaaaact tttcgcggac cgatgcggat tacattaaaa aatggaaact ctattcgtac 3960

ggcaacagaa tcaggatttt tcgcaaccct aagaagaata acgtctttga ttgggaggaa 4020ggcaacagaa tcaggatttt tcgcaaccct aagaagaata acgtctttga ttgggaggaa 4020

gtttgcttga ctagcgcgta caaggagctc tttaataagt atggcattaa ctaccaacag 4080gtttgcttga ctagcgcgta caaggagctc tttaataagt atggcattaa ctaccaacag 4080

ggtgatatca gagcactgct ttgcgaacaa tctgacaagg ctttctactc atccttcatg 4140ggtgatatca gagcactgct ttgcgaacaa tctgacaagg ctttctactc atccttcatg 4140

gctttgatga gcctgatgct ccagatgaga aattcaatta caggcagaac cgacgtggat 4200gctttgatga gcctgatgct ccagatgaga aattcaatta caggcagaac cgacgtggat 4200

ttcttgatct ccccggttaa aaattctgat ggcatctttt acgatagcag gaactatgaa 4260ttcttgatct ccccggttaa aaattctgat ggcatctttt acgatagcag gaactatgaa 4260

gcgcaagaga atgcgattct gccaaaaaat gcagacgcca acggtgccta taacatcgcc 4320gcgcaagaga atgcgattct gccaaaaaat gcagacgcca acggtgccta taacatcgcc 4320

aggaaagtcc tgtgggcgat cggccagttc aaaaaggccg aagacgaaaa attggacaag 4380aggaaagtcc tgtgggcgat cggccagttc aaaaaggccg aagacgaaaa attggacaag 4380

gtcaaaatcg ctatcagcaa caaagagtgg ctggagtatg ctcagacatc cgtaaagcat 4440gtcaaaatcg ctatcagcaa caaagagtgg ctggagtatg ctcagacatc cgtaaagcat 4440

acgcgtgact ccggcggcag caccaacctg tccgacatca tcgagaagga gacgggcaag 4500acgcgtgact ccggcggcag caccaacctg tccgacatca tcgagaagga gacgggcaag 4500

caactcgtga tccaggagag catcctcatg ctgccagagg aggtggagga ggtcatcggc 4560caactcgtga tccaggagag catcctcatg ctgccagagg aggtggagga ggtcatcggc 4560

aacaagccag agtccgacat cctggtgcac accgcctacg acgagtccac cgacgagaac 4620aacaagccag agtccgacat cctggtgcac accgcctacg acgagtccac cgacgagaac 4620

gtcatgctcc tgaccagcga cgccccagag tacaagccat gggccctcgt catccaggac 4680gtcatgctcc tgaccagcga cgccccagag tacaagccat gggccctcgt catccaggac 4680

agcaacgggg agaacaagat caagatgctg tcggggggga gcccaaagaa gaagcggaag 4740agcaacgggg agaacaagat caagatgctg tcggggggga gcccaaagaa gaagcggaag 4740

gtgagcggcg gcagcccaaa gaagaagcgc aaggtctag 4779gtgagcggcg gcagcccaaa gaagaagcgc aaggtctag 4779

<210> 10<210> 10

<211> 6305<211> 6305

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> BDUBI10-dLBCPF1-PBE-2NLS<223> BDUBI10-dLBCPF1-PBE-2NLS

<400> 10<400> 10

ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60

agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120

tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180

tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240

gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300

ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360

gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420

agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480

taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540

aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600

tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660

cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720

ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780

gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840

caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900

ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960

tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020

taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080

atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140

cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200

ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260

gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320

cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380

acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440

ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500

tctgcaggtc gaagcttgaa gcaaacatgc caaagaagaa gaggaaggtt tcatcggaga 1560tctgcaggtc gaagcttgaa gcaaacatgc caaagaagaa gaggaaggtt tcatcggaga 1560

ccggccctgt tgctgttgac cccaccctgc ggcggagaat cgagccacac gagttcgagg 1620ccggccctgt tgctgttgac cccaccctgc ggcggagaat cgagccacac gagttcgagg 1620

tgttcttcga cccaagggag ctccgcaagg agacgtgcct cctgtacgag atcaactggg 1680tgttcttcga cccaagggag ctccgcaagg agacgtgcct cctgtacgag atcaactggg 1680

gcggcaggca ctccatctgg aggcacacca gccaaaacac caacaagcac gtggaggtca 1740gcggcaggca ctccatctgg aggcacacca gccaaaacac caacaagcac gtggaggtca 1740

acttcatcga gaagttcacc accgagaggt acttctgccc aaacacccgc tgctccatca 1800acttcatcga gaagttcacc accgagaggt acttctgccc aaacacccgc tgctccatca 1800

cctggttcct gtcctggagc ccatgcggcg agtgctccag ggccatcacc gagttcctca 1860cctggttcct gtcctggagc ccatgcggcg agtgctccag ggccatcacc gagttcctca 1860

gccgctaccc acacgtcacc ctgttcatct acatcgccag gctctaccac cacgccgacc 1920gccgctaccc acacgtcacc ctgttcatct acatcgccag gctctaccac cacgccgacc 1920

caaggaacag gcagggcctc cgcgacctga tctccagcgg cgtgaccatc caaatcatga 1980caaggaacag gcagggcctc cgcgacctga tctccagcgg cgtgaccatc caaatcatga 1980

ccgagcagga gtccggctac tgctggagga acttcgtcaa ctactcccca agcaacgagg 2040ccgagcagga gtccggctac tgctggagga acttcgtcaa ctactcccca agcaacgagg 2040

cccactggcc aaggtaccca cacctctggg tgcgcctcta cgtgctcgag ctgtactgca 2100cccactggcc aaggtaccca cacctctggg tgcgcctcta cgtgctcgag ctgtactgca 2100

tcatcctcgg cctgccacca tgcctcaaca tcctgaggcg caagcaacca cagctgacct 2160tcatcctcgg cctgccacca tgcctcaaca tcctgaggcg caagcaacca cagctgacct 2160

tcttcaccat cgccctccaa agctgccact accagaggct cccaccacac atcctgtggg 2220tcttcaccat cgccctccaa agctgccact accagaggct cccaccacac atcctgtggg 2220

ctaccggcct caagtccggc agcgagacgc caggcacctc cgagagcgct acgcctgaac 2280ctaccggcct caagtccggc agcgagacgc caggcacctc cgagagcgct acgcctgaac 2280

ttaagtcaaa gctcgagaaa ttcaccaact gttattcgtt gagcaaaaca ctgcggttta 2340ttaagtcaaa gctcgagaaa ttcaccaact gttattcgtt gagcaaaaca ctgcggttta 2340

aagcgattcc agtcggcaag actcaagaga atatagacaa taagcggctg ttggtggaag 2400aagcgattcc agtcggcaag actcaagaga atatagacaa taagcggctg ttggtggaag 2400

atgaaaagcg cgcggaagac tacaaagggg tgaagaagtt gttggacaga tactacctct 2460atgaaaagcg cgcggaagac tacaaagggg tgaagaagtt gttggacaga tactacctct 2460

cttttatcaa tgatgtcttg cactcaatca aattgaagaa tctgaacaac tacatctccc 2520cttttatcaa tgatgtcttg cactcaatca aattgaagaa tctgaacaac tacatctccc 2520

tcttcagaaa gaaaacaagg acagaaaagg agaataagga acttgaaaat ttggagatca 2580tcttcagaaa gaaaacaagg acagaaaagg agaataagga acttgaaaat ttggagatca 2580

atctgaggaa agagatcgcg aaagccttta aaggcaacga aggatacaaa agtctgttca 2640atctgaggaa agagatcgcg aaagccttta aaggcaacga aggatacaaa agtctgttca 2640

agaaggatat aattgagaca attttgccag agttcctcga tgacaaggac gagattgcgc 2700agaaggatat aattgagaca attttgccag agttcctcga tgacaaggac gagattgcgc 2700

tggtcaattc gttcaacgga ttcacaacag cattcacagg cttctttgat aatcgggaaa 2760tggtcaattc gttcaacgga ttcacaacag cattcacagg cttctttgat aatcgggaaa 2760

atatgttctc tgaggaggca aagtccactt ctattgcgtt caggtgtatc aatgagaatc 2820atatgttctc tgaggaggca aagtccactt ctattgcgtt caggtgtatc aatgagaatc 2820

tcactaggta catttccaac atggatatct ttgagaaggt tgacgcaatt tttgacaagc 2880tcactaggta catttccaac atggatatct ttgagaaggt tgacgcaatt tttgacaagc 2880

acgaagttca ggagattaag gagaagatcc tcaattccga ttatgacgtt gaggacttct 2940acgaagttca ggagattaag gagaagatcc tcaattccga ttatgacgtt gaggacttct 2940

tcgaaggtga gttttttaat ttcgtgctca ctcaagaggg tatcgacgtg tataatgcga 3000tcgaaggtga gttttttaat ttcgtgctca ctcaagaggg tatcgacgtg tataatgcga 3000

tcatcggtgg gttcgtgact gagtccggtg aaaagattaa gggattgaac gagtatatca 3060tcatcggtgg gttcgtgact gagtccggtg aaaagattaa gggattgaac gagtatatca 3060

acctttacaa ccaaaagacg aaacagaagc tgccaaagtt caagcctctt tacaaacagg 3120acctttacaa ccaaaagacg aaacagaagc tgccaaagtt caagcctctt tacaaacagg 3120

ttctttcaga ccgcgagtca ctctcgttct atggggaggg ctacacttcg gatgaggaag 3180ttctttcaga ccgcgagtca ctctcgttct atggggaggg ctacacttcg gatgaggaag 3180

tcctggaggt gttcaggaat actctcaata agaattcgga gattttctct tctataaaaa 3240tcctggaggt gttcaggaat actctcaata agaattcgga gattttctct tctataaaaa 3240

aactggaaaa gttgtttaag aattttgacg aatactctag cgccggcata tttgtgaaaa 3300aactggaaaa gttgtttaag aattttgacg aatactctag cgccggcata tttgtgaaaa 3300

acggcccggc catatcaacg ataagtaaag atatcttcgg cgaatggaac gtgatcagag 3360acggcccggc catatcaacg ataagtaaag atatcttcgg cgaatggaac gtgatcagag 3360

acaaatggaa cgcggagtat gacgatattc acctgaagaa gaaggctgtc gtaacggaga 3420acaaatggaa cgcggagtat gacgatattc acctgaagaa gaaggctgtc gtaacggaga 3420

agtacgagga tgatcgcagg aaaagcttca aaaagatcgg aagtttcagc ctggaacagt 3480agtacgagga tgatcgcagg aaaagcttca aaaagatcgg aagtttcagc ctggaacagt 3480

tgcaggagta tgctgacgcc gatcttagcg tcgtcgagaa gttgaaggag ataatcatcc 3540tgcaggagta tgctgacgcc gatcttagcg tcgtcgagaa gttgaaggag ataatcatcc 3540

aaaaggtcga cgagatatat aaagtctatg gatcaagtga aaaactgttc gacgccgact 3600aaaaggtcga cgagatatat aaagtctatg gatcaagtga aaaactgttc gacgccgact 3600

tcgttttgga gaagtccctg aagaagaacg acgctgttgt tgccattatg aaggatctgc 3660tcgttttgga gaagtccctg aagaagaacg acgctgttgt tgccattatg aaggatctgc 3660

tcgacagcgt gaagagtttc gagaactata ttaaggcttt tttcggggag gggaaggaga 3720tcgacagcgt gaagagtttc gagaactata ttaaggcttt tttcggggag gggaaggaga 3720

ctaacagaga tgagtccttc tacggagact tcgtcctcgc gtacgatata ctccttaagg 3780ctaacagaga tgagtccttc tacggagact tcgtcctcgc gtacgatata ctccttaagg 3780

tagaccacat ctacgacgca atcagaaatt acgtgacaca aaagccgtac agcaaggaca 3840tagaccacat ctacgacgca atcagaaatt acgtgacaca aaagccgtac agcaaggaca 3840

agttcaaact ctacttccag aacccccagt tcatgggcgg ctgggacaag gacaaggaaa 3900agttcaaact ctacttccag aacccccagt tcatgggcgg ctgggacaag gacaaggaaa 3900

cggattacag ggctacgatc ctgaggtatg gttcaaaata ctacttggcg attatggaca 3960cggattacag ggctacgatc ctgaggtatg gttcaaaata ctacttggcg attatggaca 3960

agaagtacgc caagtgtctc cagaagattg acaaagacga tgtcaatggc aattatgaga 4020agaagtacgc caagtgtctc cagaagattg acaaagacga tgtcaatggc aattatgaga 4020

agatcaacta caagctgctt ccgggtccga acaagatgct cccaaaggtt ttcttcagca 4080agatcaacta caagctgctt ccgggtccga acaagatgct cccaaaggtt ttcttcagca 4080

agaaatggat ggcctactat aacccaagcg aggacatcca gaagatttat aagaacggta 4140agaaatggat ggcctactat aacccaagcg aggacatcca gaagatttat aagaacggta 4140

cgttcaagaa gggcgacatg ttcaatctta acgactgtca caagctgatc gacttcttca 4200cgttcaagaa gggcgacatg ttcaatctta acgactgtca caagctgatc gacttcttca 4200

aagactcaat tagccggtac ccaaagtggt ctaacgccta tgacttcaac ttttcggaaa 4260aagactcaat tagccggtac ccaaagtggt ctaacgccta tgacttcaac ttttcggaaa 4260

ccgagaagta caaggatata gccggatttt atagagaggt ggaagagcag ggctacaagg 4320ccgagaagta caaggatata gccggatttt atagagaggt ggaagagcag ggctacaagg 4320

tgtcattcga gtccgccagc aagaaggaag tggacaagct cgtggaagag ggtaagctct 4380tgtcattcga gtccgccagc aagaaggaag tggacaagct cgtggaagag ggtaagctct 4380

acatgttcca gatttataat aaagacttta gcgataagag ccacgggaca cctaatctcc 4440acatgttcca gatttataat aaagacttta gcgataagag ccacgggaca cctaatctcc 4440

acacaatgta tttcaagctg ctcttcgacg agaataacca cggccaaatc aggttgtcag 4500acacaatgta tttcaagctg ctcttcgacg agaataacca cggccaaatc aggttgtcag 4500

gaggggctga actcttcatg cggcgcgcta gccttaagaa ggaggagctt gtagtccacc 4560gaggggctga actcttcatg cggcgcgcta gccttaagaa ggaggagctt gtagtccacc 4560

ctgcgaatag tccaattgcg aataagaacc cggacaatcc taaaaagact acaacattga 4620ctgcgaatag tccaattgcg aataagaacc cggacaatcc taaaaagact acaacattga 4620

gctacgacgt gtacaaggat aagaggtttt ccgaggatca gtacgagctc cacatcccga 4680gctacgacgt gtacaaggat aagaggtttt ccgaggatca gtacgagctc cacatcccga 4680

ttgcgatcaa caagtgccca aagaatattt tcaagataaa cacagaggtg cgtgtactcc 4740ttgcgatcaa caagtgccca aagaatattt tcaagataaa cacagaggtg cgtgtactcc 4740

tgaagcatga cgacaatcct tacgtcattg ggattgctcg gggcgagagg aacctcctct 4800tgaagcatga cgacaatcct tacgtcattg ggattgctcg gggcgagagg aacctcctct 4800

atattgtggt ggtggacggg aaggggaaca tagtcgaaca gtactccctt aacgaaataa 4860atattgtggt ggtggacggg aaggggaaca tagtcgaaca gtactccctt aacgaaataa 4860

ttaacaattt caacggcatc cgtatcaaga ccgactacca ttcgttgctg gacaagaagg 4920ttaacaattt caacggcatc cgtatcaaga ccgactacca ttcgttgctg gacaagaagg 4920

agaaggagag atttgaggcg cggcaaaatt ggacaagtat cgagaacatc aaggaactca 4980agaaggagag atttgaggcg cggcaaaatt ggacaagtat cgagaacatc aaggaactca 4980

aagcaggtta tatctctcaa gttgtgcata agatatgcga gctggttgag aagtatgacg 5040aagcaggtta tatctctcaa gttgtgcata agatatgcga gctggttgag aagtatgacg 5040

cagtgatcgc tcttgaggac ctcaactcgg gctttaagaa ttctagagtt aaagtggaga 5100cagtgatcgc tcttgaggac ctcaactcgg gctttaagaa ttctagagtt aaagtggaga 5100

agcaggtcta tcaaaagttc gagaagatgc ttatagataa gctcaactac atggtcgata 5160agcaggtcta tcaaaagttc gagaagatgc ttatagataa gctcaactac atggtcgata 5160

agaaatcgaa cccatgtgcc accggcggcg cactcaaagg ttaccaaata acaaacaaat 5220agaaatcgaa cccatgtgcc accggcggcg cactcaaagg ttaccaaata acaaacaaat 5220

tcgagtcctt caaatcgatg agtactcaga atgggttcat attttatata ccggcgtggc 5280tcgagtcctt caaatcgatg agtactcaga atgggttcat attttatata ccggcgtggc 5280

ttacgtctaa gatcgacccg tcaactggtt ttgtcaacct gttgaagacg aaatacacgt 5340ttacgtctaa gatcgacccg tcaactggtt ttgtcaacct gttgaagacg aaatacacgt 5340

ccattgccga ttcgaaaaag ttcatatcta gttttgatcg tattatgtac gtcccagagg 5400ccattgccga ttcgaaaaag ttcatatcta gttttgatcg tattatgtac gtcccagagg 5400

aagatctttt cgagtttgct ctcgactaca aaaacttttc gcggaccgat gcggattaca 5460aagatctttt cgagtttgct ctcgactaca aaaacttttc gcggaccgat gcggattaca 5460

ttaaaaaatg gaaactctat tcgtacggca acagaatcag gatttttcgc aaccctaaga 5520ttaaaaaatg gaaactctat tcgtacggca acagaatcag gatttttcgc aaccctaaga 5520

agaataacgt ctttgattgg gaggaagttt gcttgactag cgcgtacaag gagctcttta 5580agaataacgt ctttgattgg gaggaagttt gcttgactag cgcgtacaag gagctcttta 5580

ataagtatgg cattaactac caacagggtg atatcagagc actgctttgc gaacaatctg 5640ataagtatgg cattaactac caacagggtg atatcagagc actgctttgc gaacaatctg 5640

acaaggcttt ctactcatcc ttcatggctt tgatgagcct gatgctccag atgagaaatt 5700acaaggcttt ctactcatcc ttcatggctt tgatgagcct gatgctccag atgagaaatt 5700

caattacagg cagaaccgac gtggatttct tgatctcccc ggttaaaaat tctgatggca 5760caattacagg cagaaccgac gtggatttct tgatctcccc ggttaaaaat tctgatggca 5760

tcttttacga tagcaggaac tatgaagcgc aagagaatgc gattctgcca aaaaatgcag 5820tcttttacga tagcaggaac tatgaagcgc aagagaatgc gattctgcca aaaaatgcag 5820

acgccaacgg tgcctataac atcgccagga aagtcctgtg ggcgatcggc cagttcaaaa 5880acgccaacgg tgcctataac atcgccagga aagtcctgtg ggcgatcggc cagttcaaaa 5880

aggccgaaga cgaaaaattg gacaaggtca aaatcgctat cagcaacaaa gagtggctgg 5940aggccgaaga cgaaaaattg gacaaggtca aaatcgctat cagcaacaaa gagtggctgg 5940

agtatgctca gacatccgta aagcatacgc gtgactccgg cggcagcacc aacctgtccg 6000agtatgctca gacatccgta aagcatacgc gtgactccgg cggcagcacc aacctgtccg 6000

acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc ctcatgctgc 6060acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc ctcatgctgc 6060

cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg gtgcacaccg 6120cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg gtgcacaccg 6120

cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc ccagagtaca 6180cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc ccagagtaca 6180

agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag atgctgtcgg 6240agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag atgctgtcgg 6240

gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag aagcgcaagg 6300gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag aagcgcaagg 6300

tctag 6305tctag 6305

<210> 11<210> 11

<211> 5196<211> 5196

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dFNCPF1-ABE7.10-2NLS<223> dFNCPF1-ABE7.10-2NLS

<400> 11<400> 11

atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60

tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120

gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180

ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240

atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300

tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360

aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420

gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480

aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540

ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600

gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660

atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720

gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780

catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840

aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900

ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960

ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020

atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080

ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140

tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200

tctggaggat ctagcggagg atctcttaag tccatctacc aggagttcgt caataagtac 1260tctggaggat ctagcggagg atctcttaag tccatctacc aggagttcgt caataagtac 1260

tcactctcta agaccctgcg gttcgagctg atcccgcagg gcaagacact cgagaacatc 1320tcactctcta agaccctgcg gttcgagctg atcccgcagg gcaagacact cgagaacatc 1320

aaggcgcgcg gcctgattct cgacgatgag aagcgggcca aggactacaa gaaggcgaag 1380aaggcgcgcg gcctgattct cgacgatgag aagcgggcca aggactacaa gaaggcgaag 1380

cagatcattg ataagtacca ccagttcttc atcgaggaga ttctgtccag cgtgtgcatc 1440cagatcattg ataagtacca ccagttcttc atcgaggaga ttctgtccag cgtgtgcatc 1440

tctgaggatc tcctgcagaa ttactccgac gtctacttca agctcaagaa gtctgacgat 1500tctgaggatc tcctgcagaa ttactccgac gtctacttca agctcaagaa gtctgacgat 1500

gacaacctgc agaaggattt caagtccgcc aaggacacca tcaagaagca gatttctgag 1560gacaacctgc agaaggattt caagtccgcc aaggacacca tcaagaagca gatttctgag 1560

tacatcaagg attccgagaa gttcaagaat ctcttcaacc agaatctgat tgatgcgaag 1620tacatcaagg attccgagaa gttcaagaat ctcttcaacc agaatctgat tgatgcgaag 1620

aagggccagg agtctgacct gatcctctgg ctgaagcagt ccaaggacaa tggcattgag 1680aagggccagg agtctgacct gatcctctgg ctgaagcagt ccaaggacaa tggcattgag 1680

ctgttcaagg ccaacagcga tatcaccgat attgacgagg cgctggagat cattaagtca 1740ctgttcaagg ccaacagcga tatcaccgat attgacgagg cgctggagat cattaagtca 1740

ttcaagggct ggaccacata cttcaagggc ttccatgaga accggaagaa tgtgtactca 1800ttcaagggct ggaccacata cttcaagggc ttccatgaga accggaagaa tgtgtactca 1800

tctaacgaca ttccgacctc catcatctac aggatcgtcg atgacaatct gccaaagttc 1860tctaacgaca ttccgacctc catcatctac aggatcgtcg atgacaatct gccaaagttc 1860

ctcgagaaca aggccaagta cgagtccctc aaggacaagg ccccggaggc gattaattac 1920ctcgagaaca aggccaagta cgagtccctc aaggacaagg ccccggaggc gattaattac 1920

gagcagatca agaaggatct ggcggaggag ctgaccttcg atatcgacta caagacaagc 1980gagcagatca agaaggatct ggcggaggag ctgaccttcg atatcgacta caagacaagc 1980

gaggtgaacc agagggtgtt ctccctcgat gaggtgttcg agatcgccaa tttcaacaat 2040gaggtgaacc agagggtgtt ctccctcgat gaggtgttcg agatcgccaa tttcaacaat 2040

tacctgaacc agtccggcat taccaagttc aatacaatca ttggcggcaa gttcgtcaac 2100tacctgaacc agtccggcat taccaagttc aatacaatca ttggcggcaa gttcgtcaac 2100

ggcgagaata ccaagcgcaa gggcattaac gagtacatca atctctactc ccagcagatc 2160ggcgagaata ccaagcgcaa gggcattaac gagtacatca atctctactc ccagcagatc 2160

aacgacaaga ccctgaagaa gtacaagatg tctgtgctct tcaagcagat cctgtccgat 2220aacgacaaga ccctgaagaa gtacaagatg tctgtgctct tcaagcagat cctgtccgat 2220

acagagtcca agagcttcgt cattgataag ctcgaggacg acagcgacgt ggtcaccaca 2280acagagtcca agagcttcgt cattgataag ctcgaggacg acagcgacgt ggtcaccaca 2280

atgcagtcat tctacgagca gatcgccgcg ttcaagaccg tggaggagaa gagcattaag 2340atgcagtcat tctacgagca gatcgccgcg ttcaagaccg tggaggagaa gagcattaag 2340

gagacactct cactcctgtt cgatgacctg aaggcccaga agctcgacct gagcaagatc 2400gagacactct cactcctgtt cgatgacctg aaggcccaga agctcgacct gagcaagatc 2400

tacttcaaga acgataagag cctcacagac ctgtcacagc aggtgttcga tgactactca 2460tacttcaaga acgataagag cctcacagac ctgtcacagc aggtgttcga tgactactca 2460

gtgattggca ccgccgtcct cgagtacatt acacagcaga tcgcgccaaa gaacctcgat 2520gtgattggca ccgccgtcct cgagtacatt acacagcaga tcgcgccaaa gaacctcgat 2520

aatccttcta agaaggagca ggagctgatc gccaagaaaa ccgagaaggc gaagtacctc 2580aatccttcta agaaggagca ggagctgatc gccaagaaaa ccgagaaggc gaagtacctc 2580

tccctggaga caattaagct cgccctggag gagttcaata agcacaggga tattgacaag 2640tccctggaga caattaagct cgccctggag gagttcaata agcacaggga tattgacaag 2640

cagtgccgct tcgaggagat cctcgcgaac ttcgccgcga tcccaatgat tttcgatgag 2700cagtgccgct tcgaggagat cctcgcgaac ttcgccgcga tcccaatgat tttcgatgag 2700

atcgcccaga acaaggacaa tctggcgcag atctctatta agtaccagaa ccagggcaag 2760atcgcccaga acaaggacaa tctggcgcag atctctatta agtaccagaa ccagggcaag 2760

aaggacctcc tgcaggcctc cgcagaggac gacgtgaagg ccatcaagga tctcctggac 2820aaggacctcc tgcaggcctc cgcagaggac gacgtgaagg ccatcaagga tctcctggac 2820

cagaccaaca atctcctgca caagctcaag atcttccata tttcacagtc tgaggataag 2880cagaccaaca atctcctgca caagctcaag atcttccata tttcacagtc tgaggataag 2880

gccaatatcc tcgataagga cgagcatttc tacctggtgt tcgaggagtg ctacttcgag 2940gccaatatcc tcgataagga cgagcatttc tacctggtgt tcgaggagtg ctacttcgag 2940

ctggcgaaca ttgtccctct gtacaacaag attaggaatt acatcacaca gaagccgtac 3000ctggcgaaca ttgtccctct gtacaacaag attaggaatt acatcacaca gaagccgtac 3000

agcgacgaga agttcaagct caacttcgag aattcaaccc tggccaacgg ctgggataag 3060agcgacgaga agttcaagct caacttcgag aattcaaccc tggccaacgg ctgggataag 3060

aataaggagc ctgacaacac agcgatcctc ttcatcaagg acgacaagta ctacctgggc 3120aataaggagc ctgacaacac agcgatcctc ttcatcaagg acgacaagta ctacctgggc 3120

gtgatgaata agaagaacaa taagatcttc gatgacaagg ccattaagga gaacaagggc 3180gtgatgaata agaagaacaa taagatcttc gatgacaagg ccattaagga gaacaagggc 3180

gagggctaca agaagatcgt gtacaagctc ctgcctggcg ccaataagat gctcccgaag 3240gagggctaca agaagatcgt gtacaagctc ctgcctggcg ccaataagat gctcccgaag 3240

gtgttcttct ccgcgaagtc cattaagttc tacaacccaa gcgaggatat cctcaggatc 3300gtgttcttct ccgcgaagtc cattaagttc tacaacccaa gcgaggatat cctcaggatc 3300

aggaaccact ctacccatac aaagaacggc tcccctcaga agggctacga gaagttcgag 3360aggaaccact ctacccatac aaagaacggc tcccctcaga agggctacga gaagttcgag 3360

ttcaatatcg aggattgccg gaagttcatt gacttctaca agcagtccat cagcaagcac 3420ttcaatatcg aggattgccg gaagttcatt gacttctaca agcagtccat cagcaagcac 3420

cctgagtgga aggatttcgg cttccgcttc agcgacaccc agcggtacaa ctcaatcgat 3480cctgagtgga aggatttcgg cttccgcttc agcgacaccc agcggtacaa ctcaatcgat 3480

gagttctaca gggaggtgga gaatcagggc tacaagctca cattcgagaa catttcagag 3540gagttctaca gggaggtgga gaatcagggc tacaagctca cattcgagaa catttcagag 3540

tcttacatcg actccgtggt caatcagggc aagctctacc tgttccagat ctacaacaag 3600tcttacatcg actccgtggt caatcagggc aagctctacc tgttccagat ctacaacaag 3600

gatttcagcg cctactcaaa gggcaggccg aacctccata ccctgtactg gaaggcgctc 3660gatttcagcg cctactcaaa gggcaggccg aacctccata ccctgtactg gaaggcgctc 3660

ttcgatgagc gcaatctgca ggacgtggtc tacaagctca acggcgaggc cgagctgttc 3720ttcgatgagc gcaatctgca ggacgtggtc tacaagctca acggcgaggc cgagctgttc 3720

taccgcaagc agtctattcc gaagaagatc acacacccag cgaaggaggc catcgcgaac 3780taccgcaagc agtctattcc gaagaagatc acacacccag cgaaggaggc catcgcgaac 3780

aagaataagg acaatccgaa gaaggagtcc gtgttcgagt acgatctcat taaggacaag 3840aagaataagg acaatccgaa gaaggagtcc gtgttcgagt acgatctcat taaggacaag 3840

cggttcaccg aggataagtt cttcttccat tgcccaatca caattaactt caagtccagc 3900cggttcaccg aggataagtt cttcttccat tgcccaatca caattaactt caagtccagc 3900

ggcgccaaca agttcaatga cgagatcaat ctcctgctca aggagaaggc gaacgatgtg 3960ggcgccaaca agttcaatga cgagatcaat ctcctgctca aggagaaggc gaacgatgtg 3960

catatcctct ccattgcccg cggcgagcgg catctcgcct actacaccct ggtcgatggc 4020catatcctct ccattgcccg cggcgagcgg catctcgcct actacaccct ggtcgatggc 4020

aagggcaata tcattaagca ggataccttc aacatcattg gcaatgaccg catgaaaacc 4080aagggcaata tcattaagca ggataccttc aacatcattg gcaatgaccg catgaaaacc 4080

aactaccacg ataagctcgc cgcgatcgag aaggataggg acagcgcgcg caaggactgg 4140aactaccacg ataagctcgc cgcgatcgag aaggataggg acagcgcgcg caaggactgg 4140

aagaagatca acaatattaa ggagatgaag gagggctacc tctcacaggt ggtccatgag 4200aagaagatca acaatattaa ggagatgaag gagggctacc tctcacaggt ggtccatgag 4200

attgccaagc tcgtgatcga gtacaatgcg attgtggtgt tcgaggacct caacttcggc 4260attgccaagc tcgtgatcga gtacaatgcg attgtggtgt tcgaggacct caacttcggc 4260

ttcaagcggg gcaggttcaa ggtggagaag caggtctacc agaagctgga gaagatgctc 4320ttcaagcggg gcaggttcaa ggtggagaag caggtctacc agaagctgga gaagatgctc 4320

atcgagaagc tcaattacct ggtgttcaag gataacgagt tcgacaagac cggaggagtc 4380atcgagaagc tcaattacct ggtgttcaag gataacgagt tcgacaagac cggaggagtc 4380

ctcagggcat accagctgac cgcgccattc gagacattca agaagatggg caagcagaca 4440ctcagggcat accagctgac cgcgccattc gagacattca agaagatggg caagcagaca 4440

ggcatcatct actacgtgcc agccggcttc acctccaaga tctgccctgt gacaggcttc 4500ggcatcatct actacgtgcc agccggcttc acctccaaga tctgccctgt gacaggcttc 4500

gtcaaccagc tctaccctaa gtacgagtcc gtcagcaagt cacaggagtt cttcagcaag 4560gtcaaccagc tctaccctaa gtacgagtcc gtcagcaagt cacaggagtt cttcagcaag 4560

ttcgataaga tttgctacaa tctggacaag ggctacttcg agttctcatt cgattacaag 4620ttcgataaga tttgctacaa tctggacaag ggctacttcg agttctcatt cgattacaag 4620

aacttcggcg acaaggccgc gaagggcaag tggaccattg cgtctttcgg ctcccggctc 4680aacttcggcg acaaggccgc gaagggcaag tggaccattg cgtctttcgg ctcccggctc 4680

atcaacttca ggaatagcga taagaaccac aattgggaca ccagggaggt gtacccgaca 4740atcaacttca ggaatagcga taagaaccac aattgggaca ccagggaggt gtacccgaca 4740

aaggagctgg agaagctgct caaggactac tcaatcgagt acggccatgg cgagtgcatt 4800aaggagctgg agaagctgct caaggactac tcaatcgagt acggccatgg cgagtgcatt 4800

aaggccgcga tctgcggcga gagcgataag aagttcttcg ccaagctgac ctcagtcctc 4860aaggccgcga tctgcggcga gagcgataag aagttcttcg ccaagctgac ctcagtcctc 4860

aatacaatcc tgcagatgag gaactctaag accggcacag agctggacta cctgatttcc 4920aatacaatcc tgcagatgag gaactctaag accggcacag agctggacta cctgatttcc 4920

ccagtggccg atgtcaacgg caatttcttc gacagccgcc aggcgccgaa gaatatgcca 4980ccagtggccg atgtcaacgg caatttcttc gacagccgcc aggcgccgaa gaatatgcca 4980

caggacgccg acgccaacgg cgcataccac atcggcctca agggcctgat gctgctcgga 5040caggacgccg acgccaacgg cgcataccac atcggcctca agggcctgat gctgctcgga 5040

cgcattaaga acaatcagga gggcaagaag ctcaacctcg tgatcaagaa cgaggagtac 5100cgcattaaga acaatcagga gggcaagaag ctcaacctcg tgatcaagaa cgaggagtac 5100

ttcgagttcg tccaaaaccg gaacaattcc ggcggcagcc caaagaagaa gaggaaggtg 5160ttcgagttcg tccaaaaccg gaacaattcc ggcggcagcc caaagaagaa gaggaaggtg 5160

agcggcggca gcccaaagaa gaagcgcaag gtctag 5196agcggcggca gcccaaagaa gaagcgcaag gtctag 5196

<210> 12<210> 12

<211> 4980<211> 4980

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dLBCPF1-ABE7.10-2NLS<223> dLBCPF1-ABE7.10-2NLS

<400> 12<400> 12

atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60

tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120

gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180

ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240

atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300

tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360

aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420

gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480

aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540

ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600

gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660

atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720

gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780

catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840

aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900

ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960

ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020

atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080

ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140

tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200

tctggaggat ctagcggagg atctcttaag tcaaagctcg agaaattcac caactgttat 1260tctggaggat ctagcggagg atctcttaag tcaaagctcg agaaattcac caactgttat 1260

tcgttgagca aaacactgcg gtttaaagcg attccagtcg gcaagactca agagaatata 1320tcgttgagca aaacactgcg gtttaaagcg attccagtcg gcaagactca agagaatata 1320

gacaataagc ggctgttggt ggaagatgaa aagcgcgcgg aagactacaa aggggtgaag 1380gacaataagc ggctgttggt ggaagatgaa aagcgcgcgg aagactacaa aggggtgaag 1380

aagttgttgg acagatacta cctctctttt atcaatgatg tcttgcactc aatcaaattg 1440aagttgttgg acagatacta cctctctttt atcaatgatg tcttgcactc aatcaaattg 1440

aagaatctga acaactacat ctccctcttc agaaagaaaa caaggacaga aaaggagaat 1500aagaatctga acaactacat ctccctcttc agaaagaaaa caaggacaga aaaggagaat 1500

aaggaacttg aaaatttgga gatcaatctg aggaaagaga tcgcgaaagc ctttaaaggc 1560aaggaacttg aaaatttgga gatcaatctg aggaaagaga tcgcgaaagc ctttaaaggc 1560

aacgaaggat acaaaagtct gttcaagaag gatataattg agacaatttt gccagagttc 1620aacgaaggat acaaaagtct gttcaagaag gatataattg agacaatttt gccagagttc 1620

ctcgatgaca aggacgagat tgcgctggtc aattcgttca acggattcac aacagcattc 1680ctcgatgaca aggacgagat tgcgctggtc aattcgttca acggattcac aacagcattc 1680

acaggcttct ttgataatcg ggaaaatatg ttctctgagg aggcaaagtc cacttctatt 1740acaggcttct ttgataatcg ggaaaatatg ttctctgagg aggcaaagtc cacttctatt 1740

gcgttcaggt gtatcaatga gaatctcact aggtacattt ccaacatgga tatctttgag 1800gcgttcaggt gtatcaatga gaatctcact aggtacattt ccaacatgga tatctttgag 1800

aaggttgacg caatttttga caagcacgaa gttcaggaga ttaaggagaa gatcctcaat 1860aaggttgacg caatttttga caagcacgaa gttcaggaga ttaaggagaa gatcctcaat 1860

tccgattatg acgttgagga cttcttcgaa ggtgagtttt ttaatttcgt gctcactcaa 1920tccgattatg acgttgagga cttcttcgaa ggtgagtttt ttaatttcgt gctcactcaa 1920

gagggtatcg acgtgtataa tgcgatcatc ggtgggttcg tgactgagtc cggtgaaaag 1980gagggtatcg acgtgtataa tgcgatcatc ggtgggttcg tgactgagtc cggtgaaaag 1980

attaagggat tgaacgagta tatcaacctt tacaaccaaa agacgaaaca gaagctgcca 2040attaagggat tgaacgagta tatcaacctt tacaaccaaa agacgaaaca gaagctgcca 2040

aagttcaagc ctctttacaa acaggttctt tcagaccgcg agtcactctc gttctatggg 2100aagttcaagc ctctttacaa acaggttctt tcagaccgcg agtcactctc gttctatggg 2100

gagggctaca cttcggatga ggaagtcctg gaggtgttca ggaatactct caataagaat 2160gagggctaca cttcggatga ggaagtcctg gaggtgttca ggaatactct caataagaat 2160

tcggagattt tctcttctat aaaaaaactg gaaaagttgt ttaagaattt tgacgaatac 2220tcggagattt tctcttctat aaaaaaactg gaaaagttgt ttaagaattt tgacgaatac 2220

tctagcgccg gcatatttgt gaaaaacggc ccggccatat caacgataag taaagatatc 2280tctagcgccg gcatatttgt gaaaaacggc ccggccatat caacgataag taaagatatc 2280

ttcggcgaat ggaacgtgat cagagacaaa tggaacgcgg agtatgacga tattcacctg 2340ttcggcgaat ggaacgtgat cagagacaaa tggaacgcgg agtatgacga tattcacctg 2340

aagaagaagg ctgtcgtaac ggagaagtac gaggatgatc gcaggaaaag cttcaaaaag 2400aagaagaagg ctgtcgtaac ggagaagtac gaggatgatc gcaggaaaag cttcaaaaag 2400

atcggaagtt tcagcctgga acagttgcag gagtatgctg acgccgatct tagcgtcgtc 2460atcggaagtt tcagcctgga acagttgcag gagtatgctg acgccgatct tagcgtcgtc 2460

gagaagttga aggagataat catccaaaag gtcgacgaga tatataaagt ctatggatca 2520gagaagttga aggagataat catccaaaag gtcgacgaga tatataaagt ctatggatca 2520

agtgaaaaac tgttcgacgc cgacttcgtt ttggagaagt ccctgaagaa gaacgacgct 2580agtgaaaaac tgttcgacgc cgacttcgtt ttggagaagt ccctgaagaa gaacgacgct 2580

gttgttgcca ttatgaagga tctgctcgac agcgtgaaga gtttcgagaa ctatattaag 2640gttgttgcca ttatgaagga tctgctcgac agcgtgaaga gtttcgagaa ctatattaag 2640

gcttttttcg gggaggggaa ggagactaac agagatgagt ccttctacgg agacttcgtc 2700gcttttttcg gggagggggaa ggagactaac agagatgagt ccttctacgg agacttcgtc 2700

ctcgcgtacg atatactcct taaggtagac cacatctacg acgcaatcag aaattacgtg 2760ctcgcgtacg atatactcct taaggtagac cacatctacg acgcaatcag aaattacgtg 2760

acacaaaagc cgtacagcaa ggacaagttc aaactctact tccagaaccc ccagttcatg 2820acacaaaagc cgtacagcaa ggacaagttc aaactctact tccagaaccc ccagttcatg 2820

ggcggctggg acaaggacaa ggaaacggat tacagggcta cgatcctgag gtatggttca 2880ggcggctggg acaaggacaa ggaaacggat tacagggcta cgatcctgag gtatggttca 2880

aaatactact tggcgattat ggacaagaag tacgccaagt gtctccagaa gattgacaaa 2940aaatactact tggcgattat ggacaagaag tacgccaagt gtctccagaa gattgacaaa 2940

gacgatgtca atggcaatta tgagaagatc aactacaagc tgcttccggg tccgaacaag 3000gacgatgtca atggcaatta tgagaagatc aactacaagc tgcttccggg tccgaacaag 3000

atgctcccaa aggttttctt cagcaagaaa tggatggcct actataaccc aagcgaggac 3060atgctcccaa aggttttctt cagcaagaaa tggatggcct actataaccc aagcgaggac 3060

atccagaaga tttataagaa cggtacgttc aagaagggcg acatgttcaa tcttaacgac 3120atccagaaga tttataagaa cggtacgttc aagaagggcg acatgttcaa tcttaacgac 3120

tgtcacaagc tgatcgactt cttcaaagac tcaattagcc ggtacccaaa gtggtctaac 3180tgtcacaagc tgatcgactt cttcaaagac tcaattagcc ggtacccaaa gtggtctaac 3180

gcctatgact tcaacttttc ggaaaccgag aagtacaagg atatagccgg attttataga 3240gcctatgact tcaacttttc ggaaaccgag aagtacaagg atatagccgg attttataga 3240

gaggtggaag agcagggcta caaggtgtca ttcgagtccg ccagcaagaa ggaagtggac 3300gaggtggaag agcagggcta caaggtgtca ttcgagtccg ccagcaagaa ggaagtggac 3300

aagctcgtgg aagagggtaa gctctacatg ttccagattt ataataaaga ctttagcgat 3360aagctcgtgg aagagggtaa gctctacatg ttccagattt ataataaaga ctttagcgat 3360

aagagccacg ggacacctaa tctccacaca atgtatttca agctgctctt cgacgagaat 3420aagagccacg ggacacctaa tctccacaca atgtatttca agctgctctt cgacgagaat 3420

aaccacggcc aaatcaggtt gtcaggaggg gctgaactct tcatgcggcg cgctagcctt 3480aaccacggcc aaatcaggtt gtcaggaggg gctgaactct tcatgcggcg cgctagcctt 3480

aagaaggagg agcttgtagt ccaccctgcg aatagtccaa ttgcgaataa gaacccggac 3540aagaaggagg agcttgtagt ccaccctgcg aatagtccaa ttgcgaataa gaacccggac 3540

aatcctaaaa agactacaac attgagctac gacgtgtaca aggataagag gttttccgag 3600aatcctaaaa agactacaac attgagctac gacgtgtaca aggataagag gttttccgag 3600

gatcagtacg agctccacat cccgattgcg atcaacaagt gcccaaagaa tattttcaag 3660gatcagtacg agctccacat cccgattgcg atcaacaagt gcccaaagaa tattttcaag 3660

ataaacacag aggtgcgtgt actcctgaag catgacgaca atccttacgt cattgggatt 3720ataaacacag aggtgcgtgt actcctgaag catgacgaca atccttacgt cattgggatt 3720

gctcggggcg agaggaacct cctctatatt gtggtggtgg acgggaaggg gaacatagtc 3780gctcggggcg agaggaacct cctctatatt gtggtggtgg acgggaaggg gaacatagtc 3780

gaacagtact cccttaacga aataattaac aatttcaacg gcatccgtat caagaccgac 3840gaacagtact cccttaacga aataattaac aatttcaacg gcatccgtat caagaccgac 3840

taccattcgt tgctggacaa gaaggagaag gagagatttg aggcgcggca aaattggaca 3900taccattcgt tgctggacaa gaaggagaag gagagatttg aggcgcggca aaattggaca 3900

agtatcgaga acatcaagga actcaaagca ggttatatct ctcaagttgt gcataagata 3960agtatcgaga acatcaagga actcaaagca ggttatatct ctcaagttgt gcataagata 3960

tgcgagctgg ttgagaagta tgacgcagtg atcgctcttg aggacctcaa ctcgggcttt 4020tgcgagctgg ttgagaagta tgacgcagtg atcgctcttg aggacctcaa ctcgggcttt 4020

aagaattcta gagttaaagt ggagaagcag gtctatcaaa agttcgagaa gatgcttata 4080aagaattcta gagttaaagt ggagaagcag gtctatcaaa agttcgagaa gatgcttata 4080

gataagctca actacatggt cgataagaaa tcgaacccat gtgccaccgg cggcgcactc 4140gataagctca actacatggt cgataagaaa tcgaacccat gtgccaccgg cggcgcactc 4140

aaaggttacc aaataacaaa caaattcgag tccttcaaat cgatgagtac tcagaatggg 4200aaaggttacc aaataacaaa caaattcgag tccttcaaat cgatgagtac tcagaatggg 4200

ttcatatttt atataccggc gtggcttacg tctaagatcg acccgtcaac tggttttgtc 4260ttcatatttt atataccggc gtggcttacg tctaagatcg acccgtcaac tggttttgtc 4260

aacctgttga agacgaaata cacgtccatt gccgattcga aaaagttcat atctagtttt 4320aacctgttga agacgaaata cacgtccatt gccgattcga aaaagttcat atctagtttt 4320

gatcgtatta tgtacgtccc agaggaagat cttttcgagt ttgctctcga ctacaaaaac 4380gatcgtatta tgtacgtccc agaggaagat cttttcgagt ttgctctcga ctacaaaaac 4380

ttttcgcgga ccgatgcgga ttacattaaa aaatggaaac tctattcgta cggcaacaga 4440ttttcgcgga ccgatgcgga ttacattaaa aaatggaaac tctattcgta cggcaacaga 4440

atcaggattt ttcgcaaccc taagaagaat aacgtctttg attgggagga agtttgcttg 4500atcaggattt ttcgcaaccc taagaagaat aacgtctttg attgggagga agtttgcttg 4500

actagcgcgt acaaggagct ctttaataag tatggcatta actaccaaca gggtgatatc 4560actagcgcgt acaaggagct ctttaataag tatggcatta actaccaaca gggtgatatc 4560

agagcactgc tttgcgaaca atctgacaag gctttctact catccttcat ggctttgatg 4620agagcactgc tttgcgaaca atctgacaag gctttctact catccttcat ggctttgatg 4620

agcctgatgc tccagatgag aaattcaatt acaggcagaa ccgacgtgga tttcttgatc 4680agcctgatgc tccagatgag aaattcaatt acaggcagaa ccgacgtgga tttcttgatc 4680

tccccggtta aaaattctga tggcatcttt tacgatagca ggaactatga agcgcaagag 4740tccccggtta aaaattctga tggcatcttt tacgatagca ggaactatga agcgcaagag 4740

aatgcgattc tgccaaaaaa tgcagacgcc aacggtgcct ataacatcgc caggaaagtc 4800aatgcgattc tgccaaaaaa tgcagacgcc aacggtgcct ataacatcgc caggaaagtc 4800

ctgtgggcga tcggccagtt caaaaaggcc gaagacgaaa aattggacaa ggtcaaaatc 4860ctgtgggcga tcggccagtt caaaaaggcc gaagacgaaa aattggacaa ggtcaaaatc 4860

gctatcagca acaaagagtg gctggagtat gctcagacat ccgtaaagca ttccggcggc 4920gctatcagca acaaagagtg gctggagtat gctcagacat ccgtaaagca ttccggcggc 4920

agcccaaaga agaagaggaa ggtgagcggc ggcagcccaa agaagaagcg caaggtctag 4980agcccaaaga agaagaggaa ggtgagcggc ggcagcccaa agaagaagcg caaggtctag 4980

<210> 13<210> 13

<211> 6506<211> 6506

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dLBCPF1-ABE2-X<223> dLBCPF1-ABE2-X

<400> 13<400> 13

ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60

agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120

tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180

tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240

gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300

ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360

gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420

agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480

taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540

aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600

tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660

cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720

ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780

gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840

caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900

ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960

tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020

taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080

atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140

cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200

ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260

gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320

cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380

acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440

ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500

tctgcaggtc gaagcttgaa gcaaacatgc caaaaaagaa gagaaaggtt tcaggcggct 1560tctgcaggtc gaagcttgaa gcaaacatgc caaaaaagaa gagaaaggtt tcaggcggct 1560

cctccgaggt ggagttctct cacgagtatt ggatgaggca cgctcttaca cttgctaaga 1620cctccgaggt ggagttctct cacgagtatt ggatgaggca cgctcttaca cttgctaaga 1620

gagcttggga cgaaagagaa gtgccagttg gcgccgttct tgtgcataat aatagggtga 1680gagcttggga cgaaagagaa gtgccagttg gcgccgttct tgtgcataat aatagggtga 1680

tcggcgaggg ttggaataga ccaattggaa ggcatgatcc aacagctcac gcagagatta 1740tcggcgaggg ttggaataga ccaattggaa ggcatgatcc aacagctcac gcagagatta 1740

tggctctcag acaaggcggc ctcgttatgc agaactacag gctcattgac gctacactct 1800tggctctcag acaaggcggc ctcgttatgc agaactacag gctcattgac gctacactct 1800

acgtgacact cgaaccttgc gttatgtgcg ccggagctat gattcattct aggattggca 1860acgtgacact cgaaccttgc gttatgtgcg ccggagctat gattcattct aggattggca 1860

gggtcgtgtt tggagctagg gacgctaaaa caggagccgc cggatctctt atggacgtgt 1920gggtcgtgtt tggagctagg gacgctaaaa caggagccgc cggatctctt atggacgtgt 1920

tgcatcatcc aggcatgaac catagggtgg agattacaga gggcattctt gcagacgagt 1980tgcatcatcc aggcatgaac catagggtgg agattacaga gggcattctt gcagacgagt 1980

gcgctgctct tctttccgat ttcttcagga tgagaaggca ggagattaag gcccagaaga 2040gcgctgctct tctttccgat ttcttcagga tgagaaggca ggagattaag gcccagaaga 2040

aggctcagtc ttctacagat agcggaggat cttccggagg atctagcggc tccgagacac 2100aggctcagtc ttctacagat agcggaggat cttccggagg atctagcggc tccgagacac 2100

caggaacatc cgaaagcgct acaccagaat ctagcggagg ctcttccgga ggatcttctg 2160caggaacatc cgaaagcgct acaccagaat ctagcggagg ctcttccgga ggatcttctg 2160

aagtggagtt ctcccacgag tattggatga ggcacgctct tacacttgct aaaagggcta 2220aagtggagtt ctcccacgag tattggatga ggcacgctct tacacttgct aaaagggcta 2220

gggacgaaag ggaagttcca gttggagctg ttctcgtgct caataacagg gtgattggcg 2280gggacgaaag ggaagttcca gttggagctg ttctcgtgct caataacagg gtgattggcg 2280

agggttggaa tagagccatt ggactccatg atccaacagc tcacgcagag attatggctc 2340agggttggaa tagagccatt ggactccatg atccaacagc tcacgcagag attatggctc 2340

ttagacaagg cggcctcgtt atgcagaatt acagactcat cgacgccaca ctctacgtta 2400ttagacaagg cggcctcgtt atgcagaatt acagactcat cgacgccaca ctctacgtta 2400

ccttcgaacc ttgcgttatg tgcgccggag ctatgatcca ttctaggatt ggcagggtcg 2460ccttcgaacc ttgcgttatg tgcgccggag ctatgatcca ttctaggatt ggcagggtcg 2460

tgttcggcgt tagaaacgct aagacaggag ctgcaggctc tcttatggac gttcttcatt 2520tgttcggcgt tagaaacgct aagacaggag ctgcaggctc tcttatggac gttcttcatt 2520

acccaggcat gaatcataga gtggagatca cagaaggcat tcttgcagac gagtgcgcag 2580acccaggcat gaatcataga gtggagatca cagaaggcat tcttgcagac gagtgcgcag 2580

ctctcctttg ctatttcttc aggatgccga ggcaagtttt caacgctcag aagaaggccc 2640ctctcctttg ctatttcttc aggatgccga ggcaagtttt caacgctcag aagaaggccc 2640

agtcttctac agattccggc ggatcttctg gaggatctag cggctccgag acaccaggaa 2700agtcttctac agattccggc ggatcttctg gaggatctag cggctccgag acaccaggaa 2700

catccgaatc cgctacacca gagtcttctg gaggatctag cggaggatct cttaagtcaa 2760catccgaatc cgctacacca gagtcttctg gaggatctag cggaggatct cttaagtcaa 2760

agctcgagaa attcaccaac tgttattcgt tgagcaaaac actgcggttt aaagcgattc 2820agctcgagaa attcaccaac tgttattcgt tgagcaaaac actgcggttt aaagcgattc 2820

cagtcggcaa gactcaagag aatatagaca ataagcggct gttggtggaa gatgaaaagc 2880cagtcggcaa gactcaagag aatatagaca ataagcggct gttggtggaa gatgaaaagc 2880

gcgcggaaga ctacaaaggg gtgaagaagt tgttggacag atactacctc tcttttatca 2940gcgcggaaga ctacaaaggg gtgaagaagt tgttggacag atactacctc tcttttatca 2940

atgatgtctt gcactcaatc aaattgaaga atctgaacaa ctacatctcc ctcttcagaa 3000atgatgtctt gcactcaatc aaattgaaga atctgaacaa ctacatctcc ctcttcagaa 3000

agaaaacaag gacagaaaag gagaataagg aacttgaaaa tttggagatc aatctgagga 3060agaaaacaag gacagaaaag gagaataagg aacttgaaaa tttggagatc aatctgagga 3060

aagagatcgc gaaagccttt aaaggcaacg aaggatacaa aagtctgttc aagaaggata 3120aagagatcgc gaaagccttt aaaggcaacg aaggatacaa aagtctgttc aagaaggata 3120

taattgagac aattttgcca gagttcctcg atgacaagga cgagattgcg ctggtcaatt 3180taattgagac aattttgcca gagttcctcg atgacaagga cgagattgcg ctggtcaatt 3180

cgttcaacgg attcacaaca gcattcacag gcttctttga taatcgggaa aatatgttct 3240cgttcaacgg attcacaaca gcattcacag gcttctttga taatcgggaa aatatgttct 3240

ctgaggaggc aaagtccact tctattgcgt tcaggtgtat caatgagaat ctcactaggt 3300ctgaggaggc aaagtccact tctattgcgt tcaggtgtat caatgagaat ctcactaggt 3300

acatttccaa catggatatc tttgagaagg ttgacgcaat ttttgacaag cacgaagttc 3360acatttccaa catggatatc tttgagaagg ttgacgcaat ttttgacaag cacgaagttc 3360

aggagattaa ggagaagatc ctcaattccg attatgacgt tgaggacttc ttcgaaggtg 3420aggagattaa ggagaagatc ctcaattccg attatgacgt tgaggacttc ttcgaaggtg 3420

agttttttaa tttcgtgctc actcaagagg gtatcgacgt gtataatgcg atcatcggtg 3480agttttttaa tttcgtgctc actcaagagg gtatcgacgt gtataatgcg atcatcggtg 3480

ggttcgtgac tgagtccggt gaaaagatta agggattgaa cgagtatatc aacctttaca 3540ggttcgtgac tgagtccggt gaaaagatta agggattgaa cgagtatatc aacctttaca 3540

accaaaagac gaaacagaag ctgccaaagt tcaagcctct ttacaaacag gttctttcag 3600accaaaagac gaaacagaag ctgccaaagt tcaagcctct ttacaaacag gttctttcag 3600

accgcgagtc actctcgttc tatggggagg gctacacttc ggatgaggaa gtcctggagg 3660accgcgagtc actctcgttc tatggggagg gctacacttc ggatgaggaa gtcctggagg 3660

tgttcaggaa tactctcaat aagaattcgg agattttctc ttctataaaa aaactggaaa 3720tgttcaggaa tactctcaat aagaattcgg agattttctc ttctataaaa aaactggaaa 3720

agttgtttaa gaattttgac gaatactcta gcgccggcat atttgtgaaa aacggcccgg 3780agttgtttaa gaattttgac gaatactcta gcgccggcat atttgtgaaa aacggcccgg 3780

ccatatcaac gataagtaaa gatatcttcg gcgaatggaa cgtgatcaga gacaaatgga 3840ccatatcaac gataagtaaa gatatcttcg gcgaatggaa cgtgatcaga gacaaatgga 3840

acgcggagta tgacgatatt cacctgaaga agaaggctgt cgtaacggag aagtacgagg 3900acgcggagta tgacgatatt cacctgaaga agaaggctgt cgtaacggag aagtacgagg 3900

atgatcgcag gaaaagcttc aaaaagatcg gaagtttcag cctggaacag ttgcaggagt 3960atgatcgcag gaaaagcttc aaaaagatcg gaagtttcag cctggaacag ttgcaggagt 3960

atgctgacgc cgatcttagc gtcgtcgaga agttgaagga gataatcatc caaaaggtcg 4020atgctgacgc cgatcttagc gtcgtcgaga agttgaagga gataatcatc caaaaggtcg 4020

acgagatata taaagtctat ggatcaagtg aaaaactgtt cgacgccgac ttcgttttgg 4080acgagatata taaagtctat ggatcaagtg aaaaactgtt cgacgccgac ttcgttttgg 4080

agaagtccct gaagaagaac gacgctgttg ttgccattat gaaggatctg ctcgacagcg 4140agaagtccct gaagaagaac gacgctgttg ttgccattat gaaggatctg ctcgacagcg 4140

tgaagagttt cgagaactat attaaggctt ttttcgggga ggggaaggag actaacagag 4200tgaagagttt cgagaactat attaaggctt ttttcgggga ggggaaggag actaacagag 4200

atgagtcctt ctacggagac ttcgtcctcg cgtacgatat actccttaag gtagaccaca 4260atgagtcctt ctacggagac ttcgtcctcg cgtacgatat actccttaag gtagaccaca 4260

tctacgacgc aatcagaaat tacgtgacac aaaagccgta cagcaaggac aagttcaaac 4320tctacgacgc aatcagaaat tacgtgacac aaaagccgta cagcaaggac aagttcaaac 4320

tctacttcca gaacccccag ttcatgggcg gctgggacaa ggacaaggaa acggattaca 4380tctacttcca gaacccccag ttcatgggcg gctgggacaa ggacaaggaa acggattaca 4380

gggctacgat cctgaggtat ggttcaaaat actacttggc gattatggac aagaagtacg 4440gggctacgat cctgaggtat ggttcaaaat actacttggc gattatggac aagaagtacg 4440

ccaagtgtct ccagaagatt gacaaagacg atgtcaatgg caattatgag aagatcaact 4500ccaagtgtct ccagaagatt gacaaagacg atgtcaatgg caattatgag aagatcaact 4500

acaagctgct tccgggtccg aacaagatgc tcccaaaggt tttcttcagc aagaaatgga 4560acaagctgct tccgggtccg aacaagatgc tcccaaaggt tttcttcagc aagaaatgga 4560

tggcctacta taacccaagc gaggacatcc agaagattta taagaacggt acgttcaaga 4620tggcctacta taacccaagc gaggacatcc agaagattta taagaacggt acgttcaaga 4620

agggcgacat gttcaatctt aacgactgtc acaagctgat cgacttcttc aaagactcaa 4680agggcgacat gttcaatctt aacgactgtc acaagctgat cgacttcttc aaagactcaa 4680

ttagccggta cccaaagtgg tctaacgcct atgacttcaa cttttcggaa accgagaagt 4740ttagccggta cccaaagtgg tctaacgcct atgacttcaa cttttcggaa accgagaagt 4740

acaaggatat agccggattt tatagagagg tggaagagca gggctacaag gtgtcattcg 4800acaaggatat agccggattt tatagagagg tggaagagca gggctacaag gtgtcattcg 4800

agtccgccag caagaaggaa gtggacaagc tcgtggaaga gggtaagctc tacatgttcc 4860agtccgccag caagaaggaa gtggacaagc tcgtggaaga gggtaagctc tacatgttcc 4860

agatttataa taaagacttt agcgataaga gccacgggac acctaatctc cacacaatgt 4920agatttataa taaagacttt agcgataaga gccacgggac acctaatctc cacacaatgt 4920

atttcaagct gctcttcgac gagaataacc acggccaaat caggttgtca ggaggggctg 4980atttcaagct gctcttcgac gagaataacc acggccaaat caggttgtca ggaggggctg 4980

aactcttcat gcggcgcgct agccttaaga aggaggagct tgtagtccac cctgcgaata 5040aactcttcat gcggcgcgct agccttaaga aggaggagct tgtagtccac cctgcgaata 5040

gtccaattgc gaataagaac ccggacaatc ctaaaaagac tacaacattg agctacgacg 5100gtccaattgc gaataagaac ccggacaatc ctaaaaagac tacaacattg agctacgacg 5100

tgtacaagga taagaggttt tccgaggatc agtacgagct ccacatcccg attgcgatca 5160tgtacaagga taagaggttt tccgaggatc agtacgagct ccacatcccg attgcgatca 5160

acaagtgccc aaagaatatt ttcaagataa acacagaggt gcgtgtactc ctgaagcatg 5220acaagtgccc aaagaatatt ttcaagataa acacagaggt gcgtgtactc ctgaagcatg 5220

acgacaatcc ttacgtcatt gggattgctc ggggcgagag gaacctcctc tatattgtgg 5280acgacaatcc ttacgtcatt gggattgctc ggggcgagag gaacctcctc tatattgtgg 5280

tggtggacgg gaaggggaac atagtcgaac agtactccct taacgaaata attaacaatt 5340tggtggacgg gaaggggaac atagtcgaac agtactccct taacgaaata attaacaatt 5340

tcaacggcat ccgtatcaag accgactacc attcgttgct ggacaagaag gagaaggaga 5400tcaacggcat ccgtatcaag accgactacc attcgttgct ggacaagaag gagaaggaga 5400

gatttgaggc gcggcaaaat tggacaagta tcgagaacat caaggaactc aaagcaggtt 5460gatttgaggc gcggcaaaat tggacaagta tcgagaacat caaggaactc aaagcaggtt 5460

atatctctca agttgtgcat aagatatgcg agctggttga gaagtatgac gcagtgatcg 5520atatctctca agttgtgcat aagatatgcg agctggttga gaagtatgac gcagtgatcg 5520

ctcttgagga cctcaactcg ggctttaaga attctagagt taaagtggag aagcaggtct 5580ctcttgagga cctcaactcg ggctttaaga attctagagt taaagtggag aagcaggtct 5580

atcaaaagtt cgagaagatg cttatagata agctcaacta catggtcgat aagaaatcga 5640atcaaaagtt cgagaagatg cttatagata agctcaacta catggtcgat aagaaatcga 5640

acccatgtgc caccggcggc gcactcaaag gttaccaaat aacaaacaaa ttcgagtcct 5700acccatgtgc caccggcggc gcactcaaag gttaccaaat aacaaacaaa ttcgagtcct 5700

tcaaatcgat gagtactcag aatgggttca tattttatat accggcgtgg cttacgtcta 5760tcaaatcgat gagtactcag aatgggttca tattttatat accggcgtgg cttacgtcta 5760

agatcgaccc gtcaactggt tttgtcaacc tgttgaagac gaaatacacg tccattgccg 5820agatcgaccc gtcaactggt tttgtcaacc tgttgaagac gaaatacacg tccattgccg 5820

attcgaaaaa gttcatatct agttttgatc gtattatgta cgtcccagag gaagatcttt 5880attcgaaaaa gttcatatct agttttgatc gtattatgta cgtcccagag gaagatcttt 5880

tcgagtttgc tctcgactac aaaaactttt cgcggaccga tgcggattac attaaaaaat 5940tcgagtttgc tctcgactac aaaaactttt cgcggaccga tgcggattac attaaaaaat 5940

ggaaactcta ttcgtacggc aacagaatca ggatttttcg caaccctaag aagaataacg 6000ggaaactcta ttcgtacggc aacagaatca ggatttttcg caaccctaag aagaataacg 6000

tctttgattg ggaggaagtt tgcttgacta gcgcgtacaa ggagctcttt aataagtatg 6060tctttgattg ggaggaagtt tgcttgacta gcgcgtacaa ggagctcttt aataagtatg 6060

gcattaacta ccaacagggt gatatcagag cactgctttg cgaacaatct gacaaggctt 6120gcattaacta ccaacagggt gatatcagag cactgctttg cgaacaatct gacaaggctt 6120

tctactcatc cttcatggct ttgatgagcc tgatgctcca gatgagaaat tcaattacag 6180tctactcatc cttcatggct ttgatgagcc tgatgctcca gatgagaaat tcaattacag 6180

gcagaaccga cgtggatttc ttgatctccc cggttaaaaa ttctgatggc atcttttacg 6240gcagaaccga cgtggatttc ttgatctccc cggttaaaaa ttctgatggc atcttttacg 6240

atagcaggaa ctatgaagcg caagagaatg cgattctgcc aaaaaatgca gacgccaacg 6300atagcaggaa ctatgaagcg caagagaatg cgattctgcc aaaaaatgca gacgccaacg 6300

gtgcctataa catcgccagg aaagtcctgt gggcgatcgg ccagttcaaa aaggccgaag 6360gtgcctataa catcgccagg aaagtcctgt gggcgatcgg ccagttcaaa aaggccgaag 6360

acgaaaaatt ggacaaggtc aaaatcgcta tcagcaacaa agagtggctg gagtatgctc 6420acgaaaaatt ggacaaggtc aaaatcgcta tcagcaacaa agagtggctg gagtatgctc 6420

agacatccgt aaagcattcc ggcggcagcc caaagaagaa gaggaaggtg agcggcggca 6480agacatccgt aaagcattcc ggcggcagcc caaagaagaa gaggaaggtg agcggcggca 6480

gcccaaagaa gaagcgcaag gtctag 6506gcccaaagaa gaagcgcaag gtctag 6506

<210> 14<210> 14

<211> 4971<211> 4971

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-ABE2-X2<223> LBCPF1-ABE2-X2

<400> 14<400> 14

atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60

gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120

gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180

tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240

ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300

ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360

aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420

gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480

atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540

actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600

gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660

gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720gaaggtgagt ttttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720

atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780

ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840

ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900

ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960

ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020

ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080

aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140

tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200

caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagt aatcatccaa 1260

aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320

gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380

gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440

aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500

gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560

ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620

gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680

aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740

atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800

aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860

ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920

gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980

gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040

tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100

atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160

acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220

ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280

gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340

tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400

gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460

aagcatgacg acaatcctta cgtcattggg attgctcggg gcgagaggaa cctcctctat 2520aagcatgacg acaatcctta cgtcattggg attgctcggg gcgagaggaa cctcctctat 2520

attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580

aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640

aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700

gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760

gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820

caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880

aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940

gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000

acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060

attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120

gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180

aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240

aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300

aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360

aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420

attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480

ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540

gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600

gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660

tatgctcaga catccgtaaa gcataagctt atgccaaaaa agaagagaaa ggtttcaggc 3720tatgctcaga catccgtaaa gcataagctt atgccaaaaa agaagagaaa ggtttcaggc 3720

ggctcctccg aggtggagtt ctctcacgag tattggatga ggcacgctct tacacttgct 3780ggctcctccg aggtggagtt ctctcacgag tattggatga ggcacgctct tacacttgct 3780

aagagagctt gggacgaaag agaagtgcca gttggcgccg ttcttgtgca taataatagg 3840aagagagctt gggacgaaag agaagtgcca gttggcgccg ttcttgtgca taataatagg 3840

gtgatcggcg agggttggaa tagaccaatt ggaaggcatg atccaacagc tcacgcagag 3900gtgatcggcg agggttggaa tagaccaatt ggaaggcatg atccaacagc tcacgcagag 3900

attatggctc tcagacaagg cggcctcgtt atgcagaact acaggctcat tgacgctaca 3960attatggctc tcagacaagg cggcctcgtt atgcagaact acaggctcat tgacgctaca 3960

ctctacgtga cactcgaacc ttgcgttatg tgcgccggag ctatgattca ttctaggatt 4020ctctacgtga cactcgaacc ttgcgttatg tgcgccggag ctatgattca ttctaggatt 4020

ggcagggtcg tgtttggagc tagggacgct aaaacaggag ccgccggatc tcttatggac 4080ggcagggtcg tgtttggagc tagggacgct aaaacaggag ccgccggatc tcttatggac 4080

gtgttgcatc atccaggcat gaaccatagg gtggagatta cagagggcat tcttgcagac 4140gtgttgcatc atccaggcat gaaccatagg gtggagatta cagagggcat tcttgcagac 4140

gagtgcgctg ctcttctttc cgatttcttc aggatgagaa ggcaggagat taaggcccag 4200gagtgcgctg ctcttctttc cgatttcttc aggatgagaa ggcaggagat taaggcccag 4200

aagaaggctc agtcttctac agatagcgga ggatcttccg gaggatctag cggctccgag 4260aagaaggctc agtcttctac agatagcgga ggatcttccg gaggatctag cggctccgag 4260

acaccaggaa catccgaaag cgctacacca gaatctagcg gaggctcttc cggaggatct 4320acaccaggaa catccgaaag cgctacacca gaatctagcg gaggctcttc cggaggatct 4320

tctgaagtgg agttctccca cgagtattgg atgaggcacg ctcttacact tgctaaaagg 4380tctgaagtgg agttctccca cgagtattgg atgaggcacg ctcttacact tgctaaaagg 4380

gctagggacg aaagggaagt tccagttgga gctgttctcg tgctcaataa cagggtgatt 4440gctagggacg aaagggaagt tccagttgga gctgttctcg tgctcaataa cagggtgatt 4440

ggcgagggtt ggaatagagc cattggactc catgatccaa cagctcacgc agagattatg 4500ggcgagggtt ggaatagagc cattggactc catgatccaa cagctcacgc agagattatg 4500

gctcttagac aaggcggcct cgttatgcag aattacagac tcatcgacgc cacactctac 4560gctcttagac aaggcggcct cgttatgcag aattacagac tcatcgacgc cacactctac 4560

gttaccttcg aaccttgcgt tatgtgcgcc ggagctatga tccattctag gattggcagg 4620gttaccttcg aaccttgcgt tatgtgcgcc ggagctatga tccattctag gattggcagg 4620

gtcgtgttcg gcgttagaaa cgctaagaca ggagctgcag gctctcttat ggacgttctt 4680gtcgtgttcg gcgttagaaa cgctaagaca ggagctgcag gctctcttat ggacgttctt 4680

cattacccag gcatgaatca tagagtggag atcacagaag gcattcttgc agacgagtgc 4740cattacccag gcatgaatca tagagtggag atcacagaag gcattcttgc agacgagtgc 4740

gcagctctcc tttgctattt cttcaggatg ccgaggcaag ttttcaacgc tcagaagaag 4800gcagctctcc tttgctattt cttcaggatg ccgaggcaag ttttcaacgc tcagaagaag 4800

gcccagtctt ctacagattc cggcggatct tctggaggat ctagcggctc cgagacacca 4860gcccagtctt ctacagattc cggcggatct tctggaggat ctagcggctc cgagacacca 4860

ggaacatccg aatccgctac accagagtct tctggaggat ctagcggagg atctcttaag 4920ggaacatccg aatccgctac accagagtct tctggaggat ctagcggagg atctcttaag 4920

aagagaccag cagctacaaa gaaggccgga caagctaaga agaagaagta g 4971aagagaccag cagctacaaa gaaggccgga caagctaaga agaagaagta g 4971

<210> 15<210> 15

<211> 4992<211> 4992

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-ABE2-X3<223> LBCPF1-ABE2-X3

<400> 15<400> 15

atgccaaaaa agaagagaaa ggtttcaaag ctcgagaaat tcaccaactg ttattcgttg 60atgccaaaaa agaagagaaa ggtttcaaag ctcgagaaat tcaccaactg ttattcgttg 60

agcaaaacac tgcggtttaa agcgattcca gtcggcaaga ctcaagagaa tatagacaat 120agcaaaacac tgcggtttaa agcgattcca gtcggcaaga ctcaagagaa tatagacaat 120

aagcggctgt tggtggaaga tgaaaagcgc gcggaagact acaaaggggt gaagaagttg 180aagcggctgt tggtggaaga tgaaaagcgc gcggaagact acaaaggggt gaagaagttg 180

ttggacagat actacctctc ttttatcaat gatgtcttgc actcaatcaa attgaagaat 240ttggacagat actacctctc ttttatcaat gatgtcttgc actcaatcaa attgaagaat 240

ctgaacaact acatctccct cttcagaaag aaaacaagga cagaaaagga gaataaggaa 300ctgaacaact acatctccct cttcagaaag aaaacaagga cagaaaagga gaataaggaa 300

cttgaaaatt tggagatcaa tctgaggaaa gagatcgcga aagcctttaa aggcaacgaa 360cttgaaaatt tggagatcaa tctgaggaaa gagatcgcga aagcctttaa aggcaacgaa 360

ggatacaaaa gtctgttcaa gaaggatata attgagacaa ttttgccaga gttcctcgat 420ggatacaaaa gtctgttcaa gaaggatata attgagacaa ttttgccaga gttcctcgat 420

gacaaggacg agattgcgct ggtcaattcg ttcaacggat tcacaacagc attcacaggc 480gacaaggacg agattgcgct ggtcaattcg ttcaacggat tcacaacagc attcacaggc 480

ttctttgata atcgggaaaa tatgttctct gaggaggcaa agtccacttc tattgcgttc 540ttctttgata atcgggaaaa tatgttctct gaggaggcaa agtccacttc tattgcgttc 540

aggtgtatca atgagaatct cactaggtac atttccaaca tggatatctt tgagaaggtt 600aggtgtatca atgagaatct cactaggtac atttccaaca tggatatctt tgagaaggtt 600

gacgcaattt ttgacaagca cgaagttcag gagattaagg agaagatcct caattccgat 660gacgcaattt ttgacaagca cgaagttcag gagattaagg agaagatcct caattccgat 660

tatgacgttg aggacttctt cgaaggtgag ttttttaatt tcgtgctcac tcaagagggt 720tatgacgttg aggacttctt cgaaggtgag ttttttaatt tcgtgctcac tcaagagggt 720

atcgacgtgt ataatgcgat catcggtggg ttcgtgactg agtccggtga aaagattaag 780atcgacgtgt ataatgcgat catcggtggg ttcgtgactg agtccggtga aaagattaag 780

ggattgaacg agtatatcaa cctttacaac caaaagacga aacagaagct gccaaagttc 840ggattgaacg agtatatcaa cctttacaac caaaagacga aacagaagct gccaaagttc 840

aagcctcttt acaaacaggt tctttcagac cgcgagtcac tctcgttcta tggggagggc 900aagcctcttt acaaacaggt tctttcagac cgcgagtcac tctcgttcta tggggagggc 900

tacacttcgg atgaggaagt cctggaggtg ttcaggaata ctctcaataa gaattcggag 960tacacttcgg atgaggaagt cctggaggtg ttcaggaata ctctcaataa gaattcggag 960

attttctctt ctataaaaaa actggaaaag ttgtttaaga attttgacga atactctagc 1020attttctctt ctataaaaaa actggaaaag ttgtttaaga attttgacga atactctagc 1020

gccggcatat ttgtgaaaaa cggcccggcc atatcaacga taagtaaaga tatcttcggc 1080gccggcatat ttgtgaaaaa cggcccggcc atatcaacga taagtaaaga tatcttcggc 1080

gaatggaacg tgatcagaga caaatggaac gcggagtatg acgatattca cctgaagaag 1140gaatggaacg tgatcagaga caaatggaac gcggagtatg acgatattca cctgaagaag 1140

aaggctgtcg taacggagaa gtacgaggat gatcgcagga aaagcttcaa aaagatcgga 1200aaggctgtcg taacggagaa gtacgaggat gatcgcagga aaagcttcaa aaagatcgga 1200

agtttcagcc tggaacagtt gcaggagtat gctgacgccg atcttagcgt cgtcgagaag 1260agtttcagcc tggaacagtt gcaggagtat gctgacgccg atcttagcgt cgtcgagaag 1260

ttgaaggaga taatcatcca aaaggtcgac gagatatata aagtctatgg atcaagtgaa 1320ttgaaggaga taatcatcca aaaggtcgac gagatatata aagtctatgg atcaagtgaa 1320

aaactgttcg acgccgactt cgttttggag aagtccctga agaagaacga cgctgttgtt 1380aaactgttcg acgccgactt cgttttggag aagtccctga agaagaacga cgctgttgtt 1380

gccattatga aggatctgct cgacagcgtg aagagtttcg agaactatat taaggctttt 1440gccattatga aggatctgct cgacagcgtg aagagtttcg agaactatat taaggctttt 1440

ttcggggagg ggaaggagac taacagagat gagtccttct acggagactt cgtcctcgcg 1500ttcggggagg ggaaggagac taacagagat gagtccttct acggagactt cgtcctcgcg 1500

tacgatatac tccttaaggt agaccacatc tacgacgcaa tcagaaatta cgtgacacaa 1560tacgatatac tccttaaggt agaccacatc tacgacgcaa tcagaaatta cgtgacacaa 1560

aagccgtaca gcaaggacaa gttcaaactc tacttccaga acccccagtt catgggcggc 1620aagccgtaca gcaaggacaa gttcaaactc tacttccaga acccccagtt catgggcggc 1620

tgggacaagg acaaggaaac ggattacagg gctacgatcc tgaggtatgg ttcaaaatac 1680tgggacaagg acaaggaaac ggattacagg gctacgatcc tgaggtatgg ttcaaaatac 1680

tacttggcga ttatggacaa gaagtacgcc aagtgtctcc agaagattga caaagacgat 1740tacttggcga ttatggacaa gaagtacgcc aagtgtctcc agaagattga caaagacgat 1740

gtcaatggca attatgagaa gatcaactac aagctgcttc cgggtccgaa caagatgctc 1800gtcaatggca attatgagaa gatcaactac aagctgcttc cgggtccgaa caagatgctc 1800

ccaaaggttt tcttcagcaa gaaatggatg gcctactata acccaagcga ggacatccag 1860ccaaaggttt tcttcagcaa gaaatggatg gcctactata acccaagcga ggacatccag 1860

aagatttata agaacggtac gttcaagaag ggcgacatgt tcaatcttaa cgactgtcac 1920aagatttata agaacggtac gttcaagaag ggcgacatgt tcaatcttaa cgactgtcac 1920

aagctgatcg acttcttcaa agactcaatt agccggtacc caaagtggtc taacgcctat 1980aagctgatcg acttcttcaa agactcaatt agccggtacc caaagtggtc taacgcctat 1980

gacttcaact tttcggaaac cgagaagtac aaggatatag ccggatttta tagagaggtg 2040gacttcaact tttcggaaac cgagaagtac aaggatatag ccggatttta tagagaggtg 2040

gaagagcagg gctacaaggt gtcattcgag tccgccagca agaaggaagt ggacaagctc 2100gaagagcagg gctacaaggt gtcattcgag tccgccagca agaaggaagt ggacaagctc 2100

gtggaagagg gtaagctcta catgttccag atttataata aagactttag cgataagagc 2160gtggaagagg gtaagctcta catgttccag atttataata aagactttag cgataagagc 2160

cacgggacac ctaatctcca cacaatgtat ttcaagctgc tcttcgacga gaataaccac 2220cacgggacac ctaatctcca cacaatgtat ttcaagctgc tcttcgacga gaataaccac 2220

ggccaaatca ggttgtcagg aggggctgaa ctcttcatgc ggcgcgctag ccttaagaag 2280ggccaaatca ggttgtcagg aggggctgaa ctcttcatgc ggcgcgctag ccttaagaag 2280

gaggagcttg tagtccaccc tgcgaatagt ccaattgcga ataagaaccc ggacaatcct 2340gaggagcttg tagtccaccc tgcgaatagt ccaattgcga ataagaaccc ggacaatcct 2340

aaaaagacta caacattgag ctacgacgtg tacaaggata agaggttttc cgaggatcag 2400aaaaagacta caacattgag ctacgacgtg tacaaggata agaggttttc cgaggatcag 2400

tacgagctcc acatcccgat tgcgatcaac aagtgcccaa agaatatttt caagataaac 2460tacgagctcc acatcccgat tgcgatcaac aagtgcccaa agaatatttt caagataaac 2460

acagaggtgc gtgtactcct gaagcatgac gacaatcctt acgtcattgg gattgctcgg 2520acagaggtgc gtgtactcct gaagcatgac gacaatcctt acgtcattgg gattgctcgg 2520

ggcgagagga acctcctcta tattgtggtg gtggacggga aggggaacat agtcgaacag 2580ggcgagagga acctcctcta tattgtggtg gtggacggga aggggaacat agtcgaacag 2580

tactccctta acgaaataat taacaatttc aacggcatcc gtatcaagac cgactaccat 2640tactccctta acgaaataat taacaatttc aacggcatcc gtatcaagac cgactaccat 2640

tcgttgctgg acaagaagga gaaggagaga tttgaggcgc ggcaaaattg gacaagtatc 2700tcgttgctgg acaagaagga gaaggagaga tttgaggcgc ggcaaaattg gacaagtatc 2700

gagaacatca aggaactcaa agcaggttat atctctcaag ttgtgcataa gatatgcgag 2760gagaacatca aggaactcaa agcaggttat atctctcaag ttgtgcataa gatatgcgag 2760

ctggttgaga agtatgacgc agtgatcgct cttgaggacc tcaactcggg ctttaagaat 2820ctggttgaga agtatgacgc agtgatcgct cttgaggacc tcaactcggg ctttaagaat 2820

tctagagtta aagtggagaa gcaggtctat caaaagttcg agaagatgct tatagataag 2880tctagagtta aagtggagaa gcaggtctat caaaagttcg agaagatgct tatagataag 2880

ctcaactaca tggtcgataa gaaatcgaac ccatgtgcca ccggcggcgc actcaaaggt 2940ctcaactaca tggtcgataa gaaatcgaac ccatgtgcca ccggcggcgc actcaaaggt 2940

taccaaataa caaacaaatt cgagtccttc aaatcgatga gtactcagaa tgggttcata 3000taccaaataa caaacaaatt cgagtccttc aaatcgatga gtactcagaa tgggttcata 3000

ttttatatac cggcgtggct tacgtctaag atcgacccgt caactggttt tgtcaacctg 3060ttttatatac cggcgtggct tacgtctaag atcgacccgt caactggttt tgtcaacctg 3060

ttgaagacga aatacacgtc cattgccgat tcgaaaaagt tcatatctag ttttgatcgt 3120ttgaagacga aatacacgtc cattgccgat tcgaaaaagt tcatatctag ttttgatcgt 3120

attatgtacg tcccagagga agatcttttc gagtttgctc tcgactacaa aaacttttcg 3180attatgtacg tcccagagga agatcttttc gagtttgctc tcgactacaa aaacttttcg 3180

cggaccgatg cggattacat taaaaaatgg aaactctatt cgtacggcaa cagaatcagg 3240cggaccgatg cggattacat taaaaaatgg aaactctatt cgtacggcaa cagaatcagg 3240

atttttcgca accctaagaa gaataacgtc tttgattggg aggaagtttg cttgactagc 3300atttttcgca accctaagaa gaataacgtc tttgattggg aggaagtttg cttgactagc 3300

gcgtacaagg agctctttaa taagtatggc attaactacc aacagggtga tatcagagca 3360gcgtacaagg agctctttaa taagtatggc attaactacc aacagggtga tatcagagca 3360

ctgctttgcg aacaatctga caaggctttc tactcatcct tcatggcttt gatgagcctg 3420ctgctttgcg aacaatctga caaggctttc tactcatcct tcatggcttt gatgagcctg 3420

atgctccaga tgagaaattc aattacaggc agaaccgacg tggatttctt gatctccccg 3480atgctccaga tgagaaattc aattacaggc agaaccgacg tggatttctt gatctccccg 3480

gttaaaaatt ctgatggcat cttttacgat agcaggaact atgaagcgca agagaatgcg 3540gttaaaaatt ctgatggcat cttttacgat agcaggaact atgaagcgca agagaatgcg 3540

attctgccaa aaaatgcaga cgccaacggt gcctataaca tcgccaggaa agtcctgtgg 3600attctgccaa aaaatgcaga cgccaacggt gcctataaca tcgccaggaa agtcctgtgg 3600

gcgatcggcc agttcaaaaa ggccgaagac gaaaaattgg acaaggtcaa aatcgctatc 3660gcgatcggcc agttcaaaaa ggccgaagac gaaaaattgg acaaggtcaa aatcgctatc 3660

agcaacaaag agtggctgga gtatgctcag acatccgtaa agcataagct tatgccaaaa 3720agcaacaaag agtggctgga gtatgctcag acatccgtaa agcataagct tatgccaaaa 3720

aagaagagaa aggtttcagg cggctcctcc gaggtggagt tctctcacga gtattggatg 3780aagaagagaa aggtttcagg cggctcctcc gaggtggagt tctctcacga gtattggatg 3780

aggcacgctc ttacacttgc taagagagct tgggacgaaa gagaagtgcc agttggcgcc 3840aggcacgctc ttacacttgc taagagagct tgggacgaaa gagaagtgcc agttggcgcc 3840

gttcttgtgc ataataatag ggtgatcggc gagggttgga atagaccaat tggaaggcat 3900gttcttgtgc ataataatag ggtgatcggc gagggttgga atagaccaat tggaaggcat 3900

gatccaacag ctcacgcaga gattatggct ctcagacaag gcggcctcgt tatgcagaac 3960gatccaacag ctcacgcaga gattatggct ctcagacaag gcggcctcgt tatgcagaac 3960

tacaggctca ttgacgctac actctacgtg acactcgaac cttgcgttat gtgcgccgga 4020tacaggctca ttgacgctac actctacgtg acactcgaac cttgcgttat gtgcgccgga 4020

gctatgattc attctaggat tggcagggtc gtgtttggag ctagggacgc taaaacagga 4080gctatgattc attctaggat tggcagggtc gtgtttggag ctagggacgc taaaacagga 4080

gccgccggat ctcttatgga cgtgttgcat catccaggca tgaaccatag ggtggagatt 4140gccgccggat ctcttatgga cgtgttgcat catccaggca tgaaccatag ggtggagatt 4140

acagagggca ttcttgcaga cgagtgcgct gctcttcttt ccgatttctt caggatgaga 4200acagagggca ttcttgcaga cgagtgcgct gctcttcttt ccgatttctt caggatgaga 4200

aggcaggaga ttaaggccca gaagaaggct cagtcttcta cagatagcgg aggatcttcc 4260aggcaggaga ttaaggccca gaagaaggct cagtcttcta cagatagcgg aggatcttcc 4260

ggaggatcta gcggctccga gacaccagga acatccgaaa gcgctacacc agaatctagc 4320ggaggatcta gcggctccga gacaccagga acatccgaaa gcgctacacc agaatctagc 4320

ggaggctctt ccggaggatc ttctgaagtg gagttctccc acgagtattg gatgaggcac 4380ggaggctctt ccggaggatc ttctgaagtg gagttctccc acgagtattg gatgaggcac 4380

gctcttacac ttgctaaaag ggctagggac gaaagggaag ttccagttgg agctgttctc 4440gctcttacac ttgctaaaag ggctagggac gaaagggaag ttccagttgg agctgttctc 4440

gtgctcaata acagggtgat tggcgagggt tggaatagag ccattggact ccatgatcca 4500gtgctcaata acagggtgat tggcgagggt tggaatagag ccattggact ccatgatcca 4500

acagctcacg cagagattat ggctcttaga caaggcggcc tcgttatgca gaattacaga 4560acagctcacg cagagattat ggctcttaga caaggcggcc tcgttatgca gaattacaga 4560

ctcatcgacg ccacactcta cgttaccttc gaaccttgcg ttatgtgcgc cggagctatg 4620ctcatcgacg ccacactcta cgttaccttc gaaccttgcg ttatgtgcgc cggagctatg 4620

atccattcta ggattggcag ggtcgtgttc ggcgttagaa acgctaagac aggagctgca 4680atccattcta ggattggcag ggtcgtgttc ggcgttagaa acgctaagac aggagctgca 4680

ggctctctta tggacgttct tcattaccca ggcatgaatc atagagtgga gatcacagaa 4740ggctctctta tggacgttct tcattaccca ggcatgaatc atagagtgga gatcacagaa 4740

ggcattcttg cagacgagtg cgcagctctc ctttgctatt tcttcaggat gccgaggcaa 4800ggcattcttg cagacgagtg cgcagctctc ctttgctatt tcttcaggat gccgaggcaa 4800

gttttcaacg ctcagaagaa ggcccagtct tctacagatt ccggcggatc ttctggagga 4860gttttcaacg ctcagaagaa ggcccagtct tctacagatt ccggcggatc ttctggagga 4860

tctagcggct ccgagacacc aggaacatcc gaatccgcta caccagagtc ttctggagga 4920tctagcggct ccgagacacc aggaacatcc gaatccgcta caccagagtc ttctggagga 4920

tctagcggag gatctcttaa gaagagacca gcagctacaa agaaggccgg acaagctaag 4980tctagcggag gatctcttaa gaagagacca gcagctacaa agaaggccgg acaagctaag 4980

aagaagaagt ag 4992aagaagaagt ag 4992

<210> 16<210> 16

<211> 5686<211> 5686

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> PJIT163-GFP<223> PJIT163-GFP

<400> 16<400> 16

gagctcggta cctgacccgg tcgtgcccct ctctagagat aatgagcatt gcatgtctaa 60gagctcggta cctgacccgg tcgtgcccct ctctagagat aatgagcatt gcatgtctaa 60

gttataaaaa attaccacat attttttttg tcacacttgt ttgaagtgca gtttatctat 120gttataaaaa attaccacat atttttttttg tcacacttgt ttgaagtgca gtttatctat 120

ctttatacat atatttaaac tttactctac gaataatata atctatagta ctacaataat 180ctttatacat atatttaaac tttactctac gaataatata atctatagta ctacaataat 180

atcagtgttt tagagaatca tataaatgaa cagttagaca tggtctaaag gacaattgag 240atcagtgttt tagagaatca tataaatgaa cagttagaca tggtctaaag gacaattgag 240

tattttgaca acaggactct acagttttat ctttttagtg tgcatgtgtt ctcctttttt 300tattttgaca acaggactct acagttttat ctttttagtg tgcatgtgtt ctcctttttt 300

tttgcaaata gcttcaccta tataatactt catccatttt attagtacat ccatttaggg 360tttgcaaata gcttcaccta tataatactt catccatttt attagtacat ccatttaggg 360

tttagggtta atggttttta tagactaatt tttttagtac atctatttta ttctatttta 420tttagggtta atggttttta tagactaatt tttttagtac atctatttta ttctatttta 420

gcctctaaat taagaaaact aaaactctat tttagttttt ttatttaata atttagatat 480gcctctaaat taagaaaact aaaactctat tttagtttttt ttatttaata atttagatat 480

aaaatagaat aaaataaagt gactaaaaat taaacaaata ccctttaaga aattaaaaaa 540aaaatagaat aaaataaagt gactaaaaat taaacaaata ccctttaaga aattaaaaaa 540

actaaggaaa catttttctt gtttcgagta gataatgcca gcctgttaaa cgccgtcgac 600actaaggaaa catttttctt gtttcgagta gataatgcca gcctgttaaa cgccgtcgac 600

gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag cgaagcagac 660gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag cgaagcagac 660

ggcacggcat ctctgtcgct gcctctggac ccctctcgat cgagagttcc gctccaccgt 720ggcacggcat ctctgtcgct gcctctggac ccctctcgat cgagagttcc gctccaccgt 720

tggacttgct ccgctgtcgg catccagaaa ttgcgtggcg gagcggcaga cgtgagccgg 780tggacttgct ccgctgtcgg catccagaaa ttgcgtggcg gagcggcaga cgtgagccgg 780

cacggcaggc ggcctcctcc tcctctcacg gcaccggcag ctacggggga ttcctttccc 840cacggcaggc ggcctcctcc tcctctcacg gcaccggcag ctacggggga ttcctttccc 840

accgctcctt cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc 900accgctcctt cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc 900

tctttcccca acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat 960tctttcccca acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat 960

ccacccgtcg gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct 1020ccacccgtcg gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct 1020

accttctcta gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca 1080accttctcta gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca 1080

tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc 1140tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc 1140

gacctgtacg tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc 1200gacctgtacg tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc 1200

tgggatggct ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg 1260tgggatggct ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg 1260

catagggttt ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg 1320catagggttt ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg 1320

gtcatctttt catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg 1380gtcatctttt catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg 1380

ttctagatcg gagtagaatt aattctgttt caaactacct ggtggattta ttaattttgg 1440ttctagatcg gagtagaatt aattctgttt caaactacct ggtggattta ttaattttgg 1440

atctgtatgt gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata 1500atctgtatgt gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata 1500

tcgatctagg ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct 1560tcgatctagg ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct 1560

ttttgttcgc ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat 1620ttttgttcgc ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat 1620

cggagtagaa tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt 1680cggagtagaa tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt 1680

gtgtcataca tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg 1740gtgtcataca tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg 1740

tatacatgtt gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt 1800tatacatgtt gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt 1800

catatgctct aaccttgagt acctatctat tataataaac aagtatgttt tataattatt 1860catatgctct aaccttgagt acctatctat tataataaac aagtatgttt tataattatt 1860

ttgatcttga tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc 1920ttgatcttga tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc 1920

ctgccttcat acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt 1980ctgccttcat acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt 1980

gtttggtgtt acttctgcaa agcttgtcga cggatccatg gtgagcaagg gcgaggagct 2040gtttggtgtt acttctgcaa agcttgtcga cggatccatg gtgagcaagg gcgaggagct 2040

gttcaccggg gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt 2100gttcaccggg gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt 2100

cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat 2160cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat 2160

ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct tcacctacgg 2220ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct tcacctacgg 2220

cgtgcagtgc ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc 2280cgtgcagtgc ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc 2280

catgcccgaa ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa 2340catgcccgaa ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa 2340

gacccgcgcc gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg 2400gacccgcgcc gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg 2400

catcgacttc aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag 2460catcgacttc aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag 2460

ccacaacgtc tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat 2520ccacaacgtc tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat 2520

ccgccacaac atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc 2580ccgccacaac atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc 2580

catcggcgac ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct 2640catcggcgac ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct 2640

gagcaaagac cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc 2700gagcaaagac cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc 2700

cgggatcact cacggcatgg acgagctgta caagtaaccc gggaattcgg tacgctgaaa 2760cgggatcact cacggcatgg acgagctgta caagtaaccc gggaattcgg tacgctgaaa 2760

tcaccagtct ctctctacaa atctatctct ctctattttc tccataaata atgtgtgagt 2820tcaccagtct ctctctacaa atctatctct ctctattttc tccataaata atgtgtgagt 2820

agtttcccga taagggaaat tagggttctt atagggtttc gctcatgtgt tgagcatata 2880agtttcccga taagggaaat tagggttctt atagggtttc gctcatgtgt tgagcatata 2880

agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 2940agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 2940

ctaaaaccaa aatccagtac taaaatccag atctcctaaa gtccctatag atctttgtcg 3000ctaaaaccaa aatccagtac taaaatccag atctcctaaa gtccctatag atctttgtcg 3000

tgaatataaa ccagacacga gacgactaaa cctggagccc agacgccgtt cgaagctaga 3060tgaatataaa ccagacacga gacgactaaa cctggagccc agacgccgtt cgaagctaga 3060

agtaccgctt aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg 3120agtaccgctt aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg 3120

actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga ggaagacaag 3180actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga ggaagacaag 3180

gaaggataag gttgcaggcc ctgtgcaagg taagaagatg gaaatttgat agaggtacgc 3240gaaggataag gttgcaggcc ctgtgcaagg taagaagatg gaaatttgat agaggtacgc 3240

tactatactt atactatacg ctaagggaat gcttgtattt ataccctata ccccctaata 3300tactatactt atactatacg ctaagggaat gcttgtattt ataccctata ccccctaata 3300

accccttatc aatttaagaa ataatccgca taagcccccg cttaaaaatt ggtatcagag 3360accccttatc aatttaagaa ataatccgca taagccccccg cttaaaaatt ggtatcagag 3360

ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa tacgaaagag 3420ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa tacgaaagag 3420

ttcttaactc taaagataaa agatctttca agatcaaaac tagttccctc acaccggagc 3480ttcttaactc taaagataaa agatctttca agatcaaaac tagttccctc acaccggagc 3480

atgcgatatc ctcgagagat ctaggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 3540atgcgatatc ctcgagagat ctaggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 3540

tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 3600tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 3600

ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 3660ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 3660

tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 3720tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 3720

ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 3780ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 3780

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 3840ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 3840

gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3900gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3900

gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3960gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3960

cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 4020cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 4020

ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 4080ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 4080

tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg 4140tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg 4140

gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4200gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4200

tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4260tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4260

ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4320ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4320

ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4380ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4380

ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4440ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4440

accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4500accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4500

tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4560tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4560

cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4620cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4620

taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4680taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4680

caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4740caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4740

gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4800gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4800

gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4860gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4860

ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4920ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4920

attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4980attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4980

gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 5040gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 5040

tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 5100tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 5100

agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5160agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5160

gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5220gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5220

actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5280actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5280

tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5340tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5340

attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5400attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5400

tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5460tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5460

tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5520tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5520

aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 5580aaatgttgaa tactcatact cttcctttttt caatattatt gaagcattta tcagggttat 5580

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5640tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5640

cgcacatttc cccgaaaagt gccacctgcc agtgccaagc taattc 5686cgcacatttc cccgaaaagt gccacctgcc agtgccaagc taattc 5686

<210> 17<210> 17

<211> 720<211> 720

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> pBUI-mGFP<223> pBUI-mGFP

<400> 17<400> 17

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60

ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120

ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180

ctcgtgacca ccttcaccta cggcgtgtag tgcttcagcc gctaccccga ccacatgaag 240ctcgtgacca ccttcaccta cggcgtgtag tgcttcagcc gctaccccga ccacatgaag 240

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420

aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480

ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600

tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660

ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 720ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 720

<210> 18<210> 18

<211> 1307<211> 1307

<212> PRT<212> PRT

<213> Acidaminococcus sp.<213> Acidaminococcus sp.

<400> 18<400> 18

Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys ThrMet Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile GlnLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln

20 25 30 20 25 30

Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr LysGlu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys

35 40 45 35 40 45

Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp GlnGlu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln

50 55 60 50 55 60

Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala IleCys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile

65 70 75 8065 70 75 80

Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu IleAsp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile

85 90 95 85 90 95

Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile GlyGlu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly

100 105 110 100 105 110

Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu IleArg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile

115 120 125 115 120 125

Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu LysTyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys

130 135 140 130 135 140

Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu ArgGln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg

145 150 155 160145 150 155 160

Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn ArgSer Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg

165 170 175 165 170 175

Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His ArgLys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg

180 185 190 180 185 190

Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile PheIle Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe

195 200 205 195 200 205

Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu AsnThr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn

210 215 220 210 215 220

Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu ValVal Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val

225 230 235 240225 230 235 240

Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile AspPhe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp

245 250 255 245 250 255

Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr GluLeu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu

260 265 270 260 265 270

Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys AsnLys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn

275 280 285 275 280 285

Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile ProAsp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro

290 295 300 290 295 300

Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile LeuLeu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu

305 310 315 320305 310 315 320

Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys TyrGlu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr

325 330 335 325 330 335

Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala LeuLys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu

340 345 350 340 345 350

Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser HisPhe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His

355 360 365 355 360 365

Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp ThrLys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr

370 375 380 370 375 380

Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly LysLeu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys

385 390 395 400385 390 395 400

Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His GluIle Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu

405 410 415 405 410 415

Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu SerAsp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser

420 425 430 420 425 430

Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His AlaGlu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala

435 440 445 435 440 445

Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu LysAla Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys

450 455 460 450 455 460

Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His LeuGlu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu

465 470 475 480465 470 475 480

Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu PheLeu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe

485 490 495 485 490 495

Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu SerSer Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser

500 505 510 500 505 510

Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser ValPhe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val

515 520 525 515 520 525

Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly TrpGlu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp

530 535 540 530 535 540

Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys AsnAsp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn

545 550 555 560545 550 555 560

Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr LysGly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys

565 570 575 565 570 575

Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp LysAla Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys

580 585 590 580 585 590

Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys CysMet Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys

595 600 605 595 600 605

Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr ThrSer Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr

610 615 620 610 615 620

Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr LysPro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys

625 630 635 640625 630 635 640

Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe GlnGlu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln

645 650 655 645 650 655

Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu AlaThr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala

660 665 670 660 665 670

Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr ThrLeu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr

675 680 685 675 680 685

Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln TyrLys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr

690 695 700 690 695 700

Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr HisLys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His

705 710 715 720705 710 715 720

Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val GluIle Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu

725 730 735 725 730 735

Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala LysThr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys

740 745 750 740 745 750

Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly LeuGly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu

755 760 765 755 760 765

Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly GlnPhe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln

770 775 780 770 775 780

Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala HisAla Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His

785 790 795 800785 790 795 800

Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys ThrArg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr

805 810 815 805 810 815

Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn HisPro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His

820 825 830 820 825 830

Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro AsnArg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn

835 840 845 835 840 845

Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg PheVal Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe

850 855 860 850 855 860

Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr GlnThr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln

865 870 875 880865 870 875 880

Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr LeuAla Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu

885 890 895 885 890 895

Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu ArgLys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg

900 905 910 900 905 910

Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu GluAsn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu

915 920 925 915 920 925

Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys LeuGln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu

930 935 940 930 935 940

Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser ValAsp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val

945 950 955 960945 950 955 960

Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val IleVal Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile

965 970 975 965 970 975

His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val LeuHis Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu

980 985 990 980 985 990

Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala GluGlu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu

995 1000 1005 995 1000 1005

Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys LeuLys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu

1010 1015 1020 1010 1015 1020

Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly GlyAsn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly

1025 1030 1035 1025 1030 1035

Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe AlaVal Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala

1040 1045 1050 1040 1045 1050

Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala ProLys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro

1055 1060 1065 1055 1060 1065

Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro PheTyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe

1070 1075 1080 1070 1075 1080

Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe LeuVal Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu

1085 1090 1095 1085 1090 1095

Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp PheGlu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe

1100 1105 1110 1100 1105 1110

Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg GlyIle Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly

1115 1120 1125 1115 1120 1125

Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys AsnLeu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn

1130 1135 1140 1130 1135 1140

Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly LysGlu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys

1145 1150 1155 1145 1150 1155

Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg TyrArg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr

1160 1165 1170 1160 1165 1170

Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu GluArg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu

1175 1180 1185 1175 1180 1185

Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys LeuLys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu

1190 1195 1200 1190 1195 1200

Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala LeuLeu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu

1205 1210 1215 1205 1210 1215

Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr GlyIle Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly

1220 1225 1230 1220 1225 1230

Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val CysGlu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys

1235 1240 1245 1235 1240 1245

Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala AspPhe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp

1250 1255 1260 1250 1255 1260

Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu LeuAla Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu

1265 1270 1275 1265 1270 1275

Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly IleAsn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile

1280 1285 1290 1280 1285 1290

Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg AsnSer Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn

1295 1300 1305 1295 1300 1305

<210> 19<210> 19

<211> 1300<211> 1300

<212> PRT<212> PRT

<213> Francisella novicida<213> Francisella novicida

<400> 19<400> 19

Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys ThrMet Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile LysLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys

20 25 30 20 25 30

Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr LysAla Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys

35 40 45 35 40 45

Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu GluLys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu

50 55 60 50 55 60

Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr SerIle Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser

65 70 75 8065 70 75 80

Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln LysAsp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys

85 90 95 85 90 95

Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu TyrAsp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr

100 105 110 100 105 110

Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu IleIle Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile

115 120 125 115 120 125

Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys GlnAsp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln

130 135 140 130 135 140

Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile ThrSer Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr

145 150 155 160145 150 155 160

Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp ThrAsp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr

165 170 175 165 170 175

Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser SerThr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser

180 185 190 180 185 190

Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn LeuAsn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu

195 200 205 195 200 205

Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp LysPro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys

210 215 220 210 215 220

Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala GluAla Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu

225 230 235 240225 230 235 240

Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln ArgGlu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg

245 250 255 245 250 255

Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn TyrVal Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr

260 265 270 260 265 270

Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly LysLeu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys

275 280 285 275 280 285

Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr IlePhe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile

290 295 300 290 295 300

Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr LysAsn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys

305 310 315 320305 310 315 320

Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys SerMet Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser

325 330 335 325 330 335

Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr MetPhe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met

340 345 350 340 345 350

Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu LysGln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys

355 360 365 355 360 365

Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala GlnSer Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln

370 375 380 370 375 380

Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu ThrLys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr

385 390 395 400385 390 395 400

Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr AlaAsp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala

405 410 415 405 410 415

Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp AsnVal Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn

420 425 430 420 425 430

Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys AlaPro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala

435 440 445 435 440 445

Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe AsnLys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn

450 455 460 450 455 460

Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu AlaLys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala

465 470 475 480465 470 475 480

Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn LysAsn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys

485 490 495 485 490 495

Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys LysAsp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys

500 505 510 500 505 510

Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys AspAsp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp

515 520 525 515 520 525

Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe HisLeu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His

530 535 540 530 535 540

Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu HisIle Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His

545 550 555 560545 550 555 560

Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile ValPhe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val

565 570 575 565 570 575

Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr SerPro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser

580 585 590 580 585 590

Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn GlyAsp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly

595 600 605 595 600 605

Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile LysTrp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys

610 615 620 610 615 620

Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys IleAsp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile

625 630 635 640625 630 635 640

Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys LysPhe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys

645 650 655 645 650 655

Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys ValIle Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val

660 665 670 660 665 670

Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile

675 680 685 675 680 685

Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnLeu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln

690 695 700 690 695 700

Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys PheLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe

705 710 715 720705 710 715 720

Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys AspIle Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp

725 730 735 725 730 735

Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp GluPhe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu

740 745 750 740 745 750

Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu AsnPhe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn

755 760 765 755 760 765

Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu TyrIle Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr

770 775 780 770 775 780

Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly ArgLeu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg

785 790 795 800785 790 795 800

Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnPro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn

805 810 815 805 810 815

Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe TyrLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr

820 825 830 820 825 830

Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu AlaArg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala

835 840 845 835 840 845

Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe GluIle Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu

850 855 860 850 855 860

Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe PheTyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe

865 870 875 880865 870 875 880

His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys PheHis Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe

885 890 895 885 890 895

Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val HisAsn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His

900 905 910 900 905 910

Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr LeuIle Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu

915 920 925 915 920 925

Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleVal Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile

930 935 940 930 935 940

Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala IleGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile

945 950 955 960945 950 955 960

Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn AsnGlu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn

965 970 975 965 970 975

Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu IleIle Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile

980 985 990 980 985 990

Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp LeuAla Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu

995 1000 1005 995 1000 1005

Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln ValAsn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val

1010 1015 1020 1010 1015 1020

Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr LeuTyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu

1025 1030 1035 1025 1030 1035

Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu ArgVal Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg

1040 1045 1050 1040 1045 1050

Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met GlyAla Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly

1055 1060 1065 1055 1060 1065

Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr SerLys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser

1070 1075 1080 1070 1075 1080

Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro LysLys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys

1085 1090 1095 1085 1090 1095

Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe AspTyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp

1100 1105 1110 1100 1105 1110

Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser PheLys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe

1115 1120 1125 1115 1120 1125

Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp ThrAsp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr

1130 1135 1140 1130 1135 1140

Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser AspIle Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp

1145 1150 1155 1145 1150 1155

Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys GluLys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu

1160 1165 1170 1160 1165 1170

Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His GlyLeu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly

1175 1180 1185 1175 1180 1185

Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys PheGlu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe

1190 1195 1200 1190 1195 1200

Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met ArgPhe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg

1205 1210 1215 1205 1210 1215

Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro ValAsn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val

1220 1225 1230 1220 1225 1230

Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro LysAla Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys

1235 1240 1245 1235 1240 1245

Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile GlyAsn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly

1250 1255 1260 1250 1255 1260

Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln GluLeu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu

1265 1270 1275 1265 1270 1275

Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe GluGly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu

1280 1285 1290 1280 1285 1290

Phe Val Gln Asn Arg Asn AsnPhe Val Gln Asn Arg Asn Asn

1295 1300 1295 1300

<210> 20<210> 20

<211> 1228<211> 1228

<212> PRT<212> PRT

<213> Lachnospiraceae bacterium<213> Lachnospiraceae bacterium

<400> 20<400> 20

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp

20 25 30 20 25 30

Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys

35 40 45 35 40 45

Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp

50 55 60 50 55 60

Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu

65 70 75 8065 70 75 80

Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn

85 90 95 85 90 95

Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn

100 105 110 100 105 110

Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu

115 120 125 115 120 125

Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe

130 135 140 130 135 140

Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn

145 150 155 160145 150 155 160

Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys IleMet Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile

165 170 175 165 170 175

Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu LysAsn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys

180 185 190 180 185 190

Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu LysVal Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys

195 200 205 195 200 205

Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu PheIle Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe

210 215 220 210 215 220

Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala IlePhe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile

225 230 235 240225 230 235 240

Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu AsnIle Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn

245 250 255 245 250 255

Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro LysGlu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys

260 265 270 260 265 270

Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu SerPhe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser

275 280 285 275 280 285

Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val PhePhe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe

290 295 300 290 295 300

Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys LysArg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys

305 310 315 320305 310 315 320

Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly IleLeu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile

325 330 335 325 330 335

Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile PhePhe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe

340 345 350 340 345 350

Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp AspGly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp

355 360 365 355 360 365

Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp AspIle His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp

370 375 380 370 375 380

Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln LeuArg Arg Lys Ser Phe Lys Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu

385 390 395 400385 390 395 400

Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys GluGln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu

405 410 415 405 410 415

Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser SerIle Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser

420 425 430 420 425 430

Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys LysGlu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys

435 440 445 435 440 445

Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val LysAsn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys

450 455 460 450 455 460

Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu ThrSer Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr

465 470 475 480465 470 475 480

Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp IleAsn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile

485 490 495 485 490 495

Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val ThrLeu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr

500 505 510 500 505 510

Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn ProGln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro

515 520 525 515 520 525

Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg AlaGln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala

530 535 540 530 535 540

Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp LysThr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys

545 550 555 560545 550 555 560

Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn GlyLys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly

565 570 575 565 570 575

Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys MetAsn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met

580 585 590 580 585 590

Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn ProLeu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro

595 600 605 595 600 605

Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys GlySer Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly

610 615 620 610 615 620

Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe LysAsp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys

625 630 635 640625 630 635 640

Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe AsnAsp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn

645 650 655 645 650 655

Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg GluPhe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu

660 665 670 660 665 670

Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys LysVal Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys

675 680 685 675 680 685

Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleGlu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile

690 695 700 690 695 700

Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu HisTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His

705 710 715 720705 710 715 720

Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln IleThr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile

725 730 735 725 730 735

Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu LysArg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys

740 745 750 740 745 750

Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn LysLys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys

755 760 765 755 760 765

Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val TyrAsn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr

770 775 780 770 775 780

Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro IleLys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile

785 790 795 800785 790 795 800

Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValAla Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val

805 810 815 805 810 815

Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile AspArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp

820 825 830 820 825 830

Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys GlyArg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly

835 840 845 835 840 845

Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe AsnAsn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn

850 855 860 850 855 860

Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys GluGly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu

865 870 875 880865 870 875 880

Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn IleLys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile

885 890 895 885 890 895

Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile CysLys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys

900 905 910 900 905 910

Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu AsnGlu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn

915 920 925 915 920 925

Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnSer Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln

930 935 940 930 935 940

Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp LysLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys

945 950 955 960945 950 955 960

Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln IleLys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile

965 970 975 965 970 975

Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly PheThr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe

980 985 990 980 985 990

Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser ThrIle Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr

995 1000 1005 995 1000 1005

Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala AspGly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp

1010 1015 1020 1010 1015 1020

Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val ProSer Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro

1025 1030 1035 1025 1030 1035

Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe SerGlu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser

1040 1045 1050 1040 1045 1050

Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser TyrArg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr

1055 1060 1065 1055 1060 1065

Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn ValGly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val

1070 1075 1080 1070 1075 1080

Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu LeuPhe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu

1085 1090 1095 1085 1090 1095

Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg AlaPhe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala

1100 1105 1110 1100 1105 1110

Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe MetLeu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met

1115 1120 1125 1115 1120 1125

Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr GlyAla Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly

1130 1135 1140 1130 1135 1140

Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser AspArg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp

1145 1150 1155 1145 1150 1155

Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn AlaGly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala

1160 1165 1170 1160 1165 1170

Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile AlaIle Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala

1175 1180 1185 1175 1180 1185

Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu AspArg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp

1190 1195 1200 1190 1195 1200

Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu TrpGlu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp

1205 1210 1215 1205 1210 1215

Leu Glu Tyr Ala Gln Thr Ser Val Lys HisLeu Glu Tyr Ala Gln Thr Ser Val Lys His

1220 1225 1220 1225

<210> 21<210> 21

<211> 1329<211> 1329

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> ASCPF1-2NLS<223> ASCPF1-2NLS

<400> 21<400> 21

Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys ThrMet Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile GlnLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln

20 25 30 20 25 30

Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr LysGlu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys

35 40 45 35 40 45

Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp GlnGlu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln

50 55 60 50 55 60

Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala IleCys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile

65 70 75 8065 70 75 80

Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu IleAsp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile

85 90 95 85 90 95

Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile GlyGlu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly

100 105 110 100 105 110

Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu IleArg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile

115 120 125 115 120 125

Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu LysTyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys

130 135 140 130 135 140

Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu ArgGln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg

145 150 155 160145 150 155 160

Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn ArgSer Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg

165 170 175 165 170 175

Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His ArgLys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg

180 185 190 180 185 190

Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile PheIle Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe

195 200 205 195 200 205

Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu AsnThr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn

210 215 220 210 215 220

Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu ValVal Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val

225 230 235 240225 230 235 240

Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile AspPhe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp

245 250 255 245 250 255

Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr GluLeu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu

260 265 270 260 265 270

Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys AsnLys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn

275 280 285 275 280 285

Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile ProAsp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro

290 295 300 290 295 300

Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile LeuLeu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu

305 310 315 320305 310 315 320

Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys TyrGlu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr

325 330 335 325 330 335

Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala LeuLys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu

340 345 350 340 345 350

Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser HisPhe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His

355 360 365 355 360 365

Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp ThrLys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr

370 375 380 370 375 380

Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly LysLeu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys

385 390 395 400385 390 395 400

Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His GluIle Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu

405 410 415 405 410 415

Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu SerAsp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser

420 425 430 420 425 430

Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His AlaGlu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala

435 440 445 435 440 445

Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu LysAla Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys

450 455 460 450 455 460

Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His LeuGlu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu

465 470 475 480465 470 475 480

Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu PheLeu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe

485 490 495 485 490 495

Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu SerSer Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser

500 505 510 500 505 510

Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser ValPhe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val

515 520 525 515 520 525

Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly TrpGlu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp

530 535 540 530 535 540

Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys AsnAsp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn

545 550 555 560545 550 555 560

Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr LysGly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys

565 570 575 565 570 575

Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp LysAla Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys

580 585 590 580 585 590

Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys CysMet Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys

595 600 605 595 600 605

Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr ThrSer Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr

610 615 620 610 615 620

Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr LysPro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys

625 630 635 640625 630 635 640

Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe GlnGlu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln

645 650 655 645 650 655

Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu AlaThr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala

660 665 670 660 665 670

Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr ThrLeu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr

675 680 685 675 680 685

Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln TyrLys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr

690 695 700 690 695 700

Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr HisLys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His

705 710 715 720705 710 715 720

Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val GluIle Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu

725 730 735 725 730 735

Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala LysThr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys

740 745 750 740 745 750

Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly LeuGly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu

755 760 765 755 760 765

Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly GlnPhe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln

770 775 780 770 775 780

Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala HisAla Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His

785 790 795 800785 790 795 800

Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys ThrArg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr

805 810 815 805 810 815

Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn HisPro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His

820 825 830 820 825 830

Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro AsnArg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn

835 840 845 835 840 845

Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg PheVal Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe

850 855 860 850 855 860

Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr GlnThr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln

865 870 875 880865 870 875 880

Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr LeuAla Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu

885 890 895 885 890 895

Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu ArgLys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg

900 905 910 900 905 910

Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu GluAsn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu

915 920 925 915 920 925

Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys LeuGln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu

930 935 940 930 935 940

Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser ValAsp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val

945 950 955 960945 950 955 960

Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val IleVal Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile

965 970 975 965 970 975

His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val LeuHis Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu

980 985 990 980 985 990

Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala GluGlu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu

995 1000 1005 995 1000 1005

Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys LeuLys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu

1010 1015 1020 1010 1015 1020

Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly GlyAsn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly

1025 1030 1035 1025 1030 1035

Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe AlaVal Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala

1040 1045 1050 1040 1045 1050

Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala ProLys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro

1055 1060 1065 1055 1060 1065

Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro PheTyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe

1070 1075 1080 1070 1075 1080

Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe LeuVal Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu

1085 1090 1095 1085 1090 1095

Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp PheGlu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe

1100 1105 1110 1100 1105 1110

Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg GlyIle Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly

1115 1120 1125 1115 1120 1125

Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys AsnLeu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn

1130 1135 1140 1130 1135 1140

Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly LysGlu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys

1145 1150 1155 1145 1150 1155

Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg TyrArg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr

1160 1165 1170 1160 1165 1170

Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu GluArg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu

1175 1180 1185 1175 1180 1185

Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys LeuLys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu

1190 1195 1200 1190 1195 1200

Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala LeuLeu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu

1205 1210 1215 1205 1210 1215

Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr GlyIle Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly

1220 1225 1230 1220 1225 1230

Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val CysGlu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys

1235 1240 1245 1235 1240 1245

Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala AspPhe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp

1250 1255 1260 1250 1255 1260

Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu LeuAla Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu

1265 1270 1275 1265 1270 1275

Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly IleAsn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile

1280 1285 1290 1280 1285 1290

Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn SerSer Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn Ser

1295 1300 1305 1295 1300 1305

Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser ProGly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro

1310 1315 1320 1310 1315 1320

Lys Lys Lys Arg Lys ValLys Lys Lys Arg Lys Val

1325 1325

<210> 22<210> 22

<211> 1322<211> 1322

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> FNCPF1-2NLS<223> FNCPF1-2NLS

<400> 22<400> 22

Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys ThrMet Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile LysLeu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys

20 25 30 20 25 30

Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr LysAla Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys

35 40 45 35 40 45

Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu GluLys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu

50 55 60 50 55 60

Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr SerIle Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser

65 70 75 8065 70 75 80

Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln LysAsp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys

85 90 95 85 90 95

Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu TyrAsp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr

100 105 110 100 105 110

Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu IleIle Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile

115 120 125 115 120 125

Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys GlnAsp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln

130 135 140 130 135 140

Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile ThrSer Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr

145 150 155 160145 150 155 160

Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp ThrAsp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr

165 170 175 165 170 175

Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser SerThr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser

180 185 190 180 185 190

Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn LeuAsn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu

195 200 205 195 200 205

Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp LysPro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys

210 215 220 210 215 220

Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala GluAla Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu

225 230 235 240225 230 235 240

Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln ArgGlu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg

245 250 255 245 250 255

Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn TyrVal Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr

260 265 270 260 265 270

Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly LysLeu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys

275 280 285 275 280 285

Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr IlePhe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile

290 295 300 290 295 300

Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr LysAsn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys

305 310 315 320305 310 315 320

Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys SerMet Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser

325 330 335 325 330 335

Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr MetPhe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met

340 345 350 340 345 350

Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu LysGln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys

355 360 365 355 360 365

Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala GlnSer Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln

370 375 380 370 375 380

Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu ThrLys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr

385 390 395 400385 390 395 400

Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr AlaAsp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala

405 410 415 405 410 415

Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp AsnVal Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn

420 425 430 420 425 430

Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys AlaPro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala

435 440 445 435 440 445

Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe AsnLys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn

450 455 460 450 455 460

Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu AlaLys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala

465 470 475 480465 470 475 480

Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn LysAsn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys

485 490 495 485 490 495

Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys LysAsp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys

500 505 510 500 505 510

Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys AspAsp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp

515 520 525 515 520 525

Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe HisLeu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His

530 535 540 530 535 540

Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu HisIle Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His

545 550 555 560545 550 555 560

Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile ValPhe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val

565 570 575 565 570 575

Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr SerPro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser

580 585 590 580 585 590

Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn GlyAsp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly

595 600 605 595 600 605

Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile LysTrp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys

610 615 620 610 615 620

Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys IleAsp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile

625 630 635 640625 630 635 640

Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys LysPhe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys

645 650 655 645 650 655

Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys ValIle Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val

660 665 670 660 665 670

Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile

675 680 685 675 680 685

Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnLeu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln

690 695 700 690 695 700

Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys PheLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe

705 710 715 720705 710 715 720

Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys AspIle Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp

725 730 735 725 730 735

Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp GluPhe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu

740 745 750 740 745 750

Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu AsnPhe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn

755 760 765 755 760 765

Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu TyrIle Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr

770 775 780 770 775 780

Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly ArgLeu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg

785 790 795 800785 790 795 800

Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnPro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn

805 810 815 805 810 815

Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe TyrLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr

820 825 830 820 825 830

Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu AlaArg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala

835 840 845 835 840 845

Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe GluIle Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu

850 855 860 850 855 860

Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe PheTyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe

865 870 875 880865 870 875 880

His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys PheHis Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe

885 890 895 885 890 895

Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val HisAsn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His

900 905 910 900 905 910

Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr LeuIle Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu

915 920 925 915 920 925

Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleVal Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile

930 935 940 930 935 940

Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala IleGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile

945 950 955 960945 950 955 960

Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn AsnGlu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn

965 970 975 965 970 975

Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu IleIle Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile

980 985 990 980 985 990

Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp LeuAla Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu

995 1000 1005 995 1000 1005

Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln ValAsn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val

1010 1015 1020 1010 1015 1020

Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr LeuTyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu

1025 1030 1035 1025 1030 1035

Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu ArgVal Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg

1040 1045 1050 1040 1045 1050

Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met GlyAla Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly

1055 1060 1065 1055 1060 1065

Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr SerLys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser

1070 1075 1080 1070 1075 1080

Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro LysLys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys

1085 1090 1095 1085 1090 1095

Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe AspTyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp

1100 1105 1110 1100 1105 1110

Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser PheLys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe

1115 1120 1125 1115 1120 1125

Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp ThrAsp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr

1130 1135 1140 1130 1135 1140

Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser AspIle Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp

1145 1150 1155 1145 1150 1155

Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys GluLys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu

1160 1165 1170 1160 1165 1170

Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His GlyLeu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly

1175 1180 1185 1175 1180 1185

Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys PheGlu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe

1190 1195 1200 1190 1195 1200

Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met ArgPhe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg

1205 1210 1215 1205 1210 1215

Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro ValAsn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val

1220 1225 1230 1220 1225 1230

Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro LysAla Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys

1235 1240 1245 1235 1240 1245

Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile GlyAsn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly

1250 1255 1260 1250 1255 1260

Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln GluLeu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu

1265 1270 1275 1265 1270 1275

Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe GluGly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu

1280 1285 1290 1280 1285 1290

Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser Pro Lys Lys LysPhe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser Pro Lys Lys Lys

1295 1300 1305 1295 1300 1305

Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValArg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val

1310 1315 1320 1310 1315 1320

<210> 23<210> 23

<211> 1252<211> 1252

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-2NLS<223> LBCPF1-2NLS

<400> 23<400> 23

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp

20 25 30 20 25 30

Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys

35 40 45 35 40 45

Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp

50 55 60 50 55 60

Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu

65 70 75 8065 70 75 80

Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn

85 90 95 85 90 95

Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn

100 105 110 100 105 110

Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu

115 120 125 115 120 125

Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe

130 135 140 130 135 140

Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn

145 150 155 160145 150 155 160

Met Glu Thr Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe ArgMet Glu Thr Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg

165 170 175 165 170 175

Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile PheCys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe

180 185 190 180 185 190

Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile LysGlu Lys Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys

195 200 205 195 200 205

Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu GlyGlu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly

210 215 220 210 215 220

Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr AsnGlu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn

225 230 235 240225 230 235 240

Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys GlyAla Ile Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly

245 250 255 245 250 255

Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys LeuLeu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu

260 265 270 260 265 270

Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu SerPro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser

275 280 285 275 280 285

Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu GluLeu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu

290 295 300 290 295 300

Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser IleVal Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile

305 310 315 320305 310 315 320

Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser AlaLys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala

325 330 335 325 330 335

Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys AspGly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp

340 345 350 340 345 350

Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu TyrIle Phe Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr

355 360 365 355 360 365

Asp Asp Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr GluAsp Asp Ile His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu

370 375 380 370 375 380

Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu GluAsp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu

385 390 395 400385 390 395 400

Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys LeuGln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu

405 410 415 405 410 415

Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr GlyLys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly

420 425 430 420 425 430

Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser LeuSer Ser Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu

435 440 445 435 440 445

Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp SerLys Lys Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser

450 455 460 450 455 460

Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly LysVal Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys

465 470 475 480465 470 475 480

Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala TyrGlu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr

485 490 495 485 490 495

Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn TyrAsp Ile Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr

500 505 510 500 505 510

Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe GlnVal Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln

515 520 525 515 520 525

Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp TyrAsn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr

530 535 540 530 535 540

Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile MetArg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met

545 550 555 560545 550 555 560

Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp ValAsp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val

565 570 575 565 570 575

Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro AsnAsn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn

580 585 590 580 585 590

Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr TyrLys Met Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr

595 600 605 595 600 605

Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe LysAsn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys

610 615 620 610 615 620

Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp PheLys Gly Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe

625 630 635 640625 630 635 640

Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr AspPhe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp

645 650 655 645 650 655

Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe TyrPhe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr

660 665 670 660 665 670

Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala SerArg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser

675 680 685 675 680 685

Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met PheLys Lys Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe

690 695 700 690 695 700

Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro AsnGln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn

705 710 715 720705 710 715 720

Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His GlyLeu His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly

725 730 735 725 730 735

Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala SerGln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser

740 745 750 740 745 750

Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile AlaLeu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala

755 760 765 755 760 765

Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr AspAsn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp

770 775 780 770 775 780

Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His IleVal Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile

785 790 795 800785 790 795 800

Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn ThrPro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr

805 810 815 805 810 815

Glu Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile GlyGlu Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly

820 825 830 820 825 830

Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp GlyIle Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly

835 840 845 835 840 845

Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn AsnLys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn

850 855 860 850 855 860

Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp LysPhe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys

865 870 875 880865 870 875 880

Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile GluLys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu

885 890 895 885 890 895

Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His LysAsn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys

900 905 910 900 905 910

Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu AspIle Cys Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp

915 920 925 915 920 925

Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln ValLeu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val

930 935 940 930 935 940

Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met ValTyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val

945 950 955 960945 950 955 960

Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly TyrAsp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr

965 970 975 965 970 975

Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln AsnGln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn

980 985 990 980 985 990

Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp ProGly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro

995 1000 1005 995 1000 1005

Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser IleSer Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile

1010 1015 1020 1010 1015 1020

Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met TyrAla Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr

1025 1030 1035 1025 1030 1035

Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys AsnVal Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn

1040 1045 1050 1040 1045 1050

Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu TyrPhe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr

1055 1060 1065 1055 1060 1065

Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys AsnSer Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn

1070 1075 1080 1070 1075 1080

Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr LysAsn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys

1085 1090 1095 1085 1090 1095

Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp IleGlu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile

1100 1105 1110 1100 1105 1110

Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser SerArg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser

1115 1120 1125 1115 1120 1125

Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser IlePhe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile

1130 1135 1140 1130 1135 1140

Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys AsnThr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn

1145 1150 1155 1145 1150 1155

Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln GluSer Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu

1160 1165 1170 1160 1165 1170

Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr AsnAsn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn

1175 1180 1185 1175 1180 1185

Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys AlaIle Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala

1190 1195 1200 1190 1195 1200

Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn LysGlu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys

1205 1210 1215 1205 1210 1215

Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser Gly GlyGlu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser Gly Gly

1220 1225 1230 1220 1225 1230

Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys LysSer Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys

1235 1240 1245 1235 1240 1245

Lys Arg Lys ValLys Arg Lys Val

1250 1250

<210> 24<210> 24

<211> 1664<211> 1664

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dFNCPF1-PBE-2NLS<223> dFNCPF1-PBE-2NLS

<400> 24<400> 24

Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val AlaMet Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala

1 5 10 151 5 10 15

Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu ValVal Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val

20 25 30 20 25 30

Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr GluPhe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu

35 40 45 35 40 45

Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln AsnIle Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn

50 55 60 50 55 60

Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr GluThr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu

65 70 75 8065 70 75 80

Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu SerArg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser

85 90 95 85 90 95

Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu SerTrp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser

100 105 110 100 105 110

Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr HisArg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His

115 120 125 115 120 125

His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser SerHis Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser

130 135 140 130 135 140

Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys TrpGly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp

145 150 155 160145 150 155 160

Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro ArgArg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg

165 170 175 165 170 175

Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys IleTyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile

180 185 190 180 185 190

Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln ProIle Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro

195 200 205 195 200 205

Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln ArgGln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg

210 215 220 210 215 220

Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser GluLeu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu

225 230 235 240225 230 235 240

Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Ile TyrThr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Ile Tyr

245 250 255 245 250 255

Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe GluGln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu

260 265 270 260 265 270

Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly LeuLeu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu

275 280 285 275 280 285

Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys GlnIle Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln

290 295 300 290 295 300

Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser SerIle Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser

305 310 315 320305 310 315 320

Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr PheVal Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe

325 330 335 325 330 335

Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys SerLys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser

340 345 350 340 345 350

Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp SerAla Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser

355 360 365 355 360 365

Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys LysGlu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys

370 375 380 370 375 380

Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp AsnGly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn

385 390 395 400385 390 395 400

Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp GluGly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu

405 410 415 405 410 415

Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe LysAla Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys

420 425 430 420 425 430

Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile ProGly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro

435 440 445 435 440 445

Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe LeuThr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu

450 455 460 450 455 460

Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu AlaGlu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala

465 470 475 480465 470 475 480

Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr PheIle Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe

485 490 495 485 490 495

Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser LeuAsp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu

500 505 510 500 505 510

Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln SerAsp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser

515 520 525 515 520 525

Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn GlyGly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly

530 535 540 530 535 540

Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr SerGlu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser

545 550 555 560545 550 555 560

Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val LeuGln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu

565 570 575 565 570 575

Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile AspPhe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp

580 585 590 580 585 590

Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe TyrLys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr

595 600 605 595 600 605

Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys GluGlu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu

610 615 620 610 615 620

Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp LeuThr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu

625 630 635 640625 630 635 640

Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser GlnSer Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln

645 650 655 645 650 655

Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu TyrGln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr

660 665 670 660 665 670

Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys LysIle Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys

675 680 685 675 680 685

Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu SerGlu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser

690 695 700 690 695 700

Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg AspLeu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp

705 710 715 720705 710 715 720

Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala AlaIle Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala

725 730 735 725 730 735

Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu AlaIle Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala

740 745 750 740 745 750

Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu GlnGln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln

755 760 765 755 760 765

Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp GlnAla Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln

770 775 780 770 775 780

Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln SerThr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser

785 790 795 800785 790 795 800

Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu ValGlu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val

805 810 815 805 810 815

Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr AsnPhe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn

820 825 830 820 825 830

Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys PheLys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe

835 840 845 835 840 845

Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys AsnLys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn

850 855 860 850 855 860

Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys TyrLys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr

865 870 875 880865 870 875 880

Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp LysTyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys

885 890 895 885 890 895

Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr LysAla Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr Lys

900 905 910 900 905 910

Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe Ser AlaLeu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe Ser Ala

915 920 925 915 920 925

Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu Arg Ile ArgLys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu Arg Ile Arg

930 935 940 930 935 940

Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln Lys Gly Tyr GluAsn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln Lys Gly Tyr Glu

945 950 955 960945 950 955 960

Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe Ile Asp Phe TyrLys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe Ile Asp Phe Tyr

965 970 975 965 970 975

Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp Phe Gly Phe ArgLys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp Phe Gly Phe Arg

980 985 990 980 985 990

Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu Phe Tyr Arg GluPhe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu Phe Tyr Arg Glu

995 1000 1005 995 1000 1005

Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn Ile Ser GluVal Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn Ile Ser Glu

1010 1015 1020 1010 1015 1020

Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr Leu PheSer Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr Leu Phe

1025 1030 1035 1025 1030 1035

Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg ProGln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg Pro

1040 1045 1050 1040 1045 1050

Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg AsnAsn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn

1055 1060 1065 1055 1060 1065

Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu PheLeu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe

1070 1075 1080 1070 1075 1080

Tyr Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala LysTyr Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys

1085 1090 1095 1085 1090 1095

Glu Ala Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu SerGlu Ala Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser

1100 1105 1110 1100 1105 1110

Val Phe Glu Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu AspVal Phe Glu Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp

1115 1120 1125 1115 1120 1125

Lys Phe Phe Phe His Cys Pro Ile Thr Ile Asn Phe Lys Ser SerLys Phe Phe Phe His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser

1130 1135 1140 1130 1135 1140

Gly Ala Asn Lys Phe Asn Asp Glu Ile Asn Leu Leu Leu Lys GluGly Ala Asn Lys Phe Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu

1145 1150 1155 1145 1150 1155

Lys Ala Asn Asp Val His Ile Leu Ser Ile Ala Arg Gly Glu ArgLys Ala Asn Asp Val His Ile Leu Ser Ile Ala Arg Gly Glu Arg

1160 1165 1170 1160 1165 1170

His Leu Ala Tyr Tyr Thr Leu Val Asp Gly Lys Gly Asn Ile IleHis Leu Ala Tyr Tyr Thr Leu Val Asp Gly Lys Gly Asn Ile Ile

1175 1180 1185 1175 1180 1185

Lys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp Arg Met Lys ThrLys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp Arg Met Lys Thr

1190 1195 1200 1190 1195 1200

Asn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp Arg Asp SerAsn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp Arg Asp Ser

1205 1210 1215 1205 1210 1215

Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn Ile Lys Glu Met LysAla Arg Lys Asp Trp Lys Lys Ile Asn Asn Ile Lys Glu Met Lys

1220 1225 1230 1220 1225 1230

Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile Ala Lys Leu ValGlu Gly Tyr Leu Ser Gln Val Val His Glu Ile Ala Lys Leu Val

1235 1240 1245 1235 1240 1245

Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu Asn Phe GlyIle Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu Asn Phe Gly

1250 1255 1260 1250 1255 1260

Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val Tyr Gln LysPhe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val Tyr Gln Lys

1265 1270 1275 1265 1270 1275

Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu Val Phe LysLeu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu Val Phe Lys

1280 1285 1290 1280 1285 1290

Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg Ala Tyr GlnAsp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg Ala Tyr Gln

1295 1300 1305 1295 1300 1305

Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly Lys Gln ThrLeu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly Lys Gln Thr

1310 1315 1320 1310 1315 1320

Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser Lys Ile CysGly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser Lys Ile Cys

1325 1330 1335 1325 1330 1335

Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys Tyr Glu SerPro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys Tyr Glu Ser

1340 1345 1350 1340 1345 1350

Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp Lys Ile CysVal Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp Lys Ile Cys

1355 1360 1365 1355 1360 1365

Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe Asp Tyr LysTyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe Asp Tyr Lys

1370 1375 1380 1370 1375 1380

Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr Ile Ala SerAsn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr Ile Ala Ser

1385 1390 1395 1385 1390 1395

Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp Lys Asn HisPhe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp Lys Asn His

1400 1405 1410 1400 1405 1410

Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu Leu Glu LysAsn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu Leu Glu Lys

1415 1420 1425 1415 1420 1425

Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly Glu Cys IleLeu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly Glu Cys Ile

1430 1435 1440 1430 1435 1440

Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe Phe Ala LysLys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe Phe Ala Lys

1445 1450 1455 1445 1450 1455

Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg Asn Ser LysLeu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg Asn Ser Lys

1460 1465 1470 1460 1465 1470

Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val Ala Asp ValThr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val Ala Asp Val

1475 1480 1485 1475 1480 1485

Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys Asn Met ProAsn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys Asn Met Pro

1490 1495 1500 1490 1495 1500

Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly Leu Lys GlyGln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly Leu Lys Gly

1505 1510 1515 1505 1510 1515

Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu Gly Lys LysLeu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu Gly Lys Lys

1520 1525 1530 1520 1525 1530

Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu Phe Val GlnLeu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu Phe Val Gln

1535 1540 1545 1535 1540 1545

Asn Arg Asn Asn Thr Arg Asp Ser Gly Gly Ser Thr Asn Leu SerAsn Arg Asn Asn Thr Arg Asp Ser Gly Gly Ser Thr Asn Leu Ser

1550 1555 1560 1550 1555 1560

Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val Ile Gln GluAsp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val Ile Gln Glu

1565 1570 1575 1565 1570 1575

Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile Gly AsnSer Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile Gly Asn

1580 1585 1590 1580 1585 1590

Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu SerLys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu Ser

1595 1600 1605 1595 1600 1605

Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu TyrThr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr

1610 1615 1620 1610 1615 1620

Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn LysLys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys

1625 1630 1635 1625 1630 1635

Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValIle Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val

1640 1645 1650 1640 1645 1650

Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValSer Gly Gly Ser Pro Lys Lys Lys Arg Lys Val

1655 1660 1655 1660

<210> 25<210> 25

<211> 1592<211> 1592

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dLBCPF1-PBE-2NLS<223> dLBCPF1-PBE-2NLS

<400> 25<400> 25

Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val AlaMet Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala

1 5 10 151 5 10 15

Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu ValVal Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val

20 25 30 20 25 30

Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr GluPhe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu

35 40 45 35 40 45

Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln AsnIle Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn

50 55 60 50 55 60

Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr GluThr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu

65 70 75 8065 70 75 80

Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu SerArg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser

85 90 95 85 90 95

Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu SerTrp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser

100 105 110 100 105 110

Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr HisArg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His

115 120 125 115 120 125

His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser SerHis Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser

130 135 140 130 135 140

Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys TrpGly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp

145 150 155 160145 150 155 160

Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro ArgArg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg

165 170 175 165 170 175

Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys IleTyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile

180 185 190 180 185 190

Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln ProIle Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro

195 200 205 195 200 205

Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln ArgGln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg

210 215 220 210 215 220

Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser GluLeu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu

225 230 235 240225 230 235 240

Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Lys LeuThr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Lys Leu

245 250 255 245 250 255

Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe LysGlu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys

260 265 270 260 265 270

Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg LeuAla Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu

275 280 285 275 280 285

Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys LysLeu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys

290 295 300 290 295 300

Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His SerLeu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser

305 310 315 320305 310 315 320

Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys LysIle Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys

325 330 335 325 330 335

Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile AsnThr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn

340 345 350 340 345 350

Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr LysLeu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys

355 360 365 355 360 365

Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe LeuSer Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu

370 375 380 370 375 380

Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe ThrAsp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr

385 390 395 400385 390 395 400

Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser GluThr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu

405 410 415 405 410 415

Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn LeuGlu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu

420 425 430 420 425 430

Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala IleThr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile

435 440 445 435 440 445

Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn SerPhe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser

450 455 460 450 455 460

Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe ValAsp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val

465 470 475 480465 470 475 480

Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly PheLeu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe

485 490 495 485 490 495

Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile AsnVal Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn

500 505 510 500 505 510

Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro LeuLeu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu

515 520 525 515 520 525

Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly GluTyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu

530 535 540 530 535 540

Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr LeuGly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu

545 550 555 560545 550 555 560

Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys LeuAsn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu

565 570 575 565 570 575

Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys AsnPhe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn

580 585 590 580 585 590

Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp AsnGly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn

595 600 605 595 600 605

Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu LysVal Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys

610 615 620 610 615 620

Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys SerLys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser

625 630 635 640625 630 635 640

Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr AlaPhe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala

645 650 655 645 650 655

Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile GlnAsp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln

660 665 670 660 665 670

Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu PheLys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe

675 680 685 675 680 685

Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala ValAsp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val

690 695 700 690 695 700

Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu AsnVal Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn

705 710 715 720705 710 715 720

Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp GluTyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu

725 730 735 725 730 735

Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys ValSer Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val

740 745 750 740 745 750

Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro TyrAsp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr

755 760 765 755 760 765

Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met GlySer Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly

770 775 780 770 775 780

Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu ArgGly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg

785 790 795 800785 790 795 800

Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala LysTyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys

805 810 815 805 810 815

Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu LysCys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys

820 825 830 820 825 830

Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys ValIle Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val

835 840 845 835 840 845

Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp IlePhe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile

850 855 860 850 855 860

Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe AsnGln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn

865 870 875 880865 870 875 880

Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile SerLeu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser

885 890 895 885 890 895

Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu ThrArg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr

900 905 910 900 905 910

Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu GlnGlu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln

915 920 925 915 920 925

Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp LysGly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys

930 935 940 930 935 940

Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys AspLeu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp

945 950 955 960945 950 955 960

Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr PhePhe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe

965 970 975 965 970 975

Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser GlyLys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly

980 985 990 980 985 990

Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu LeuGly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu

995 1000 1005 995 1000 1005

Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro AspVal Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp

1010 1015 1020 1010 1015 1020

Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys AspAsn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp

1025 1030 1035 1025 1030 1035

Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile AlaLys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile Ala

1040 1045 1050 1040 1045 1050

Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValIle Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val

1055 1060 1065 1055 1060 1065

Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly IleArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile

1070 1075 1080 1070 1075 1080

Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp GlyAla Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly

1085 1090 1095 1085 1090 1095

Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile AsnLys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn

1100 1105 1110 1100 1105 1110

Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu LeuAsn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu

1115 1120 1125 1115 1120 1125

Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp ThrAsp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr

1130 1135 1140 1130 1135 1140

Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser GlnSer Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln

1145 1150 1155 1145 1150 1155

Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala ValVal Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val

1160 1165 1170 1160 1165 1170

Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg ValIle Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val

1175 1180 1185 1175 1180 1185

Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu IleLys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile

1190 1195 1200 1190 1195 1200

Asp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys AlaAsp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala

1205 1210 1215 1205 1210 1215

Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe GluThr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu

1220 1225 1230 1220 1225 1230

Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr IleSer Phe Lys Ser Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile

1235 1240 1245 1235 1240 1245

Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr Gly Phe ValPro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr Gly Phe Val

1250 1255 1260 1250 1255 1260

Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser Lys LysAsn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys

1265 1270 1275 1265 1270 1275

Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu AspPhe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu Asp

1280 1285 1290 1280 1285 1290

Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr AspLeu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp

1295 1300 1305 1295 1300 1305

Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn ArgAla Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg

1310 1315 1320 1310 1315 1320

Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp TrpIle Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp Trp

1325 1330 1335 1325 1330 1335

Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn LysGlu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys

1340 1345 1350 1340 1345 1350

Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu CysTyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys

1355 1360 1365 1355 1360 1365

Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu MetGlu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu Met

1370 1375 1380 1370 1375 1380

Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly Arg Thr AspSer Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly Arg Thr Asp

1385 1390 1395 1385 1390 1395

Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp Gly Ile PheVal Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp Gly Ile Phe

1400 1405 1410 1400 1405 1410

Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala Ile Leu ProTyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro

1415 1420 1425 1415 1420 1425

Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala Arg Lys ValLys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val

1430 1435 1440 1430 1435 1440

Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp Glu Lys LeuLeu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu

1445 1450 1455 1445 1450 1455

Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp Leu Glu TyrAsp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr

1460 1465 1470 1460 1465 1470

Ala Gln Thr Ser Val Lys His Thr Arg Asp Ser Gly Gly Ser ThrAla Gln Thr Ser Val Lys His Thr Arg Asp Ser Gly Gly Ser Thr

1475 1480 1485 1475 1480 1485

Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu ValAsn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val

1490 1495 1500 1490 1495 1500

Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu ValIle Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val

1505 1510 1515 1505 1510 1515

Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala TyrIle Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr

1520 1525 1530 1520 1525 1530

Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp AlaAsp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala

1535 1540 1545 1535 1540 1545

Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn GlyPro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly

1550 1555 1560 1550 1555 1560

Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys LysGlu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys

1565 1570 1575 1565 1570 1575

Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys ValArg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val

1580 1585 1590 1580 1585 1590

<210> 26<210> 26

<211> 1731<211> 1731

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dFNCPF1-ABE7.10-2NLS<223> dFNCPF1-ABE7.10-2NLS

<400> 26<400> 26

Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val GluMet Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu

1 5 10 151 5 10 15

Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys ArgPhe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg

20 25 30 20 25 30

Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnAla Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn

35 40 45 35 40 45

Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His AspAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp

50 55 60 50 55 60

Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu ValPro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val

65 70 75 8065 70 75 80

Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu GluMet Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu

85 90 95 85 90 95

Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly ArgPro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg

100 105 110 100 105 110

Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu

115 120 125 115 120 125

Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile ThrMet Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr

130 135 140 130 135 140

Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe PheGlu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe

145 150 155 160145 150 155 160

Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerArg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser

165 170 175 165 170 175

Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr ProThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro

180 185 190 180 185 190

Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser GlyGly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly

195 200 205 195 200 205

Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaGly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala

210 215 220 210 215 220

Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val GlyLeu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly

225 230 235 240225 230 235 240

Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn ArgAla Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg

245 250 255 245 250 255

Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala LeuAla Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu

260 265 270 260 265 270

Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrArg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr

275 280 285 275 280 285

Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met IleLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile

290 295 300 290 295 300

His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys ThrHis Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr

305 310 315 320305 310 315 320

Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met AsnGly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn

325 330 335 325 330 335

His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala

340 345 350 340 345 350

Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala GlnLeu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln

355 360 365 355 360 365

Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly SerLys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser

370 375 380 370 375 380

Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu SerSer Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser

385 390 395 400385 390 395 400

Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Ile Tyr Gln Glu PheSer Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Ile Tyr Gln Glu Phe

405 410 415 405 410 415

Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu Leu Ile ProVal Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu Leu Ile Pro

420 425 430 420 425 430

Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu Ile Leu AspGln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu Ile Leu Asp

435 440 445 435 440 445

Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln Ile Ile AspAsp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln Ile Ile Asp

450 455 460 450 455 460

Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser Val Cys IleLys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser Val Cys Ile

465 470 475 480465 470 475 480

Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe Lys Leu LysSer Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe Lys Leu Lys

485 490 495 485 490 495

Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser Ala Lys AspLys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser Ala Lys Asp

500 505 510 500 505 510

Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser Glu Lys PheThr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser Glu Lys Phe

515 520 525 515 520 525

Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys Gly Gln GluLys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys Gly Gln Glu

530 535 540 530 535 540

Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn Gly Ile GluSer Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn Gly Ile Glu

545 550 555 560545 550 555 560

Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu Ala Leu GluLeu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu Ala Leu Glu

565 570 575 565 570 575

Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys Gly Phe HisIle Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys Gly Phe His

580 585 590 580 585 590

Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro Thr Ser IleGlu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro Thr Ser Ile

595 600 605 595 600 605

Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu Glu Asn LysIle Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu Glu Asn Lys

610 615 620 610 615 620

Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala Ile Asn TyrAla Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala Ile Asn Tyr

625 630 635 640625 630 635 640

Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe Asp Ile AspGlu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe Asp Ile Asp

645 650 655 645 650 655

Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu Asp Glu ValTyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu Asp Glu Val

660 665 670 660 665 670

Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser Gly Ile ThrPhe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser Gly Ile Thr

675 680 685 675 680 685

Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly Glu Asn ThrLys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly Glu Asn Thr

690 695 700 690 695 700

Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser Gln Gln IleLys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser Gln Gln Ile

705 710 715 720705 710 715 720

Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu Phe Lys GlnAsn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu Phe Lys Gln

725 730 735 725 730 735

Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp Lys Leu GluIle Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp Lys Leu Glu

740 745 750 740 745 750

Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr Glu Gln IleAsp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr Glu Gln Ile

755 760 765 755 760 765

Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu Thr Leu SerAla Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu Thr Leu Ser

770 775 780 770 775 780

Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu Ser Lys IleLeu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu Ser Lys Ile

785 790 795 800785 790 795 800

Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln Gln Val PheTyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln Gln Val Phe

805 810 815 805 810 815

Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr Ile Thr GlnAsp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr Ile Thr Gln

820 825 830 820 825 830

Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys Glu Gln GluGln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys Glu Gln Glu

835 840 845 835 840 845

Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser Leu Glu ThrLeu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser Leu Glu Thr

850 855 860 850 855 860

Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp Ile Asp LysIle Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp Ile Asp Lys

865 870 875 880865 870 875 880

Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala Ile Pro MetGln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala Ile Pro Met

885 890 895 885 890 895

Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala Gln Ile SerIle Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala Gln Ile Ser

900 905 910 900 905 910

Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln Ala Ser AlaIle Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln Ala Ser Ala

915 920 925 915 920 925

Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln Thr Asn AsnGlu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln Thr Asn Asn

930 935 940 930 935 940

Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser Glu Asp LysLeu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser Glu Asp Lys

945 950 955 960945 950 955 960

Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val Phe Glu GluAla Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val Phe Glu Glu

965 970 975 965 970 975

Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn Lys Ile ArgCys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn Lys Ile Arg

980 985 990 980 985 990

Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe Lys Leu AsnAsn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe Lys Leu Asn

995 1000 1005 995 1000 1005

Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn Lys GluPhe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn Lys Glu

1010 1015 1020 1010 1015 1020

Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr TyrPro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr Tyr

1025 1030 1035 1025 1030 1035

Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp LysLeu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys

1040 1045 1050 1040 1045 1050

Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val TyrAla Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr

1055 1060 1065 1055 1060 1065

Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe PheLys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe

1070 1075 1080 1070 1075 1080

Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile LeuSer Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu

1085 1090 1095 1085 1090 1095

Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro GlnArg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln

1100 1105 1110 1100 1105 1110

Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg LysLys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys

1115 1120 1125 1115 1120 1125

Phe Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu TrpPhe Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp

1130 1135 1140 1130 1135 1140

Lys Asp Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn SerLys Asp Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser

1145 1150 1155 1145 1150 1155

Ile Asp Glu Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys LeuIle Asp Glu Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu

1160 1165 1170 1160 1165 1170

Thr Phe Glu Asn Ile Ser Glu Ser Tyr Ile Asp Ser Val Val AsnThr Phe Glu Asn Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn

1175 1180 1185 1175 1180 1185

Gln Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe SerGln Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser

1190 1195 1200 1190 1195 1200

Ala Tyr Ser Lys Gly Arg Pro Asn Leu His Thr Leu Tyr Trp LysAla Tyr Ser Lys Gly Arg Pro Asn Leu His Thr Leu Tyr Trp Lys

1205 1210 1215 1205 1210 1215

Ala Leu Phe Asp Glu Arg Asn Leu Gln Asp Val Val Tyr Lys LeuAla Leu Phe Asp Glu Arg Asn Leu Gln Asp Val Val Tyr Lys Leu

1220 1225 1230 1220 1225 1230

Asn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln Ser Ile Pro LysAsn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln Ser Ile Pro Lys

1235 1240 1245 1235 1240 1245

Lys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn Lys Asn LysLys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn Lys Asn Lys

1250 1255 1260 1250 1255 1260

Asp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu Ile LysAsp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu Ile Lys

1265 1270 1275 1265 1270 1275

Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro IleAsp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro Ile

1280 1285 1290 1280 1285 1290

Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp GluThr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp Glu

1295 1300 1305 1295 1300 1305

Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile LeuIle Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile Leu

1310 1315 1320 1310 1315 1320

Ser Ile Ala Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu ValSer Ile Ala Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu Val

1325 1330 1335 1325 1330 1335

Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile IleAsp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile

1340 1345 1350 1340 1345 1350

Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala AlaGly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala

1355 1360 1365 1355 1360 1365

Ile Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys IleIle Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile

1370 1375 1380 1370 1375 1380

Asn Asn Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val ValAsn Asn Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val

1385 1390 1395 1385 1390 1395

His Glu Ile Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val ValHis Glu Ile Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val

1400 1405 1410 1400 1405 1410

Phe Glu Asp Leu Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys ValPhe Glu Asp Leu Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val

1415 1420 1425 1415 1420 1425

Glu Lys Gln Val Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu LysGlu Lys Gln Val Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys

1430 1435 1440 1430 1435 1440

Leu Asn Tyr Leu Val Phe Lys Asp Asn Glu Phe Asp Lys Thr GlyLeu Asn Tyr Leu Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly

1445 1450 1455 1445 1450 1455

Gly Val Leu Arg Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr PheGly Val Leu Arg Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe

1460 1465 1470 1460 1465 1470

Lys Lys Met Gly Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro AlaLys Lys Met Gly Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala

1475 1480 1485 1475 1480 1485

Gly Phe Thr Ser Lys Ile Cys Pro Val Thr Gly Phe Val Asn GlnGly Phe Thr Ser Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln

1490 1495 1500 1490 1495 1500

Leu Tyr Pro Lys Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe PheLeu Tyr Pro Lys Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe

1505 1510 1515 1505 1510 1515

Ser Lys Phe Asp Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr PheSer Lys Phe Asp Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe

1520 1525 1530 1520 1525 1530

Glu Phe Ser Phe Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala LysGlu Phe Ser Phe Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys

1535 1540 1545 1535 1540 1545

Gly Lys Trp Thr Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn PheGly Lys Trp Thr Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe

1550 1555 1560 1550 1555 1560

Arg Asn Ser Asp Lys Asn His Asn Trp Asp Thr Arg Glu Val TyrArg Asn Ser Asp Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr

1565 1570 1575 1565 1570 1575

Pro Thr Lys Glu Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile GluPro Thr Lys Glu Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu

1580 1585 1590 1580 1585 1590

Tyr Gly His Gly Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu SerTyr Gly His Gly Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser

1595 1600 1605 1595 1600 1605

Asp Lys Lys Phe Phe Ala Lys Leu Thr Ser Val Leu Asn Thr IleAsp Lys Lys Phe Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile

1610 1615 1620 1610 1615 1620

Leu Gln Met Arg Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr LeuLeu Gln Met Arg Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu

1625 1630 1635 1625 1630 1635

Ile Ser Pro Val Ala Asp Val Asn Gly Asn Phe Phe Asp Ser ArgIle Ser Pro Val Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg

1640 1645 1650 1640 1645 1650

Gln Ala Pro Lys Asn Met Pro Gln Asp Ala Asp Ala Asn Gly AlaGln Ala Pro Lys Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala

1655 1660 1665 1655 1660 1665

Tyr His Ile Gly Leu Lys Gly Leu Met Leu Leu Gly Arg Ile LysTyr His Ile Gly Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys

1670 1675 1680 1670 1675 1680

Asn Asn Gln Glu Gly Lys Lys Leu Asn Leu Val Ile Lys Asn GluAsn Asn Gln Glu Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu

1685 1690 1695 1685 1690 1695

Glu Tyr Phe Glu Phe Val Gln Asn Arg Asn Asn Ser Gly Gly SerGlu Tyr Phe Glu Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser

1700 1705 1710 1700 1705 1710

Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys LysPro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys

1715 1720 1725 1715 1720 1725

Arg Lys ValArg Lys Val

1730 1730

<210> 27<210> 27

<211> 1659<211> 1659

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> dLBCPF1-ABE7.10-2NLS<223> dLBCPF1-ABE7.10-2NLS

<400> 27<400> 27

Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val GluMet Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu

1 5 10 151 5 10 15

Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys ArgPhe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg

20 25 30 20 25 30

Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnAla Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn

35 40 45 35 40 45

Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His AspAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp

50 55 60 50 55 60

Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu ValPro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val

65 70 75 8065 70 75 80

Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu GluMet Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu

85 90 95 85 90 95

Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly ArgPro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg

100 105 110 100 105 110

Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu

115 120 125 115 120 125

Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile ThrMet Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr

130 135 140 130 135 140

Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe PheGlu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe

145 150 155 160145 150 155 160

Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerArg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser

165 170 175 165 170 175

Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr ProThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro

180 185 190 180 185 190

Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser GlyGly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly

195 200 205 195 200 205

Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaGly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala

210 215 220 210 215 220

Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val GlyLeu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly

225 230 235 240225 230 235 240

Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn ArgAla Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg

245 250 255 245 250 255

Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala LeuAla Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu

260 265 270 260 265 270

Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrArg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr

275 280 285 275 280 285

Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met IleLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile

290 295 300 290 295 300

His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys ThrHis Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr

305 310 315 320305 310 315 320

Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met AsnGly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn

325 330 335 325 330 335

His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala

340 345 350 340 345 350

Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala GlnLeu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln

355 360 365 355 360 365

Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly SerLys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser

370 375 380 370 375 380

Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu SerSer Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser

385 390 395 400385 390 395 400

Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Lys Leu Glu Lys PheSer Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Lys Leu Glu Lys Phe

405 410 415 405 410 415

Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile ProThr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro

420 425 430 420 425 430

Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val GluVal Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu

435 440 445 435 440 445

Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu AspAsp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp

450 455 460 450 455 460

Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys LeuArg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu

465 470 475 480465 470 475 480

Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg ThrLys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr

485 490 495 485 490 495

Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg LysGlu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys

500 505 510 500 505 510

Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu PheGlu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe

515 520 525 515 520 525

Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp LysLys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys

530 535 540 530 535 540

Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala PheAsp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe

545 550 555 560545 550 555 560

Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala LysThr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys

565 570 575 565 570 575

Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg TyrSer Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr

580 585 590 580 585 590

Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp LysIle Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys

595 600 605 595 600 605

His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr AspHis Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp

610 615 620 610 615 620

Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr GlnVal Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln

625 630 635 640625 630 635 640

Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr GluGlu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu

645 650 655 645 650 655

Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr AsnSer Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn

660 665 670 660 665 670

Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys GlnGln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln

675 680 685 675 680 685

Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr ThrVal Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr

690 695 700 690 695 700

Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys AsnSer Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn

705 710 715 720705 710 715 720

Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys AsnSer Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn

725 730 735 725 730 735

Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro AlaPhe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala

740 745 750 740 745 750

Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile ArgIle Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg

755 760 765 755 760 765

Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys AlaAsp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala

770 775 780 770 775 780

Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys LysVal Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys

785 790 795 800785 790 795 800

Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala AspIle Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp

805 810 815 805 810 815

Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val AspLeu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp

820 825 830 820 825 830

Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala AspGlu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp

835 840 845 835 840 845

Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala IlePhe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile

850 855 860 850 855 860

Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile LysMet Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys

865 870 875 880865 870 875 880

Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe TyrAla Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr

885 890 895 885 890 895

Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His IleGly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile

900 905 910 900 905 910

Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys AspTyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp

915 920 925 915 920 925

Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp AspLys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp

930 935 940 930 935 940

Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly SerLys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser

945 950 955 960945 950 955 960

Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu GlnLys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln

965 970 975 965 970 975

Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn TyrLys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr

980 985 990 980 985 990

Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe SerLys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser

995 1000 1005 995 1000 1005

Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln LysLys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys

1010 1015 1020 1010 1015 1020

Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn LeuIle Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu

1025 1030 1035 1025 1030 1035

Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile SerAsn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser

1040 1045 1050 1040 1045 1050

Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser GluArg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu

1055 1060 1065 1055 1060 1065

Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val GluThr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu

1070 1075 1080 1070 1075 1080

Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys GluGlu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu

1085 1090 1095 1085 1090 1095

Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleVal Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile

1100 1105 1110 1100 1105 1110

Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn LeuTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu

1115 1120 1125 1115 1120 1125

His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His GlyHis Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly

1130 1135 1140 1130 1135 1140

Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg AlaGln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala

1145 1150 1155 1145 1150 1155

Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser ProSer Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro

1160 1165 1170 1160 1165 1170

Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr LeuIle Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu

1175 1180 1185 1175 1180 1185

Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln TyrSer Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr

1190 1195 1200 1190 1195 1200

Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn IleGlu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile

1205 1210 1215 1205 1210 1215

Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp AspPhe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp

1220 1225 1230 1220 1225 1230

Asn Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu LeuAsn Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu

1235 1240 1245 1235 1240 1245

Tyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln TyrTyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr

1250 1255 1260 1250 1255 1260

Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile LysSer Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys

1265 1270 1275 1265 1270 1275

Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg PheThr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe

1280 1285 1290 1280 1285 1290

Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu LeuGlu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu

1295 1300 1305 1295 1300 1305

Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu LeuLys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu Leu

1310 1315 1320 1310 1315 1320

Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn SerVal Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn Ser

1325 1330 1335 1325 1330 1335

Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnGly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln

1340 1345 1350 1340 1345 1350

Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val AspLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp

1355 1360 1365 1355 1360 1365

Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly TyrLys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr

1370 1375 1380 1370 1375 1380

Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr GlnGln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln

1385 1390 1395 1385 1390 1395

Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys IleAsn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile

1400 1405 1410 1400 1405 1410

Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr ThrAsp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr

1415 1420 1425 1415 1420 1425

Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg IleSer Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile

1430 1435 1440 1430 1435 1440

Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp TyrMet Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr

1445 1450 1455 1445 1450 1455

Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp LysLys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys

1460 1465 1470 1460 1465 1470

Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro LysLeu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys

1475 1480 1485 1475 1480 1485

Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser AlaLys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala

1490 1495 1500 1490 1495 1500

Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln GlyTyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly

1505 1510 1515 1505 1510 1515

Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe TyrAsp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr

1520 1525 1530 1520 1525 1530

Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg AsnSer Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn

1535 1540 1545 1535 1540 1545

Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro ValSer Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val

1550 1555 1560 1550 1555 1560

Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu AlaLys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala

1565 1570 1575 1565 1570 1575

Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly AlaGln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala

1580 1585 1590 1580 1585 1590

Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe LysTyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys

1595 1600 1605 1595 1600 1605

Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile SerLys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser

1610 1615 1620 1610 1615 1620

Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His SerAsn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser

1625 1630 1635 1625 1630 1635

Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser ProGly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro

1640 1645 1650 1640 1645 1650

Lys Lys Lys Arg Lys ValLys Lys Lys Arg Lys Val

1655 1655

<210> 28<210> 28

<211> 1656<211> 1656

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-ABE2-X2<223> LBCPF1-ABE2-X2

<400> 28<400> 28

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys ThrMet Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr

1 5 10 151 5 10 15

Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile AspLeu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp

20 25 30 20 25 30

Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr LysAsn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys

35 40 45 35 40 45

Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn AspGly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp

50 55 60 50 55 60

Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser LeuVal Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu

65 70 75 8065 70 75 80

Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu AsnPhe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn

85 90 95 85 90 95

Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly AsnLeu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn

100 105 110 100 105 110

Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile LeuGlu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu

115 120 125 115 120 125

Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser PhePro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe

130 135 140 130 135 140

Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu AsnAsn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn

145 150 155 160145 150 155 160

Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys IleMet Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile

165 170 175 165 170 175

Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu LysAsn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys

180 185 190 180 185 190

Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu LysVal Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys

195 200 205 195 200 205

Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu PheIle Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe

210 215 220 210 215 220

Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala IlePhe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile

225 230 235 240225 230 235 240

Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu AsnIle Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn

245 250 255 245 250 255

Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro LysGlu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys

260 265 270 260 265 270

Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu SerPhe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser

275 280 285 275 280 285

Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val PhePhe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe

290 295 300 290 295 300

Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys LysArg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys

305 310 315 320305 310 315 320

Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly IleLeu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile

325 330 335 325 330 335

Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile PhePhe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe

340 345 350 340 345 350

Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp AspGly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp

355 360 365 355 360 365

Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp AspIle His Leu Lys Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp

370 375 380 370 375 380

Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln LeuArg Arg Lys Ser Phe Lys Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu

385 390 395 400385 390 395 400

Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys GluGln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu

405 410 415 405 410 415

Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser SerIle Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser

420 425 430 420 425 430

Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys LysGlu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys

435 440 445 435 440 445

Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val LysAsn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys

450 455 460 450 455 460

Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu ThrSer Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr

465 470 475 480465 470 475 480

Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp IleAsn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile

485 490 495 485 490 495

Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val ThrLeu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr

500 505 510 500 505 510

Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn ProGln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro

515 520 525 515 520 525

Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg AlaGln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala

530 535 540 530 535 540

Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp LysThr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys

545 550 555 560545 550 555 560

Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn GlyLys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly

565 570 575 565 570 575

Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys MetAsn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met

580 585 590 580 585 590

Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn ProLeu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro

595 600 605 595 600 605

Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys GlySer Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly

610 615 620 610 615 620

Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe LysAsp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys

625 630 635 640625 630 635 640

Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe AsnAsp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn

645 650 655 645 650 655

Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg GluPhe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu

660 665 670 660 665 670

Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys LysVal Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys

675 680 685 675 680 685

Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln IleGlu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile

690 695 700 690 695 700

Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu HisTyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His

705 710 715 720705 710 715 720

Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln IleThr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile

725 730 735 725 730 735

Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu LysArg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys

740 745 750 740 745 750

Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn LysLys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys

755 760 765 755 760 765

Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val TyrAsn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr

770 775 780 770 775 780

Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro IleLys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile

785 790 795 800785 790 795 800

Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu ValAla Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val

805 810 815 805 810 815

Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile AlaArg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Ala

820 825 830 820 825 830

Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys GlyArg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly

835 840 845 835 840 845

Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe AsnAsn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn

850 855 860 850 855 860

Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys GluGly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu

865 870 875 880865 870 875 880

Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn IleLys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile

885 890 895 885 890 895

Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile CysLys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys

900 905 910 900 905 910

Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu AsnGlu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn

915 920 925 915 920 925

Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr GlnSer Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln

930 935 940 930 935 940

Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp LysLys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys

945 950 955 960945 950 955 960

Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln IleLys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile

965 970 975 965 970 975

Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly PheThr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe

980 985 990 980 985 990

Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser ThrIle Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr

995 1000 1005 995 1000 1005

Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala AspGly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp

1010 1015 1020 1010 1015 1020

Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val ProSer Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro

1025 1030 1035 1025 1030 1035

Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe SerGlu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser

1040 1045 1050 1040 1045 1050

Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser TyrArg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr

1055 1060 1065 1055 1060 1065

Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn ValGly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val

1070 1075 1080 1070 1075 1080

Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu LeuPhe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu

1085 1090 1095 1085 1090 1095

Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg AlaPhe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala

1100 1105 1110 1100 1105 1110

Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe MetLeu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met

1115 1120 1125 1115 1120 1125

Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr GlyAla Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly

1130 1135 1140 1130 1135 1140

Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser AspArg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp

1145 1150 1155 1145 1150 1155

Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn AlaGly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala

1160 1165 1170 1160 1165 1170

Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile AlaIle Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala

1175 1180 1185 1175 1180 1185

Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu AspArg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp

1190 1195 1200 1190 1195 1200

Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu TrpGlu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp

1205 1210 1215 1205 1210 1215

Leu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Leu Met Pro LysLeu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Leu Met Pro Lys

1220 1225 1230 1220 1225 1230

Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu Phe SerLys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu Phe Ser

1235 1240 1245 1235 1240 1245

His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg AlaHis Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala

1250 1255 1260 1250 1255 1260

Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His AsnTrp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn

1265 1270 1275 1265 1270 1275

Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg HisAsn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His

1280 1285 1290 1280 1285 1290

Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly GlyAsp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly

1295 1300 1305 1295 1300 1305

Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr ValLeu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val

1310 1315 1320 1310 1315 1320

Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His SerThr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser

1325 1330 1335 1325 1330 1335

Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr GlyArg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly

1340 1345 1350 1340 1345 1350

Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met AsnAla Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn

1355 1360 1365 1355 1360 1365

His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys AlaHis Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala

1370 1375 1380 1370 1375 1380

Ala Leu Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile LysAla Leu Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys

1385 1390 1395 1385 1390 1395

Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser SerAla Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser

1400 1405 1410 1400 1405 1410

Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser AlaGly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala

1415 1420 1425 1415 1420 1425

Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ser Glu ValThr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ser Glu Val

1430 1435 1440 1430 1435 1440

Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu AlaGlu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala

1445 1450 1455 1445 1450 1455

Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala Val LeuLys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu

1460 1465 1470 1460 1465 1470

Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala IleVal Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala Ile

1475 1480 1485 1475 1480 1485

Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu ArgGly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg

1490 1495 1500 1490 1495 1500

Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala ThrGln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr

1505 1510 1515 1505 1510 1515

Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala MetLeu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met

1520 1525 1530 1520 1525 1530

Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn AlaIle His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala

1535 1540 1545 1535 1540 1545

Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr ProLys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro

1550 1555 1560 1550 1555 1560

Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala AspGly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp

1565 1570 1575 1565 1570 1575

Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg GlnGlu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln

1580 1585 1590 1580 1585 1590

Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser GlyVal Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly

1595 1600 1605 1595 1600 1605

Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr SerGly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser

1610 1615 1620 1610 1615 1620

Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly SerGlu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser

1625 1630 1635 1625 1630 1635

Leu Lys Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala LysLeu Lys Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys

1640 1645 1650 1640 1645 1650

Lys Lys LysLys Lys Lys

1655 1655

<210> 29<210> 29

<211> 1663<211> 1663

<212> PRT<212> PRT

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> LBCPF1-ABE2-X3<223> LBCPF1-ABE2-X3

<400> 29<400> 29

Met Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr AsnMet Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr Asn

1 5 10 151 5 10 15

Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val GlyCys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val Gly

20 25 30 20 25 30

Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp GluLys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp Glu

35 40 45 35 40 45

Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg TyrLys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg Tyr

50 55 60 50 55 60

Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys AsnTyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys Asn

65 70 75 8065 70 75 80

Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu LysLeu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu Lys

85 90 95 85 90 95

Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu IleGlu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu Ile

100 105 110 100 105 110

Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys LysAla Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys Lys

115 120 125 115 120 125

Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp GluAsp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp Glu

130 135 140 130 135 140

Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr GlyIle Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr Gly

145 150 155 160145 150 155 160

Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser ThrPhe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser Thr

165 170 175 165 170 175

Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile SerSer Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser

180 185 190 180 185 190

Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His GluAsn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu

195 200 205 195 200 205

Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val GluVal Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu

210 215 220 210 215 220

Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu GlyAsp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly

225 230 235 240225 230 235 240

Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser GlyIle Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly

245 250 255 245 250 255

Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln LysGlu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys

260 265 270 260 265 270

Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val LeuThr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu

275 280 285 275 280 285

Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser AspSer Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp

290 295 300 290 295 300

Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser GluGlu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu

305 310 315 320305 310 315 320

Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe AspIle Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp

325 330 335 325 330 335

Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile SerGlu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser

340 345 350 340 345 350

Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp LysThr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys

355 360 365 355 360 365

Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val ValTrp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val Val

370 375 380 370 375 380

Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile GlyThr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly

385 390 395 400385 390 395 400

Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu SerSer Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser

405 410 415 405 410 415

Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu IleVal Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile

420 425 430 420 425 430

Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe ValTyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val

435 440 445 435 440 445

Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met LysLeu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys

450 455 460 450 455 460

Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala PheAsp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe

465 470 475 480465 470 475 480

Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly AspPhe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp

485 490 495 485 490 495

Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr AspPhe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp

500 505 510 500 505 510

Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys PheAla Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe

515 520 525 515 520 525

Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys AspLys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp

530 535 540 530 535 540

Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys TyrLys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr

545 550 555 560545 550 555 560

Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys IleTyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile

565 570 575 565 570 575

Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys LeuAsp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu

580 585 590 580 585 590

Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys LysLeu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys

595 600 605 595 600 605

Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr LysTrp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys

610 615 620 610 615 620

Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys HisAsn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His

625 630 635 640625 630 635 640

Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys TrpLys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp

645 650 655 645 650 655

Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys AspSer Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp

660 665 670 660 665 670

Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val SerIle Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser

675 680 685 675 680 685

Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu GlyPhe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly

690 695 700 690 695 700

Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys SerLys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser

705 710 715 720705 710 715 720

His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe AspHis Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp

725 730 735 725 730 735

Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu PheGlu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe

740 745 750 740 745 750

Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro AlaMet Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala

755 760 765 755 760 765

Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr ThrAsn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr

770 775 780 770 775 780

Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp GlnThr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln

785 790 795 800785 790 795 800

Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn IleTyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile

805 810 815 805 810 815

Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp AsnPhe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp Asn

820 825 830 820 825 830

Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu Tyr IlePro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile

835 840 845 835 840 845

Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu AsnVal Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn

850 855 860 850 855 860

Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr HisGlu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His

865 870 875 880865 870 875 880

Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln AsnSer Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn

885 890 895 885 890 895

Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile SerTrp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser

900 905 910 900 905 910

Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala ValGln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val

915 920 925 915 920 925

Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val LysIle Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys

930 935 940 930 935 940

Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp LysVal Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys

945 950 955 960945 950 955 960

Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly GlyLeu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly

965 970 975 965 970 975

Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys SerAla Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser

980 985 990 980 985 990

Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu ThrMet Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr

995 1000 1005 995 1000 1005

Ser Lys Ile Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys ThrSer Lys Ile Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr

1010 1015 1020 1010 1015 1020

Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser PheLys Tyr Thr Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe

1025 1030 1035 1025 1030 1035

Asp Arg Ile Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe AlaAsp Arg Ile Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala

1040 1045 1050 1040 1045 1050

Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile LysLeu Asp Tyr Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys

1055 1060 1065 1055 1060 1065

Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe ArgLys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg

1070 1075 1080 1070 1075 1080

Asn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys LeuAsn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu

1085 1090 1095 1085 1090 1095

Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn TyrThr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr

1100 1105 1110 1100 1105 1110

Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp LysGln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys

1115 1120 1125 1115 1120 1125

Ala Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu GlnAla Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln

1130 1135 1140 1130 1135 1140

Met Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu IleMet Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile

1145 1150 1155 1145 1150 1155

Ser Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg AsnSer Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn

1160 1165 1170 1160 1165 1170

Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp AlaTyr Glu Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala

1175 1180 1185 1175 1180 1185

Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile GlyAsn Gly Ala Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly

1190 1195 1200 1190 1195 1200

Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys IleGln Phe Lys Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile

1205 1210 1215 1205 1210 1215

Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser ValAla Ile Ser Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val

1220 1225 1230 1220 1225 1230

Lys His Lys Leu Met Pro Lys Lys Lys Arg Lys Val Ser Gly GlyLys His Lys Leu Met Pro Lys Lys Lys Lys Arg Lys Val Ser Gly Gly

1235 1240 1245 1235 1240 1245

Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His AlaSer Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala

1250 1255 1260 1250 1255 1260

Leu Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro ValLeu Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val

1265 1270 1275 1265 1270 1275

Gly Ala Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly TrpGly Ala Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp

1280 1285 1290 1280 1285 1290

Asn Arg Pro Ile Gly Arg His Asp Pro Thr Ala His Ala Glu IleAsn Arg Pro Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile

1295 1300 1305 1295 1300 1305

Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg LeuMet Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu

1310 1315 1320 1310 1315 1320

Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu Pro Cys Val Met CysIle Asp Ala Thr Leu Tyr Val Thr Leu Glu Pro Cys Val Met Cys

1325 1330 1335 1325 1330 1335

Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe GlyAla Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly

1340 1345 1350 1340 1345 1350

Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp ValAla Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val

1355 1360 1365 1355 1360 1365

Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr Glu GlyLeu His His Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly

1370 1375 1380 1370 1375 1380

Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe ArgIle Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe Arg

1385 1390 1395 1385 1390 1395

Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser SerMet Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser

1400 1405 1410 1400 1405 1410

Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu ThrThr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr

1415 1420 1425 1415 1420 1425

Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly SerPro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser

1430 1435 1440 1430 1435 1440

Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp MetSer Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met

1445 1450 1455 1445 1450 1455

Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg GluArg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu

1460 1465 1470 1460 1465 1470

Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile GlyVal Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly

1475 1480 1485 1475 1480 1485

Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala HisGlu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His

1490 1495 1500 1490 1495 1500

Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln AsnAla Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn

1505 1510 1515 1505 1510 1515

Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro CysTyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys

1520 1525 1530 1520 1525 1530

Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg ValVal Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val

1535 1540 1545 1535 1540 1545

Val Phe Gly Val Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser LeuVal Phe Gly Val Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu

1550 1555 1560 1550 1555 1560

Met Asp Val Leu His Tyr Pro Gly Met Asn His Arg Val Glu IleMet Asp Val Leu His Tyr Pro Gly Met Asn His Arg Val Glu Ile

1565 1570 1575 1565 1570 1575

Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Cys TyrThr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Cys Tyr

1580 1585 1590 1580 1585 1590

Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys Lys AlaPhe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys Lys Ala

1595 1600 1605 1595 1600 1605

Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser GlyGln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly

1610 1615 1620 1610 1615 1620

Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser SerSer Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser

1625 1630 1635 1625 1630 1635

Gly Gly Ser Ser Gly Gly Ser Leu Lys Lys Arg Pro Ala Ala ThrGly Gly Ser Ser Gly Gly Ser Leu Lys Lys Arg Pro Ala Ala Thr

1640 1645 1650 1640 1645 1650

Lys Lys Ala Gly Gln Ala Lys Lys Lys LysLys Lys Ala Gly Gln Ala Lys Lys Lys Lys

1655 1660 1655 1660

<210> 30<210> 30

<211> 1526<211> 1526

<212> DNA<212> DNA

<213> Artificial Sequence<213> Artificial Sequence

<220><220>

<223> Artificial Sequence<223> Artificial Sequence

<400> 30<400> 30

ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60

agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120

tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180

tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240

gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300

ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360

gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420gtttagggtt aatggtttttt atagactaat tttttagta catctatttt attctatttt 420

agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480

taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540

aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600

tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660

cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720

ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780

gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840

caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900

ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960

tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020

taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080

atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140

cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200

ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260

gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320

cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380

acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440

ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500

tctgcaggtc gaagcttgaa gcaaac 1526tctgcaggtc gaagcttgaa gcaaac 1526

Claims (20)

1. a kind of system for carrying out base editor to the target sequence in organism genome, it includes it is following i) in v) extremely One item missing:
I) base editor fusion protein and guide RNA;
Ii) the expression construct and guide RNA of the nucleotide sequence comprising encoding base editor fusion protein;
Iii) base editor fusion protein, and the expression construct of the nucleotide sequence comprising encoding guide RNA;
Iv) the expression construct of the nucleotide sequence comprising encoding base editor fusion protein, and the core comprising encoding guide RNA The expression construct of nucleotide sequence;
V) the expression building of the nucleotide sequence of the nucleotide sequence comprising encoding base editor fusion protein and coding guide RNA Body;
Wherein the base editor fusion protein includes the Cpf1 and deaminase of DNA cleavage activity missing, and the guide RNA can By the target sequence in the base editor fusion protein target gene group, lead to one or more C to T or A in the target sequence To the substitution of G.
2. the system of claim 1, wherein the Cpf1 of DNA cleavage activity missing is the FnCpf1 of DNA cleavage activity missing, Such as the FnCpf1 of the DNA cleavage activity missing is mutated relative to wild type FnCpf1 comprising D917A.
3. the system of claim 1, wherein the Cpf1 of DNA cleavage activity missing is the AsCpf1 of DNA cleavage activity missing, Such as the AsCpf1 of the DNA cleavage activity missing is mutated relative to wild type AsCpf1 comprising D908A.
4. the system of claim 1, wherein the Cpf1 of DNA cleavage activity missing is the LbCpf1 of DNA cleavage activity missing, Such as the LbCpf1 of the DNA cleavage activity missing is mutated relative to wild type LbCpf1 comprising D832A.
5. the system of claim 1, wherein the deaminase is cytidine deaminase, such as apolipoprotein B mRNA edits complex (APOBEC) family's deaminase.
6. the system of claim 5, wherein the cytidine deaminase is the cytidine deaminase of APOBEC1 deaminase or activation-inducing (AID)。
7. the system of claim 5, wherein the base editor fusion protein also includes uracil dna glycosylase inhibitor (UGI)。
8. the system of claim 1, wherein the deaminase is DNA dependent form adenine deaminase, preferably single stranded DNA dependent form Adenine deaminase.
9. the system of claim 8, wherein the DNA dependent form adenine deaminase is Escherichia coli tRNA adenine deaminase The variant of TadA (ecTadA) can especially receive variant of the single stranded DNA as substrate.
10. the system of claim 9, wherein the DNA dependent form adenine deaminase includes one relative to wild type ecTadA Or multiple groups mutation selected from the following:
1) A106V and D108N;
2) D147Y and E155V;
3) L84F, H123Y and I156F;
4)A142N;
5) H36L, R51L, S146C and K157N;
6)P48S/T/A;
7)A142N;
8)W23L/R;With
9)R152H/P。
11. the system of claim 10, wherein the DNA dependent form adenine deaminase relative to wild type ecTadA include with Lower mutation: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F, H123Y, I156F, D147Y, E155V and R152P.
12. the N-terminal fusion of the system of claim 9, the DNA dependent form adenine deaminase has corresponding wild type adenine The N-terminal of deaminase, the preferably described DNA dependent form adenine deaminase passes through connector and corresponding wild type adenine deaminase Fusion.
13. the system of claim 1, wherein the deaminase is fused the end N of the Cpf1 lacked to the DNA cleavage activity End or in which the deaminase are fused the C-terminal of the Cpf1 lacked to DNA cleavage activity.
14. the system of claim 1, wherein the deaminase and the Cpf1 of DNA cleavage activity missing are merged by connector.
15. the system of claim 1, wherein the base editor fusion protein includes also nuclear location sequence in its N-terminal and/or C-terminal It arranges (NLS).
16. the system of claim 1, wherein the nucleotide sequence of the encoding base editor fusion protein is directed to pending base The organism of editor carries out codon optimization.
17. the system of claim 1, the nucleotide sequence of the encoding base editor fusion protein and/or the coding guide The nucleotide sequence of RNA is operably connected with expression regulation element.
18. the system of claim 17, wherein the controlling element is promoter, such as 35S promoter, corn Ubi-1 starting Son, wheat U6 promoter, rice U3 promoter or corn U3 promoter.
19. a kind of method for generating genetically modified organism, including by the system introducing of any one of claim 1-18 Biological cell, thus the base editor fusion protein is targeted the target sequence in the cellular genome by the guide RNA, Lead to the substitution of one or more C to T or A to G in the target sequence.
20. the method for claim 19, wherein the organism be selected from mammal for example people, mouse, rat, monkey, dog, pig, sheep, Ox, cat;Poultry such as chicken, duck, goose;Plant, including monocotyledon and dicotyledon, for example, rice, corn, wheat, sorghum, Barley, soybean, peanut, arabidopsis.
CN201811578853.8A 2017-12-22 2018-12-21 Base editing system and method based on CPF1 protein Active CN109957569B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711403490 2017-12-22
CN201711403490X 2017-12-22

Publications (2)

Publication Number Publication Date
CN109957569A true CN109957569A (en) 2019-07-02
CN109957569B CN109957569B (en) 2022-10-25

Family

ID=66992485

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811578853.8A Active CN109957569B (en) 2017-12-22 2018-12-21 Base editing system and method based on CPF1 protein

Country Status (3)

Country Link
CN (1) CN109957569B (en)
AR (1) AR114014A1 (en)
WO (1) WO2019120310A1 (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110964741A (en) * 2019-12-20 2020-04-07 北京市农林科学院 Nuclear localization signal FNB and application thereof in improving base editing efficiency
CN111019967A (en) * 2019-11-27 2020-04-17 南京农业大学 Application of GmU3-19g-1 and GmU6-16g-1 promoters in soybean polygene editing system
CN111518794A (en) * 2020-04-13 2020-08-11 中山大学 Preparation and application of induced mutant protein based on activation-induced cytidine deaminase
WO2021032155A1 (en) * 2019-08-20 2021-02-25 中国科学院遗传与发育生物学研究所 Base editing system and use method therefor
CN112851776A (en) * 2020-04-20 2021-05-28 中国科学院天津工业生物技术研究所 Gene site-directed mutagenesis method and stress resistance breeding application thereof
CN113005141A (en) * 2021-01-05 2021-06-22 温州医科大学 Gene editing tool composed of high-activity mutant, preparation method and method for repairing congenital retinoschisis disease pathogenic gene
CN114045302A (en) * 2021-11-12 2022-02-15 三亚中国农业科学院国家南繁研究院 Single-base editing vector and construction and application thereof
CN114375335A (en) * 2019-07-19 2022-04-19 成对植物服务股份有限公司 Optimized protein linkers and methods of use
CN114667149A (en) * 2019-09-09 2022-06-24 比姆医疗股份有限公司 Novel nucleobase editor and methods of use
CN114761547A (en) * 2019-09-26 2022-07-15 先正达农作物保护股份公司 Methods and compositions for DNA base editing
CN114829595A (en) * 2019-10-17 2022-07-29 成对植物服务股份有限公司 Variants of CAS12A nuclease, methods of making and uses thereof
CN114835818A (en) * 2022-03-17 2022-08-02 江南大学 A gene editing fusion protein, its constructed adenine base editor and its application
CN114867852A (en) * 2019-10-30 2022-08-05 成对植物服务股份有限公司 V-type CRISPR-CAS base editor and method of use thereof
WO2022223010A1 (en) * 2021-04-21 2022-10-27 浙江大学 Negative-strand rna viral vector and plant genome editing method without transformation
CN115704015A (en) * 2021-08-12 2023-02-17 清华大学 Targeted mutagenesis system based on adenine and cytosine dual base editor
CN116286734A (en) * 2022-11-29 2023-06-23 武汉大学 Mutants of wild-type LbCas12a protein and its use in SNP detection
CN116376948A (en) * 2022-07-25 2023-07-04 广州医科大学 A kind of preparation method of plasmid carrier and MS2 bacteriophage similar particles displaying foreign protein
CN116497067A (en) * 2019-02-13 2023-07-28 比姆医疗股份有限公司 Compositions and methods for treating heme lesions
CN116751799A (en) * 2023-06-14 2023-09-15 江南大学 Multi-site double-base editor and application thereof
CN116836962A (en) * 2023-06-28 2023-10-03 微光基因(苏州)有限公司 Engineered adenosine deaminase and base editor
WO2023207607A1 (en) * 2022-04-29 2023-11-02 北京大学 Deaminase mutant, composition, and method for modifying mitochondrial dna
US12133884B2 (en) 2018-05-11 2024-11-05 Beam Therapeutics Inc. Methods of substituting pathogenic amino acids using programmable base editor systems

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG11202106977PA (en) * 2018-12-27 2021-07-29 Lifeedit Therapeutics Inc Polypeptides useful for gene editing and methods of use
CN110551752B (en) * 2019-08-30 2023-03-14 北京市农林科学院 xCas9n-epBE base editing system and application thereof in genome base replacement
US20230257761A1 (en) 2019-09-12 2023-08-17 Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences Herbicide resistant plant
EP4185699A1 (en) 2020-07-21 2023-05-31 Pairwise Plants Services, Inc. Optimized protein linkers and methods of use
CN114317596B (en) * 2020-09-30 2024-01-16 北京市农林科学院 A method for mutating A to G in target sequence of plant genome
CN114317518B (en) * 2020-09-30 2024-01-12 北京市农林科学院 Application of SpRYn-CBE base editing system in base replacement in plant genomes
CN114317589B (en) * 2020-09-30 2024-01-16 北京市农林科学院 Application of SpRYn-ABE base editing system in plant genome base substitution
CN112430622A (en) * 2020-10-26 2021-03-02 扬州大学 FokI and dCpf1 fusion protein expression vector and site-directed gene editing method mediated by same
EP4499819A1 (en) * 2022-03-30 2025-02-05 Basf Agricultural Solutions Seed Us Llc Optimized base editors
CN114686456B (en) * 2022-05-10 2023-02-17 中山大学 Base editing system based on bimolecular deaminase complementation and application thereof
CN115820691B (en) * 2022-07-25 2023-08-22 安徽农业大学 LbCPf1 variant-based rice base editing system and application

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017184786A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Cpf1 complexes with reduced indel activity
WO2017184768A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Novel crispr enzymes and systems
WO2017189308A1 (en) * 2016-04-19 2017-11-02 The Broad Institute Inc. Novel crispr enzymes and systems
CN110214183A (en) * 2016-08-03 2019-09-06 哈佛大学的校长及成员们 Adenosine nucleobase editing machine and application thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3374494A4 (en) * 2015-11-11 2019-05-01 Coda Biotherapeutics, Inc. Crispr compositions and methods of using the same for gene therapy
EP3405570A1 (en) * 2016-01-22 2018-11-28 The Broad Institute, Inc. Crystal structure of crispr cpf1

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017184786A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Cpf1 complexes with reduced indel activity
WO2017184768A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Novel crispr enzymes and systems
WO2017189308A1 (en) * 2016-04-19 2017-11-02 The Broad Institute Inc. Novel crispr enzymes and systems
CN110214183A (en) * 2016-08-03 2019-09-06 哈佛大学的校长及成员们 Adenosine nucleobase editing machine and application thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
NICOLE M. GAUDELLI等: "《Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage》", 《NATURE》 *

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12133884B2 (en) 2018-05-11 2024-11-05 Beam Therapeutics Inc. Methods of substituting pathogenic amino acids using programmable base editor systems
US12016908B2 (en) 2019-02-13 2024-06-25 Beam Therapeutics Inc. Compositions and methods for treating hemoglobinopathies
CN116497067B (en) * 2019-02-13 2025-01-24 比姆医疗股份有限公司 Compositions and methods for treating hemoglobinopathies
CN116497067A (en) * 2019-02-13 2023-07-28 比姆医疗股份有限公司 Compositions and methods for treating heme lesions
CN114375335A (en) * 2019-07-19 2022-04-19 成对植物服务股份有限公司 Optimized protein linkers and methods of use
WO2021032155A1 (en) * 2019-08-20 2021-02-25 中国科学院遗传与发育生物学研究所 Base editing system and use method therefor
CN114945670B (en) * 2019-08-20 2024-12-10 苏州齐禾生科生物科技有限公司 A base editing system and method of use thereof
CN114945670A (en) * 2019-08-20 2022-08-26 上海蓝十字医学科学研究所 Base editing system and use method thereof
CN114667149A (en) * 2019-09-09 2022-06-24 比姆医疗股份有限公司 Novel nucleobase editor and methods of use
CN114761547B (en) * 2019-09-26 2025-02-18 先正达农作物保护股份公司 Methods and compositions for DNA base editing
CN114761547A (en) * 2019-09-26 2022-07-15 先正达农作物保护股份公司 Methods and compositions for DNA base editing
CN114829595A (en) * 2019-10-17 2022-07-29 成对植物服务股份有限公司 Variants of CAS12A nuclease, methods of making and uses thereof
CN114867852A (en) * 2019-10-30 2022-08-05 成对植物服务股份有限公司 V-type CRISPR-CAS base editor and method of use thereof
CN111019967A (en) * 2019-11-27 2020-04-17 南京农业大学 Application of GmU3-19g-1 and GmU6-16g-1 promoters in soybean polygene editing system
CN110964741B (en) * 2019-12-20 2022-03-01 北京市农林科学院 A nuclear localization signal FNB and its application in improving base editing efficiency
CN110964741A (en) * 2019-12-20 2020-04-07 北京市农林科学院 Nuclear localization signal FNB and application thereof in improving base editing efficiency
WO2021208501A1 (en) * 2020-04-13 2021-10-21 中山大学 Preparation and use of induced mutant protein based on activated induced cytidine deaminase
CN111518794B (en) * 2020-04-13 2023-05-16 中山大学 Preparation and application of inducible mutant protein based on activation-inducible cytidine deaminase
CN111518794A (en) * 2020-04-13 2020-08-11 中山大学 Preparation and application of induced mutant protein based on activation-induced cytidine deaminase
CN112851776A (en) * 2020-04-20 2021-05-28 中国科学院天津工业生物技术研究所 Gene site-directed mutagenesis method and stress resistance breeding application thereof
CN113005141A (en) * 2021-01-05 2021-06-22 温州医科大学 Gene editing tool composed of high-activity mutant, preparation method and method for repairing congenital retinoschisis disease pathogenic gene
WO2022223010A1 (en) * 2021-04-21 2022-10-27 浙江大学 Negative-strand rna viral vector and plant genome editing method without transformation
CN115704015A (en) * 2021-08-12 2023-02-17 清华大学 Targeted mutagenesis system based on adenine and cytosine dual base editor
CN114045302A (en) * 2021-11-12 2022-02-15 三亚中国农业科学院国家南繁研究院 Single-base editing vector and construction and application thereof
CN114835818A (en) * 2022-03-17 2022-08-02 江南大学 A gene editing fusion protein, its constructed adenine base editor and its application
CN114835818B (en) * 2022-03-17 2024-03-22 江南大学 Gene editing fusion protein, adenine base editor constructed by same and application thereof
WO2023207607A1 (en) * 2022-04-29 2023-11-02 北京大学 Deaminase mutant, composition, and method for modifying mitochondrial dna
CN116376948A (en) * 2022-07-25 2023-07-04 广州医科大学 A kind of preparation method of plasmid carrier and MS2 bacteriophage similar particles displaying foreign protein
CN116376948B (en) * 2022-07-25 2023-12-15 广州医科大学 Plasmid vector and preparation method of MS2 phage similar particles for displaying exogenous proteins
CN116286734B (en) * 2022-11-29 2024-04-02 武汉大学 Mutants of wild-type LbCas12a protein and their uses for SNP detection
CN116286734A (en) * 2022-11-29 2023-06-23 武汉大学 Mutants of wild-type LbCas12a protein and its use in SNP detection
CN116751799A (en) * 2023-06-14 2023-09-15 江南大学 Multi-site double-base editor and application thereof
CN116751799B (en) * 2023-06-14 2024-01-26 江南大学 Multi-site double-base editor and application thereof
CN116836962B (en) * 2023-06-28 2024-04-05 微光基因(苏州)有限公司 Engineered adenosine deaminase and base editors
CN116836962A (en) * 2023-06-28 2023-10-03 微光基因(苏州)有限公司 Engineered adenosine deaminase and base editor

Also Published As

Publication number Publication date
AR114014A1 (en) 2020-07-08
CN109957569B (en) 2022-10-25
WO2019120310A1 (en) 2019-06-27

Similar Documents

Publication Publication Date Title
CN109957569B (en) Base editing system and method based on CPF1 protein
CN110157727A (en) Alkaloid edit methods
KR102061438B1 (en) A method for converting monocot genome sequences in which a nucleic acid base in a targeting DNA sequence is specifically converted, and a molecular complex used therein.
AU2022201319B2 (en) Viral vectors encoding recombinant fviii variants with increased expression for gene therapy of hemophilia a
KR102589523B1 (en) Cell lines and methods for producing polyketides in yeast
DK2324120T3 (en) Manipulating SNF1 protein kinase OF REVISION OF OIL CONTENT IN OLEAGINOUS ORGANISMS
BRPI0806354A2 (en) transgender oilseeds, seeds, oils, food or food analogues, medicinal food products or medicinal food analogues, pharmaceuticals, beverage formulas for babies, nutritional supplements, pet food, aquaculture feed, animal feed, whole seed products , mixed oil products, partially processed products, by-products and by-products
AU2016354550A1 (en) Viral vectors encoding recombinant fviii variants with increased expression for gene therapy of hemophilia a
CN106544361A (en) Mammalian cell expression vector, expression system, preparation method and application
CN108913718A (en) A kind of preparation method and application of the CAR-T cell of targeting EGFR v III
CN112608940B (en) Construction method and application of animal model of congenital cataract disease
CN114540424A (en) IGFBP7 muscle tissue specific knockout mouse animal model and construction method thereof
KR20210034013A (en) Genetic treatment of hemophilia using viral vectors encoding recombinant FVIII variants with increased expression
CN113943737A (en) Application of chicken CTGF gene in inhibiting differentiation of chicken preadipocytes
KR102009270B1 (en) Recombinant foot-and-mouth disease virus expressing protective antigen of type O-Thi60
CN114395020B (en) Application of GmRALF1 protein in promoting phosphorus element absorption of plants
CN111100874B (en) Targeting vector and method and application of integrating exogenous gene into mouse DC-SIGN exon 7 site to construct BAC clone
CN114134170A (en) Preparation method and application of HA tag fusion expression vector
CN101300358A (en) Astaxanthine biosynthesis in eukaryotes
KR102247462B1 (en) Recombinant Lactic acid bacteria having enhanced Productivity for Riboflavin
KR101246884B1 (en) Transformed Pseudomonas Aeruginosa for Over-expression of Fatty Acid Biosynthesis Pathway and Method of Preparing the Same
JP2003274972A (en) Flowering control gene and its use
CN107345210A (en) The disinsection engineering bacteria strain and its application of one plant of furocoumarin genetic transformation
CN116555061A (en) A mutant of Saccharomyces cerevisiae and its application in the preparation of alcohol
KR20120094788A (en) The developed pseudomonas aeruginosa strains for improving the content of fatty acids and manufacturing method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220623

Address after: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120

Applicant after: Shanghai Blue Cross Medical Science Research Institute

Address before: No. 2, No. 1, Beichen West Road, Beichen, Beijing

Applicant before: INSTITUTE OF GENETICS AND DEVELOPMENTAL BIOLOGY, CHINESE ACADEMY OF SCIENCES

TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220921

Address after: Unit E598, 5th Floor, Lecheng Plaza, Phase II, Biomedical Industrial Park, No. 218, Sangtian Street, Suzhou Industrial Park, Suzhou Area, China (Jiangsu) Pilot Free Trade Zone, Suzhou City, Jiangsu Province, 215127

Applicant after: Suzhou Qihe Biotechnology Co.,Ltd.

Address before: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120

Applicant before: Shanghai Blue Cross Medical Science Research Institute

GR01 Patent grant
GR01 Patent grant